How many models does the rl training process have to train? #3346
Unanswered
aijianiula0601
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I saw that the rm and sft models in the source code were put into the triton service. So which models should be loaded to train the rl training process?
Beta Was this translation helpful? Give feedback.
All reactions