You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am studying by referring to your CQL code.
But, I think Line 68 not be proper to Offline RL when I run the train.py of CQL-SAC.
Line 68 : buffer.add(state, action, reward, next_state, done)
Isn't this line an off-policy model by putting data that interacts with the agent and the environment into a buffer?
I thank you for your hard works.
The text was updated successfully, but these errors were encountered:
I am studying by referring to your CQL code.
But, I think Line 68 not be proper to Offline RL when I run the train.py of CQL-SAC.
Line 68 : buffer.add(state, action, reward, next_state, done)
Isn't this line an off-policy model by putting data that interacts with the agent and the environment into a buffer?
I thank you for your hard works.
The text was updated successfully, but these errors were encountered: