-
Notifications
You must be signed in to change notification settings - Fork 47
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Did you manage to solve the action saturation issue? #1
Comments
I got the same problem. When outputting both linear and angular velocity, it got 1 or 0 after some gradient decent. I have checked the code and tested it using Pendulum-v0 environment. |
Hello, I also meet the problem of the action saturation, I noticed that the author used a batch normalization after each layer in actor network. I'm wondering if it is the solution to fix the problem and did you fix it now? |
Hello, I have the same problem, do you have a solution? I would be grateful if you could help me! |
I am using SAC and meet the same issue too. I changed many hype-parameters (lr, batch size, initial memory, state format, action range) and the issues have been fixed. I am not sure that which factor is the key, but I am pretty sure that is not batch norm, because I not using batch norm in model design. |
Hi, I see in this code you are outputting both linear and angular velocity and would like to know how you got around the action saturation problem?
The text was updated successfully, but these errors were encountered: