Skip to content

Commit

Permalink
add veRL (#617)
Browse files Browse the repository at this point in the history
  • Loading branch information
zhimin-z authored Nov 1, 2024
1 parent 8139fb6 commit 3856555
Showing 1 changed file with 1 addition and 0 deletions.
1 change: 1 addition & 0 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -539,6 +539,7 @@ Please review our [CONTRIBUTING.md](https://github.com/EthicalML/awesome-product
* [SuperSuit](https://github.com/Farama-Foundation/SuperSuit) ![](https://img.shields.io/github/stars/Farama-Foundation/SuperSuit.svg?style=social) - SuperSuit introduces a collection of small functions which can wrap reinforcement learning environments to do preprocessing ('microwrappers').
* [TF-Agents](https://github.com/tensorflow/agents) ![](https://img.shields.io/github/stars/tensorflow/agents.svg?style=social) - A reliable, scalable and easy to use TensorFlow library for contextual bandits and reinforcement learning.
* [TRL](https://github.com/huggingface/trl) ![](https://img.shields.io/github/stars/huggingface/trl.svg?style=social) - Train transformer language models with reinforcement learning.
* [veRL](https://github.com/volcengine/veRL) ![](https://img.shields.io/github/stars/volcengine/veRL.svg?style=social) - veRL (HybridFlow) is a flexible, efficient and industrial-level RL(HF) training framework designed for LLMs.


## Industry Strength Visualisation
Expand Down

0 comments on commit 3856555

Please sign in to comment.