CATX is a library for training and using contextual bandits in a continuous action space.
CATX builds on the work presented in "Efficient Contextual Bandits with Continuous Actions (CATS)" by Majzoubi et al. CATX brings forth the freedom to implement custom neural network architectures as decision agents within the learning algorithm. It allows for greater scalability and context modalities while also leveraging the computational speed of JAX.
CATX is aimed at users facing continuous action contextual bandits problems - any problem where you need to take continuous actions while maximising the desired reward (and, consequently, minimising cost, time or effort expenditures). Contextual bandits settings, where the exploration-exploitation trade-off needs to be dealt with, can be found in many industries and use cases. CATX offers a valuable boost to this type of problem, by implementing contextual bandits with continuous actions in JAX, and allowing custom neural networks in the tree structure of the CATS algorithm.
Go to documentation
to find everything you need to know about CATX
from the installation with pip install catx
to a quick getting started example
and much more on CATX and its inner workings.
@software{catx2022github,
author = {Wissam Bejjani and Cyprien Courtot},
title = {CATX: contextual bandits library for Continuous Action Trees with smoothing in JAX},
url = {https://github.com/instadeepai/catx/},
version = {0.2.1},
year = {2022},
}
In this bibtex entry, the version number is intended to be from
catx/VERSION
,
and the year corresponds to the project's open-source release.