PPOSIL, fix continuous actions and PPO
This release adds PPOSIL, fixes some small issues with continuous actions, and PPO ratio computation.
Implementations
#145 Implement PPOSIL. Improve debug logging
#143 add Arch installer thanks to @angel-ayala
Bug Fixes
#138 kill hanging processes of Electron for plotting
#145 fix PPO wrong graph update sequence causing ratio to be 1. Fix continuous action output construction. add guards.
#146 fix continuous actions and add full tests