Data Preprocessing, LSTM Model, and Analysis for something.
TODO:
- Unify single passes and training functions if they're the same (seems to be at least in case of LSTM and NFO, I don't see why we can't get a general training function for all models)
- Separate dataset and data loader for the CustomSequence (will eff up training code but we'll fix it)
- Fix the transformer model (needs proper testing)
This is a continuation of [https://github.com/jhirschm/NeurIPS2023]