RecSys Challenge 2019 - benchmark models

This repository contains code to reproduce the benchmarks calculated for the data from the ACM RecSys Challenge 2019 organized by trivago, TU Wien, Politecnico di Milano, and Karlsruhe Institute of Technology.

Installation and usage

[Optional] Before installing the code, you can create a virtual environment so the installed packages don't get mixed with the ones in your system. To do it, execute the following commands in your terminal:

python3 -m venv trvrecsys2019benchmarks
source trvrecsys2019benchmarks/bin/activate

This will create a folder in the current directory which will contain the Python executable files.

To install the package and its dependencies use:

pip install git+https://github.com/trivago/recsys-challenge-2019-benchmarks.git#egg=trvrecsys2019benchmarks

This will install all Python packages that are needed. Note that in order to run the code on a Mac, the used version of the lightgbm library requires the installation of the OpenMP library, which is required for running LightGBM on the system with the Apple Clang compiler. This can be installed via:

brew install libomp

Producing example data

Before running any models you can create a small example data set to test the execution of the models before running them with the bigger RecSysChallenge data sets.

produce-example-data --data-path <target-location-for-csv-files>

Running a single model

To run an individual model, run:

run-single-model --data-path <path-to-csv-files-directory> --train-file <training-data-file> --test-file <test-data-file> --subm-file <submission-file-to-be-created> --model-name <name-of-model>

To see what models are available, run:

run-single-model --help

Note, that some models need a different set of training data. For example, the nn-item model relies on the item_metadata.csv as input.

Running all models

You can run all models at once. This might take a while and the name of the submission files will be inferred from the model names. To do so, run:

run-all-models --data-path <path-to-csv-files-directory> --train-file <training-data-file> --test-file <test-data-file> --meta-file <item-metadata-file>

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
src		src
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RecSys Challenge 2019 - benchmark models

Installation and usage

Producing example data

Running a single model

Running all models

About

Releases

Packages

Languages

License

trivago/recsys-challenge-2019-benchmarks

Folders and files

Latest commit

History

Repository files navigation

RecSys Challenge 2019 - benchmark models

Installation and usage

Producing example data

Running a single model

Running all models

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages