Welcome to Bodo examples!
First make sure you have Bodo installed.
Other packages that are required to run the data generation scripts, pandas_datareader
and scikit-learn
:
conda install -c conda-forge pandas-datareader
conda install -c conda-forge scikit-learn
Many of the data generation scripts and example scripts can take in optional arguments.
python path/script.py --help
shows the usage.
For more information on data generation and examples, please see the docstring at the top of each python script.
- Monte Carlo Pi Calculation
- k-means
- Linear Regression
- Logistic Regression
- Kernel Density Estimation
- Intraday Mean
- Some TPCH Queries, more information on TPC-H can be found here:
Query #1,
3,
4,
5,
6,
9,
10,
12,
14,
18,
19,
20.
-
data generation: generated data will be available at
data/tpch-datagen/data
:# To generate data with a scale of 1, equivalent to 1GB of data data/tpch-datagen/generateData.sh 1 # To generate data with a scale of 2, equivalent to 2GB of data data/tpch-datagen/generateData.sh 2
-
An example performing Monte Carlo Pi Calculation:
# run the example on a single core
python examples/pi.py
# run the example on 4 cores
mpiexec -n 4 python examples/pi.py
An example performing linear regression:
# generate data
python data/linear_regression_datagen.py
# run example on 4 cores
mpiexec -n 4 python examples/linear_regression.py
An example performing TPCH query #1:
# generate data
data/tpch-datagen/generateData.sh 1
# run example on 4 cores
mpiexec -n 4 python examples/tpch/q01.py
More documentation can be found at http://docs.bodo.ai.
Bodo tutorial can be found here