This project implements a main image selection pipeline for hotel images using Apache Spark and Python. The pipeline reads image data, calculates scores based on various criteria, and selects the best main image for each hotel.
- Docker
- Make
Please note that Python and Spark are being installed by Docker.
To run this project on your local machine, follow these steps:
-
Build the Docker Image: Navigate to the directory where the Dockerfile is located and build the Docker image.
cd path/to/dockerfile
make build
-
Run the Docker Container: Navigate to the directory where the Dockerfile is located and build the Docker image.
make run