Skip to content

saikatmitra1998/Data-Pipeline-for-Hotel-Thumbnail-Selection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Main Image Selection Pipeline Implementation

Overview

This project implements a main image selection pipeline for hotel images using Apache Spark and Python. The pipeline reads image data, calculates scores based on various criteria, and selects the best main image for each hotel.

Prerequisites

  • Docker
  • Make

Please note that Python and Spark are being installed by Docker.

How to Execute the Code

To run this project on your local machine, follow these steps:

  1. Build the Docker Image: Navigate to the directory where the Dockerfile is located and build the Docker image.

    cd path/to/dockerfile
    make build
  2. Run the Docker Container: Navigate to the directory where the Dockerfile is located and build the Docker image.

    make run

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published