Skip to content

pisa-engine/docker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

20 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PISA Docker Image

Build Status

This project provides a Docker image for PISA. It provides the ability to run command-line tools in an isolated environment. The image can be furthermore used as a base for other images that provide means of reproducing specific scenarios using PISA engine.

The image is available in Docker Hub and can be pulled with the command docker pull pisa/pisa.

Building Image

git clone https://github.com/pisa-engine/docker.git
cd docker
make image

Push image to Docker Hub

make push-image

Command-Line Usage

Once built, you can execute a container, which will have all command-line tools included in its PATH variable.

# docker run --rm pisa create_freq_index -h

# docker run -rm -v /path/to/data:/data \ 
    zcat /data/GOV2/**/*.gz | parse_collection \
        -f trecweb \
        --stemmer porter2 \
        --content-parser html \
        -o /data/output