Evaluating dense model-based approaches for Multimodal Medical Case retrieval

Installation and Use

Import imageclefenv environment: conda env create --file imageclefenv.yaml
Activate environment: conda activate imageclefenv

Then follow the usage instructions here to run the code of each step of the pipeline, after setting the environment variables explained below.

Workflow pipeline of the retrieval system: dataset collection and article encoding (step 1), storage and indexing of embeddings (step 2), query encoding (step 3), results fusion (step 4), and retrieval of a final ranked list of results (step 5). The query workflow is turquoise, whereas the articles' workflow is black.

Note: To use LongCLIP, download the checkpoint of the model LongCLIP-B and place it under ./checkpoints.

Environment Variables

In order to run the project, you need to set all of the following environment variables in a .env file:

HF_TOKEN: HuggingFace token.
HF_HOME: HuggingFace home directory, where data will be locally stored.
DATA_DIR_PATH: Dataset directory (file structure detailed next).
OUTPUT_DIR_PATH: Output directory.

Dataset

Case-based retrieval task from ImageCLEFmed 2013 Task detailed here.

The data directory must follow the following structure:

data/
│   case-based-topics.xml   
│
└───CaseQueryImages2013/
│   │   01_1.jpg
│   │   01_2.jpg
|   |   ...
│
└───figures/
|   │   scrt68-3.jpg
|   │   rr11-4.jpg
|   |   ...
|
└───meta-xml/
│   │
│   └───one-file-per-article/
│       │   article_126217.xml
│       │   article_29062.xml
│       │   ...
│
└───qrels/
    │   qrel-2013-case_based.txt

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
docs		docs
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
imageclefenv.yaml		imageclefenv.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Evaluating dense model-based approaches for Multimodal Medical Case retrieval

Installation and Use

Environment Variables

Dataset

About

Languages

catarinaopires/eval-multimodal-medical-case-retrieval

Folders and files

Latest commit

History

Repository files navigation

Evaluating dense model-based approaches for Multimodal Medical Case retrieval

Installation and Use

Environment Variables

Dataset

About

Topics

Resources

Stars

Watchers

Forks

Languages