GitHub - jashdalvi/PII-Data-Detection

Solution for Kaggle Competition (41st Position Silver medal) - The Learning Agency Lab - PII Data Detection

Steps to run the code

Setup the env

pip install -r requirements.txt

Change the directory

cd src

Run the code

For training on a single GPU with mixed precision

accelerate launch --mixed_precision=fp16 train_accelerate.py fold=0

For training on a Multi-GPU with mixed precision
Note: Make sure to change the num of processes according to the number of GPUs you have. The following command is for 2 GPUs.

accelerate launch --multi_gpu --mixed_precision=fp16 --num_processes=2 train_accelerate.py fold=0

Take a look at the src/config/config_accelerate.py file for changing configuration details. Also add your wandb API key as an ENV variable to log the metrics on wandb.

For training Llama-3-8b LLM on multiple GPUs. The configuration uses LoRA during finetuning.

accelerate launch --multi_gpu --mixed_precision=bf16 --num_processes=2 train_llm.py fold=2 upload_models=True

Name		Name	Last commit message	Last commit date
Latest commit History 152 Commits
notebooks		notebooks
src		src
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
runpod.yaml		runpod.yaml
setup.py		setup.py
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solution for Kaggle Competition (41st Position Silver medal) - The Learning Agency Lab - PII Data Detection

Steps to run the code

Setup the env

Change the directory

Run the code

About

Releases

Packages

Languages

jashdalvi/PII-Data-Detection

Folders and files

Latest commit

History

Repository files navigation

Solution for Kaggle Competition (41st Position Silver medal) - The Learning Agency Lab - PII Data Detection

Steps to run the code

Setup the env

Change the directory

Run the code

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages