How do language models bind entities in context?

This repository contains code that generates the data and plots for the ICLR 2024 paper How do language models bind entities in context?.

This is built off the TransformerLens library, and contains code from Danny Halawi and Evan Hernandez.

Setup

Use python 3.10. Install the requirements.txt file.

You will need to obtain a local copy of LLaMA weights compatible with Hugging Face. Follow the instructions from the HF documentation. Once you have a path to LLaMA weights, save it in the shell environment variable LLAMA_WEIGHTS (refer to models.py).

Experiments were conducted on 4 A100 GPUs.

Running experiments

Run

python -m scripts.run_experiment \
    --model "llama-30b-hf" \
    --force_time 'results' \
    --num_devices 4 \
    --cyclic \
    --factorizability \
    --position \
    --means_intervention \
    --means_intervention_baseline \
    --all_tasks_means \
    --cross_tasks_means \
    --cross_tasks_subspace \
    --baseline \
    --local_position \

Here's a breakdown of the flags:

factorizability: Fig 3a, 3b
position: Fig 4
means_intervention, means_intervention_baseline: Table 1
all_tasks_means: Fig 6 right
cross_tasks_subspace: Table 2
baseline: baselines for Fig 6 right
local_position: Position plots for all tasks (appendix)

To generate Fig 5, run:

python -m scripts.run_experiment \
    --model "llama-13b-hf" \
    --force_time "results" \
    --num_devices 3 \
    --subspace

To generate Fig 6 left, run:

models="EleutherAI/pythia-70m EleutherAI/pythia-160m EleutherAI/pythia-410m EleutherAI/pythia-1b EleutherAI/pythia-1.4b EleutherAI/pythia-2.8b EleutherAI/pythia-6.9b EleutherAI/pythia-12b llama-7b-hf llama-13b-hf"

for model in $models; do
    echo $model
    python -m scripts.run_experiment \
        --model $model \
        --force_time "results" \
        --num_devices 3 \
        --capitals_mean \
        --cyclic \
        --capitals_baseline
done

You may add the flags --factorizability --position if you want to run those experiments for all models.

To generate one-hop:

python -m scripts.run_experiment \
    --model "llama-65b-hf" \
    --force_time 'results' \
    --country_width 2\
    --num_devices 4 \
    --onehop

To generate MCQ:

python -m scripts.run_experiment \
    --model "tulu-13b" \
    --force_time 'results' \
    --country_width 2\
    --num_devices 4 \
    --mcq

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
coref		coref
figures		figures
scripts		scripts
transformer_lens		transformer_lens
.gitattributes		.gitattributes
.gitconfig		.gitconfig
.gitignore		.gitignore
README.md		README.md
paper_plots.ipynb		paper_plots.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How do language models bind entities in context?

Setup

Running experiments

About

Releases

Packages

Languages

jiahai-feng/binding-iclr

Folders and files

Latest commit

History

Repository files navigation

How do language models bind entities in context?

Setup

Running experiments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages