Monkeypatch for Llama 3.2-Vision #282

tyler-romero · 2024-09-29T17:54:04Z

Summary

Add monkeypatch to support Llama 3.2-Vision models.

Details

Llama 3.2-Vision is a multimodal model. It is also only available in transformers>=4.45.0.

Torchvision is required to run the multimodal tests for Llama 3.2-Vision (the image processor requires it).

Testing Done

Hardware Type: RTX 4090
run make test to ensure correctness
run make checkstyle to ensure code style
run make test-convergence to ensure convergence

tyler-romero · 2024-09-30T15:16:05Z

Wanted to add support for LayerNorm in the vision tower as well, but it is hard to patch the way it is used in modeling_mllama.py - a patch would end up changing torch.nn globally.

## Summary - Previously, the pre-trained weights were not being loaded if patching model post-initialization - Instead of loading weights, just patch the model instance module's forward method (see linkedin#279) ## Testing Done - In convergence tests, check that pre-init patching and post-init patching match results from original model - Hardware Type: A100 - [x] run `make test` to ensure correctness - [x] run `make checkstyle` to ensure code style - [ ] run `make test-convergence` to ensure convergence --> most tests working, waiting for other fixes for all tests to pass

- Make the `transformers` dependency optional so we only have `torch` and `triton` as required deps, which is helpful if you're not using `transformers` for modeling code. This was also causing installation issues for people using slightly older transformers versions. - If transformers is needed, make it compatible with any 4.x version. The specific model being used should dictate the transformers version compatibility. `pip install -e .[transformers]` `pip install -e .[dev]` A100-80G-PCIe - Hardware Type: A100-80G-PCIe - [x] run `make test` to ensure correctness - [x] run `make checkstyle` to ensure code style - [x] run `make test-convergence` to ensure convergence

tyler-romero · 2024-10-01T20:59:31Z

Just need someone to set a HF Token that has accepted the llama license in the CICD env:
https://discord.com/channels/1189498204333543425/1275130785933951039/1290378722653896744

>           raise EnvironmentError(
                "You are trying to access a gated repo.\nMake sure to have access to it at "
                f"[https://huggingface.co/{path_or_repo_id}.\n{str(e)}](https://huggingface.co/%7Bpath_or_repo_id%7D./n%7Bstr(e)%7D)"
            ) from e
E           OSError: You are trying to access a gated repo.
E           Make sure to have access to it at https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct.
E           401 Client Error. (Request ID: Root=1-66fc5343-253e99011960556f10a26970;1bfe5fb8-1fa8-4ff0-a21e-28041d5b6752)
E           
E           Cannot access gated repo for url https://huggingface.co/meta-llama/Llama-3.2-11B-Vision-Instruct/resolve/main/config.json.
E           Access to model meta-llama/Llama-3.2-11B-Vision-Instruct is restricted. You must have access to it and be authenticated to access it. Please log in.

shivam15s · 2024-10-01T23:54:53Z

Added HF hub read secret to the repo and got the gated llama 3.2 access

tyler-romero · 2024-10-02T00:25:20Z

Thanks, for some reason I'm still seeing the same issue - could you try triggering a cicd run on this branch?

shivam15s · 2024-10-02T00:49:18Z

Let me get back. Still figuring out how to configure the gpu to use the access token.

shivam15s · 2024-10-02T18:35:28Z

Seems like using gated stuff for tests isn't a good idea. I'm looking into using fake/mirrored models/processors for the tests

tyler-romero · 2024-10-08T18:45:28Z

Any update @shivam15s ?

shivam15s · 2024-10-09T21:39:35Z

Hey @tyler-romero, created a PR on your branch. Let me know if the proposed solution makes sense

Gated tokenizer possible fix

tyler-romero · 2024-10-10T17:44:57Z

Really nice solution, thank you!

shivam15s · 2024-10-10T21:21:18Z

Seems like a convergence test fails because of a precision issue.
Could you please look into it? Thanks

tyler-romero · 2024-10-10T21:50:38Z

Now that you made custom tokenizers for the multimodal tests, I could reduce the vocab sizes of the mini-models (to be in-line with the vocab sizes of the other mini-models), and the convergence tests are passing now.

shivam15s

Great work, lgtm

tyler-romero · 2024-10-10T22:04:29Z

Sounds good, feel free to merge (only collaborators/linkedin employees can merge) - and please add yourself as a contributor to this change if GitHub doesn't do that automatically.

ByronHsu · 2024-10-10T22:15:58Z

Thank for both for driving this!!

tyler-romero added 13 commits September 29, 2024 10:51

Monkeypatch for Llama 3.2-Vision

569a3c0

Checkstyle

f1b6289

add monkey patch tests

0d2b757

remove qwen2vl test

a2a6086

checkstyle

42b1e3a

working monkey patch

541038b

try fix monkeypatch

08b2177

another attempt to fix

e16e9f2

Checkstyle

2c57cd3

Cleanup

24b23c1

Fix test

25a5af8

Revert support for layernorm

7c6adeb

checkstyle

01c2fd7

tyler-romero force-pushed the tyler/mllama-monkeypatch branch from c7900fb to 01c2fd7 Compare September 30, 2024 05:36

Merge branch 'main' into tyler/mllama-monkeypatch

89466e2

tyler-romero marked this pull request as ready for review September 30, 2024 17:23

ByronHsu mentioned this pull request Sep 30, 2024

2024 Q4 Roadmap #285

Open

shimizust and others added 9 commits October 1, 2024 11:26

Merge branch 'main' into tyler/mllama-monkeypatch

45dd253

Clean

6a00669

checkstyle

6c43814

Update mllama monkeypatch

6ed7e87

Checkstyle

2eaddd7

Merge branch 'main' into tyler/mllama-monkeypatch

aa2bf9a

Remove skip on multimodal tests

098af04

Merge branch 'main' into tyler/mllama-monkeypatch

4dedcc3

Merge branch 'main' into tyler/mllama-monkeypatch

3a4ff19

shivam15s added 2 commits October 9, 2024 19:14

add fake tokenizer stuff

359322f

fix checkstyle

f7e5e57

Merge pull request #2 from shivam15s/shisahni/fix-mllama

fed04de

Gated tokenizer possible fix

tyler-romero added 2 commits October 10, 2024 14:28

Reduce vocab sizes

b4bef2f

Checkstyle

8cdcdd2

shivam15s approved these changes Oct 10, 2024

View reviewed changes

ByronHsu merged commit 9b10f48 into linkedin:main Oct 10, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Monkeypatch for Llama 3.2-Vision #282

Monkeypatch for Llama 3.2-Vision #282

tyler-romero commented Sep 29, 2024 •

edited

Loading

tyler-romero commented Sep 30, 2024

tyler-romero commented Oct 1, 2024

shivam15s commented Oct 1, 2024 •

edited

Loading

tyler-romero commented Oct 2, 2024

shivam15s commented Oct 2, 2024

shivam15s commented Oct 2, 2024

tyler-romero commented Oct 8, 2024

shivam15s commented Oct 9, 2024

tyler-romero commented Oct 10, 2024

shivam15s commented Oct 10, 2024

tyler-romero commented Oct 10, 2024

shivam15s left a comment

tyler-romero commented Oct 10, 2024

ByronHsu commented Oct 10, 2024

Monkeypatch for Llama 3.2-Vision #282

Monkeypatch for Llama 3.2-Vision #282

Conversation

tyler-romero commented Sep 29, 2024 • edited Loading

Summary

Details

Testing Done

tyler-romero commented Sep 30, 2024

tyler-romero commented Oct 1, 2024

shivam15s commented Oct 1, 2024 • edited Loading

tyler-romero commented Oct 2, 2024

shivam15s commented Oct 2, 2024

shivam15s commented Oct 2, 2024

tyler-romero commented Oct 8, 2024

shivam15s commented Oct 9, 2024

tyler-romero commented Oct 10, 2024

shivam15s commented Oct 10, 2024

tyler-romero commented Oct 10, 2024

shivam15s left a comment

Choose a reason for hiding this comment

tyler-romero commented Oct 10, 2024

ByronHsu commented Oct 10, 2024

tyler-romero commented Sep 29, 2024 •

edited

Loading

shivam15s commented Oct 1, 2024 •

edited

Loading