Verify chatglm3 6b #1119

Aniruddha521 · 2024-10-31T20:07:08Z

I proceed as mentioned in the task #259 with the following changes.
1) Extended the nightly_model in the file openvino.genai/tests/python_tests/ov_genai_test_utils.py

nightly_models = [
        "TinyLlama/TinyLlama-1.1B-Chat-v1.0",
        "facebook/opt-125m",
        "microsoft/phi-1_5",
        "microsoft/phi-2",
        "THUDM/chatglm2-6b",
        "THUDM/chatglm3-6b", # no beam_search
        "Qwen/Qwen2-0.5B-Instruct",
        "Qwen/Qwen-7B-Chat",
        "Qwen/Qwen1.5-7B-Chat",
        "argilla/notus-7b-v1",
        "HuggingFaceH4/zephyr-7b-beta",
        "ikala/redpajama-3b-chat",
        "mistralai/Mistral-7B-v0.1",

2) Added model to

openvino.genai/.github/workflows/causal_lm_cpp.yml

Line 62 in 8470250
run: |

cpp-greedy_causal_lm-Chatglm3-6b and cpp-prompt_lookup_decoding_lm-ubuntu-Chatglm3-6b

3) Extended the supported_model_list and added note

Note

The beam_search_causal_lm is not supported in the ChatGLM3-6B model.

Wovchena · 2024-11-01T05:40:17Z

.github/workflows/causal_lm_cpp.yml

+          A:' > ./prompt.txt
+
+          ./build/samples/cpp/prompt_lookup_decoding_lm/prompt_lookup_decoding_lm ./TinyLlama-1.1B-Chat-v1.0/ "$(<prompt.txt)" > predictions_prompt_lookup.txt
+          ./build/samples/cpp/greedy_causal_lm/greedy_causal_lm ./TinyLlama-1.1B-Chat-v1.0/ "$(<prompt.txt)" > predictions_greedy.txt


Test chatglm3-6b instead.

That's a mistake thank you for pointing it out, in my updated commit I have removed cpp-prompt_lookup_decoding_lm-ubuntu-Chatglm3-6b and just added the necessary part in cpp-prompt_lookup_decoding_lm-ubuntu

Wovchena · 2024-11-01T05:40:39Z

.github/workflows/causal_lm_cpp.yml

+          assert predicted_greedy == predicted_prompt_lookup
+          "
+          echo "Prompt lookup" passed
+      - name: run and compare (model with seq_length_axis = 1)


It seems this step is just a copy of another one. Remove it.

Wovchena · 2024-11-01T05:41:17Z

tests/python_tests/ov_genai_test_utils.py

https://github.com/openvinotoolkit/openvino.genai/blob/master/src/docs/SUPPORTED_MODELS.md isn't updated. Please, extend the note below the table that optimum-cli requires --task text-generation-with-past for THUDM/chatglm3-6b and beam search isn't supported for that model.

…tu-Chatglm3-6b-and-merged-in-cpp-prompt_lookup_decoding_lm-ubuntu

….genai into verify_chatglm3-6b updating_local_directory

Wovchena · 2024-11-01T09:57:23Z

tests/python_tests/ov_genai_test_utils.py

@@ -25,6 +25,7 @@ def get_models_list():
        "microsoft/phi-1_5",
        "microsoft/phi-2",
        "THUDM/chatglm2-6b",
+        "THUDM/chatglm3-6b", # no beam_search


Does every test pass with THUDM/chatglm3-6b? If not, please, mark the failing tests to be skipped.

Skips must happen only for that particular model

Wovchena · 2024-11-01T10:43:30Z

.github/workflows/causal_lm_cpp.yml

@@ -274,6 +274,41 @@ jobs:
          && call .\ov\setupvars.bat
          && python samples\python\greedy_causal_lm\lora.py .\TinyLlama\TinyLlama-1.1B-intermediate-step-1431k-3T\ adapter_model.safetensors "How to create a table with two columns, one of them has type float, another one has type int?"

+  cpp-greedy_causal_lm-Chatglm3-6b:
+    runs-on: ubuntu-24.04


Suggested change

runs-on: ubuntu-24.04

runs-on: ubuntu-20.04-4-cores

Aniruddha521 · 2024-11-01T21:32:58Z

@Wovchena can you please guide me that what possibly gone wrong for cpp-greedy_causal_lm-Chatglm3-6b and cpp-prompt_lookup_decoding_lm-ubuntu although both greedy_causal_lm and prompt_lookup_decoding_lm works for chatglm3-6b in my local machine

Wovchena · 2024-11-04T06:03:40Z

.github/workflows/causal_lm_cpp.yml

+          optimum-cli export openvino --trust-remote-code --weight-format fp16 --model THUDM/chatglm3-6b chatglm3-6b --task text-generation-with-past
+      - run: > 
+          . ./ov/setupvars.sh
+          && timeout 2m ./build/samples/cpp/greedy_causal_lm/greedy_causal_lm ./chatglm3-6b/ 69 | diff <(timeout 2m samples/python/greedy_causal_lm/greedy_causal_lm.py ./chatglm3-6b/ 69) -


@Wovchena can you please guide me that what possibly gone wrong for cpp-greedy_causal_lm-Chatglm3-6b and cpp-prompt_lookup_decoding_lm-ubuntu although both greedy_causal_lm and prompt_lookup_decoding_lm works for chatglm3-6b in my local machine

The samples work but the comparison fails which means the samples produced different results. Try running the samples again, but pay attention to their output.

By the way, here's a newer and more verbose implementation of the same idea using tee and timeout-minutes. It is also split into multiple steps to make it clear which command fails. You can rewrite your tests in a similar way to simplify reasoning about what happened.

openvino.genai/.github/workflows/causal_lm_cpp.yml

Lines 776 to 790 in 6165c47

- name: Run visual_language_chat C++ sample - MiniCPM-V-2_6

run: >

source ./ov/setupvars.sh

&& ./build/samples/cpp/visual_language_chat/visual_language_chat ./MiniCPM-V-2_6/ ./images/

<<< $'Describe the images?' | tee cpp.txt

timeout-minutes: 2

- run: diff cpp.txt ref.txt

- name: Run visual_language_chat Python sample - MiniCPM-V-2_6

run: >

source ./ov/setupvars.sh

&& ./samples/python/visual_language_chat/visual_language_chat.py ./MiniCPM-V-2_6/ ./images/

<<< $'Describe the images?' | tee py.txt

env:

PYTHONPATH: "./build/"

- run: diff py.txt ref.txt

…-Chatglm3-6b incausal_lm_cpp.yml

….genai into verify_chatglm3-6b merged branch

….genai into verify_chatglm3-6b merging-remote-and-local

Aniruddha521 · 2024-11-12T07:48:20Z

@Wovchena
I use the following command but doesn't seems like it is working:

git pull --rebase origin verify-chatglm-6b
git add resolved-file
git rebase --continue
git push origin verify-chatglm-6b --force

can you suggest me anything which I may have done wrong or missed.
Additionally my forked repo is up to date with openvinotoolkit:master
please help me in this matter!

Wovchena · 2024-11-12T07:58:16Z

You can checkout thirparty/openvino_tokenizers from master and push to you branch to align the tokenizers with master:

git checkout master thirdparty/openvino_tokenizers/
git add thirdparty/openvino_tokenizers/
git commit -m tokenizers
git push

Wovchena · 2024-11-12T08:58:47Z

Apparently that was wrong master. Try git checkout a0268cd5c5fe71ccbc4dc773b502075867c859fe thirdparty/openvino_tokenizers instead of git checkout master thirdparty/openvino_tokenizers/

Aniruddha521 · 2024-11-12T13:55:01Z

@Wovchena
why prompt_lookup and greedy is are generating different output for chatglm3?here

- name: run and compare
        run: |
          source ./ov/setupvars.sh

          echo 'Code:```python
          def add(a, b):
              return a + b
          ```
          Question: Can you please add 2 and 3
          A:' > ./prompt.txt

          ./build/samples/cpp/prompt_lookup_decoding_lm/prompt_lookup_decoding_lm ./chatglm3-6b/ "$(<prompt.txt)" > predictions_prompt_lookup.txt
          ./build/samples/cpp/greedy_causal_lm/greedy_causal_lm ./chatglm3-6b/ "$(<prompt.txt)" > predictions_greedy.txt
          diff predictions_prompt_lookup.txt predictions_greedy.txt
          python -c "
          with open('predictions_greedy.txt', 'r') as f:
              predicted_greedy = f.readline()
          with open('predictions_prompt_lookup.txt', 'r') as f:
              predicted_prompt_lookup = f.readline()
          print(predicted_greedy)
          print(predicted_prompt_lookup)
          "
          echo "Prompt lookup" passed

because as per the error raised I think it is due to the different output generated by greedy and prompt lookup.
But it is probably generating same results for Qwen model since it is passing the tests.

Wovchena · 2024-11-12T14:04:36Z

I don't know. That is the point of this task to test if chatglm is fine. Apparently it's not. You can print the results to see them

….genai into verify_chatglm3-6b merging local and remote Updated submodule for a0268cd

solved conflicts

updated submodule with git checkout a0268cd

….genai into verify_chatglm3-6b merged local and remote ammended

Aniruddha521 · 2024-11-13T09:06:52Z

@Wovchena
As I can see cpp-greedy_causal_lm-Chatglm3-6b is failing because the output generated by python sample and c++ sample are different and whereas cpp-prompt_lookup_decoding_lm-ubuntu is failing because prompt lookup and greedy generated text are different.
In both the cases I am using diff command to compare the generated text which may be causing assertion error , so will I only need to remove comparison part or remove whole cpp-greedy_causal_lm-Chatglm3-6b and cpp-prompt_lookup_decoding_lm-ubuntu` .
Also don't know why

Linux (Ubuntu 20.04, Python 3.9) / OpenVINO genai extension (cmake + wheel) (pull_request) Failing after 27m
Windows (VS 2019, Python 3.11) / OpenVINO genai extension (cmake + wheel) (pull_request)
macOS (12, Python 3.9) / OpenVINO genai extension (cmake + wheel) (pull_request)
the above tests are failing?
I also proceed as you mentioned earlier

git checkout a0268cd5c5fe71ccbc4dc773b502075867c859fe thirdparty/openvino_tokenizers
git add thirdparty/openvino_tokenizers/
git commit --amend
git push

Wovchena · 2024-11-13T09:47:19Z

You need to find out why C++ and Python greedy produce different outputs and fix it. Other model runs are aligned, that means the problem is not about the samples themselves.

This PR diff shows that you've modified thirdparty/openvino_tokenizers. Eliminate that diff. master's version of openvino_tokenizers should be kept. May be this is the reason for Linux (Ubuntu 20.04, Python 3.9) / OpenVINO genai extension (cmake + wheel) (pull_request) failure.

Aniruddha521 and others added 5 commits October 30, 2024 02:16

THUDM/chatglm3-6b_added_in_nightly_models

f51bf94

checked

064d8ad

Merge branch 'openvinotoolkit:master' into verify_chatglm3-6b

6122d99

extended SUPPORTED_MODELS

a3a01ee

Merge branch 'openvinotoolkit:master' into verify_chatglm3-6b

bdfb5b4

github-actions bot added category: sampling Sampling / Decoding algorithms category: GHA CI based on Github actions labels Oct 31, 2024

Aniruddha521 mentioned this pull request Oct 31, 2024

[Good First Issue]: Verify chatglm3-6b with GenAI text_generation #268

Open

Wovchena requested changes Nov 1, 2024

View reviewed changes

Aniruddha521 and others added 4 commits November 1, 2024 14:54

updated -causal_lm_cpp.yml-removed-cpp-prompt_lookup_decoding_lm-ubun…

53711c5

…tu-Chatglm3-6b-and-merged-in-cpp-prompt_lookup_decoding_lm-ubuntu

updated_supported_model.md_as_asked

4532c96

Merge branch 'verify_chatglm3-6b' of github.com:Aniruddha521/openvino…

13c1d97

….genai into verify_chatglm3-6b updating_local_directory

Update SUPPORTED_MODELS.md

e2c73b8

Wovchena reviewed Nov 1, 2024

View reviewed changes

updated-causal-lm

c190798

mlukasze linked an issue Nov 4, 2024 that may be closed by this pull request

[Good First Issue]: Verify chatglm3-6b with GenAI text_generation #268

Open

Wovchena reviewed Nov 4, 2024

View reviewed changes

Aniruddha521 and others added 3 commits November 4, 2024 21:21

Update causal_lm_cpp.yml

2e2b293

Merge branch 'master' into verify_chatglm3-6b

c4619a8

updating branch

9292757

ilya-lavrenov removed the category: sampling Sampling / Decoding algorithms label Nov 5, 2024

Updated cpp-prompt_lookup_decoding_lm-ubuntu and cpp-greedy_causal_lm…

4028323

…-Chatglm3-6b incausal_lm_cpp.yml

github-actions bot added the category: sampling Sampling / Decoding algorithms label Nov 5, 2024

Aniruddha521 and others added 4 commits November 5, 2024 16:32

Update causal_lm_cpp.yml

b70302c

Update causal_lm_cpp.yml

f251985

Merge branch 'verify_chatglm3-6b' of github.com:Aniruddha521/openvino…

08d7701

….genai into verify_chatglm3-6b merged branch

updated causal_lm_cpp.yml

87db456

github-actions bot added the category: tokenizers Tokenizer class or submodule update label Nov 5, 2024

Merge branch 'verify_chatglm3-6b' of github.com:Aniruddha521/openvino…

996fe65

….genai into verify_chatglm3-6b merging-remote-and-local

Aniruddha521 closed this Nov 11, 2024

Aniruddha521 force-pushed the verify_chatglm3-6b branch from 996fe65 to 4017c8f Compare November 11, 2024 11:29

Sync_and_Merged

83ab7c4

Aniruddha521 reopened this Nov 11, 2024

Aniruddha521 force-pushed the verify_chatglm3-6b branch from 8f58ec2 to 83ab7c4 Compare November 11, 2024 13:49

Aniruddha521 added 2 commits November 11, 2024 19:21

Merge branch 'master' into verify_chatglm3-6b

30133b9

Merge branch 'master' into verify_chatglm3-6b

93d8ca0

Aniruddha521 closed this Nov 12, 2024

Aniruddha521 reopened this Nov 12, 2024

tokenizers

ccd9ba9

Aniruddha521 and others added 2 commits November 12, 2024 19:25

Merge branch 'master' into verify_chatglm3-6b

d302833

slight modification

cca410a

Aniruddha521 and others added 10 commits November 12, 2024 19:34

Merge branch 'verify_chatglm3-6b' of github.com:Aniruddha521/openvino…

6a8c40b

….genai into verify_chatglm3-6b merging local and remote Updated submodule for a0268cd

Update causal_lm_cpp.yml

bdd716c

solved conflicts

checking

35fd401

usual check

55f493b

Merge branch 'master' into verify_chatglm3-6b

c2c2726

Merge branch 'master' into verify_chatglm3-6b

c3e0ec0

Update causal_lm_cpp.yml

cafa42a

usual check

506a895

updated submodule with git checkout a0268cd

Merge branch 'verify_chatglm3-6b' of github.com:Aniruddha521/openvino…

50c4aec

….genai into verify_chatglm3-6b merged local and remote ammended

minor changes

0d6e58e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Verify chatglm3 6b #1119

Verify chatglm3 6b #1119

Aniruddha521 commented Oct 31, 2024

Wovchena Nov 1, 2024

Aniruddha521 Nov 1, 2024

Wovchena Nov 1, 2024

Wovchena Nov 1, 2024

Wovchena Nov 1, 2024

Wovchena Nov 1, 2024

Wovchena Nov 1, 2024

Aniruddha521 commented Nov 1, 2024

Wovchena Nov 4, 2024

Aniruddha521 commented Nov 12, 2024

Wovchena commented Nov 12, 2024

Wovchena commented Nov 12, 2024

Aniruddha521 commented Nov 12, 2024

Wovchena commented Nov 12, 2024

Aniruddha521 commented Nov 13, 2024

Wovchena commented Nov 13, 2024

	- name: Run visual_language_chat C++ sample - MiniCPM-V-2_6
	run: >
	source ./ov/setupvars.sh
	&& ./build/samples/cpp/visual_language_chat/visual_language_chat ./MiniCPM-V-2_6/ ./images/
	<<< $'Describe the images?' \| tee cpp.txt
	timeout-minutes: 2
	- run: diff cpp.txt ref.txt
	- name: Run visual_language_chat Python sample - MiniCPM-V-2_6
	run: >
	source ./ov/setupvars.sh
	&& ./samples/python/visual_language_chat/visual_language_chat.py ./MiniCPM-V-2_6/ ./images/
	<<< $'Describe the images?' \| tee py.txt
	env:
	PYTHONPATH: "./build/"
	- run: diff py.txt ref.txt

Verify chatglm3 6b #1119

Are you sure you want to change the base?

Verify chatglm3 6b #1119

Conversation

Aniruddha521 commented Oct 31, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Aniruddha521 commented Nov 1, 2024

Choose a reason for hiding this comment

Aniruddha521 commented Nov 12, 2024

Wovchena commented Nov 12, 2024

Wovchena commented Nov 12, 2024

Aniruddha521 commented Nov 12, 2024

Wovchena commented Nov 12, 2024

Aniruddha521 commented Nov 13, 2024

Wovchena commented Nov 13, 2024