[model] add support for mixtral moe model #128

936187425 · 2024-04-16T02:57:13Z

support for Mixtral-8x7B-v0.1

guocuimi · 2024-04-22T18:17:47Z

src/models/huggingface/mixtral.h

+    const int64_t head_dim = args.head_dim();
+    const int64_t n_kv_heads = args.n_kv_heads().value_or(n_heads);
+    const int64_t n_local_heads = n_heads / world_size;
+    const int64_t n_local_kv_heads = n_kv_heads / world_size;


just a heads up. i added support for MQA and GQA, please also include that support in your change. FYI dff774e

you can learn MQA and GQA from this blog: https://iamshobhitagarwal.medium.com/navigating-the-attention-landscape-mha-mqa-and-gqa-decoded-288217d0a7d1

936187425 added 3 commits April 15, 2024 22:53

[feat]add mixtral.h

4342b65

[feat] add some classes in mixtral.h

4e88395

[feat] construct modules from mixtral model except Mixtral moe impl

7172786

guocuimi reviewed Apr 22, 2024

View reviewed changes

936187425 and others added 6 commits April 26, 2024 05:13

[feat] add the replicated_linear

c13ebd3

Merge branch 'vectorch-ai:main' into Mixtral

f2ffe46

[format]

13da555

[refactor] add the wrapper of fused_moe_layer add the fused_moe_kernel

bdb529c

Merge branch 'vectorch-ai:main' into Mixtral

1a8d37d

[feat] add the load_state_dict in fused_moe.cpp

3eb45f0

936187425 changed the title ~~[model] added support for mixtral moe model~~ [model] add support for mixtral moe model May 16, 2024

936187425 and others added 5 commits May 22, 2024 11:32

Merge branch 'vectorch-ai:main' into Mixtral

51f3d56

[feat] add MixtralBlockExpert using torch version

d649ac9

[format]

046cf63

[bug]Remove third_party/pybind11/ from submodules

11a4a8a

[fix]remove the third_party/pybind11

9a5ac5b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[model] add support for mixtral moe model #128

[model] add support for mixtral moe model #128

936187425 commented Apr 16, 2024 •

edited

Loading

guocuimi Apr 22, 2024

[model] add support for mixtral moe model #128

Are you sure you want to change the base?

[model] add support for mixtral moe model #128

Conversation

936187425 commented Apr 16, 2024 • edited Loading

guocuimi Apr 22, 2024

Choose a reason for hiding this comment

936187425 commented Apr 16, 2024 •

edited

Loading