forked from NVIDIA/Megatron-LM
-
Notifications
You must be signed in to change notification settings - Fork 49
Issues: bigcode-project/Megatron-LM
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
RuntimeError: Error building extension 'scaled_upper_triang_masked_softmax_cuda'
#61
opened Jun 14, 2023 by
KOVVURISATYANARAYANAREDDY
ValueError: Invalid attention arguments: AttnType.self_attn, None
#59
opened Jun 6, 2023 by
chen-lee-li
Support interleaved pipeline schedules in checkpoint merging tools
enhancement
New feature or request
#45
opened Mar 29, 2023 by
RaymondLi0
Improve loading of the data-paths
enhancement
New feature or request
#38
opened Mar 21, 2023 by
RaymondLi0
OOM on preprocessing dataset with large number of documents
bug
Something isn't working
#34
opened Mar 10, 2023 by
RaymondLi0
Conversion of Huggingface bigcode/santacoder to Nvidia Triton Inference server
#17
opened Jan 24, 2023 by
michaelfeil
Timeout on creating the index mappings
bug
Something isn't working
#15
opened Jan 4, 2023 by
RaymondLi0
ProTip!
Mix and match filters to narrow down what you’re looking for.