Skip to content
@bigscience-workshop

BigScience Workshop

Research workshop on large language models - The Summer of Language Models 21

Popular repositories Loading

  1. petals petals Public

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    Python 9.3k 523

  2. promptsource promptsource Public

    Toolkit for creating, sharing and using natural language prompts.

    Python 2.7k 354

  3. Megatron-DeepSpeed Megatron-DeepSpeed Public

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    Python 1.3k 220

  4. bigscience bigscience Public

    Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

    Shell 981 101

  5. xmtf xmtf Public

    Crosslingual Generalization through Multitask Finetuning

    Jupyter Notebook 518 38

  6. biomedical biomedical Public

    Tools for curating biomedical training data for large-scale language modeling

    Python 461 116

Repositories

Showing 10 of 35 repositories
  • biomedical Public

    Tools for curating biomedical training data for large-scale language modeling

    bigscience-workshop/biomedical’s past year of commit activity
    Python 461 116 162 (6 issues need help) 16 Updated Dec 9, 2024
  • data_tooling Public

    Tools for managing datasets for governance and training.

    bigscience-workshop/data_tooling’s past year of commit activity
    HTML 79 Apache-2.0 48 138 (2 issues need help) 3 Updated Oct 28, 2024
  • xmtf Public

    Crosslingual Generalization through Multitask Finetuning

    bigscience-workshop/xmtf’s past year of commit activity
    Jupyter Notebook 518 Apache-2.0 38 11 0 Updated Sep 22, 2024
  • petals Public

    🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading

    bigscience-workshop/petals’s past year of commit activity
    Python 9,296 MIT 523 87 (9 issues need help) 18 Updated Sep 7, 2024
  • bigscience Public

    Central place for the engineering/scaling WG: documentation, SLURM scripts and logs, compute environment and data.

    bigscience-workshop/bigscience’s past year of commit activity
    Shell 981 101 13 8 Updated Jul 29, 2024
  • Megatron-DeepSpeed Public

    Ongoing research training transformer language models at scale, including: BERT & GPT-2

    bigscience-workshop/Megatron-DeepSpeed’s past year of commit activity
    Python 1,342 220 74 (10 issues need help) 45 Updated Mar 20, 2024
  • multilingual-modeling Public

    BLOOM+1: Adapting BLOOM model to support a new unseen language

    bigscience-workshop/multilingual-modeling’s past year of commit activity
    Python 70 Apache-2.0 16 13 6 Updated Mar 2, 2024
  • promptsource Public

    Toolkit for creating, sharing and using natural language prompts.

    bigscience-workshop/promptsource’s past year of commit activity
    Python 2,720 Apache-2.0 354 11 32 Updated Oct 23, 2023
  • massive-probing-framework Public Forked from AIRI-Institute/Probing_framework

    Framework for BLOOM probing

    bigscience-workshop/massive-probing-framework’s past year of commit activity
    Python 8 10 0 0 Updated Oct 17, 2023
  • bigscience-workshop/architecture-objective’s past year of commit activity
    Python 93 Apache-2.0 314 4 5 Updated Jul 25, 2023