From cf43c82816399ac276cb5d0741eecf1b345a9321 Mon Sep 17 00:00:00 2001
From: pytorchbot
Date: Fri, 20 Dec 2024 18:56:59 +0000
Subject: [PATCH] auto-generating sphinx docs
---
main/_sources/tutorials/e2e_flow.rst.txt | 320 +++++++++----------
main/objects.inv | Bin 11857 -> 11856 bytes
main/searchindex.js | 2 +-
main/tutorials/e2e_flow.html | 374 +++++++++++------------
4 files changed, 312 insertions(+), 384 deletions(-)
diff --git a/main/_sources/tutorials/e2e_flow.rst.txt b/main/_sources/tutorials/e2e_flow.rst.txt
index a6c1d561b..21571d2e3 100644
--- a/main/_sources/tutorials/e2e_flow.rst.txt
+++ b/main/_sources/tutorials/e2e_flow.rst.txt
@@ -7,7 +7,7 @@ End-to-End Workflow with torchtune
In this tutorial, we'll walk through an end-to-end example of how you can fine-tune,
evaluate, optionally quantize and then run generation with your favorite LLM using
torchtune. We'll also go over how you can use some popular tools and libraries
-from the community seemlessly with torchtune.
+from the community seamlessly with torchtune.
.. grid:: 2
@@ -25,49 +25,32 @@ from the community seemlessly with torchtune.
:ref:`checkpoints `
-Overview
---------
+Finetune your model
+-------------------
-Fine-tuning an LLM is usually only one step in a larger workflow. An example workflow that you
-might have can look something like this:
+First, let's download a model using the tune CLI. The following command will download the `Llama3.2 3B Instruct `_
+model from the Hugging Face Hub and save it the local filesystem. Hugging Face uploaded the original
+weights (``consolidated.00.pth``) and the weights compatible with the `from_pretrained() `_ API (``*.safetensors``).
+We don't need both so we'll ignore the original weights when downloading.
-- Download a popular model from `HF Hub `_
-- Fine-tune the model using a relevant fine-tuning technique. The exact technique used
- will depend on factors such as the model, amount and nature of training data, your hardware
- setup and the end task for which the model will be used
-- Evaluate the model on some benchmarks to validate model quality
-- Run some generations to make sure the model output looks reasonable
-- Quantize the model for efficient inference
-- [Optional] Export the model for specific environments such as inference on a mobile phone
+.. code-block:: text
-In this tutorial, we'll cover how you can use torchtune for all of the above, leveraging
-integrations with popular tools and libraries from the ecosystem.
-
-We'll use the Llama-3.2-3B-Instruct model for this tutorial. You can find a complete set of models supported
-by torchtune `here `_.
-
-|
-
-Download Llama-3.2-3B-Instruct
-------------------------------
-
-For more information on checkpoint formats and how these are handled in torchtune, take a look at
-this tutorial on :ref:`checkpoints `.
-
-To download the HF format Llama-3.2-3B-Instruct, we'll use the tune CLI.
-
-.. code-block:: bash
-
- tune download meta-llama/Llama-3.2-3B-Instruct \
- --output-dir /tmp/Llama-3.2-3B-Instruct \
- --ignore-patterns "original/consolidated.00.pth"
-
-Make a note of ````, we'll use this many times in this tutorial.
+ $ tune download meta-llama/Llama-3.2-3B-Instruct --ignore-patterns "original/consolidated.00.pth"
+ Successfully downloaded model repo and wrote to the following locations:
+ /tmp/Llama-3.2-3B-Instruct/.cache
+ /tmp/Llama-3.2-3B-Instruct/.gitattributes
+ /tmp/Llama-3.2-3B-Instruct/LICENSE.txt
+ /tmp/Llama-3.2-3B-Instruct/README.md
+ /tmp/Llama-3.2-3B-Instruct/USE_POLICY.md
+ /tmp/Llama-3.2-3B-Instruct/config.json
+ /tmp/Llama-3.2-3B-Instruct/generation_config.json
+ /tmp/Llama-3.2-3B-Instruct/model-00001-of-00002.safetensors
+ ...
-|
+.. note::
-Finetune the model using LoRA
------------------------------
+ For a list of all other models you can finetune out-of-the-box with torchtune, check out
+ our :ref:`models page`.
For this tutorial, we'll fine-tune the model using LoRA. LoRA is a parameter efficient fine-tuning
technique which is especially helpful when you don't have a lot of GPU memory to play with. LoRA
@@ -77,22 +60,11 @@ fine-tune a Llama-3.2-3B-Instruct model with LoRA in less than 16GB of GPU memor
RTX 3090/4090. For more information on how to use LoRA, take a look at our
:ref:`LoRA Tutorial `.
-We'll fine-tune using our
-`single device LoRA recipe `_
-and use the standard settings from the
-`default config `_.
-
-This will fine-tune our model using a ``batch_size=2`` and ``dtype=bfloat16``. With these settings the model
-should have a peak memory usage of ~16GB and total training time of around two hours for each epoch.
-We'll need to make some changes to the config to make sure our recipe can access the
-right checkpoints.
-
Let's look for the right config for this use case by using the tune CLI.
-.. code-block:: bash
-
- tune ls
+.. code-block:: text
+ $ tune ls
RECIPE CONFIG
full_finetune_single_device llama2/7B_full_low_memory
code_llama2/7B_full_low_memory
@@ -125,105 +97,102 @@ Let's look for the right config for this use case by using the tune CLI.
...
-For this tutorial we'll use the ``llama3_2/3B_lora_single_device`` config.
-
-The config already points to the HF Checkpointer and the right checkpoint files.
-All we need to do is update the checkpoint directory for both the model and the
-tokenizer. Let's do this using the overrides in the tune CLI while starting training!
-
-
-.. code-block:: bash
-
- tune run lora_finetune_single_device --config llama3_2/3B_lora_single_device
-
-
-Preparing your artifacts for inference
---------------------------------------
+We'll fine-tune using our
+:ref:`single device LoRA recipe `
+and use the standard settings from the
+`default config `_.
-Congrats for getting this far! You have loaded your weights, trained your model, now it's time to visualize
-the outputs. A simple way of doing this is by running `tree -a path/to/outputdir`, which should show something like the tree below.
-There are 4 types of folders:
+This will fine-tune our model using a ``batch_size=4`` and ``dtype=bfloat16``. With these settings the model
+should have a peak memory usage of ~16GB and total training time of around 2-3 hours for each epoch.
+
+.. code-block:: text
+
+ $ tune run lora_finetune_single_device --config llama3_2/3B_lora_single_device
+ Setting manual seed to local seed 3977464327. Local seed is seed + rank = 3977464327 + 0
+ Hint: enable_activation_checkpointing is True, but enable_activation_offloading isn't. Enabling activation offloading should reduce memory further.
+ Writing logs to /tmp/torchtune/llama3_2_3B/lora_single_device/logs/log_1734708879.txt
+ Model is initialized with precision torch.bfloat16.
+ Memory stats after model init:
+ GPU peak memory allocation: 6.21 GiB
+ GPU peak memory reserved: 6.27 GiB
+ GPU peak memory active: 6.21 GiB
+ Tokenizer is initialized from file.
+ Optimizer and loss are initialized.
+ Loss is initialized.
+ Dataset and Sampler are initialized.
+ Learning rate scheduler is initialized.
+ Profiling disabled.
+ Profiler config after instantiation: {'enabled': False}
+ 1|3|Loss: 1.943998098373413: 0%| | 3/1617 [00:21<3:04:47, 6.87s/it]
+
+Congrats on training your model! Let's take a look at the artifacts produced by torchtune. A simple way of doing this is by running :code:`tree -a path/to/outputdir`, which should show something like the tree below.
+There are 3 types of folders:
1) **recipe_state**: Holds recipe_state.pt with the information necessary to restart the last intermediate epoch. For more information, please check our deep-dive :ref:`Checkpointing in torchtune `.;
-2) **logs**: Defined in your config in metric_logger;
-3) **epoch_{}**: Contains your new trained model weights plus all original files of the model, except the checkpoints, making it easy for you to choose an specific epoch to run inference on or push to a model hub;
-
-.. code-block:: bash
-
- >>> tree -a /tmp/torchtune/llama3_2_3B/lora_single_device
- /tmp/torchtune/llama3_2_3B/lora_single_device
- ├── epoch_0
- │ ├── adapter_config.json
- │ ├── adapter_model.pt
- │ ├── adapter_model.safetensors
- │ ├── config.json
- │ ├── ft-model-00001-of-00002.safetensors
- │ ├── ft-model-00002-of-00002.safetensors
- │ ├── generation_config.json
- │ ├── LICENSE.txt
- │ ├── model.safetensors.index.json
- │ ├── original
- │ │ ├── orig_params.json
- │ │ ├── params.json
- │ │ └── tokenizer.model
- │ ├── original_repo_id.json
- │ ├── README.md
- │ ├── special_tokens_map.json
- │ ├── tokenizer_config.json
- │ ├── tokenizer.json
- │ └── USE_POLICY.md
- ├── epoch_1
- │ ├── adapter_config.json
- │ ├── adapter_model.pt
- │ ├── adapter_model.safetensors
- │ ├── config.json
- │ ├── ft-model-00001-of-00002.safetensors
- │ ├── ft-model-00002-of-00002.safetensors
- │ ├── generation_config.json
- │ ├── LICENSE.txt
- │ ├── model.safetensors.index.json
- │ ├── original
- │ │ ├── orig_params.json
- │ │ ├── params.json
- │ │ └── tokenizer.model
- │ ├── original_repo_id.json
- │ ├── README.md
- │ ├── special_tokens_map.json
- │ ├── tokenizer_config.json
- │ ├── tokenizer.json
- │ └── USE_POLICY.md
- ├── logs
- │ └── log_1734652101.txt
- └── recipe_state
- └── recipe_state.pt
+2) **logs**: Contains all the logging output from your training run: loss, memory, exceptions, etc.
+3) **epoch_{}**: Contains your trained model weights plus model metadata. If running inference or pushing to a model hub, you should use this folder directly.
+
+
+.. code-block:: text
+
+ $ tree -a /tmp/torchtune/llama3_2_3B/lora_single_device
+ /tmp/torchtune/llama3_2_3B/lora_single_device
+ ├── epoch_0
+ │ ├── adapter_config.json
+ │ ├── adapter_model.pt
+ │ ├── adapter_model.safetensors
+ │ ├── config.json
+ │ ├── ft-model-00001-of-00002.safetensors
+ │ ├── ft-model-00002-of-00002.safetensors
+ │ ├── generation_config.json
+ │ ├── LICENSE.txt
+ │ ├── model.safetensors.index.json
+ │ ├── original
+ │ │ ├── orig_params.json
+ │ │ ├── params.json
+ │ │ └── tokenizer.model
+ │ ├── original_repo_id.json
+ │ ├── README.md
+ │ ├── special_tokens_map.json
+ │ ├── tokenizer_config.json
+ │ ├── tokenizer.json
+ │ └── USE_POLICY.md
+ ├── epoch_1
+ │ ├── adapter_config.json
+ │ ...
+ ├── logs
+ │ └── log_1734652101.txt
+ └── recipe_state
+ └── recipe_state.pt
Let's understand the files:
-- `adapter_model.safetensors` and `adapter_model.pt` are your LoRA trained adapter weights. We save a duplicated .pt version of it to facilitate resuming from checkpoint.
-- `ft-model-{}-of-{}.safetensors` are your trained full model weights (not adapters). When LoRA finetuning, these are only present if we set ``save_adapter_weights_only=False``. In that case, we merge the merged base model with trained adapters, making inference easier.
-- `adapter_config.json` is used by Huggingface PEFT when loading an adapter (more on that later);
-- `model.safetensors.index.json` is used by Huggingface .from_pretrained when loading the model weights (more on that later)
+- ``adapter_model.safetensors`` and ``adapter_model.pt`` are your LoRA trained adapter weights. We save a duplicated .pt version of it to facilitate resuming from checkpoint.
+- ``ft-model-{}-of-{}.safetensors`` are your trained full model weights (not adapters). When LoRA finetuning, these are only present if we set ``save_adapter_weights_only=False``. In that case, we merge the merged base model with trained adapters, making inference easier.
+- ``adapter_config.json`` is used by Huggingface PEFT when loading an adapter (more on that later);
+- ``model.safetensors.index.json`` is used by Hugging Face ``from_pretrained()`` when loading the model weights (more on that later)
- All other files were originally in the checkpoint_dir. They are automatically copied during training. Files over 100MiB and ending on .safetensors, .pth, .pt, .bin are ignored, making it lightweight.
-|
+Evaluate your model
+-------------------
-.. _eval_harness_label:
+We've fine-tuned a model. But how well does this model really do? Let's determine this through structured evaluation and playing around with it.
-Run Evaluation using EleutherAI's Eval Harness
-----------------------------------------------
+.. _eval_harness_label:
-We've fine-tuned a model. But how well does this model really do? Let's run some Evaluations!
+Run evals using EleutherAI's Eval Harness
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
.. TODO (SalmanMohammadi) ref eval recipe docs
torchtune integrates with
`EleutherAI's evaluation harness `_.
An example of this is available through the
-``eleuther_eval`` recipe. In this tutorial, we're going to directly use this recipe by
-modifying its associated config ``eleuther_evaluation.yaml``.
+`eleuther_eval `_ recipe. In this tutorial, we're going to directly use this recipe by
+modifying its associated config `eleuther_evaluation.yaml `_.
.. note::
- For this section of the tutorial, you should first run :code:`pip install lm_eval==0.4.*`
+ For this section of the tutorial, you should first run :code:`pip install lm_eval>=0.4.5`
to install the EleutherAI evaluation harness.
Since we plan to update all of the checkpoint files to point to our fine-tuned checkpoints,
@@ -231,10 +200,10 @@ let's first copy over the config to our local working directory so we can make c
.. code-block:: bash
- tune cp eleuther_evaluation ./custom_eval_config.yaml \
+ $ tune cp eleuther_evaluation ./custom_eval_config.yaml
+ Copied file to custom_eval_config.yaml
-Then, in your config, you only need to replace two fields: ``output_dir`` and ``checkpoint_files``. Notice
-that we are using the merged weights, and not the LoRA adapters.
+Notice that we are using the merged weights, and not the LoRA adapters.
.. code-block:: yaml
@@ -281,19 +250,16 @@ For this tutorial we'll use the `truthfulqa_mc2 `_.
-
Let's first copy over the config to our local working directory so we can make changes.
-.. code-block:: bash
+.. code-block:: text
- tune cp generation ./custom_generation_config.yaml
+ $ tune cp generation ./custom_generation_config.yaml
+ Copied file to custom_generation_config.yaml
Let's modify ``custom_generation_config.yaml`` to include the following changes. Again, you only need
to replace two fields: ``output_dir`` and ``checkpoint_files``
@@ -362,27 +328,17 @@ default settings for sampling with ``top_k=300`` and a
sampling are computed. We recommend inspecting the model with these before playing around with
these parameters.
-.. code-block:: bash
-
- tune run generate --config ./custom_generation_config.yaml \
- prompt="tell me a joke. "
-
-
-Once generation is complete, you'll see the following in the logs.
-
-
-.. code-block::
+.. code-block:: text
+ $ tune run generate --config ./custom_generation_config.yaml prompt="tell me a joke. "
Tell me a joke. Here's a joke for you:
What do you call a fake noodle?
An impasta!
-|
-
-Speeding up Generation using Quantization
------------------------------------------
+Introduce some quantization
+~~~~~~~~~~~~~~~~~~~~~~~~~~~
We rely on `torchao `_ for `post-training quantization `_.
To quantize the fine-tuned model after installing torchao we can run the following command::
@@ -401,22 +357,20 @@ For Llama models, you can run generation directly in torchao on the quantized mo
discussed in `this readme `_. This way you can compare your own results
to those in the previously-linked table.
-|
-
-Using torchtune checkpoints with other libraries
-------------------------------------------------
+Use your model in the wild
+--------------------------
-As we mentioned above, one of the benefits of handling of the checkpoint
-conversion is that you can directly work with standard formats. This helps
-with interoperability with other libraries since torchtune doesn't add yet
-another format to the mix.
+Let's say we're happy with how our model is performing at this point - we want to do something with it! Productionize for serving, publish on the Hugging Face Hub, etc.
+As we mentioned above, one of the benefits of handling of the checkpoint conversion is that you can directly work with standard formats. This helps
+with interoperability with other libraries since torchtune doesn't add yet another format to the mix.
-Let's start with huggingface
+Use with Hugging Face ``from_pretrained()``
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
-**Case 1: HF using BASE MODEL + trained adapter**
+**Case 1: Hugging Face using base model + trained adapters**
-Here we load the base model from HF model hub. Then we load the adapters on top of it using PeftModel.
-It will look for the files adapter_model.safetensors for the weights and adapter_config.json for where to insert them.
+Here we load the base model from Hugging Face model hub. Then we load the adapters on top of it using `PeftModel `_.
+It will look for the files ``adapter_model.safetensors`` for the weights and ``adapter_config.json`` for where to insert them.
.. code-block:: python
@@ -446,9 +400,9 @@ It will look for the files adapter_model.safetensors for the weights and adapter
prompt = "tell me a joke: '"
print("Base model output:", generate_text(peft_model, tokenizer, prompt))
-**Case 2: HF using merged full+adapter weights**
+**Case 2: Hugging Face using merged weights**
-In this case, HF will check in model.safetensors.index.json which files it should load.
+In this case, Hugging Face will check in ``model.safetensors.index.json`` for which files it should load.
.. code-block:: python
@@ -475,16 +429,20 @@ In this case, HF will check in model.safetensors.index.json which files it shoul
prompt = "Complete the sentence: 'Once upon a time...'"
print("Base model output:", generate_text(model, tokenizer, prompt))
-**Case 3: vLLM using merged full+adapter weights**
+Use with vLLM
+~~~~~~~~~~~~~
-It will load any .safetensors file. Since here we mixed both the full model weights and adapter weights, we have to delete the
+`vLLM `_ is a fast and easy-to-use library for LLM inference and serving. They include a lot of awesome features like
+state-of-the-art serving throughput, continuous batching of incoming requests, quantization, and speculative decoding.
+
+The library will load any .safetensors file. Since here we mixed both the full model weights and adapter weights, we have to delete the
adapter weights to succesfully load it.
-.. code-block:: bash
+.. code-block:: python
rm /tmp/torchtune/llama3_2_3B/lora_single_device/base_model/adapter_model.safetensors
-Now we can run the script
+Now we can run the following script:
.. code-block:: python
@@ -517,8 +475,8 @@ Now we can run the script
outputs = llm.chat(conversation, sampling_params=sampling_params, use_tqdm=False)
print_outputs(outputs)
-Uploading your model to the Hugging Face Hub
---------------------------------------------
+Upload your model to the Hugging Face Hub
+~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Your new model is working great and you want to share it with the world. The easiest way to do this
is utilizing the `huggingface_hub `_.
diff --git a/main/objects.inv b/main/objects.inv
index a4b14299ac9befdcb8570197b38ccd65e4194d3f..fcc0cfbc4af617012f7a7a0c9d03ba4f65f62880 100644
GIT binary patch
delta 6084
zcmV;#7dzDLb_hBZNlq{vGwB}t-no*C-P
zGI`+3axRJ2|5Y4oa!C}dZ*Fc^JDl6P;c&O$4Jj=*q0(l92||C4i(|9W?R|-<4sBL?
zbc?UKnkxNA<$e~ETKhvk#;ED$4pYb1TR~=YaldzOc$gKHOiSx^kz>1B9IMz7T49Yi3Kn5R*sQWGR795uQNaxJkc;kcH$?6Dn
znPw)xAL)A)iu^KfUT_U+0KKTz)AdEotUec+8wXlYi%5Sn%=+)iED8T|_W<;=shHRg
zoTOWzkF?LH|HPoSo=9mcJt<|Tv1N}-&>@t*ON5`APY=#5VHW4a%=uH^Aa`96%LV=jRz!>E2H%^
z-5I*swmyH*L`(}!(w(8nY#0l}H0+KseK^2}j8>7Bp0=c!uzW>?6De6(bbN7TR^GoC
z84}?pQo|{0mu$pkORB4ak`p^hF1S-EF_Tl1?3j1)<_RZI%z=~`Le-=h%x-G{j60Sb
z-ve^OzyI(61@1+}(A8brq3DgCtL{Dz6jdwT-$8$xWZAjIY)ykU8752GM6TqW$>_y7
z8-1P=9ZwoMcqdqOH^v!M72=R;$_!vm@BXFP>w)}-;<`;YbfXbW!Ky}(vX-&a-0@Wv
znAb>-6B#Aik@tIcXawzXR&I|On1Ml_Gcjb^8IAYVoT-cj`^_CS)ehcooSl&2@i|7`
zY-WEoiJ6!-&zWSq$7mBDv!w2CbNf)#=@Fy0CdZlHa}|fMQpClWUO5Ju>4^nr{Hdsz
zH%-#*6q@+Z91mdlp%{lArp>@+r15Hx*gh5%(4*8D@Rm$O5dKO6biSjX*+^@T!Op5h
z(6SY=b2JgSK^Xpeyf$9b9l&b3t@TmNUg%ICmd?Ns}g+v|U-x
z+FovscEj&Txl`i;UEnJXdt&0G?IughlLxYf)qH?v4%9(~nD6T|op>eegxCV?jEaAH
zj>aPMOqU>O=jTg}{)fcNf%%sZY6vThxR|d`Gpt<5AAaxjO#O{c&mUdsm^BDS!$co8
z7e9uwAsVJ^NTL^;U@|%`&Q$0pv{1OMbZX=_>NLljoS*4*Y7FE6Mx4qn!_7#|nOU`v
zQ7f}ms}QqR3)!_Y8@39uY_*JOE2DpHE1+?A_;85%|BgHq2{}n2KOHg9*&`f=2q{S9
z$`$Ow2kc@1=G=Py<1QYU?cggb>rvu717>u%5qmKuFgTjCFAKHlKynQ1WL>5Bg*W!a
z6x+qvm4C%6P0cG{)hqwHSDMOKu-c!AKM-6gqqz2&GY~q1RZ^8x0qHdW)DwSm1PMU=
z*$?sOA>z+2h`q&uW>f^nJ?9LD3%BY>c!KX}-+@F31sww@fu92i{H!71Wc@1043*Xn
z72FV2h$SkWDJrlnst{vTI%`z!<`5G(_TbkUJpk!?wr5UW9Q>c8JCNajU$#6MpN5`F05T7J%qAFioWg`0fMKxe`>Mw(SMSG3XqI>tX7l@C4u0=!(SK
z019t444izR3TlN)+YA-F9jcIqsB|q+L7M`dc0WMyirk7+~}(CwD+CD8Tk3YM1zJ4z#9f)J}xmh4^WUI>4SXHpKDucNyKYLZ72CMWIt2|70*8M=IKpq3pL1>`~
zFzj=G?DH}9xg)mHSYDMM$xnNYdov7#&I75OYpFQ4N)_5HmA-#nDzApALR+TNH%;Z!
zHsbIR-$Fg-A0mGbI^g7>N!g7ZX81C^tBXJQL*KvRI9Y(`P~nvCFyEK_da6;Mo^k8t
zv4e0I?zRSXx^aKU(}8t50=?Lx(n8$FpLKcyo!X+(Lf*r@tA%}~vO7x^;4R(m8hlS?
zBs=J8j=M2vxVPlTVXRv6BzqCvdiWt!o_h*d*e
z0I8^=ijhPL6x{tzfkApoVdmz5R*&-0ga9_9&n2u4nP7i*(3>iDB^qu{cU7cTKerip
z48U2u{bcMSFz*O<43kgzxkIQY7@Q!){^0vSM)@HZr&~c%R}~gZDs3>SJfcapNQbR#
zJ+^{(*>azXp_Tq}Br_CJq6NKlMKL3t8C-bdUrf<29I)ZmRt)WEV^8yrEekQKr#;7}gjmtbfJ55>%?1uxT3Hp^DOT~K
zKdH|56WF>s`H2Jn1`lsM
zsYJ(4=J`=ur2SMyCW_uIczzb%fKrEsoXqm#wM6TwdJO#h+b$@Eoa-XUIXr$`EMUP4
zvjs1FEqJkz1uyItya=)2#R5%dZaLqETd3Xa%?4|=fPr&5W!ivNcxPA3k;3s
zQPF>QKnxuV4a1<<2)M-HXBqdrhrGbiPCwB3jByJLZ|0HBcTn>l&&ER;Y!nL|!1QC6
z<+?h-DtEe8;Js*~6-yN7DB{AZpRu^Ei;{l!%MnDC
z5hs3N&A0VV$`(k>m2EZy=JsU`=WUA;tJ5uD*_O()-ogE
z=H%-Am$cnq?T`5rPWT6@Zb-UoO>8>EM9-p&pMj*952;B2?C{31s~WL`
zHx>V&`}W9A7+zyseqcXuX|gL%n+{HJl<3Zg3qfAWI;%r8*)S2x2sA{Qv_l^Mx`HKz3KESrj&pBym?ymGwJLr(5zuS+DNBBytZ7EGf$}hU{%`C4WP1=X?f9t~}VtA3|iPKy0oI(p;(BWBQ@P4Zch928XtfS($(MW|uMFmF`3O
z#AHI5YW2u{4NaCSKlTA;wBPpoq3DmsedBCilk%PCX{IYn`Lp{f9?A>UknWwzL&J>+
zV+XDBbG1b2x3*5`M{5r>
zgvZHqoVPAtQ8dYM3YvcmGddlN`I+hlmGhtQ)XDUtx{_!{WXr?ryH74l%87uOO!A8v
zj5{dCor-bigz@^Tmh-&}#(jE#)}#{|f5BH%#EdS#|Ji-s7R6+RmDWv5=50mk
z_CX+bO~a3lT8nBqvk4qsDSu$!1tytSd%Srni!^UPq}hD8npJ<$ZChw_XGk5hj1Bw0
ze(2@q$HO^!xfs`GIx@P&Z5+mFtYO4zobAZ#f#yAsyfAWFH6w=FGIp9=?zP0uj5-N*
z5=buXIG;)q&grQAkxX+oPc*`QspGN)G=*7({Di$J-%gDb$zDlE&FJeMo=H!of7nN%
z(57oghy5?GLh*kgQ^(TwN^ELO!%ack0Hry4d`Zv?uz@W)#P@K_1?=p0al0oo#!Yzy
zEXD28V!YO6&2XV#L@UTLlLTkdz;tHBKe|pF=5Q?CS=sd{pIckgI7l~fnI`_}2i<6Z8+X{O
z*!lV8SWw=g+yVTO-2vPnBDABIR|QG9As2vfZB`+_2;U`uxLl{rQy7(AgIMWw*B~PI
zZBiTqTSb4>A-05+6cC*UoHs@Nt|0S@FisOi^7HPaWDvVAah|*j9^ngCT~vxWdpggfv_j-AM{WcPm>lp`1P{(}3vBAgX*f^((pWV>4<
z-OUhoUkB(WRxP7@FuEdIj{F47W89LKZwumY&*^5Cl2(W_7$sUT*}N72*Oyo3#V#P3
zQ#>7)SEk11*x3uc6m>@CYwSEwo}fC;pWG2O!acku)53-Kp=VsPyZSrFwjE0;dv!``
z4@-aii+x?W`60~{{8bAwD#KO&pS)PNX`A8yt5eqO$Zxzkvd~M02g5#@O`w}KmB*Ha
z@HID*gqhyL}tcQ@n?=XvnyHQ@li+L4RF~Lra|nQ^IwF6j
zqiej&&E|MGX2ORhSQb`ICQd4~o>b~)SgEXdA)`9p@b~EzL0y69iaK2p)kPSP=zb^D
zdf#b!2Be&rY6d`)yeQ2hs)U|6Q{9xTM{r3N%)~q8uI2hJZWIReKt%vH6Tjn!P6^!m
z?Ka&o%YI0696cQA1Y!Mecke$uDD8jVwNnUir-L78?Ap19sRI3o%l|GmV-mef{my-A
zGTbO}_Y2NQU&Ay+w0qpg@AY*-yq`1{cYpfyQ=DZ(B`isXZ=U~AJS9aLOCiK-IH;>C
z{6<_vJlCj-ULZ12i|MTUNN;BAG(W~ox}n48P@w6dXmTWyBc?||W>1e@3z>f&wpHJM
z2*v7zI{=x|ti(GRH5KXILHYK}8~$4mruSvU+mK0%_QvB{(Np4}mzI)A3Pkjj2#{+M
z>eDQveT74voUY%#yfBB@vI%aQd^g2|l0kAo}`acz;*CRjbxd;&rzK)l47OuE=m94Avpos_`7m)Ffo6E#smCgOk3DSoln+Z=C2
zh1Hx$>g+lptMMnL$oi8gvJ$6m`j2Cj)j$AqOvUHTw&-SXW2U`;cHZfJ|@o0$eK
z156W9DXG7m=xBsNFqV&96n)iiTOS3kd2sR4#w&`!C1%q|ekb!+
zWM;d?HM_g!YQCw6FkhdcHQ!yA-J6dPJ(T@y3_Ulxqm-HmzuCvFM5QXaIa&1FL_$R*
zREUG9bVTEY`{~;HU)Q4WTAZC6F=`x|V<@0y)r?w#v3TevjY?0Q>B4uf?Vc|~`kq|H
zgFBQC)1#3Dr%ZnmBPDS-QMp#`YKlb)A3QI~GLH>YDPMj4en}QzR6l->Gy5oAv4dpl
z5@@{+>{}iIHY=Sei)}ON)fK
zbb@2cYL>QZx;ko$9i|6B{979?!Z;dFejnGNRXiTfaM|S<<6#NnhYo$O^EgFatZTDj4`m^b|>Uh(=Qp
zO>TY|mE(A#e{{lMNJv^=lFsm%_p-mQWOqx3=XjN5c~hArwQ7J=C!Z^aMU77V+!|@IwXHxH~~fssY?7sx^dl9Mx5Q{sr|@2E@lgeT9v}W&3J(@L27VJ<^@B|JWW2j
z!Se6O2tdlRZm)lE%54cJR79PMNT`Tb_In5JHwv0PvC(e4M=3@;I`)o+{=cmMMT-2(
zswwwh(T=PLr}rw5fuMR5!SVLS8wwK~
z6y-~rtq=Z+=o=Q|H)TW;1oBl;zOLi-BN<-k4z=7$DeCp6$hYaP({CME_`&urY7RHj
zB&dH_j*yAj_LYXPcmRfR5VJPh^{BO09yv64Rxu%
Kul@&a+DAxcFxn>o
delta 6085
zcmV;$7dq(BT+v*x!xn$;Ec_HMt}T);Bk|s~yg5eQ~&D@P?F@J5gz~!32LH$HlQ(={CT`e1|qG
zJ-1USHwhl8lmr`+a!17ts2AWk{k08-y-varftBUT*-x8Zxi##lS{CQp<&_qI
zO-e2^vc8NTid)U%+gV7Ca>GErB>wy>+vZL7|RnAv{J%VWzw)&gudt)$#o(qx%A
z$Q>gWR8e-YPh{5osl@r^M_|RqLL6(HY6j)IB_M+jMbu4~c?%B9eWVlTZM^Zp?PPTX
zx=b^Z-;ea=3PpaIH!rvbHGp1J>*@NUW>%kz%#8yrs6~II8D{1KUns(l&Zh_GmN1KRV&;4^r3`OdRL1=o
zZYL%aDKS47CtGmo(u9$71MG^+nTWhLLl7bfPDpcP&K4hv$a#sH<_d?%^2P%a$>q^{
zn(hqUY+HXHXdNp{S;c=Lo4DCR&)454b$3}&}K0LC3l
zj&B4x;opCFfCBd-V(99w?NIb?&sBG`2a2kd?r(o1O|tAnra8{H_lGT@c0}f
zZ#I9kn#4>@o99fj-D9+gk6BXp_qu&3>hy?FTa)8V@41RYSSjLSOs^aR&GguUGyYUm
z%$p|Zb_z{=XpRRk{7{TT57TB~GtzjqM{FMp3g}Vl40uZ>A_#va0XpB&Z*8Qt$6#mG
zB52u)*f|C;
zC
zn~NVq*$@p=HYCxDO)wc97iTK;J6b5*Rys9u8+DrFP0r7BIyDAz03%Lim*Hll=FF^G
z$f%Xss#S=LxdD0
za^(tk;RALt0CR4={+SmK%y#gpmGvlbo&hsD+=#uH5*QrK*_VadbRanfcCxNg{K6ai
zVv6lz?8?95m8Rwuu
zhYD_pD#Q|%&J-2c7FCEbDxEbdcXNn|9DDlfj2?h=J=-%UFHZkY(jCa~KQIE~kGz>O
z6AA`GfThDkP6eD>9}I=v*8_hOxdG_Rq<*z%
zaGDpZYhW+G?gf=d>+a!wX%9Ca3QzD|9bS=ud;L&&%E8h9Rf!M^ItEYzzI{TcVoC?n
zJE|c6sI(4J!9Ao3a*;~wBNg09st_-!bZ%0C{iIsJQ7Vh4RDP~0AjaP}}x`;}~=f!}T!rPg+>F&~#0E6~w#
zc8-D6+dwjJwOky1r3yDyWwKS}X{;*PT9v_Em7l$;P=i%^i&Y*bJL`U+Qy`Cl=peMv
z1Q_6@nV
zX&Z6)h;N~u^RJP=2OV&7(4_3f4l{fi-qpn){Gsn(ahxnbbf|F3cbM->`(g0zrZC2x
zt|{3r3&L}BjZVS6#G+{w%=(NFh6xUFQ-d}=Q;`|M7UxT{9ULMwWQMS913lHKPtUmZ
z^4LMR3wK+CI^BP`;_T#>|Mj?6ZP6fJ@G{MCVZ^E-
zE`U^2QN>6i1q$wdr@$aRr7&}IK&wajazX%`(Ki#;hD?7jJLpXnyAlmIr@JartDoBp
zJO`Afx<{i_@(jsjCW$C6zXqR36c!TBO5P
zwjNu-yKK2n#n4KBIg%L)Dba#nx}y=MdFlnpBuBU`Z5A9_%f5n6b@Qs$s7A2Lu#(`O
zW)dGUaO;0+Oj+e`)=FpB%H6Ql3(m#H9{cHtJr+~^iCse=(T~V^_g*VI24{&N2B!fG
zPIU}Up$sm(@h_(67Y^8PYb%C!w6Ujo$CiZ{)zhA1Q$nohWx%2BfMx@SEUl~x?G&qc
z(VtZ3`w47ao&3ase{!?-Sl8)a9})dzPY55WL&JYbW_j@%yfZq0o77|AC$r%kNrQ(s
zo>Zb^C-eNMEz*9fA`?aL7Cb)-Z$PO-Lr!LS@miwwR6PcM{%sc&L(X*(?!!6Wq_GW{%TEM_L9W$ZBW#CAuA0&Oo#|4H)
z^QeF5J0OORg@$3!YXn?k@Ux72-a}sCXr~|Oe8#v1hBx!b<~yi)k7whd3^s}d4q*DR
z%V*eHV8k*HR=(p@*bt>1ojeAn@G;3a90iU?209da};r5)z4U5*F{Oc{pARv
z%7~LnN{SVZeRGr^x4LZUyYNRJPKHS;Uojn|TM2x=y@%LN6z5Rl!mht)o{J;eF`7q>gxgYx1fYvRfdk>cbg&2d~igLD$|U32o<4kV8j;rgmBnx=m<
zJ@Z3naH|$U$Xdpgwq63%Lz3Lup>@nhDD`jHwb5u`?`?xeC}6K7$R
zhAQNWSWR=MQ{ytO;{$H#r(RIpprn6&Eir0Z_H~YQ!(KDmqJ2?3d2t2k=M@16SZkS)
zaC36?{!7~KulC3M2`Bu6R5v8uwI((l;-eD!%z0*Fy7b!(n9o2`%!kw@0Csre*j0_#
z!JCSI)qQ(pCk(GKE9kMC=yC
zYAPu{)~F`UV_z4aFJY6KDQMQP9&MykAYNOp$(bkAf3PZT=!cKl
z*SJWjuO<5?C?a!ZL~lrdNV+N=e>2No+FltpwW43KCczvfuuTiCaQ6X@X_l2qw=>n8
zyo($B0NDSs%5y%0Bv&466Gum(a{ZRBr#{5imgUb0&c-(FKx@*8jKAQkDPl&K-~a4BZ;N8G!bnMr~(X<#}t;vZcn4s$q`?yT&3l+Ul33(YfSw+BF+m_(1x
zF6VKw8U&KnJwjH`t3TjFQNJ=9H6S2Wu!mGr=fSHZnLvNCB@5l-1i=qQdjiAsc?!pz
zezXoJ{d^$9`L5l=8pEiiCuS^S=Sco1u0o$qOdpS=j5DpVp#1vnFm(pJ@q=zOz>Pa>
zR_y%zax5rsQSJbK$?gDd5E0tZ%d3JU+>i@ExHhYhUxe=xKwPfV<|&LyuR*MIx@!=T
z`!*?#fvta{>JVE(N(zY11J0YGepirrMHr`vBKdjuQ8I|#mpD(}h4E5Ua7IdkolAF1
zXfQRPH?iG&l|{$szSFdTogLj551!Q7l=R#0ev>yUnc!`F%%B7QM6#zcNS?=UFd+0w
zfgX)s&4L{1!k1*AvM#n{u~|a^C&C^0B*#wVSh9b64a$)VdVj(FT@lWTIKjD6cCy{A
zk?v*)yRQRu6RVcdJs4dPEk}L=<}q$b%eMvbx94;-OGzul8H^Gwm~37Pfa}XE^I{i}
z%qgCZ%PUi3bL{MeUWz&+^EGxJC{Iuw=TGj48sQ#ZlWF0?`_MD4*8qo-29N{3I3`D8I|EG|4&}5+qBK_|J5mLcH}qS99ifk!-HX;%qGyyn#yC#
zLim~`ulkQfGnHBVh%>4*E>l$FEmA%rO*Opv%Ubt=X{x5Lai>J4a`2Dz=W0i{He)i6
zB%=u;V@Nv`X^%vI8fVQ9V!B!IjSmLHYw>?Zs)&e+h^e^jMyB=EaAZ=^itRQKjA~G@
zK5e&I5hnv6x>_%IdH@*F5=kvRO1}&A!!I>S!8vU`V)}+yWqoz`pFIt+PzazPHAIS1
zd8gb{BNhLG)7`#}=qcVqSu|u^zA0X!&onNuV=ugA|BW6-`S%`qN2*I~xH$yCQyqU1
z)6q5FT2CrJp)osOf>_bNnVuZ5miD@oT+X~)+4wi3ufY-a@TTw7dHw6dY~czn~C4?L#G7p
z{dSvfm}NhtIgTEVbb_${x4ZWr9+ZD}@7gH@xYNN8GFeWW+Db($aJCf(5Cb12aCP&7Fb$r00|AhV~(u7!V058JBm
zKZIg+!X1E2X;$K$jGBt{?x1}8`In&`V3nBn2XRN(9I?
z3H51~(Z0f=PEOZv-|>Dc63*7A@7BC`|CSdo8BWL?O2~2lVb67MNw=f)W>~331tXFa
z$s|Eb?Zf=rhikgM%uWrC;AnsEE48RRXM~ApMKY|;n@0?|m>@y{*Y`|hea)xlA9z7J
z^fS$70`t|LVr@I^g8#_&RFTvx_vOfXD8XdT;KxChh`6>$Y7?v;YCZv>6Chqurv=
zqQYuUBz1P3kk$B;Qe^!}6j_N=H~mYWX9L$nqGQ6+qb_|Ax^8)Hf3T*RLN~NU_03EJ
zmjR}UsFc*-PV_gfH5%5V(b1<$)qOd4&){;Akn6)ENnGFX#1%j0-5;XTl2y)Fam7?9
zQiS-3-X}=4?gk!~AL4)LjY}WpBn9GZU|;sku)cA`^Td-y8BeF8ngv<;8p(|A1~)yg
z&X;C~5@3x|F#k;Y=7!8p`U%jXtBdncJ{6H|)@5jibEVn30bTZ8bs{6vMrUpMPHYPz
z1K~HL(4f&(l9?{`?72#da$K#r7}V~bt>)JvYemDX*aZZ294dbob#g-Yof5NYB)^mS
zD>Ad);+oxEb2Z;oM3}G7(3VDciII{xoTywYcQwT#g%6&WWSPf?sg$q2e!nD(FRC9u$C-VUuGm2`
zb#g)}P&onmp8Pzav9O@McCi8-!W}YS=
z-C+55WCS2(S-00eIOVp46Dp!kMI=;2EBn0z_ZtPxp4ezN-lG&F9vyo}L;qja{~|^H
zW!04XuV_bBgwuN!$UsoNiQss9;|+z02u#$;L{fhyYIad46LA<#B_+=}QnL)4;9C5(
zWkX1YnIhNumO)N%pt(_>tDSj049oP>uYT1t&tkNW>aT}T{W^+;olQ1)Nw-WJPtC7x
zK8o@s&DIBhMf42|@tZOt2?F`5C|}p{`jHGTbcb4Qr4;pgQ{>xp*Xg$oEc{@57d3|)
zX%c@_EJw)1Z2L+>SUiA8ikF<;!f~vGEG|b%aY>Z&Sq>iz8%J$fBX^ca_#HP!!zTZn
zE47b0nI2kcGR-#5j_4O(m_sT0@EJzita#Dul(}5*`#d4@^oaafe<|uGCdL%^8(_26
z(c{=m*;NOx|EoCGX
zv9SiFxOSE#cM#>KAzkh8o6>ac|1KrtV6%Fm6PmO^)L($>NGckoO+?p|T!1YD(uTU!
L-&g+w-%v?Kg(=b3
diff --git a/main/searchindex.js b/main/searchindex.js
index cf2ded471..6f2a079e3 100644
--- a/main/searchindex.js
+++ b/main/searchindex.js
@@ -1 +1 @@
-Search.setIndex({"docnames": ["api_ref_config", "api_ref_data", "api_ref_datasets", "api_ref_generation", "api_ref_models", "api_ref_modules", "api_ref_rlhf", "api_ref_training", "api_ref_utilities", "basics/chat_datasets", "basics/custom_components", "basics/datasets_overview", "basics/instruct_datasets", "basics/message_transforms", "basics/messages", "basics/model_transforms", "basics/multimodal_datasets", "basics/packing", "basics/preference_datasets", "basics/prompt_templates", "basics/text_completion_datasets", "basics/tokenizers", "deep_dives/checkpointer", "deep_dives/comet_logging", "deep_dives/configs", "deep_dives/recipe_deepdive", "deep_dives/wandb_logging", "generated/torchtune.config.instantiate", "generated/torchtune.config.log_config", "generated/torchtune.config.parse", "generated/torchtune.config.validate", "generated/torchtune.data.AlpacaToMessages", "generated/torchtune.data.ChatMLTemplate", "generated/torchtune.data.ChosenRejectedToMessages", "generated/torchtune.data.GrammarErrorCorrectionTemplate", "generated/torchtune.data.InputOutputToMessages", "generated/torchtune.data.Message", "generated/torchtune.data.OpenAIToMessages", "generated/torchtune.data.PromptTemplate", "generated/torchtune.data.PromptTemplateInterface", "generated/torchtune.data.QuestionAnswerTemplate", "generated/torchtune.data.Role", "generated/torchtune.data.ShareGPTToMessages", "generated/torchtune.data.SummarizeTemplate", "generated/torchtune.data.format_content_with_images", "generated/torchtune.data.left_pad_sequence", "generated/torchtune.data.load_image", "generated/torchtune.data.padded_collate", "generated/torchtune.data.padded_collate_dpo", "generated/torchtune.data.padded_collate_sft", "generated/torchtune.data.padded_collate_tiled_images_and_mask", "generated/torchtune.data.truncate", "generated/torchtune.data.validate_messages", "generated/torchtune.datasets.ConcatDataset", "generated/torchtune.datasets.PackedDataset", "generated/torchtune.datasets.PreferenceDataset", "generated/torchtune.datasets.SFTDataset", "generated/torchtune.datasets.TextCompletionDataset", "generated/torchtune.datasets.alpaca_cleaned_dataset", "generated/torchtune.datasets.alpaca_dataset", "generated/torchtune.datasets.chat_dataset", "generated/torchtune.datasets.cnn_dailymail_articles_dataset", "generated/torchtune.datasets.grammar_dataset", "generated/torchtune.datasets.hh_rlhf_helpful_dataset", "generated/torchtune.datasets.instruct_dataset", "generated/torchtune.datasets.multimodal.llava_instruct_dataset", "generated/torchtune.datasets.multimodal.the_cauldron_dataset", "generated/torchtune.datasets.multimodal.vqa_dataset", "generated/torchtune.datasets.preference_dataset", "generated/torchtune.datasets.samsum_dataset", "generated/torchtune.datasets.slimorca_dataset", "generated/torchtune.datasets.stack_exchange_paired_dataset", "generated/torchtune.datasets.text_completion_dataset", "generated/torchtune.datasets.wikitext_dataset", "generated/torchtune.generation.generate", "generated/torchtune.generation.generate_next_token", "generated/torchtune.generation.get_causal_mask_from_padding_mask", "generated/torchtune.generation.get_position_ids_from_padding_mask", "generated/torchtune.generation.sample", "generated/torchtune.models.clip.TilePositionalEmbedding", "generated/torchtune.models.clip.TiledTokenPositionalEmbedding", "generated/torchtune.models.clip.TokenPositionalEmbedding", "generated/torchtune.models.clip.clip_vision_encoder", "generated/torchtune.models.code_llama2.code_llama2_13b", "generated/torchtune.models.code_llama2.code_llama2_70b", "generated/torchtune.models.code_llama2.code_llama2_7b", "generated/torchtune.models.code_llama2.lora_code_llama2_13b", "generated/torchtune.models.code_llama2.lora_code_llama2_70b", "generated/torchtune.models.code_llama2.lora_code_llama2_7b", "generated/torchtune.models.code_llama2.qlora_code_llama2_13b", "generated/torchtune.models.code_llama2.qlora_code_llama2_70b", "generated/torchtune.models.code_llama2.qlora_code_llama2_7b", "generated/torchtune.models.gemma.gemma", "generated/torchtune.models.gemma.gemma_2b", "generated/torchtune.models.gemma.gemma_7b", "generated/torchtune.models.gemma.gemma_tokenizer", "generated/torchtune.models.gemma.lora_gemma", "generated/torchtune.models.gemma.lora_gemma_2b", "generated/torchtune.models.gemma.lora_gemma_7b", "generated/torchtune.models.gemma.qlora_gemma_2b", "generated/torchtune.models.gemma.qlora_gemma_7b", "generated/torchtune.models.gemma2.gemma2", "generated/torchtune.models.gemma2.gemma2_27b", "generated/torchtune.models.gemma2.gemma2_2b", "generated/torchtune.models.gemma2.gemma2_9b", "generated/torchtune.models.gemma2.lora_gemma2", "generated/torchtune.models.gemma2.lora_gemma2_27b", "generated/torchtune.models.gemma2.lora_gemma2_2b", "generated/torchtune.models.gemma2.lora_gemma2_9b", "generated/torchtune.models.gemma2.qlora_gemma2_27b", "generated/torchtune.models.gemma2.qlora_gemma2_2b", "generated/torchtune.models.gemma2.qlora_gemma2_9b", "generated/torchtune.models.llama2.Llama2ChatTemplate", "generated/torchtune.models.llama2.llama2", "generated/torchtune.models.llama2.llama2_13b", "generated/torchtune.models.llama2.llama2_70b", "generated/torchtune.models.llama2.llama2_7b", "generated/torchtune.models.llama2.llama2_reward_7b", "generated/torchtune.models.llama2.llama2_tokenizer", "generated/torchtune.models.llama2.lora_llama2", "generated/torchtune.models.llama2.lora_llama2_13b", "generated/torchtune.models.llama2.lora_llama2_70b", "generated/torchtune.models.llama2.lora_llama2_7b", "generated/torchtune.models.llama2.lora_llama2_reward_7b", "generated/torchtune.models.llama2.qlora_llama2_13b", "generated/torchtune.models.llama2.qlora_llama2_70b", "generated/torchtune.models.llama2.qlora_llama2_7b", "generated/torchtune.models.llama2.qlora_llama2_reward_7b", "generated/torchtune.models.llama3.llama3", "generated/torchtune.models.llama3.llama3_70b", "generated/torchtune.models.llama3.llama3_8b", "generated/torchtune.models.llama3.llama3_tokenizer", "generated/torchtune.models.llama3.lora_llama3", "generated/torchtune.models.llama3.lora_llama3_70b", "generated/torchtune.models.llama3.lora_llama3_8b", "generated/torchtune.models.llama3.qlora_llama3_70b", "generated/torchtune.models.llama3.qlora_llama3_8b", "generated/torchtune.models.llama3_1.llama3_1", "generated/torchtune.models.llama3_1.llama3_1_405b", "generated/torchtune.models.llama3_1.llama3_1_70b", "generated/torchtune.models.llama3_1.llama3_1_8b", "generated/torchtune.models.llama3_1.lora_llama3_1", "generated/torchtune.models.llama3_1.lora_llama3_1_405b", "generated/torchtune.models.llama3_1.lora_llama3_1_70b", "generated/torchtune.models.llama3_1.lora_llama3_1_8b", "generated/torchtune.models.llama3_1.qlora_llama3_1_405b", "generated/torchtune.models.llama3_1.qlora_llama3_1_70b", "generated/torchtune.models.llama3_1.qlora_llama3_1_8b", "generated/torchtune.models.llama3_2.llama3_2_1b", "generated/torchtune.models.llama3_2.llama3_2_3b", "generated/torchtune.models.llama3_2.lora_llama3_2_1b", "generated/torchtune.models.llama3_2.lora_llama3_2_3b", "generated/torchtune.models.llama3_2.qlora_llama3_2_1b", "generated/torchtune.models.llama3_2.qlora_llama3_2_3b", "generated/torchtune.models.llama3_2_vision.Llama3VisionEncoder", "generated/torchtune.models.llama3_2_vision.Llama3VisionProjectionHead", "generated/torchtune.models.llama3_2_vision.Llama3VisionTransform", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_11b", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_decoder", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_encoder", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_transform", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_11b", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_decoder", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_encoder", "generated/torchtune.models.llama3_2_vision.qlora_llama3_2_vision_11b", "generated/torchtune.models.llama3_3.llama3_3_70b", "generated/torchtune.models.llama3_3.lora_llama3_3_70b", "generated/torchtune.models.llama3_3.qlora_llama3_3_70b", "generated/torchtune.models.mistral.MistralChatTemplate", "generated/torchtune.models.mistral.lora_mistral", "generated/torchtune.models.mistral.lora_mistral_7b", "generated/torchtune.models.mistral.lora_mistral_classifier", "generated/torchtune.models.mistral.lora_mistral_reward_7b", "generated/torchtune.models.mistral.mistral", "generated/torchtune.models.mistral.mistral_7b", "generated/torchtune.models.mistral.mistral_classifier", "generated/torchtune.models.mistral.mistral_reward_7b", "generated/torchtune.models.mistral.mistral_tokenizer", "generated/torchtune.models.mistral.qlora_mistral_7b", "generated/torchtune.models.mistral.qlora_mistral_reward_7b", "generated/torchtune.models.phi3.lora_phi3", "generated/torchtune.models.phi3.lora_phi3_mini", "generated/torchtune.models.phi3.phi3", "generated/torchtune.models.phi3.phi3_mini", "generated/torchtune.models.phi3.phi3_mini_tokenizer", "generated/torchtune.models.phi3.qlora_phi3_mini", "generated/torchtune.models.qwen2.lora_qwen2", "generated/torchtune.models.qwen2.lora_qwen2_0_5b", "generated/torchtune.models.qwen2.lora_qwen2_1_5b", "generated/torchtune.models.qwen2.lora_qwen2_7b", "generated/torchtune.models.qwen2.qwen2", "generated/torchtune.models.qwen2.qwen2_0_5b", "generated/torchtune.models.qwen2.qwen2_1_5b", "generated/torchtune.models.qwen2.qwen2_7b", "generated/torchtune.models.qwen2.qwen2_tokenizer", "generated/torchtune.models.qwen2_5.lora_qwen2_5_0_5b", "generated/torchtune.models.qwen2_5.lora_qwen2_5_14b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_14b_instruct", "generated/torchtune.models.qwen2_5.lora_qwen2_5_1_5b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_1_5b_instruct", "generated/torchtune.models.qwen2_5.lora_qwen2_5_32b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_32b_instruct", "generated/torchtune.models.qwen2_5.lora_qwen2_5_3b", "generated/torchtune.models.qwen2_5.lora_qwen2_5_72b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_72b_instruct", "generated/torchtune.models.qwen2_5.lora_qwen2_5_7b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_7b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_0_5b", "generated/torchtune.models.qwen2_5.qwen2_5_14b_base", "generated/torchtune.models.qwen2_5.qwen2_5_14b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_1_5b_base", "generated/torchtune.models.qwen2_5.qwen2_5_1_5b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_32b_base", "generated/torchtune.models.qwen2_5.qwen2_5_32b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_3b", "generated/torchtune.models.qwen2_5.qwen2_5_72b_base", "generated/torchtune.models.qwen2_5.qwen2_5_72b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_7b_base", "generated/torchtune.models.qwen2_5.qwen2_5_7b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_tokenizer", "generated/torchtune.modules.FeedForward", "generated/torchtune.modules.Fp32LayerNorm", "generated/torchtune.modules.KVCache", "generated/torchtune.modules.LayerDropout", "generated/torchtune.modules.MultiHeadAttention", "generated/torchtune.modules.RMSNorm", "generated/torchtune.modules.RotaryPositionalEmbeddings", "generated/torchtune.modules.TanhGate", "generated/torchtune.modules.TiedLinear", "generated/torchtune.modules.TransformerCrossAttentionLayer", "generated/torchtune.modules.TransformerDecoder", "generated/torchtune.modules.TransformerSelfAttentionLayer", "generated/torchtune.modules.VisionTransformer", "generated/torchtune.modules.common_utils.delete_kv_caches", "generated/torchtune.modules.common_utils.disable_kv_cache", "generated/torchtune.modules.common_utils.local_kv_cache", "generated/torchtune.modules.common_utils.reparametrize_as_dtype_state_dict_post_hook", "generated/torchtune.modules.loss.CEWithChunkedOutputLoss", "generated/torchtune.modules.loss.ForwardKLLoss", "generated/torchtune.modules.loss.ForwardKLWithChunkedOutputLoss", "generated/torchtune.modules.model_fusion.DeepFusionModel", "generated/torchtune.modules.model_fusion.FusionEmbedding", "generated/torchtune.modules.model_fusion.FusionLayer", "generated/torchtune.modules.model_fusion.get_fusion_params", "generated/torchtune.modules.model_fusion.register_fusion_module", "generated/torchtune.modules.peft.AdapterModule", "generated/torchtune.modules.peft.DoRALinear", "generated/torchtune.modules.peft.LoRALinear", "generated/torchtune.modules.peft.disable_adapter", "generated/torchtune.modules.peft.get_adapter_params", "generated/torchtune.modules.peft.get_adapter_state_dict", "generated/torchtune.modules.peft.set_trainable_params", "generated/torchtune.modules.peft.validate_missing_and_unexpected_for_lora", "generated/torchtune.modules.prepare_layer_dropout", "generated/torchtune.modules.tokenizers.BaseTokenizer", "generated/torchtune.modules.tokenizers.ModelTokenizer", "generated/torchtune.modules.tokenizers.SentencePieceBaseTokenizer", "generated/torchtune.modules.tokenizers.TikTokenBaseTokenizer", "generated/torchtune.modules.tokenizers.parse_hf_tokenizer_json", "generated/torchtune.modules.tokenizers.tokenize_messages_no_special_tokens", "generated/torchtune.modules.transforms.Transform", "generated/torchtune.modules.transforms.VisionCrossAttentionMask", "generated/torchtune.rlhf.estimate_advantages", "generated/torchtune.rlhf.get_rewards_ppo", "generated/torchtune.rlhf.loss.DPOLoss", "generated/torchtune.rlhf.loss.PPOLoss", "generated/torchtune.rlhf.loss.RSOLoss", "generated/torchtune.rlhf.loss.SimPOLoss", "generated/torchtune.rlhf.truncate_sequence_at_first_stop_token", "generated/torchtune.training.FormattedCheckpointFiles", "generated/torchtune.training.FullModelHFCheckpointer", "generated/torchtune.training.FullModelMetaCheckpointer", "generated/torchtune.training.FullModelTorchTuneCheckpointer", "generated/torchtune.training.ModelType", "generated/torchtune.training.OptimizerInBackwardWrapper", "generated/torchtune.training.apply_selective_activation_checkpointing", "generated/torchtune.training.create_optim_in_bwd_wrapper", "generated/torchtune.training.gather_cpu_state_dict", "generated/torchtune.training.get_cosine_schedule_with_warmup", "generated/torchtune.training.get_dtype", "generated/torchtune.training.get_lr", "generated/torchtune.training.get_memory_stats", "generated/torchtune.training.get_quantizer_mode", "generated/torchtune.training.get_unmasked_sequence_lengths", "generated/torchtune.training.init_distributed", "generated/torchtune.training.is_distributed", "generated/torchtune.training.log_memory_stats", "generated/torchtune.training.metric_logging.CometLogger", "generated/torchtune.training.metric_logging.DiskLogger", "generated/torchtune.training.metric_logging.StdoutLogger", "generated/torchtune.training.metric_logging.TensorBoardLogger", "generated/torchtune.training.metric_logging.WandBLogger", "generated/torchtune.training.register_optim_in_bwd_hooks", "generated/torchtune.training.set_activation_checkpointing", "generated/torchtune.training.set_default_dtype", "generated/torchtune.training.set_seed", "generated/torchtune.training.setup_torch_profiler", "generated/torchtune.training.update_state_dict_for_classifier", "generated/torchtune.training.validate_expected_param_dtype", "generated/torchtune.utils.batch_to_device", "generated/torchtune.utils.get_device", "generated/torchtune.utils.get_logger", "generated/torchtune.utils.get_world_size_and_rank", "generated/torchtune.utils.torch_version_ge", "generated_examples/index", "generated_examples/sg_execution_times", "index", "install", "overview", "recipes/dpo", "recipes/lora_finetune_single_device", "recipes/qat_distributed", "recipes/recipes_overview", "sg_execution_times", "tune_cli", "tutorials/chat", "tutorials/e2e_flow", "tutorials/first_finetune_tutorial", "tutorials/llama3", "tutorials/llama_kd_tutorial", "tutorials/lora_finetune", "tutorials/memory_optimizations", "tutorials/qat_finetune", "tutorials/qlora_finetune"], "filenames": ["api_ref_config.rst", "api_ref_data.rst", "api_ref_datasets.rst", "api_ref_generation.rst", "api_ref_models.rst", "api_ref_modules.rst", "api_ref_rlhf.rst", "api_ref_training.rst", "api_ref_utilities.rst", "basics/chat_datasets.rst", "basics/custom_components.rst", "basics/datasets_overview.rst", "basics/instruct_datasets.rst", "basics/message_transforms.rst", "basics/messages.rst", "basics/model_transforms.rst", "basics/multimodal_datasets.rst", "basics/packing.rst", "basics/preference_datasets.rst", "basics/prompt_templates.rst", "basics/text_completion_datasets.rst", "basics/tokenizers.rst", "deep_dives/checkpointer.rst", "deep_dives/comet_logging.rst", "deep_dives/configs.rst", "deep_dives/recipe_deepdive.rst", "deep_dives/wandb_logging.rst", "generated/torchtune.config.instantiate.rst", "generated/torchtune.config.log_config.rst", "generated/torchtune.config.parse.rst", "generated/torchtune.config.validate.rst", "generated/torchtune.data.AlpacaToMessages.rst", "generated/torchtune.data.ChatMLTemplate.rst", "generated/torchtune.data.ChosenRejectedToMessages.rst", "generated/torchtune.data.GrammarErrorCorrectionTemplate.rst", "generated/torchtune.data.InputOutputToMessages.rst", "generated/torchtune.data.Message.rst", "generated/torchtune.data.OpenAIToMessages.rst", "generated/torchtune.data.PromptTemplate.rst", "generated/torchtune.data.PromptTemplateInterface.rst", "generated/torchtune.data.QuestionAnswerTemplate.rst", "generated/torchtune.data.Role.rst", "generated/torchtune.data.ShareGPTToMessages.rst", "generated/torchtune.data.SummarizeTemplate.rst", "generated/torchtune.data.format_content_with_images.rst", "generated/torchtune.data.left_pad_sequence.rst", "generated/torchtune.data.load_image.rst", "generated/torchtune.data.padded_collate.rst", "generated/torchtune.data.padded_collate_dpo.rst", "generated/torchtune.data.padded_collate_sft.rst", "generated/torchtune.data.padded_collate_tiled_images_and_mask.rst", "generated/torchtune.data.truncate.rst", "generated/torchtune.data.validate_messages.rst", "generated/torchtune.datasets.ConcatDataset.rst", "generated/torchtune.datasets.PackedDataset.rst", "generated/torchtune.datasets.PreferenceDataset.rst", "generated/torchtune.datasets.SFTDataset.rst", "generated/torchtune.datasets.TextCompletionDataset.rst", "generated/torchtune.datasets.alpaca_cleaned_dataset.rst", "generated/torchtune.datasets.alpaca_dataset.rst", "generated/torchtune.datasets.chat_dataset.rst", "generated/torchtune.datasets.cnn_dailymail_articles_dataset.rst", "generated/torchtune.datasets.grammar_dataset.rst", "generated/torchtune.datasets.hh_rlhf_helpful_dataset.rst", "generated/torchtune.datasets.instruct_dataset.rst", "generated/torchtune.datasets.multimodal.llava_instruct_dataset.rst", "generated/torchtune.datasets.multimodal.the_cauldron_dataset.rst", "generated/torchtune.datasets.multimodal.vqa_dataset.rst", "generated/torchtune.datasets.preference_dataset.rst", "generated/torchtune.datasets.samsum_dataset.rst", "generated/torchtune.datasets.slimorca_dataset.rst", "generated/torchtune.datasets.stack_exchange_paired_dataset.rst", "generated/torchtune.datasets.text_completion_dataset.rst", "generated/torchtune.datasets.wikitext_dataset.rst", "generated/torchtune.generation.generate.rst", "generated/torchtune.generation.generate_next_token.rst", "generated/torchtune.generation.get_causal_mask_from_padding_mask.rst", "generated/torchtune.generation.get_position_ids_from_padding_mask.rst", "generated/torchtune.generation.sample.rst", "generated/torchtune.models.clip.TilePositionalEmbedding.rst", "generated/torchtune.models.clip.TiledTokenPositionalEmbedding.rst", "generated/torchtune.models.clip.TokenPositionalEmbedding.rst", "generated/torchtune.models.clip.clip_vision_encoder.rst", "generated/torchtune.models.code_llama2.code_llama2_13b.rst", "generated/torchtune.models.code_llama2.code_llama2_70b.rst", "generated/torchtune.models.code_llama2.code_llama2_7b.rst", "generated/torchtune.models.code_llama2.lora_code_llama2_13b.rst", "generated/torchtune.models.code_llama2.lora_code_llama2_70b.rst", "generated/torchtune.models.code_llama2.lora_code_llama2_7b.rst", "generated/torchtune.models.code_llama2.qlora_code_llama2_13b.rst", "generated/torchtune.models.code_llama2.qlora_code_llama2_70b.rst", "generated/torchtune.models.code_llama2.qlora_code_llama2_7b.rst", "generated/torchtune.models.gemma.gemma.rst", "generated/torchtune.models.gemma.gemma_2b.rst", "generated/torchtune.models.gemma.gemma_7b.rst", "generated/torchtune.models.gemma.gemma_tokenizer.rst", "generated/torchtune.models.gemma.lora_gemma.rst", "generated/torchtune.models.gemma.lora_gemma_2b.rst", "generated/torchtune.models.gemma.lora_gemma_7b.rst", "generated/torchtune.models.gemma.qlora_gemma_2b.rst", "generated/torchtune.models.gemma.qlora_gemma_7b.rst", "generated/torchtune.models.gemma2.gemma2.rst", "generated/torchtune.models.gemma2.gemma2_27b.rst", "generated/torchtune.models.gemma2.gemma2_2b.rst", "generated/torchtune.models.gemma2.gemma2_9b.rst", "generated/torchtune.models.gemma2.lora_gemma2.rst", "generated/torchtune.models.gemma2.lora_gemma2_27b.rst", "generated/torchtune.models.gemma2.lora_gemma2_2b.rst", "generated/torchtune.models.gemma2.lora_gemma2_9b.rst", "generated/torchtune.models.gemma2.qlora_gemma2_27b.rst", "generated/torchtune.models.gemma2.qlora_gemma2_2b.rst", "generated/torchtune.models.gemma2.qlora_gemma2_9b.rst", "generated/torchtune.models.llama2.Llama2ChatTemplate.rst", "generated/torchtune.models.llama2.llama2.rst", "generated/torchtune.models.llama2.llama2_13b.rst", "generated/torchtune.models.llama2.llama2_70b.rst", "generated/torchtune.models.llama2.llama2_7b.rst", "generated/torchtune.models.llama2.llama2_reward_7b.rst", "generated/torchtune.models.llama2.llama2_tokenizer.rst", "generated/torchtune.models.llama2.lora_llama2.rst", "generated/torchtune.models.llama2.lora_llama2_13b.rst", "generated/torchtune.models.llama2.lora_llama2_70b.rst", "generated/torchtune.models.llama2.lora_llama2_7b.rst", "generated/torchtune.models.llama2.lora_llama2_reward_7b.rst", "generated/torchtune.models.llama2.qlora_llama2_13b.rst", "generated/torchtune.models.llama2.qlora_llama2_70b.rst", "generated/torchtune.models.llama2.qlora_llama2_7b.rst", "generated/torchtune.models.llama2.qlora_llama2_reward_7b.rst", "generated/torchtune.models.llama3.llama3.rst", "generated/torchtune.models.llama3.llama3_70b.rst", "generated/torchtune.models.llama3.llama3_8b.rst", "generated/torchtune.models.llama3.llama3_tokenizer.rst", "generated/torchtune.models.llama3.lora_llama3.rst", "generated/torchtune.models.llama3.lora_llama3_70b.rst", "generated/torchtune.models.llama3.lora_llama3_8b.rst", "generated/torchtune.models.llama3.qlora_llama3_70b.rst", "generated/torchtune.models.llama3.qlora_llama3_8b.rst", "generated/torchtune.models.llama3_1.llama3_1.rst", "generated/torchtune.models.llama3_1.llama3_1_405b.rst", "generated/torchtune.models.llama3_1.llama3_1_70b.rst", "generated/torchtune.models.llama3_1.llama3_1_8b.rst", "generated/torchtune.models.llama3_1.lora_llama3_1.rst", "generated/torchtune.models.llama3_1.lora_llama3_1_405b.rst", "generated/torchtune.models.llama3_1.lora_llama3_1_70b.rst", "generated/torchtune.models.llama3_1.lora_llama3_1_8b.rst", "generated/torchtune.models.llama3_1.qlora_llama3_1_405b.rst", "generated/torchtune.models.llama3_1.qlora_llama3_1_70b.rst", "generated/torchtune.models.llama3_1.qlora_llama3_1_8b.rst", "generated/torchtune.models.llama3_2.llama3_2_1b.rst", "generated/torchtune.models.llama3_2.llama3_2_3b.rst", "generated/torchtune.models.llama3_2.lora_llama3_2_1b.rst", "generated/torchtune.models.llama3_2.lora_llama3_2_3b.rst", "generated/torchtune.models.llama3_2.qlora_llama3_2_1b.rst", "generated/torchtune.models.llama3_2.qlora_llama3_2_3b.rst", "generated/torchtune.models.llama3_2_vision.Llama3VisionEncoder.rst", "generated/torchtune.models.llama3_2_vision.Llama3VisionProjectionHead.rst", "generated/torchtune.models.llama3_2_vision.Llama3VisionTransform.rst", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_11b.rst", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_decoder.rst", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_encoder.rst", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_transform.rst", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_11b.rst", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_decoder.rst", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_encoder.rst", "generated/torchtune.models.llama3_2_vision.qlora_llama3_2_vision_11b.rst", "generated/torchtune.models.llama3_3.llama3_3_70b.rst", "generated/torchtune.models.llama3_3.lora_llama3_3_70b.rst", "generated/torchtune.models.llama3_3.qlora_llama3_3_70b.rst", "generated/torchtune.models.mistral.MistralChatTemplate.rst", "generated/torchtune.models.mistral.lora_mistral.rst", "generated/torchtune.models.mistral.lora_mistral_7b.rst", "generated/torchtune.models.mistral.lora_mistral_classifier.rst", "generated/torchtune.models.mistral.lora_mistral_reward_7b.rst", "generated/torchtune.models.mistral.mistral.rst", "generated/torchtune.models.mistral.mistral_7b.rst", "generated/torchtune.models.mistral.mistral_classifier.rst", "generated/torchtune.models.mistral.mistral_reward_7b.rst", "generated/torchtune.models.mistral.mistral_tokenizer.rst", "generated/torchtune.models.mistral.qlora_mistral_7b.rst", "generated/torchtune.models.mistral.qlora_mistral_reward_7b.rst", "generated/torchtune.models.phi3.lora_phi3.rst", "generated/torchtune.models.phi3.lora_phi3_mini.rst", "generated/torchtune.models.phi3.phi3.rst", "generated/torchtune.models.phi3.phi3_mini.rst", "generated/torchtune.models.phi3.phi3_mini_tokenizer.rst", "generated/torchtune.models.phi3.qlora_phi3_mini.rst", "generated/torchtune.models.qwen2.lora_qwen2.rst", "generated/torchtune.models.qwen2.lora_qwen2_0_5b.rst", "generated/torchtune.models.qwen2.lora_qwen2_1_5b.rst", "generated/torchtune.models.qwen2.lora_qwen2_7b.rst", "generated/torchtune.models.qwen2.qwen2.rst", "generated/torchtune.models.qwen2.qwen2_0_5b.rst", "generated/torchtune.models.qwen2.qwen2_1_5b.rst", "generated/torchtune.models.qwen2.qwen2_7b.rst", "generated/torchtune.models.qwen2.qwen2_tokenizer.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_0_5b.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_14b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_14b_instruct.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_1_5b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_1_5b_instruct.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_32b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_32b_instruct.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_3b.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_72b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_72b_instruct.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_7b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_7b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_0_5b.rst", "generated/torchtune.models.qwen2_5.qwen2_5_14b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_14b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_1_5b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_1_5b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_32b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_32b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_3b.rst", "generated/torchtune.models.qwen2_5.qwen2_5_72b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_72b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_7b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_7b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_tokenizer.rst", "generated/torchtune.modules.FeedForward.rst", "generated/torchtune.modules.Fp32LayerNorm.rst", "generated/torchtune.modules.KVCache.rst", "generated/torchtune.modules.LayerDropout.rst", "generated/torchtune.modules.MultiHeadAttention.rst", "generated/torchtune.modules.RMSNorm.rst", "generated/torchtune.modules.RotaryPositionalEmbeddings.rst", "generated/torchtune.modules.TanhGate.rst", "generated/torchtune.modules.TiedLinear.rst", "generated/torchtune.modules.TransformerCrossAttentionLayer.rst", "generated/torchtune.modules.TransformerDecoder.rst", "generated/torchtune.modules.TransformerSelfAttentionLayer.rst", "generated/torchtune.modules.VisionTransformer.rst", "generated/torchtune.modules.common_utils.delete_kv_caches.rst", "generated/torchtune.modules.common_utils.disable_kv_cache.rst", "generated/torchtune.modules.common_utils.local_kv_cache.rst", "generated/torchtune.modules.common_utils.reparametrize_as_dtype_state_dict_post_hook.rst", "generated/torchtune.modules.loss.CEWithChunkedOutputLoss.rst", "generated/torchtune.modules.loss.ForwardKLLoss.rst", "generated/torchtune.modules.loss.ForwardKLWithChunkedOutputLoss.rst", "generated/torchtune.modules.model_fusion.DeepFusionModel.rst", "generated/torchtune.modules.model_fusion.FusionEmbedding.rst", "generated/torchtune.modules.model_fusion.FusionLayer.rst", "generated/torchtune.modules.model_fusion.get_fusion_params.rst", "generated/torchtune.modules.model_fusion.register_fusion_module.rst", "generated/torchtune.modules.peft.AdapterModule.rst", "generated/torchtune.modules.peft.DoRALinear.rst", "generated/torchtune.modules.peft.LoRALinear.rst", "generated/torchtune.modules.peft.disable_adapter.rst", "generated/torchtune.modules.peft.get_adapter_params.rst", "generated/torchtune.modules.peft.get_adapter_state_dict.rst", "generated/torchtune.modules.peft.set_trainable_params.rst", "generated/torchtune.modules.peft.validate_missing_and_unexpected_for_lora.rst", "generated/torchtune.modules.prepare_layer_dropout.rst", "generated/torchtune.modules.tokenizers.BaseTokenizer.rst", "generated/torchtune.modules.tokenizers.ModelTokenizer.rst", "generated/torchtune.modules.tokenizers.SentencePieceBaseTokenizer.rst", "generated/torchtune.modules.tokenizers.TikTokenBaseTokenizer.rst", "generated/torchtune.modules.tokenizers.parse_hf_tokenizer_json.rst", "generated/torchtune.modules.tokenizers.tokenize_messages_no_special_tokens.rst", "generated/torchtune.modules.transforms.Transform.rst", "generated/torchtune.modules.transforms.VisionCrossAttentionMask.rst", "generated/torchtune.rlhf.estimate_advantages.rst", "generated/torchtune.rlhf.get_rewards_ppo.rst", "generated/torchtune.rlhf.loss.DPOLoss.rst", "generated/torchtune.rlhf.loss.PPOLoss.rst", "generated/torchtune.rlhf.loss.RSOLoss.rst", "generated/torchtune.rlhf.loss.SimPOLoss.rst", "generated/torchtune.rlhf.truncate_sequence_at_first_stop_token.rst", "generated/torchtune.training.FormattedCheckpointFiles.rst", "generated/torchtune.training.FullModelHFCheckpointer.rst", "generated/torchtune.training.FullModelMetaCheckpointer.rst", "generated/torchtune.training.FullModelTorchTuneCheckpointer.rst", "generated/torchtune.training.ModelType.rst", "generated/torchtune.training.OptimizerInBackwardWrapper.rst", "generated/torchtune.training.apply_selective_activation_checkpointing.rst", "generated/torchtune.training.create_optim_in_bwd_wrapper.rst", "generated/torchtune.training.gather_cpu_state_dict.rst", "generated/torchtune.training.get_cosine_schedule_with_warmup.rst", "generated/torchtune.training.get_dtype.rst", "generated/torchtune.training.get_lr.rst", "generated/torchtune.training.get_memory_stats.rst", "generated/torchtune.training.get_quantizer_mode.rst", "generated/torchtune.training.get_unmasked_sequence_lengths.rst", "generated/torchtune.training.init_distributed.rst", "generated/torchtune.training.is_distributed.rst", "generated/torchtune.training.log_memory_stats.rst", "generated/torchtune.training.metric_logging.CometLogger.rst", "generated/torchtune.training.metric_logging.DiskLogger.rst", "generated/torchtune.training.metric_logging.StdoutLogger.rst", "generated/torchtune.training.metric_logging.TensorBoardLogger.rst", "generated/torchtune.training.metric_logging.WandBLogger.rst", "generated/torchtune.training.register_optim_in_bwd_hooks.rst", "generated/torchtune.training.set_activation_checkpointing.rst", "generated/torchtune.training.set_default_dtype.rst", "generated/torchtune.training.set_seed.rst", "generated/torchtune.training.setup_torch_profiler.rst", "generated/torchtune.training.update_state_dict_for_classifier.rst", "generated/torchtune.training.validate_expected_param_dtype.rst", "generated/torchtune.utils.batch_to_device.rst", "generated/torchtune.utils.get_device.rst", "generated/torchtune.utils.get_logger.rst", "generated/torchtune.utils.get_world_size_and_rank.rst", "generated/torchtune.utils.torch_version_ge.rst", "generated_examples/index.rst", "generated_examples/sg_execution_times.rst", "index.rst", "install.rst", "overview.rst", "recipes/dpo.rst", "recipes/lora_finetune_single_device.rst", "recipes/qat_distributed.rst", "recipes/recipes_overview.rst", "sg_execution_times.rst", "tune_cli.rst", "tutorials/chat.rst", "tutorials/e2e_flow.rst", "tutorials/first_finetune_tutorial.rst", "tutorials/llama3.rst", "tutorials/llama_kd_tutorial.rst", "tutorials/lora_finetune.rst", "tutorials/memory_optimizations.rst", "tutorials/qat_finetune.rst", "tutorials/qlora_finetune.rst"], "titles": ["torchtune.config", "torchtune.data", "torchtune.datasets", "torchtune.generation", "torchtune.models", "torchtune.modules", "torchtune.rlhf", "torchtune.training", "torchtune.utils", "Chat Datasets", "Custom Components and Recipes", "Datasets Overview", "Instruct Datasets", "Message Transforms", "Messages", "Multimodal Transforms", "Multimodal Datasets", "Sample packing", "Preference Datasets", "Prompt Templates", "Text-completion Datasets", "Tokenizers", "Checkpointing in torchtune", "Logging to Comet", "All About Configs", "What Are Recipes?", "Logging to Weights & Biases", "instantiate", "log_config", "parse", "validate", "AlpacaToMessages", "ChatMLTemplate", "ChosenRejectedToMessages", "torchtune.data.GrammarErrorCorrectionTemplate", "InputOutputToMessages", "Message", "OpenAIToMessages", "PromptTemplate", "PromptTemplateInterface", "torchtune.data.QuestionAnswerTemplate", "torchtune.data.Role", "ShareGPTToMessages", "torchtune.data.SummarizeTemplate", "format_content_with_images", "left_pad_sequence", "load_image", "padded_collate", "padded_collate_dpo", "padded_collate_sft", "padded_collate_tiled_images_and_mask", "truncate", "validate_messages", "ConcatDataset", "PackedDataset", "PreferenceDataset", "SFTDataset", "TextCompletionDataset", "alpaca_cleaned_dataset", "alpaca_dataset", "chat_dataset", "cnn_dailymail_articles_dataset", "grammar_dataset", "hh_rlhf_helpful_dataset", "instruct_dataset", "llava_instruct_dataset", "the_cauldron_dataset", "vqa_dataset", "preference_dataset", "samsum_dataset", "slimorca_dataset", "stack_exchange_paired_dataset", "text_completion_dataset", "wikitext_dataset", "generate", "generate_next_token", "get_causal_mask_from_padding_mask", "get_position_ids_from_padding_mask", "sample", "TilePositionalEmbedding", "TiledTokenPositionalEmbedding", "TokenPositionalEmbedding", "clip_vision_encoder", "code_llama2_13b", "code_llama2_70b", "code_llama2_7b", "lora_code_llama2_13b", "lora_code_llama2_70b", "lora_code_llama2_7b", "qlora_code_llama2_13b", "qlora_code_llama2_70b", "qlora_code_llama2_7b", "gemma", "gemma_2b", "gemma_7b", "gemma_tokenizer", "lora_gemma", "lora_gemma_2b", "lora_gemma_7b", "qlora_gemma_2b", "qlora_gemma_7b", "gemma2", "gemma2_27b", "gemma2_2b", "gemma2_9b", "lora_gemma2", "lora_gemma2_27b", "lora_gemma2_2b", "lora_gemma2_9b", "qlora_gemma2_27b", "qlora_gemma2_2b", "qlora_gemma2_9b", "Llama2ChatTemplate", "llama2", "llama2_13b", "llama2_70b", "llama2_7b", "llama2_reward_7b", "llama2_tokenizer", "lora_llama2", "lora_llama2_13b", "lora_llama2_70b", "lora_llama2_7b", "lora_llama2_reward_7b", "qlora_llama2_13b", "qlora_llama2_70b", "qlora_llama2_7b", "qlora_llama2_reward_7b", "llama3", "llama3_70b", "llama3_8b", "llama3_tokenizer", "lora_llama3", "lora_llama3_70b", "lora_llama3_8b", "qlora_llama3_70b", "qlora_llama3_8b", "llama3_1", "llama3_1_405b", "llama3_1_70b", "llama3_1_8b", "lora_llama3_1", "lora_llama3_1_405b", "lora_llama3_1_70b", "lora_llama3_1_8b", "qlora_llama3_1_405b", "qlora_llama3_1_70b", "qlora_llama3_1_8b", "llama3_2_1b", "llama3_2_3b", "lora_llama3_2_1b", "lora_llama3_2_3b", "qlora_llama3_2_1b", "qlora_llama3_2_3b", "Llama3VisionEncoder", "Llama3VisionProjectionHead", "Llama3VisionTransform", "llama3_2_vision_11b", "llama3_2_vision_decoder", "llama3_2_vision_encoder", "llama3_2_vision_transform", "lora_llama3_2_vision_11b", "lora_llama3_2_vision_decoder", "lora_llama3_2_vision_encoder", "qlora_llama3_2_vision_11b", "llama3_3_70b", "lora_llama3_3_70b", "qlora_llama3_3_70b", "MistralChatTemplate", "lora_mistral", "lora_mistral_7b", "lora_mistral_classifier", "lora_mistral_reward_7b", "mistral", "mistral_7b", "mistral_classifier", "mistral_reward_7b", "mistral_tokenizer", "qlora_mistral_7b", "qlora_mistral_reward_7b", "lora_phi3", "lora_phi3_mini", "phi3", "phi3_mini", "phi3_mini_tokenizer", "qlora_phi3_mini", "lora_qwen2", "lora_qwen2_0_5b", "lora_qwen2_1_5b", "lora_qwen2_7b", "qwen2", "qwen2_0_5b", "qwen2_1_5b", "qwen2_7b", "qwen2_tokenizer", "lora_qwen2_5_0_5b", "lora_qwen2_5_14b_base", "lora_qwen2_5_14b_instruct", "lora_qwen2_5_1_5b_base", "lora_qwen2_5_1_5b_instruct", "lora_qwen2_5_32b_base", "lora_qwen2_5_32b_instruct", "lora_qwen2_5_3b", "lora_qwen2_5_72b_base", "lora_qwen2_5_72b_instruct", "lora_qwen2_5_7b_base", "lora_qwen2_5_7b_instruct", "qwen2_5_0_5b", "qwen2_5_14b_base", "qwen2_5_14b_instruct", "qwen2_5_1_5b_base", "qwen2_5_1_5b_instruct", "qwen2_5_32b_base", "qwen2_5_32b_instruct", "qwen2_5_3b", "qwen2_5_72b_base", "qwen2_5_72b_instruct", "qwen2_5_7b_base", "qwen2_5_7b_instruct", "qwen2_5_tokenizer", "FeedForward", "Fp32LayerNorm", "KVCache", "LayerDropout", "MultiHeadAttention", "RMSNorm", "RotaryPositionalEmbeddings", "TanhGate", "TiedLinear", "TransformerCrossAttentionLayer", "TransformerDecoder", "TransformerSelfAttentionLayer", "VisionTransformer", "delete_kv_caches", "disable_kv_cache", "local_kv_cache", "reparametrize_as_dtype_state_dict_post_hook", "CEWithChunkedOutputLoss", "ForwardKLLoss", "ForwardKLWithChunkedOutputLoss", "DeepFusionModel", "FusionEmbedding", "FusionLayer", "get_fusion_params", "register_fusion_module", "AdapterModule", "DoRALinear", "LoRALinear", "disable_adapter", "get_adapter_params", "get_adapter_state_dict", "set_trainable_params", "validate_missing_and_unexpected_for_lora", "prepare_layer_dropout", "BaseTokenizer", "ModelTokenizer", "SentencePieceBaseTokenizer", "TikTokenBaseTokenizer", "parse_hf_tokenizer_json", "tokenize_messages_no_special_tokens", "Transform", "VisionCrossAttentionMask", "estimate_advantages", "get_rewards_ppo", "DPOLoss", "PPOLoss", "RSOLoss", "torchtune.rlhf.loss.SimPOLoss", "truncate_sequence_at_first_stop_token", "FormattedCheckpointFiles", "FullModelHFCheckpointer", "FullModelMetaCheckpointer", "FullModelTorchTuneCheckpointer", "ModelType", "OptimizerInBackwardWrapper", "apply_selective_activation_checkpointing", "create_optim_in_bwd_wrapper", "gather_cpu_state_dict", "get_cosine_schedule_with_warmup", "get_dtype", "get_lr", "get_memory_stats", "get_quantizer_mode", "get_unmasked_sequence_lengths", "init_distributed", "is_distributed", "log_memory_stats", "CometLogger", "DiskLogger", "StdoutLogger", "TensorBoardLogger", "WandBLogger", "register_optim_in_bwd_hooks", "set_activation_checkpointing", "set_default_dtype", "set_seed", "setup_torch_profiler", "update_state_dict_for_classifier", "validate_expected_param_dtype", "batch_to_device", "get_device", "get_logger", "get_world_size_and_rank", "torch_version_ge", "<no title>", "Computation times", "Welcome to the torchtune Documentation", "Install Instructions", "torchtune Overview", "Direct Preference Optimization", "LoRA Single Device Finetuning", "Distributed Quantization-Aware Training (QAT)", "Recipes Overview", "Computation times", "torchtune CLI", "Fine-Tuning Llama3 with Chat Data", "End-to-End Workflow with torchtune", "Fine-Tune Your First LLM", "Meta Llama3 in torchtune", "Distilling Llama3.1 8B into Llama3.2 1B using Knowledge Distillation", "Fine-Tuning Llama2 with LoRA", "Memory Optimization Overview", "Fine-Tuning Llama3 with QAT", "Fine-Tuning Llama2 with QLoRA"], "terms": {"instruct": [1, 2, 4, 9, 10, 11, 13, 15, 16, 17, 18, 19, 21, 22, 31, 32, 33, 35, 37, 42, 54, 56, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 72, 157, 160, 161, 168, 176, 182, 183, 184, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 306, 309, 310, 311, 314, 315, 317, 319, 320, 322, 323], "prompt": [1, 9, 10, 11, 12, 13, 18, 31, 33, 34, 35, 36, 37, 38, 39, 40, 42, 43, 55, 56, 59, 60, 62, 63, 64, 67, 68, 69, 70, 71, 74, 75, 95, 112, 118, 131, 156, 160, 168, 177, 184, 194, 219, 230, 240, 259, 316, 318], "chat": [1, 2, 11, 13, 16, 18, 32, 37, 42, 56, 60, 112, 184, 310, 316], "includ": [1, 9, 11, 12, 16, 18, 19, 21, 22, 24, 25, 38, 39, 56, 67, 78, 82, 92, 101, 113, 128, 137, 158, 159, 160, 162, 163, 173, 184, 190, 230, 246, 247, 254, 270, 271, 308, 309, 312, 314, 315, 316, 317, 318, 319, 320, 323], "some": [1, 17, 18, 20, 21, 22, 24, 32, 171, 241, 243, 249, 251, 306, 308, 309, 310, 311, 314, 315, 316, 317, 319, 320, 321, 322, 323], "specif": [1, 5, 11, 12, 15, 19, 21, 24, 25, 27, 55, 56, 65, 66, 67, 156, 255, 311, 315, 316, 321, 322, 323], "format": [1, 2, 7, 11, 19, 21, 36, 45, 46, 55, 56, 59, 60, 63, 64, 67, 68, 112, 156, 168, 255, 269, 270, 271, 272, 273, 314, 315, 316, 317, 318, 320, 321], "differ": [1, 9, 10, 17, 18, 19, 21, 24, 26, 48, 53, 60, 64, 67, 79, 80, 81, 156, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 228, 232, 246, 256, 264, 273, 298, 308, 309, 310, 311, 314, 315, 316, 318, 319, 320, 321, 322, 323], "dataset": [1, 10, 13, 14, 15, 17, 19, 24, 31, 33, 35, 36, 37, 42, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 264, 308, 309, 317, 318, 319, 322], "model": [1, 2, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 22, 23, 24, 25, 27, 31, 32, 33, 35, 36, 37, 42, 53, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 222, 224, 226, 228, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 258, 259, 260, 262, 263, 264, 265, 266, 267, 270, 271, 272, 273, 275, 276, 281, 286, 287, 292, 293, 297, 306, 308, 309, 310, 311, 315, 323], "convert": [1, 9, 11, 14, 21, 22, 33, 35, 37, 42, 49, 55, 56, 60, 65, 66, 68, 76, 154, 270, 277, 322, 323], "from": [1, 2, 4, 10, 11, 13, 14, 15, 17, 19, 23, 24, 25, 26, 27, 31, 33, 36, 37, 42, 45, 46, 47, 50, 53, 54, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 77, 78, 79, 80, 81, 82, 83, 84, 85, 93, 94, 102, 103, 104, 112, 114, 115, 116, 117, 131, 155, 156, 160, 174, 176, 184, 191, 192, 193, 194, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 224, 229, 230, 231, 232, 233, 234, 235, 237, 238, 239, 242, 243, 244, 245, 246, 247, 249, 250, 253, 256, 258, 261, 264, 266, 267, 269, 270, 271, 272, 274, 276, 278, 287, 290, 291, 292, 297, 298, 305, 307, 309, 311, 313, 314, 316, 317, 318, 319, 320, 321, 322], "common": [1, 2, 5, 9, 14, 15, 24, 240, 259, 309, 314, 315, 318, 320, 321, 322], "schema": [1, 9, 11, 12, 16], "convers": [1, 13, 16, 18, 19, 21, 22, 33, 42, 52, 55, 56, 60, 65, 68, 70, 270, 272, 273, 308, 315, 316, 320, 321, 323], "json": [1, 9, 12, 13, 16, 18, 21, 22, 37, 42, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 258, 270, 314, 315, 316, 322], "list": [1, 9, 11, 14, 15, 18, 19, 21, 22, 24, 33, 36, 38, 44, 45, 47, 48, 49, 50, 51, 52, 53, 55, 56, 60, 61, 65, 66, 68, 73, 74, 82, 86, 87, 88, 89, 90, 91, 95, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 131, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 155, 156, 159, 160, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 177, 178, 179, 180, 181, 184, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 230, 232, 237, 239, 240, 241, 242, 245, 246, 247, 252, 253, 254, 255, 256, 257, 259, 261, 269, 270, 271, 272, 287, 298, 301, 312, 315, 316, 317, 318, 321, 322], "us": [1, 2, 4, 5, 9, 10, 11, 12, 13, 14, 16, 17, 18, 20, 21, 22, 23, 26, 27, 29, 32, 35, 36, 38, 44, 47, 50, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 80, 81, 82, 112, 113, 119, 128, 131, 132, 137, 141, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 180, 184, 186, 190, 194, 219, 220, 222, 224, 226, 228, 230, 231, 232, 233, 234, 235, 236, 237, 240, 241, 244, 246, 247, 248, 252, 256, 257, 261, 262, 263, 264, 265, 267, 270, 271, 272, 273, 274, 277, 279, 281, 287, 288, 289, 290, 291, 295, 297, 299, 300, 306, 307, 308, 309, 310, 311, 312, 314, 317, 318, 320, 321, 322], "collect": [1, 24, 317], "sampl": [1, 9, 11, 12, 13, 14, 15, 16, 19, 20, 21, 23, 26, 33, 35, 36, 37, 42, 44, 50, 54, 55, 56, 57, 62, 63, 65, 66, 67, 68, 69, 70, 72, 74, 75, 224, 226, 230, 231, 232, 240, 260, 261, 266, 309, 315, 316, 321], "batch": [1, 11, 17, 25, 47, 48, 49, 50, 54, 59, 62, 65, 66, 69, 80, 154, 155, 222, 223, 224, 226, 229, 230, 231, 232, 235, 240, 242, 262, 263, 264, 266, 283, 296, 299, 308, 317, 318, 320, 321], "handl": [1, 13, 16, 17, 24, 29, 31, 53, 56, 156, 240, 256, 257, 315, 316, 320, 323], "ani": [1, 5, 10, 11, 13, 14, 15, 16, 17, 21, 22, 24, 25, 27, 29, 30, 33, 36, 37, 38, 42, 44, 47, 50, 51, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 77, 81, 221, 230, 236, 240, 242, 243, 249, 250, 251, 252, 254, 255, 256, 259, 270, 271, 272, 274, 277, 284, 287, 295, 298, 309, 314, 315, 316, 317, 320, 321, 322], "pad": [1, 45, 47, 48, 49, 50, 54, 74, 76, 77, 230, 232, 263, 265, 268, 283], "miscellan": 1, "modifi": [1, 10, 21, 22, 24, 25, 26, 234, 236, 246, 274, 308, 316, 318, 319, 320, 321, 322, 323], "For": [2, 7, 9, 10, 12, 14, 16, 18, 19, 20, 21, 22, 24, 25, 33, 35, 36, 37, 38, 42, 50, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 81, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 155, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 224, 230, 232, 237, 240, 241, 244, 246, 247, 248, 260, 270, 276, 282, 287, 291, 293, 295, 307, 309, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "detail": [2, 9, 10, 12, 13, 16, 21, 22, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 81, 82, 175, 232, 237, 265, 295, 309, 310, 311, 314, 316, 317, 318, 319, 320, 321, 322, 323], "usag": [2, 21, 236, 237, 239, 269, 273, 274, 296, 307, 314, 316, 317, 318, 321, 322, 323], "guid": [2, 23, 24, 26, 33, 35, 37, 42, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 267, 287, 308, 315, 317, 319, 320], "pleas": [2, 7, 22, 34, 40, 43, 79, 80, 81, 82, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 164, 165, 166, 167, 178, 179, 185, 232, 237, 270, 271, 272, 293, 307, 311, 312, 316, 318, 323], "see": [2, 7, 9, 10, 11, 12, 13, 14, 16, 18, 19, 21, 22, 23, 26, 34, 40, 43, 47, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 75, 89, 90, 91, 99, 100, 109, 110, 111, 112, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 164, 165, 166, 167, 168, 175, 178, 179, 185, 195, 198, 199, 202, 207, 210, 211, 214, 225, 229, 231, 232, 242, 245, 254, 255, 260, 273, 287, 291, 293, 295, 301, 307, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "overview": [2, 7, 24, 26, 240, 306, 309, 310, 311, 317, 319, 320, 323], "support": [2, 4, 10, 11, 15, 16, 17, 18, 21, 22, 23, 25, 26, 27, 36, 37, 54, 55, 56, 59, 60, 61, 62, 65, 66, 67, 68, 69, 70, 73, 78, 96, 105, 119, 132, 141, 154, 161, 162, 163, 168, 169, 171, 180, 183, 184, 186, 221, 224, 232, 240, 241, 242, 247, 266, 271, 272, 274, 279, 281, 282, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "sever": [2, 309, 321], "wide": [2, 9, 224, 319], "onli": [2, 4, 10, 16, 18, 22, 23, 26, 35, 36, 42, 54, 55, 56, 61, 67, 68, 74, 78, 82, 96, 105, 119, 132, 141, 156, 161, 162, 163, 168, 169, 171, 180, 186, 224, 228, 230, 232, 237, 239, 240, 243, 246, 247, 249, 250, 252, 256, 270, 271, 272, 274, 277, 279, 281, 282, 314, 316, 317, 319, 320, 321, 322, 323], "help": [2, 11, 18, 19, 22, 63, 112, 230, 232, 240, 270, 287, 306, 307, 308, 309, 314, 315, 316, 317, 319, 321, 322, 323], "quickli": [2, 11, 24, 38, 57, 310, 315, 321], "bootstrap": [2, 11], "your": [2, 7, 9, 11, 12, 13, 14, 16, 17, 18, 21, 22, 23, 26, 27, 38, 57, 60, 64, 67, 68, 80, 81, 82, 159, 163, 232, 241, 287, 290, 291, 297, 306, 307, 308, 309, 310, 311, 314, 315, 318, 319, 320, 321, 322, 323], "fine": [2, 9, 10, 11, 12, 16, 18, 19, 20, 22, 23, 25, 26, 36, 54, 55, 56, 72, 246, 297, 306, 308, 309, 310, 311, 312, 316], "tune": [2, 4, 9, 10, 11, 12, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 29, 36, 54, 55, 56, 72, 246, 297, 306, 307, 308, 309, 310, 311, 312, 314, 316], "also": [2, 9, 10, 12, 14, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 53, 60, 64, 67, 68, 72, 74, 75, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 184, 186, 190, 224, 230, 233, 267, 281, 287, 291, 297, 300, 307, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "like": [2, 6, 12, 22, 23, 24, 25, 26, 184, 232, 237, 239, 241, 272, 300, 307, 314, 315, 316, 317, 319, 320, 321, 322], "These": [2, 5, 10, 13, 15, 18, 19, 21, 22, 24, 25, 27, 54, 55, 68, 232, 261, 309, 310, 312, 315, 316, 317, 318, 320, 321, 322, 323], "ar": [2, 5, 9, 10, 11, 12, 13, 14, 15, 16, 18, 19, 20, 21, 22, 23, 24, 26, 27, 31, 35, 38, 39, 42, 45, 47, 48, 52, 54, 55, 56, 59, 60, 64, 65, 66, 67, 68, 74, 76, 77, 80, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 112, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 156, 161, 162, 163, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 229, 230, 231, 232, 234, 240, 241, 242, 246, 247, 248, 250, 252, 261, 263, 270, 271, 273, 274, 276, 279, 280, 281, 285, 296, 297, 307, 308, 310, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "especi": [2, 308, 314, 316, 321], "specifi": [2, 10, 12, 16, 18, 20, 22, 24, 25, 27, 31, 33, 35, 37, 42, 44, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 74, 76, 78, 95, 113, 118, 119, 128, 131, 132, 137, 141, 158, 160, 162, 177, 184, 186, 190, 194, 219, 224, 230, 231, 238, 239, 240, 246, 247, 253, 282, 291, 293, 296, 311, 312, 314, 315, 317, 318, 321, 322, 323], "yaml": [2, 10, 17, 18, 20, 24, 25, 27, 28, 29, 53, 60, 64, 67, 68, 72, 291, 308, 312, 314, 315, 316, 317, 318, 320, 322, 323], "config": [2, 9, 12, 13, 16, 17, 18, 19, 20, 21, 22, 23, 26, 27, 28, 29, 30, 53, 60, 64, 67, 68, 72, 102, 103, 104, 224, 252, 270, 274, 287, 291, 296, 308, 309, 310, 311, 312, 315, 316, 318, 319, 320, 321, 322, 323], "represent": [2, 269, 319, 320, 322, 323], "abov": [2, 4, 9, 16, 17, 18, 20, 22, 55, 236, 285, 307, 311, 316, 318, 320, 321, 322, 323], "text": [4, 5, 9, 11, 12, 15, 18, 19, 21, 35, 36, 37, 38, 39, 42, 44, 50, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 156, 241, 242, 254, 256, 257, 259, 261, 273, 315, 316, 322], "version": [4, 58, 74, 96, 105, 119, 132, 141, 161, 169, 171, 180, 186, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 224, 303, 307, 316, 318, 321, 322, 323], "famili": [4, 22, 25, 59, 61, 65, 66, 70, 71, 73, 273, 308, 314, 318, 319], "import": [4, 9, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 27, 60, 64, 65, 66, 67, 68, 72, 78, 223, 232, 233, 234, 235, 253, 264, 287, 290, 291, 315, 316, 317, 318, 319, 320, 321, 322, 323], "you": [4, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 36, 38, 55, 56, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 112, 222, 230, 232, 235, 237, 239, 242, 244, 273, 287, 290, 291, 297, 306, 307, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "need": [4, 9, 10, 12, 14, 16, 18, 19, 20, 22, 23, 24, 25, 26, 38, 54, 56, 224, 228, 230, 232, 240, 241, 267, 287, 290, 291, 292, 307, 309, 310, 311, 312, 314, 315, 316, 317, 318, 320, 321, 323], "request": [4, 279, 316], "access": [4, 10, 22, 24, 25, 53, 270, 276, 309, 310, 311, 314, 316, 317], "hug": [4, 11, 22, 32, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 258, 278, 308, 309, 314, 317, 318], "face": [4, 11, 22, 32, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 258, 278, 308, 309, 314, 317, 318], "befor": [4, 19, 22, 38, 52, 54, 65, 79, 80, 82, 159, 163, 224, 229, 230, 231, 232, 237, 239, 240, 242, 247, 257, 270, 287, 311, 314, 316, 321, 322], "download": [4, 10, 11, 16, 22, 65, 304, 307, 309, 310, 311, 315, 318, 319, 320, 322, 323], "To": [4, 9, 12, 13, 14, 16, 17, 18, 19, 20, 22, 24, 25, 26, 54, 65, 230, 232, 242, 270, 297, 307, 308, 309, 311, 312, 314, 316, 317, 318, 319, 320, 321, 322, 323], "70b": [4, 84, 87, 90, 115, 121, 125, 129, 133, 135, 139, 143, 146, 165, 166, 167, 318], "meta": [4, 10, 15, 16, 20, 21, 22, 112, 226, 246, 270, 271, 309, 310, 311, 314, 315, 316, 317, 319], "ignor": [4, 9, 10, 12, 22, 42, 72, 228, 229, 231, 238, 239, 275, 297, 309, 310, 311, 314, 316, 319], "pattern": [4, 10, 19, 22, 257, 309, 310, 311, 314, 316, 319], "origin": [4, 10, 15, 16, 17, 20, 21, 22, 58, 59, 63, 236, 241, 242, 246, 247, 309, 310, 311, 315, 316, 318, 319, 320, 321, 322, 323], "consolid": [4, 10, 22, 309, 310, 311, 314, 316, 319], "00": [4, 10, 16, 22, 60, 64, 305, 309, 310, 311, 313, 314, 316, 317, 319], "pth": [4, 10, 22, 269, 309, 310, 311, 314, 316, 319], "hf": [4, 9, 18, 20, 21, 22, 264, 266, 270, 314, 315, 316, 317, 318], "token": [4, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, 20, 22, 24, 25, 36, 42, 47, 49, 50, 51, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 156, 158, 159, 160, 162, 163, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 194, 219, 224, 226, 229, 230, 231, 232, 237, 239, 240, 241, 242, 254, 255, 256, 257, 258, 259, 261, 263, 265, 268, 283, 310, 314, 316, 317, 318, 319, 320, 321, 322, 323], "hf_token": [4, 21, 309, 311, 319], "The": [4, 9, 11, 12, 15, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 29, 30, 32, 36, 46, 47, 52, 53, 54, 55, 56, 60, 63, 64, 65, 66, 67, 68, 71, 79, 80, 81, 82, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 154, 156, 159, 161, 162, 163, 169, 171, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 221, 223, 225, 226, 227, 228, 232, 236, 237, 238, 239, 240, 241, 242, 246, 247, 248, 253, 254, 255, 256, 257, 258, 259, 261, 262, 264, 265, 266, 267, 270, 272, 274, 278, 279, 280, 282, 287, 291, 294, 296, 300, 301, 303, 307, 308, 309, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "reus": [4, 308], "llama3_token": [4, 15, 17, 20, 21, 65, 66, 74, 315, 316, 318], "class": [4, 10, 13, 14, 15, 21, 24, 26, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 42, 43, 53, 54, 55, 56, 57, 65, 66, 79, 80, 81, 82, 95, 112, 117, 118, 131, 154, 155, 156, 160, 168, 171, 175, 176, 177, 184, 194, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 237, 238, 239, 240, 241, 242, 243, 245, 246, 247, 249, 251, 253, 254, 255, 256, 257, 260, 261, 264, 265, 266, 269, 270, 271, 272, 273, 274, 287, 288, 289, 290, 291, 312, 315, 317, 319, 320, 321, 323], "1b": [4, 10, 17, 148, 150, 152, 306, 314, 316], "output": [4, 10, 12, 13, 14, 20, 21, 31, 35, 45, 53, 55, 56, 59, 62, 64, 67, 69, 70, 74, 82, 86, 87, 88, 92, 96, 101, 105, 113, 117, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 154, 155, 158, 159, 161, 162, 163, 169, 170, 171, 172, 173, 176, 180, 181, 186, 189, 190, 196, 197, 200, 201, 203, 204, 205, 206, 220, 221, 223, 224, 226, 227, 229, 230, 231, 232, 237, 239, 240, 241, 242, 246, 247, 251, 252, 261, 272, 286, 289, 296, 297, 307, 309, 310, 311, 314, 316, 317, 318, 319, 320, 321, 323], "dir": [4, 10, 21, 22, 291, 307, 310, 311, 314, 316, 317, 318, 319, 322], "tmp": [4, 9, 10, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 274, 310, 311, 314, 315, 316, 317, 319], "3b": [4, 22, 149, 151, 153, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218], "languag": [4, 10, 16, 32, 74, 162, 241, 242, 246, 247, 264, 297, 309, 320, 321], "11b": [4, 157, 164], "8b": [4, 15, 16, 20, 21, 130, 134, 136, 140, 142, 144, 147, 181, 306, 309, 310, 311, 314, 315, 322], "405b": [4, 138, 142, 145], "weight": [4, 21, 22, 25, 50, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 157, 160, 161, 162, 163, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 228, 236, 245, 246, 247, 252, 256, 264, 270, 271, 272, 273, 282, 291, 297, 306, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 322, 323], "can": [4, 5, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 30, 33, 35, 36, 37, 38, 39, 42, 50, 53, 55, 56, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 82, 156, 159, 163, 223, 226, 228, 229, 230, 232, 237, 239, 240, 242, 244, 248, 256, 257, 270, 273, 275, 287, 290, 291, 293, 296, 306, 307, 308, 309, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "instead": [4, 9, 12, 14, 16, 22, 25, 31, 45, 54, 55, 72, 82, 141, 162, 163, 228, 232, 247, 267, 270, 271, 272, 314, 318, 320, 321, 322], "remov": [4, 321], "flag": [4, 24, 25, 36, 59, 60, 62, 64, 68, 69, 70, 270, 271, 272, 277, 309, 314, 321, 323], "builder": [4, 9, 10, 11, 12, 13, 15, 16, 17, 22, 58, 60, 61, 64, 67, 68, 83, 84, 85, 86, 87, 88, 89, 90, 91, 93, 94, 97, 98, 99, 100, 102, 103, 104, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 120, 121, 122, 123, 124, 125, 126, 127, 129, 130, 133, 134, 135, 136, 138, 139, 140, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 164, 165, 166, 167, 170, 172, 174, 176, 178, 179, 181, 183, 185, 187, 188, 189, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 315, 321, 323], "all": [4, 5, 10, 11, 14, 15, 19, 21, 25, 30, 35, 36, 38, 42, 45, 47, 50, 53, 54, 55, 56, 67, 82, 131, 154, 156, 160, 184, 194, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 219, 224, 228, 230, 232, 233, 234, 235, 236, 240, 241, 242, 244, 248, 250, 253, 260, 270, 274, 276, 280, 285, 292, 298, 299, 304, 306, 308, 309, 310, 311, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322], "7b": [4, 9, 12, 14, 18, 19, 20, 21, 61, 73, 85, 88, 91, 94, 98, 116, 117, 122, 123, 126, 127, 170, 172, 174, 176, 179, 189, 193, 205, 206, 217, 218, 270, 271, 315, 317, 318, 320, 323], "13b": [4, 83, 86, 89, 114, 120, 124], "codellama": 4, "size": [4, 14, 15, 16, 22, 25, 27, 45, 50, 59, 62, 65, 66, 69, 80, 81, 82, 154, 155, 156, 157, 159, 160, 161, 163, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 222, 223, 224, 225, 226, 229, 230, 231, 232, 235, 237, 239, 240, 241, 242, 261, 262, 263, 283, 285, 302, 308, 311, 314, 317, 318, 320, 321, 322], "0": [4, 9, 10, 12, 14, 15, 16, 18, 20, 22, 25, 45, 47, 48, 49, 50, 54, 60, 64, 67, 68, 74, 75, 77, 78, 82, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 155, 156, 158, 159, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 173, 175, 178, 179, 180, 181, 182, 185, 186, 187, 188, 189, 190, 191, 192, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 222, 223, 224, 230, 232, 233, 234, 235, 241, 246, 247, 253, 259, 264, 265, 266, 267, 268, 277, 278, 283, 287, 290, 291, 295, 300, 303, 305, 309, 311, 313, 315, 316, 317, 318, 320, 321, 322, 323], "5b": [4, 187, 188, 191, 192, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 321], "14b": [4, 196, 197, 208, 209], "32b": [4, 200, 201, 212, 213], "72b": [4, 203, 204, 215, 216], "qwen2": [4, 10, 186, 187, 188, 189, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 273, 309, 316, 321], "exampl": [4, 10, 19, 21, 22, 23, 24, 25, 26, 27, 29, 33, 35, 37, 38, 42, 44, 45, 46, 47, 48, 49, 50, 53, 54, 56, 59, 60, 61, 62, 64, 65, 66, 67, 68, 69, 70, 72, 73, 74, 76, 77, 78, 82, 155, 156, 159, 163, 222, 223, 224, 232, 233, 234, 235, 237, 239, 240, 241, 242, 244, 245, 246, 247, 248, 253, 254, 255, 256, 257, 259, 260, 264, 266, 268, 269, 270, 271, 273, 274, 282, 283, 287, 290, 291, 294, 297, 300, 301, 303, 304, 305, 307, 309, 310, 311, 313, 314, 315, 316, 318, 319, 320, 321, 322, 323], "qwen2_5": [4, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219], "1_5b": 4, "mini": [4, 21, 181, 182, 183, 184, 185], "4k": [4, 21, 182, 183, 184], "microsoft": [4, 183, 184], "ai": [4, 10, 12, 14, 19, 55, 56, 174, 291, 315, 318], "thi": [4, 9, 10, 12, 13, 14, 16, 17, 18, 19, 20, 21, 23, 24, 25, 26, 27, 31, 33, 34, 35, 36, 37, 42, 43, 44, 45, 47, 48, 50, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 80, 81, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 156, 158, 159, 162, 163, 168, 169, 171, 173, 175, 180, 182, 183, 184, 186, 190, 220, 222, 224, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 240, 241, 242, 244, 245, 246, 247, 248, 252, 253, 254, 255, 256, 257, 259, 260, 261, 263, 264, 265, 267, 269, 270, 271, 272, 274, 278, 279, 281, 283, 285, 287, 288, 290, 291, 292, 293, 295, 297, 299, 300, 306, 307, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "v0": [4, 9, 14, 18, 19, 21, 168], "mistralai": [4, 21, 314], "safetensor": [4, 22, 269, 270, 314, 316, 319], "2b": [4, 93, 97, 103, 107], "googl": [4, 93, 94, 102, 103, 104], "gguf": 4, "9b": [4, 104, 108], "27b": [4, 102, 106], "model_s": 4, "b": [4, 15, 25, 45, 47, 154, 155, 222, 224, 226, 230, 231, 240, 247, 262, 263, 283, 291, 316, 320, 323], "compon": [4, 6, 14, 21, 22, 25, 30, 48, 55, 56, 65, 66, 246, 308, 312, 317, 319, 320, 323], "multimod": [4, 11, 14, 36, 42, 56, 65, 66, 67, 240, 307], "encod": [4, 5, 15, 21, 50, 56, 74, 75, 82, 154, 155, 157, 158, 159, 161, 162, 163, 224, 229, 230, 231, 235, 240, 241, 242, 244, 254, 256, 257, 259, 261, 264, 267, 273, 315], "perform": [5, 12, 13, 17, 19, 20, 21, 22, 54, 74, 232, 237, 248, 260, 267, 308, 310, 311, 315, 316, 318, 319, 321, 322, 323], "direct": [5, 18, 25, 48, 86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181, 246, 264, 307, 312, 321], "id": [5, 14, 17, 21, 22, 47, 48, 49, 50, 54, 61, 65, 66, 73, 74, 75, 77, 78, 156, 224, 226, 230, 231, 240, 254, 255, 256, 257, 258, 259, 261, 270, 272, 287, 315], "decod": [5, 9, 12, 14, 15, 16, 18, 20, 21, 60, 64, 67, 68, 74, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 155, 156, 157, 158, 159, 161, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 224, 229, 230, 231, 235, 240, 242, 244, 254, 256, 257, 315, 316], "typic": [5, 9, 12, 20, 24, 33, 37, 42, 50, 54, 55, 56, 57, 72, 184, 244, 264, 267, 321, 322, 323], "byte": [5, 21, 257, 321, 323], "pair": [5, 10, 18, 21, 24, 48, 49, 63, 68, 71, 257], "underli": [5, 13, 18, 21, 223, 256, 321, 323], "helper": 5, "method": [5, 13, 14, 15, 19, 21, 22, 24, 25, 26, 29, 46, 55, 57, 59, 60, 61, 62, 63, 64, 68, 69, 70, 71, 72, 73, 156, 230, 236, 237, 240, 243, 244, 245, 249, 254, 255, 274, 282, 307, 308, 320, 323], "two": [5, 15, 18, 19, 22, 24, 35, 50, 52, 65, 66, 74, 75, 80, 232, 241, 244, 246, 261, 268, 269, 308, 309, 311, 316, 317, 318, 320, 321, 322, 323], "pre": [5, 9, 11, 12, 17, 18, 19, 20, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 112, 157, 160, 161, 232, 240, 242, 244, 246, 311, 315, 321], "train": [5, 9, 10, 11, 12, 13, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 31, 33, 35, 50, 53, 54, 55, 56, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 112, 156, 157, 160, 161, 221, 223, 224, 226, 230, 231, 236, 237, 239, 240, 241, 242, 244, 246, 264, 267, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 306, 308, 309, 310, 312, 314, 315, 316, 318, 319, 320, 321, 322, 323], "function": [5, 10, 22, 24, 25, 27, 29, 45, 46, 47, 48, 60, 64, 67, 68, 74, 80, 81, 82, 159, 163, 220, 223, 224, 232, 233, 236, 248, 252, 253, 264, 265, 267, 270, 295, 297, 299, 300, 302, 308, 309, 316, 319, 323], "preprocess": [5, 54, 232], "imag": [5, 11, 15, 35, 36, 37, 42, 44, 46, 50, 56, 65, 66, 67, 79, 80, 81, 82, 154, 155, 156, 157, 159, 160, 161, 163, 232, 241, 261, 320], "loss": [6, 9, 12, 14, 24, 25, 36, 38, 55, 56, 59, 60, 62, 64, 68, 69, 70, 237, 238, 239, 264, 265, 266, 309, 317, 319, 320, 323], "algorithm": [6, 21, 262, 267, 295], "ppo": [6, 262, 263, 264, 265, 312], "dpo": [6, 18, 48, 55, 248, 264, 266, 267, 309, 312], "offer": 7, "allow": [7, 10, 53, 242, 246, 252, 290, 311, 314, 321, 322, 323], "seamless": 7, "transit": 7, "between": [7, 9, 18, 19, 21, 22, 55, 60, 68, 158, 162, 229, 230, 234, 240, 263, 265, 267, 270, 273, 287, 309, 318, 319, 320, 322, 323], "interoper": [7, 22, 25, 308, 316, 323], "rest": [7, 315, 321, 323], "ecosystem": [7, 22, 25, 308, 316, 318, 323], "comprehens": [7, 321], "deep": [7, 22, 23, 24, 25, 26, 240, 242, 244, 308, 312, 316, 317, 318, 321], "dive": [7, 22, 23, 24, 25, 26, 308, 311, 312, 316, 317, 318, 321], "util": [7, 14, 16, 22, 24, 25, 27, 45, 47, 50, 154, 275, 290, 292, 293, 299, 300, 301, 302, 303, 308, 316, 317, 321, 323], "work": [7, 22, 25, 35, 42, 67, 228, 241, 242, 308, 311, 314, 316, 318, 321, 323], "set": [7, 9, 12, 17, 18, 20, 22, 23, 24, 25, 26, 33, 36, 37, 42, 50, 54, 55, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 72, 73, 113, 119, 128, 132, 137, 141, 158, 161, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 226, 229, 230, 233, 234, 235, 240, 248, 251, 274, 285, 287, 293, 294, 295, 296, 299, 300, 308, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322], "enabl": [7, 10, 11, 17, 21, 23, 24, 25, 26, 53, 86, 87, 88, 89, 90, 91, 97, 98, 99, 100, 106, 107, 108, 109, 110, 111, 120, 121, 122, 123, 124, 125, 126, 127, 133, 134, 135, 136, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 164, 166, 167, 170, 172, 178, 179, 181, 185, 187, 188, 189, 191, 192, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 210, 211, 214, 224, 229, 230, 231, 233, 234, 235, 240, 242, 295, 296, 311, 318, 320, 321, 323], "consumpt": [7, 53, 76, 310], "dure": [7, 10, 11, 22, 54, 59, 60, 62, 64, 68, 69, 70, 222, 223, 224, 226, 230, 231, 232, 236, 240, 241, 253, 267, 281, 309, 310, 311, 315, 316, 318, 320, 321, 322, 323], "control": [7, 13, 18, 21, 25, 36, 59, 60, 62, 64, 68, 69, 70, 234, 235, 242, 246, 247, 248, 287, 295, 311, 316, 321], "lr": [7, 24, 274, 278, 280, 319, 321], "process": [7, 11, 14, 15, 17, 25, 26, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 82, 159, 163, 232, 236, 277, 284, 295, 302, 309, 317, 322, 323], "variou": 7, "provid": [7, 10, 11, 12, 14, 22, 24, 25, 27, 32, 33, 35, 37, 42, 46, 47, 51, 53, 54, 74, 76, 82, 224, 228, 230, 232, 240, 246, 247, 248, 259, 264, 272, 287, 291, 296, 300, 308, 310, 311, 314, 315, 316, 317, 318, 321], "debug": [7, 22, 24, 25, 287, 314], "finetun": [7, 10, 24, 25, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 240, 306, 308, 311, 317, 318, 321], "job": [7, 10, 26, 295, 317], "involv": [9, 12, 17, 20, 56, 322], "multi": [9, 18, 25, 224, 318], "turn": [9, 18, 25, 33, 36, 37, 42, 52, 55, 68, 315, 321], "multipl": [9, 16, 17, 18, 22, 24, 25, 33, 36, 37, 42, 48, 53, 56, 68, 154, 155, 224, 230, 231, 232, 240, 247, 287, 288, 289, 290, 291, 296, 317, 318, 319, 321], "back": [9, 21, 22, 52, 248, 270, 320, 321, 323], "forth": [9, 52], "user": [9, 12, 13, 14, 15, 16, 18, 19, 21, 22, 25, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 46, 52, 55, 56, 60, 64, 67, 68, 95, 113, 118, 119, 128, 131, 132, 137, 141, 158, 160, 162, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 194, 219, 224, 259, 312, 315, 316, 317, 322], "assist": [9, 12, 13, 14, 15, 16, 18, 19, 21, 31, 32, 33, 35, 36, 37, 38, 39, 41, 42, 44, 52, 55, 56, 60, 68, 74, 95, 112, 118, 131, 160, 177, 184, 194, 219, 259, 315, 316], "role": [9, 13, 14, 15, 16, 18, 19, 21, 33, 36, 37, 38, 39, 42, 44, 55, 56, 60, 68, 95, 118, 131, 156, 160, 177, 184, 194, 219, 259, 315, 316], "content": [9, 13, 15, 16, 18, 19, 21, 22, 33, 36, 37, 38, 39, 42, 44, 55, 56, 60, 68, 259, 315, 316], "what": [9, 14, 15, 16, 18, 22, 23, 24, 26, 36, 37, 55, 56, 60, 64, 67, 68, 112, 168, 232, 306, 312, 315, 316, 317, 318, 321], "answer": [9, 15, 16, 19, 40, 64, 67, 316, 318], "ultim": [9, 322], "question": [9, 15, 16, 19, 40, 64, 67, 316, 318], "life": 9, "42": [9, 74, 232], "That": [9, 315], "s": [9, 10, 12, 13, 14, 16, 17, 18, 19, 20, 22, 24, 25, 26, 27, 29, 32, 37, 42, 52, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 86, 87, 88, 112, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 154, 155, 156, 161, 162, 163, 168, 169, 170, 171, 172, 180, 181, 184, 186, 189, 190, 196, 197, 200, 201, 203, 204, 205, 206, 222, 224, 226, 230, 231, 232, 236, 240, 243, 244, 245, 246, 247, 249, 250, 252, 253, 257, 264, 266, 267, 268, 270, 271, 274, 281, 283, 287, 290, 293, 294, 297, 299, 300, 308, 309, 314, 315, 317, 319, 320, 321, 322, 323], "ridicul": 9, "oh": 9, "i": [9, 12, 14, 18, 19, 20, 25, 36, 68, 74, 112, 154, 155, 168, 223, 224, 229, 230, 231, 232, 236, 240, 251, 269, 274, 316, 318, 321, 322, 323], "know": [9, 22, 315, 319, 320], "more": [9, 10, 11, 12, 13, 14, 16, 18, 19, 21, 22, 24, 25, 38, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 232, 237, 244, 252, 269, 272, 287, 291, 293, 295, 299, 308, 309, 310, 311, 312, 314, 316, 317, 318, 319, 320, 321, 322, 323], "structur": [9, 12, 13, 14, 19, 25, 37, 39, 42, 60, 131, 156, 160, 184, 194, 219, 261, 315, 322], "than": [9, 10, 12, 16, 18, 24, 50, 52, 74, 76, 222, 224, 232, 264, 272, 273, 298, 299, 303, 315, 316, 317, 318, 319, 320, 321, 323], "freeform": [9, 12, 57, 72], "associ": [9, 10, 11, 12, 22, 24, 25, 74, 75, 82, 92, 101, 113, 128, 137, 158, 162, 173, 190, 287, 316, 320], "where": [9, 10, 12, 14, 16, 18, 19, 20, 36, 38, 45, 48, 59, 74, 76, 77, 80, 117, 154, 155, 176, 220, 224, 230, 232, 234, 237, 239, 240, 247, 256, 261, 262, 264, 265, 268, 283, 316, 319, 321], "thei": [9, 11, 12, 19, 21, 22, 24, 25, 53, 65, 66, 67, 82, 154, 159, 163, 230, 232, 242, 314, 315, 316, 320, 321, 322], "learn": [9, 12, 25, 53, 241, 242, 244, 274, 278, 280, 308, 309, 310, 311, 312, 315, 317, 318, 320, 321, 322, 323], "simpli": [9, 12, 13, 14, 16, 20, 22, 24, 54, 56, 240, 264, 309, 314, 315, 318, 319, 321, 323], "predict": [9, 12, 74, 75, 78, 262, 263, 265, 310], "next": [9, 12, 54, 72, 74, 75, 82, 232, 261, 310, 318, 323], "respond": 9, "accur": 9, "primari": [9, 12, 16, 18, 20, 24, 25, 55, 56, 312, 317], "entri": [9, 12, 16, 18, 20, 24, 25, 47, 50, 309, 312, 317, 321], "point": [9, 10, 12, 16, 18, 20, 21, 24, 25, 46, 60, 259, 309, 312, 316, 317, 318, 320, 322, 323], "torchtun": [9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 42, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 307, 309, 310, 311, 312, 315, 317, 321], "chat_dataset": [9, 12, 13, 18, 315], "let": [9, 10, 11, 12, 16, 18, 22, 24, 26, 240, 314, 315, 316, 317, 318, 319, 320, 321, 323], "follow": [9, 10, 11, 12, 15, 16, 19, 22, 25, 36, 37, 38, 42, 50, 54, 55, 56, 64, 67, 68, 156, 224, 229, 261, 265, 272, 273, 274, 278, 285, 291, 296, 306, 307, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "data": [9, 10, 12, 13, 14, 15, 16, 19, 21, 23, 31, 32, 33, 35, 36, 37, 38, 39, 42, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 112, 160, 232, 260, 264, 266, 281, 287, 288, 289, 290, 291, 299, 310, 311, 316, 321, 322, 323], "directli": [9, 10, 12, 13, 14, 16, 22, 24, 25, 27, 31, 55, 56, 60, 64, 65, 67, 68, 72, 264, 270, 314, 316, 317, 318, 320, 321, 322, 323], "llm": [9, 10, 11, 12, 21, 25, 240, 242, 306, 307, 308, 309, 310, 312, 316, 318, 319, 320], "my_data": [9, 12, 13, 16, 315], "human": [9, 16, 18, 36, 42, 60, 112, 264, 265, 266, 315], "valu": [9, 16, 22, 24, 33, 35, 37, 42, 45, 47, 48, 50, 59, 60, 62, 63, 64, 67, 68, 69, 70, 71, 74, 75, 77, 78, 83, 84, 85, 92, 93, 94, 96, 101, 102, 103, 104, 105, 113, 114, 115, 116, 117, 119, 128, 129, 130, 132, 137, 138, 139, 140, 141, 148, 149, 156, 158, 162, 165, 169, 171, 173, 174, 175, 176, 180, 182, 186, 190, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 222, 224, 225, 229, 230, 231, 238, 239, 240, 242, 252, 262, 263, 265, 268, 270, 273, 274, 278, 283, 287, 288, 289, 290, 291, 295, 311, 314, 315, 316, 317, 318, 320, 321, 322], "gpt": [9, 16, 42, 60, 75, 315, 316], "mistral": [9, 14, 18, 19, 21, 156, 168, 169, 170, 171, 172, 174, 175, 176, 177, 178, 179, 273, 314, 315, 316, 317], "mistral_token": [9, 14, 18, 19, 21], "m_token": [9, 14, 18, 19, 20, 21], "path": [9, 10, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 25, 26, 27, 35, 42, 46, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 95, 118, 131, 156, 160, 177, 184, 194, 219, 256, 257, 258, 270, 271, 272, 296, 314, 315, 316, 318, 320], "1": [9, 14, 16, 18, 19, 20, 21, 22, 25, 35, 42, 45, 47, 48, 49, 50, 54, 67, 70, 74, 75, 77, 78, 79, 80, 113, 119, 128, 132, 137, 138, 140, 141, 142, 144, 145, 147, 155, 156, 158, 162, 168, 169, 171, 173, 175, 180, 182, 186, 187, 188, 190, 191, 192, 198, 199, 210, 211, 222, 223, 224, 230, 232, 233, 234, 235, 237, 238, 239, 253, 256, 257, 259, 264, 265, 266, 267, 271, 273, 278, 283, 285, 287, 290, 291, 294, 295, 308, 309, 310, 314, 315, 316, 317, 320, 321, 322, 323], "prompt_templ": [9, 12, 14, 16, 18, 19, 95, 118, 131, 156, 160, 177, 184, 194, 219, 316], "mistralchattempl": [9, 14, 18, 19, 177, 315], "max_seq_len": [9, 10, 12, 14, 16, 17, 18, 20, 21, 24, 27, 47, 50, 51, 54, 55, 59, 60, 61, 62, 64, 65, 66, 67, 69, 70, 72, 73, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 156, 158, 160, 162, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 194, 219, 222, 224, 226, 230, 235, 322], "8192": [9, 12, 14, 16, 17, 18, 20, 21, 160, 320, 322], "ds": [9, 10, 12, 15, 16, 18, 20, 54, 70, 315], "sourc": [9, 10, 12, 13, 16, 18, 20, 22, 24, 27, 28, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 42, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 148, 149, 150, 151, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 183, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 314, 315, 322], "data_fil": [9, 12, 13, 16, 18, 20, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 315], "split": [9, 10, 12, 13, 14, 16, 18, 20, 22, 44, 53, 54, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 257, 315, 322], "conversation_column": [9, 60, 315], "conversation_styl": [9, 60, 315], "By": [9, 12, 22, 246, 311, 314, 319, 320, 321, 322, 323], "default": [9, 10, 12, 16, 22, 24, 31, 32, 33, 35, 36, 37, 42, 45, 48, 49, 50, 51, 54, 55, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 144, 148, 149, 150, 151, 156, 157, 160, 161, 162, 163, 165, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 223, 224, 225, 226, 229, 230, 231, 232, 236, 238, 239, 240, 246, 247, 250, 252, 253, 256, 257, 259, 262, 263, 264, 267, 270, 271, 272, 274, 277, 278, 279, 286, 287, 288, 291, 294, 295, 296, 302, 307, 311, 314, 315, 316, 318, 319, 320, 321, 322, 323], "true": [9, 10, 12, 13, 14, 15, 16, 17, 22, 24, 31, 36, 45, 53, 54, 55, 57, 58, 59, 60, 62, 64, 65, 66, 67, 68, 69, 70, 72, 73, 76, 77, 82, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 156, 157, 164, 167, 178, 179, 185, 223, 224, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 240, 242, 246, 247, 248, 253, 256, 257, 259, 261, 262, 265, 268, 270, 271, 272, 280, 281, 283, 284, 285, 287, 290, 296, 303, 310, 314, 315, 316, 318, 320, 321, 322, 323], "train_on_input": [9, 12, 13, 18, 24, 31, 33, 35, 37, 42, 53, 58, 59, 60, 62, 63, 64, 68, 69, 70, 71], "new_system_prompt": [9, 12, 13, 33, 35, 37, 42, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70], "none": [9, 16, 25, 26, 28, 30, 31, 33, 35, 37, 42, 50, 51, 52, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 82, 95, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 154, 155, 156, 158, 160, 162, 177, 184, 194, 219, 220, 222, 223, 224, 226, 229, 230, 231, 232, 233, 234, 235, 240, 242, 246, 247, 248, 251, 252, 253, 256, 259, 262, 263, 265, 270, 271, 272, 273, 274, 275, 277, 279, 282, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 298, 299, 300, 301, 314, 322], "tokenized_dict": [9, 12, 15, 16, 18, 20], "label": [9, 12, 20, 25, 47, 48, 49, 50, 54, 61, 70, 73, 237, 238, 239, 264, 267, 319], "print": [9, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 26, 44, 50, 53, 59, 62, 65, 66, 69, 70, 74, 156, 232, 233, 234, 235, 256, 257, 259, 303, 315, 316, 317, 320, 322, 323], "inst": [9, 14, 19, 21, 112, 156, 168, 315], "733": [9, 14, 21], "16289": [9, 14, 21], "28793": [9, 14, 21], "1824": 9, "349": 9, "272": 9, "4372": 9, "In": [9, 10, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 24, 25, 55, 80, 81, 82, 159, 163, 226, 230, 232, 247, 290, 291, 311, 315, 316, 318, 319, 320, 321, 322, 323], "_component_": [9, 10, 12, 13, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 53, 60, 64, 67, 68, 72, 296, 311, 315, 316, 318, 319, 320, 321, 322], "null": [9, 24, 316, 322], "have": [9, 10, 13, 14, 18, 21, 22, 24, 27, 35, 36, 55, 60, 68, 76, 80, 81, 82, 154, 159, 163, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 221, 222, 223, 224, 225, 228, 230, 232, 233, 234, 235, 237, 239, 240, 245, 261, 267, 269, 272, 274, 280, 290, 298, 307, 315, 316, 317, 318, 319, 320, 321, 322, 323], "singl": [9, 10, 16, 17, 18, 19, 22, 24, 27, 33, 35, 37, 42, 47, 53, 54, 55, 56, 57, 60, 68, 72, 80, 81, 82, 95, 117, 118, 131, 154, 155, 156, 159, 160, 163, 176, 177, 184, 224, 230, 232, 240, 270, 271, 272, 273, 274, 276, 309, 312, 314, 315, 316, 317, 318, 319, 320, 321, 323], "name": [9, 12, 13, 14, 16, 18, 20, 22, 23, 24, 26, 28, 31, 33, 35, 37, 42, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 245, 246, 247, 250, 251, 257, 270, 271, 272, 273, 274, 276, 287, 288, 289, 290, 291, 297, 298, 300, 314, 315, 316, 318, 321, 322], "messag": [9, 11, 12, 15, 16, 18, 19, 21, 31, 32, 33, 35, 37, 38, 39, 42, 44, 52, 55, 56, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 95, 118, 131, 156, 160, 177, 184, 255, 259, 286, 307, 314, 315], "contain": [9, 11, 13, 14, 15, 16, 18, 20, 22, 33, 35, 36, 42, 47, 48, 49, 50, 54, 55, 56, 57, 60, 65, 67, 72, 131, 156, 160, 184, 194, 219, 222, 224, 226, 230, 231, 240, 243, 245, 249, 250, 251, 252, 257, 259, 262, 268, 270, 271, 272, 274, 276, 281, 286, 290, 296, 297, 299, 315, 316, 318, 320], "topic": [9, 306], "per": [9, 16, 47, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 155, 156, 164, 167, 178, 179, 185, 222, 232, 236, 261, 263, 264, 314, 321, 322, 323], "could": [9, 18, 19, 280, 319, 320], "system": [9, 12, 13, 18, 19, 32, 33, 35, 36, 37, 38, 39, 41, 42, 44, 52, 55, 56, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 95, 112, 118, 131, 160, 168, 177, 184, 194, 219, 259, 309, 315, 316], "tool": [9, 18, 19, 22, 36, 38, 56, 168, 287, 316, 317], "call": [9, 14, 18, 21, 22, 27, 36, 38, 56, 65, 66, 168, 224, 230, 232, 233, 234, 236, 240, 246, 252, 287, 288, 289, 290, 291, 292, 296, 297, 315, 316, 320, 323], "return": [9, 10, 13, 15, 18, 19, 21, 27, 29, 36, 38, 44, 45, 46, 47, 48, 49, 50, 51, 54, 55, 56, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 140, 141, 142, 144, 148, 149, 150, 151, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 183, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 229, 230, 231, 232, 234, 235, 237, 238, 239, 240, 241, 242, 243, 245, 246, 247, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 268, 269, 270, 272, 274, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 294, 295, 296, 300, 301, 302, 303, 316, 319, 320, 323], "dai": [9, 20], "todai": [9, 316], "It": [9, 10, 14, 16, 32, 36, 38, 55, 56, 60, 62, 64, 65, 66, 67, 69, 71, 156, 159, 163, 168, 223, 228, 230, 232, 240, 253, 264, 267, 287, 309, 314, 315, 316, 319, 321, 323], "tuesdai": 9, "about": [9, 10, 13, 14, 18, 22, 25, 65, 66, 232, 264, 267, 287, 291, 308, 309, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "tomorrow": 9, "wednesdai": 9, "As": [9, 12, 16, 22, 24, 25, 26, 247, 308, 316, 321, 323], "an": [9, 10, 12, 14, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 46, 50, 52, 53, 57, 60, 62, 64, 65, 66, 67, 68, 69, 72, 73, 79, 80, 81, 119, 132, 141, 156, 159, 161, 163, 169, 171, 175, 180, 186, 187, 188, 191, 192, 223, 224, 228, 230, 232, 240, 241, 242, 244, 245, 248, 249, 250, 251, 255, 260, 261, 264, 269, 270, 271, 272, 274, 275, 280, 286, 287, 291, 296, 300, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "slimorca": [9, 70], "pass": [9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 24, 27, 36, 38, 53, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 161, 169, 171, 173, 175, 180, 182, 186, 190, 223, 224, 228, 230, 234, 235, 236, 240, 246, 247, 248, 257, 265, 272, 279, 281, 284, 287, 290, 291, 293, 296, 314, 315, 316, 320, 322, 323], "repo": [9, 10, 12, 16, 18, 20, 22, 65, 270, 271, 273, 314, 316], "select": [9, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 275], "one": [9, 10, 11, 12, 13, 16, 18, 22, 25, 33, 35, 37, 42, 47, 50, 52, 60, 66, 68, 232, 237, 239, 259, 272, 287, 316, 317, 318, 321, 323], "most": [9, 12, 13, 16, 18, 20, 22, 24, 36, 38, 309, 315, 317, 320, 321, 323], "gemma": [9, 12, 18, 20, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 111, 228, 273, 309, 321], "gemma_token": [9, 12, 18, 20], "g_token": [9, 12, 18, 20], "open": [9, 20, 46, 70, 93, 94], "orca": [9, 70], "dedup": [9, 70], "recip": [9, 11, 12, 16, 18, 20, 22, 23, 24, 26, 27, 28, 29, 156, 230, 240, 270, 271, 272, 308, 309, 310, 311, 315, 316, 318, 321, 323], "via": [9, 12, 14, 16, 17, 18, 20, 23, 24, 26, 55, 60, 64, 67, 68, 72, 224, 230, 231, 246, 247, 270, 320, 323], "http": [9, 12, 16, 27, 46, 57, 61, 63, 65, 72, 73, 75, 83, 84, 85, 86, 87, 88, 89, 90, 91, 93, 94, 96, 97, 98, 99, 100, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 142, 144, 145, 146, 147, 150, 151, 152, 153, 164, 167, 169, 170, 171, 172, 174, 176, 178, 179, 180, 181, 183, 184, 185, 187, 188, 189, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 224, 225, 226, 232, 237, 238, 253, 261, 262, 264, 265, 266, 267, 270, 271, 278, 285, 287, 290, 291, 293, 295, 301, 307, 314, 316, 318, 319], "ha": [9, 18, 22, 64, 67, 74, 155, 227, 229, 230, 232, 235, 237, 239, 240, 243, 245, 248, 249, 268, 272, 274, 297, 298, 315, 317, 318, 319, 320, 321, 323], "addition": [9, 22, 256, 257, 267, 295, 315, 320, 321], "argument": [9, 10, 12, 16, 22, 24, 27, 34, 40, 43, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 164, 165, 166, 167, 178, 179, 185, 223, 246, 247, 284, 287, 288, 290, 291, 293, 314, 315, 316, 320, 321, 322], "load_dataset": [9, 12, 16, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 315], "document": [9, 12, 16, 17, 79, 80, 81, 82, 224, 230, 231, 309, 310, 312, 314, 321], "file": [9, 10, 11, 12, 16, 22, 23, 24, 25, 26, 27, 28, 29, 46, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 256, 257, 258, 270, 271, 272, 288, 291, 296, 305, 308, 311, 313, 314, 315, 316, 317, 318, 320, 321, 322, 323], "raw": [9, 11, 13, 14, 16, 21, 44], "vari": [9, 50, 54, 230], "field": [9, 10, 14, 15, 22, 27, 31, 35, 36, 42, 44, 54, 55, 56, 59, 65, 66, 286, 316], "indic": [9, 14, 16, 18, 19, 50, 53, 54, 76, 77, 82, 159, 163, 224, 226, 230, 231, 232, 240, 241, 253, 261, 262, 265, 268, 283, 285, 315], "There": [9, 22, 24, 52, 80, 315, 316, 317, 318, 319, 320, 321], "few": [9, 10, 242, 318, 320, 323], "standard": [9, 12, 14, 15, 17, 19, 22, 34, 55, 56, 60, 63, 113, 119, 128, 132, 137, 141, 156, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 246, 289, 308, 315, 316, 318, 319], "across": [9, 22, 25, 50, 53, 246, 253, 270, 290, 295, 318, 319, 322], "mani": [9, 14, 16, 19, 24, 54, 309, 310, 311, 316, 319], "we": [9, 10, 11, 12, 18, 19, 20, 21, 22, 23, 24, 25, 26, 47, 50, 54, 55, 56, 60, 61, 68, 73, 74, 78, 224, 226, 228, 230, 231, 232, 234, 237, 239, 240, 247, 264, 267, 270, 271, 272, 279, 282, 292, 297, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "ipython": [9, 14, 19, 36, 38, 41, 55, 56, 95, 118, 131, 160, 177, 184, 194, 219], "transform": [9, 10, 11, 16, 22, 25, 31, 33, 35, 55, 56, 59, 60, 62, 63, 65, 66, 67, 68, 69, 70, 82, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 155, 156, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 229, 230, 231, 232, 242, 261, 278, 293, 316, 320, 321, 322], "sharegpttomessag": [9, 13, 60, 70], "expect": [9, 12, 13, 15, 16, 18, 19, 20, 22, 24, 27, 31, 33, 35, 36, 37, 42, 46, 50, 55, 56, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 155, 156, 226, 240, 274, 287, 291, 298, 315, 320, 321, 322], "code": [9, 10, 12, 13, 16, 19, 21, 22, 25, 83, 84, 85, 86, 87, 88, 89, 90, 91, 230, 287, 304, 308, 317, 321], "openaitomessag": [9, 13, 60, 68], "If": [9, 10, 13, 14, 16, 17, 19, 21, 22, 24, 30, 33, 35, 36, 37, 42, 44, 46, 47, 50, 51, 52, 55, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 72, 74, 76, 78, 82, 95, 113, 118, 119, 128, 131, 132, 137, 141, 154, 158, 160, 162, 177, 184, 186, 190, 194, 219, 222, 224, 226, 228, 230, 231, 232, 234, 235, 236, 237, 239, 240, 246, 247, 259, 270, 271, 272, 273, 274, 275, 279, 280, 281, 282, 284, 287, 290, 291, 295, 296, 298, 300, 307, 314, 315, 316, 317, 318, 319, 320, 321, 322], "doe": [9, 17, 22, 44, 50, 54, 68, 72, 92, 168, 173, 183, 224, 228, 230, 231, 233, 234, 235, 238, 239, 240, 245, 259, 270, 272, 274, 297, 314, 316, 322], "fit": [9, 25, 54, 61, 72, 73, 232, 264, 315, 321], "creat": [9, 10, 13, 16, 19, 22, 24, 27, 38, 54, 56, 60, 68, 76, 83, 84, 85, 86, 87, 88, 89, 90, 91, 93, 94, 97, 98, 99, 100, 102, 103, 104, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 120, 121, 122, 123, 124, 125, 126, 127, 129, 130, 133, 134, 135, 136, 138, 139, 140, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 164, 165, 166, 167, 170, 172, 174, 176, 178, 179, 181, 183, 185, 187, 188, 189, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 224, 230, 231, 232, 240, 270, 271, 272, 276, 278, 287, 288, 290, 314, 323], "custom": [9, 15, 16, 21, 24, 25, 31, 38, 55, 56, 60, 64, 65, 66, 67, 68, 72, 95, 118, 131, 160, 177, 184, 194, 219, 293, 308, 309, 310, 311, 314, 317, 318, 320, 321], "dialogu": [9, 16, 43, 69, 315], "defin": [9, 10, 17, 22, 24, 25, 38, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 224, 229, 230, 240, 243, 245, 247, 249, 253, 263, 316, 317, 320], "same": [9, 10, 11, 15, 18, 22, 24, 38, 45, 79, 80, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 155, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 221, 225, 227, 228, 229, 231, 232, 235, 240, 242, 259, 265, 267, 268, 274, 280, 291, 297, 299, 311, 314, 315, 318, 319, 320, 321, 322, 323], "wai": [9, 14, 19, 22, 24, 55, 56, 252, 269, 314, 316, 317, 318, 319], "instruct_dataset": [9, 12, 13, 53], "info": [9, 301, 317], "slimorca_dataset": [9, 24], "command": [10, 12, 17, 21, 23, 25, 26, 307, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "line": [10, 17, 22, 23, 25, 312, 314, 317, 318, 321], "both": [10, 14, 15, 21, 22, 37, 50, 53, 63, 68, 220, 240, 242, 244, 314, 316, 319, 320, 321, 322, 323], "built": [10, 11, 13, 23, 24, 26, 63, 68, 71, 307, 315, 317, 323], "done": [10, 17, 54, 230, 252, 279, 297, 320, 322, 323], "run": [10, 17, 22, 23, 24, 26, 29, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224, 230, 236, 237, 270, 271, 272, 274, 275, 276, 285, 287, 290, 291, 292, 307, 308, 309, 310, 311, 312, 315, 317, 318, 319, 320, 321, 322, 323], "cli": [10, 24, 26, 28, 29, 307, 309, 310, 316, 317, 321], "which": [10, 11, 12, 14, 16, 17, 18, 19, 20, 21, 22, 24, 25, 46, 47, 53, 54, 57, 59, 60, 62, 64, 68, 69, 70, 72, 77, 78, 82, 86, 87, 88, 95, 96, 97, 98, 105, 106, 107, 108, 118, 119, 120, 121, 122, 123, 131, 132, 133, 134, 141, 142, 144, 150, 151, 156, 159, 160, 161, 162, 163, 168, 169, 170, 171, 172, 177, 180, 181, 184, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 223, 224, 226, 230, 231, 232, 233, 234, 235, 240, 242, 246, 247, 252, 253, 256, 270, 271, 272, 274, 278, 279, 288, 291, 293, 297, 308, 309, 310, 311, 312, 314, 315, 316, 317, 319, 320, 321, 322, 323], "folder": [10, 22, 316], "first": [10, 17, 22, 24, 27, 42, 52, 54, 65, 77, 82, 159, 163, 230, 232, 233, 234, 240, 268, 270, 306, 308, 309, 310, 315, 316, 318, 319, 320, 322, 323], "ensur": [10, 19, 21, 22, 24, 30, 52, 55, 56, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 233, 270, 272, 279, 308, 309, 317], "instal": [10, 23, 24, 26, 285, 287, 290, 291, 306, 314, 316, 317, 318, 319, 320, 321, 322, 323], "environ": [10, 25, 285, 287, 300, 307, 312, 314, 316, 317, 322], "so": [10, 13, 18, 19, 20, 22, 24, 54, 65, 228, 232, 270, 307, 308, 315, 316, 317, 318, 320, 321, 322, 323], "directori": [10, 22, 24, 35, 42, 65, 67, 270, 271, 272, 288, 290, 291, 296, 314, 316, 317, 318], "new": [10, 14, 15, 16, 19, 21, 25, 37, 42, 59, 61, 62, 63, 65, 68, 69, 70, 174, 222, 241, 242, 273, 287, 288, 290, 315, 316, 317, 318, 319, 320, 323], "librari": [10, 264, 266, 279, 295, 301, 306, 307, 308, 314, 321, 323], "mkdir": 10, "my_project": [10, 287, 291], "cd": [10, 21, 307], "llama": [10, 15, 16, 17, 20, 21, 22, 112, 154, 156, 157, 158, 159, 160, 162, 163, 226, 270, 271, 309, 310, 311, 314, 315, 317, 318, 319, 320], "3": [10, 15, 16, 17, 20, 21, 22, 45, 47, 48, 49, 50, 54, 77, 78, 82, 139, 143, 146, 154, 156, 157, 158, 159, 160, 162, 163, 165, 166, 167, 168, 181, 183, 184, 223, 232, 253, 273, 283, 294, 301, 309, 310, 311, 314, 315, 317, 318, 319, 322, 323], "2": [10, 14, 15, 17, 21, 22, 26, 45, 47, 48, 49, 50, 52, 54, 70, 77, 78, 79, 80, 148, 149, 150, 151, 152, 153, 154, 156, 157, 158, 159, 160, 161, 162, 163, 164, 168, 222, 223, 224, 232, 240, 253, 256, 257, 259, 265, 267, 268, 270, 271, 273, 283, 294, 295, 296, 303, 309, 311, 314, 315, 317, 318, 320, 321, 322], "lora": [10, 24, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 248, 250, 252, 270, 306, 308, 309, 312, 315, 317, 318, 319], "devic": [10, 17, 22, 24, 25, 235, 246, 247, 250, 274, 277, 279, 281, 299, 300, 309, 312, 314, 315, 316, 317, 318, 320, 321, 323], "lora_finetune_single_devic": [10, 24, 310, 314, 315, 316, 317, 318, 319, 320, 321, 323], "llama3_2": [10, 17, 22, 148, 149, 150, 151, 152, 153, 228, 233, 234, 235, 273, 316, 319], "1b_lora_single_devic": 10, "often": [10, 320, 321], "ll": [10, 18, 20, 22, 24, 25, 74, 282, 308, 311, 315, 316, 317, 318, 319, 321, 322, 323], "want": [10, 12, 19, 24, 25, 26, 27, 50, 55, 56, 74, 228, 244, 307, 314, 315, 316, 317, 318, 319, 320, 321], "start": [10, 23, 25, 26, 46, 77, 259, 273, 287, 307, 308, 309, 315, 316, 317, 319, 321, 322], "our": [10, 12, 13, 20, 22, 25, 308, 309, 310, 311, 312, 315, 316, 317, 319, 320, 321, 322, 323], "particular": [10, 11, 13, 19, 21, 24, 53, 156, 320, 323], "adjust": [10, 246, 310, 311, 319, 321, 322], "hyperparamet": [10, 23, 267, 274, 308, 317, 320, 323], "cp": [10, 24, 307, 314, 315, 316, 317, 318, 322], "copi": [10, 22, 246, 247, 315, 316, 317, 318, 321, 322, 323], "make": [10, 17, 19, 22, 23, 24, 25, 26, 157, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 232, 240, 308, 314, 316, 317, 318, 319, 320, 321, 322, 323], "modif": [10, 322], "show": [10, 22, 156, 261, 307, 310, 311, 314, 315, 316, 319, 320], "each": [10, 12, 15, 18, 19, 20, 22, 25, 38, 39, 42, 47, 48, 50, 53, 54, 55, 56, 79, 80, 81, 82, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 155, 156, 159, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 226, 230, 231, 232, 237, 239, 240, 242, 246, 252, 253, 259, 261, 262, 263, 264, 266, 283, 295, 296, 308, 311, 312, 314, 316, 317, 320, 321, 322], "ls": [10, 21, 307, 312, 314, 316, 317, 318], "full": [10, 11, 13, 16, 24, 25, 34, 40, 43, 55, 73, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 161, 164, 165, 166, 167, 178, 179, 185, 240, 250, 252, 259, 275, 277, 307, 308, 312, 314, 316, 318, 320, 321, 322], "5b_full_single_devic": 10, "qwen_config": 10, "now": [10, 19, 22, 222, 234, 274, 276, 311, 315, 316, 317, 318, 319, 320, 322, 323], "sure": [10, 17, 22, 24, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 316, 317, 318, 319, 320, 321, 322, 323], "correct": [10, 12, 14, 19, 25, 34, 62, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 226, 230, 300, 308, 315], "ve": [10, 18, 21, 24, 311, 314, 315, 316, 318, 319, 320, 321], "even": [10, 232, 297, 307, 314, 315, 318, 319, 320, 321, 323], "didn": 10, "t": [10, 13, 14, 18, 19, 20, 22, 24, 25, 45, 154, 155, 237, 242, 279, 291, 295, 309, 314, 315, 316, 317, 319, 321, 323], "complet": [10, 11, 12, 18, 22, 25, 37, 54, 61, 72, 184, 315, 316, 317, 318, 321], "note": [10, 16, 21, 22, 24, 96, 105, 240, 245, 274, 292, 295, 297, 311, 314, 315, 316, 319, 320, 321, 322, 323], "must": [10, 13, 17, 27, 38, 53, 65, 66, 224, 234, 245, 246, 269, 287, 323], "extens": [10, 25, 308], "full_finetune_single_devic": [10, 17, 280, 314, 316, 317], "Or": [10, 240, 307], "rel": [10, 16, 17, 54, 224, 226, 230, 231, 240, 264, 281, 309, 319, 320], "discuss": [10, 14, 19, 21, 24, 316, 317, 318, 320], "workflow": [10, 11, 22, 306, 317, 320], "write": [10, 16, 22, 25, 270, 271, 272, 288, 316, 317], "own": [10, 13, 18, 21, 22, 38, 295, 314, 315, 316, 318, 319, 320], "loop": 10, "logic": [10, 15, 25, 31, 56, 255, 273, 308, 312, 317, 320], "case": [10, 14, 16, 22, 25, 26, 36, 38, 55, 80, 81, 82, 159, 163, 232, 234, 270, 274, 279, 282, 288, 293, 308, 314, 315, 316, 318, 320, 321, 323], "similar": [10, 13, 16, 60, 61, 63, 65, 66, 68, 71, 72, 73, 264, 318, 319, 320, 321, 323], "scratch": 10, "local": [10, 11, 14, 46, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 287, 291, 295, 307, 309, 314, 315, 316, 317], "single_devic": 10, "py": [10, 13, 24, 27, 75, 86, 87, 88, 97, 98, 102, 103, 104, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 226, 238, 264, 265, 266, 267, 278, 314, 316, 318], "recommend": [10, 60, 61, 62, 68, 69, 71, 73, 168, 230, 237, 287, 290, 315, 316, 321, 323], "python": [10, 24, 287, 291, 295, 301, 304, 314, 322], "convent": [10, 229], "main": [10, 27, 29, 102, 103, 104, 184, 226, 307, 311, 316, 318], "script": [10, 22, 26, 312, 314, 316, 317, 318], "decor": [10, 25, 29], "pars": [10, 24, 27, 28, 258, 312, 317], "omegaconf": [10, 27], "dictconfig": [10, 24, 25, 27, 28, 29, 30, 287, 291, 296], "def": [10, 13, 15, 19, 21, 24, 25, 26, 29, 65, 66, 253, 273, 316, 319, 320, 323], "cfg": [10, 24, 25, 28, 29, 30], "add": [10, 12, 13, 14, 16, 19, 21, 23, 24, 26, 50, 54, 57, 72, 82, 156, 168, 232, 244, 257, 259, 272, 273, 315, 316, 318, 320, 321, 323], "here": [10, 12, 14, 15, 16, 18, 20, 21, 22, 23, 24, 26, 32, 62, 65, 66, 226, 280, 309, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "attribut": [10, 228, 248, 259, 267, 276], "__name__": 10, "__main__": 10, "don": [10, 13, 14, 18, 19, 20, 22, 24, 25, 291, 295, 314, 315, 316, 317, 319, 321, 323], "experiment": [10, 21, 24], "optim": [10, 18, 19, 22, 24, 25, 48, 53, 55, 92, 173, 183, 264, 265, 266, 267, 272, 274, 276, 278, 280, 281, 292, 296, 310, 311, 312, 315, 316, 317, 318, 319, 320, 323], "them": [10, 12, 15, 18, 19, 22, 24, 53, 68, 232, 236, 242, 259, 299, 311, 314, 315, 316, 320, 321, 322, 323], "when": [10, 16, 17, 18, 20, 21, 22, 24, 25, 29, 53, 54, 55, 56, 57, 68, 72, 74, 76, 222, 223, 224, 226, 228, 230, 231, 232, 234, 235, 236, 237, 239, 240, 241, 246, 247, 248, 252, 263, 278, 290, 292, 297, 309, 310, 314, 316, 318, 319, 320, 321, 322, 323], "mean": [10, 24, 156, 224, 225, 229, 230, 231, 240, 246, 247, 253, 262, 314, 315, 317, 320, 322], "high": [10, 53, 55, 56, 308, 319, 320], "level": [10, 25, 55, 56, 237, 239, 260, 276, 301, 308, 319, 323], "paramet": [10, 13, 14, 15, 16, 25, 27, 28, 29, 30, 31, 33, 35, 36, 37, 38, 42, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 144, 148, 149, 150, 151, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 303, 306, 308, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 322, 323], "easili": [10, 16, 22, 24, 308, 319, 320, 322, 323], "custom_decod": 10, "customtransformerdecod": 10, "nn": [10, 27, 45, 47, 50, 82, 154, 155, 220, 222, 223, 224, 225, 228, 229, 230, 231, 232, 233, 234, 235, 236, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 251, 253, 275, 276, 292, 293, 297, 298, 319, 320, 323], "modul": [10, 13, 15, 21, 24, 27, 65, 66, 79, 80, 81, 82, 154, 155, 156, 159, 163, 171, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 264, 265, 266, 273, 275, 276, 292, 293, 295, 309, 317, 319, 320, 321, 323], "A": [10, 13, 15, 19, 22, 25, 26, 33, 34, 37, 40, 42, 43, 47, 48, 49, 50, 53, 54, 68, 82, 194, 219, 223, 224, 228, 229, 230, 231, 232, 236, 240, 247, 252, 253, 256, 257, 259, 261, 262, 263, 264, 265, 266, 268, 273, 274, 280, 281, 282, 286, 305, 306, 309, 313, 314, 315, 316, 320, 321, 322, 323], "architectur": [10, 25, 112, 168, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 230, 232, 240, 242, 273, 314], "present": [10, 33, 37, 42, 62, 63, 65, 66, 67, 68, 69, 70, 257, 272, 297, 316], "custom_model": 10, "num_lay": [10, 27, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 230, 232, 240, 242], "int": [10, 15, 21, 24, 26, 47, 48, 49, 50, 51, 54, 61, 65, 66, 73, 74, 75, 76, 78, 79, 80, 81, 82, 86, 87, 88, 89, 90, 91, 92, 95, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 131, 132, 133, 134, 135, 136, 137, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 173, 175, 177, 178, 179, 180, 181, 182, 184, 185, 186, 187, 188, 189, 190, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 219, 222, 223, 224, 225, 226, 229, 230, 231, 232, 235, 237, 238, 239, 240, 241, 242, 246, 247, 254, 255, 256, 257, 258, 259, 261, 268, 270, 271, 272, 274, 275, 278, 287, 288, 289, 290, 291, 293, 295, 296, 302, 314, 319, 320, 321, 323], "classification_head": 10, "bool": [10, 15, 19, 21, 24, 31, 33, 35, 36, 37, 42, 45, 54, 55, 57, 58, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 82, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 156, 157, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 229, 230, 231, 232, 236, 237, 238, 240, 242, 246, 247, 252, 253, 255, 256, 257, 259, 262, 268, 270, 271, 272, 277, 281, 284, 285, 287, 290, 293, 296, 297, 303, 321, 323], "fals": [10, 13, 14, 15, 16, 18, 19, 22, 24, 33, 35, 36, 37, 42, 45, 53, 54, 55, 58, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 76, 77, 82, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 230, 231, 232, 233, 234, 235, 240, 241, 242, 246, 247, 248, 252, 256, 268, 270, 271, 272, 283, 285, 296, 297, 314, 315, 316, 318, 320, 322, 323], "setup": [10, 22, 24, 25, 76, 222, 224, 229, 230, 231, 233, 234, 235, 240, 242, 275, 296, 314, 316, 320, 323], "expos": [10, 13, 24, 25, 272, 312, 317], "friendli": [10, 60, 64, 67, 68, 72, 74, 315], "manner": [10, 20], "rather": [10, 264, 321], "everi": [10, 12, 22, 25, 62, 63, 68, 69, 70, 79, 80, 81, 158, 162, 232, 234, 253, 290, 296, 307, 314, 321, 323], "construct": [10, 36, 63, 261, 312, 320], "care": [10, 22, 270, 272, 316, 318, 320], "how": [10, 13, 14, 18, 22, 23, 24, 25, 26, 232, 240, 287, 293, 306, 309, 310, 311, 314, 315, 316, 317, 318, 321, 322, 323], "implement": [10, 19, 21, 22, 25, 55, 57, 59, 60, 61, 62, 63, 64, 68, 69, 70, 71, 72, 73, 220, 226, 227, 232, 238, 245, 247, 254, 255, 260, 264, 265, 266, 267, 270, 278, 282, 290, 308, 311, 319, 320, 321, 322, 323], "llama3_2_vision_11b": 10, "custom_dataset": [10, 13], "sftdataset": [10, 13, 24, 55, 58, 59, 60, 62, 64, 65, 66, 67, 69, 70], "packeddataset": [10, 17, 53, 58, 59, 60, 62, 64, 69, 70, 72, 73], "inputoutputtomessag": [10, 13, 14, 62, 69], "modeltoken": [10, 15, 21, 24, 36, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 68, 69, 70, 71, 72, 73, 259], "build": [10, 25, 72, 82, 92, 101, 113, 128, 137, 158, 159, 162, 163, 173, 175, 190, 269, 308, 309, 318, 320, 321], "block": [10, 25, 54, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 161, 162, 163, 169, 170, 171, 172, 173, 180, 181, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 230, 231, 252, 308], "tiny_cod": 10, "pack": [10, 54, 55, 58, 59, 60, 62, 64, 65, 66, 67, 69, 70, 72, 73, 224, 226, 230, 231, 240, 322], "subset": [10, 15, 16, 47, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 96, 105, 119, 132, 141, 161, 169, 171, 180, 186, 243, 249, 250], "nampdn": 10, "tini": 10, "respons": [10, 12, 13, 18, 19, 21, 32, 33, 35, 36, 37, 42, 55, 56, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 259, 262, 263, 264, 266, 267, 309, 316, 317, 318], "model_transform": [10, 13, 15, 16, 55, 56, 62, 65, 66, 67, 69, 156], "message_transform": [10, 13, 55, 56], "column_map": [10, 12, 13, 16, 18, 31, 33, 35, 37, 42, 53, 58, 59, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71], "input": [10, 11, 12, 13, 14, 15, 20, 21, 22, 31, 35, 47, 48, 49, 50, 54, 55, 56, 59, 61, 62, 64, 65, 66, 67, 69, 70, 73, 79, 80, 81, 82, 95, 118, 131, 154, 155, 156, 159, 160, 163, 177, 184, 186, 190, 220, 221, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 238, 239, 240, 241, 242, 246, 247, 253, 256, 257, 261, 270, 272, 280, 295, 298, 315, 316, 320, 323], "filter_fn": [10, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73], "lambda": [10, 223, 262], "x": [10, 22, 45, 74, 75, 76, 79, 80, 81, 154, 155, 220, 221, 223, 224, 225, 226, 227, 229, 230, 231, 232, 240, 241, 242, 246, 247, 253, 283, 294, 319, 320, 322, 323], "split_across_pack": [10, 54, 72], "els": [10, 11, 12, 19, 25, 291, 308, 323], "posit": [10, 17, 24, 27, 54, 75, 77, 79, 80, 81, 82, 92, 96, 101, 105, 128, 137, 141, 154, 159, 163, 169, 171, 173, 175, 180, 182, 222, 223, 224, 226, 229, 230, 231, 232, 240, 241, 318], "automat": [10, 12, 16, 17, 19, 21, 23, 24, 26, 27, 59, 60, 314, 316, 323], "instanti": [10, 30, 38, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 140, 141, 142, 144, 148, 149, 150, 151, 157, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 183, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 274], "separ": [10, 55, 67, 242, 259, 270, 315, 317, 318, 320, 323], "under": [10, 24, 296, 321, 323], "best": [10, 16, 18, 25, 309, 311, 315, 319, 321], "root": [10, 225, 290, 291], "custom_finetun": 10, "32": [10, 27, 222, 232, 240, 242, 287, 318, 320, 321, 322, 323], "option": [10, 12, 18, 21, 22, 24, 25, 31, 33, 35, 37, 42, 50, 51, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 81, 82, 86, 87, 88, 95, 96, 97, 98, 101, 105, 106, 107, 108, 113, 118, 119, 120, 121, 122, 123, 128, 131, 132, 133, 134, 137, 141, 142, 144, 150, 151, 154, 155, 156, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 177, 180, 181, 184, 186, 187, 188, 189, 190, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 219, 220, 223, 224, 226, 229, 230, 231, 232, 235, 236, 240, 246, 247, 250, 252, 253, 254, 256, 259, 262, 263, 265, 270, 271, 272, 274, 275, 277, 279, 282, 286, 287, 288, 291, 295, 296, 298, 300, 301, 307, 308, 314, 315, 316, 321], "param": [10, 22, 25, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 157, 161, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 243, 244, 246, 249, 251, 270, 314, 316, 320, 322, 323], "omit": [10, 319, 320, 321], "being": [10, 19, 22, 56, 234, 270, 271, 272, 276, 300, 321, 322, 323], "found": [10, 11, 22, 23, 24, 26, 226, 270, 271, 272, 311, 314, 319, 320, 323], "correctli": [10, 21, 22, 25, 30, 252, 270, 307, 312, 315, 317, 323], "try": [10, 22, 24, 315, 316, 317, 318, 323], "after": [10, 19, 20, 23, 25, 38, 56, 65, 66, 95, 118, 131, 156, 160, 177, 184, 223, 224, 227, 230, 231, 240, 242, 246, 268, 286, 287, 288, 289, 290, 291, 309, 311, 315, 316, 318, 322, 323], "pythonpath": 10, "pwd": 10, "vlm": [11, 16], "hub": [11, 22, 55, 56, 314, 317], "remot": [11, 14, 35, 42, 46, 55, 56, 67], "url": [11, 16, 35, 37, 42, 46, 67, 307], "project": [11, 23, 26, 82, 86, 87, 88, 92, 96, 101, 105, 113, 117, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 154, 155, 158, 159, 161, 162, 163, 169, 170, 171, 172, 173, 176, 180, 181, 186, 189, 190, 196, 197, 200, 201, 203, 204, 205, 206, 220, 224, 230, 232, 240, 244, 252, 273, 287, 291, 306, 320, 321, 323], "prefer": [11, 13, 25, 48, 55, 63, 68, 71, 264, 265, 266, 267, 308, 312, 314, 316], "align": [11, 65, 66, 264, 309, 315, 319], "continu": [11, 20, 54, 232, 287], "pretrain": [11, 154, 155, 156, 240, 242, 244, 256, 257, 314, 315, 317, 320, 323], "beyond": [11, 316, 323], "those": [11, 22, 273, 316, 318, 320], "customiz": 11, "task": [11, 12, 16, 18, 19, 34, 40, 43, 53, 61, 156, 310, 315, 316, 318, 319, 320, 321, 322, 323], "supervis": [11, 20, 56, 309], "rlhf": [11, 55, 63, 262, 263, 264, 265, 266, 268, 309], "queri": [11, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 230, 231, 240, 318, 321], "time": [11, 16, 17, 22, 60, 64, 92, 173, 234, 237, 239, 259, 262, 288, 290, 296, 311, 314, 315, 316, 318, 321, 323], "take": [11, 12, 13, 18, 22, 24, 25, 27, 48, 55, 56, 65, 66, 68, 154, 232, 236, 242, 253, 270, 272, 299, 300, 311, 315, 316, 317, 318, 319, 320, 321, 323], "object": [11, 13, 14, 15, 19, 21, 24, 27, 28, 82, 224, 264, 267, 282], "appli": [11, 12, 15, 19, 22, 25, 47, 55, 56, 59, 65, 66, 67, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 158, 161, 162, 163, 164, 167, 169, 170, 171, 172, 173, 178, 179, 180, 181, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 228, 229, 230, 231, 240, 246, 252, 253, 293, 308, 309, 310, 319, 321, 323], "templat": [11, 31, 32, 34, 38, 39, 40, 43, 55, 56, 59, 62, 69, 95, 112, 118, 131, 156, 160, 168, 177, 184, 194, 219], "anyth": [11, 61, 299], "requir": [11, 15, 17, 19, 21, 22, 24, 47, 48, 53, 55, 56, 57, 65, 66, 68, 72, 156, 228, 230, 241, 270, 272, 274, 284, 285, 287, 290, 291, 295, 296, 307, 311, 314, 315, 317, 321, 322, 323], "collat": [11, 47, 49, 50, 54], "packag": [11, 23, 26, 287, 290, 291, 307], "togeth": [11, 25, 54, 237, 291, 312, 317, 320, 321, 322], "form": [12, 18, 22, 24, 25, 31, 44, 52, 55, 56, 314], "along": [12, 22, 223, 320], "describ": [12, 253, 293], "hand": [12, 36], "grammar": [12, 19, 34, 62], "head": [12, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224, 226, 230, 240, 244, 273, 318], "csv": [12, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73], "incorrect": [12, 19], "cat": [12, 16, 19, 261], "grammarerrorcorrectiontempl": [12, 19, 62], "prepend": [12, 14, 16, 19, 33, 35, 37, 38, 39, 42, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 95, 118, 131, 156, 160, 177, 184, 194, 219, 256, 286], "english": [12, 19, 34], "ncorrect": [12, 34], "mask": [12, 13, 14, 15, 17, 19, 21, 36, 38, 50, 54, 56, 59, 60, 62, 64, 65, 66, 67, 68, 69, 70, 75, 76, 77, 156, 224, 229, 230, 231, 240, 255, 259, 261, 262, 265, 283, 315, 319], "out": [12, 15, 18, 20, 22, 24, 25, 59, 60, 62, 64, 68, 69, 70, 76, 77, 261, 270, 271, 283, 306, 308, 309, 310, 311, 312, 314, 315, 316, 317, 318, 320, 321, 323], "100": [12, 18, 25, 48, 49, 50, 59, 60, 62, 64, 68, 69, 70, 74, 237, 238, 239, 241, 319, 320, 323], "27957": 12, "736": 12, "577": 12, "anoth": [12, 13, 16, 24, 56, 228, 287, 316, 321], "c4": [12, 72, 322], "200m": 12, "liweili": [12, 62], "c4_200m": [12, 62], "chang": [12, 13, 16, 21, 22, 23, 24, 26, 31, 33, 35, 64, 66, 67, 71, 272, 307, 314, 316, 317, 318, 319, 320, 321, 322, 323], "remap": 12, "someth": [12, 22, 25, 26, 315, 316, 322], "hello": [12, 13, 14, 19, 21, 44, 256, 257, 301, 315, 316, 318], "world": [12, 13, 14, 19, 21, 44, 256, 257, 285, 301, 302, 316], "bye": [12, 13], "robot": [12, 15], "am": [12, 14, 16, 60, 64, 112, 168, 315, 318], "prompttempl": [12, 31, 34, 40, 43, 156], "relev": [12, 14, 25, 229, 230, 231, 240, 314, 316, 320, 321], "inform": [12, 14, 22, 287, 291, 293, 308, 309, 314, 316, 317], "mai": [12, 16, 17, 24, 26, 60, 74, 232, 235, 241, 297, 309, 310, 311, 315, 317, 319, 320, 321], "alpaca_dataset": [12, 17, 24, 58], "grammar_dataset": 12, "samsum_dataset": 12, "dictionari": [13, 14, 15, 36, 38, 44, 47, 48, 49, 54, 55, 56, 95, 118, 131, 160, 177, 184, 194, 219, 270, 281, 286, 287, 288, 289, 290, 291, 299], "onc": [13, 21, 24, 38, 230, 240, 316, 317, 318, 320, 323], "repres": [13, 36, 48, 79, 80, 232, 269, 275, 315, 321, 322], "prepar": [13, 15, 253, 315, 322], "ad": [13, 16, 19, 21, 25, 38, 50, 79, 80, 81, 158, 162, 175, 232, 240, 241, 244, 256, 259, 272, 273, 315, 320, 321, 322, 323], "column": [13, 16, 18, 20, 31, 33, 35, 37, 42, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 224, 230, 231, 240, 322], "worri": [13, 22, 315, 317], "itself": [13, 24], "do": [13, 15, 18, 21, 22, 23, 25, 36, 47, 65, 68, 234, 252, 259, 287, 291, 297, 309, 314, 316, 317, 318, 320, 321, 322], "well": [13, 18, 24, 25, 308, 314, 316, 318, 319, 321, 323], "flexibl": [13, 24, 53, 321], "inherit": [13, 14, 19, 25, 308], "__call__": [13, 15, 19, 65, 66, 156], "simpl": [13, 22, 25, 232, 253, 267, 306, 316, 317, 320, 322, 323], "contriv": [13, 19], "would": [13, 15, 19, 22, 24, 26, 38, 54, 230, 232, 240, 307, 315, 320, 321, 323], "inde": [13, 279], "quit": [13, 321, 323], "type": [13, 14, 15, 16, 21, 22, 26, 27, 29, 36, 37, 44, 45, 46, 47, 48, 49, 50, 51, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 140, 141, 142, 144, 148, 149, 150, 151, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 183, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 229, 230, 231, 232, 235, 236, 237, 238, 239, 240, 241, 242, 243, 246, 247, 249, 250, 253, 254, 255, 256, 257, 258, 259, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 276, 277, 279, 280, 281, 282, 283, 284, 285, 293, 294, 295, 296, 298, 300, 301, 302, 303, 311, 316, 320, 321, 322, 323], "map": [13, 15, 19, 21, 22, 31, 33, 35, 37, 38, 42, 47, 53, 54, 59, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 95, 118, 131, 160, 177, 184, 194, 219, 251, 257, 258, 270, 274, 276, 287, 288, 289, 290, 291, 292, 296, 320], "pprint": 13, "messagetransform": 13, "self": [13, 15, 18, 19, 20, 21, 25, 26, 54, 65, 66, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 229, 230, 231, 237, 239, 240, 242, 245, 246, 247, 252, 253, 270, 273, 274, 319, 320, 323], "str": [13, 15, 21, 24, 28, 31, 33, 35, 36, 37, 38, 42, 44, 46, 47, 48, 49, 50, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 95, 118, 131, 156, 160, 161, 164, 177, 184, 194, 219, 236, 241, 242, 243, 245, 246, 247, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 269, 270, 271, 272, 273, 274, 275, 277, 279, 281, 282, 284, 286, 287, 288, 289, 290, 291, 295, 296, 297, 298, 300, 301, 303, 321], "eot": [13, 14, 19, 36, 156], "input_sampl": 13, "output_sampl": 13, "manipul": 13, "load_dataset_kwarg": [13, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73], "mymessagetransform": 13, "chosenrejectedtomessag": [13, 63, 68], "core": [14, 25, 55, 56, 308, 312, 317, 323], "govern": [14, 315], "serv": [14, 19, 24, 33, 35, 37, 42, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 259, 320], "interfac": [14, 25, 38, 39, 53, 245, 260], "api": [14, 25, 26, 34, 40, 43, 55, 56, 57, 59, 65, 66, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 164, 165, 166, 167, 178, 179, 185, 252, 287, 307, 312, 314, 315, 316, 317, 318, 323], "oper": [14, 25, 232, 248, 260, 295, 322], "send": 14, "other": [14, 15, 18, 20, 22, 25, 27, 35, 38, 53, 253, 272, 296, 299, 309, 310, 311, 315, 317, 318, 319, 320, 321, 322], "special": [14, 16, 19, 22, 36, 42, 131, 156, 158, 160, 162, 184, 194, 219, 232, 241, 254, 255, 257, 258, 259, 261, 274], "individu": [14, 36, 54, 240, 281, 291, 293, 315], "ref": [14, 55, 56, 57, 59, 65, 66, 183, 184, 291], "constructor": [14, 21], "msg": [14, 16, 19, 21, 315], "ident": [14, 18, 20, 45, 47, 54, 65, 68, 168, 230, 246, 321, 322], "from_dict": [14, 36, 315], "becaus": [14, 21, 22, 55, 56, 96, 105, 230, 232, 240, 272, 314, 315, 322], "correspond": [14, 18, 21, 36, 48, 75, 76, 77, 243, 245, 246, 247, 249, 250, 262, 265, 270, 271, 272, 279, 311, 317, 318, 321, 322], "begin": [14, 22, 54, 72, 82, 232, 257, 259, 315, 318, 323], "pil": [14, 15, 16, 36, 37, 44, 46], "img_msg": 14, "place": [14, 16, 20, 297, 315, 321], "mode": [14, 15, 16, 235, 275, 282, 287], "rgb": [14, 15, 16, 154], "4": [14, 15, 16, 22, 24, 45, 47, 48, 49, 50, 77, 82, 156, 159, 163, 222, 224, 232, 283, 303, 308, 311, 314, 316, 318, 319, 320, 321, 322, 323], "appropri": [14, 36, 53, 77, 112, 241, 270, 278, 323], "load_imag": [14, 16], "image_path": [14, 16], "jpg": [14, 16, 35, 42, 46, 67], "tag": [14, 16, 19, 21, 38, 42, 44, 95, 112, 118, 131, 156, 160, 168, 177, 184, 194, 219, 287, 288, 289, 290, 291, 315], "placehold": [14, 16, 42, 269], "should": [14, 15, 16, 18, 20, 22, 24, 25, 33, 35, 36, 37, 38, 42, 47, 54, 59, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 76, 77, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 112, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 154, 158, 161, 162, 163, 168, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 224, 230, 232, 235, 240, 245, 246, 247, 252, 262, 265, 269, 286, 287, 288, 289, 290, 291, 307, 308, 316, 317, 318, 319, 320, 321, 322, 323], "insert": [14, 242, 316, 322], "format_content_with_imag": [14, 16], "image_tag": [14, 16, 42, 44], "conveni": [14, 22, 24, 25, 46, 314], "text_cont": [14, 16, 19, 36, 315], "prompttemplateinterfac": [14, 19, 95, 118, 131, 160, 177, 184, 194, 219], "templated_msg": [14, 19], "contains_media": [14, 16, 36], "get_media": [14, 15, 16, 36], "4x4": 14, "0x7f8d27e72740": 14, "tokenize_messsag": 14, "hi": [14, 20, 74, 315], "tokenize_messag": [14, 15, 21, 36, 55, 57, 59, 60, 61, 62, 63, 64, 65, 66, 68, 69, 70, 71, 72, 73, 156, 255, 259, 315], "22557": 14, "1526": [14, 21], "28808": 14, "28705": [14, 21], "28748": [14, 21], "15359": 14, "28725": 14, "315": [14, 20], "837": 14, "396": 14, "16107": 14, "13892": 14, "28723": 14, "modal": [15, 16, 56, 67, 156, 242], "current": [15, 16, 18, 22, 35, 42, 54, 55, 67, 68, 76, 92, 96, 105, 119, 132, 141, 161, 162, 163, 169, 171, 173, 180, 183, 186, 222, 224, 226, 230, 231, 240, 265, 271, 272, 274, 282, 288, 290, 292, 295, 302, 311, 312, 317, 318, 319, 321], "intend": [15, 299, 315], "drop": [15, 156, 223, 241, 253, 319, 322], "replac": [15, 16, 42, 51, 59, 60, 62, 64, 68, 69, 70, 156, 236, 241, 297, 316, 320], "llama3_2_vis": [15, 16, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164], "llama3visiontransform": [15, 16, 160], "__init__": [15, 24, 25, 65, 66, 253, 319, 320, 323], "transform_imag": 15, "clipimagetransform": [15, 65, 66, 156, 232], "xattn_mask": 15, "visioncrossattentionmask": [15, 156, 260], "224": [15, 16, 156], "tile_s": [15, 80, 81, 82, 156, 159, 163, 232, 261], "patch_siz": [15, 80, 81, 82, 156, 159, 163, 232, 261], "14": [15, 48, 156, 232, 322, 323], "skip_special_token": [15, 16, 68, 156, 316], "begin_of_text": [15, 16, 21, 315], "start_header_id": [15, 16, 315], "end_header_id": [15, 16, 315], "n": [15, 16, 18, 19, 21, 34, 38, 40, 43, 224, 232, 259, 305, 313, 314, 315, 322], "eot_id": [15, 16, 21, 315], "na": [15, 315], "encoder_input": [15, 16, 50, 229, 230, 240], "shape": [15, 16, 22, 47, 50, 74, 75, 76, 77, 79, 80, 81, 82, 154, 155, 156, 159, 163, 220, 221, 222, 224, 225, 226, 227, 229, 230, 231, 232, 237, 238, 239, 240, 241, 242, 246, 247, 261, 262, 263, 264, 265, 266, 268, 283, 296, 297, 319], "num_til": [15, 16, 154, 155, 232], "num_channel": [15, 16, 232], "tile_height": [15, 16], "tile_width": [15, 16], "torch": [15, 16, 22, 24, 45, 47, 48, 49, 50, 74, 75, 76, 77, 78, 79, 80, 81, 82, 154, 155, 156, 220, 221, 222, 223, 224, 225, 226, 227, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 247, 253, 262, 263, 264, 265, 266, 268, 272, 274, 276, 277, 278, 279, 280, 281, 283, 284, 285, 290, 292, 293, 294, 295, 296, 297, 298, 299, 300, 303, 307, 311, 316, 317, 318, 319, 320, 321, 323], "just": [15, 19, 22, 308, 311, 314, 315, 317, 318, 320, 321, 322], "the_cauldron_dataset": [15, 16], "ai2d": [15, 66], "respir": 15, "combust": 15, "give": [15, 21, 24, 269, 319, 320, 321], "choic": [15, 18, 309], "oxygen": 15, "carbon": 15, "dioxid": 15, "c": [15, 45, 47, 50, 65, 154, 309, 315, 316], "nitrogen": 15, "d": [15, 24, 36, 65, 154, 155, 222, 224, 230, 240, 309, 314, 315, 319, 320, 322], "heat": 15, "letter": 15, "mymultimodaltransform": 15, "my_tokenizer_build": 15, "myimagetransform": 15, "add_eo": [15, 57, 72, 156, 256, 257, 315], "tupl": [15, 19, 21, 38, 48, 74, 75, 81, 95, 118, 131, 156, 160, 177, 184, 194, 219, 222, 232, 255, 259, 262, 263, 264, 265, 266, 268, 296, 297, 298, 302], "infer": [15, 19, 22, 50, 56, 92, 112, 173, 222, 224, 226, 230, 231, 240, 300, 306, 311, 312, 315, 317, 318, 322, 323], "vision": [15, 16, 56, 82, 154, 156, 157, 158, 159, 160, 161, 162, 163, 164, 240, 241, 273], "aspect_ratio": [15, 50, 79, 80, 154, 232], "append": [15, 19, 38, 39, 95, 118, 131, 156, 160, 177, 184, 194, 219, 230, 240, 256, 287, 307], "addit": [15, 21, 22, 24, 25, 27, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 156, 158, 159, 162, 163, 168, 223, 234, 241, 242, 252, 264, 270, 271, 272, 279, 284, 287, 288, 290, 291, 293, 308, 315, 317, 320, 321], "kei": [15, 21, 22, 24, 26, 33, 35, 37, 42, 47, 48, 55, 56, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224, 229, 230, 231, 240, 242, 251, 252, 267, 270, 272, 274, 287, 296, 314, 317, 320, 321, 323], "e": [16, 18, 19, 36, 46, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 81, 82, 154, 155, 156, 159, 163, 223, 224, 232, 236, 240, 245, 251, 261, 269, 270, 271, 272, 274, 281, 296, 300, 307, 309, 311, 318, 320, 321, 322, 323], "g": [16, 18, 46, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 81, 82, 154, 155, 156, 159, 163, 224, 232, 240, 245, 261, 269, 270, 271, 272, 281, 296, 300, 309, 311, 318, 320, 321, 322, 323], "base": [16, 18, 20, 22, 27, 36, 38, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 156, 157, 160, 161, 162, 163, 164, 167, 169, 170, 171, 172, 173, 175, 178, 179, 180, 181, 182, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 226, 246, 247, 248, 251, 252, 263, 264, 266, 267, 270, 278, 280, 288, 297, 300, 306, 309, 315, 316, 317, 318, 319, 320, 321, 323], "multimodal_chat_dataset": 16, "visual": [16, 22, 67, 242, 316], "get": [16, 22, 23, 24, 25, 26, 50, 156, 274, 279, 281, 287, 301, 302, 307, 308, 309, 310, 311, 315, 316, 317, 319, 320, 321, 322], "below": [16, 22, 23, 26, 47, 316, 318, 319, 320, 323], "clock": 16, "10": [16, 45, 47, 48, 49, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 232, 241, 318, 321, 322, 323], "llama3_2_vision_transform": 16, "questionanswertempl": [16, 19, 71], "image_s": [16, 157, 160, 161, 164, 232], "560": [16, 157, 160, 161, 164], "image_dir": [16, 35, 42, 65, 67], "home": [16, 24, 35, 42, 46, 67], "nquestion": 16, "nit": 16, "00am": 16, "sharegpt": [16, 42, 60, 315], "q1": [16, 33, 55, 60, 68], "a1": [16, 33, 55, 60], "sharegpt4v": 16, "lin": 16, "chen": 16, "renam": 16, "themselv": [16, 323], "pathlib": 16, "pil_imag": 16, "Then": [16, 20, 26, 248, 316, 317, 319, 321], "relat": [16, 229, 230, 240, 316, 320], "user_messag": [16, 34, 40, 43, 156, 315], "locat": [16, 21, 24, 35, 42, 67, 314, 318, 320, 322, 323], "long": [16, 22, 54, 257, 315, 320], "image_dog": 16, "image_cat": 16, "image_bird": 16, "dog": [16, 261], "bird": [16, 46], "pet": 16, "three": [16, 22, 25, 50, 156, 264, 266, 312, 317], "referenc": 16, "huggingfac": [16, 57, 61, 63, 72, 73, 176, 183, 184, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 264, 266, 267, 270, 271, 278, 314, 316], "co": [16, 57, 61, 63, 72, 73, 176, 183, 184, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 270, 271], "img": 16, "llava_instruct_dataset": 16, "concaten": [17, 21, 48, 53, 159, 163, 223, 255, 259], "sequenc": [17, 45, 47, 48, 49, 50, 54, 57, 61, 65, 66, 72, 73, 76, 77, 82, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 154, 155, 156, 158, 160, 162, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 194, 219, 222, 224, 226, 229, 230, 231, 232, 235, 240, 242, 257, 259, 261, 263, 267, 268, 283, 315], "upto": [17, 226], "maximum": [17, 24, 47, 50, 51, 54, 61, 73, 76, 79, 80, 82, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 156, 158, 159, 160, 162, 163, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 222, 224, 226, 229, 230, 231, 235, 240, 242, 253, 269, 314], "length": [17, 45, 47, 49, 50, 51, 52, 53, 54, 61, 73, 76, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 154, 155, 156, 158, 160, 162, 169, 171, 173, 175, 177, 180, 182, 183, 184, 186, 190, 194, 219, 222, 224, 226, 229, 230, 231, 235, 237, 239, 240, 242, 257, 261, 262, 263, 271, 283, 287, 321], "slow": [17, 321, 323], "down": [17, 232, 272, 320, 321, 323], "introduc": [17, 86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181, 224, 242, 246, 247, 267, 311, 315, 319, 320, 321, 322, 323], "signific": [17, 309, 321, 322], "speedup": [17, 223, 316, 318], "depend": [17, 25, 26, 270, 296, 314, 316, 319, 320, 321, 323], "iter": [17, 253, 296, 297, 298, 323], "through": [17, 18, 22, 23, 24, 25, 26, 55, 82, 159, 163, 220, 222, 232, 242, 248, 308, 309, 310, 311, 312, 314, 315, 316, 317, 319, 321, 322, 323], "greedi": [17, 54], "upon": [17, 25, 53, 230, 234, 240, 316, 318], "initi": [17, 25, 29, 53, 54, 83, 84, 85, 93, 94, 102, 103, 104, 114, 115, 116, 117, 129, 130, 138, 139, 140, 148, 149, 165, 174, 176, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 246, 264, 274, 284, 285, 297, 311, 317, 320, 323], "max": [17, 50, 54, 194, 219, 230, 232, 240, 257, 269, 278, 314, 320], "llama3": [17, 20, 21, 22, 24, 65, 66, 74, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 156, 158, 160, 161, 162, 164, 165, 166, 167, 180, 237, 239, 240, 270, 271, 272, 273, 306, 308, 309, 310, 311, 314, 316, 321], "load": [17, 22, 25, 35, 42, 46, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 240, 246, 252, 270, 271, 272, 274, 290, 297, 316, 318, 320], "isinst": 17, "1b_full_single_devic": [17, 316], "prevent": [17, 22, 54, 264, 309, 314], "irrelev": 17, "cross": [17, 50, 54, 158, 162, 229, 237, 239, 240, 242, 261, 319], "attend": [17, 54, 224, 229, 230, 231, 240, 261], "pytorch": [17, 24, 25, 67, 75, 225, 230, 236, 237, 285, 290, 293, 295, 296, 306, 307, 308, 311, 314, 316, 318, 320, 321, 322, 323], "flex": 17, "attent": [17, 50, 54, 75, 76, 77, 82, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 159, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 183, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 224, 226, 229, 230, 231, 233, 240, 242, 252, 261, 318, 320, 321, 323], "flash": 17, "non": [17, 238, 239, 263, 277, 319], "causal": [17, 54, 76, 224, 230, 231, 240], "hardwar": [17, 279, 308, 316, 320, 321], "cuda": [17, 22, 24, 279, 281, 296, 300, 307, 316, 321, 323], "ture": 17, "sdpa": 17, "memori": [17, 21, 25, 53, 54, 57, 61, 72, 73, 228, 230, 236, 237, 239, 240, 252, 277, 281, 286, 296, 306, 308, 309, 310, 311, 316, 317, 318, 319, 322], "effici": [17, 252, 306, 308, 310, 316, 317, 320, 322], "fallback": 17, "while": [17, 24, 25, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 241, 246, 308, 309, 311, 316, 321, 322, 323], "retain": [17, 264, 321, 323], "reward": [18, 117, 123, 127, 172, 176, 179, 262, 263, 264, 266, 267, 273, 309], "downstream": [18, 22], "captur": 18, "ground": [18, 237, 238, 239, 321], "truth": [18, 24, 237, 238, 239, 316, 318], "usual": [18, 21, 226, 230, 268, 270, 283, 291, 314, 316, 320, 321], "outcom": 18, "binari": 18, "comparison": [18, 25, 320, 323], "annot": 18, "accord": [18, 19, 65, 66, 77, 168, 315], "criterion": 18, "style": [18, 31, 54, 58, 59, 60, 70, 242, 309, 323], "interact": [18, 25, 55, 68, 306, 312, 317], "free": [18, 267, 312, 320], "preference_dataset": 18, "my_preference_dataset": [18, 68], "chosen_convers": [18, 68], "hole": [18, 68], "my": [18, 19, 23, 68, 74, 314, 315, 316, 318], "trouser": [18, 68], "fix": [18, 20, 68], "rejected_convers": [18, 68], "off": [18, 25, 38, 68, 310, 311, 316, 321, 322], "chosen": [18, 33, 55, 63, 68, 71, 264, 266, 267, 296, 309, 316], "reject": [18, 33, 55, 63, 68, 71, 264, 266, 267, 309], "rejected_input_id": [18, 48, 68], "nwhat": 18, "ntake": 18, "rejected_label": [18, 48], "128006": 18, "78191": 18, "128007": 18, "271": 18, "18293": 18, "1124": 18, "1022": 18, "13": [18, 20, 21, 48, 232, 259, 268, 323], "128009": [18, 315], "accomplish": [18, 20, 53, 60, 64, 67, 68, 72], "shown": [18, 321, 322], "di": 18, "look": [18, 19, 22, 24, 25, 270, 271, 272, 276, 290, 307, 315, 316, 317, 318, 319, 320, 322], "anthrop": [18, 63], "harmless": [18, 63, 309], "granni": 18, "her": [18, 20], "mobil": [18, 316], "phone": [18, 316], "issu": [18, 312], "grandmoth": 18, "manag": [18, 22, 53, 234, 235, 248, 287, 294, 315], "behavior": [18, 22, 315], "thing": [18, 321, 323], "grandma": 18, "feel": [18, 312, 320], "box": [18, 308, 311, 323], "hh_rlhf_helpful_dataset": 18, "hendrydong": 18, "preference_700k": 18, "stack_exchange_paired_dataset": 18, "purpos": [19, 65, 66, 317, 318], "whenev": [19, 156, 237, 320], "llama2": [19, 24, 25, 27, 61, 73, 83, 84, 85, 86, 87, 88, 89, 90, 91, 112, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 156, 220, 230, 231, 273, 306, 309, 310, 314, 316, 317, 318, 321, 322], "were": [19, 20, 21, 53, 154, 232, 248, 265, 316, 317, 322], "gear": [19, 156], "summar": [19, 43, 69, 315, 321], "summarizetempl": [19, 69, 315], "commun": [19, 156, 316, 321], "chatmltempl": [19, 156], "gec_templ": 19, "extend": [19, 21, 22, 25, 308, 321], "customprompttempl": 19, "achiev": [19, 38, 292, 311, 318, 319, 320, 322, 323], "prepend_tag": [19, 38], "append_tag": [19, 38], "thu": [19, 31, 38, 55, 56, 230, 321, 322], "empti": [19, 47, 50, 52, 78, 277, 314], "standalon": [19, 222], "my_custom_templ": 19, "Is": 19, "overhyp": 19, "advanc": [19, 80, 81, 82, 159, 163, 232, 309], "configur": [19, 21, 22, 25, 55, 56, 59, 60, 61, 62, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 96, 105, 119, 132, 141, 161, 169, 180, 186, 287, 308, 311, 312, 315, 317, 318, 319, 320, 321, 322, 323], "doesn": [19, 316], "neatli": 19, "fall": 19, "protocol": [19, 21, 245, 254, 255, 260], "arg": [19, 21, 24, 27, 32, 39, 81, 221, 223, 230, 236, 242, 245, 254, 255, 260, 289, 296, 311, 316, 322], "whether": [19, 31, 33, 35, 36, 37, 42, 47, 50, 55, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 156, 157, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 236, 238, 240, 246, 247, 252, 253, 256, 257, 279, 280, 281, 287, 297, 315, 319], "sai": [19, 22, 314, 315, 317], "eureka": 19, "eurekatempl": 19, "formatted_dialogu": 19, "llama2chattempl": [19, 118, 168, 315], "paradigm": [20, 25, 309, 310, 321], "unstructur": [20, 57, 72, 73], "unlabel": 20, "text_complet": 20, "odyssei": 20, "clear": [20, 321], "river": 20, "oceanu": 20, "had": 20, "got": [20, 50], "sea": 20, "went": 20, "till": 20, "reach": 20, "aeaean": 20, "island": 20, "dawn": 20, "sunris": 20, "drew": 20, "ship": 20, "sand": 20, "shore": 20, "sleep": 20, "wait": [20, 296], "break": [20, 156, 257], "child": 20, "morn": 20, "rosi": 20, "finger": 20, "appear": [20, 321], "sent": [20, 291], "men": 20, "circ": 20, "hous": 20, "fetch": [20, 320], "bodi": 20, "elpenor": 20, "cut": 20, "firewood": 20, "wood": 20, "headland": 20, "jut": 20, "wept": 20, "over": [20, 21, 22, 25, 36, 56, 238, 239, 264, 278, 308, 311, 314, 316, 319, 320, 321, 323], "him": 20, "lament": 20, "funer": 20, "rite": 20, "armour": 20, "been": [20, 74, 76, 222, 230, 240, 268, 274, 315, 321, 322], "burn": 20, "ash": 20, "rais": [20, 22, 27, 30, 33, 35, 37, 42, 44, 46, 47, 50, 52, 53, 55, 59, 60, 62, 64, 65, 66, 67, 69, 70, 72, 76, 82, 186, 222, 224, 228, 229, 230, 232, 233, 234, 235, 246, 247, 252, 259, 270, 271, 272, 274, 279, 280, 281, 284, 287, 291, 295, 297, 298, 299], "cairn": 20, "stone": 20, "top": [20, 75, 78, 159, 163, 276, 316, 321, 323], "oar": 20, "he": 20, "row": [20, 55, 56, 224, 230, 231, 240], "text_completion_dataset": [20, 322], "128000": [20, 315, 322], "6153": 20, "584": 20, "1051": 20, "2867": 20, "279": 20, "15140": 20, "22302": 20, "355": 20, "11": [20, 45, 47, 48, 232, 322, 323], "323": 20, "1047": 20, "2751": 20, "704": 20, "1139": 20, "1825": 20, "9581": 20, "4024": 20, "389": 20, "12222": 20, "8813": 20, "362": 20, "12791": 20, "5420": 20, "13218": 20, "1405": 20, "1070": 20, "374": 20, "39493": 20, "64919": 20, "439": 20, "304": 20, "1023": 20, "7634": 20, "1226": 20, "1243": 20, "24465": 20, "1057": 20, "8448": 20, "311": 20, "70163": 20, "1077": 20, "31284": 20, "6212": 20, "30315": 20, "1938": 20, "1288": 20, "1464": 20, "128001": [20, 322], "similarli": [20, 22, 131, 156, 160, 184, 194, 219, 322], "wikimedia": 20, "wikipedia": [20, 46, 73], "cnn_dailymail_articles_dataset": 20, "index": [21, 22, 48, 49, 50, 53, 54, 224, 226, 231, 238, 240, 263, 278, 300, 307, 315, 316], "embed": [21, 22, 79, 80, 81, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 155, 156, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224, 225, 226, 229, 230, 232, 240, 241, 242, 244, 315, 318, 321, 322], "vector": [21, 246, 266, 309, 315, 321], "understood": 21, "plai": [21, 316, 321], "necessari": [21, 22, 55, 56, 287, 288, 289, 290, 291, 315, 316, 320], "phi3": [21, 22, 180, 181, 183, 184, 185, 273, 314, 316], "phi3_mini_token": 21, "p_token": 21, "phi": [21, 183, 184, 273], "32010": 21, "29871": 21, "1792": [21, 259], "9508": [21, 259], "32007": 21, "32001": 21, "4299": [21, 22], "2933": [21, 259], "nuser": 21, "nmodel": 21, "sentencepiec": [21, 256, 318], "tiktoken": [21, 156, 257, 318], "host": [21, 307, 314, 317, 321], "distribut": [21, 78, 274, 284, 285, 293, 295, 300, 308, 309, 312, 314, 317, 318, 319, 321], "alongsid": [21, 309, 321], "alreadi": [21, 24, 33, 37, 42, 62, 63, 65, 66, 67, 68, 69, 70, 222, 224, 234, 235, 240, 273, 284, 307, 314, 316, 319, 320, 321], "_token": [21, 25], "mistraltoken": [21, 177, 315], "adher": [21, 37, 42], "arbitrarili": 21, "small": [21, 225, 316, 321], "seq": [21, 230, 240], "len": [21, 22, 50, 53, 59, 62, 65, 66, 69, 230, 232, 240], "demonstr": [21, 321, 322], "7": [21, 22, 45, 47, 48, 49, 50, 222, 232, 261, 265], "6312": 21, "28709": 21, "assign": [21, 24, 55, 56], "uniqu": [21, 55, 56, 250, 273], "abil": 21, "NOT": [21, 92, 156, 173, 316], "presenc": [21, 31], "certain": [21, 22, 24, 296, 315], "proper": [21, 307, 317], "end_of_text": 21, "special_token": [21, 257, 315], "added_token": 21, "128257": 21, "128258": 21, "remain": [21, 37, 42, 223, 228, 278, 319, 320, 321], "special_tokens_path": [21, 131, 156, 160, 184, 194, 219], "basetoken": 21, "actual": [21, 23, 24, 26, 31, 33, 35, 55, 56, 59, 62, 63, 64, 66, 67, 68, 69, 71, 156, 311, 315, 322], "string": [21, 22, 35, 36, 38, 44, 60, 61, 95, 118, 131, 156, 160, 177, 184, 194, 219, 245, 246, 247, 253, 254, 256, 257, 259, 269, 275, 279, 282, 287, 300, 314, 321], "kwarg": [21, 24, 27, 32, 39, 194, 219, 221, 223, 229, 231, 236, 242, 245, 246, 247, 254, 255, 260, 284, 287, 288, 289, 290, 291, 293, 296], "dict": [21, 22, 25, 26, 31, 33, 35, 36, 37, 38, 42, 44, 47, 48, 49, 50, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 95, 118, 131, 160, 177, 184, 194, 219, 229, 231, 236, 240, 242, 243, 249, 250, 251, 252, 254, 255, 257, 258, 260, 270, 271, 272, 274, 276, 277, 281, 284, 286, 287, 292, 297, 299, 321], "given": [21, 25, 27, 44, 47, 52, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 77, 156, 234, 235, 247, 248, 254, 255, 263, 279, 282, 292, 300, 303, 308, 320], "token_id": [21, 156, 254, 257], "its": [21, 54, 112, 168, 171, 224, 226, 230, 231, 240, 242, 246, 274, 292, 295, 314, 315, 316, 318, 320, 321], "sentencepiecebasetoken": [21, 254], "bpe": 21, "sp_token": 21, "reason": [21, 22, 25, 74, 316, 321, 322], "walk": [22, 25, 290, 308, 315, 316, 317, 322, 323], "design": [22, 25, 267, 314], "cover": [22, 23, 24, 25, 26, 315, 316, 323], "scenario": [22, 53, 156], "compos": [22, 232], "plug": [22, 321], "evalu": [22, 25, 223, 253, 306, 308, 311, 312, 317, 319, 320, 323], "gener": [22, 25, 47, 54, 61, 72, 75, 76, 77, 78, 156, 223, 225, 234, 235, 248, 262, 280, 287, 294, 295, 296, 304, 306, 311, 315, 319, 320, 321, 322, 323], "easi": [22, 25, 308, 309, 316, 320, 321], "understand": [22, 24, 25, 242, 306, 308, 309, 310, 315, 316, 320, 321, 323], "concept": [22, 309, 312, 316, 317, 321], "talk": 22, "close": [22, 25, 287, 288, 289, 290, 291, 320], "veri": [22, 53, 230, 240, 314, 316], "dictat": 22, "state_dict": [22, 236, 241, 242, 250, 252, 270, 271, 272, 273, 274, 297, 320, 323], "store": [22, 55, 56, 287, 288, 291, 320, 321, 323], "disk": [22, 57, 288], "identifi": [22, 287], "state": [22, 25, 155, 230, 232, 234, 236, 240, 243, 249, 250, 251, 252, 262, 264, 270, 271, 272, 274, 276, 277, 297, 309, 316, 318, 320, 323], "match": [22, 44, 287, 297, 307, 314, 316, 318, 320], "up": [22, 23, 25, 26, 50, 54, 61, 73, 156, 230, 234, 235, 240, 257, 261, 276, 287, 296, 310, 311, 312, 314, 315, 317, 318, 320, 321, 323], "exactli": [22, 269, 322], "definit": [22, 320], "either": [22, 47, 55, 56, 74, 224, 230, 231, 270, 287, 293, 307, 314, 320, 322, 323], "explicit": 22, "error": [22, 24, 34, 52, 270, 295, 314], "except": [22, 36, 168, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 259, 316, 319], "wors": [22, 321], "silent": 22, "succe": 22, "popular": [22, 240, 308, 316], "offici": [22, 112, 315, 317, 318], "websit": 22, "inspect": [22, 316, 320, 323], "mmap": 22, "weights_onli": [22, 272], "map_loc": 22, "cpu": [22, 25, 235, 236, 250, 277, 279, 296, 300, 307, 314, 323], "tensor": [22, 45, 47, 48, 49, 50, 74, 75, 76, 77, 78, 79, 80, 81, 82, 154, 155, 220, 221, 222, 223, 224, 225, 226, 227, 229, 230, 231, 232, 236, 237, 238, 239, 240, 241, 242, 246, 247, 253, 262, 263, 264, 265, 266, 268, 270, 277, 283, 287, 288, 289, 290, 291, 294, 297, 299, 319, 320, 321, 323], "item": 22, "f": [22, 26, 59, 62, 65, 66, 69, 269, 315, 316, 319, 320, 323], "tok_embed": [22, 230, 240, 241], "128256": 22, "3072": 22, "255": 22, "tabl": [22, 241, 315, 316, 318, 319, 321, 323], "layer": [22, 25, 82, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 117, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 155, 158, 159, 161, 162, 163, 164, 167, 169, 170, 171, 172, 173, 175, 176, 178, 179, 180, 181, 182, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 227, 228, 229, 230, 231, 232, 233, 234, 235, 240, 242, 244, 246, 247, 252, 253, 275, 308, 310, 318, 320, 321, 322, 323], "dim": [22, 50, 154, 155, 220, 223, 224, 225, 226, 230, 237, 239, 240, 319], "within": [22, 24, 27, 54, 74, 78, 79, 96, 105, 119, 132, 141, 161, 162, 163, 169, 171, 180, 186, 232, 234, 235, 290, 295, 296, 314, 320, 323], "big": 22, "piec": 22, "safe_open": 22, "00001": [22, 314, 316, 319], "00002": [22, 314, 316, 319], "framework": [22, 25, 308], "pt": [22, 26, 270, 271, 272, 316, 318, 319, 322], "k": [22, 75, 78, 80, 224, 320], "get_tensor": 22, "embed_token": 22, "187": [22, 322], "Not": [22, 321], "fewer": [22, 224], "sinc": [22, 24, 27, 55, 56, 239, 270, 272, 315, 316, 318, 321, 322], "mismatch": 22, "caus": [22, 256], "re": [22, 24, 234, 242, 267, 272, 308, 309, 310, 311, 315, 316, 317, 320, 321], "end": [22, 25, 36, 57, 72, 82, 156, 232, 257, 259, 306, 308, 315, 318, 320, 322], "number": [22, 25, 44, 50, 54, 61, 73, 74, 79, 80, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 155, 156, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 222, 223, 224, 230, 232, 238, 239, 261, 270, 271, 272, 274, 275, 278, 295, 296, 302, 309, 314, 317, 319, 320, 321], "save": [22, 25, 26, 230, 236, 237, 239, 240, 270, 271, 272, 274, 291, 306, 311, 314, 315, 316, 318, 320, 321, 322], "less": [22, 50, 74, 316, 317, 318, 321, 323], "prone": 22, "invari": 22, "accept": [22, 24, 317, 321, 323], "explicitli": [22, 245, 308, 320], "produc": [22, 274, 309, 311, 322, 323], "One": [22, 50, 322], "advantag": [22, 262, 265, 311, 320], "abl": [22, 25, 316, 317, 322], "post": [22, 232, 292, 296, 311, 316, 318, 322, 323], "quantiz": [22, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 272, 282, 306, 307, 309, 310, 312, 317, 323], "eval": [22, 306, 308, 322], "without": [22, 24, 26, 224, 228, 230, 234, 240, 246, 247, 252, 307, 308, 311, 315, 320, 321, 322], "OR": 22, "surround": [22, 25, 308], "load_checkpoint": [22, 25, 270, 271, 272, 273], "save_checkpoint": [22, 25, 26, 270, 271, 272], "permut": 22, "behav": 22, "further": [22, 232, 267, 314, 319, 320, 321, 322, 323], "illustr": [22, 65, 66, 318], "whilst": [22, 309, 310, 321], "read": [22, 270, 271, 272, 308, 321], "compat": [22, 270, 272, 321], "mention": [22, 316, 321, 323], "assum": [22, 35, 42, 45, 47, 65, 67, 95, 118, 131, 160, 177, 184, 194, 219, 222, 224, 226, 231, 240, 241, 243, 249, 250, 257, 274, 276, 278, 279, 280, 315, 320], "checkpoint_dir": [22, 24, 270, 271, 272, 316, 318, 319, 322], "easiest": [22, 316, 317], "everyth": [22, 25, 308, 312, 317], "flow": [22, 54, 322, 323], "output_dir": [22, 24, 270, 271, 272, 296, 316, 318, 320, 322, 323], "snippet": 22, "explain": [22, 321], "fullmodelhfcheckpoint": [22, 316, 319], "sort": [22, 270, 272], "order": [22, 23, 25, 270, 272, 290, 291, 317, 321], "matter": [22, 270, 272, 314, 320], "checkpoint_fil": [22, 24, 26, 270, 271, 272, 316, 318, 319, 320, 322, 323], "model_typ": [22, 270, 271, 272, 316, 318, 322], "restart": [22, 314, 316], "later": [22, 316, 321], "resume_from_checkpoint": [22, 270, 271, 272], "discrep": [22, 270], "github": [22, 27, 75, 86, 87, 88, 97, 98, 102, 103, 104, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 226, 237, 238, 264, 265, 266, 267, 278, 307, 316, 318, 319], "repositori": [22, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 112, 309, 310, 311, 317], "fullmodelmetacheckpoint": [22, 318, 322], "test": [22, 24, 25, 74, 308, 311, 315, 321], "congrat": [22, 316], "far": [22, 316], "tree": [22, 290, 316, 318], "outputdir": [22, 316], "recipe_st": [22, 270, 271, 272, 316], "hold": [22, 285, 316, 317], "last": [22, 36, 51, 54, 72, 230, 263, 274, 278, 316], "epoch": [22, 25, 26, 270, 271, 272, 274, 278, 314, 315, 316, 317, 318, 322], "log": [22, 25, 28, 264, 265, 266, 267, 281, 286, 287, 288, 289, 290, 291, 301, 309, 316, 317, 318, 319, 320, 321, 323], "metric_logg": [22, 23, 24, 25, 26, 316], "epoch_": [22, 270, 271, 272, 316], "plu": [22, 316], "metadata": [22, 26, 322], "push": [22, 316], "exclud": [22, 283, 298], "larg": [22, 53, 237, 239, 246, 247, 296, 314, 321, 323], "lightweight": [22, 257, 315, 316], "mostli": 22, "easier": [22, 24, 316, 317], "applic": [22, 25, 270, 271, 291], "check": [22, 79, 80, 81, 82, 229, 230, 231, 232, 240, 242, 252, 277, 279, 285, 298, 303, 306, 308, 309, 310, 311, 312, 315, 316, 317, 320, 321], "tutori": [22, 293, 308, 310, 311, 315, 316, 317, 318, 319, 320, 321, 322, 323], "llama3_2_3b": [22, 151, 316], "lora_single_devic": [22, 316], "epoch_0": [22, 316], "adapter_config": [22, 316], "adapter_model": [22, 270, 271, 272, 316], "ft": [22, 316], "generation_config": [22, 316], "licens": [22, 316], "txt": [22, 57, 72, 194, 219, 288, 316, 317], "orig_param": [22, 316], "original_repo_id": [22, 316], "readm": [22, 314, 316, 318], "md": [22, 314, 316], "special_tokens_map": [22, 316], "tokenizer_config": [22, 184, 316], "use_polici": [22, 316], "epoch_1": [22, 316], "log_1734652101": [22, 316], "written": [22, 24, 25, 270, 271, 287, 288, 289, 290, 291, 308], "partit": [22, 270, 323], "key_1": [22, 272], "weight_1": 22, "key_2": 22, "weight_2": 22, "mid": 22, "chekpoint": 22, "middl": [22, 242, 321], "subsequ": [22, 25, 222, 230, 232, 261], "etc": [22, 25, 155, 270, 281, 309, 317], "flood": 22, "overwritten": 22, "sometim": [22, 24], "interrupt": 22, "previou": [22, 54, 270, 271, 272, 319], "updat": [22, 24, 25, 38, 222, 224, 230, 240, 246, 260, 264, 265, 274, 296, 299, 307, 315, 316, 317, 318, 320, 321, 322, 323], "your_epoch": 22, "notic": [22, 24, 79, 80, 81, 232, 315, 316, 320], "section": [22, 25, 281, 306, 316, 318, 321, 323], "adapter_checkpoint": [22, 270, 271, 272], "still": [22, 156, 237, 239, 241, 242, 246, 310, 320, 322, 323], "adapt": [22, 86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 155, 169, 170, 171, 172, 180, 181, 240, 242, 243, 245, 246, 247, 248, 249, 250, 251, 270, 271, 272, 309, 310, 315, 316, 320, 323], "influenc": 22, "resuming_from_checkpoint": 22, "save_adapter_weights_onli": [22, 316], "merg": [22, 27, 28, 194, 219, 270, 316, 318, 323], "howev": [22, 53, 184, 307, 319, 321], "therefor": [22, 237, 239, 321, 323], "untrain": [22, 315], "weigth": 22, "choos": [22, 60, 316, 320], "reduc": [22, 230, 264, 308, 310, 311, 319, 320, 321, 322, 323], "amount": [22, 316], "storag": [22, 246, 247, 314, 323], "knowledg": [22, 306], "forward": [22, 25, 79, 80, 81, 154, 155, 220, 221, 223, 224, 225, 226, 227, 229, 230, 231, 232, 234, 235, 237, 238, 239, 240, 241, 242, 246, 247, 253, 264, 265, 266, 281, 296, 318, 319, 320, 321, 323], "right": [22, 47, 50, 77, 230, 270, 316, 318, 320], "full_single_devic": 22, "pytorch_fil": 22, "torchtune_sd": 22, "load_state_dict": [22, 240, 241, 242, 252, 274, 297, 320], "vocab": [22, 27, 194, 219, 230, 240, 241, 318], "24": [22, 232, 317, 318], "randint": 22, "dtype": [22, 24, 25, 78, 222, 224, 229, 230, 231, 233, 234, 235, 236, 240, 242, 279, 294, 298, 316, 319, 321, 322, 323], "1658": 22, "2459": 22, "3259": 22, "3262": 22, "6": [22, 45, 47, 48, 49, 50, 54, 92, 96, 101, 105, 225, 232, 283, 311, 316, 322, 323], "5942": 22, "2284": 22, "4090": [22, 316], "0129": 22, "0121": 22, "0127": 22, "5": [22, 24, 45, 47, 48, 49, 50, 76, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 223, 232, 253, 264, 267, 268, 278, 309, 316, 317, 318, 319, 321], "6462": 22, "8787": 22, "0950": 22, "6460": 22, "6455": 22, "6457": 22, "4156": 22, "0626": 22, "0362": 22, "6432": 22, "6437": 22, "6427": 22, "5679": 22, "6902": 22, "5267": 22, "6137": 22, "6138": 22, "6127": 22, "3688": 22, "1350": 22, "1764": 22, "4563": 22, "4565": 22, "4564": 22, "find": [22, 23, 25, 26, 264, 314, 316, 317, 319, 320, 321], "hope": 22, "deeper": [22, 309, 310, 311, 317, 321], "insight": [22, 316], "happi": [22, 316], "cometlogg": 23, "checkpoint": [23, 24, 25, 236, 240, 242, 257, 269, 270, 271, 272, 273, 274, 275, 291, 293, 297, 308, 311, 314, 318, 319, 320, 322, 323], "workspac": [23, 26, 287], "seen": [23, 26, 320, 323], "screenshot": [23, 26], "comet_ml": [23, 287], "featur": [23, 25, 26, 55, 307, 308, 309, 310, 311, 317, 321], "pip": [23, 26, 287, 290, 291, 307, 316, 318, 321], "login": [23, 26, 287, 291, 314], "metric_log": [23, 24, 26, 287, 288, 289, 290, 291], "experiment_nam": [23, 287], "experi": [23, 24, 287, 291, 306, 308, 318, 319, 320], "grab": [23, 26, 318], "tab": [23, 26], "asset": 23, "artifact": [23, 26, 296], "click": [23, 26], "effect": [24, 223, 267, 319, 321, 322], "prerequisit": [24, 315, 316, 317, 318, 319, 320, 322, 323], "Be": [24, 315, 316, 317, 318, 319, 320, 321, 322, 323], "familiar": [24, 315, 316, 317, 318, 319, 320, 322, 323], "fundament": [24, 322], "reproduc": [24, 287], "overridden": [24, 296], "quick": 24, "seed": [24, 25, 26, 223, 295, 316, 317, 322], "shuffl": [24, 54, 322], "fp32": [24, 225, 230, 237, 239, 321, 322, 323], "enable_fsdp": 24, "keyword": [24, 27, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 223, 236, 246, 247, 315], "subfield": 24, "dotpath": [24, 95, 118, 131, 160, 177, 184, 194, 219], "wish": [24, 222, 234, 297], "exact": [24, 27, 316], "normal": [24, 54, 156, 221, 224, 225, 229, 230, 231, 237, 238, 239, 256, 315, 320, 322, 323], "instanc": [24, 27, 53, 119, 132, 141, 161, 169, 171, 180, 186, 187, 188, 191, 192, 236, 243, 249, 251, 320], "preced": [24, 27, 314, 318, 320], "throw": 24, "miss": [24, 252, 296, 320], "llama2_token": [24, 315], "llama2token": [24, 118], "512": [24, 82, 323], "overwrit": [24, 272, 297, 307, 314], "duplic": [24, 25, 308, 314, 316], "refer": [24, 25, 226, 232, 237, 248, 263, 264, 265, 266, 267, 287, 308, 309, 320, 321, 322], "resolv": [24, 28, 317], "alpaca": [24, 31, 53, 58, 59, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 319], "disklogg": 24, "log_dir": [24, 288, 290, 291], "verifi": [24, 279, 300, 315, 317, 320], "properli": [24, 252, 285, 314], "wa": [24, 35, 42, 50, 67, 76, 80, 81, 82, 159, 163, 232, 252, 315, 320, 322, 323], "7b_lora_single_devic": [24, 316, 317, 320, 323], "my_config": [24, 314], "guidelin": 24, "tempt": 24, "put": [24, 25, 312, 317, 320, 322], "much": [24, 241, 267, 318, 320, 321, 322, 323], "switch": 24, "encourag": [24, 267, 320, 321], "clariti": 24, "significantli": [24, 264, 310, 311, 321], "dont": 24, "privat": [24, 314], "parent": [24, 314], "guarante": 24, "stabil": [24, 237, 239, 308, 311, 321, 322, 323], "underscor": 24, "_alpaca": 24, "k1": [24, 25], "v1": [24, 25, 73], "k2": [24, 25], "v2": [24, 25, 287], "my_model_checkpoint": 24, "file_1": 24, "file_2": 24, "my_tokenizer_path": 24, "nest": [24, 299], "dot": 24, "notat": [24, 50, 154, 155, 224, 226, 230, 240, 262, 263, 283], "bitsandbyt": [24, 321], "pagedadamw8bit": [24, 321], "delet": [24, 230, 233, 234, 235, 240, 316], "foreach": 24, "8b_full": [24, 314, 316], "adamw": [24, 320, 321], "2e": [24, 321], "fuse": [24, 158, 162, 240, 241, 242, 243, 292, 322], "nproc_per_nod": [24, 309, 311, 318, 320, 322], "full_finetune_distribut": [24, 280, 314, 316, 317], "thought": [25, 308, 312, 317, 323], "target": [25, 76, 238, 239, 267, 308, 319], "pipelin": [25, 308, 311], "eg": [25, 230, 240, 270, 308], "meaning": [25, 308, 316, 321], "fsdp": [25, 228, 274, 317, 318, 321], "activ": [25, 82, 220, 275, 281, 286, 293, 296, 308, 311, 322, 323], "gradient": [25, 238, 239, 292, 296, 308, 311, 316, 318, 320, 323], "accumul": [25, 292, 296, 308, 311], "mix": [25, 221, 314, 316, 321], "precis": [25, 221, 236, 279, 308, 311, 317, 323], "complex": 25, "becom": [25, 232, 307], "harder": 25, "anticip": 25, "methodolog": 25, "possibl": [25, 54, 269, 314, 321], "trade": [25, 321], "vs": [25, 317, 321], "qualiti": [25, 309, 316, 320, 322], "believ": 25, "suit": [25, 317, 321], "solut": 25, "result": [25, 65, 82, 159, 163, 232, 239, 259, 261, 277, 296, 309, 311, 316, 318, 319, 320, 321, 322, 323], "meant": [25, 236, 274], "expertis": 25, "routin": 25, "yourself": [25, 314, 318, 320], "exist": [25, 235, 242, 274, 287, 307, 314, 316, 317, 318, 323], "ones": [25, 50, 222], "modular": [25, 308], "wandb": [25, 26, 291, 317], "fulli": [25, 53, 161], "nativ": [25, 306, 308, 320, 322, 323], "numer": [25, 66, 308, 311, 322], "pariti": [25, 308], "verif": 25, "benchmark": [25, 295, 308, 316, 318, 320, 322], "limit": [25, 274, 316, 319, 322], "hidden": [25, 82, 155, 159, 163, 220, 230, 232], "behind": 25, "unnecessari": 25, "abstract": [25, 254, 255, 308, 317, 323], "No": [25, 272, 308], "go": [25, 82, 112, 159, 163, 168, 232, 259, 308, 316, 317, 319, 321, 323], "figur": [25, 320, 323], "spectrum": 25, "decid": 25, "avail": [25, 35, 42, 67, 73, 240, 242, 279, 285, 300, 308, 314, 316, 318, 320, 321], "consist": [25, 33, 37, 42, 65, 66, 73, 312, 317], "overrid": [25, 28, 29, 33, 37, 42, 62, 63, 65, 66, 67, 68, 69, 70, 297, 312, 314, 316, 317, 318, 319, 323], "valid": [25, 52, 77, 238, 246, 247, 252, 263, 280, 297, 298, 307, 312, 316, 317], "closer": [25, 319, 320], "monolith": [25, 308], "trainer": [25, 264, 266, 267], "wrapper": [25, 221, 256, 257, 274, 276, 314, 320], "around": [25, 156, 221, 256, 257, 281, 314, 315, 316, 320, 321, 322, 323], "extern": 25, "primarili": [25, 53, 320], "eleutherai": [25, 73, 308, 319, 320, 322], "har": [25, 308, 319, 320, 322], "stage": [25, 232], "distil": [25, 306], "resum": [25, 278, 316, 323], "dataload": [25, 54, 59, 62, 65, 66, 69], "clean": [25, 26, 58, 319], "group": [25, 224, 284, 287, 288, 289, 290, 291, 302, 314, 318, 322], "init_process_group": [25, 284], "backend": [25, 314, 322], "gloo": 25, "nccl": 25, "fullfinetunerecipedistribut": 25, "cleanup": 25, "stuff": 25, "carri": [25, 56], "metric": [25, 317, 319, 321, 322], "logger": [25, 286, 287, 288, 289, 290, 291, 301, 317], "_devic": 25, "get_devic": 25, "_dtype": 25, "get_dtyp": 25, "ckpt_dict": 25, "wrap": [25, 242, 253, 275, 293, 315, 321], "_model": [25, 274], "_setup_model": 25, "_setup_token": 25, "_optim": 25, "_setup_optim": 25, "_loss_fn": 25, "_setup_loss": 25, "_sampler": 25, "_dataload": 25, "_setup_data": 25, "backward": [25, 274, 276, 292, 296, 323], "zero_grad": 25, "curr_epoch": 25, "rang": [25, 241, 264, 265, 267, 295, 314, 318, 322], "epochs_run": [25, 26], "total_epoch": [25, 26], "idx": [25, 54], "enumer": 25, "_autocast": 25, "logit": [25, 74, 75, 78, 237, 238, 239, 283, 319], "global_step": 25, "_log_every_n_step": 25, "_metric_logg": 25, "log_dict": [25, 287, 288, 289, 290, 291], "step": [25, 54, 55, 56, 65, 66, 230, 240, 262, 274, 276, 278, 287, 288, 289, 290, 291, 292, 296, 306, 311, 316, 320, 322, 323], "recipe_main": [25, 29], "fullfinetunerecip": 25, "wandblogg": [26, 320, 323], "tip": 26, "straggler": 26, "background": 26, "crash": 26, "otherwis": [26, 45, 47, 50, 80, 81, 82, 159, 163, 230, 232, 285, 287, 315, 322], "exit": [26, 234, 235, 248, 307, 314], "resourc": [26, 287, 288, 289, 290, 291, 321, 322], "kill": 26, "ps": 26, "aux": 26, "grep": 26, "awk": 26, "xarg": 26, "desir": [26, 55, 56, 246, 247, 294, 309, 315, 316, 321], "suggest": [26, 319], "approach": [26, 53, 309, 319], "full_finetun": 26, "joinpath": 26, "_checkpoint": 26, "_output_dir": [26, 270, 271, 272], "torchtune_model_": 26, "with_suffix": 26, "wandb_at": 26, "descript": [26, 314], "whatev": 26, "seed_kei": 26, "epochs_kei": 26, "total_epochs_kei": 26, "max_steps_kei": 26, "max_steps_per_epoch": [26, 322], "add_fil": 26, "log_artifact": 26, "hydra": 27, "facebook": 27, "research": 27, "com": [27, 75, 86, 87, 88, 97, 98, 102, 103, 104, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 226, 237, 238, 264, 265, 266, 267, 278, 287, 307, 314, 316, 318, 319], "facebookresearch": 27, "blob": [27, 75, 86, 87, 88, 97, 98, 102, 103, 104, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 184, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 226, 238, 264, 265, 266, 267, 278], "_intern": 27, "_instantiate2": 27, "l148": 27, "num_head": [27, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 224, 226, 230], "num_kv_head": [27, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224], "vocab_s": [27, 74, 75, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 237, 238, 239, 241], "32000": [27, 320], "parsed_yaml": 27, "4096": [27, 61, 73, 101, 105, 224, 226, 316, 320, 322], "embed_dim": [27, 79, 80, 81, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 226, 229, 230, 231, 232, 241, 242, 297, 320], "valueerror": [27, 33, 35, 37, 42, 44, 46, 47, 50, 52, 53, 55, 59, 60, 62, 64, 65, 66, 67, 69, 70, 72, 186, 222, 224, 232, 233, 234, 235, 246, 247, 270, 271, 272, 279, 281, 295, 298], "recipe_nam": 28, "rank": [28, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 277, 285, 295, 302, 309, 310, 317, 320, 323], "zero": [28, 50, 222, 225, 230, 240, 269, 316, 318, 322], "displai": 28, "callabl": [29, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 82, 223, 230, 248, 282, 293], "With": [29, 316, 319, 320, 322, 323], "my_recip": 29, "foo": 29, "bar": [29, 308, 317, 321], "configerror": 30, "cannot": [30, 46, 272, 318], "equival": [31, 35, 80, 266, 267], "condit": [31, 74, 285, 314], "dedic": 31, "due": [31, 256, 320, 321, 323], "keep": [31, 33, 35, 37, 42, 63, 64, 66, 67, 68, 71, 228, 241, 316, 320, 321], "openai": [32, 37, 60, 265], "markup": 32, "im_start": 32, "context": [32, 183, 234, 235, 248, 294, 296, 321], "im_end": 32, "goe": [32, 248], "a2": [33, 55], "functool": [34, 40, 43], "partial": [34, 40, 43], "_prompt_templ": [34, 40, 43], "assistant_messag": [34, 40, 43], "respect": [35, 53, 112, 222, 246, 247, 251, 296, 315], "final": [35, 42, 55, 56, 67, 86, 87, 88, 92, 96, 101, 105, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 159, 161, 162, 163, 169, 170, 171, 172, 173, 180, 181, 186, 189, 190, 196, 197, 200, 201, 203, 204, 205, 206, 220, 230, 240, 252, 318, 319, 320, 321, 323], "leav": [35, 42, 67, 321], "liter": [36, 38, 41, 86, 87, 88, 89, 90, 91, 95, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 131, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 160, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 177, 178, 179, 180, 181, 184, 185, 186, 187, 188, 189, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 219, 252], "union": [36, 46, 47, 58, 59, 60, 62, 64, 69, 70, 72, 73, 95, 118, 131, 160, 177, 184, 194, 219, 223, 230, 240, 246, 247, 251, 253, 270, 275, 280, 287, 288, 289, 290, 291, 293, 295], "interleav": [36, 261], "attach": 36, "writer": 36, "calcul": [36, 38, 77, 154, 156, 224, 229, 231, 232, 262, 263, 265, 318], "consecut": [36, 52, 222, 261], "properti": [36, 320, 321], "media": [36, 56], "classmethod": 36, "image_url": 37, "unmask": [37, 42, 238], "consid": [38, 53, 55, 56, 80, 81, 82, 159, 163, 232, 321], "come": [38, 52, 245, 246, 247, 320, 321], "nanswer": 40, "alia": 41, "alwai": [42, 287, 297, 315, 321], "nsummari": [43, 315], "summari": [43, 53, 69, 232, 281], "batch_first": 45, "padding_valu": 45, "float": [45, 74, 75, 78, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 156, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 173, 175, 178, 179, 180, 181, 182, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 225, 246, 247, 253, 262, 263, 264, 265, 266, 267, 274, 278, 280, 281, 286, 287, 288, 289, 290, 291, 320, 321, 322, 323], "rnn": [45, 47, 50], "pad_sequ": [45, 47, 50], "variabl": [45, 273, 285, 287, 321, 323], "left": [45, 47, 50, 230, 320], "longest": [45, 49, 50], "trail": 45, "dimens": [45, 50, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 220, 222, 223, 224, 226, 230, 232, 241, 246, 247, 318, 320, 321, 323], "element": [45, 47, 50, 53, 238, 283], "8": [45, 47, 48, 50, 59, 62, 65, 66, 69, 86, 87, 88, 89, 90, 91, 97, 98, 99, 100, 106, 107, 108, 109, 110, 111, 120, 121, 122, 123, 124, 125, 126, 127, 133, 134, 135, 136, 137, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 170, 172, 178, 179, 181, 185, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 232, 237, 239, 316, 319, 320, 321, 322, 323], "9": [45, 47, 48, 50, 222, 232, 283, 322, 323], "12": [45, 47, 48, 70, 232, 307, 322], "image_loc": 46, "www": [46, 287], "org": [46, 65, 83, 84, 85, 86, 87, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 119, 120, 121, 122, 124, 125, 126, 127, 132, 133, 134, 135, 136, 144, 145, 146, 147, 150, 151, 152, 153, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 224, 225, 226, 232, 253, 261, 262, 264, 265, 266, 267, 285, 290, 293, 295, 301, 307], "en": [46, 57, 61, 63, 72, 73, 322], "pad_direct": [47, 50], "keys_to_pad": 47, "padding_idx": [47, 48, 49, 50, 54], "left_pad_sequ": [47, 50], "integ": [47, 49, 241, 269, 275, 295], "batch_siz": [47, 59, 62, 65, 66, 69, 222, 224, 229, 230, 231, 233, 234, 235, 237, 238, 239, 240, 241, 242, 264, 266, 268, 316, 321, 322], "ignore_idx": [48, 49, 50], "input_id": [48, 283], "chosen_input_id": [48, 68], "chosen_label": 48, "15": [48, 232, 315, 320, 323], "16": [48, 86, 87, 88, 89, 90, 91, 97, 98, 99, 100, 106, 107, 108, 109, 110, 111, 120, 121, 122, 123, 124, 125, 126, 127, 133, 134, 135, 136, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 170, 172, 178, 179, 181, 185, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 232, 316, 320, 321, 323], "17": [48, 232, 320], "18": [48, 232, 318], "19": [48, 232, 323], "20": [48, 232, 268, 322], "token_pair": 49, "padded_col": 49, "pad_max_til": 50, "pad_max_imag": 50, "tile": [50, 79, 80, 81, 82, 154, 155, 156, 157, 159, 160, 161, 163, 232, 261], "aspect": [50, 79, 80, 308], "ratio": [50, 79, 80, 264, 265], "text_seq_len": [50, 261], "n_tile": [50, 79, 80, 232], "h": [50, 154, 222, 232, 237, 239, 307, 314], "w": [50, 83, 84, 85, 93, 94, 102, 103, 104, 114, 115, 116, 117, 129, 130, 138, 139, 140, 148, 149, 154, 165, 174, 176, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 232, 287, 290, 291, 315, 320, 323], "h_ratio": 50, "w_ratio": 50, "encoder_mask": [50, 229, 230, 240], "image_seq_len": [50, 261], "channel": [50, 82, 154, 156, 159, 163, 232, 246, 322], "height": [50, 154], "largest": 50, "bsz": [50, 74, 75, 76, 77, 79, 80, 232, 237, 239], "max_num_imag": 50, "max_num_til": [50, 79, 80, 82, 156, 159, 163, 232], "tokens_per_til": 50, "image_id": 50, "four": [50, 320], "model_input": 50, "max_text_seq_len": 50, "40": [50, 80, 81, 82, 159, 163, 232, 261, 321, 323], "did": [50, 318, 323], "extra": [50, 156, 240, 307, 315, 320, 321, 322, 323], "second": [50, 224, 241, 320, 321, 323], "eos_id": [51, 156, 257, 259], "shorter": [52, 230], "min": [52, 320], "invalid": 52, "sub": [53, 290], "unifi": [53, 176], "simplifi": [53, 264, 314, 319, 320], "simultan": 53, "intern": [53, 240], "aggreg": 53, "transpar": 53, "constitu": 53, "might": [53, 234, 241, 244, 314, 316, 321], "comput": [53, 55, 56, 113, 119, 128, 132, 137, 141, 154, 155, 158, 162, 186, 190, 224, 226, 230, 231, 237, 239, 240, 261, 264, 266, 281, 295, 311, 316, 319, 321, 322, 323], "cumul": 53, "maintain": [53, 242, 310, 321, 323], "deleg": 53, "retriev": [53, 55, 56, 230], "lead": [53, 256, 269, 311], "scale": [53, 74, 75, 78, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 137, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 225, 227, 229, 231, 246, 247, 253, 263, 267, 320, 321, 322, 323], "strategi": [53, 311], "stream": [53, 301, 321], "demand": 53, "deriv": [53, 220, 230, 231], "instans": 53, "dataset1": 53, "mycustomdataset": 53, "params1": 53, "dataset2": 53, "params2": 53, "concat_dataset": 53, "total": [53, 263, 265, 278, 302, 305, 313, 316, 318, 319, 320, 321], "data_point": 53, "1500": 53, "vicgal": 53, "gpt4": 53, "samsum": [53, 69], "focus": [53, 312, 317, 321], "enhanc": [53, 232, 267, 321, 323], "divers": 53, "machin": [53, 266, 300, 309, 314], "max_pack": 54, "outsid": [54, 295, 296, 320], "sampler": [54, 317], "part": [54, 223, 241, 266, 315, 323], "buffer": [54, 230, 240, 246, 247, 321], "enough": [54, 315, 321], "lower": [54, 311, 319, 320], "triangular": 54, "wise": 54, "made": [54, 60, 64, 67, 68, 72, 156], "smaller": [54, 241, 318, 319, 320, 321, 322, 323], "jam": 54, "s1": [54, 256], "s2": [54, 256], "s3": 54, "s4": 54, "contamin": 54, "input_po": [54, 75, 224, 226, 230, 231, 240], "matrix": [54, 229, 230, 240], "increment": 54, "move": [54, 72, 230, 246, 247, 250, 299, 321], "entir": [54, 72, 237, 244, 315, 323], "avoid": [54, 72, 225, 232, 236, 277, 295, 314, 322, 323], "truncat": [54, 61, 72, 73, 95, 118, 131, 156, 160, 177, 184, 194, 219, 257, 268], "sentenc": [54, 72, 316], "techniqu": [55, 308, 309, 310, 311, 316, 317, 318, 319, 320, 321, 322], "repons": 55, "At": [55, 56, 230, 240], "extract": [55, 56, 61, 258], "against": [55, 56, 267, 303, 322, 323], "unit": [55, 56, 308], "filepath": [55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73], "filter": [55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 322], "prior": [55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 297], "doc": [55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 225, 285, 287, 290, 291, 295, 301, 314], "round": [56, 322], "incorpor": [56, 264], "happen": [56, 237, 239], "ti": [56, 96, 105, 186, 190, 228, 321], "agnost": 56, "treat": [56, 232, 248, 315], "minimum": [56, 65, 66, 67], "corpu": [57, 61, 72, 73], "package_refer": [57, 61, 63, 72, 73], "loading_method": [57, 61, 63, 72, 73], "tabular": [57, 72], "eo": [57, 72, 184, 256, 259, 315], "yahma": 58, "variant": [58, 62, 69, 321], "page": [58, 73, 307, 308, 314, 317, 318, 321], "tatsu": 59, "lab": [59, 75], "codebas": 59, "independ": 59, "contribut": [59, 60, 62, 64, 68, 69, 70, 238, 239, 263, 265], "alpacatomessag": 59, "alpaca_d": 59, "altern": [60, 64, 67, 68, 234, 317, 321], "toward": [60, 267, 309], "my_dataset": [60, 64, 67], "london": [60, 64], "ccdv": 61, "cnn_dailymail": 61, "textcompletiondataset": [61, 72, 73], "cnn": 61, "dailymail": 61, "articl": [61, 73], "highlight": [61, 323], "disabl": [61, 73, 223, 230, 234, 240, 248, 253, 295, 322], "highest": [61, 73], "conjunct": [62, 69, 71, 230], "grammar_d": 62, "rlhflow": 63, "hh": 63, "preferencedataset": [63, 68, 71], "liuhaotian": 65, "llava": 65, "150k": 65, "coco": 65, "train2017": 65, "llava_instruct_150k": 65, "2017": 65, "visit": 65, "cocodataset": 65, "wget": 65, "zip": [65, 304], "unzip": 65, "minim": [65, 66, 317, 319, 320, 321, 322, 323], "clip": [65, 66, 79, 80, 81, 82, 154, 155, 156, 159, 163, 232, 265, 273], "mymodeltransform": [65, 66], "tokenizer_path": [65, 66], "image_transform": [65, 66], "yet": [65, 66, 67, 168, 314, 315, 316], "llava_instruct_d": 65, "huggingfacem4": 66, "the_cauldron": 66, "cauldron": 66, "card": 66, "cauldron_d": 66, "pictur": 67, "logo": 67, "rgb_pytorch": 67, "png": 67, "compris": 68, "share": [68, 224, 228, 316], "c1": 68, "r1": 68, "chosen_messag": 68, "rejected_messag": 68, "samsung": 69, "samsum_d": 69, "351": 70, "82": 70, "391": 70, "221": 70, "220": 70, "193": 70, "471": 70, "lvwerra": 71, "stack": [71, 232, 296], "exchang": 71, "allenai": [72, 322], "data_dir": 72, "realnewslik": 72, "wikitext_document_level": 73, "wikitext": [73, 322], "103": 73, "transformerdecod": [74, 75, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 113, 114, 115, 116, 117, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 158, 162, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 178, 179, 180, 181, 182, 183, 185, 186, 187, 188, 189, 190, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 229, 231, 240, 241, 242, 320], "max_generated_token": 74, "pad_id": [74, 268], "temperatur": [74, 75, 78, 264, 266, 267, 316], "top_k": [74, 75, 78, 316], "stop_token": [74, 268], "rng": 74, "custom_generate_next_token": 74, "seq_length": [74, 75, 76, 229, 231, 241, 242], "prune": [74, 78, 323], "probabl": [74, 78, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 246, 247, 253, 264, 265, 266, 267, 309, 316, 319], "stop": [74, 268], "random": [74, 223, 232, 295, 317], "compil": [74, 237, 316, 318, 321, 323], "generate_next_token": 74, "llama3_8b": [74, 134, 142, 240, 318, 321, 322], "manual_se": 74, "tolist": 74, "jeremi": 74, "m": [74, 236, 309, 315, 322], "seq_len": [74, 76, 77, 230], "num_generated_token": 74, "q": [75, 78, 224, 320], "randomli": [75, 78, 297], "softmax": [75, 78, 224, 230, 231, 240, 319], "trick": [75, 78], "fast": [75, 316], "32971d3129541c5bfb4f715abc33d1c5f408d204": 75, "l40": 75, "padding_mask": [76, 77, 265, 268], "target_seq_len": 76, "suitabl": 76, "scaled_dot_product_attent": [76, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 224], "static": 76, "kv": [76, 222, 224, 230, 231, 233, 234, 235, 240, 322], "cach": [76, 222, 224, 226, 229, 230, 231, 233, 234, 235, 240, 242, 307, 314], "longer": [76, 222, 321], "boolean": [76, 77, 82, 224, 229, 230, 231, 240, 242, 253, 283], "assertionerror": [76, 82, 222, 229, 230, 252, 297], "shift": [77, 230], "uniform_": 78, "int32": 78, "patch": [79, 80, 81, 82, 155, 156, 159, 163, 232, 261], "vision_transform": [79, 80, 81, 82], "visiontransform": [79, 80, 81, 82], "divid": [79, 80, 81, 82, 156, 159, 163, 232, 238, 239, 261], "dimension": [79, 80, 81, 82, 159, 163, 232], "n_img": [79, 80, 232], "n_tokens_per_til": [79, 80, 81], "crop": [79, 80, 81, 82, 154, 159, 163, 232], "local_token_positional_embed": 80, "_position_embed": [80, 232], "tokenpositionalembed": [80, 232], "gate": [80, 227, 273, 309, 310, 311, 314, 317], "global_token_positional_embed": 80, "400": [80, 81, 82, 159, 163, 232, 261], "10x10": [80, 81, 82, 159, 163, 232, 261], "grid": [80, 81, 82, 159, 163, 232, 261], "th": [80, 222], "silu": [82, 220], "cls_output_dim": [82, 232], "attn_bia": 82, "use_rop": 82, "out_indic": [82, 232], "output_cls_project": 82, "in_channel": [82, 159, 163, 232], "append_cls_token": [82, 232], "transformerencoderlay": 82, "cl": [82, 155, 232], "mlp": [82, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 229, 230, 231, 252, 318, 320, 321], "bia": [82, 228, 245, 246, 247, 297, 320, 322, 323], "2d": 82, "rope": [82, 137, 141, 186, 190, 224, 226], "intermedi": [82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 232, 272, 293, 316, 318, 323], "fourth": [82, 159, 163, 232], "determin": [82, 159, 163], "divis": [82, 225], "code_llama2": [83, 84, 85, 86, 87, 88, 89, 90, 91, 314, 316], "arxiv": [83, 84, 85, 86, 87, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 119, 120, 121, 122, 124, 125, 126, 127, 132, 133, 134, 135, 136, 144, 145, 146, 147, 150, 151, 152, 153, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 224, 226, 232, 253, 261, 262, 264, 265, 266, 267, 309], "pdf": [83, 84, 85, 261, 262], "2308": [83, 84, 85], "12950": [83, 84, 85], "lora_attn_modul": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 252, 310, 320, 321, 323], "q_proj": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 252, 310, 320, 321, 322, 323], "k_proj": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 252, 310, 320, 321, 322, 323], "v_proj": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 252, 310, 320, 321, 322, 323], "output_proj": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 252, 320, 321, 322, 323], "apply_lora_to_mlp": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 252, 310, 320, 321], "apply_lora_to_output": [86, 87, 88, 89, 90, 91, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 252, 320, 321], "lora_rank": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 310, 320, 321], "lora_alpha": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 310, 320, 321], "lora_dropout": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 321], "use_dora": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 143, 144, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 321], "quantize_bas": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 321, 323], "code_llama2_13b": 86, "tloen": [86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206], "8bb8579e403dc78e37fe81ffbb253c413007323f": [86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206], "l41": [86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206], "l43": [86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206], "linear": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 155, 161, 162, 163, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 228, 230, 245, 246, 247, 252, 253, 320, 321, 322, 323], "low": [86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 309, 310, 316, 319, 320, 323], "approxim": [86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 320], "factor": [86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 137, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 262, 316], "dropout": [86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 246, 247, 253, 320, 321, 323], "decompos": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181, 246, 309, 310], "magnitud": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181, 246, 250, 321], "dora": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 141, 144, 150, 151, 162, 163, 169, 170, 171, 172, 180, 181, 246, 309, 310], "ab": [86, 87, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 119, 120, 121, 122, 124, 125, 126, 127, 132, 133, 134, 135, 136, 144, 145, 146, 147, 150, 151, 152, 153, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 224, 226, 232, 253, 264, 265, 266, 267], "2402": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181], "09353": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181], "code_llama2_70b": 87, "code_llama2_7b": 88, "qlora": [89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 164, 167, 178, 179, 185, 236, 306, 308, 309, 310, 318, 320], "paper": [89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 164, 167, 178, 179, 185, 253, 261, 264, 266, 267, 319, 320, 323], "2305": [89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 164, 167, 178, 179, 185, 224, 264, 266], "14314": [89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 164, 167, 178, 179, 185], "lora_code_llama2_13b": 89, "lora_code_llama2_70b": 90, "lora_code_llama2_7b": 91, "head_dim": [92, 96, 101, 105, 222, 224, 230], "intermediate_dim": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190], "attn_dropout": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 224, 230], "norm_ep": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190], "1e": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 225, 319, 321], "06": [92, 96, 101, 105, 225, 320], "rope_bas": [92, 96, 101, 105, 113, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190], "10000": [92, 96, 101, 105, 113, 169, 171, 173, 175, 180, 182, 226], "transformerselfattentionlay": [92, 101, 113, 128, 137, 173, 190, 229, 230, 240, 242], "rm": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 316], "norm": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 230], "space": [92, 101, 113, 128, 137, 158, 162, 173, 190, 230, 244, 321], "slide": [92, 173, 183], "window": [92, 173, 183], "vocabulari": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 237, 239, 320, 321], "mha": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 230], "onto": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 224, 244], "epsilon": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 265], "rotari": [92, 96, 101, 105, 113, 128, 137, 141, 169, 171, 173, 175, 180, 182, 226, 318], "10_000": [92, 96, 101, 105, 169, 171, 173, 175, 182], "blog": [93, 94], "technolog": [93, 94], "develop": [93, 94, 307, 323], "gemmatoken": 95, "_templatetyp": [95, 118, 131, 160, 177, 184, 194, 219], "gemma_2b": [97, 107], "gemma_7b": [98, 106, 108], "lora_gemma_2b": 99, "lora_gemma_7b": 100, "hidden_capping_valu": [101, 105], "50": [101, 105, 232, 268, 287, 316], "final_capping_valu": [101, 105], "30": [101, 105, 232, 268, 322], "sliding_window_s": [101, 105], "query_pre_attn_scalar": [101, 105], "gemma2": [102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 273, 316], "gemma_pytorch": [102, 103, 104], "lora_gemma2_27b": 109, "lora_gemm2a_2b": 110, "lora_gemma2_9b": 111, "taken": [112, 316, 320, 323], "sy": [112, 315], "honest": [112, 309, 315], "pari": [112, 168], "capit": [112, 168], "franc": [112, 168], "known": [112, 168, 282], "stun": [112, 168], "05": [113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190], "gqa": [113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224], "mqa": [113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224], "kvcach": [113, 119, 128, 132, 137, 141, 158, 162, 180, 186, 190, 224, 230, 233, 234, 235, 240], "scale_hidden_dim_for_mlp": [113, 119, 128, 132, 137, 141, 158, 162, 186, 190], "2307": [114, 115, 116, 117], "09288": [114, 115, 116, 117], "classif": [117, 171, 175, 176, 273], "llama2_13b": 120, "llama2_70b": 121, "llama2_7b": [122, 320], "classifi": [123, 171, 175, 176, 297, 321], "llama2_reward_7b": [123, 273], "lora_llama2_13b": 124, "lora_llama2_70b": 125, "lora_llama2_7b": [126, 320], "lora_llama2_reward_7b": 127, "500000": [128, 132, 137, 141, 158, 162], "500_000": [128, 137, 141], "70": 129, "llama3token": [131, 156, 255], "regist": [131, 156, 160, 184, 194, 219, 236, 292, 323], "canon": [131, 156, 160, 184, 194, 219], "llama3_70b": 133, "lora_llama3_70b": 135, "lora_llama3_8b": [136, 321], "scale_factor": [137, 141], "llama3_1": [138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 309, 310, 316, 319], "llama3_1_8b": 144, "lora_llama3_1_405b": 145, "lora_llama3_1_70b": [146, 166, 167], "lora_llama3_1_8b": 147, "llama3_2_1b": [150, 233, 234, 235], "lora_llama3_2_1b": 152, "lora_llama3_2_3b": 153, "projection_head": [154, 240, 244], "combin": [154, 156, 159, 163, 230, 240, 242, 244, 263, 319], "learnabl": [154, 227, 240, 242, 246, 316], "fusion": [154, 157, 158, 159, 161, 162, 163, 240, 241, 242, 243, 244], "encoder_dim": [154, 155], "decoder_dim": [154, 155], "num_img": [154, 155], "num_emb": [154, 155], "broken": [154, 155, 232, 242], "width": [154, 322], "clip_embeds_per_til": 154, "emb": [154, 155, 224, 229, 230, 240], "num_hidden_input": 155, "frozen": [155, 161, 164, 241, 264, 320, 321, 323], "sequenti": [155, 240, 244], "num_hidden": 155, "hidden_st": [155, 232], "image_mean": 156, "image_std": 156, "tranform": 156, "possible_resolut": 156, "448": [156, 157, 160, 161], "deviat": 156, "transformed_data": 156, "img1": [156, 261], "img2": [156, 261], "31587": [156, 256, 257], "29644": [156, 256, 257], "102": [156, 256, 257], "truncate_at_eo": [156, 257], "skip": [156, 224], "tokenize_head": 156, "tokenize_end": 156, "header": 156, "eom": 156, "wether": 156, "decoder_train": [157, 161, 164, 240], "encoder_train": [157, 161, 164, 240], "fusion_train": [157, 161, 164, 240], "deepfusionmodel": [157, 161, 164], "trainabl": [157, 161, 242, 247, 251, 320, 321, 323], "resiz": [157, 160, 161], "fusion_interv": [158, 162], "num_special_token": [158, 162], "encoder_max_seq_len": [158, 162, 229, 230, 231, 235, 240, 242], "causalselfattent": [158, 162], "interv": [158, 162, 317], "clip_embed_dim": [159, 163], "clip_num_lay": [159, 163], "clip_hidden_st": [159, 163], "num_layers_project": [159, 163], "decoder_embed_dim": [159, 163], "llama3visionencod": [159, 163], "spatial": [159, 163], "backbon": [159, 163], "trainbl": 161, "decoder_lora": 162, "fusion_lora": [162, 163], "encoder_lora": 163, "quantization_kwarg": [163, 246, 247], "lora_llama3_2_vision_11b": 164, "llama3_3": [165, 166, 167], "llama3_1_70b": 165, "num_class": [171, 175, 297], "announc": 174, "ray2333": 176, "feedback": [176, 264], "lora_mistral_7b": 178, "lora_mistral_reward_7b": 179, "phi3_mini": [181, 273], "128k": 183, "nor": 183, "phi3minitoken": 184, "spm": 184, "lm": [184, 265, 319], "bo": [184, 256, 259, 315], "unk": 184, "augment": [184, 323], "endoftext": 184, "phi3minisentencepiecebasetoken": 184, "lora_phi3_mini": 185, "1000000": [186, 190], "tie_word_embed": [186, 187, 188, 190, 191, 192, 195, 198, 199, 202, 207, 210, 211, 214], "qwen2transformerdecod": 186, "period": [186, 190], "word": [186, 190, 321, 322], "qwen2_0_5b": [187, 228], "qwen2_1_5b": [188, 228], "qwen2_7b": 189, "qwen": [191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218], "merges_fil": [194, 219], "qwen2token": 194, "qwen2_5_0_5b": 195, "qwen2_5_14b_bas": 196, "slightli": [196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 319, 321], "qwen2_5_14b_instruct": 197, "qwen2_5_1_5b_bas": 198, "qwen2_5_1_5b_instruct": 199, "qwen2_5_32b_bas": 200, "qwen2_5_32b_instruct": 201, "qwen2_5_3b": 202, "qwen2_5_72b_bas": 203, "qwen2_5_72b_instruct": 204, "qwen2_5_7b_bas": 205, "qwen2_5_7b_instruct": 206, "qwen2_5token": 219, "gate_proj": 220, "down_proj": 220, "up_proj": 220, "feed": [220, 229, 231], "network": [220, 248, 320, 323], "fed": [220, 315], "multipli": [220, 321], "in_dim": [220, 245, 246, 247, 320, 321, 323], "out_dim": [220, 230, 245, 246, 247, 320, 321, 323], "layernorm": 221, "past": 222, "dpython": [222, 224, 229, 230, 231, 235, 236, 240, 242, 294, 298], "reset": [222, 224, 229, 230, 231, 240, 242, 281], "k_val": 222, "v_val": 222, "fill": 222, "bfloat16": [222, 236, 294, 316, 317, 318, 320, 321, 322], "greater": [222, 232, 303], "prob": 223, "disable_on_ev": [223, 253], "portion": [223, 322, 323], "regular": [223, 264, 267, 321, 322, 323], "potenti": [223, 311, 320, 321], "layer_dropout": 223, "randn": 223, "pos_embed": [224, 229, 320, 322], "q_norm": 224, "k_norm": 224, "kv_cach": [224, 233, 234, 235], "is_caus": 224, "13245v1": 224, "multihead": 224, "extrem": 224, "credit": 224, "litgpt": 224, "v": [224, 230, 240, 320], "n_kv_head": 224, "rotarypositionalembed": [224, 320, 322], "rmsnorm": 224, "vice": [224, 314], "versa": [224, 314], "y": [224, 309], "s_x": 224, "s_y": 224, "_masktyp": [224, 230, 231], "score": [224, 230, 231, 263], "encoder_max_cache_seq_len": [224, 230, 231], "decoder_max_cache_seq_len": 224, "j": [224, 229, 230, 231, 240, 309], "blockmask": [224, 230, 231], "create_block_mask": [224, 230, 231], "flex_attent": [224, 230, 231], "n_h": [224, 226], "num": [224, 226], "n_kv": 224, "h_d": [224, 226], "reset_cach": [224, 229, 230, 231, 240, 242], "setup_cach": [224, 229, 230, 231, 233, 234, 240, 242], "ep": 225, "squar": 225, "stabl": [225, 285, 290, 295, 307, 321], "html": [225, 285, 290, 293, 295, 301, 306], "propos": [226, 321], "2104": 226, "09864": 226, "verfic": 226, "l80": 226, "init": [226, 281, 286, 291, 323], "exceed": 226, "freq": 226, "recomput": [226, 321], "geometr": 226, "progress": [226, 253, 312, 317, 321], "rotat": 226, "angl": 226, "basic": [227, 318], "tied_modul": 228, "pointer": [228, 308], "why": [228, 315, 317, 320], "whose": [228, 248, 287, 292], "attributeerror": [228, 299], "attn": [229, 231, 233, 234, 235, 320, 322, 323], "multiheadattent": [229, 231, 320, 322], "ca_norm": 229, "mlp_norm": [229, 231], "ca_scal": 229, "mlp_scale": [229, 231], "ff": [229, 231], "caches_are_en": [229, 230, 231, 233, 234, 235, 240, 242], "func": [229, 231, 242], "caches_are_setup": [229, 230, 231, 233, 234, 235, 240, 242], "token_sequ": 229, "embed_sequ": 229, "decoder_max_seq_len": [229, 230, 231, 233, 234, 235, 240, 242], "modulelist": [230, 253], "output_hidden_st": [230, 240], "belong": [230, 276], "statement": 230, "improv": [230, 257, 266, 309, 311, 318, 319, 320, 321], "readabl": 230, "behaviour": [230, 240, 297, 309], "alter": [230, 240], "common_util": [230, 233, 234, 235, 236], "disable_kv_cach": [230, 240], "chunked_output": 230, "last_hidden_st": 230, "chunk": [230, 237, 239, 257], "cewithchunkedoutputloss": [230, 240], "upcast": [230, 237, 239], "set_num_output_chunk": [230, 240], "num_chunk": [230, 237, 239], "s_e": [230, 240], "d_e": [230, 240], "arang": [230, 240], "prompt_length": [230, 240], "correspondingli": 230, "padded_prompt_length": 230, "m_": [230, 240], "realloc": [230, 240], "runtimeerror": [230, 246, 259, 274, 279, 280, 284], "num_output_chunk": [230, 237, 239, 240], "transformercrossattentionlay": [230, 240, 242], "fusionlay": [230, 240], "sa_norm": 231, "sa_scal": 231, "token_pos_embed": 232, "pre_tile_pos_emb": 232, "post_tile_pos_emb": 232, "cls_project": 232, "vit": 232, "2010": [232, 253], "11929": 232, "convolut": 232, "flatten": 232, "downscal": 232, "800x400": 232, "400x400": 232, "_transform": 232, "whole": [232, 319], "n_token": 232, "101": 232, "pool": 232, "tiledtokenpositionalembed": 232, "tilepositionalembed": 232, "tile_pos_emb": 232, "8x8": 232, "21": 232, "22": 232, "23": [232, 278], "25": [232, 319], "26": 232, "27": 232, "28": 232, "29": [232, 323], "31": [232, 318], "33": 232, "34": 232, "35": [232, 323], "36": [232, 309], "37": 232, "38": 232, "39": 232, "41": 232, "43": 232, "44": 232, "45": 232, "46": 232, "47": 232, "48": [232, 323], "49": 232, "51": 232, "52": [232, 317], "53": 232, "54": 232, "55": [232, 317], "56": 232, "57": [232, 320, 323], "58": 232, "59": [232, 323], "60": 232, "61": 232, "62": 232, "63": 232, "64": [232, 310, 320, 321], "num_patches_per_til": 232, "emb_dim": 232, "constain": 232, "anim": 232, "max_n_img": 232, "n_channel": 232, "vision_util": 232, "tile_crop": 232, "800": 232, "patch_grid_s": 232, "rand": 232, "nch": 232, "tile_cropped_imag": 232, "batch_imag": 232, "unsqueez": 232, "batch_aspect_ratio": 232, "clip_vision_encod": 232, "cache_en": 233, "float32": [233, 234, 235, 279, 319, 321], "1024": [233, 234, 235, 322], "temporarili": [234, 235, 248, 321], "enter": [234, 235], "overhead": [234, 264, 311, 321, 322], "untouch": [234, 315], "yield": [234, 235, 248], "caller": [234, 235, 248], "delete_kv_cach": 235, "offload_to_cpu": 236, "hook": [236, 292, 321, 323], "nf4": [236, 321, 323], "restor": 236, "higher": [236, 316, 318, 319, 321, 322, 323], "offload": [236, 323], "increas": [236, 253, 264, 278, 309, 318, 319, 320, 321, 322], "peak": [236, 277, 281, 286, 316, 318, 320, 323], "gpu": [236, 309, 311, 314, 316, 317, 318, 319, 320, 321, 322, 323], "_register_state_dict_hook": 236, "mymodul": 236, "_after_": 236, "nf4tensor": [236, 323], "unquant": [236, 322, 323], "unus": 236, "ignore_index": [237, 238, 239, 319], "entropi": [237, 239, 319], "bf16": [237, 239, 279, 316, 321, 323], "ce": [237, 319], "better": [237, 239, 267, 308, 315, 319, 322], "accuraci": [237, 239, 311, 316, 318, 319, 320, 321, 322, 323], "doubl": [237, 239, 323], "num_token": [237, 238, 239], "consider": [237, 239], "compute_cross_entropi": 237, "gain": [237, 311, 318], "won": [237, 315, 321], "realiz": 237, "pull": [237, 309, 310, 311, 314], "1390": 237, "loss_fn": [237, 239], "chunkedcrossentropyloss": 237, "output_chunk": [237, 239], "kullback": [238, 319], "leibler": [238, 319], "diverg": [238, 239, 263, 319], "jongwooko": [238, 319], "distillm": [238, 319], "17c0f98bc263b1861a02d5df578c84aea652ee65": 238, "student_logit": [238, 239, 319], "teacher_logit": [238, 239, 319], "student": [238, 239], "teacher": [238, 239], "kl": [238, 239, 263, 319], "teacher_chunk": 239, "teacher_model": 239, "model_fus": [240, 241, 242, 243, 244], "deepfus": 240, "evolut": 240, "interspers": 240, "assumpt": 240, "signatur": 240, "interchang": 240, "fusion_param": [240, 241, 242, 243, 244], "fusionembed": 240, "fusion_lay": [240, 242], "clip_vit_224": [240, 244], "feedforward": [240, 244], "register_fusion_modul": 240, "strict": [240, 241, 242, 252, 320], "freez": [240, 316, 320], "fusion_vocab_s": 241, "necessit": 241, "rout": 241, "128": [241, 310, 318, 320], "fusion_first": 242, "flamingo": [242, 261], "shot": [242, 316, 318, 322], "infus": 242, "interpret": 242, "enocd": 242, "isn": [242, 279, 314, 321], "fused_lay": 242, "mark": [244, 315], "earli": 244, "peft": [245, 246, 247, 248, 249, 250, 251, 252, 270, 309, 310, 316, 320, 323], "adapter_param": [245, 246, 247, 248, 249, 251], "proj": 245, "loralinear": [245, 320, 321, 323], "alpha": [246, 247, 320, 321, 323], "use_bia": [246, 247], "scalar": [246, 287, 288, 289, 290, 291, 321], "orient": [246, 321], "bax": [246, 247], "distinct": [246, 323], "to_nf4": [246, 247, 323], "block_siz": [246, 247], "scaler_block_s": [246, 247], "granular": [246, 247], "scaler": [246, 247], "lora_a": [246, 247, 320, 323], "lora_b": [246, 247, 320, 323], "initialize_dora_magnitud": 246, "to_empti": [246, 247], "recurs": [246, 247, 290], "submodul": [246, 247], "perturb": 247, "decomposit": [247, 320, 321], "matric": [247, 320, 323], "mapsto": 247, "w_0x": 247, "r": [247, 309, 316, 320], "polici": [248, 263, 264, 265, 266, 267, 293, 309, 312], "neural": [248, 309, 320, 323], "shard": [250, 271, 277, 318], "get_adapter_param": [251, 320], "base_miss": 252, "base_unexpect": 252, "lora_miss": 252, "lora_unexpect": 252, "reli": [252, 259, 316, 318], "unexpect": 252, "nonempti": 252, "prob_max": 253, "prob_layer_scal": 253, "scaletyp": 253, "uniform": 253, "layers_str": 253, "modulelayerdropoutwrapp": 253, "inplac": [253, 299, 320], "mymodel": 253, "super": [253, 319], "uniformli": 253, "layerdrop": 253, "fan": 253, "et": [253, 320], "al": [253, 320], "1909": 253, "11556v1": 253, "linearli": [253, 278], "zhang": 253, "13369": 253, "exponenti": 253, "layerskip": 253, "elhoushi": 253, "2404": 253, "16710": 253, "exp": 253, "tiktokenbasetoken": 254, "light": 256, "sentencepieceprocessor": 256, "trim": 256, "whitespac": 256, "spm_model": [256, 315], "tokenized_text": [256, 257], "add_bo": [256, 257, 315], "trim_leading_whitespac": 256, "prefix": [256, 321], "unbatch": 256, "bos_id": [257, 259], "substr": 257, "repetit": 257, "speed": [257, 296, 318, 321, 322, 323], "identif": 257, "regex": 257, "absent": 257, "tt_model": 257, "tokenizer_json_path": 258, "heavili": 259, "concat": 259, "1788": 259, "2643": 259, "465": 259, "22137": 259, "join": 259, "satisfi": 259, "loos": 260, "image_token_id": 261, "particip": [261, 262], "laid": 261, "fig": 261, "2204": 261, "14198": 261, "immedi": [261, 321], "until": [261, 321], "img3": 261, "equal": [261, 303, 321], "gamma": [262, 266, 267, 309], "lmbda": 262, "estim": [262, 263], "1506": 262, "02438": 262, "response_len": [262, 263], "receiv": 262, "discount": 262, "gae": 262, "logprob": [263, 267], "ref_logprob": 263, "kl_coeff": 263, "valid_score_idx": 263, "coeffici": [263, 265], "total_reward": 263, "kl_reward": 263, "beta": [264, 267], "label_smooth": [264, 267], "18290": 264, "intuit": [264, 266, 267], "dispref": 264, "dynam": [264, 322], "degener": 264, "occur": [264, 311], "naiv": 264, "trl": [264, 266, 267], "5d1deb1445828cfd0e947cb3a7925b1c03a283fc": 264, "dpo_train": [264, 266], "l844": 264, "2009": 264, "01325": 264, "baselin": [264, 265, 319, 320], "uncertainti": [264, 267], "policy_chosen_logp": [264, 266], "policy_rejected_logp": [264, 266], "reference_chosen_logp": [264, 266], "reference_rejected_logp": [264, 266], "chosen_reward": [264, 266], "rejected_reward": [264, 266], "value_clip_rang": 265, "value_coeff": 265, "proxim": [265, 312], "1707": 265, "06347": 265, "eqn": 265, "vwxyzjn": 265, "ccc19538e817e98a60d3253242ac15e2a562cb49": 265, "lm_human_preference_detail": 265, "train_policy_acceler": 265, "l719": 265, "ea25b9e8b234e6ee1bca43083f8f3cf974143998": 265, "ppo2": 265, "l68": 265, "l75": 265, "pi_old_logprob": 265, "pi_logprob": 265, "phi_old_valu": 265, "phi_valu": 265, "value_padding_mask": 265, "old": 265, "participag": 265, "five": 265, "policy_loss": 265, "value_loss": 265, "clipfrac": 265, "fraction": 265, "statist": [266, 309, 321], "rso": [266, 309], "hing": [266, 309], "2309": [266, 309], "06657": [266, 309], "logist": 266, "regress": 266, "slic": 266, "10425": 266, "almost": [266, 320], "svm": 266, "counter": 266, "4dce042a3863db1d375358e8c8092b874b02934b": 266, "l1141": 266, "2405": 267, "14734": 267, "simpo": 267, "averag": [267, 319], "implicit": 267, "margin": [267, 309], "bradlei": 267, "terri": 267, "larger": [267, 272, 316, 318, 319, 321], "win": 267, "lose": 267, "98ad01ddfd1e1b67ec018014b83cba40e0caea66": 267, "cpo_train": 267, "l603": 267, "pretti": 267, "identitc": 267, "elimin": 267, "kind": 267, "ipoloss": 267, "fill_valu": 268, "sequence_length": 268, "stop_token_id": 268, "869": 268, "eos_mask": 268, "truncated_sequ": 268, "filename_format": [269, 270], "max_filenam": [269, 270], "concis": 269, "filenam": [269, 288], "file_": 269, "_of_": 269, "n_file": 269, "build_checkpoint_filenam": 269, "00003": [269, 319], "file_00001_of_00003": 269, "file_00002_of_00003": 269, "file_00003_of_00003": 269, "recipe_checkpoint": [270, 271, 272, 322], "safe_seri": 270, "should_load_recipe_st": [270, 271, 272], "from_pretrain": [270, 316], "0001_of_0003": 270, "0002_of_0003": 270, "largest_epoch": [270, 271, 272], "recipe_state_dirnam": [270, 271, 272], "receip": 270, "deprec": [270, 271, 272], "preserv": [270, 323], "weight_map": 270, "convert_weight": 270, "_model_typ": [270, 273], "intermediate_checkpoint": [270, 271, 272], "adapter_onli": [270, 271, 272], "_weight_map": 270, "wip": 271, "qualnam": 273, "boundari": 273, "distinguish": 273, "llama3_vis": 273, "llama3_2_vision_decod": 273, "mistral_reward_7b": 273, "clip_text": 273, "clip_text_encoder_larg": 273, "my_new_model": 273, "my_custom_state_dict_map": 273, "optim_map": 274, "bare": 274, "bone": 274, "optim_dict": [274, 276, 292], "p": [274, 309, 320, 322, 323], "cfg_optim": 274, "ckpt": 274, "optim_ckpt": 274, "placeholder_optim_dict": 274, "optiminbackwardwrapp": 274, "get_last_lr": 274, "rate": [274, 278, 280, 308, 317, 321], "schedul": [274, 278, 296, 317, 321], "get_optim_kei": 274, "arbitrari": [274, 320, 321], "optim_ckpt_map": 274, "set_lr_schedul": 274, "lr_schedul": [274, 278], "lrschedul": 274, "loadabl": 274, "step_lr_schedul": 274, "ac_mod": 275, "ac_opt": 275, "op": [275, 322], "ac": 275, "optimizerinbackwardwrapp": [276, 280], "named_paramet": [276, 297], "sharded_sd": 277, "dtensor": 277, "is_rank_zero": 277, "rank0": 277, "num_warmup_step": 278, "num_training_step": 278, "num_cycl": [278, 296], "last_epoch": 278, "lambdalr": 278, "decreas": [278, 320, 321, 322, 323], "cosin": 278, "v4": 278, "src": 278, "l104": 278, "warmup": [278, 296], "phase": 278, "wave": 278, "half": [278, 321], "kernel": 279, "warpper": 280, "optimizer_in_backward": 280, "reset_stat": 281, "track": [281, 287], "alloc": [281, 286, 318, 321, 323], "reserv": [281, 286, 315, 323], "stat": [281, 286, 323], "int4": [282, 322], "4w": 282, "recogn": 282, "int8dynactint4weightquant": [282, 311, 322], "8da4w": [282, 322], "int4weightonlyquant": [282, 322], "int8dynactint4weightqatquant": [282, 311, 322], "qat": [282, 306, 312], "int4weightonlyqatquant": 282, "master": 285, "port": [285, 314], "address": [285, 319, 321], "peak_memory_act": 286, "peak_memory_alloc": 286, "peak_memory_reserv": 286, "get_memory_stat": 286, "api_kei": 287, "experiment_kei": 287, "onlin": 287, "log_cod": 287, "comet": 287, "site": 287, "ml": 287, "team": 287, "compar": [287, 290, 303, 316, 318, 319, 320, 322, 323], "sdk": 287, "uncategor": 287, "alphanumer": 287, "charact": 287, "get_or_cr": 287, "fresh": 287, "persist": 287, "hpo": 287, "sweep": 287, "server": 287, "offlin": 287, "auto": [287, 314, 316], "creation": 287, "experimentconfig": 287, "project_nam": 287, "my_workspac": 287, "my_metr": [287, 290, 291], "importerror": [287, 291], "termin": [287, 290, 291], "comet_api_kei": 287, "flush": [287, 288, 289, 290, 291], "ndarrai": [287, 288, 289, 290, 291], "record": [287, 288, 289, 290, 291, 296], "log_config": [287, 291], "payload": [287, 288, 289, 290, 291], "log_": 288, "unixtimestamp": 288, "thread": 288, "safe": [288, 309], "organize_log": 290, "tensorboard": 290, "subdirectori": 290, "logdir": 290, "startup": 290, "tfevent": 290, "encount": 290, "frontend": 290, "organ": [290, 314], "accordingli": [290, 322], "my_log_dir": 290, "view": [290, 319], "entiti": 291, "bias": [291, 320, 323], "usernam": [291, 314, 316], "my_ent": 291, "my_group": 291, "account": [291, 320, 323], "link": [291, 316, 318], "capecap": 291, "6053ofw0": 291, "torchtune_config_j67sb73v": 291, "soon": [292, 321], "readi": [292, 306, 315, 322], "grad": 292, "auto_wrap_polici": 293, "acwrappolicytyp": 293, "author": [293, 308, 317, 321, 323], "fsdp_adavnced_tutori": 293, "insid": 294, "contextmanag": 294, "debug_mod": 295, "pseudo": 295, "commonli": [295, 309, 320, 321, 323], "numpi": 295, "determinist": 295, "global": [295, 321], "warn": 295, "nondeterminist": 295, "cudnn": 295, "set_deterministic_debug_mod": 295, "profile_memori": 296, "with_stack": 296, "record_shap": 296, "with_flop": 296, "wait_step": 296, "warmup_step": 296, "active_step": 296, "profil": 296, "layout": 296, "trace": 296, "profileract": 296, "gradient_accumul": 296, "sensibl": 296, "default_schedul": 296, "reduct": [296, 311, 320], "scope": 296, "flop": 296, "cycl": 296, "repeat": 296, "model_named_paramet": 297, "force_overrid": 297, "concret": [297, 321], "vocab_dim": 297, "named_param": 298, "exclude_param_nam": 298, "too": [299, 311, 318], "npu": 300, "xpu": 300, "handler": 301, "_log": 301, "aka": 302, "__version__": 303, "generated_examples_python": 304, "galleri": [304, 313], "sphinx": 304, "000": [305, 313, 318], "execut": [305, 313], "generated_exampl": 305, "mem": [305, 313], "mb": [305, 313], "gentl": 306, "introduct": 306, "first_finetune_tutori": 306, "maxim": [306, 308], "kd": 306, "torchvis": 307, "torchao": [307, 311, 316, 318, 321, 322, 323], "latest": [307, 311, 317, 321, 323], "whl": 307, "cu121": 307, "cu118": 307, "cu124": 307, "confirm": 307, "And": 307, "welcom": [307, 314], "greatest": [307, 317], "contributor": 307, "dev": 307, "commit": 307, "branch": 307, "therebi": [307, 321, 322, 323], "forc": [307, 319], "reinstal": 307, "opt": [307, 317], "suffix": 307, "On": [308, 320], "emphas": 308, "simplic": 308, "component": 308, "prove": 308, "democrat": 308, "zoo": 308, "varieti": [308, 309, 320], "integr": [308, 316, 317, 318, 320, 322, 323], "fsdp2": [308, 321], "excit": 308, "checkout": 308, "quickstart": 308, "attain": 308, "embodi": 308, "philosophi": 308, "usabl": 308, "composit": 308, "hard": 308, "outlin": 308, "unecessari": 308, "never": 308, "thoroughli": 308, "aim": [309, 319, 321], "steer": 309, "goal": [309, 322], "domain": 309, "interest": [309, 310, 311, 319], "sft": 309, "grant": [309, 310, 311, 317], "lora_dpo_single_devic": 309, "8b_lora_dpo_single_devic": 309, "lora_dpo_distribut": 309, "8b_lora_dpo": 309, "primer": 309, "un": [309, 311], "degrad": [309, 311, 321, 322, 323], "grow": 309, "art": 309, "good": [309, 315, 320, 321], "penal": 309, "gap": 309, "7b_lora_dpo_single_devic": 309, "rsoloss": 309, "lever": [309, 310, 311], "rafailov": 309, "sharma": 309, "mitchel": 309, "man": 309, "ermon": 309, "finn": 309, "2024": 309, "secretli": 309, "liu": 309, "zhao": 309, "joshi": 309, "khalman": 309, "saleh": 309, "2023": 309, "preprint": 309, "competit": 310, "8b_lora_single_devic": [310, 314, 315, 316, 318, 319, 321], "action": [310, 311], "simul": [311, 321, 322], "compromis": 311, "blogpost": [311, 321], "qat_distribut": [311, 322], "8b_qat_ful": [311, 322], "workload": [311, 321, 322], "least": [311, 318, 320, 321, 322], "vram": [311, 318, 320, 321, 322], "80gb": [311, 322], "a100": 311, "h100": 311, "delai": 311, "fake": [311, 316, 322], "empir": [311, 322], "fake_quant_after_n_step": [311, 322], "idea": [311, 319, 323], "roughli": [311, 321], "total_step": 311, "futur": [311, 322], "plan": [311, 316], "groupsiz": [311, 322], "256": [311, 318, 321, 322], "hackabl": [312, 317], "singularli": [312, 317], "technic": [312, 317], "awar": [312, 321, 322], "tracker": 312, "short": 314, "subcommand": 314, "anytim": 314, "kaggl": 314, "symlink": 314, "authent": [314, 317], "successfulli": [314, 317], "wrote": 314, "bin": [314, 316], "metaresearch": 314, "lot": [314, 316, 321], "recent": 314, "releas": [314, 318], "agre": 314, "term": [314, 321], "perman": 314, "eat": 314, "bandwith": 314, "00030": 314, "ootb": 314, "7b_full_low_memori": [314, 316, 317], "8b_full_single_devic": [314, 316], "mini_full_low_memori": [314, 316], "7b_full": [314, 316, 317], "13b_full": [314, 316, 317], "70b_full": 314, "edit": 314, "clobber": 314, "destin": 314, "lora_finetune_distribut": [314, 318, 320], "torchrun": 314, "launch": [314, 315, 317], "nproc": 314, "node": 314, "worker": 314, "nnode": [314, 320, 322], "minimum_nod": 314, "maximum_nod": 314, "fail": 314, "rdzv": 314, "rendezv": 314, "endpoint": 314, "8b_lora": [314, 318], "bypass": 314, "fancy_lora": 314, "8b_fancy_lora": 314, "nice": 315, "meet": 315, "overhaul": 315, "multiturn": 315, "accompani": 315, "who": 315, "influenti": 315, "hip": 315, "hop": 315, "artist": 315, "2pac": 315, "rakim": 315, "flavor": 315, "formatted_messag": 315, "nyou": 315, "nwho": 315, "518": 315, "25580": 315, "29962": 315, "3532": 315, "14816": 315, "29903": 315, "6778": 315, "_spm_model": 315, "piece_to_id": 315, "learnt": 315, "manual": [315, 323], "529": 315, "29879": 315, "29958": 315, "nhere": 315, "pure": 315, "mess": 315, "prime": 315, "strictli": 315, "ask": [315, 321], "though": 315, "robust": 315, "pretend": 315, "zuckerberg": 315, "seem": [315, 316], "altogeth": 315, "honor": 315, "custom_8b_lora_single_devic": 315, "favorit": [316, 320], "seemlessli": 316, "connect": [316, 322], "natur": 316, "export": 316, "leverag": [316, 318, 323], "percentag": 316, "16gb": [316, 320], "rtx": 316, "3090": 316, "hour": 316, "3b_full_single_devic": 316, "7b_full_single_devic": 316, "1b_full": 316, "3b_full": 316, "9b_full": 316, "27b_full": 316, "mini_ful": 316, "7b_qlora_single_devic": [316, 317, 323], "3b_lora_single_devic": 316, "facilit": 316, "100mib": 316, "But": [316, 320], "realli": 316, "eleuther_ev": [316, 318, 322], "eleuther_evalu": [316, 318, 322], "lm_eval": [316, 318], "custom_eval_config": [316, 318], "todo": 316, "TO": 316, "1234": 316, "truthfulqa_mc2": [316, 318, 319, 320], "max_seq_length": [316, 322], "enable_kv_cach": 316, "measur": [316, 318], "propens": [316, 318], "324": 316, "loglikelihood": 316, "custom_generation_config": [316, 318], "again": 316, "tell": 316, "me": 316, "joke": 316, "max_new_token": 316, "300": 316, "kick": 316, "noodl": 316, "impasta": 316, "int8_weight_onli": [316, 318], "int8_dynamic_activation_int8_weight": [316, 318], "ao": [316, 318], "quant_api": [316, 318], "quantize_": [316, 318], "int4_weight_onli": [316, 318], "previous": [316, 318, 320], "benefit": 316, "peftmodel": 316, "automodelforcausallm": 316, "autotoken": 316, "trained_model_path": 316, "original_model_nam": 316, "peft_model": 316, "generate_text": 316, "max_length": 316, "return_tensor": 316, "pretrained_model_name_or_path": 316, "vllm": 316, "succesfulli": 316, "base_model": [316, 320], "samplingparam": 316, "print_output": 316, "generated_text": 316, "80": 316, "load_format": 316, "kv_cache_dtyp": 316, "sampling_param": 316, "max_token": 316, "essai": 316, "educ": 316, "use_tqdm": 316, "great": [316, 321], "huggingface_hub": 316, "hfapi": 316, "whoami": 316, "repo_nam": 316, "repo_id": 316, "create_repo": 316, "upload_fold": 316, "folder_path": 316, "repo_typ": 316, "create_pr": 316, "hopefulli": 316, "gave": 316, "minut": 317, "agreement": 317, "depth": 317, "principl": 317, "boilerpl": 317, "substanti": [317, 320], "custom_config": 317, "replic": 317, "lorafinetunerecipesingledevic": 317, "lora_finetune_output": 317, "log_1713194212": 317, "3697006702423096": 317, "25880": [317, 323], "01": 317, "83it": 317, "monitor": 317, "tqdm": 317, "e2": 317, "focu": 318, "theta": 318, "observ": [318, 322], "gb": [318, 320, 322, 323], "consum": [318, 323], "overal": [318, 319], "8b_qlora_single_devic": [318, 321], "fact": [318, 320], "coupl": [318, 320, 323], "meta_model_0": [318, 322], "122": 318, "sarah": 318, "busi": 318, "mum": 318, "young": 318, "children": 318, "live": 318, "north": 318, "east": 318, "england": 318, "135": 318, "88": 318, "sec": 318, "94": 318, "138": 318, "bandwidth": 318, "346": 318, "09": 318, "139": 318, "broader": 318, "teach": [319, 320], "straight": [319, 320], "jump": [319, 320], "compress": 319, "transfer": 319, "capac": 319, "computation": 319, "expens": 319, "deploi": 319, "imit": 319, "diagram": 319, "minillm": 319, "forwardklloss": 319, "teacher_prob": 319, "student_logprob": 319, "log_softmax": 319, "prod_prob": 319, "sum": [319, 320], "forwardklwithchunkedoutputloss": 319, "knowledge_distillation_single_devic": 319, "bit": [319, 320, 321, 322, 323], "alpaca_cleaned_dataset": 319, "hellaswag": [319, 322], "commonsense_qa": 319, "kd_ratio": 319, "teacher_checkpoint": 319, "00004": 319, "truthfulqa": [319, 320], "commonsens": 319, "constant": 319, "hf_model_0001_0": 319, "boost": 319, "graph": [319, 321], "irrespect": 319, "3e": 319, "truthful_qa": 319, "wherea": 319, "unfamiliar": 320, "oppos": [320, 323], "momentum": [320, 321], "aghajanyan": 320, "hypothes": 320, "intrins": 320, "eight": 320, "practic": [320, 321], "blue": 320, "although": [320, 321, 322], "rememb": 320, "approx": 320, "15m": 320, "65k": 320, "99": 320, "requires_grad": [320, 323], "frozen_out": [320, 323], "lora_out": [320, 323], "lora_model": 320, "lora_llama_2_7b": [320, 323], "alon": 320, "in_featur": [320, 322], "out_featur": [320, 322], "validate_missing_and_unexpected_for_lora": 320, "peft_util": 320, "set_trainable_param": 320, "lora_param": 320, "total_param": 320, "numel": 320, "trainable_param": 320, "2f": 320, "6742609920": 320, "4194304": 320, "7b_lora": 320, "my_model_checkpoint_path": [320, 322, 323], "tokenizer_checkpoint": [320, 322, 323], "my_tokenizer_checkpoint_path": [320, 322, 323], "clone": [320, 322, 323], "constraint": 320, "factori": 320, "benefici": 320, "impact": [320, 321], "minor": 320, "lora_experiment_1": 320, "smooth": [320, 323], "curv": [320, 323], "500": 320, "ran": 320, "footprint": [320, 322], "commod": 320, "cogniz": 320, "ax": 320, "parallel": 320, "475": 320, "87": 320, "508": 320, "86": 320, "504": 320, "04": 320, "514": 320, "lowest": 320, "absolut": 320, "4gb": 320, "tradeoff": 320, "salman": 321, "mohammadi": 321, "brief": 321, "glossari": 321, "constrain": [321, 322], "oom": 321, "adam": 321, "gradient_accumulation_step": 321, "cost": 321, "ram": 321, "priorit": 321, "sebastian": 321, "raschka": 321, "fp16": 321, "sound": 321, "quot": 321, "aliv": 321, "region": 321, "enable_activation_checkpoint": 321, "bring": 321, "autograd": [321, 323], "saved_tensors_hook": 321, "cours": 321, "runtim": 321, "overlap": 321, "hide": 321, "unless": 321, "enable_activation_offload": 321, "total_batch_s": 321, "count": 321, "suppos": 321, "log_every_n_step": 321, "translat": 321, "frequent": 321, "slowli": 321, "num_devic": 321, "artifici": 321, "faster": 321, "prototyp": 321, "low_bit_optim": 321, "adamw8bit": 321, "bnb": 321, "modern": 321, "converg": 321, "stateless": 321, "stochast": 321, "descent": 321, "sacrif": 321, "optimizer_in_bwd": 321, "cpuoffloadoptim": 321, "offload_gradi": 321, "4e": 321, "hint": 321, "bottleneck": 321, "slowdown": 321, "amort": 321, "4x": 321, "fsdp_cpu_offload": 321, "fullyshardeddataparallel": 321, "fsdp1": 321, "greatli": 321, "lora_": 321, "lora_llama3": 321, "_lora": 321, "firstli": 321, "secondli": 321, "affect": 321, "fashion": 321, "slower": [321, 323], "jointli": 321, "sens": 321, "novel": 321, "normalfloat": [321, 323], "8x": [321, 323], "worth": 321, "cast": [321, 322], "datatyp": [321, 323], "incur": [321, 322, 323], "penalti": 321, "qlora_": 321, "qlora_llama3_8b": 321, "_qlora": 321, "particularli": 321, "reap": 321, "hood": [321, 323], "doralinear": 321, "swap": [321, 322], "perplex": 322, "ptq": 322, "kept": 322, "nois": 322, "henc": 322, "x_q": 322, "int8": 322, "zp": 322, "x_float": 322, "qmin": 322, "qmax": 322, "clamp": 322, "x_fq": 322, "dequant": 322, "proce": 322, "prepared_model": 322, "int8dynactint4weightqatlinear": 322, "int8dynactint4weightlinear": 322, "train_loop": 322, "converted_model": 322, "recov": 322, "custom_8b_qat_ful": 322, "2000": 322, "1000": 322, "memory_efficient_fsdp_wrap": 322, "led": 322, "presum": 322, "mutat": 322, "5gb": 322, "custom_quant": 322, "poorli": 322, "custom_eleuther_evalu": 322, "fullmodeltorchtunecheckpoint": 322, "my_eleuther_evalu": 322, "stderr": 322, "word_perplex": 322, "9148": 322, "byte_perplex": 322, "5357": 322, "bits_per_byt": 322, "6189": 322, "acc": 322, "5687": 322, "0049": 322, "acc_norm": 322, "7536": 322, "0043": 322, "74": 322, "048": 322, "190": 322, "7735": 322, "5598": 322, "6413": 322, "5481": 322, "0050": 322, "7390": 322, "0044": 322, "7251": 322, "4994": 322, "5844": 322, "5740": 322, "7610": 322, "outperform": 322, "importantli": 322, "characterist": 322, "958": 322, "halv": 322, "motiv": 322, "edg": 322, "smartphon": 322, "executorch": 322, "xnnpack": 322, "export_llama": 322, "use_sdpa_with_kv_cach": 322, "qmode": 322, "group_siz": 322, "get_bos_id": 322, "get_eos_id": 322, "output_nam": 322, "llama3_8da4w": 322, "pte": 322, "881": 322, "oneplu": 322, "709": 322, "tok": 322, "815": 322, "316": 322, "364": 322, "highli": 323, "vanilla": 323, "held": 323, "bespok": 323, "vast": 323, "major": 323, "normatfloat": 323, "deepdiv": 323, "de": 323, "counterpart": 323, "set_default_devic": 323, "qlora_linear": 323, "memory_alloc": 323, "177": 323, "152": 323, "del": 323, "empty_cach": 323, "lora_linear": 323, "081": 323, "344": 323, "qlora_llama2_7b": 323, "qlora_model": 323, "essenti": 323, "reparametrize_as_dtype_state_dict_post_hook": 323, "96": 323, "98": 323, "149": 323, "9157477021217346": 323, "02": 323, "08": 323, "15it": 323, "nightli": 323, "200": 323, "hundr": 323, "228": 323, "8158286809921265": 323, "95it": 323, "exercis": 323, "linear_nf4": 323, "linear_weight": 323, "incom": 323}, "objects": {"torchtune.config": [[27, 0, 1, "", "instantiate"], [28, 0, 1, "", "log_config"], [29, 0, 1, "", "parse"], [30, 0, 1, "", "validate"]], "torchtune.data": [[31, 1, 1, "", "AlpacaToMessages"], [32, 1, 1, "", "ChatMLTemplate"], [33, 1, 1, "", "ChosenRejectedToMessages"], [34, 2, 1, "", "GrammarErrorCorrectionTemplate"], [35, 1, 1, "", "InputOutputToMessages"], [36, 1, 1, "", "Message"], [37, 1, 1, "", "OpenAIToMessages"], [38, 1, 1, "", "PromptTemplate"], [39, 1, 1, "", "PromptTemplateInterface"], [40, 2, 1, "", "QuestionAnswerTemplate"], [41, 2, 1, "", "Role"], [42, 1, 1, "", "ShareGPTToMessages"], [43, 2, 1, "", "SummarizeTemplate"], [44, 0, 1, "", "format_content_with_images"], [45, 0, 1, "", "left_pad_sequence"], [46, 0, 1, "", "load_image"], [47, 0, 1, "", "padded_collate"], [48, 0, 1, "", "padded_collate_dpo"], [49, 0, 1, "", "padded_collate_sft"], [50, 0, 1, "", "padded_collate_tiled_images_and_mask"], [51, 0, 1, "", "truncate"], [52, 0, 1, "", "validate_messages"]], "torchtune.data.Message": [[36, 3, 1, "", "contains_media"], [36, 4, 1, "", "from_dict"], [36, 4, 1, "", "get_media"], [36, 3, 1, "", "text_content"]], "torchtune.datasets": [[53, 1, 1, "", "ConcatDataset"], [54, 1, 1, "", "PackedDataset"], [55, 1, 1, "", "PreferenceDataset"], [56, 1, 1, "", "SFTDataset"], [57, 1, 1, "", "TextCompletionDataset"], [58, 0, 1, "", "alpaca_cleaned_dataset"], [59, 0, 1, "", "alpaca_dataset"], [60, 0, 1, "", "chat_dataset"], [61, 0, 1, "", "cnn_dailymail_articles_dataset"], [62, 0, 1, "", "grammar_dataset"], [63, 0, 1, "", "hh_rlhf_helpful_dataset"], [64, 0, 1, "", "instruct_dataset"], [68, 0, 1, "", "preference_dataset"], [69, 0, 1, "", "samsum_dataset"], [70, 0, 1, "", "slimorca_dataset"], [71, 0, 1, "", "stack_exchange_paired_dataset"], [72, 0, 1, "", "text_completion_dataset"], [73, 0, 1, "", "wikitext_dataset"]], "torchtune.datasets.multimodal": [[65, 0, 1, "", "llava_instruct_dataset"], [66, 0, 1, "", "the_cauldron_dataset"], [67, 0, 1, "", "vqa_dataset"]], "torchtune.generation": [[74, 0, 1, "", "generate"], [75, 0, 1, "", "generate_next_token"], [76, 0, 1, "", "get_causal_mask_from_padding_mask"], [77, 0, 1, "", "get_position_ids_from_padding_mask"], [78, 0, 1, "", "sample"]], "torchtune.models.clip": [[79, 1, 1, "", "TilePositionalEmbedding"], [80, 1, 1, "", "TiledTokenPositionalEmbedding"], [81, 1, 1, "", "TokenPositionalEmbedding"], [82, 0, 1, "", "clip_vision_encoder"]], "torchtune.models.clip.TilePositionalEmbedding": [[79, 4, 1, "", "forward"]], "torchtune.models.clip.TiledTokenPositionalEmbedding": [[80, 4, 1, "", "forward"]], "torchtune.models.clip.TokenPositionalEmbedding": [[81, 4, 1, "", "forward"]], "torchtune.models.code_llama2": [[83, 0, 1, "", "code_llama2_13b"], [84, 0, 1, "", "code_llama2_70b"], [85, 0, 1, "", "code_llama2_7b"], [86, 0, 1, "", "lora_code_llama2_13b"], [87, 0, 1, "", "lora_code_llama2_70b"], [88, 0, 1, "", "lora_code_llama2_7b"], [89, 0, 1, "", "qlora_code_llama2_13b"], [90, 0, 1, "", "qlora_code_llama2_70b"], [91, 0, 1, "", "qlora_code_llama2_7b"]], "torchtune.models.gemma": [[92, 0, 1, "", "gemma"], [93, 0, 1, "", "gemma_2b"], [94, 0, 1, "", "gemma_7b"], [95, 0, 1, "", "gemma_tokenizer"], [96, 0, 1, "", "lora_gemma"], [97, 0, 1, "", "lora_gemma_2b"], [98, 0, 1, "", "lora_gemma_7b"], [99, 0, 1, "", "qlora_gemma_2b"], [100, 0, 1, "", "qlora_gemma_7b"]], "torchtune.models.gemma2": [[101, 0, 1, "", "gemma2"], [102, 0, 1, "", "gemma2_27b"], [103, 0, 1, "", "gemma2_2b"], [104, 0, 1, "", "gemma2_9b"], [105, 0, 1, "", "lora_gemma2"], [106, 0, 1, "", "lora_gemma2_27b"], [107, 0, 1, "", "lora_gemma2_2b"], [108, 0, 1, "", "lora_gemma2_9b"], [109, 0, 1, "", "qlora_gemma2_27b"], [110, 0, 1, "", "qlora_gemma2_2b"], [111, 0, 1, "", "qlora_gemma2_9b"]], "torchtune.models.llama2": [[112, 1, 1, "", "Llama2ChatTemplate"], [113, 0, 1, "", "llama2"], [114, 0, 1, "", "llama2_13b"], [115, 0, 1, "", "llama2_70b"], [116, 0, 1, "", "llama2_7b"], [117, 0, 1, "", "llama2_reward_7b"], [118, 0, 1, "", "llama2_tokenizer"], [119, 0, 1, "", "lora_llama2"], [120, 0, 1, "", "lora_llama2_13b"], [121, 0, 1, "", "lora_llama2_70b"], [122, 0, 1, "", "lora_llama2_7b"], [123, 0, 1, "", "lora_llama2_reward_7b"], [124, 0, 1, "", "qlora_llama2_13b"], [125, 0, 1, "", "qlora_llama2_70b"], [126, 0, 1, "", "qlora_llama2_7b"], [127, 0, 1, "", "qlora_llama2_reward_7b"]], "torchtune.models.llama3": [[128, 0, 1, "", "llama3"], [129, 0, 1, "", "llama3_70b"], [130, 0, 1, "", "llama3_8b"], [131, 0, 1, "", "llama3_tokenizer"], [132, 0, 1, "", "lora_llama3"], [133, 0, 1, "", "lora_llama3_70b"], [134, 0, 1, "", "lora_llama3_8b"], [135, 0, 1, "", "qlora_llama3_70b"], [136, 0, 1, "", "qlora_llama3_8b"]], "torchtune.models.llama3_1": [[137, 0, 1, "", "llama3_1"], [138, 0, 1, "", "llama3_1_405b"], [139, 0, 1, "", "llama3_1_70b"], [140, 0, 1, "", "llama3_1_8b"], [141, 0, 1, "", "lora_llama3_1"], [142, 0, 1, "", "lora_llama3_1_405b"], [143, 0, 1, "", "lora_llama3_1_70b"], [144, 0, 1, "", "lora_llama3_1_8b"], [145, 0, 1, "", "qlora_llama3_1_405b"], [146, 0, 1, "", "qlora_llama3_1_70b"], [147, 0, 1, "", "qlora_llama3_1_8b"]], "torchtune.models.llama3_2": [[148, 0, 1, "", "llama3_2_1b"], [149, 0, 1, "", "llama3_2_3b"], [150, 0, 1, "", "lora_llama3_2_1b"], [151, 0, 1, "", "lora_llama3_2_3b"], [152, 0, 1, "", "qlora_llama3_2_1b"], [153, 0, 1, "", "qlora_llama3_2_3b"]], "torchtune.models.llama3_2_vision": [[154, 1, 1, "", "Llama3VisionEncoder"], [155, 1, 1, "", "Llama3VisionProjectionHead"], [156, 1, 1, "", "Llama3VisionTransform"], [157, 0, 1, "", "llama3_2_vision_11b"], [158, 0, 1, "", "llama3_2_vision_decoder"], [159, 0, 1, "", "llama3_2_vision_encoder"], [160, 0, 1, "", "llama3_2_vision_transform"], [161, 0, 1, "", "lora_llama3_2_vision_11b"], [162, 0, 1, "", "lora_llama3_2_vision_decoder"], [163, 0, 1, "", "lora_llama3_2_vision_encoder"], [164, 0, 1, "", "qlora_llama3_2_vision_11b"]], "torchtune.models.llama3_2_vision.Llama3VisionEncoder": [[154, 4, 1, "", "forward"]], "torchtune.models.llama3_2_vision.Llama3VisionProjectionHead": [[155, 4, 1, "", "forward"]], "torchtune.models.llama3_2_vision.Llama3VisionTransform": [[156, 4, 1, "", "decode"], [156, 4, 1, "", "tokenize_message"], [156, 4, 1, "", "tokenize_messages"]], "torchtune.models.llama3_3": [[165, 0, 1, "", "llama3_3_70b"], [166, 0, 1, "", "lora_llama3_3_70b"], [167, 0, 1, "", "qlora_llama3_3_70b"]], "torchtune.models.mistral": [[168, 1, 1, "", "MistralChatTemplate"], [169, 0, 1, "", "lora_mistral"], [170, 0, 1, "", "lora_mistral_7b"], [171, 0, 1, "", "lora_mistral_classifier"], [172, 0, 1, "", "lora_mistral_reward_7b"], [173, 0, 1, "", "mistral"], [174, 0, 1, "", "mistral_7b"], [175, 0, 1, "", "mistral_classifier"], [176, 0, 1, "", "mistral_reward_7b"], [177, 0, 1, "", "mistral_tokenizer"], [178, 0, 1, "", "qlora_mistral_7b"], [179, 0, 1, "", "qlora_mistral_reward_7b"]], "torchtune.models.phi3": [[180, 0, 1, "", "lora_phi3"], [181, 0, 1, "", "lora_phi3_mini"], [182, 0, 1, "", "phi3"], [183, 0, 1, "", "phi3_mini"], [184, 0, 1, "", "phi3_mini_tokenizer"], [185, 0, 1, "", "qlora_phi3_mini"]], "torchtune.models.qwen2": [[186, 0, 1, "", "lora_qwen2"], [187, 0, 1, "", "lora_qwen2_0_5b"], [188, 0, 1, "", "lora_qwen2_1_5b"], [189, 0, 1, "", "lora_qwen2_7b"], [190, 0, 1, "", "qwen2"], [191, 0, 1, "", "qwen2_0_5b"], [192, 0, 1, "", "qwen2_1_5b"], [193, 0, 1, "", "qwen2_7b"], [194, 0, 1, "", "qwen2_tokenizer"]], "torchtune.models.qwen2_5": [[195, 0, 1, "", "lora_qwen2_5_0_5b"], [196, 0, 1, "", "lora_qwen2_5_14b_base"], [197, 0, 1, "", "lora_qwen2_5_14b_instruct"], [198, 0, 1, "", "lora_qwen2_5_1_5b_base"], [199, 0, 1, "", "lora_qwen2_5_1_5b_instruct"], [200, 0, 1, "", "lora_qwen2_5_32b_base"], [201, 0, 1, "", "lora_qwen2_5_32b_instruct"], [202, 0, 1, "", "lora_qwen2_5_3b"], [203, 0, 1, "", "lora_qwen2_5_72b_base"], [204, 0, 1, "", "lora_qwen2_5_72b_instruct"], [205, 0, 1, "", "lora_qwen2_5_7b_base"], [206, 0, 1, "", "lora_qwen2_5_7b_instruct"], [207, 0, 1, "", "qwen2_5_0_5b"], [208, 0, 1, "", "qwen2_5_14b_base"], [209, 0, 1, "", "qwen2_5_14b_instruct"], [210, 0, 1, "", "qwen2_5_1_5b_base"], [211, 0, 1, "", "qwen2_5_1_5b_instruct"], [212, 0, 1, "", "qwen2_5_32b_base"], [213, 0, 1, "", "qwen2_5_32b_instruct"], [214, 0, 1, "", "qwen2_5_3b"], [215, 0, 1, "", "qwen2_5_72b_base"], [216, 0, 1, "", "qwen2_5_72b_instruct"], [217, 0, 1, "", "qwen2_5_7b_base"], [218, 0, 1, "", "qwen2_5_7b_instruct"], [219, 0, 1, "", "qwen2_5_tokenizer"]], "torchtune.modules": [[220, 1, 1, "", "FeedForward"], [221, 1, 1, "", "Fp32LayerNorm"], [222, 1, 1, "", "KVCache"], [223, 1, 1, "", "LayerDropout"], [224, 1, 1, "", "MultiHeadAttention"], [225, 1, 1, "", "RMSNorm"], [226, 1, 1, "", "RotaryPositionalEmbeddings"], [227, 1, 1, "", "TanhGate"], [228, 1, 1, "", "TiedLinear"], [229, 1, 1, "", "TransformerCrossAttentionLayer"], [230, 1, 1, "", "TransformerDecoder"], [231, 1, 1, "", "TransformerSelfAttentionLayer"], [232, 1, 1, "", "VisionTransformer"], [253, 0, 1, "", "prepare_layer_dropout"]], "torchtune.modules.FeedForward": [[220, 4, 1, "", "forward"]], "torchtune.modules.Fp32LayerNorm": [[221, 4, 1, "", "forward"]], "torchtune.modules.KVCache": [[222, 4, 1, "", "reset"], [222, 4, 1, "", "update"]], "torchtune.modules.LayerDropout": [[223, 4, 1, "", "forward"]], "torchtune.modules.MultiHeadAttention": [[224, 4, 1, "", "forward"], [224, 4, 1, "", "reset_cache"], [224, 4, 1, "", "setup_cache"]], "torchtune.modules.RMSNorm": [[225, 4, 1, "", "forward"]], "torchtune.modules.RotaryPositionalEmbeddings": [[226, 4, 1, "", "forward"]], "torchtune.modules.TanhGate": [[227, 4, 1, "", "forward"]], "torchtune.modules.TransformerCrossAttentionLayer": [[229, 4, 1, "", "caches_are_enabled"], [229, 4, 1, "", "caches_are_setup"], [229, 4, 1, "", "forward"], [229, 4, 1, "", "reset_cache"], [229, 4, 1, "", "setup_caches"]], "torchtune.modules.TransformerDecoder": [[230, 4, 1, "", "caches_are_enabled"], [230, 4, 1, "", "caches_are_setup"], [230, 4, 1, "", "chunked_output"], [230, 4, 1, "", "forward"], [230, 4, 1, "", "reset_caches"], [230, 4, 1, "", "set_num_output_chunks"], [230, 4, 1, "", "setup_caches"]], "torchtune.modules.TransformerSelfAttentionLayer": [[231, 4, 1, "", "caches_are_enabled"], [231, 4, 1, "", "caches_are_setup"], [231, 4, 1, "", "forward"], [231, 4, 1, "", "reset_cache"], [231, 4, 1, "", "setup_caches"]], "torchtune.modules.VisionTransformer": [[232, 4, 1, "", "forward"]], "torchtune.modules.common_utils": [[233, 0, 1, "", "delete_kv_caches"], [234, 0, 1, "", "disable_kv_cache"], [235, 0, 1, "", "local_kv_cache"], [236, 0, 1, "", "reparametrize_as_dtype_state_dict_post_hook"]], "torchtune.modules.loss": [[237, 1, 1, "", "CEWithChunkedOutputLoss"], [238, 1, 1, "", "ForwardKLLoss"], [239, 1, 1, "", "ForwardKLWithChunkedOutputLoss"]], "torchtune.modules.loss.CEWithChunkedOutputLoss": [[237, 4, 1, "", "compute_cross_entropy"], [237, 4, 1, "", "forward"]], "torchtune.modules.loss.ForwardKLLoss": [[238, 4, 1, "", "forward"]], "torchtune.modules.loss.ForwardKLWithChunkedOutputLoss": [[239, 4, 1, "", "forward"]], "torchtune.modules.model_fusion": [[240, 1, 1, "", "DeepFusionModel"], [241, 1, 1, "", "FusionEmbedding"], [242, 1, 1, "", "FusionLayer"], [243, 0, 1, "", "get_fusion_params"], [244, 0, 1, "", "register_fusion_module"]], "torchtune.modules.model_fusion.DeepFusionModel": [[240, 4, 1, "", "caches_are_enabled"], [240, 4, 1, "", "caches_are_setup"], [240, 4, 1, "", "forward"], [240, 4, 1, "", "reset_caches"], [240, 4, 1, "", "set_num_output_chunks"], [240, 4, 1, "", "setup_caches"]], "torchtune.modules.model_fusion.FusionEmbedding": [[241, 4, 1, "", "forward"], [241, 4, 1, "", "fusion_params"]], "torchtune.modules.model_fusion.FusionLayer": [[242, 4, 1, "", "caches_are_enabled"], [242, 4, 1, "", "caches_are_setup"], [242, 4, 1, "", "forward"], [242, 4, 1, "", "fusion_params"], [242, 4, 1, "", "reset_cache"], [242, 4, 1, "", "setup_caches"]], "torchtune.modules.peft": [[245, 1, 1, "", "AdapterModule"], [246, 1, 1, "", "DoRALinear"], [247, 1, 1, "", "LoRALinear"], [248, 0, 1, "", "disable_adapter"], [249, 0, 1, "", "get_adapter_params"], [250, 0, 1, "", "get_adapter_state_dict"], [251, 0, 1, "", "set_trainable_params"], [252, 0, 1, "", "validate_missing_and_unexpected_for_lora"]], "torchtune.modules.peft.AdapterModule": [[245, 4, 1, "", "adapter_params"]], "torchtune.modules.peft.DoRALinear": [[246, 4, 1, "", "adapter_params"], [246, 4, 1, "", "forward"], [246, 4, 1, "", "initialize_dora_magnitude"], [246, 4, 1, "", "to_empty"]], "torchtune.modules.peft.LoRALinear": [[247, 4, 1, "", "adapter_params"], [247, 4, 1, "", "forward"], [247, 4, 1, "", "to_empty"]], "torchtune.modules.tokenizers": [[254, 1, 1, "", "BaseTokenizer"], [255, 1, 1, "", "ModelTokenizer"], [256, 1, 1, "", "SentencePieceBaseTokenizer"], [257, 1, 1, "", "TikTokenBaseTokenizer"], [258, 0, 1, "", "parse_hf_tokenizer_json"], [259, 0, 1, "", "tokenize_messages_no_special_tokens"]], "torchtune.modules.tokenizers.BaseTokenizer": [[254, 4, 1, "", "decode"], [254, 4, 1, "", "encode"]], "torchtune.modules.tokenizers.ModelTokenizer": [[255, 4, 1, "", "tokenize_messages"]], "torchtune.modules.tokenizers.SentencePieceBaseTokenizer": [[256, 4, 1, "", "decode"], [256, 4, 1, "", "encode"]], "torchtune.modules.tokenizers.TikTokenBaseTokenizer": [[257, 4, 1, "", "decode"], [257, 4, 1, "", "encode"]], "torchtune.modules.transforms": [[260, 1, 1, "", "Transform"], [261, 1, 1, "", "VisionCrossAttentionMask"]], "torchtune.rlhf": [[262, 0, 1, "", "estimate_advantages"], [263, 0, 1, "", "get_rewards_ppo"], [268, 0, 1, "", "truncate_sequence_at_first_stop_token"]], "torchtune.rlhf.loss": [[264, 1, 1, "", "DPOLoss"], [265, 1, 1, "", "PPOLoss"], [266, 1, 1, "", "RSOLoss"], [267, 2, 1, "", "SimPOLoss"]], "torchtune.rlhf.loss.DPOLoss": [[264, 4, 1, "", "forward"]], "torchtune.rlhf.loss.PPOLoss": [[265, 4, 1, "", "forward"]], "torchtune.rlhf.loss.RSOLoss": [[266, 4, 1, "", "forward"]], "torchtune.training": [[269, 1, 1, "", "FormattedCheckpointFiles"], [270, 1, 1, "", "FullModelHFCheckpointer"], [271, 1, 1, "", "FullModelMetaCheckpointer"], [272, 1, 1, "", "FullModelTorchTuneCheckpointer"], [273, 1, 1, "", "ModelType"], [274, 1, 1, "", "OptimizerInBackwardWrapper"], [275, 0, 1, "", "apply_selective_activation_checkpointing"], [276, 0, 1, "", "create_optim_in_bwd_wrapper"], [277, 0, 1, "", "gather_cpu_state_dict"], [278, 0, 1, "", "get_cosine_schedule_with_warmup"], [279, 0, 1, "", "get_dtype"], [280, 0, 1, "", "get_lr"], [281, 0, 1, "", "get_memory_stats"], [282, 0, 1, "", "get_quantizer_mode"], [283, 0, 1, "", "get_unmasked_sequence_lengths"], [284, 0, 1, "", "init_distributed"], [285, 0, 1, "", "is_distributed"], [286, 0, 1, "", "log_memory_stats"], [292, 0, 1, "", "register_optim_in_bwd_hooks"], [293, 0, 1, "", "set_activation_checkpointing"], [294, 0, 1, "", "set_default_dtype"], [295, 0, 1, "", "set_seed"], [296, 0, 1, "", "setup_torch_profiler"], [297, 0, 1, "", "update_state_dict_for_classifier"], [298, 0, 1, "", "validate_expected_param_dtype"]], "torchtune.training.FormattedCheckpointFiles": [[269, 4, 1, "", "build_checkpoint_filenames"]], "torchtune.training.FullModelHFCheckpointer": [[270, 4, 1, "", "load_checkpoint"], [270, 4, 1, "", "save_checkpoint"]], "torchtune.training.FullModelMetaCheckpointer": [[271, 4, 1, "", "load_checkpoint"], [271, 4, 1, "", "save_checkpoint"]], "torchtune.training.FullModelTorchTuneCheckpointer": [[272, 4, 1, "", "load_checkpoint"], [272, 4, 1, "", "save_checkpoint"]], "torchtune.training.OptimizerInBackwardWrapper": [[274, 4, 1, "", "get_last_lr"], [274, 4, 1, "", "get_optim_key"], [274, 4, 1, "", "load_state_dict"], [274, 4, 1, "", "set_lr_scheduler"], [274, 4, 1, "", "state_dict"], [274, 4, 1, "", "step_lr_scheduler"]], "torchtune.training.metric_logging": [[287, 1, 1, "", "CometLogger"], [288, 1, 1, "", "DiskLogger"], [289, 1, 1, "", "StdoutLogger"], [290, 1, 1, "", "TensorBoardLogger"], [291, 1, 1, "", "WandBLogger"]], "torchtune.training.metric_logging.CometLogger": [[287, 4, 1, "", "close"], [287, 4, 1, "", "log"], [287, 4, 1, "", "log_config"], [287, 4, 1, "", "log_dict"]], "torchtune.training.metric_logging.DiskLogger": [[288, 4, 1, "", "close"], [288, 4, 1, "", "log"], [288, 4, 1, "", "log_dict"]], "torchtune.training.metric_logging.StdoutLogger": [[289, 4, 1, "", "close"], [289, 4, 1, "", "log"], [289, 4, 1, "", "log_dict"]], "torchtune.training.metric_logging.TensorBoardLogger": [[290, 4, 1, "", "close"], [290, 4, 1, "", "log"], [290, 4, 1, "", "log_dict"]], "torchtune.training.metric_logging.WandBLogger": [[291, 4, 1, "", "close"], [291, 4, 1, "", "log"], [291, 4, 1, "", "log_config"], [291, 4, 1, "", "log_dict"]], "torchtune.utils": [[299, 0, 1, "", "batch_to_device"], [300, 0, 1, "", "get_device"], [301, 0, 1, "", "get_logger"], [302, 0, 1, "", "get_world_size_and_rank"], [303, 0, 1, "", "torch_version_ge"]]}, "objtypes": {"0": "py:function", "1": "py:class", "2": "py:data", "3": "py:property", "4": "py:method"}, "objnames": {"0": ["py", "function", "Python function"], "1": ["py", "class", "Python class"], "2": ["py", "data", "Python data"], "3": ["py", "property", "Python property"], "4": ["py", "method", "Python method"]}, "titleterms": {"torchtun": [0, 1, 2, 3, 4, 5, 6, 7, 8, 10, 22, 34, 40, 41, 43, 267, 306, 308, 314, 316, 318, 319, 320, 322, 323], "config": [0, 10, 24, 25, 314, 317], "data": [1, 11, 34, 40, 41, 43, 315], "text": [1, 2, 14, 16, 20, 318], "templat": [1, 9, 12, 14, 19, 21, 315], "type": 1, "messag": [1, 13, 14, 36], "transform": [1, 5, 13, 14, 15, 260], "collat": 1, "helper": 1, "function": 1, "dataset": [2, 9, 11, 12, 16, 18, 20, 315], "imag": [2, 14, 16], "gener": [2, 3, 74, 316, 318], "builder": 2, "class": [2, 19, 25], "model": [4, 5, 15, 21, 26, 314, 316, 317, 318, 319, 320, 321, 322], "llama3": [4, 128, 315, 318, 319, 322], "3": [4, 316], "2": [4, 316, 319], "vision": [4, 5], "1": [4, 319], "llama2": [4, 113, 315, 320, 323], "code": 4, "llama": [4, 316], "qwen": 4, "5": 4, "phi": 4, "mistral": [4, 173], "gemma": [4, 92], "gemma2": [4, 101], "clip": 4, "modul": 5, "compon": [5, 10, 24, 321], "build": [5, 307, 323], "block": 5, "loss": [5, 267], "base": [5, 21], "token": [5, 14, 21, 315], "util": [5, 8], "peft": [5, 321], "fusion": 5, "rlhf": [6, 267], "train": [7, 311, 317], "checkpoint": [7, 22, 26, 316, 321], "reduc": 7, "precis": [7, 321], "distribut": [7, 311], "memori": [7, 320, 321, 323], "manag": 7, "schedul": 7, "metric": [7, 23, 26], "log": [7, 23, 26], "perform": [7, 320], "profil": 7, "miscellan": [7, 8], "chat": [9, 315], "exampl": [9, 12, 13, 15, 16, 18, 20], "format": [9, 12, 14, 16, 18, 20, 22], "load": [9, 12, 16, 18, 20, 21], "from": [9, 12, 16, 18, 20, 21, 22, 315, 323], "hug": [9, 12, 16, 18, 20, 21, 316], "face": [9, 12, 16, 18, 20, 21, 316], "local": [9, 12, 16, 18, 20], "remot": [9, 12, 16], "specifi": 9, "convers": 9, "style": 9, "sharegpt": 9, "openai": 9, "renam": [9, 12], "column": [9, 12], "built": [9, 12, 16, 18, 19, 20, 314], "custom": [10, 13, 19, 315], "recip": [10, 25, 312, 314, 317, 319, 320, 322], "set": [10, 21], "up": [10, 316], "your": [10, 24, 25, 316, 317], "project": 10, "launch": 10, "overview": [11, 22, 308, 312, 316, 321], "pipelin": 11, "instruct": [12, 307, 316, 318], "configur": [13, 24], "creat": [14, 15], "prompt": [14, 19, 21, 315], "access": [14, 318], "content": 14, "multimod": [15, 16], "us": [15, 19, 24, 25, 315, 316, 319, 323], "interleav": 16, "sampl": [17, 78], "pack": 17, "prefer": [18, 309], "defin": 19, "via": [19, 307, 318], "dotpath": 19, "string": 19, "dictionari": 19, "prompttempl": [19, 38], "complet": 20, "json": 20, "txt": 20, "download": [21, 314, 316, 317], "file": 21, "max": 21, "sequenc": 21, "length": 21, "special": [21, 315], "handl": 22, "differ": 22, "hfcheckpoint": 22, "metacheckpoint": 22, "torchtunecheckpoint": 22, "output": 22, "intermedi": 22, "vs": 22, "final": 22, "resum": 22, "full": 22, "finetun": [22, 310, 312, 316, 320, 322, 323], "lora": [22, 310, 316, 320, 321, 323], "put": [22, 323], "thi": 22, "all": [22, 24, 323], "togeth": [22, 323], "comet": 23, "logger": [23, 26], "about": 24, "where": 24, "do": 24, "paramet": [24, 321], "live": 24, "write": 24, "instanti": [24, 27], "referenc": 24, "other": [24, 316], "field": 24, "interpol": 24, "valid": [24, 30, 314], "best": 24, "practic": 24, "airtight": 24, "public": 24, "api": 24, "onli": 24, "command": 24, "line": 24, "overrid": 24, "remov": 24, "what": [25, 308, 319, 320, 322, 323], "ar": 25, "script": 25, "run": [25, 314, 316], "cli": [25, 314], "pars": [25, 29], "weight": [26, 321], "bias": 26, "w": 26, "b": 26, "log_config": 28, "alpacatomessag": 31, "chatmltempl": 32, "chosenrejectedtomessag": 33, "grammarerrorcorrectiontempl": 34, "inputoutputtomessag": 35, "openaitomessag": 37, "prompttemplateinterfac": 39, "questionanswertempl": 40, "role": 41, "sharegpttomessag": 42, "summarizetempl": 43, "format_content_with_imag": 44, "left_pad_sequ": 45, "load_imag": 46, "padded_col": 47, "padded_collate_dpo": 48, "padded_collate_sft": 49, "padded_collate_tiled_images_and_mask": 50, "truncat": 51, "validate_messag": 52, "concatdataset": 53, "packeddataset": 54, "preferencedataset": 55, "sftdataset": 56, "textcompletiondataset": 57, "alpaca_cleaned_dataset": 58, "alpaca_dataset": 59, "chat_dataset": 60, "cnn_dailymail_articles_dataset": 61, "grammar_dataset": 62, "hh_rlhf_helpful_dataset": 63, "instruct_dataset": 64, "llava_instruct_dataset": 65, "the_cauldron_dataset": 66, "vqa_dataset": 67, "preference_dataset": 68, "samsum_dataset": 69, "slimorca_dataset": 70, "stack_exchange_paired_dataset": 71, "text_completion_dataset": 72, "wikitext_dataset": 73, "generate_next_token": 75, "get_causal_mask_from_padding_mask": 76, "get_position_ids_from_padding_mask": 77, "tilepositionalembed": 79, "tiledtokenpositionalembed": 80, "tokenpositionalembed": 81, "clip_vision_encod": 82, "code_llama2_13b": 83, "code_llama2_70b": 84, "code_llama2_7b": 85, "lora_code_llama2_13b": 86, "lora_code_llama2_70b": 87, "lora_code_llama2_7b": 88, "qlora_code_llama2_13b": 89, "qlora_code_llama2_70b": 90, "qlora_code_llama2_7b": 91, "gemma_2b": 93, "gemma_7b": 94, "gemma_token": 95, "lora_gemma": 96, "lora_gemma_2b": 97, "lora_gemma_7b": 98, "qlora_gemma_2b": 99, "qlora_gemma_7b": 100, "gemma2_27b": 102, "gemma2_2b": 103, "gemma2_9b": 104, "lora_gemma2": 105, "lora_gemma2_27b": 106, "lora_gemma2_2b": 107, "lora_gemma2_9b": 108, "qlora_gemma2_27b": 109, "qlora_gemma2_2b": 110, "qlora_gemma2_9b": 111, "llama2chattempl": 112, "llama2_13b": 114, "llama2_70b": 115, "llama2_7b": 116, "llama2_reward_7b": 117, "llama2_token": 118, "lora_llama2": 119, "lora_llama2_13b": 120, "lora_llama2_70b": 121, "lora_llama2_7b": 122, "lora_llama2_reward_7b": 123, "qlora_llama2_13b": 124, "qlora_llama2_70b": 125, "qlora_llama2_7b": 126, "qlora_llama2_reward_7b": 127, "llama3_70b": 129, "llama3_8b": 130, "llama3_token": 131, "lora_llama3": 132, "lora_llama3_70b": 133, "lora_llama3_8b": 134, "qlora_llama3_70b": 135, "qlora_llama3_8b": 136, "llama3_1": 137, "llama3_1_405b": 138, "llama3_1_70b": 139, "llama3_1_8b": 140, "lora_llama3_1": 141, "lora_llama3_1_405b": 142, "lora_llama3_1_70b": 143, "lora_llama3_1_8b": 144, "qlora_llama3_1_405b": 145, "qlora_llama3_1_70b": 146, "qlora_llama3_1_8b": 147, "llama3_2_1b": 148, "llama3_2_3b": 149, "lora_llama3_2_1b": 150, "lora_llama3_2_3b": 151, "qlora_llama3_2_1b": 152, "qlora_llama3_2_3b": 153, "llama3visionencod": 154, "llama3visionprojectionhead": 155, "llama3visiontransform": 156, "llama3_2_vision_11b": 157, "llama3_2_vision_decod": 158, "llama3_2_vision_encod": 159, "llama3_2_vision_transform": 160, "lora_llama3_2_vision_11b": 161, "lora_llama3_2_vision_decod": 162, "lora_llama3_2_vision_encod": 163, "qlora_llama3_2_vision_11b": 164, "llama3_3_70b": 165, "lora_llama3_3_70b": 166, "qlora_llama3_3_70b": 167, "mistralchattempl": 168, "lora_mistr": 169, "lora_mistral_7b": 170, "lora_mistral_classifi": 171, "lora_mistral_reward_7b": 172, "mistral_7b": 174, "mistral_classifi": 175, "mistral_reward_7b": 176, "mistral_token": 177, "qlora_mistral_7b": 178, "qlora_mistral_reward_7b": 179, "lora_phi3": 180, "lora_phi3_mini": 181, "phi3": 182, "phi3_mini": 183, "phi3_mini_token": 184, "qlora_phi3_mini": 185, "lora_qwen2": 186, "lora_qwen2_0_5b": 187, "lora_qwen2_1_5b": 188, "lora_qwen2_7b": 189, "qwen2": [190, 319], "qwen2_0_5b": 191, "qwen2_1_5b": 192, "qwen2_7b": 193, "qwen2_token": 194, "lora_qwen2_5_0_5b": 195, "lora_qwen2_5_14b_bas": 196, "lora_qwen2_5_14b_instruct": 197, "lora_qwen2_5_1_5b_bas": 198, "lora_qwen2_5_1_5b_instruct": 199, "lora_qwen2_5_32b_bas": 200, "lora_qwen2_5_32b_instruct": 201, "lora_qwen2_5_3b": 202, "lora_qwen2_5_72b_bas": 203, "lora_qwen2_5_72b_instruct": 204, "lora_qwen2_5_7b_bas": 205, "lora_qwen2_5_7b_instruct": 206, "qwen2_5_0_5b": 207, "qwen2_5_14b_bas": 208, "qwen2_5_14b_instruct": 209, "qwen2_5_1_5b_bas": 210, "qwen2_5_1_5b_instruct": 211, "qwen2_5_32b_bas": 212, "qwen2_5_32b_instruct": 213, "qwen2_5_3b": 214, "qwen2_5_72b_bas": 215, "qwen2_5_72b_instruct": 216, "qwen2_5_7b_bas": 217, "qwen2_5_7b_instruct": 218, "qwen2_5_token": 219, "feedforward": 220, "fp32layernorm": 221, "kvcach": 222, "layerdropout": 223, "multiheadattent": 224, "rmsnorm": 225, "rotarypositionalembed": 226, "tanhgat": 227, "tiedlinear": 228, "transformercrossattentionlay": 229, "transformerdecod": 230, "transformerselfattentionlay": 231, "visiontransform": 232, "delete_kv_cach": 233, "disable_kv_cach": 234, "local_kv_cach": 235, "reparametrize_as_dtype_state_dict_post_hook": 236, "cewithchunkedoutputloss": 237, "forwardklloss": 238, "forwardklwithchunkedoutputloss": 239, "deepfusionmodel": 240, "fusionembed": 241, "fusionlay": 242, "get_fusion_param": 243, "register_fusion_modul": 244, "adaptermodul": 245, "doralinear": 246, "loralinear": 247, "disable_adapt": 248, "get_adapter_param": 249, "get_adapter_state_dict": 250, "set_trainable_param": 251, "validate_missing_and_unexpected_for_lora": 252, "prepare_layer_dropout": 253, "basetoken": 254, "modeltoken": 255, "sentencepiecebasetoken": 256, "tiktokenbasetoken": 257, "parse_hf_tokenizer_json": 258, "tokenize_messages_no_special_token": 259, "visioncrossattentionmask": 261, "estimate_advantag": 262, "get_rewards_ppo": 263, "dpoloss": 264, "ppoloss": 265, "rsoloss": 266, "simpoloss": 267, "truncate_sequence_at_first_stop_token": 268, "formattedcheckpointfil": 269, "fullmodelhfcheckpoint": 270, "fullmodelmetacheckpoint": 271, "fullmodeltorchtunecheckpoint": 272, "modeltyp": 273, "optimizerinbackwardwrapp": 274, "apply_selective_activation_checkpoint": 275, "create_optim_in_bwd_wrapp": 276, "gather_cpu_state_dict": 277, "get_cosine_schedule_with_warmup": 278, "get_dtyp": 279, "get_lr": 280, "get_memory_stat": 281, "get_quantizer_mod": 282, "get_unmasked_sequence_length": 283, "init_distribut": 284, "is_distribut": 285, "log_memory_stat": 286, "cometlogg": 287, "disklogg": 288, "stdoutlogg": 289, "tensorboardlogg": 290, "wandblogg": 291, "register_optim_in_bwd_hook": 292, "set_activation_checkpoint": 293, "set_default_dtyp": 294, "set_se": 295, "setup_torch_profil": 296, "update_state_dict_for_classifi": 297, "validate_expected_param_dtyp": 298, "batch_to_devic": 299, "get_devic": 300, "get_logg": 301, "get_world_size_and_rank": 302, "torch_version_g": 303, "comput": [305, 313], "time": [305, 313], "welcom": 306, "document": 306, "get": [306, 314, 318], "start": [306, 314], "tutori": 306, "instal": 307, "pre": 307, "requisit": 307, "pypi": 307, "git": 307, "clone": 307, "nightli": 307, "kei": 308, "concept": 308, "design": 308, "principl": 308, "direct": 309, "optim": [309, 321], "singl": 310, "devic": [310, 322], "quantiz": [311, 316, 318, 321, 322], "awar": 311, "qat": [311, 322], "list": 314, "copi": 314, "fine": [315, 317, 318, 319, 320, 321, 322, 323], "tune": [315, 317, 318, 319, 320, 321, 322, 323], "chang": 315, "when": 315, "should": 315, "i": 315, "end": 316, "workflow": 316, "3b": 316, "prepar": 316, "artifact": 316, "infer": 316, "evalu": [316, 318, 322], "eleutherai": [316, 318], "s": [316, 318], "eval": [316, 318], "har": [316, 318], "speed": 316, "librari": 316, "upload": 316, "hub": 316, "first": 317, "llm": 317, "select": 317, "modifi": 317, "next": 317, "step": [317, 321], "meta": 318, "8b": [318, 319], "our": 318, "faster": 318, "distil": 319, "1b": 319, "knowledg": 319, "how": [319, 320], "doe": [319, 320], "work": [319, 320], "kd": 319, "ablat": 319, "studi": 319, "teacher": 319, "student": 319, "hyperparamet": 319, "learn": 319, "rate": 319, "ratio": 319, "5b": 319, "0": 319, "appli": [320, 322], "trade": 320, "off": 320, "activ": 321, "offload": 321, "gradient": 321, "accumul": 321, "lower": [321, 322], "fuse": 321, "backward": 321, "pass": 321, "state": 321, "cpu": 321, "effici": 321, "low": 321, "rank": 321, "adapt": 321, "qlora": [321, 323], "decompos": 321, "dora": 321, "option": 322, "save": 323, "deep": 323, "dive": 323}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 6, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.intersphinx": 1, "sphinx.ext.todo": 2, "sphinx.ext.viewcode": 1, "sphinx": 56}})
\ No newline at end of file
+Search.setIndex({"docnames": ["api_ref_config", "api_ref_data", "api_ref_datasets", "api_ref_generation", "api_ref_models", "api_ref_modules", "api_ref_rlhf", "api_ref_training", "api_ref_utilities", "basics/chat_datasets", "basics/custom_components", "basics/datasets_overview", "basics/instruct_datasets", "basics/message_transforms", "basics/messages", "basics/model_transforms", "basics/multimodal_datasets", "basics/packing", "basics/preference_datasets", "basics/prompt_templates", "basics/text_completion_datasets", "basics/tokenizers", "deep_dives/checkpointer", "deep_dives/comet_logging", "deep_dives/configs", "deep_dives/recipe_deepdive", "deep_dives/wandb_logging", "generated/torchtune.config.instantiate", "generated/torchtune.config.log_config", "generated/torchtune.config.parse", "generated/torchtune.config.validate", "generated/torchtune.data.AlpacaToMessages", "generated/torchtune.data.ChatMLTemplate", "generated/torchtune.data.ChosenRejectedToMessages", "generated/torchtune.data.GrammarErrorCorrectionTemplate", "generated/torchtune.data.InputOutputToMessages", "generated/torchtune.data.Message", "generated/torchtune.data.OpenAIToMessages", "generated/torchtune.data.PromptTemplate", "generated/torchtune.data.PromptTemplateInterface", "generated/torchtune.data.QuestionAnswerTemplate", "generated/torchtune.data.Role", "generated/torchtune.data.ShareGPTToMessages", "generated/torchtune.data.SummarizeTemplate", "generated/torchtune.data.format_content_with_images", "generated/torchtune.data.left_pad_sequence", "generated/torchtune.data.load_image", "generated/torchtune.data.padded_collate", "generated/torchtune.data.padded_collate_dpo", "generated/torchtune.data.padded_collate_sft", "generated/torchtune.data.padded_collate_tiled_images_and_mask", "generated/torchtune.data.truncate", "generated/torchtune.data.validate_messages", "generated/torchtune.datasets.ConcatDataset", "generated/torchtune.datasets.PackedDataset", "generated/torchtune.datasets.PreferenceDataset", "generated/torchtune.datasets.SFTDataset", "generated/torchtune.datasets.TextCompletionDataset", "generated/torchtune.datasets.alpaca_cleaned_dataset", "generated/torchtune.datasets.alpaca_dataset", "generated/torchtune.datasets.chat_dataset", "generated/torchtune.datasets.cnn_dailymail_articles_dataset", "generated/torchtune.datasets.grammar_dataset", "generated/torchtune.datasets.hh_rlhf_helpful_dataset", "generated/torchtune.datasets.instruct_dataset", "generated/torchtune.datasets.multimodal.llava_instruct_dataset", "generated/torchtune.datasets.multimodal.the_cauldron_dataset", "generated/torchtune.datasets.multimodal.vqa_dataset", "generated/torchtune.datasets.preference_dataset", "generated/torchtune.datasets.samsum_dataset", "generated/torchtune.datasets.slimorca_dataset", "generated/torchtune.datasets.stack_exchange_paired_dataset", "generated/torchtune.datasets.text_completion_dataset", "generated/torchtune.datasets.wikitext_dataset", "generated/torchtune.generation.generate", "generated/torchtune.generation.generate_next_token", "generated/torchtune.generation.get_causal_mask_from_padding_mask", "generated/torchtune.generation.get_position_ids_from_padding_mask", "generated/torchtune.generation.sample", "generated/torchtune.models.clip.TilePositionalEmbedding", "generated/torchtune.models.clip.TiledTokenPositionalEmbedding", "generated/torchtune.models.clip.TokenPositionalEmbedding", "generated/torchtune.models.clip.clip_vision_encoder", "generated/torchtune.models.code_llama2.code_llama2_13b", "generated/torchtune.models.code_llama2.code_llama2_70b", "generated/torchtune.models.code_llama2.code_llama2_7b", "generated/torchtune.models.code_llama2.lora_code_llama2_13b", "generated/torchtune.models.code_llama2.lora_code_llama2_70b", "generated/torchtune.models.code_llama2.lora_code_llama2_7b", "generated/torchtune.models.code_llama2.qlora_code_llama2_13b", "generated/torchtune.models.code_llama2.qlora_code_llama2_70b", "generated/torchtune.models.code_llama2.qlora_code_llama2_7b", "generated/torchtune.models.gemma.gemma", "generated/torchtune.models.gemma.gemma_2b", "generated/torchtune.models.gemma.gemma_7b", "generated/torchtune.models.gemma.gemma_tokenizer", "generated/torchtune.models.gemma.lora_gemma", "generated/torchtune.models.gemma.lora_gemma_2b", "generated/torchtune.models.gemma.lora_gemma_7b", "generated/torchtune.models.gemma.qlora_gemma_2b", "generated/torchtune.models.gemma.qlora_gemma_7b", "generated/torchtune.models.gemma2.gemma2", "generated/torchtune.models.gemma2.gemma2_27b", "generated/torchtune.models.gemma2.gemma2_2b", "generated/torchtune.models.gemma2.gemma2_9b", "generated/torchtune.models.gemma2.lora_gemma2", "generated/torchtune.models.gemma2.lora_gemma2_27b", "generated/torchtune.models.gemma2.lora_gemma2_2b", "generated/torchtune.models.gemma2.lora_gemma2_9b", "generated/torchtune.models.gemma2.qlora_gemma2_27b", "generated/torchtune.models.gemma2.qlora_gemma2_2b", "generated/torchtune.models.gemma2.qlora_gemma2_9b", "generated/torchtune.models.llama2.Llama2ChatTemplate", "generated/torchtune.models.llama2.llama2", "generated/torchtune.models.llama2.llama2_13b", "generated/torchtune.models.llama2.llama2_70b", "generated/torchtune.models.llama2.llama2_7b", "generated/torchtune.models.llama2.llama2_reward_7b", "generated/torchtune.models.llama2.llama2_tokenizer", "generated/torchtune.models.llama2.lora_llama2", "generated/torchtune.models.llama2.lora_llama2_13b", "generated/torchtune.models.llama2.lora_llama2_70b", "generated/torchtune.models.llama2.lora_llama2_7b", "generated/torchtune.models.llama2.lora_llama2_reward_7b", "generated/torchtune.models.llama2.qlora_llama2_13b", "generated/torchtune.models.llama2.qlora_llama2_70b", "generated/torchtune.models.llama2.qlora_llama2_7b", "generated/torchtune.models.llama2.qlora_llama2_reward_7b", "generated/torchtune.models.llama3.llama3", "generated/torchtune.models.llama3.llama3_70b", "generated/torchtune.models.llama3.llama3_8b", "generated/torchtune.models.llama3.llama3_tokenizer", "generated/torchtune.models.llama3.lora_llama3", "generated/torchtune.models.llama3.lora_llama3_70b", "generated/torchtune.models.llama3.lora_llama3_8b", "generated/torchtune.models.llama3.qlora_llama3_70b", "generated/torchtune.models.llama3.qlora_llama3_8b", "generated/torchtune.models.llama3_1.llama3_1", "generated/torchtune.models.llama3_1.llama3_1_405b", "generated/torchtune.models.llama3_1.llama3_1_70b", "generated/torchtune.models.llama3_1.llama3_1_8b", "generated/torchtune.models.llama3_1.lora_llama3_1", "generated/torchtune.models.llama3_1.lora_llama3_1_405b", "generated/torchtune.models.llama3_1.lora_llama3_1_70b", "generated/torchtune.models.llama3_1.lora_llama3_1_8b", "generated/torchtune.models.llama3_1.qlora_llama3_1_405b", "generated/torchtune.models.llama3_1.qlora_llama3_1_70b", "generated/torchtune.models.llama3_1.qlora_llama3_1_8b", "generated/torchtune.models.llama3_2.llama3_2_1b", "generated/torchtune.models.llama3_2.llama3_2_3b", "generated/torchtune.models.llama3_2.lora_llama3_2_1b", "generated/torchtune.models.llama3_2.lora_llama3_2_3b", "generated/torchtune.models.llama3_2.qlora_llama3_2_1b", "generated/torchtune.models.llama3_2.qlora_llama3_2_3b", "generated/torchtune.models.llama3_2_vision.Llama3VisionEncoder", "generated/torchtune.models.llama3_2_vision.Llama3VisionProjectionHead", "generated/torchtune.models.llama3_2_vision.Llama3VisionTransform", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_11b", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_decoder", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_encoder", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_transform", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_11b", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_decoder", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_encoder", "generated/torchtune.models.llama3_2_vision.qlora_llama3_2_vision_11b", "generated/torchtune.models.llama3_3.llama3_3_70b", "generated/torchtune.models.llama3_3.lora_llama3_3_70b", "generated/torchtune.models.llama3_3.qlora_llama3_3_70b", "generated/torchtune.models.mistral.MistralChatTemplate", "generated/torchtune.models.mistral.lora_mistral", "generated/torchtune.models.mistral.lora_mistral_7b", "generated/torchtune.models.mistral.lora_mistral_classifier", "generated/torchtune.models.mistral.lora_mistral_reward_7b", "generated/torchtune.models.mistral.mistral", "generated/torchtune.models.mistral.mistral_7b", "generated/torchtune.models.mistral.mistral_classifier", "generated/torchtune.models.mistral.mistral_reward_7b", "generated/torchtune.models.mistral.mistral_tokenizer", "generated/torchtune.models.mistral.qlora_mistral_7b", "generated/torchtune.models.mistral.qlora_mistral_reward_7b", "generated/torchtune.models.phi3.lora_phi3", "generated/torchtune.models.phi3.lora_phi3_mini", "generated/torchtune.models.phi3.phi3", "generated/torchtune.models.phi3.phi3_mini", "generated/torchtune.models.phi3.phi3_mini_tokenizer", "generated/torchtune.models.phi3.qlora_phi3_mini", "generated/torchtune.models.qwen2.lora_qwen2", "generated/torchtune.models.qwen2.lora_qwen2_0_5b", "generated/torchtune.models.qwen2.lora_qwen2_1_5b", "generated/torchtune.models.qwen2.lora_qwen2_7b", "generated/torchtune.models.qwen2.qwen2", "generated/torchtune.models.qwen2.qwen2_0_5b", "generated/torchtune.models.qwen2.qwen2_1_5b", "generated/torchtune.models.qwen2.qwen2_7b", "generated/torchtune.models.qwen2.qwen2_tokenizer", "generated/torchtune.models.qwen2_5.lora_qwen2_5_0_5b", "generated/torchtune.models.qwen2_5.lora_qwen2_5_14b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_14b_instruct", "generated/torchtune.models.qwen2_5.lora_qwen2_5_1_5b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_1_5b_instruct", "generated/torchtune.models.qwen2_5.lora_qwen2_5_32b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_32b_instruct", "generated/torchtune.models.qwen2_5.lora_qwen2_5_3b", "generated/torchtune.models.qwen2_5.lora_qwen2_5_72b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_72b_instruct", "generated/torchtune.models.qwen2_5.lora_qwen2_5_7b_base", "generated/torchtune.models.qwen2_5.lora_qwen2_5_7b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_0_5b", "generated/torchtune.models.qwen2_5.qwen2_5_14b_base", "generated/torchtune.models.qwen2_5.qwen2_5_14b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_1_5b_base", "generated/torchtune.models.qwen2_5.qwen2_5_1_5b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_32b_base", "generated/torchtune.models.qwen2_5.qwen2_5_32b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_3b", "generated/torchtune.models.qwen2_5.qwen2_5_72b_base", "generated/torchtune.models.qwen2_5.qwen2_5_72b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_7b_base", "generated/torchtune.models.qwen2_5.qwen2_5_7b_instruct", "generated/torchtune.models.qwen2_5.qwen2_5_tokenizer", "generated/torchtune.modules.FeedForward", "generated/torchtune.modules.Fp32LayerNorm", "generated/torchtune.modules.KVCache", "generated/torchtune.modules.LayerDropout", "generated/torchtune.modules.MultiHeadAttention", "generated/torchtune.modules.RMSNorm", "generated/torchtune.modules.RotaryPositionalEmbeddings", "generated/torchtune.modules.TanhGate", "generated/torchtune.modules.TiedLinear", "generated/torchtune.modules.TransformerCrossAttentionLayer", "generated/torchtune.modules.TransformerDecoder", "generated/torchtune.modules.TransformerSelfAttentionLayer", "generated/torchtune.modules.VisionTransformer", "generated/torchtune.modules.common_utils.delete_kv_caches", "generated/torchtune.modules.common_utils.disable_kv_cache", "generated/torchtune.modules.common_utils.local_kv_cache", "generated/torchtune.modules.common_utils.reparametrize_as_dtype_state_dict_post_hook", "generated/torchtune.modules.loss.CEWithChunkedOutputLoss", "generated/torchtune.modules.loss.ForwardKLLoss", "generated/torchtune.modules.loss.ForwardKLWithChunkedOutputLoss", "generated/torchtune.modules.model_fusion.DeepFusionModel", "generated/torchtune.modules.model_fusion.FusionEmbedding", "generated/torchtune.modules.model_fusion.FusionLayer", "generated/torchtune.modules.model_fusion.get_fusion_params", "generated/torchtune.modules.model_fusion.register_fusion_module", "generated/torchtune.modules.peft.AdapterModule", "generated/torchtune.modules.peft.DoRALinear", "generated/torchtune.modules.peft.LoRALinear", "generated/torchtune.modules.peft.disable_adapter", "generated/torchtune.modules.peft.get_adapter_params", "generated/torchtune.modules.peft.get_adapter_state_dict", "generated/torchtune.modules.peft.set_trainable_params", "generated/torchtune.modules.peft.validate_missing_and_unexpected_for_lora", "generated/torchtune.modules.prepare_layer_dropout", "generated/torchtune.modules.tokenizers.BaseTokenizer", "generated/torchtune.modules.tokenizers.ModelTokenizer", "generated/torchtune.modules.tokenizers.SentencePieceBaseTokenizer", "generated/torchtune.modules.tokenizers.TikTokenBaseTokenizer", "generated/torchtune.modules.tokenizers.parse_hf_tokenizer_json", "generated/torchtune.modules.tokenizers.tokenize_messages_no_special_tokens", "generated/torchtune.modules.transforms.Transform", "generated/torchtune.modules.transforms.VisionCrossAttentionMask", "generated/torchtune.rlhf.estimate_advantages", "generated/torchtune.rlhf.get_rewards_ppo", "generated/torchtune.rlhf.loss.DPOLoss", "generated/torchtune.rlhf.loss.PPOLoss", "generated/torchtune.rlhf.loss.RSOLoss", "generated/torchtune.rlhf.loss.SimPOLoss", "generated/torchtune.rlhf.truncate_sequence_at_first_stop_token", "generated/torchtune.training.FormattedCheckpointFiles", "generated/torchtune.training.FullModelHFCheckpointer", "generated/torchtune.training.FullModelMetaCheckpointer", "generated/torchtune.training.FullModelTorchTuneCheckpointer", "generated/torchtune.training.ModelType", "generated/torchtune.training.OptimizerInBackwardWrapper", "generated/torchtune.training.apply_selective_activation_checkpointing", "generated/torchtune.training.create_optim_in_bwd_wrapper", "generated/torchtune.training.gather_cpu_state_dict", "generated/torchtune.training.get_cosine_schedule_with_warmup", "generated/torchtune.training.get_dtype", "generated/torchtune.training.get_lr", "generated/torchtune.training.get_memory_stats", "generated/torchtune.training.get_quantizer_mode", "generated/torchtune.training.get_unmasked_sequence_lengths", "generated/torchtune.training.init_distributed", "generated/torchtune.training.is_distributed", "generated/torchtune.training.log_memory_stats", "generated/torchtune.training.metric_logging.CometLogger", "generated/torchtune.training.metric_logging.DiskLogger", "generated/torchtune.training.metric_logging.StdoutLogger", "generated/torchtune.training.metric_logging.TensorBoardLogger", "generated/torchtune.training.metric_logging.WandBLogger", "generated/torchtune.training.register_optim_in_bwd_hooks", "generated/torchtune.training.set_activation_checkpointing", "generated/torchtune.training.set_default_dtype", "generated/torchtune.training.set_seed", "generated/torchtune.training.setup_torch_profiler", "generated/torchtune.training.update_state_dict_for_classifier", "generated/torchtune.training.validate_expected_param_dtype", "generated/torchtune.utils.batch_to_device", "generated/torchtune.utils.get_device", "generated/torchtune.utils.get_logger", "generated/torchtune.utils.get_world_size_and_rank", "generated/torchtune.utils.torch_version_ge", "generated_examples/index", "generated_examples/sg_execution_times", "index", "install", "overview", "recipes/dpo", "recipes/lora_finetune_single_device", "recipes/qat_distributed", "recipes/recipes_overview", "sg_execution_times", "tune_cli", "tutorials/chat", "tutorials/e2e_flow", "tutorials/first_finetune_tutorial", "tutorials/llama3", "tutorials/llama_kd_tutorial", "tutorials/lora_finetune", "tutorials/memory_optimizations", "tutorials/qat_finetune", "tutorials/qlora_finetune"], "filenames": ["api_ref_config.rst", "api_ref_data.rst", "api_ref_datasets.rst", "api_ref_generation.rst", "api_ref_models.rst", "api_ref_modules.rst", "api_ref_rlhf.rst", "api_ref_training.rst", "api_ref_utilities.rst", "basics/chat_datasets.rst", "basics/custom_components.rst", "basics/datasets_overview.rst", "basics/instruct_datasets.rst", "basics/message_transforms.rst", "basics/messages.rst", "basics/model_transforms.rst", "basics/multimodal_datasets.rst", "basics/packing.rst", "basics/preference_datasets.rst", "basics/prompt_templates.rst", "basics/text_completion_datasets.rst", "basics/tokenizers.rst", "deep_dives/checkpointer.rst", "deep_dives/comet_logging.rst", "deep_dives/configs.rst", "deep_dives/recipe_deepdive.rst", "deep_dives/wandb_logging.rst", "generated/torchtune.config.instantiate.rst", "generated/torchtune.config.log_config.rst", "generated/torchtune.config.parse.rst", "generated/torchtune.config.validate.rst", "generated/torchtune.data.AlpacaToMessages.rst", "generated/torchtune.data.ChatMLTemplate.rst", "generated/torchtune.data.ChosenRejectedToMessages.rst", "generated/torchtune.data.GrammarErrorCorrectionTemplate.rst", "generated/torchtune.data.InputOutputToMessages.rst", "generated/torchtune.data.Message.rst", "generated/torchtune.data.OpenAIToMessages.rst", "generated/torchtune.data.PromptTemplate.rst", "generated/torchtune.data.PromptTemplateInterface.rst", "generated/torchtune.data.QuestionAnswerTemplate.rst", "generated/torchtune.data.Role.rst", "generated/torchtune.data.ShareGPTToMessages.rst", "generated/torchtune.data.SummarizeTemplate.rst", "generated/torchtune.data.format_content_with_images.rst", "generated/torchtune.data.left_pad_sequence.rst", "generated/torchtune.data.load_image.rst", "generated/torchtune.data.padded_collate.rst", "generated/torchtune.data.padded_collate_dpo.rst", "generated/torchtune.data.padded_collate_sft.rst", "generated/torchtune.data.padded_collate_tiled_images_and_mask.rst", "generated/torchtune.data.truncate.rst", "generated/torchtune.data.validate_messages.rst", "generated/torchtune.datasets.ConcatDataset.rst", "generated/torchtune.datasets.PackedDataset.rst", "generated/torchtune.datasets.PreferenceDataset.rst", "generated/torchtune.datasets.SFTDataset.rst", "generated/torchtune.datasets.TextCompletionDataset.rst", "generated/torchtune.datasets.alpaca_cleaned_dataset.rst", "generated/torchtune.datasets.alpaca_dataset.rst", "generated/torchtune.datasets.chat_dataset.rst", "generated/torchtune.datasets.cnn_dailymail_articles_dataset.rst", "generated/torchtune.datasets.grammar_dataset.rst", "generated/torchtune.datasets.hh_rlhf_helpful_dataset.rst", "generated/torchtune.datasets.instruct_dataset.rst", "generated/torchtune.datasets.multimodal.llava_instruct_dataset.rst", "generated/torchtune.datasets.multimodal.the_cauldron_dataset.rst", "generated/torchtune.datasets.multimodal.vqa_dataset.rst", "generated/torchtune.datasets.preference_dataset.rst", "generated/torchtune.datasets.samsum_dataset.rst", "generated/torchtune.datasets.slimorca_dataset.rst", "generated/torchtune.datasets.stack_exchange_paired_dataset.rst", "generated/torchtune.datasets.text_completion_dataset.rst", "generated/torchtune.datasets.wikitext_dataset.rst", "generated/torchtune.generation.generate.rst", "generated/torchtune.generation.generate_next_token.rst", "generated/torchtune.generation.get_causal_mask_from_padding_mask.rst", "generated/torchtune.generation.get_position_ids_from_padding_mask.rst", "generated/torchtune.generation.sample.rst", "generated/torchtune.models.clip.TilePositionalEmbedding.rst", "generated/torchtune.models.clip.TiledTokenPositionalEmbedding.rst", "generated/torchtune.models.clip.TokenPositionalEmbedding.rst", "generated/torchtune.models.clip.clip_vision_encoder.rst", "generated/torchtune.models.code_llama2.code_llama2_13b.rst", "generated/torchtune.models.code_llama2.code_llama2_70b.rst", "generated/torchtune.models.code_llama2.code_llama2_7b.rst", "generated/torchtune.models.code_llama2.lora_code_llama2_13b.rst", "generated/torchtune.models.code_llama2.lora_code_llama2_70b.rst", "generated/torchtune.models.code_llama2.lora_code_llama2_7b.rst", "generated/torchtune.models.code_llama2.qlora_code_llama2_13b.rst", "generated/torchtune.models.code_llama2.qlora_code_llama2_70b.rst", "generated/torchtune.models.code_llama2.qlora_code_llama2_7b.rst", "generated/torchtune.models.gemma.gemma.rst", "generated/torchtune.models.gemma.gemma_2b.rst", "generated/torchtune.models.gemma.gemma_7b.rst", "generated/torchtune.models.gemma.gemma_tokenizer.rst", "generated/torchtune.models.gemma.lora_gemma.rst", "generated/torchtune.models.gemma.lora_gemma_2b.rst", "generated/torchtune.models.gemma.lora_gemma_7b.rst", "generated/torchtune.models.gemma.qlora_gemma_2b.rst", "generated/torchtune.models.gemma.qlora_gemma_7b.rst", "generated/torchtune.models.gemma2.gemma2.rst", "generated/torchtune.models.gemma2.gemma2_27b.rst", "generated/torchtune.models.gemma2.gemma2_2b.rst", "generated/torchtune.models.gemma2.gemma2_9b.rst", "generated/torchtune.models.gemma2.lora_gemma2.rst", "generated/torchtune.models.gemma2.lora_gemma2_27b.rst", "generated/torchtune.models.gemma2.lora_gemma2_2b.rst", "generated/torchtune.models.gemma2.lora_gemma2_9b.rst", "generated/torchtune.models.gemma2.qlora_gemma2_27b.rst", "generated/torchtune.models.gemma2.qlora_gemma2_2b.rst", "generated/torchtune.models.gemma2.qlora_gemma2_9b.rst", "generated/torchtune.models.llama2.Llama2ChatTemplate.rst", "generated/torchtune.models.llama2.llama2.rst", "generated/torchtune.models.llama2.llama2_13b.rst", "generated/torchtune.models.llama2.llama2_70b.rst", "generated/torchtune.models.llama2.llama2_7b.rst", "generated/torchtune.models.llama2.llama2_reward_7b.rst", "generated/torchtune.models.llama2.llama2_tokenizer.rst", "generated/torchtune.models.llama2.lora_llama2.rst", "generated/torchtune.models.llama2.lora_llama2_13b.rst", "generated/torchtune.models.llama2.lora_llama2_70b.rst", "generated/torchtune.models.llama2.lora_llama2_7b.rst", "generated/torchtune.models.llama2.lora_llama2_reward_7b.rst", "generated/torchtune.models.llama2.qlora_llama2_13b.rst", "generated/torchtune.models.llama2.qlora_llama2_70b.rst", "generated/torchtune.models.llama2.qlora_llama2_7b.rst", "generated/torchtune.models.llama2.qlora_llama2_reward_7b.rst", "generated/torchtune.models.llama3.llama3.rst", "generated/torchtune.models.llama3.llama3_70b.rst", "generated/torchtune.models.llama3.llama3_8b.rst", "generated/torchtune.models.llama3.llama3_tokenizer.rst", "generated/torchtune.models.llama3.lora_llama3.rst", "generated/torchtune.models.llama3.lora_llama3_70b.rst", "generated/torchtune.models.llama3.lora_llama3_8b.rst", "generated/torchtune.models.llama3.qlora_llama3_70b.rst", "generated/torchtune.models.llama3.qlora_llama3_8b.rst", "generated/torchtune.models.llama3_1.llama3_1.rst", "generated/torchtune.models.llama3_1.llama3_1_405b.rst", "generated/torchtune.models.llama3_1.llama3_1_70b.rst", "generated/torchtune.models.llama3_1.llama3_1_8b.rst", "generated/torchtune.models.llama3_1.lora_llama3_1.rst", "generated/torchtune.models.llama3_1.lora_llama3_1_405b.rst", "generated/torchtune.models.llama3_1.lora_llama3_1_70b.rst", "generated/torchtune.models.llama3_1.lora_llama3_1_8b.rst", "generated/torchtune.models.llama3_1.qlora_llama3_1_405b.rst", "generated/torchtune.models.llama3_1.qlora_llama3_1_70b.rst", "generated/torchtune.models.llama3_1.qlora_llama3_1_8b.rst", "generated/torchtune.models.llama3_2.llama3_2_1b.rst", "generated/torchtune.models.llama3_2.llama3_2_3b.rst", "generated/torchtune.models.llama3_2.lora_llama3_2_1b.rst", "generated/torchtune.models.llama3_2.lora_llama3_2_3b.rst", "generated/torchtune.models.llama3_2.qlora_llama3_2_1b.rst", "generated/torchtune.models.llama3_2.qlora_llama3_2_3b.rst", "generated/torchtune.models.llama3_2_vision.Llama3VisionEncoder.rst", "generated/torchtune.models.llama3_2_vision.Llama3VisionProjectionHead.rst", "generated/torchtune.models.llama3_2_vision.Llama3VisionTransform.rst", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_11b.rst", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_decoder.rst", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_encoder.rst", "generated/torchtune.models.llama3_2_vision.llama3_2_vision_transform.rst", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_11b.rst", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_decoder.rst", "generated/torchtune.models.llama3_2_vision.lora_llama3_2_vision_encoder.rst", "generated/torchtune.models.llama3_2_vision.qlora_llama3_2_vision_11b.rst", "generated/torchtune.models.llama3_3.llama3_3_70b.rst", "generated/torchtune.models.llama3_3.lora_llama3_3_70b.rst", "generated/torchtune.models.llama3_3.qlora_llama3_3_70b.rst", "generated/torchtune.models.mistral.MistralChatTemplate.rst", "generated/torchtune.models.mistral.lora_mistral.rst", "generated/torchtune.models.mistral.lora_mistral_7b.rst", "generated/torchtune.models.mistral.lora_mistral_classifier.rst", "generated/torchtune.models.mistral.lora_mistral_reward_7b.rst", "generated/torchtune.models.mistral.mistral.rst", "generated/torchtune.models.mistral.mistral_7b.rst", "generated/torchtune.models.mistral.mistral_classifier.rst", "generated/torchtune.models.mistral.mistral_reward_7b.rst", "generated/torchtune.models.mistral.mistral_tokenizer.rst", "generated/torchtune.models.mistral.qlora_mistral_7b.rst", "generated/torchtune.models.mistral.qlora_mistral_reward_7b.rst", "generated/torchtune.models.phi3.lora_phi3.rst", "generated/torchtune.models.phi3.lora_phi3_mini.rst", "generated/torchtune.models.phi3.phi3.rst", "generated/torchtune.models.phi3.phi3_mini.rst", "generated/torchtune.models.phi3.phi3_mini_tokenizer.rst", "generated/torchtune.models.phi3.qlora_phi3_mini.rst", "generated/torchtune.models.qwen2.lora_qwen2.rst", "generated/torchtune.models.qwen2.lora_qwen2_0_5b.rst", "generated/torchtune.models.qwen2.lora_qwen2_1_5b.rst", "generated/torchtune.models.qwen2.lora_qwen2_7b.rst", "generated/torchtune.models.qwen2.qwen2.rst", "generated/torchtune.models.qwen2.qwen2_0_5b.rst", "generated/torchtune.models.qwen2.qwen2_1_5b.rst", "generated/torchtune.models.qwen2.qwen2_7b.rst", "generated/torchtune.models.qwen2.qwen2_tokenizer.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_0_5b.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_14b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_14b_instruct.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_1_5b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_1_5b_instruct.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_32b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_32b_instruct.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_3b.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_72b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_72b_instruct.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_7b_base.rst", "generated/torchtune.models.qwen2_5.lora_qwen2_5_7b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_0_5b.rst", "generated/torchtune.models.qwen2_5.qwen2_5_14b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_14b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_1_5b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_1_5b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_32b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_32b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_3b.rst", "generated/torchtune.models.qwen2_5.qwen2_5_72b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_72b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_7b_base.rst", "generated/torchtune.models.qwen2_5.qwen2_5_7b_instruct.rst", "generated/torchtune.models.qwen2_5.qwen2_5_tokenizer.rst", "generated/torchtune.modules.FeedForward.rst", "generated/torchtune.modules.Fp32LayerNorm.rst", "generated/torchtune.modules.KVCache.rst", "generated/torchtune.modules.LayerDropout.rst", "generated/torchtune.modules.MultiHeadAttention.rst", "generated/torchtune.modules.RMSNorm.rst", "generated/torchtune.modules.RotaryPositionalEmbeddings.rst", "generated/torchtune.modules.TanhGate.rst", "generated/torchtune.modules.TiedLinear.rst", "generated/torchtune.modules.TransformerCrossAttentionLayer.rst", "generated/torchtune.modules.TransformerDecoder.rst", "generated/torchtune.modules.TransformerSelfAttentionLayer.rst", "generated/torchtune.modules.VisionTransformer.rst", "generated/torchtune.modules.common_utils.delete_kv_caches.rst", "generated/torchtune.modules.common_utils.disable_kv_cache.rst", "generated/torchtune.modules.common_utils.local_kv_cache.rst", "generated/torchtune.modules.common_utils.reparametrize_as_dtype_state_dict_post_hook.rst", "generated/torchtune.modules.loss.CEWithChunkedOutputLoss.rst", "generated/torchtune.modules.loss.ForwardKLLoss.rst", "generated/torchtune.modules.loss.ForwardKLWithChunkedOutputLoss.rst", "generated/torchtune.modules.model_fusion.DeepFusionModel.rst", "generated/torchtune.modules.model_fusion.FusionEmbedding.rst", "generated/torchtune.modules.model_fusion.FusionLayer.rst", "generated/torchtune.modules.model_fusion.get_fusion_params.rst", "generated/torchtune.modules.model_fusion.register_fusion_module.rst", "generated/torchtune.modules.peft.AdapterModule.rst", "generated/torchtune.modules.peft.DoRALinear.rst", "generated/torchtune.modules.peft.LoRALinear.rst", "generated/torchtune.modules.peft.disable_adapter.rst", "generated/torchtune.modules.peft.get_adapter_params.rst", "generated/torchtune.modules.peft.get_adapter_state_dict.rst", "generated/torchtune.modules.peft.set_trainable_params.rst", "generated/torchtune.modules.peft.validate_missing_and_unexpected_for_lora.rst", "generated/torchtune.modules.prepare_layer_dropout.rst", "generated/torchtune.modules.tokenizers.BaseTokenizer.rst", "generated/torchtune.modules.tokenizers.ModelTokenizer.rst", "generated/torchtune.modules.tokenizers.SentencePieceBaseTokenizer.rst", "generated/torchtune.modules.tokenizers.TikTokenBaseTokenizer.rst", "generated/torchtune.modules.tokenizers.parse_hf_tokenizer_json.rst", "generated/torchtune.modules.tokenizers.tokenize_messages_no_special_tokens.rst", "generated/torchtune.modules.transforms.Transform.rst", "generated/torchtune.modules.transforms.VisionCrossAttentionMask.rst", "generated/torchtune.rlhf.estimate_advantages.rst", "generated/torchtune.rlhf.get_rewards_ppo.rst", "generated/torchtune.rlhf.loss.DPOLoss.rst", "generated/torchtune.rlhf.loss.PPOLoss.rst", "generated/torchtune.rlhf.loss.RSOLoss.rst", "generated/torchtune.rlhf.loss.SimPOLoss.rst", "generated/torchtune.rlhf.truncate_sequence_at_first_stop_token.rst", "generated/torchtune.training.FormattedCheckpointFiles.rst", "generated/torchtune.training.FullModelHFCheckpointer.rst", "generated/torchtune.training.FullModelMetaCheckpointer.rst", "generated/torchtune.training.FullModelTorchTuneCheckpointer.rst", "generated/torchtune.training.ModelType.rst", "generated/torchtune.training.OptimizerInBackwardWrapper.rst", "generated/torchtune.training.apply_selective_activation_checkpointing.rst", "generated/torchtune.training.create_optim_in_bwd_wrapper.rst", "generated/torchtune.training.gather_cpu_state_dict.rst", "generated/torchtune.training.get_cosine_schedule_with_warmup.rst", "generated/torchtune.training.get_dtype.rst", "generated/torchtune.training.get_lr.rst", "generated/torchtune.training.get_memory_stats.rst", "generated/torchtune.training.get_quantizer_mode.rst", "generated/torchtune.training.get_unmasked_sequence_lengths.rst", "generated/torchtune.training.init_distributed.rst", "generated/torchtune.training.is_distributed.rst", "generated/torchtune.training.log_memory_stats.rst", "generated/torchtune.training.metric_logging.CometLogger.rst", "generated/torchtune.training.metric_logging.DiskLogger.rst", "generated/torchtune.training.metric_logging.StdoutLogger.rst", "generated/torchtune.training.metric_logging.TensorBoardLogger.rst", "generated/torchtune.training.metric_logging.WandBLogger.rst", "generated/torchtune.training.register_optim_in_bwd_hooks.rst", "generated/torchtune.training.set_activation_checkpointing.rst", "generated/torchtune.training.set_default_dtype.rst", "generated/torchtune.training.set_seed.rst", "generated/torchtune.training.setup_torch_profiler.rst", "generated/torchtune.training.update_state_dict_for_classifier.rst", "generated/torchtune.training.validate_expected_param_dtype.rst", "generated/torchtune.utils.batch_to_device.rst", "generated/torchtune.utils.get_device.rst", "generated/torchtune.utils.get_logger.rst", "generated/torchtune.utils.get_world_size_and_rank.rst", "generated/torchtune.utils.torch_version_ge.rst", "generated_examples/index.rst", "generated_examples/sg_execution_times.rst", "index.rst", "install.rst", "overview.rst", "recipes/dpo.rst", "recipes/lora_finetune_single_device.rst", "recipes/qat_distributed.rst", "recipes/recipes_overview.rst", "sg_execution_times.rst", "tune_cli.rst", "tutorials/chat.rst", "tutorials/e2e_flow.rst", "tutorials/first_finetune_tutorial.rst", "tutorials/llama3.rst", "tutorials/llama_kd_tutorial.rst", "tutorials/lora_finetune.rst", "tutorials/memory_optimizations.rst", "tutorials/qat_finetune.rst", "tutorials/qlora_finetune.rst"], "titles": ["torchtune.config", "torchtune.data", "torchtune.datasets", "torchtune.generation", "torchtune.models", "torchtune.modules", "torchtune.rlhf", "torchtune.training", "torchtune.utils", "Chat Datasets", "Custom Components and Recipes", "Datasets Overview", "Instruct Datasets", "Message Transforms", "Messages", "Multimodal Transforms", "Multimodal Datasets", "Sample packing", "Preference Datasets", "Prompt Templates", "Text-completion Datasets", "Tokenizers", "Checkpointing in torchtune", "Logging to Comet", "All About Configs", "What Are Recipes?", "Logging to Weights & Biases", "instantiate", "log_config", "parse", "validate", "AlpacaToMessages", "ChatMLTemplate", "ChosenRejectedToMessages", "torchtune.data.GrammarErrorCorrectionTemplate", "InputOutputToMessages", "Message", "OpenAIToMessages", "PromptTemplate", "PromptTemplateInterface", "torchtune.data.QuestionAnswerTemplate", "torchtune.data.Role", "ShareGPTToMessages", "torchtune.data.SummarizeTemplate", "format_content_with_images", "left_pad_sequence", "load_image", "padded_collate", "padded_collate_dpo", "padded_collate_sft", "padded_collate_tiled_images_and_mask", "truncate", "validate_messages", "ConcatDataset", "PackedDataset", "PreferenceDataset", "SFTDataset", "TextCompletionDataset", "alpaca_cleaned_dataset", "alpaca_dataset", "chat_dataset", "cnn_dailymail_articles_dataset", "grammar_dataset", "hh_rlhf_helpful_dataset", "instruct_dataset", "llava_instruct_dataset", "the_cauldron_dataset", "vqa_dataset", "preference_dataset", "samsum_dataset", "slimorca_dataset", "stack_exchange_paired_dataset", "text_completion_dataset", "wikitext_dataset", "generate", "generate_next_token", "get_causal_mask_from_padding_mask", "get_position_ids_from_padding_mask", "sample", "TilePositionalEmbedding", "TiledTokenPositionalEmbedding", "TokenPositionalEmbedding", "clip_vision_encoder", "code_llama2_13b", "code_llama2_70b", "code_llama2_7b", "lora_code_llama2_13b", "lora_code_llama2_70b", "lora_code_llama2_7b", "qlora_code_llama2_13b", "qlora_code_llama2_70b", "qlora_code_llama2_7b", "gemma", "gemma_2b", "gemma_7b", "gemma_tokenizer", "lora_gemma", "lora_gemma_2b", "lora_gemma_7b", "qlora_gemma_2b", "qlora_gemma_7b", "gemma2", "gemma2_27b", "gemma2_2b", "gemma2_9b", "lora_gemma2", "lora_gemma2_27b", "lora_gemma2_2b", "lora_gemma2_9b", "qlora_gemma2_27b", "qlora_gemma2_2b", "qlora_gemma2_9b", "Llama2ChatTemplate", "llama2", "llama2_13b", "llama2_70b", "llama2_7b", "llama2_reward_7b", "llama2_tokenizer", "lora_llama2", "lora_llama2_13b", "lora_llama2_70b", "lora_llama2_7b", "lora_llama2_reward_7b", "qlora_llama2_13b", "qlora_llama2_70b", "qlora_llama2_7b", "qlora_llama2_reward_7b", "llama3", "llama3_70b", "llama3_8b", "llama3_tokenizer", "lora_llama3", "lora_llama3_70b", "lora_llama3_8b", "qlora_llama3_70b", "qlora_llama3_8b", "llama3_1", "llama3_1_405b", "llama3_1_70b", "llama3_1_8b", "lora_llama3_1", "lora_llama3_1_405b", "lora_llama3_1_70b", "lora_llama3_1_8b", "qlora_llama3_1_405b", "qlora_llama3_1_70b", "qlora_llama3_1_8b", "llama3_2_1b", "llama3_2_3b", "lora_llama3_2_1b", "lora_llama3_2_3b", "qlora_llama3_2_1b", "qlora_llama3_2_3b", "Llama3VisionEncoder", "Llama3VisionProjectionHead", "Llama3VisionTransform", "llama3_2_vision_11b", "llama3_2_vision_decoder", "llama3_2_vision_encoder", "llama3_2_vision_transform", "lora_llama3_2_vision_11b", "lora_llama3_2_vision_decoder", "lora_llama3_2_vision_encoder", "qlora_llama3_2_vision_11b", "llama3_3_70b", "lora_llama3_3_70b", "qlora_llama3_3_70b", "MistralChatTemplate", "lora_mistral", "lora_mistral_7b", "lora_mistral_classifier", "lora_mistral_reward_7b", "mistral", "mistral_7b", "mistral_classifier", "mistral_reward_7b", "mistral_tokenizer", "qlora_mistral_7b", "qlora_mistral_reward_7b", "lora_phi3", "lora_phi3_mini", "phi3", "phi3_mini", "phi3_mini_tokenizer", "qlora_phi3_mini", "lora_qwen2", "lora_qwen2_0_5b", "lora_qwen2_1_5b", "lora_qwen2_7b", "qwen2", "qwen2_0_5b", "qwen2_1_5b", "qwen2_7b", "qwen2_tokenizer", "lora_qwen2_5_0_5b", "lora_qwen2_5_14b_base", "lora_qwen2_5_14b_instruct", "lora_qwen2_5_1_5b_base", "lora_qwen2_5_1_5b_instruct", "lora_qwen2_5_32b_base", "lora_qwen2_5_32b_instruct", "lora_qwen2_5_3b", "lora_qwen2_5_72b_base", "lora_qwen2_5_72b_instruct", "lora_qwen2_5_7b_base", "lora_qwen2_5_7b_instruct", "qwen2_5_0_5b", "qwen2_5_14b_base", "qwen2_5_14b_instruct", "qwen2_5_1_5b_base", "qwen2_5_1_5b_instruct", "qwen2_5_32b_base", "qwen2_5_32b_instruct", "qwen2_5_3b", "qwen2_5_72b_base", "qwen2_5_72b_instruct", "qwen2_5_7b_base", "qwen2_5_7b_instruct", "qwen2_5_tokenizer", "FeedForward", "Fp32LayerNorm", "KVCache", "LayerDropout", "MultiHeadAttention", "RMSNorm", "RotaryPositionalEmbeddings", "TanhGate", "TiedLinear", "TransformerCrossAttentionLayer", "TransformerDecoder", "TransformerSelfAttentionLayer", "VisionTransformer", "delete_kv_caches", "disable_kv_cache", "local_kv_cache", "reparametrize_as_dtype_state_dict_post_hook", "CEWithChunkedOutputLoss", "ForwardKLLoss", "ForwardKLWithChunkedOutputLoss", "DeepFusionModel", "FusionEmbedding", "FusionLayer", "get_fusion_params", "register_fusion_module", "AdapterModule", "DoRALinear", "LoRALinear", "disable_adapter", "get_adapter_params", "get_adapter_state_dict", "set_trainable_params", "validate_missing_and_unexpected_for_lora", "prepare_layer_dropout", "BaseTokenizer", "ModelTokenizer", "SentencePieceBaseTokenizer", "TikTokenBaseTokenizer", "parse_hf_tokenizer_json", "tokenize_messages_no_special_tokens", "Transform", "VisionCrossAttentionMask", "estimate_advantages", "get_rewards_ppo", "DPOLoss", "PPOLoss", "RSOLoss", "torchtune.rlhf.loss.SimPOLoss", "truncate_sequence_at_first_stop_token", "FormattedCheckpointFiles", "FullModelHFCheckpointer", "FullModelMetaCheckpointer", "FullModelTorchTuneCheckpointer", "ModelType", "OptimizerInBackwardWrapper", "apply_selective_activation_checkpointing", "create_optim_in_bwd_wrapper", "gather_cpu_state_dict", "get_cosine_schedule_with_warmup", "get_dtype", "get_lr", "get_memory_stats", "get_quantizer_mode", "get_unmasked_sequence_lengths", "init_distributed", "is_distributed", "log_memory_stats", "CometLogger", "DiskLogger", "StdoutLogger", "TensorBoardLogger", "WandBLogger", "register_optim_in_bwd_hooks", "set_activation_checkpointing", "set_default_dtype", "set_seed", "setup_torch_profiler", "update_state_dict_for_classifier", "validate_expected_param_dtype", "batch_to_device", "get_device", "get_logger", "get_world_size_and_rank", "torch_version_ge", "<no title>", "Computation times", "Welcome to the torchtune Documentation", "Install Instructions", "torchtune Overview", "Direct Preference Optimization", "LoRA Single Device Finetuning", "Distributed Quantization-Aware Training (QAT)", "Recipes Overview", "Computation times", "torchtune CLI", "Fine-Tuning Llama3 with Chat Data", "End-to-End Workflow with torchtune", "Fine-Tune Your First LLM", "Meta Llama3 in torchtune", "Distilling Llama3.1 8B into Llama3.2 1B using Knowledge Distillation", "Fine-Tuning Llama2 with LoRA", "Memory Optimization Overview", "Fine-Tuning Llama3 with QAT", "Fine-Tuning Llama2 with QLoRA"], "terms": {"instruct": [1, 2, 4, 9, 10, 11, 13, 15, 16, 17, 18, 19, 21, 22, 31, 32, 33, 35, 37, 42, 54, 56, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 72, 157, 160, 161, 168, 176, 182, 183, 184, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 306, 309, 310, 311, 314, 315, 316, 317, 319, 320, 322, 323], "prompt": [1, 9, 10, 11, 12, 13, 18, 31, 33, 34, 35, 36, 37, 38, 39, 40, 42, 43, 55, 56, 59, 60, 62, 63, 64, 67, 68, 69, 70, 71, 74, 75, 95, 112, 118, 131, 156, 160, 168, 177, 184, 194, 219, 230, 240, 259, 316, 318], "chat": [1, 2, 11, 13, 16, 18, 32, 37, 42, 56, 60, 112, 184, 310, 316], "includ": [1, 9, 11, 12, 16, 18, 19, 21, 22, 24, 25, 38, 39, 56, 67, 78, 82, 92, 101, 113, 128, 137, 158, 159, 160, 162, 163, 173, 184, 190, 230, 246, 247, 254, 270, 271, 308, 309, 312, 314, 315, 316, 317, 318, 319, 320, 323], "some": [1, 17, 18, 20, 21, 22, 24, 32, 171, 241, 243, 249, 251, 306, 308, 309, 310, 311, 314, 315, 317, 319, 320, 321, 322, 323], "specif": [1, 5, 11, 12, 15, 19, 21, 24, 25, 27, 55, 56, 65, 66, 67, 156, 255, 311, 315, 316, 321, 322, 323], "format": [1, 2, 7, 11, 19, 21, 36, 45, 46, 55, 56, 59, 60, 63, 64, 67, 68, 112, 156, 168, 255, 269, 270, 271, 272, 273, 314, 315, 316, 317, 318, 320, 321], "differ": [1, 9, 10, 17, 18, 19, 21, 24, 26, 48, 53, 60, 64, 67, 79, 80, 81, 156, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 228, 232, 246, 256, 264, 273, 298, 308, 309, 310, 311, 314, 315, 316, 318, 319, 320, 321, 322, 323], "dataset": [1, 10, 13, 14, 15, 17, 19, 24, 31, 33, 35, 36, 37, 42, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 264, 308, 309, 316, 317, 318, 319, 322], "model": [1, 2, 9, 10, 11, 12, 13, 14, 16, 17, 18, 19, 20, 22, 23, 24, 25, 27, 31, 32, 33, 35, 36, 37, 42, 53, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 222, 224, 226, 228, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 258, 259, 260, 262, 263, 264, 265, 266, 267, 270, 271, 272, 273, 275, 276, 281, 286, 287, 292, 293, 297, 306, 308, 309, 310, 311, 315, 323], "convert": [1, 9, 11, 14, 21, 22, 33, 35, 37, 42, 49, 55, 56, 60, 65, 66, 68, 76, 154, 270, 277, 322, 323], "from": [1, 2, 4, 10, 11, 13, 14, 15, 17, 19, 23, 24, 25, 26, 27, 31, 33, 36, 37, 42, 45, 46, 47, 50, 53, 54, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 77, 78, 79, 80, 81, 82, 83, 84, 85, 93, 94, 102, 103, 104, 112, 114, 115, 116, 117, 131, 155, 156, 160, 174, 176, 184, 191, 192, 193, 194, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 224, 229, 230, 231, 232, 233, 234, 235, 237, 238, 239, 242, 243, 244, 245, 246, 247, 249, 250, 253, 256, 258, 261, 264, 266, 267, 269, 270, 271, 272, 274, 276, 278, 287, 290, 291, 292, 297, 298, 305, 307, 309, 311, 313, 314, 316, 317, 318, 319, 320, 321, 322], "common": [1, 2, 5, 9, 14, 15, 24, 240, 259, 309, 314, 315, 318, 320, 321, 322], "schema": [1, 9, 11, 12, 16], "convers": [1, 13, 16, 18, 19, 21, 22, 33, 42, 52, 55, 56, 60, 65, 68, 70, 270, 272, 273, 308, 315, 316, 320, 321, 323], "json": [1, 9, 12, 13, 16, 18, 21, 22, 37, 42, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 258, 270, 314, 315, 316, 322], "list": [1, 9, 11, 14, 15, 18, 19, 21, 22, 24, 33, 36, 38, 44, 45, 47, 48, 49, 50, 51, 52, 53, 55, 56, 60, 61, 65, 66, 68, 73, 74, 82, 86, 87, 88, 89, 90, 91, 95, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 131, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 155, 156, 159, 160, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 177, 178, 179, 180, 181, 184, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 230, 232, 237, 239, 240, 241, 242, 245, 246, 247, 252, 253, 254, 255, 256, 257, 259, 261, 269, 270, 271, 272, 287, 298, 301, 312, 315, 316, 317, 318, 321, 322], "us": [1, 2, 4, 5, 9, 10, 11, 12, 13, 14, 16, 17, 18, 20, 21, 22, 23, 26, 27, 29, 32, 35, 36, 38, 44, 47, 50, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 80, 81, 82, 112, 113, 119, 128, 131, 132, 137, 141, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 180, 184, 186, 190, 194, 219, 220, 222, 224, 226, 228, 230, 231, 232, 233, 234, 235, 236, 237, 240, 241, 244, 246, 247, 248, 252, 256, 257, 261, 262, 263, 264, 265, 267, 270, 271, 272, 273, 274, 277, 279, 281, 287, 288, 289, 290, 291, 295, 297, 299, 300, 306, 307, 308, 309, 310, 311, 312, 314, 317, 318, 320, 321, 322], "collect": [1, 24, 317], "sampl": [1, 9, 11, 12, 13, 14, 15, 16, 19, 20, 21, 23, 26, 33, 35, 36, 37, 42, 44, 50, 54, 55, 56, 57, 62, 63, 65, 66, 67, 68, 69, 70, 72, 74, 75, 224, 226, 230, 231, 232, 240, 260, 261, 266, 309, 315, 316, 321], "batch": [1, 11, 17, 25, 47, 48, 49, 50, 54, 59, 62, 65, 66, 69, 80, 154, 155, 222, 223, 224, 226, 229, 230, 231, 232, 235, 240, 242, 262, 263, 264, 266, 283, 296, 299, 308, 316, 317, 318, 320, 321], "handl": [1, 13, 16, 17, 24, 29, 31, 53, 56, 156, 240, 256, 257, 315, 316, 320, 323], "ani": [1, 5, 10, 11, 13, 14, 15, 16, 17, 21, 22, 24, 25, 27, 29, 30, 33, 36, 37, 38, 42, 44, 47, 50, 51, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 77, 81, 221, 230, 236, 240, 242, 243, 249, 250, 251, 252, 254, 255, 256, 259, 270, 271, 272, 274, 277, 284, 287, 295, 298, 309, 314, 315, 316, 317, 320, 321, 322], "pad": [1, 45, 47, 48, 49, 50, 54, 74, 76, 77, 230, 232, 263, 265, 268, 283], "miscellan": 1, "modifi": [1, 10, 21, 22, 24, 25, 26, 234, 236, 246, 274, 308, 316, 318, 319, 320, 321, 322, 323], "For": [2, 7, 9, 10, 12, 14, 16, 18, 19, 20, 21, 22, 24, 25, 33, 35, 36, 37, 38, 42, 50, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 81, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 155, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 224, 230, 232, 237, 240, 241, 244, 246, 247, 248, 260, 270, 276, 282, 287, 291, 293, 295, 307, 309, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "detail": [2, 9, 10, 12, 13, 16, 21, 22, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 81, 82, 175, 232, 237, 265, 295, 309, 310, 311, 314, 316, 317, 318, 319, 320, 321, 322, 323], "usag": [2, 21, 236, 237, 239, 269, 273, 274, 296, 307, 314, 316, 317, 318, 321, 322, 323], "guid": [2, 23, 24, 26, 33, 35, 37, 42, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 267, 287, 308, 315, 317, 319, 320], "pleas": [2, 7, 22, 34, 40, 43, 79, 80, 81, 82, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 164, 165, 166, 167, 178, 179, 185, 232, 237, 270, 271, 272, 293, 307, 311, 312, 316, 318, 323], "see": [2, 7, 9, 10, 11, 12, 13, 14, 16, 18, 19, 21, 22, 23, 26, 34, 40, 43, 47, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 75, 89, 90, 91, 99, 100, 109, 110, 111, 112, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 164, 165, 166, 167, 168, 175, 178, 179, 185, 195, 198, 199, 202, 207, 210, 211, 214, 225, 229, 231, 232, 242, 245, 254, 255, 260, 273, 287, 291, 293, 295, 301, 307, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "overview": [2, 7, 24, 26, 240, 306, 309, 310, 311, 316, 317, 319, 320, 323], "support": [2, 4, 10, 11, 15, 16, 17, 18, 21, 22, 23, 25, 26, 27, 36, 37, 54, 55, 56, 59, 60, 61, 62, 65, 66, 67, 68, 69, 70, 73, 78, 96, 105, 119, 132, 141, 154, 161, 162, 163, 168, 169, 171, 180, 183, 184, 186, 221, 224, 232, 240, 241, 242, 247, 266, 271, 272, 274, 279, 281, 282, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "sever": [2, 309, 321], "wide": [2, 9, 224, 319], "onli": [2, 4, 10, 16, 18, 22, 23, 26, 35, 36, 42, 54, 55, 56, 61, 67, 68, 74, 78, 82, 96, 105, 119, 132, 141, 156, 161, 162, 163, 168, 169, 171, 180, 186, 224, 228, 230, 232, 237, 239, 240, 243, 246, 247, 249, 250, 252, 256, 270, 271, 272, 274, 277, 279, 281, 282, 314, 316, 317, 319, 320, 321, 322, 323], "help": [2, 11, 18, 19, 22, 63, 112, 230, 232, 240, 270, 287, 306, 307, 308, 309, 314, 315, 316, 317, 319, 321, 322, 323], "quickli": [2, 11, 24, 38, 57, 310, 315, 321], "bootstrap": [2, 11], "your": [2, 7, 9, 11, 12, 13, 14, 16, 17, 18, 21, 22, 23, 26, 27, 38, 57, 60, 64, 67, 68, 80, 81, 82, 159, 163, 232, 241, 287, 290, 291, 297, 306, 307, 308, 309, 310, 311, 314, 315, 318, 319, 320, 321, 322, 323], "fine": [2, 9, 10, 11, 12, 16, 18, 19, 20, 22, 23, 25, 26, 36, 54, 55, 56, 72, 246, 297, 306, 308, 309, 310, 311, 312, 316], "tune": [2, 4, 9, 10, 11, 12, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 29, 36, 54, 55, 56, 72, 246, 297, 306, 307, 308, 309, 310, 311, 312, 314, 316], "also": [2, 9, 10, 12, 14, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 53, 60, 64, 67, 68, 72, 74, 75, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 184, 186, 190, 224, 230, 233, 267, 281, 287, 291, 297, 300, 307, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "like": [2, 6, 12, 22, 23, 24, 25, 26, 184, 232, 237, 239, 241, 272, 300, 307, 314, 315, 316, 317, 319, 320, 321, 322], "These": [2, 5, 10, 13, 15, 18, 19, 21, 22, 24, 25, 27, 54, 55, 68, 232, 261, 309, 310, 312, 315, 316, 317, 318, 320, 321, 322, 323], "ar": [2, 5, 9, 10, 11, 12, 13, 14, 15, 16, 18, 19, 20, 21, 22, 23, 24, 26, 27, 31, 35, 38, 39, 42, 45, 47, 48, 52, 54, 55, 56, 59, 60, 64, 65, 66, 67, 68, 74, 76, 77, 80, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 112, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 156, 161, 162, 163, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 229, 230, 231, 232, 234, 240, 241, 242, 246, 247, 248, 250, 252, 261, 263, 270, 271, 273, 274, 276, 279, 280, 281, 285, 296, 297, 307, 308, 310, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "especi": [2, 308, 314, 316, 321], "specifi": [2, 10, 12, 16, 18, 20, 22, 24, 25, 27, 31, 33, 35, 37, 42, 44, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 74, 76, 78, 95, 113, 118, 119, 128, 131, 132, 137, 141, 158, 160, 162, 177, 184, 186, 190, 194, 219, 224, 230, 231, 238, 239, 240, 246, 247, 253, 282, 291, 293, 296, 311, 312, 314, 315, 317, 318, 321, 322, 323], "yaml": [2, 10, 17, 18, 20, 24, 25, 27, 28, 29, 53, 60, 64, 67, 68, 72, 291, 308, 312, 314, 315, 316, 317, 318, 320, 322, 323], "config": [2, 9, 12, 13, 16, 17, 18, 19, 20, 21, 22, 23, 26, 27, 28, 29, 30, 53, 60, 64, 67, 68, 72, 102, 103, 104, 224, 252, 270, 274, 287, 291, 296, 308, 309, 310, 311, 312, 315, 316, 318, 319, 320, 321, 322, 323], "represent": [2, 269, 319, 320, 322, 323], "abov": [2, 4, 9, 16, 17, 18, 20, 22, 55, 236, 285, 307, 311, 316, 318, 320, 321, 322, 323], "text": [4, 5, 9, 11, 12, 15, 18, 19, 21, 35, 36, 37, 38, 39, 42, 44, 50, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 156, 241, 242, 254, 256, 257, 259, 261, 273, 315, 316, 322], "version": [4, 58, 74, 96, 105, 119, 132, 141, 161, 169, 171, 180, 186, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 224, 303, 307, 316, 318, 321, 322, 323], "famili": [4, 22, 25, 59, 61, 65, 66, 70, 71, 73, 273, 308, 314, 318, 319], "import": [4, 9, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 27, 60, 64, 65, 66, 67, 68, 72, 78, 223, 232, 233, 234, 235, 253, 264, 287, 290, 291, 315, 316, 317, 318, 319, 320, 321, 322, 323], "you": [4, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 36, 38, 55, 56, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 112, 222, 230, 232, 235, 237, 239, 242, 244, 273, 287, 290, 291, 297, 306, 307, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "need": [4, 9, 10, 12, 14, 16, 18, 19, 20, 22, 23, 24, 25, 26, 38, 54, 56, 224, 228, 230, 232, 240, 241, 267, 287, 290, 291, 292, 307, 309, 310, 311, 312, 314, 315, 316, 317, 318, 320, 321, 323], "request": [4, 279, 316], "access": [4, 10, 22, 24, 25, 53, 270, 276, 309, 310, 311, 314, 317], "hug": [4, 11, 22, 32, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 258, 278, 308, 309, 314, 317, 318], "face": [4, 11, 22, 32, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 258, 278, 308, 309, 314, 317, 318], "befor": [4, 19, 22, 38, 52, 54, 65, 79, 80, 82, 159, 163, 224, 229, 230, 231, 232, 237, 239, 240, 242, 247, 257, 270, 287, 311, 314, 316, 321, 322], "download": [4, 10, 11, 16, 22, 65, 304, 307, 309, 310, 311, 315, 316, 318, 319, 320, 322, 323], "To": [4, 9, 12, 13, 14, 16, 17, 18, 19, 20, 22, 24, 25, 26, 54, 65, 230, 232, 242, 270, 297, 307, 308, 309, 311, 312, 314, 316, 317, 318, 319, 320, 321, 322, 323], "70b": [4, 84, 87, 90, 115, 121, 125, 129, 133, 135, 139, 143, 146, 165, 166, 167, 318], "meta": [4, 10, 15, 16, 20, 21, 22, 112, 226, 246, 270, 271, 309, 310, 311, 314, 315, 316, 317, 319], "ignor": [4, 9, 10, 12, 22, 42, 72, 228, 229, 231, 238, 239, 275, 297, 309, 310, 311, 314, 316, 319], "pattern": [4, 10, 19, 22, 257, 309, 310, 311, 314, 316, 319], "origin": [4, 10, 15, 16, 17, 20, 21, 22, 58, 59, 63, 236, 241, 242, 246, 247, 309, 310, 311, 315, 316, 318, 319, 320, 321, 322, 323], "consolid": [4, 10, 22, 309, 310, 311, 314, 316, 319], "00": [4, 10, 16, 22, 60, 64, 305, 309, 310, 311, 313, 314, 316, 317, 319], "pth": [4, 10, 22, 269, 309, 310, 311, 314, 316, 319], "hf": [4, 9, 18, 20, 21, 22, 264, 266, 270, 314, 315, 317, 318], "token": [4, 9, 10, 11, 12, 13, 15, 16, 17, 18, 19, 20, 22, 24, 25, 36, 42, 47, 49, 50, 51, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 156, 158, 159, 160, 162, 163, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 194, 219, 224, 226, 229, 230, 231, 232, 237, 239, 240, 241, 242, 254, 255, 256, 257, 258, 259, 261, 263, 265, 268, 283, 310, 314, 316, 317, 318, 319, 320, 321, 322, 323], "hf_token": [4, 21, 309, 311, 319], "The": [4, 9, 11, 12, 15, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 29, 30, 32, 36, 46, 47, 52, 53, 54, 55, 56, 60, 63, 64, 65, 66, 67, 68, 71, 79, 80, 81, 82, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 154, 156, 159, 161, 162, 163, 169, 171, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 221, 223, 225, 226, 227, 228, 232, 236, 237, 238, 239, 240, 241, 242, 246, 247, 248, 253, 254, 255, 256, 257, 258, 259, 261, 262, 264, 265, 266, 267, 270, 272, 274, 278, 279, 280, 282, 287, 291, 294, 296, 300, 301, 303, 307, 308, 309, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "reus": [4, 308], "llama3_token": [4, 15, 17, 20, 21, 65, 66, 74, 315, 316, 318], "class": [4, 10, 13, 14, 15, 21, 24, 26, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 42, 43, 53, 54, 55, 56, 57, 65, 66, 79, 80, 81, 82, 95, 112, 117, 118, 131, 154, 155, 156, 160, 168, 171, 175, 176, 177, 184, 194, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 237, 238, 239, 240, 241, 242, 243, 245, 246, 247, 249, 251, 253, 254, 255, 256, 257, 260, 261, 264, 265, 266, 269, 270, 271, 272, 273, 274, 287, 288, 289, 290, 291, 312, 315, 317, 319, 320, 321, 323], "1b": [4, 10, 17, 148, 150, 152, 306, 314, 316], "output": [4, 10, 12, 13, 14, 20, 21, 31, 35, 45, 53, 55, 56, 59, 62, 64, 67, 69, 70, 74, 82, 86, 87, 88, 92, 96, 101, 105, 113, 117, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 154, 155, 158, 159, 161, 162, 163, 169, 170, 171, 172, 173, 176, 180, 181, 186, 189, 190, 196, 197, 200, 201, 203, 204, 205, 206, 220, 221, 223, 224, 226, 227, 229, 230, 231, 232, 237, 239, 240, 241, 242, 246, 247, 251, 252, 261, 272, 286, 289, 296, 297, 307, 309, 310, 311, 314, 317, 318, 319, 320, 321, 323], "dir": [4, 10, 21, 22, 291, 307, 310, 311, 314, 317, 318, 319, 322], "tmp": [4, 9, 10, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 274, 310, 311, 314, 315, 316, 317, 319], "3b": [4, 22, 149, 151, 153, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 316], "languag": [4, 10, 16, 32, 74, 162, 241, 242, 246, 247, 264, 297, 309, 320, 321], "11b": [4, 157, 164], "8b": [4, 15, 16, 20, 21, 130, 134, 136, 140, 142, 144, 147, 181, 306, 309, 310, 311, 314, 315, 322], "405b": [4, 138, 142, 145], "weight": [4, 21, 22, 25, 50, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 157, 160, 161, 162, 163, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 228, 236, 245, 246, 247, 252, 256, 264, 270, 271, 272, 273, 282, 291, 297, 306, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 322, 323], "can": [4, 5, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 30, 33, 35, 36, 37, 38, 39, 42, 50, 53, 55, 56, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 82, 156, 159, 163, 223, 226, 228, 229, 230, 232, 237, 239, 240, 242, 244, 248, 256, 257, 270, 273, 275, 287, 290, 291, 293, 296, 306, 307, 308, 309, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "instead": [4, 9, 12, 14, 16, 22, 25, 31, 45, 54, 55, 72, 82, 141, 162, 163, 228, 232, 247, 267, 270, 271, 272, 314, 318, 320, 321, 322], "remov": [4, 321], "flag": [4, 24, 25, 36, 59, 60, 62, 64, 68, 69, 70, 270, 271, 272, 277, 309, 314, 321, 323], "builder": [4, 9, 10, 11, 12, 13, 15, 16, 17, 22, 58, 60, 61, 64, 67, 68, 83, 84, 85, 86, 87, 88, 89, 90, 91, 93, 94, 97, 98, 99, 100, 102, 103, 104, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 120, 121, 122, 123, 124, 125, 126, 127, 129, 130, 133, 134, 135, 136, 138, 139, 140, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 164, 165, 166, 167, 170, 172, 174, 176, 178, 179, 181, 183, 185, 187, 188, 189, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 315, 321, 323], "all": [4, 5, 10, 11, 14, 15, 19, 21, 25, 30, 35, 36, 38, 42, 45, 47, 50, 53, 54, 55, 56, 67, 82, 131, 154, 156, 160, 184, 194, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 219, 224, 228, 230, 232, 233, 234, 235, 236, 240, 241, 242, 244, 248, 250, 253, 260, 270, 274, 276, 280, 285, 292, 298, 299, 304, 306, 308, 309, 310, 311, 313, 314, 315, 316, 317, 318, 319, 320, 321, 322], "7b": [4, 9, 12, 14, 18, 19, 20, 21, 61, 73, 85, 88, 91, 94, 98, 116, 117, 122, 123, 126, 127, 170, 172, 174, 176, 179, 189, 193, 205, 206, 217, 218, 270, 271, 315, 317, 318, 320, 323], "13b": [4, 83, 86, 89, 114, 120, 124], "codellama": 4, "size": [4, 14, 15, 16, 22, 25, 27, 45, 50, 59, 62, 65, 66, 69, 80, 81, 82, 154, 155, 156, 157, 159, 160, 161, 163, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 222, 223, 224, 225, 226, 229, 230, 231, 232, 235, 237, 239, 240, 241, 242, 261, 262, 263, 283, 285, 302, 308, 311, 314, 317, 318, 320, 321, 322], "0": [4, 9, 10, 12, 14, 15, 16, 18, 20, 22, 25, 45, 47, 48, 49, 50, 54, 60, 64, 67, 68, 74, 75, 77, 78, 82, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 155, 156, 158, 159, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 173, 175, 178, 179, 180, 181, 182, 185, 186, 187, 188, 189, 190, 191, 192, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 222, 223, 224, 230, 232, 233, 234, 235, 241, 246, 247, 253, 259, 264, 265, 266, 267, 268, 277, 278, 283, 287, 290, 291, 295, 300, 303, 305, 309, 311, 313, 315, 316, 317, 318, 320, 321, 322, 323], "5b": [4, 187, 188, 191, 192, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 321], "14b": [4, 196, 197, 208, 209], "32b": [4, 200, 201, 212, 213], "72b": [4, 203, 204, 215, 216], "qwen2": [4, 10, 186, 187, 188, 189, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 273, 309, 316, 321], "exampl": [4, 10, 19, 21, 22, 23, 24, 25, 26, 27, 29, 33, 35, 37, 38, 42, 44, 45, 46, 47, 48, 49, 50, 53, 54, 56, 59, 60, 61, 62, 64, 65, 66, 67, 68, 69, 70, 72, 73, 74, 76, 77, 78, 82, 155, 156, 159, 163, 222, 223, 224, 232, 233, 234, 235, 237, 239, 240, 241, 242, 244, 245, 246, 247, 248, 253, 254, 255, 256, 257, 259, 260, 264, 266, 268, 269, 270, 271, 273, 274, 282, 283, 287, 290, 291, 294, 297, 300, 301, 303, 304, 305, 307, 309, 310, 311, 313, 314, 315, 316, 318, 319, 320, 321, 322, 323], "qwen2_5": [4, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219], "1_5b": 4, "mini": [4, 21, 181, 182, 183, 184, 185], "4k": [4, 21, 182, 183, 184], "microsoft": [4, 183, 184], "ai": [4, 10, 12, 14, 19, 55, 56, 174, 291, 315, 318], "thi": [4, 9, 10, 12, 13, 14, 16, 17, 18, 19, 20, 21, 23, 24, 25, 26, 27, 31, 33, 34, 35, 36, 37, 42, 43, 44, 45, 47, 48, 50, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 76, 80, 81, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 156, 158, 159, 162, 163, 168, 169, 171, 173, 175, 180, 182, 183, 184, 186, 190, 220, 222, 224, 226, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 240, 241, 242, 244, 245, 246, 247, 248, 252, 253, 254, 255, 256, 257, 259, 260, 261, 263, 264, 265, 267, 269, 270, 271, 272, 274, 278, 279, 281, 283, 285, 287, 288, 290, 291, 292, 293, 295, 297, 299, 300, 306, 307, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "v0": [4, 9, 14, 18, 19, 21, 168], "mistralai": [4, 21, 314], "safetensor": [4, 22, 269, 270, 314, 316, 319], "2b": [4, 93, 97, 103, 107], "googl": [4, 93, 94, 102, 103, 104], "gguf": 4, "9b": [4, 104, 108], "27b": [4, 102, 106], "model_s": 4, "b": [4, 15, 25, 45, 47, 154, 155, 222, 224, 226, 230, 231, 240, 247, 262, 263, 283, 291, 316, 320, 323], "compon": [4, 6, 14, 21, 22, 25, 30, 48, 55, 56, 65, 66, 246, 308, 312, 317, 319, 320, 323], "multimod": [4, 11, 14, 36, 42, 56, 65, 66, 67, 240, 307], "encod": [4, 5, 15, 21, 50, 56, 74, 75, 82, 154, 155, 157, 158, 159, 161, 162, 163, 224, 229, 230, 231, 235, 240, 241, 242, 244, 254, 256, 257, 259, 261, 264, 267, 273, 315], "perform": [5, 12, 13, 17, 19, 20, 21, 22, 54, 74, 232, 237, 248, 260, 267, 308, 310, 311, 315, 316, 318, 319, 321, 322, 323], "direct": [5, 18, 25, 48, 86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181, 246, 264, 307, 312, 321], "id": [5, 14, 17, 21, 22, 47, 48, 49, 50, 54, 61, 65, 66, 73, 74, 75, 77, 78, 156, 224, 226, 230, 231, 240, 254, 255, 256, 257, 258, 259, 261, 270, 272, 287, 315], "decod": [5, 9, 12, 14, 15, 16, 18, 20, 21, 60, 64, 67, 68, 74, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 155, 156, 157, 158, 159, 161, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 224, 229, 230, 231, 235, 240, 242, 244, 254, 256, 257, 315, 316], "typic": [5, 9, 12, 20, 24, 33, 37, 42, 50, 54, 55, 56, 57, 72, 184, 244, 264, 267, 321, 322, 323], "byte": [5, 21, 257, 321, 323], "pair": [5, 10, 18, 21, 24, 48, 49, 63, 68, 71, 257], "underli": [5, 13, 18, 21, 223, 256, 321, 323], "helper": 5, "method": [5, 13, 14, 15, 19, 21, 22, 24, 25, 26, 29, 46, 55, 57, 59, 60, 61, 62, 63, 64, 68, 69, 70, 71, 72, 73, 156, 230, 236, 237, 240, 243, 244, 245, 249, 254, 255, 274, 282, 307, 308, 320, 323], "two": [5, 15, 18, 19, 22, 24, 35, 50, 52, 65, 66, 74, 75, 80, 232, 241, 244, 246, 261, 268, 269, 308, 309, 311, 316, 317, 318, 320, 321, 322, 323], "pre": [5, 9, 11, 12, 17, 18, 19, 20, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 112, 157, 160, 161, 232, 240, 242, 244, 246, 311, 315, 321], "train": [5, 9, 10, 11, 12, 13, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 31, 33, 35, 50, 53, 54, 55, 56, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 112, 156, 157, 160, 161, 221, 223, 224, 226, 230, 231, 236, 237, 239, 240, 241, 242, 244, 246, 264, 267, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 306, 308, 309, 310, 312, 314, 315, 316, 318, 319, 320, 321, 322, 323], "function": [5, 10, 22, 24, 25, 27, 29, 45, 46, 47, 48, 60, 64, 67, 68, 74, 80, 81, 82, 159, 163, 220, 223, 224, 232, 233, 236, 248, 252, 253, 264, 265, 267, 270, 295, 297, 299, 300, 302, 308, 309, 316, 319, 323], "preprocess": [5, 54, 232], "imag": [5, 11, 15, 35, 36, 37, 42, 44, 46, 50, 56, 65, 66, 67, 79, 80, 81, 82, 154, 155, 156, 157, 159, 160, 161, 163, 232, 241, 261, 320], "loss": [6, 9, 12, 14, 24, 25, 36, 38, 55, 56, 59, 60, 62, 64, 68, 69, 70, 237, 238, 239, 264, 265, 266, 309, 316, 317, 319, 320, 323], "algorithm": [6, 21, 262, 267, 295], "ppo": [6, 262, 263, 264, 265, 312], "dpo": [6, 18, 48, 55, 248, 264, 266, 267, 309, 312], "offer": 7, "allow": [7, 10, 53, 242, 246, 252, 290, 311, 314, 321, 322, 323], "seamless": 7, "transit": 7, "between": [7, 9, 18, 19, 21, 22, 55, 60, 68, 158, 162, 229, 230, 234, 240, 263, 265, 267, 270, 273, 287, 309, 318, 319, 320, 322, 323], "interoper": [7, 22, 25, 308, 316, 323], "rest": [7, 315, 321, 323], "ecosystem": [7, 22, 25, 308, 318, 323], "comprehens": [7, 321], "deep": [7, 22, 23, 24, 25, 26, 240, 242, 244, 308, 312, 316, 317, 318, 321], "dive": [7, 22, 23, 24, 25, 26, 308, 311, 312, 316, 317, 318, 321], "util": [7, 14, 16, 22, 24, 25, 27, 45, 47, 50, 154, 275, 290, 292, 293, 299, 300, 301, 302, 303, 308, 316, 317, 321, 323], "work": [7, 22, 25, 35, 42, 67, 228, 241, 242, 308, 311, 314, 316, 318, 321, 323], "set": [7, 9, 12, 17, 18, 20, 22, 23, 24, 25, 26, 33, 36, 37, 42, 50, 54, 55, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 72, 73, 113, 119, 128, 132, 137, 141, 158, 161, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 226, 229, 230, 233, 234, 235, 240, 248, 251, 274, 285, 287, 293, 294, 295, 296, 299, 300, 308, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322], "enabl": [7, 10, 11, 17, 21, 23, 24, 25, 26, 53, 86, 87, 88, 89, 90, 91, 97, 98, 99, 100, 106, 107, 108, 109, 110, 111, 120, 121, 122, 123, 124, 125, 126, 127, 133, 134, 135, 136, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 164, 166, 167, 170, 172, 178, 179, 181, 185, 187, 188, 189, 191, 192, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 210, 211, 214, 224, 229, 230, 231, 233, 234, 235, 240, 242, 295, 296, 311, 316, 318, 320, 321, 323], "consumpt": [7, 53, 76, 310], "dure": [7, 10, 11, 22, 54, 59, 60, 62, 64, 68, 69, 70, 222, 223, 224, 226, 230, 231, 232, 236, 240, 241, 253, 267, 281, 309, 310, 311, 315, 316, 318, 320, 321, 322, 323], "control": [7, 13, 18, 21, 25, 36, 59, 60, 62, 64, 68, 69, 70, 234, 235, 242, 246, 247, 248, 287, 295, 311, 316, 321], "lr": [7, 24, 274, 278, 280, 319, 321], "process": [7, 11, 14, 15, 17, 25, 26, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 82, 159, 163, 232, 236, 277, 284, 295, 302, 309, 317, 322, 323], "variou": 7, "provid": [7, 10, 11, 12, 14, 22, 24, 25, 27, 32, 33, 35, 37, 42, 46, 47, 51, 53, 54, 74, 76, 82, 224, 228, 230, 232, 240, 246, 247, 248, 259, 264, 272, 287, 291, 296, 300, 308, 310, 311, 314, 315, 316, 317, 318, 321], "debug": [7, 22, 24, 25, 287, 314], "finetun": [7, 10, 24, 25, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 240, 306, 308, 311, 317, 318, 321], "job": [7, 10, 26, 295, 317], "involv": [9, 12, 17, 20, 56, 322], "multi": [9, 18, 25, 224, 318], "turn": [9, 18, 25, 33, 36, 37, 42, 52, 55, 68, 315, 321], "multipl": [9, 16, 17, 18, 22, 24, 25, 33, 36, 37, 42, 48, 53, 56, 68, 154, 155, 224, 230, 231, 232, 240, 247, 287, 288, 289, 290, 291, 296, 317, 318, 319, 321], "back": [9, 21, 22, 52, 248, 270, 320, 321, 323], "forth": [9, 52], "user": [9, 12, 13, 14, 15, 16, 18, 19, 21, 22, 25, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 46, 52, 55, 56, 60, 64, 67, 68, 95, 113, 118, 119, 128, 131, 132, 137, 141, 158, 160, 162, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 194, 219, 224, 259, 312, 315, 316, 317, 322], "assist": [9, 12, 13, 14, 15, 16, 18, 19, 21, 31, 32, 33, 35, 36, 37, 38, 39, 41, 42, 44, 52, 55, 56, 60, 68, 74, 95, 112, 118, 131, 160, 177, 184, 194, 219, 259, 315, 316], "role": [9, 13, 14, 15, 16, 18, 19, 21, 33, 36, 37, 38, 39, 42, 44, 55, 56, 60, 68, 95, 118, 131, 156, 160, 177, 184, 194, 219, 259, 315, 316], "content": [9, 13, 15, 16, 18, 19, 21, 22, 33, 36, 37, 38, 39, 42, 44, 55, 56, 60, 68, 259, 315, 316], "what": [9, 14, 15, 16, 18, 22, 23, 24, 26, 36, 37, 55, 56, 60, 64, 67, 68, 112, 168, 232, 306, 312, 315, 316, 317, 318, 321], "answer": [9, 15, 16, 19, 40, 64, 67, 316, 318], "ultim": [9, 322], "question": [9, 15, 16, 19, 40, 64, 67, 316, 318], "life": 9, "42": [9, 74, 232], "That": [9, 315], "s": [9, 10, 12, 13, 14, 16, 17, 18, 19, 20, 22, 24, 25, 26, 27, 29, 32, 37, 42, 52, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 86, 87, 88, 112, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 154, 155, 156, 161, 162, 163, 168, 169, 170, 171, 172, 180, 181, 184, 186, 189, 190, 196, 197, 200, 201, 203, 204, 205, 206, 222, 224, 226, 230, 231, 232, 236, 240, 243, 244, 245, 246, 247, 249, 250, 252, 253, 257, 264, 266, 267, 268, 270, 271, 274, 281, 283, 287, 290, 293, 294, 297, 299, 300, 308, 309, 314, 315, 317, 319, 320, 321, 322, 323], "ridicul": 9, "oh": 9, "i": [9, 12, 14, 18, 19, 20, 25, 36, 68, 74, 112, 154, 155, 168, 223, 224, 229, 230, 231, 232, 236, 240, 251, 269, 274, 316, 318, 321, 322, 323], "know": [9, 22, 315, 319, 320], "more": [9, 10, 11, 12, 13, 14, 16, 18, 19, 21, 22, 24, 25, 38, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 232, 237, 244, 252, 269, 272, 287, 291, 293, 295, 299, 308, 309, 310, 311, 312, 314, 316, 317, 318, 319, 320, 321, 322, 323], "structur": [9, 12, 13, 14, 19, 25, 37, 39, 42, 60, 131, 156, 160, 184, 194, 219, 261, 315, 316, 322], "than": [9, 10, 12, 16, 18, 24, 50, 52, 74, 76, 222, 224, 232, 264, 272, 273, 298, 299, 303, 315, 316, 317, 318, 319, 320, 321, 323], "freeform": [9, 12, 57, 72], "associ": [9, 10, 11, 12, 22, 24, 25, 74, 75, 82, 92, 101, 113, 128, 137, 158, 162, 173, 190, 287, 316, 320], "where": [9, 10, 12, 14, 16, 18, 19, 20, 36, 38, 45, 48, 59, 74, 76, 77, 80, 117, 154, 155, 176, 220, 224, 230, 232, 234, 237, 239, 240, 247, 256, 261, 262, 264, 265, 268, 283, 316, 319, 321], "thei": [9, 11, 12, 19, 21, 22, 24, 25, 53, 65, 66, 67, 82, 154, 159, 163, 230, 232, 242, 314, 315, 316, 320, 321, 322], "learn": [9, 12, 25, 53, 241, 242, 244, 274, 278, 280, 308, 309, 310, 311, 312, 315, 316, 317, 318, 320, 321, 322, 323], "simpli": [9, 12, 13, 14, 16, 20, 22, 24, 54, 56, 240, 264, 309, 314, 315, 318, 319, 321, 323], "predict": [9, 12, 74, 75, 78, 262, 263, 265, 310], "next": [9, 12, 54, 72, 74, 75, 82, 232, 261, 310, 318, 323], "respond": 9, "accur": 9, "primari": [9, 12, 16, 18, 20, 24, 25, 55, 56, 312, 317], "entri": [9, 12, 16, 18, 20, 24, 25, 47, 50, 309, 312, 317, 321], "point": [9, 10, 12, 16, 18, 20, 21, 24, 25, 46, 60, 259, 309, 312, 316, 317, 318, 320, 322, 323], "torchtun": [9, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 42, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 178, 179, 180, 181, 182, 183, 184, 185, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 307, 309, 310, 311, 312, 315, 317, 321], "chat_dataset": [9, 12, 13, 18, 315], "let": [9, 10, 11, 12, 16, 18, 22, 24, 26, 240, 314, 315, 316, 317, 318, 319, 320, 321, 323], "follow": [9, 10, 11, 12, 15, 16, 19, 22, 25, 36, 37, 38, 42, 50, 54, 55, 56, 64, 67, 68, 156, 224, 229, 261, 265, 272, 273, 274, 278, 285, 291, 296, 306, 307, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "data": [9, 10, 12, 13, 14, 15, 16, 19, 21, 23, 31, 32, 33, 35, 36, 37, 38, 39, 42, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 112, 160, 232, 260, 264, 266, 281, 287, 288, 289, 290, 291, 299, 310, 311, 321, 322, 323], "directli": [9, 10, 12, 13, 14, 16, 22, 24, 25, 27, 31, 55, 56, 60, 64, 65, 67, 68, 72, 264, 270, 314, 316, 317, 318, 320, 321, 322, 323], "llm": [9, 10, 11, 12, 21, 25, 240, 242, 306, 307, 308, 309, 310, 312, 316, 318, 319, 320], "my_data": [9, 12, 13, 16, 315], "human": [9, 16, 18, 36, 42, 60, 112, 264, 265, 266, 315], "valu": [9, 16, 22, 24, 33, 35, 37, 42, 45, 47, 48, 50, 59, 60, 62, 63, 64, 67, 68, 69, 70, 71, 74, 75, 77, 78, 83, 84, 85, 92, 93, 94, 96, 101, 102, 103, 104, 105, 113, 114, 115, 116, 117, 119, 128, 129, 130, 132, 137, 138, 139, 140, 141, 148, 149, 156, 158, 162, 165, 169, 171, 173, 174, 175, 176, 180, 182, 186, 190, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 222, 224, 225, 229, 230, 231, 238, 239, 240, 242, 252, 262, 263, 265, 268, 270, 273, 274, 278, 283, 287, 288, 289, 290, 291, 295, 311, 314, 315, 316, 317, 318, 320, 321, 322], "gpt": [9, 16, 42, 60, 75, 315, 316], "mistral": [9, 14, 18, 19, 21, 156, 168, 169, 170, 171, 172, 174, 175, 176, 177, 178, 179, 273, 314, 315, 316, 317], "mistral_token": [9, 14, 18, 19, 21], "m_token": [9, 14, 18, 19, 20, 21], "path": [9, 10, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 24, 25, 26, 27, 35, 42, 46, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 95, 118, 131, 156, 160, 177, 184, 194, 219, 256, 257, 258, 270, 271, 272, 296, 314, 315, 316, 318, 320], "1": [9, 14, 16, 18, 19, 20, 21, 22, 25, 35, 42, 45, 47, 48, 49, 50, 54, 67, 70, 74, 75, 77, 78, 79, 80, 113, 119, 128, 132, 137, 138, 140, 141, 142, 144, 145, 147, 155, 156, 158, 162, 168, 169, 171, 173, 175, 180, 182, 186, 187, 188, 190, 191, 192, 198, 199, 210, 211, 222, 223, 224, 230, 232, 233, 234, 235, 237, 238, 239, 253, 256, 257, 259, 264, 265, 266, 267, 271, 273, 278, 283, 285, 287, 290, 291, 294, 295, 308, 309, 310, 314, 315, 316, 317, 320, 321, 322, 323], "prompt_templ": [9, 12, 14, 16, 18, 19, 95, 118, 131, 156, 160, 177, 184, 194, 219, 316], "mistralchattempl": [9, 14, 18, 19, 177, 315], "max_seq_len": [9, 10, 12, 14, 16, 17, 18, 20, 21, 24, 27, 47, 50, 51, 54, 55, 59, 60, 61, 62, 64, 65, 66, 67, 69, 70, 72, 73, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 156, 158, 160, 162, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 194, 219, 222, 224, 226, 230, 235, 322], "8192": [9, 12, 14, 16, 17, 18, 20, 21, 160, 320, 322], "ds": [9, 10, 12, 15, 16, 18, 20, 54, 70, 315], "sourc": [9, 10, 12, 13, 16, 18, 20, 22, 24, 27, 28, 29, 30, 31, 32, 33, 35, 36, 37, 38, 39, 42, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 112, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 143, 144, 148, 149, 150, 151, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 168, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 183, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 302, 303, 304, 314, 315, 322], "data_fil": [9, 12, 13, 16, 18, 20, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 315], "split": [9, 10, 12, 13, 14, 16, 18, 20, 22, 44, 53, 54, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 257, 315, 322], "conversation_column": [9, 60, 315], "conversation_styl": [9, 60, 315], "By": [9, 12, 22, 246, 311, 314, 319, 320, 321, 322, 323], "default": [9, 10, 12, 16, 22, 24, 31, 32, 33, 35, 36, 37, 42, 45, 48, 49, 50, 51, 54, 55, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 144, 148, 149, 150, 151, 156, 157, 160, 161, 162, 163, 165, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 223, 224, 225, 226, 229, 230, 231, 232, 236, 238, 239, 240, 246, 247, 250, 252, 253, 256, 257, 259, 262, 263, 264, 267, 270, 271, 272, 274, 277, 278, 279, 286, 287, 288, 291, 294, 295, 296, 302, 307, 311, 314, 315, 316, 318, 319, 320, 321, 322, 323], "true": [9, 10, 12, 13, 14, 15, 16, 17, 22, 24, 31, 36, 45, 53, 54, 55, 57, 58, 59, 60, 62, 64, 65, 66, 67, 68, 69, 70, 72, 73, 76, 77, 82, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 156, 157, 164, 167, 178, 179, 185, 223, 224, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 240, 242, 246, 247, 248, 253, 256, 257, 259, 261, 262, 265, 268, 270, 271, 272, 280, 281, 283, 284, 285, 287, 290, 296, 303, 310, 314, 315, 316, 318, 320, 321, 322, 323], "train_on_input": [9, 12, 13, 18, 24, 31, 33, 35, 37, 42, 53, 58, 59, 60, 62, 63, 64, 68, 69, 70, 71], "new_system_prompt": [9, 12, 13, 33, 35, 37, 42, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70], "none": [9, 16, 25, 26, 28, 30, 31, 33, 35, 37, 42, 50, 51, 52, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 82, 95, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 154, 155, 156, 158, 160, 162, 177, 184, 194, 219, 220, 222, 223, 224, 226, 229, 230, 231, 232, 233, 234, 235, 240, 242, 246, 247, 248, 251, 252, 253, 256, 259, 262, 263, 265, 270, 271, 272, 273, 274, 275, 277, 279, 282, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 298, 299, 300, 301, 314, 322], "tokenized_dict": [9, 12, 15, 16, 18, 20], "label": [9, 12, 20, 25, 47, 48, 49, 50, 54, 61, 70, 73, 237, 238, 239, 264, 267, 319], "print": [9, 12, 14, 15, 16, 17, 18, 19, 20, 21, 22, 26, 44, 50, 53, 59, 62, 65, 66, 69, 70, 74, 156, 232, 233, 234, 235, 256, 257, 259, 303, 315, 316, 317, 320, 322, 323], "inst": [9, 14, 19, 21, 112, 156, 168, 315], "733": [9, 14, 21], "16289": [9, 14, 21], "28793": [9, 14, 21], "1824": 9, "349": 9, "272": 9, "4372": 9, "In": [9, 10, 12, 13, 14, 16, 17, 18, 19, 20, 21, 22, 24, 25, 55, 80, 81, 82, 159, 163, 226, 230, 232, 247, 290, 291, 311, 315, 316, 318, 319, 320, 321, 322, 323], "_component_": [9, 10, 12, 13, 16, 17, 18, 19, 20, 21, 22, 23, 24, 26, 27, 53, 60, 64, 67, 68, 72, 296, 311, 315, 316, 318, 319, 320, 321, 322], "null": [9, 24, 316, 322], "have": [9, 10, 13, 14, 18, 21, 22, 24, 27, 35, 36, 55, 60, 68, 76, 80, 81, 82, 154, 159, 163, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 221, 222, 223, 224, 225, 228, 230, 232, 233, 234, 235, 237, 239, 240, 245, 261, 267, 269, 272, 274, 280, 290, 298, 307, 315, 316, 317, 318, 319, 320, 321, 322, 323], "singl": [9, 10, 16, 17, 18, 19, 22, 24, 27, 33, 35, 37, 42, 47, 53, 54, 55, 56, 57, 60, 68, 72, 80, 81, 82, 95, 117, 118, 131, 154, 155, 156, 159, 160, 163, 176, 177, 184, 224, 230, 232, 240, 270, 271, 272, 273, 274, 276, 309, 312, 314, 315, 316, 317, 318, 319, 320, 321, 323], "name": [9, 12, 13, 14, 16, 18, 20, 22, 23, 24, 26, 28, 31, 33, 35, 37, 42, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 245, 246, 247, 250, 251, 257, 270, 271, 272, 273, 274, 276, 287, 288, 289, 290, 291, 297, 298, 300, 314, 315, 316, 318, 321, 322], "messag": [9, 11, 12, 15, 16, 18, 19, 21, 31, 32, 33, 35, 37, 38, 39, 42, 44, 52, 55, 56, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 95, 118, 131, 156, 160, 177, 184, 255, 259, 286, 307, 314, 315], "contain": [9, 11, 13, 14, 15, 16, 18, 20, 22, 33, 35, 36, 42, 47, 48, 49, 50, 54, 55, 56, 57, 60, 65, 67, 72, 131, 156, 160, 184, 194, 219, 222, 224, 226, 230, 231, 240, 243, 245, 249, 250, 251, 252, 257, 259, 262, 268, 270, 271, 272, 274, 276, 281, 286, 290, 296, 297, 299, 315, 316, 318, 320], "topic": [9, 306], "per": [9, 16, 47, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 155, 156, 164, 167, 178, 179, 185, 222, 232, 236, 261, 263, 264, 314, 321, 322, 323], "could": [9, 18, 19, 280, 319, 320], "system": [9, 12, 13, 18, 19, 32, 33, 35, 36, 37, 38, 39, 41, 42, 44, 52, 55, 56, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 95, 112, 118, 131, 160, 168, 177, 184, 194, 219, 259, 309, 315, 316], "tool": [9, 18, 19, 22, 36, 38, 56, 168, 287, 316, 317], "call": [9, 14, 18, 21, 22, 27, 36, 38, 56, 65, 66, 168, 224, 230, 232, 233, 234, 236, 240, 246, 252, 287, 288, 289, 290, 291, 292, 296, 297, 315, 316, 320, 323], "return": [9, 10, 13, 15, 18, 19, 21, 27, 29, 36, 38, 44, 45, 46, 47, 48, 49, 50, 51, 54, 55, 56, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 140, 141, 142, 144, 148, 149, 150, 151, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 183, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 229, 230, 231, 232, 234, 235, 237, 238, 239, 240, 241, 242, 243, 245, 246, 247, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 262, 263, 264, 265, 266, 268, 269, 270, 272, 274, 276, 277, 278, 279, 280, 281, 282, 283, 284, 285, 286, 294, 295, 296, 300, 301, 302, 303, 316, 319, 320, 323], "dai": [9, 20], "todai": [9, 316], "It": [9, 10, 14, 16, 32, 36, 38, 55, 56, 60, 62, 64, 65, 66, 67, 69, 71, 156, 159, 163, 168, 223, 228, 230, 232, 240, 253, 264, 267, 287, 309, 314, 315, 316, 319, 321, 323], "tuesdai": 9, "about": [9, 10, 13, 14, 18, 22, 25, 65, 66, 232, 264, 267, 287, 291, 308, 309, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "tomorrow": 9, "wednesdai": 9, "As": [9, 12, 16, 22, 24, 25, 26, 247, 308, 316, 321, 323], "an": [9, 10, 12, 14, 16, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 46, 50, 52, 53, 57, 60, 62, 64, 65, 66, 67, 68, 69, 72, 73, 79, 80, 81, 119, 132, 141, 156, 159, 161, 163, 169, 171, 175, 180, 186, 187, 188, 191, 192, 223, 224, 228, 230, 232, 240, 241, 242, 244, 245, 248, 249, 250, 251, 255, 260, 261, 264, 269, 270, 271, 272, 274, 275, 280, 286, 287, 291, 296, 300, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "slimorca": [9, 70], "pass": [9, 10, 11, 12, 14, 15, 16, 17, 18, 19, 20, 21, 24, 27, 36, 38, 53, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 161, 169, 171, 173, 175, 180, 182, 186, 190, 223, 224, 228, 230, 234, 235, 236, 240, 246, 247, 248, 257, 265, 272, 279, 281, 284, 287, 290, 291, 293, 296, 314, 315, 316, 320, 322, 323], "repo": [9, 10, 12, 16, 18, 20, 22, 65, 270, 271, 273, 314, 316], "select": [9, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 275], "one": [9, 10, 11, 12, 13, 16, 18, 22, 25, 33, 35, 37, 42, 47, 50, 52, 60, 66, 68, 232, 237, 239, 259, 272, 287, 316, 317, 318, 321, 323], "most": [9, 12, 13, 16, 18, 20, 22, 24, 36, 38, 309, 315, 317, 320, 321, 323], "gemma": [9, 12, 18, 20, 93, 94, 95, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 111, 228, 273, 309, 321], "gemma_token": [9, 12, 18, 20], "g_token": [9, 12, 18, 20], "open": [9, 20, 46, 70, 93, 94], "orca": [9, 70], "dedup": [9, 70], "recip": [9, 11, 12, 16, 18, 20, 22, 23, 24, 26, 27, 28, 29, 156, 230, 240, 270, 271, 272, 308, 309, 310, 311, 315, 316, 318, 321, 323], "via": [9, 12, 14, 16, 17, 18, 20, 23, 24, 26, 55, 60, 64, 67, 68, 72, 224, 230, 231, 246, 247, 270, 320, 323], "http": [9, 12, 16, 27, 46, 57, 61, 63, 65, 72, 73, 75, 83, 84, 85, 86, 87, 88, 89, 90, 91, 93, 94, 96, 97, 98, 99, 100, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 142, 144, 145, 146, 147, 150, 151, 152, 153, 164, 167, 169, 170, 171, 172, 174, 176, 178, 179, 180, 181, 183, 184, 185, 187, 188, 189, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 224, 225, 226, 232, 237, 238, 253, 261, 262, 264, 265, 266, 267, 270, 271, 278, 285, 287, 290, 291, 293, 295, 301, 307, 314, 316, 318, 319], "ha": [9, 18, 22, 64, 67, 74, 155, 227, 229, 230, 232, 235, 237, 239, 240, 243, 245, 248, 249, 268, 272, 274, 297, 298, 315, 317, 318, 319, 320, 321, 323], "addition": [9, 22, 256, 257, 267, 295, 315, 320, 321], "argument": [9, 10, 12, 16, 22, 24, 27, 34, 40, 43, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 164, 165, 166, 167, 178, 179, 185, 223, 246, 247, 284, 287, 288, 290, 291, 293, 314, 315, 316, 320, 321, 322], "load_dataset": [9, 12, 16, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 315], "document": [9, 12, 16, 17, 79, 80, 81, 82, 224, 230, 231, 309, 310, 312, 314, 321], "file": [9, 10, 11, 12, 16, 22, 23, 24, 25, 26, 27, 28, 29, 46, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 256, 257, 258, 270, 271, 272, 288, 291, 296, 305, 308, 311, 313, 314, 315, 316, 317, 318, 320, 321, 322, 323], "raw": [9, 11, 13, 14, 16, 21, 44], "vari": [9, 50, 54, 230], "field": [9, 10, 14, 15, 22, 27, 31, 35, 36, 42, 44, 54, 55, 56, 59, 65, 66, 286, 316], "indic": [9, 14, 16, 18, 19, 50, 53, 54, 76, 77, 82, 159, 163, 224, 226, 230, 231, 232, 240, 241, 253, 261, 262, 265, 268, 283, 285, 315], "There": [9, 22, 24, 52, 80, 315, 316, 317, 318, 319, 320, 321], "few": [9, 10, 242, 318, 320, 323], "standard": [9, 12, 14, 15, 17, 19, 22, 34, 55, 56, 60, 63, 113, 119, 128, 132, 137, 141, 156, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 246, 289, 308, 315, 316, 318, 319], "across": [9, 22, 25, 50, 53, 246, 253, 270, 290, 295, 318, 319, 322], "mani": [9, 14, 16, 19, 24, 54, 309, 310, 311, 319], "we": [9, 10, 11, 12, 18, 19, 20, 21, 22, 23, 24, 25, 26, 47, 50, 54, 55, 56, 60, 61, 68, 73, 74, 78, 224, 226, 228, 230, 231, 232, 234, 237, 239, 240, 247, 264, 267, 270, 271, 272, 279, 282, 292, 297, 308, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "ipython": [9, 14, 19, 36, 38, 41, 55, 56, 95, 118, 131, 160, 177, 184, 194, 219], "transform": [9, 10, 11, 16, 22, 25, 31, 33, 35, 55, 56, 59, 60, 62, 63, 65, 66, 67, 68, 69, 70, 82, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 155, 156, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 229, 230, 231, 232, 242, 261, 278, 293, 316, 320, 321, 322], "sharegpttomessag": [9, 13, 60, 70], "expect": [9, 12, 13, 15, 16, 18, 19, 20, 22, 24, 27, 31, 33, 35, 36, 37, 42, 46, 50, 55, 56, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 155, 156, 226, 240, 274, 287, 291, 298, 315, 320, 321, 322], "code": [9, 10, 12, 13, 16, 19, 21, 22, 25, 83, 84, 85, 86, 87, 88, 89, 90, 91, 230, 287, 304, 308, 317, 321], "openaitomessag": [9, 13, 60, 68], "If": [9, 10, 13, 14, 16, 17, 19, 21, 22, 24, 30, 33, 35, 36, 37, 42, 44, 46, 47, 50, 51, 52, 55, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 72, 74, 76, 78, 82, 95, 113, 118, 119, 128, 131, 132, 137, 141, 154, 158, 160, 162, 177, 184, 186, 190, 194, 219, 222, 224, 226, 228, 230, 231, 232, 234, 235, 236, 237, 239, 240, 246, 247, 259, 270, 271, 272, 273, 274, 275, 279, 280, 281, 282, 284, 287, 290, 291, 295, 296, 298, 300, 307, 314, 315, 316, 317, 318, 319, 320, 321, 322], "doe": [9, 17, 22, 44, 50, 54, 68, 72, 92, 168, 173, 183, 224, 228, 230, 231, 233, 234, 235, 238, 239, 240, 245, 259, 270, 272, 274, 297, 314, 316, 322], "fit": [9, 25, 54, 61, 72, 73, 232, 264, 315, 321], "creat": [9, 10, 13, 16, 19, 22, 24, 27, 38, 54, 56, 60, 68, 76, 83, 84, 85, 86, 87, 88, 89, 90, 91, 93, 94, 97, 98, 99, 100, 102, 103, 104, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 120, 121, 122, 123, 124, 125, 126, 127, 129, 130, 133, 134, 135, 136, 138, 139, 140, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 164, 165, 166, 167, 170, 172, 174, 176, 178, 179, 181, 183, 185, 187, 188, 189, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 224, 230, 231, 232, 240, 270, 271, 272, 276, 278, 287, 288, 290, 314, 323], "custom": [9, 15, 16, 21, 24, 25, 31, 38, 55, 56, 60, 64, 65, 66, 67, 68, 72, 95, 118, 131, 160, 177, 184, 194, 219, 293, 308, 309, 310, 311, 314, 317, 318, 320, 321], "dialogu": [9, 16, 43, 69, 315], "defin": [9, 10, 17, 22, 24, 25, 38, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 224, 229, 230, 240, 243, 245, 247, 249, 253, 263, 316, 317, 320], "same": [9, 10, 11, 15, 18, 22, 24, 38, 45, 79, 80, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 155, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 221, 225, 227, 228, 229, 231, 232, 235, 240, 242, 259, 265, 267, 268, 274, 280, 291, 297, 299, 311, 314, 315, 318, 319, 320, 321, 322, 323], "wai": [9, 14, 19, 22, 24, 55, 56, 252, 269, 314, 316, 317, 318, 319], "instruct_dataset": [9, 12, 13, 53], "info": [9, 301, 317], "slimorca_dataset": [9, 24], "command": [10, 12, 17, 21, 23, 25, 26, 307, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "line": [10, 17, 22, 23, 25, 312, 314, 317, 318, 321], "both": [10, 14, 15, 21, 22, 37, 50, 53, 63, 68, 220, 240, 242, 244, 314, 316, 319, 320, 321, 322, 323], "built": [10, 11, 13, 23, 24, 26, 63, 68, 71, 307, 315, 317, 323], "done": [10, 17, 54, 230, 252, 279, 297, 320, 322, 323], "run": [10, 17, 22, 23, 24, 26, 29, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224, 230, 236, 237, 270, 271, 272, 274, 275, 276, 285, 287, 290, 291, 292, 307, 308, 309, 310, 311, 312, 315, 317, 318, 319, 320, 321, 322, 323], "cli": [10, 24, 26, 28, 29, 307, 309, 310, 316, 317, 321], "which": [10, 11, 12, 14, 16, 17, 18, 19, 20, 21, 22, 24, 25, 46, 47, 53, 54, 57, 59, 60, 62, 64, 68, 69, 70, 72, 77, 78, 82, 86, 87, 88, 95, 96, 97, 98, 105, 106, 107, 108, 118, 119, 120, 121, 122, 123, 131, 132, 133, 134, 141, 142, 144, 150, 151, 156, 159, 160, 161, 162, 163, 168, 169, 170, 171, 172, 177, 180, 181, 184, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 223, 224, 226, 230, 231, 232, 233, 234, 235, 240, 242, 246, 247, 252, 253, 256, 270, 271, 272, 274, 278, 279, 288, 291, 293, 297, 308, 309, 310, 311, 312, 314, 315, 316, 317, 319, 320, 321, 322, 323], "folder": [10, 22, 316], "first": [10, 17, 22, 24, 27, 42, 52, 54, 65, 77, 82, 159, 163, 230, 232, 233, 234, 240, 268, 270, 306, 308, 309, 310, 315, 316, 318, 319, 320, 322, 323], "ensur": [10, 19, 21, 22, 24, 30, 52, 55, 56, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 233, 270, 272, 279, 308, 309, 317], "instal": [10, 23, 24, 26, 285, 287, 290, 291, 306, 314, 316, 317, 318, 319, 320, 321, 322, 323], "environ": [10, 25, 285, 287, 300, 307, 312, 314, 316, 317, 322], "so": [10, 13, 18, 19, 20, 22, 24, 54, 65, 228, 232, 270, 307, 308, 315, 316, 317, 318, 320, 321, 322, 323], "directori": [10, 22, 24, 35, 42, 65, 67, 270, 271, 272, 288, 290, 291, 296, 314, 316, 317, 318], "new": [10, 14, 15, 16, 19, 21, 25, 37, 42, 59, 61, 62, 63, 65, 68, 69, 70, 174, 222, 241, 242, 273, 287, 288, 290, 315, 316, 317, 318, 319, 320, 323], "librari": [10, 264, 266, 279, 295, 301, 306, 307, 308, 314, 316, 321, 323], "mkdir": 10, "my_project": [10, 287, 291], "cd": [10, 21, 307], "llama": [10, 15, 16, 17, 20, 21, 22, 112, 154, 156, 157, 158, 159, 160, 162, 163, 226, 270, 271, 309, 310, 311, 314, 315, 316, 317, 318, 319, 320], "3": [10, 15, 16, 17, 20, 21, 22, 45, 47, 48, 49, 50, 54, 77, 78, 82, 139, 143, 146, 154, 156, 157, 158, 159, 160, 162, 163, 165, 166, 167, 168, 181, 183, 184, 223, 232, 253, 273, 283, 294, 301, 309, 310, 311, 314, 315, 316, 317, 318, 319, 322, 323], "2": [10, 14, 15, 17, 21, 22, 26, 45, 47, 48, 49, 50, 52, 54, 70, 77, 78, 79, 80, 148, 149, 150, 151, 152, 153, 154, 156, 157, 158, 159, 160, 161, 162, 163, 164, 168, 222, 223, 224, 232, 240, 253, 256, 257, 259, 265, 267, 268, 270, 271, 273, 283, 294, 295, 296, 303, 309, 311, 314, 315, 316, 317, 318, 320, 321, 322], "lora": [10, 24, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 248, 250, 252, 270, 306, 308, 309, 312, 315, 316, 317, 318, 319], "devic": [10, 17, 22, 24, 25, 235, 246, 247, 250, 274, 277, 279, 281, 299, 300, 309, 312, 314, 315, 316, 317, 318, 320, 321, 323], "lora_finetune_single_devic": [10, 24, 310, 314, 315, 316, 317, 318, 319, 320, 321, 323], "llama3_2": [10, 17, 22, 148, 149, 150, 151, 152, 153, 228, 233, 234, 235, 273, 316, 319], "1b_lora_single_devic": 10, "often": [10, 320, 321], "ll": [10, 18, 20, 22, 24, 25, 74, 282, 308, 311, 315, 316, 317, 318, 319, 321, 322, 323], "want": [10, 12, 19, 24, 25, 26, 27, 50, 55, 56, 74, 228, 244, 307, 314, 315, 316, 317, 318, 319, 320, 321], "start": [10, 23, 25, 26, 46, 77, 259, 273, 287, 307, 308, 309, 315, 317, 319, 321, 322], "our": [10, 12, 13, 20, 22, 25, 308, 309, 310, 311, 312, 315, 316, 317, 319, 320, 321, 322, 323], "particular": [10, 11, 13, 19, 21, 24, 53, 156, 320, 323], "adjust": [10, 246, 310, 311, 319, 321, 322], "hyperparamet": [10, 23, 267, 274, 308, 317, 320, 323], "cp": [10, 24, 307, 314, 315, 316, 317, 318, 322], "copi": [10, 22, 246, 247, 315, 316, 317, 318, 321, 322, 323], "make": [10, 17, 19, 22, 23, 24, 25, 26, 157, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 232, 240, 308, 314, 316, 317, 318, 319, 320, 321, 322, 323], "modif": [10, 322], "show": [10, 22, 156, 261, 307, 310, 311, 314, 315, 316, 319, 320], "each": [10, 12, 15, 18, 19, 20, 22, 25, 38, 39, 42, 47, 48, 50, 53, 54, 55, 56, 79, 80, 81, 82, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 155, 156, 159, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 226, 230, 231, 232, 237, 239, 240, 242, 246, 252, 253, 259, 261, 262, 263, 264, 266, 283, 295, 296, 308, 311, 312, 314, 316, 317, 320, 321, 322], "ls": [10, 21, 307, 312, 314, 316, 317, 318], "full": [10, 11, 13, 16, 24, 25, 34, 40, 43, 55, 73, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 161, 164, 165, 166, 167, 178, 179, 185, 240, 250, 252, 259, 275, 277, 307, 308, 312, 314, 316, 318, 320, 321, 322], "5b_full_single_devic": 10, "qwen_config": 10, "now": [10, 19, 22, 222, 234, 274, 276, 311, 315, 316, 317, 318, 319, 320, 322, 323], "sure": [10, 17, 22, 24, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 316, 317, 318, 319, 320, 321, 322, 323], "correct": [10, 12, 14, 19, 25, 34, 62, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 226, 230, 300, 308, 315], "ve": [10, 18, 21, 24, 311, 314, 315, 316, 318, 319, 320, 321], "even": [10, 232, 297, 307, 314, 315, 318, 319, 320, 321, 323], "didn": 10, "t": [10, 13, 14, 18, 19, 20, 22, 24, 25, 45, 154, 155, 237, 242, 279, 291, 295, 309, 314, 315, 316, 317, 319, 321, 323], "complet": [10, 11, 12, 18, 22, 25, 37, 54, 61, 72, 184, 315, 316, 317, 318, 321], "note": [10, 16, 21, 22, 24, 96, 105, 240, 245, 274, 292, 295, 297, 311, 314, 315, 319, 320, 321, 322, 323], "must": [10, 13, 17, 27, 38, 53, 65, 66, 224, 234, 245, 246, 269, 287, 323], "extens": [10, 25, 308], "full_finetune_single_devic": [10, 17, 280, 314, 316, 317], "Or": [10, 240, 307], "rel": [10, 16, 17, 54, 224, 226, 230, 231, 240, 264, 281, 309, 319, 320], "discuss": [10, 14, 19, 21, 24, 316, 317, 318, 320], "workflow": [10, 11, 22, 306, 317, 320], "write": [10, 16, 22, 25, 270, 271, 272, 288, 316, 317], "own": [10, 13, 18, 21, 22, 38, 295, 314, 315, 316, 318, 319, 320], "loop": 10, "logic": [10, 15, 25, 31, 56, 255, 273, 308, 312, 317, 320], "case": [10, 14, 16, 22, 25, 26, 36, 38, 55, 80, 81, 82, 159, 163, 232, 234, 270, 274, 279, 282, 288, 293, 308, 314, 315, 316, 318, 320, 321, 323], "similar": [10, 13, 16, 60, 61, 63, 65, 66, 68, 71, 72, 73, 264, 318, 319, 320, 321, 323], "scratch": 10, "local": [10, 11, 14, 46, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 131, 156, 160, 184, 194, 219, 287, 291, 295, 307, 309, 314, 315, 316, 317], "single_devic": 10, "py": [10, 13, 24, 27, 75, 86, 87, 88, 97, 98, 102, 103, 104, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 226, 238, 264, 265, 266, 267, 278, 314, 316, 318], "recommend": [10, 60, 61, 62, 68, 69, 71, 73, 168, 230, 237, 287, 290, 315, 316, 321, 323], "python": [10, 24, 287, 291, 295, 301, 304, 314, 322], "convent": [10, 229], "main": [10, 27, 29, 102, 103, 104, 184, 226, 307, 311, 316, 318], "script": [10, 22, 26, 312, 314, 316, 317, 318], "decor": [10, 25, 29], "pars": [10, 24, 27, 28, 258, 312, 317], "omegaconf": [10, 27], "dictconfig": [10, 24, 25, 27, 28, 29, 30, 287, 291, 296], "def": [10, 13, 15, 19, 21, 24, 25, 26, 29, 65, 66, 253, 273, 316, 319, 320, 323], "cfg": [10, 24, 25, 28, 29, 30], "add": [10, 12, 13, 14, 16, 19, 21, 23, 24, 26, 50, 54, 57, 72, 82, 156, 168, 232, 244, 257, 259, 272, 273, 315, 316, 318, 320, 321, 323], "here": [10, 12, 14, 15, 16, 18, 20, 21, 22, 23, 24, 26, 32, 62, 65, 66, 226, 280, 309, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 321, 322, 323], "attribut": [10, 228, 248, 259, 267, 276], "__name__": 10, "__main__": 10, "don": [10, 13, 14, 18, 19, 20, 22, 24, 25, 291, 295, 314, 315, 316, 317, 319, 321, 323], "experiment": [10, 21, 24], "optim": [10, 18, 19, 22, 24, 25, 48, 53, 55, 92, 173, 183, 264, 265, 266, 267, 272, 274, 276, 278, 280, 281, 292, 296, 310, 311, 312, 315, 316, 317, 318, 319, 320, 323], "them": [10, 12, 15, 18, 19, 22, 24, 53, 68, 232, 236, 242, 259, 299, 311, 314, 315, 316, 320, 321, 322, 323], "when": [10, 16, 17, 18, 20, 21, 22, 24, 25, 29, 53, 54, 55, 56, 57, 68, 72, 74, 76, 222, 223, 224, 226, 228, 230, 231, 232, 234, 235, 236, 237, 239, 240, 241, 246, 247, 248, 252, 263, 278, 290, 292, 297, 309, 310, 314, 316, 318, 319, 320, 321, 322, 323], "mean": [10, 24, 156, 224, 225, 229, 230, 231, 240, 246, 247, 253, 262, 314, 315, 317, 320, 322], "high": [10, 53, 55, 56, 308, 319, 320], "level": [10, 25, 55, 56, 237, 239, 260, 276, 301, 308, 319, 323], "paramet": [10, 13, 14, 15, 16, 25, 27, 28, 29, 30, 31, 33, 35, 36, 37, 38, 42, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 139, 140, 141, 142, 144, 148, 149, 150, 151, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 165, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 261, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 274, 275, 276, 277, 278, 279, 280, 281, 282, 283, 284, 286, 287, 288, 289, 290, 291, 292, 293, 294, 295, 296, 297, 298, 299, 300, 301, 303, 306, 308, 310, 311, 312, 314, 315, 316, 317, 318, 319, 320, 322, 323], "easili": [10, 16, 22, 24, 308, 319, 320, 322, 323], "custom_decod": 10, "customtransformerdecod": 10, "nn": [10, 27, 45, 47, 50, 82, 154, 155, 220, 222, 223, 224, 225, 228, 229, 230, 231, 232, 233, 234, 235, 236, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 251, 253, 275, 276, 292, 293, 297, 298, 319, 320, 323], "modul": [10, 13, 15, 21, 24, 27, 65, 66, 79, 80, 81, 82, 154, 155, 156, 159, 163, 171, 220, 221, 222, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 243, 244, 245, 246, 247, 248, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 259, 260, 261, 264, 265, 266, 273, 275, 276, 292, 293, 295, 309, 317, 319, 320, 321, 323], "A": [10, 13, 15, 19, 22, 25, 26, 33, 34, 37, 40, 42, 43, 47, 48, 49, 50, 53, 54, 68, 82, 194, 219, 223, 224, 228, 229, 230, 231, 232, 236, 240, 247, 252, 253, 256, 257, 259, 261, 262, 263, 264, 265, 266, 268, 273, 274, 280, 281, 282, 286, 305, 306, 309, 313, 314, 315, 316, 320, 321, 322, 323], "architectur": [10, 25, 112, 168, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 230, 232, 240, 242, 273, 314], "present": [10, 33, 37, 42, 62, 63, 65, 66, 67, 68, 69, 70, 257, 272, 297, 316], "custom_model": 10, "num_lay": [10, 27, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 230, 232, 240, 242], "int": [10, 15, 21, 24, 26, 47, 48, 49, 50, 51, 54, 61, 65, 66, 73, 74, 75, 76, 78, 79, 80, 81, 82, 86, 87, 88, 89, 90, 91, 92, 95, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 131, 132, 133, 134, 135, 136, 137, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 173, 175, 177, 178, 179, 180, 181, 182, 184, 185, 186, 187, 188, 189, 190, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 219, 222, 223, 224, 225, 226, 229, 230, 231, 232, 235, 237, 238, 239, 240, 241, 242, 246, 247, 254, 255, 256, 257, 258, 259, 261, 268, 270, 271, 272, 274, 275, 278, 287, 288, 289, 290, 291, 293, 295, 296, 302, 314, 319, 320, 321, 323], "classification_head": 10, "bool": [10, 15, 19, 21, 24, 31, 33, 35, 36, 37, 42, 45, 54, 55, 57, 58, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 82, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 156, 157, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 229, 230, 231, 232, 236, 237, 238, 240, 242, 246, 247, 252, 253, 255, 256, 257, 259, 262, 268, 270, 271, 272, 277, 281, 284, 285, 287, 290, 293, 296, 297, 303, 321, 323], "fals": [10, 13, 14, 15, 16, 18, 19, 22, 24, 33, 35, 36, 37, 42, 45, 53, 54, 55, 58, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 76, 77, 82, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 157, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 230, 231, 232, 233, 234, 235, 240, 241, 242, 246, 247, 248, 252, 256, 268, 270, 271, 272, 283, 285, 296, 297, 314, 315, 316, 318, 320, 322, 323], "setup": [10, 22, 24, 25, 76, 222, 224, 229, 230, 231, 233, 234, 235, 240, 242, 275, 296, 314, 320, 323], "expos": [10, 13, 24, 25, 272, 312, 317], "friendli": [10, 60, 64, 67, 68, 72, 74, 315], "manner": [10, 20], "rather": [10, 264, 321], "everi": [10, 12, 22, 25, 62, 63, 68, 69, 70, 79, 80, 81, 158, 162, 232, 234, 253, 290, 296, 307, 314, 321, 323], "construct": [10, 36, 63, 261, 312, 320], "care": [10, 22, 270, 272, 316, 318, 320], "how": [10, 13, 14, 18, 22, 23, 24, 25, 26, 232, 240, 287, 293, 306, 309, 310, 311, 314, 315, 316, 317, 318, 321, 322, 323], "implement": [10, 19, 21, 22, 25, 55, 57, 59, 60, 61, 62, 63, 64, 68, 69, 70, 71, 72, 73, 220, 226, 227, 232, 238, 245, 247, 254, 255, 260, 264, 265, 266, 267, 270, 278, 282, 290, 308, 311, 319, 320, 321, 322, 323], "llama3_2_vision_11b": 10, "custom_dataset": [10, 13], "sftdataset": [10, 13, 24, 55, 58, 59, 60, 62, 64, 65, 66, 67, 69, 70], "packeddataset": [10, 17, 53, 58, 59, 60, 62, 64, 69, 70, 72, 73], "inputoutputtomessag": [10, 13, 14, 62, 69], "modeltoken": [10, 15, 21, 24, 36, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 68, 69, 70, 71, 72, 73, 259], "build": [10, 25, 72, 82, 92, 101, 113, 128, 137, 158, 159, 162, 163, 173, 175, 190, 269, 308, 309, 318, 320, 321], "block": [10, 25, 54, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 161, 162, 163, 169, 170, 171, 172, 173, 180, 181, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 230, 231, 252, 308], "tiny_cod": 10, "pack": [10, 54, 55, 58, 59, 60, 62, 64, 65, 66, 67, 69, 70, 72, 73, 224, 226, 230, 231, 240, 322], "subset": [10, 15, 16, 47, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 96, 105, 119, 132, 141, 161, 169, 171, 180, 186, 243, 249, 250], "nampdn": 10, "tini": 10, "respons": [10, 12, 13, 18, 19, 21, 32, 33, 35, 36, 37, 42, 55, 56, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 259, 262, 263, 264, 266, 267, 309, 316, 317, 318], "model_transform": [10, 13, 15, 16, 55, 56, 62, 65, 66, 67, 69, 156], "message_transform": [10, 13, 55, 56], "column_map": [10, 12, 13, 16, 18, 31, 33, 35, 37, 42, 53, 58, 59, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71], "input": [10, 11, 12, 13, 14, 15, 20, 21, 22, 31, 35, 47, 48, 49, 50, 54, 55, 56, 59, 61, 62, 64, 65, 66, 67, 69, 70, 73, 79, 80, 81, 82, 95, 118, 131, 154, 155, 156, 159, 160, 163, 177, 184, 186, 190, 220, 221, 223, 224, 225, 226, 227, 228, 229, 230, 231, 232, 238, 239, 240, 241, 242, 246, 247, 253, 256, 257, 261, 270, 272, 280, 295, 298, 315, 316, 320, 323], "filter_fn": [10, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73], "lambda": [10, 223, 262], "x": [10, 22, 45, 74, 75, 76, 79, 80, 81, 154, 155, 220, 221, 223, 224, 225, 226, 227, 229, 230, 231, 232, 240, 241, 242, 246, 247, 253, 283, 294, 319, 320, 322, 323], "split_across_pack": [10, 54, 72], "els": [10, 11, 12, 19, 25, 291, 308, 323], "posit": [10, 17, 24, 27, 54, 75, 77, 79, 80, 81, 82, 92, 96, 101, 105, 128, 137, 141, 154, 159, 163, 169, 171, 173, 175, 180, 182, 222, 223, 224, 226, 229, 230, 231, 232, 240, 241, 318], "automat": [10, 12, 16, 17, 19, 21, 23, 24, 26, 27, 59, 60, 314, 316, 323], "instanti": [10, 30, 38, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 140, 141, 142, 144, 148, 149, 150, 151, 157, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 183, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 274, 316], "separ": [10, 55, 67, 242, 259, 270, 315, 317, 318, 320, 323], "under": [10, 24, 296, 321, 323], "best": [10, 16, 18, 25, 309, 311, 315, 319, 321], "root": [10, 225, 290, 291], "custom_finetun": 10, "32": [10, 27, 222, 232, 240, 242, 287, 318, 320, 321, 322, 323], "option": [10, 12, 18, 21, 22, 24, 25, 31, 33, 35, 37, 42, 50, 51, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 78, 81, 82, 86, 87, 88, 95, 96, 97, 98, 101, 105, 106, 107, 108, 113, 118, 119, 120, 121, 122, 123, 128, 131, 132, 133, 134, 137, 141, 142, 144, 150, 151, 154, 155, 156, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 177, 180, 181, 184, 186, 187, 188, 189, 190, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 219, 220, 223, 224, 226, 229, 230, 231, 232, 235, 236, 240, 246, 247, 250, 252, 253, 254, 256, 259, 262, 263, 265, 270, 271, 272, 274, 275, 277, 279, 282, 286, 287, 288, 291, 295, 296, 298, 300, 301, 307, 308, 314, 315, 316, 321], "param": [10, 22, 25, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 157, 161, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 243, 244, 246, 249, 251, 270, 314, 316, 320, 322, 323], "omit": [10, 319, 320, 321], "being": [10, 19, 22, 56, 234, 270, 271, 272, 276, 300, 321, 322, 323], "found": [10, 11, 22, 23, 24, 26, 226, 270, 271, 272, 311, 314, 319, 320, 323], "correctli": [10, 21, 22, 25, 30, 252, 270, 307, 312, 315, 317, 323], "try": [10, 22, 24, 315, 316, 317, 318, 323], "after": [10, 19, 20, 23, 25, 38, 56, 65, 66, 95, 118, 131, 156, 160, 177, 184, 223, 224, 227, 230, 231, 240, 242, 246, 268, 286, 287, 288, 289, 290, 291, 309, 311, 315, 316, 318, 322, 323], "pythonpath": 10, "pwd": 10, "vlm": [11, 16], "hub": [11, 22, 55, 56, 314, 317], "remot": [11, 14, 35, 42, 46, 55, 56, 67], "url": [11, 16, 35, 37, 42, 46, 67, 307], "project": [11, 23, 26, 82, 86, 87, 88, 92, 96, 101, 105, 113, 117, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 154, 155, 158, 159, 161, 162, 163, 169, 170, 171, 172, 173, 176, 180, 181, 186, 189, 190, 196, 197, 200, 201, 203, 204, 205, 206, 220, 224, 230, 232, 240, 244, 252, 273, 287, 291, 306, 320, 321, 323], "prefer": [11, 13, 25, 48, 55, 63, 68, 71, 264, 265, 266, 267, 308, 312, 314, 316], "align": [11, 65, 66, 264, 309, 315, 319], "continu": [11, 20, 54, 232, 287, 316], "pretrain": [11, 154, 155, 156, 240, 242, 244, 256, 257, 314, 315, 317, 320, 323], "beyond": [11, 316, 323], "those": [11, 22, 273, 316, 318, 320], "customiz": 11, "task": [11, 12, 16, 18, 19, 34, 40, 43, 53, 61, 156, 310, 315, 316, 318, 319, 320, 321, 322, 323], "supervis": [11, 20, 56, 309], "rlhf": [11, 55, 63, 262, 263, 264, 265, 266, 268, 309], "queri": [11, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 230, 231, 240, 318, 321], "time": [11, 16, 17, 22, 60, 64, 92, 173, 234, 237, 239, 259, 262, 288, 290, 296, 311, 314, 315, 316, 318, 321, 323], "take": [11, 12, 13, 18, 22, 24, 25, 27, 48, 55, 56, 65, 66, 68, 154, 232, 236, 242, 253, 270, 272, 299, 300, 311, 315, 316, 317, 318, 319, 320, 321, 323], "object": [11, 13, 14, 15, 19, 21, 24, 27, 28, 82, 224, 264, 267, 282], "appli": [11, 12, 15, 19, 22, 25, 47, 55, 56, 59, 65, 66, 67, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 158, 161, 162, 163, 164, 167, 169, 170, 171, 172, 173, 178, 179, 180, 181, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 228, 229, 230, 231, 240, 246, 252, 253, 293, 308, 309, 310, 319, 321, 323], "templat": [11, 31, 32, 34, 38, 39, 40, 43, 55, 56, 59, 62, 69, 95, 112, 118, 131, 156, 160, 168, 177, 184, 194, 219], "anyth": [11, 61, 299], "requir": [11, 15, 17, 19, 21, 22, 24, 47, 48, 53, 55, 56, 57, 65, 66, 68, 72, 156, 228, 230, 241, 270, 272, 274, 284, 285, 287, 290, 291, 295, 296, 307, 311, 314, 315, 317, 321, 322, 323], "collat": [11, 47, 49, 50, 54], "packag": [11, 23, 26, 287, 290, 291, 307], "togeth": [11, 25, 54, 237, 291, 312, 317, 320, 321, 322], "form": [12, 18, 22, 24, 25, 31, 44, 52, 55, 56, 314], "along": [12, 22, 223, 320], "describ": [12, 253, 293], "hand": [12, 36], "grammar": [12, 19, 34, 62], "head": [12, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224, 226, 230, 240, 244, 273, 318], "csv": [12, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73], "incorrect": [12, 19], "cat": [12, 16, 19, 261], "grammarerrorcorrectiontempl": [12, 19, 62], "prepend": [12, 14, 16, 19, 33, 35, 37, 38, 39, 42, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 95, 118, 131, 156, 160, 177, 184, 194, 219, 256, 286], "english": [12, 19, 34], "ncorrect": [12, 34], "mask": [12, 13, 14, 15, 17, 19, 21, 36, 38, 50, 54, 56, 59, 60, 62, 64, 65, 66, 67, 68, 69, 70, 75, 76, 77, 156, 224, 229, 230, 231, 240, 255, 259, 261, 262, 265, 283, 315, 319], "out": [12, 15, 18, 20, 22, 24, 25, 59, 60, 62, 64, 68, 69, 70, 76, 77, 261, 270, 271, 283, 306, 308, 309, 310, 311, 312, 314, 315, 316, 317, 318, 320, 321, 323], "100": [12, 18, 25, 48, 49, 50, 59, 60, 62, 64, 68, 69, 70, 74, 237, 238, 239, 241, 319, 320, 323], "27957": 12, "736": 12, "577": 12, "anoth": [12, 13, 16, 24, 56, 228, 287, 316, 321], "c4": [12, 72, 322], "200m": 12, "liweili": [12, 62], "c4_200m": [12, 62], "chang": [12, 13, 16, 21, 22, 23, 24, 26, 31, 33, 35, 64, 66, 67, 71, 272, 307, 314, 316, 317, 318, 319, 320, 321, 322, 323], "remap": 12, "someth": [12, 22, 25, 26, 315, 316, 322], "hello": [12, 13, 14, 19, 21, 44, 256, 257, 301, 315, 316, 318], "world": [12, 13, 14, 19, 21, 44, 256, 257, 285, 301, 302, 316], "bye": [12, 13], "robot": [12, 15], "am": [12, 14, 16, 60, 64, 112, 168, 315, 318], "prompttempl": [12, 31, 34, 40, 43, 156], "relev": [12, 14, 25, 229, 230, 231, 240, 314, 320, 321], "inform": [12, 14, 22, 287, 291, 293, 308, 309, 314, 316, 317], "mai": [12, 16, 17, 24, 26, 60, 74, 232, 235, 241, 297, 309, 310, 311, 315, 317, 319, 320, 321], "alpaca_dataset": [12, 17, 24, 58], "grammar_dataset": 12, "samsum_dataset": 12, "dictionari": [13, 14, 15, 36, 38, 44, 47, 48, 49, 54, 55, 56, 95, 118, 131, 160, 177, 184, 194, 219, 270, 281, 286, 287, 288, 289, 290, 291, 299], "onc": [13, 21, 24, 38, 230, 240, 316, 317, 318, 320, 323], "repres": [13, 36, 48, 79, 80, 232, 269, 275, 315, 321, 322], "prepar": [13, 15, 253, 315, 322], "ad": [13, 16, 19, 21, 25, 38, 50, 79, 80, 81, 158, 162, 175, 232, 240, 241, 244, 256, 259, 272, 273, 315, 320, 321, 322, 323], "column": [13, 16, 18, 20, 31, 33, 35, 37, 42, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 224, 230, 231, 240, 322], "worri": [13, 22, 315, 317], "itself": [13, 24], "do": [13, 15, 18, 21, 22, 23, 25, 36, 47, 65, 68, 234, 252, 259, 287, 291, 297, 309, 314, 316, 317, 318, 320, 321, 322], "well": [13, 18, 24, 25, 308, 314, 316, 318, 319, 321, 323], "flexibl": [13, 24, 53, 321], "inherit": [13, 14, 19, 25, 308], "__call__": [13, 15, 19, 65, 66, 156], "simpl": [13, 22, 25, 232, 253, 267, 306, 316, 317, 320, 322, 323], "contriv": [13, 19], "would": [13, 15, 19, 22, 24, 26, 38, 54, 230, 232, 240, 307, 315, 320, 321, 323], "inde": [13, 279], "quit": [13, 321, 323], "type": [13, 14, 15, 16, 21, 22, 26, 27, 29, 36, 37, 44, 45, 46, 47, 48, 49, 50, 51, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 88, 92, 93, 94, 95, 96, 97, 98, 101, 102, 103, 104, 105, 106, 107, 108, 113, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 128, 129, 130, 131, 132, 133, 134, 137, 138, 140, 141, 142, 144, 148, 149, 150, 151, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 169, 170, 171, 172, 173, 174, 175, 176, 177, 180, 181, 182, 183, 184, 186, 187, 188, 189, 190, 191, 192, 193, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 220, 221, 222, 223, 224, 225, 226, 227, 229, 230, 231, 232, 235, 236, 237, 238, 239, 240, 241, 242, 243, 246, 247, 249, 250, 253, 254, 255, 256, 257, 258, 259, 262, 263, 264, 265, 266, 267, 268, 269, 270, 271, 272, 273, 274, 276, 277, 279, 280, 281, 282, 283, 284, 285, 293, 294, 295, 296, 298, 300, 301, 302, 303, 311, 316, 320, 321, 322, 323], "map": [13, 15, 19, 21, 22, 31, 33, 35, 37, 38, 42, 47, 53, 54, 59, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 95, 118, 131, 160, 177, 184, 194, 219, 251, 257, 258, 270, 274, 276, 287, 288, 289, 290, 291, 292, 296, 320], "pprint": 13, "messagetransform": 13, "self": [13, 15, 18, 19, 20, 21, 25, 26, 54, 65, 66, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 229, 230, 231, 237, 239, 240, 242, 245, 246, 247, 252, 253, 270, 273, 274, 319, 320, 323], "str": [13, 15, 21, 24, 28, 31, 33, 35, 36, 37, 38, 42, 44, 46, 47, 48, 49, 50, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 95, 118, 131, 156, 160, 161, 164, 177, 184, 194, 219, 236, 241, 242, 243, 245, 246, 247, 249, 250, 251, 252, 253, 254, 255, 256, 257, 258, 269, 270, 271, 272, 273, 274, 275, 277, 279, 281, 282, 284, 286, 287, 288, 289, 290, 291, 295, 296, 297, 298, 300, 301, 303, 321], "eot": [13, 14, 19, 36, 156], "input_sampl": 13, "output_sampl": 13, "manipul": 13, "load_dataset_kwarg": [13, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73], "mymessagetransform": 13, "chosenrejectedtomessag": [13, 63, 68], "core": [14, 25, 55, 56, 308, 312, 317, 323], "govern": [14, 315], "serv": [14, 19, 24, 33, 35, 37, 42, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 259, 316, 320], "interfac": [14, 25, 38, 39, 53, 245, 260], "api": [14, 25, 26, 34, 40, 43, 55, 56, 57, 59, 65, 66, 89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 139, 143, 145, 146, 147, 152, 153, 164, 165, 166, 167, 178, 179, 185, 252, 287, 307, 312, 314, 315, 316, 317, 318, 323], "oper": [14, 25, 232, 248, 260, 295, 322], "send": 14, "other": [14, 15, 18, 20, 22, 25, 27, 35, 38, 53, 253, 272, 296, 299, 309, 310, 311, 315, 316, 317, 318, 319, 320, 321, 322], "special": [14, 16, 19, 22, 36, 42, 131, 156, 158, 160, 162, 184, 194, 219, 232, 241, 254, 255, 257, 258, 259, 261, 274], "individu": [14, 36, 54, 240, 281, 291, 293, 315], "ref": [14, 55, 56, 57, 59, 65, 66, 183, 184, 291], "constructor": [14, 21], "msg": [14, 16, 19, 21, 315], "ident": [14, 18, 20, 45, 47, 54, 65, 68, 168, 230, 246, 321, 322], "from_dict": [14, 36, 315], "becaus": [14, 21, 22, 55, 56, 96, 105, 230, 232, 240, 272, 314, 315, 322], "correspond": [14, 18, 21, 36, 48, 75, 76, 77, 243, 245, 246, 247, 249, 250, 262, 265, 270, 271, 272, 279, 311, 317, 318, 321, 322], "begin": [14, 22, 54, 72, 82, 232, 257, 259, 315, 318, 323], "pil": [14, 15, 16, 36, 37, 44, 46], "img_msg": 14, "place": [14, 16, 20, 297, 315, 321], "mode": [14, 15, 16, 235, 275, 282, 287], "rgb": [14, 15, 16, 154], "4": [14, 15, 16, 22, 24, 45, 47, 48, 49, 50, 77, 82, 156, 159, 163, 222, 224, 232, 283, 303, 308, 311, 314, 316, 318, 319, 320, 321, 322, 323], "appropri": [14, 36, 53, 77, 112, 241, 270, 278, 323], "load_imag": [14, 16], "image_path": [14, 16], "jpg": [14, 16, 35, 42, 46, 67], "tag": [14, 16, 19, 21, 38, 42, 44, 95, 112, 118, 131, 156, 160, 168, 177, 184, 194, 219, 287, 288, 289, 290, 291, 315], "placehold": [14, 16, 42, 269], "should": [14, 15, 16, 18, 20, 22, 24, 25, 33, 35, 36, 37, 38, 42, 47, 54, 59, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 76, 77, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 112, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 154, 158, 161, 162, 163, 168, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 224, 230, 232, 235, 240, 245, 246, 247, 252, 262, 265, 269, 286, 287, 288, 289, 290, 291, 307, 308, 316, 317, 318, 319, 320, 321, 322, 323], "insert": [14, 242, 316, 322], "format_content_with_imag": [14, 16], "image_tag": [14, 16, 42, 44], "conveni": [14, 22, 24, 25, 46, 314], "text_cont": [14, 16, 19, 36, 315], "prompttemplateinterfac": [14, 19, 95, 118, 131, 160, 177, 184, 194, 219], "templated_msg": [14, 19], "contains_media": [14, 16, 36], "get_media": [14, 15, 16, 36], "4x4": 14, "0x7f8d27e72740": 14, "tokenize_messsag": 14, "hi": [14, 20, 74, 315], "tokenize_messag": [14, 15, 21, 36, 55, 57, 59, 60, 61, 62, 63, 64, 65, 66, 68, 69, 70, 71, 72, 73, 156, 255, 259, 315], "22557": 14, "1526": [14, 21], "28808": 14, "28705": [14, 21], "28748": [14, 21], "15359": 14, "28725": 14, "315": [14, 20], "837": 14, "396": 14, "16107": 14, "13892": 14, "28723": 14, "modal": [15, 16, 56, 67, 156, 242], "current": [15, 16, 18, 22, 35, 42, 54, 55, 67, 68, 76, 92, 96, 105, 119, 132, 141, 161, 162, 163, 169, 171, 173, 180, 183, 186, 222, 224, 226, 230, 231, 240, 265, 271, 272, 274, 282, 288, 290, 292, 295, 302, 311, 312, 317, 318, 319, 321], "intend": [15, 299, 315], "drop": [15, 156, 223, 241, 253, 319, 322], "replac": [15, 16, 42, 51, 59, 60, 62, 64, 68, 69, 70, 156, 236, 241, 297, 316, 320], "llama3_2_vis": [15, 16, 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164], "llama3visiontransform": [15, 16, 160], "__init__": [15, 24, 25, 65, 66, 253, 319, 320, 323], "transform_imag": 15, "clipimagetransform": [15, 65, 66, 156, 232], "xattn_mask": 15, "visioncrossattentionmask": [15, 156, 260], "224": [15, 16, 156], "tile_s": [15, 80, 81, 82, 156, 159, 163, 232, 261], "patch_siz": [15, 80, 81, 82, 156, 159, 163, 232, 261], "14": [15, 48, 156, 232, 322, 323], "skip_special_token": [15, 16, 68, 156, 316], "begin_of_text": [15, 16, 21, 315], "start_header_id": [15, 16, 315], "end_header_id": [15, 16, 315], "n": [15, 16, 18, 19, 21, 34, 38, 40, 43, 224, 232, 259, 305, 313, 314, 315, 322], "eot_id": [15, 16, 21, 315], "na": [15, 315], "encoder_input": [15, 16, 50, 229, 230, 240], "shape": [15, 16, 22, 47, 50, 74, 75, 76, 77, 79, 80, 81, 82, 154, 155, 156, 159, 163, 220, 221, 222, 224, 225, 226, 227, 229, 230, 231, 232, 237, 238, 239, 240, 241, 242, 246, 247, 261, 262, 263, 264, 265, 266, 268, 283, 296, 297, 319], "num_til": [15, 16, 154, 155, 232], "num_channel": [15, 16, 232], "tile_height": [15, 16], "tile_width": [15, 16], "torch": [15, 16, 22, 24, 45, 47, 48, 49, 50, 74, 75, 76, 77, 78, 79, 80, 81, 82, 154, 155, 156, 220, 221, 222, 223, 224, 225, 226, 227, 229, 230, 231, 232, 233, 234, 235, 236, 237, 238, 239, 240, 241, 242, 246, 247, 253, 262, 263, 264, 265, 266, 268, 272, 274, 276, 277, 278, 279, 280, 281, 283, 284, 285, 290, 292, 293, 294, 295, 296, 297, 298, 299, 300, 303, 307, 311, 316, 317, 318, 319, 320, 321, 323], "just": [15, 19, 22, 308, 311, 314, 315, 317, 318, 320, 321, 322], "the_cauldron_dataset": [15, 16], "ai2d": [15, 66], "respir": 15, "combust": 15, "give": [15, 21, 24, 269, 319, 320, 321], "choic": [15, 18, 309], "oxygen": 15, "carbon": 15, "dioxid": 15, "c": [15, 45, 47, 50, 65, 154, 309, 315, 316], "nitrogen": 15, "d": [15, 24, 36, 65, 154, 155, 222, 224, 230, 240, 309, 314, 315, 319, 320, 322], "heat": 15, "letter": 15, "mymultimodaltransform": 15, "my_tokenizer_build": 15, "myimagetransform": 15, "add_eo": [15, 57, 72, 156, 256, 257, 315], "tupl": [15, 19, 21, 38, 48, 74, 75, 81, 95, 118, 131, 156, 160, 177, 184, 194, 219, 222, 232, 255, 259, 262, 263, 264, 265, 266, 268, 296, 297, 298, 302], "infer": [15, 19, 22, 50, 56, 92, 112, 173, 222, 224, 226, 230, 231, 240, 300, 306, 311, 312, 315, 316, 317, 318, 322, 323], "vision": [15, 16, 56, 82, 154, 156, 157, 158, 159, 160, 161, 162, 163, 164, 240, 241, 273], "aspect_ratio": [15, 50, 79, 80, 154, 232], "append": [15, 19, 38, 39, 95, 118, 131, 156, 160, 177, 184, 194, 219, 230, 240, 256, 287, 307], "addit": [15, 21, 22, 24, 25, 27, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 156, 158, 159, 162, 163, 168, 223, 234, 241, 242, 252, 264, 270, 271, 272, 279, 284, 287, 288, 290, 291, 293, 308, 315, 317, 320, 321], "kei": [15, 21, 22, 24, 26, 33, 35, 37, 42, 47, 48, 55, 56, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224, 229, 230, 231, 240, 242, 251, 252, 267, 270, 272, 274, 287, 296, 314, 317, 320, 321, 323], "e": [16, 18, 19, 36, 46, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 81, 82, 154, 155, 156, 159, 163, 223, 224, 232, 236, 240, 245, 251, 261, 269, 270, 271, 272, 274, 281, 296, 300, 307, 309, 311, 318, 320, 321, 322, 323], "g": [16, 18, 46, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 79, 80, 81, 82, 154, 155, 156, 159, 163, 224, 232, 240, 245, 261, 269, 270, 271, 272, 281, 296, 300, 309, 311, 318, 320, 321, 322, 323], "base": [16, 18, 20, 22, 27, 36, 38, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 156, 157, 160, 161, 162, 163, 164, 167, 169, 170, 171, 172, 173, 175, 178, 179, 180, 181, 182, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 226, 246, 247, 248, 251, 252, 263, 264, 266, 267, 270, 278, 280, 288, 297, 300, 306, 309, 315, 316, 317, 318, 319, 320, 321, 323], "multimodal_chat_dataset": 16, "visual": [16, 22, 67, 242], "get": [16, 22, 23, 24, 25, 26, 50, 156, 274, 279, 281, 287, 301, 302, 307, 308, 309, 310, 311, 315, 317, 319, 320, 321, 322], "below": [16, 22, 23, 26, 47, 316, 318, 319, 320, 323], "clock": 16, "10": [16, 45, 47, 48, 49, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 232, 241, 318, 321, 322, 323], "llama3_2_vision_transform": 16, "questionanswertempl": [16, 19, 71], "image_s": [16, 157, 160, 161, 164, 232], "560": [16, 157, 160, 161, 164], "image_dir": [16, 35, 42, 65, 67], "home": [16, 24, 35, 42, 46, 67], "nquestion": 16, "nit": 16, "00am": 16, "sharegpt": [16, 42, 60, 315], "q1": [16, 33, 55, 60, 68], "a1": [16, 33, 55, 60], "sharegpt4v": 16, "lin": 16, "chen": 16, "renam": 16, "themselv": [16, 323], "pathlib": 16, "pil_imag": 16, "Then": [16, 20, 26, 248, 316, 317, 319, 321], "relat": [16, 229, 230, 240, 316, 320], "user_messag": [16, 34, 40, 43, 156, 315], "locat": [16, 21, 24, 35, 42, 67, 314, 316, 318, 320, 322, 323], "long": [16, 22, 54, 257, 315, 320], "image_dog": 16, "image_cat": 16, "image_bird": 16, "dog": [16, 261], "bird": [16, 46], "pet": 16, "three": [16, 22, 25, 50, 156, 264, 266, 312, 317], "referenc": 16, "huggingfac": [16, 57, 61, 63, 72, 73, 176, 183, 184, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 264, 266, 267, 270, 271, 278, 314, 316], "co": [16, 57, 61, 63, 72, 73, 176, 183, 184, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 270, 271], "img": 16, "llava_instruct_dataset": 16, "concaten": [17, 21, 48, 53, 159, 163, 223, 255, 259], "sequenc": [17, 45, 47, 48, 49, 50, 54, 57, 61, 65, 66, 72, 73, 76, 77, 82, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 154, 155, 156, 158, 160, 162, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 194, 219, 222, 224, 226, 229, 230, 231, 232, 235, 240, 242, 257, 259, 261, 263, 267, 268, 283, 315], "upto": [17, 226], "maximum": [17, 24, 47, 50, 51, 54, 61, 73, 76, 79, 80, 82, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 156, 158, 159, 160, 162, 163, 169, 171, 173, 175, 177, 180, 182, 184, 186, 190, 222, 224, 226, 229, 230, 231, 235, 240, 242, 253, 269, 314], "length": [17, 45, 47, 49, 50, 51, 52, 53, 54, 61, 73, 76, 92, 95, 96, 101, 105, 113, 118, 119, 128, 131, 132, 137, 141, 154, 155, 156, 158, 160, 162, 169, 171, 173, 175, 177, 180, 182, 183, 184, 186, 190, 194, 219, 222, 224, 226, 229, 230, 231, 235, 237, 239, 240, 242, 257, 261, 262, 263, 271, 283, 287, 321], "slow": [17, 321, 323], "down": [17, 232, 272, 320, 321, 323], "introduc": [17, 86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181, 224, 242, 246, 247, 267, 311, 315, 319, 320, 321, 322, 323], "signific": [17, 309, 321, 322], "speedup": [17, 223, 316, 318], "depend": [17, 25, 26, 270, 296, 314, 319, 320, 321, 323], "iter": [17, 253, 296, 297, 298, 323], "through": [17, 18, 22, 23, 24, 25, 26, 55, 82, 159, 163, 220, 222, 232, 242, 248, 308, 309, 310, 311, 312, 314, 315, 316, 317, 319, 321, 322, 323], "greedi": [17, 54], "upon": [17, 25, 53, 230, 234, 240, 316, 318], "initi": [17, 25, 29, 53, 54, 83, 84, 85, 93, 94, 102, 103, 104, 114, 115, 116, 117, 129, 130, 138, 139, 140, 148, 149, 165, 174, 176, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 246, 264, 274, 284, 285, 297, 311, 316, 317, 320, 323], "max": [17, 50, 54, 194, 219, 230, 232, 240, 257, 269, 278, 314, 320], "llama3": [17, 20, 21, 22, 24, 65, 66, 74, 129, 130, 131, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 156, 158, 160, 161, 162, 164, 165, 166, 167, 180, 237, 239, 240, 270, 271, 272, 273, 306, 308, 309, 310, 311, 314, 316, 321], "load": [17, 22, 25, 35, 42, 46, 53, 54, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 240, 246, 252, 270, 271, 272, 274, 290, 297, 316, 318, 320], "isinst": 17, "1b_full_single_devic": [17, 316], "prevent": [17, 22, 54, 264, 309, 314], "irrelev": 17, "cross": [17, 50, 54, 158, 162, 229, 237, 239, 240, 242, 261, 319], "attend": [17, 54, 224, 229, 230, 231, 240, 261], "pytorch": [17, 24, 25, 67, 75, 225, 230, 236, 237, 285, 290, 293, 295, 296, 306, 307, 308, 311, 314, 316, 318, 320, 321, 322, 323], "flex": 17, "attent": [17, 50, 54, 75, 76, 77, 82, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 159, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 183, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 224, 226, 229, 230, 231, 233, 240, 242, 252, 261, 318, 320, 321, 323], "flash": 17, "non": [17, 238, 239, 263, 277, 319], "causal": [17, 54, 76, 224, 230, 231, 240], "hardwar": [17, 279, 308, 320, 321], "cuda": [17, 22, 24, 279, 281, 296, 300, 307, 316, 321, 323], "ture": 17, "sdpa": 17, "memori": [17, 21, 25, 53, 54, 57, 61, 72, 73, 228, 230, 236, 237, 239, 240, 252, 277, 281, 286, 296, 306, 308, 309, 310, 311, 316, 317, 318, 319, 322], "effici": [17, 252, 306, 308, 310, 316, 317, 320, 322], "fallback": 17, "while": [17, 24, 25, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 241, 246, 308, 309, 311, 321, 322, 323], "retain": [17, 264, 321, 323], "reward": [18, 117, 123, 127, 172, 176, 179, 262, 263, 264, 266, 267, 273, 309], "downstream": [18, 22], "captur": 18, "ground": [18, 237, 238, 239, 321], "truth": [18, 24, 237, 238, 239, 316, 318], "usual": [18, 21, 226, 230, 268, 270, 283, 291, 314, 320, 321], "outcom": 18, "binari": 18, "comparison": [18, 25, 320, 323], "annot": 18, "accord": [18, 19, 65, 66, 77, 168, 315], "criterion": 18, "style": [18, 31, 54, 58, 59, 60, 70, 242, 309, 323], "interact": [18, 25, 55, 68, 306, 312, 317], "free": [18, 267, 312, 320], "preference_dataset": 18, "my_preference_dataset": [18, 68], "chosen_convers": [18, 68], "hole": [18, 68], "my": [18, 19, 23, 68, 74, 314, 315, 316, 318], "trouser": [18, 68], "fix": [18, 20, 68], "rejected_convers": [18, 68], "off": [18, 25, 38, 68, 310, 311, 316, 321, 322], "chosen": [18, 33, 55, 63, 68, 71, 264, 266, 267, 296, 309, 316], "reject": [18, 33, 55, 63, 68, 71, 264, 266, 267, 309], "rejected_input_id": [18, 48, 68], "nwhat": 18, "ntake": 18, "rejected_label": [18, 48], "128006": 18, "78191": 18, "128007": 18, "271": 18, "18293": 18, "1124": 18, "1022": 18, "13": [18, 20, 21, 48, 232, 259, 268, 323], "128009": [18, 315], "accomplish": [18, 20, 53, 60, 64, 67, 68, 72], "shown": [18, 321, 322], "di": 18, "look": [18, 19, 22, 24, 25, 270, 271, 272, 276, 290, 307, 315, 316, 317, 318, 319, 320, 322], "anthrop": [18, 63], "harmless": [18, 63, 309], "granni": 18, "her": [18, 20], "mobil": 18, "phone": 18, "issu": [18, 312], "grandmoth": 18, "manag": [18, 22, 53, 234, 235, 248, 287, 294, 315], "behavior": [18, 22, 315], "thing": [18, 321, 323], "grandma": 18, "feel": [18, 312, 320], "box": [18, 308, 311, 316, 323], "hh_rlhf_helpful_dataset": 18, "hendrydong": 18, "preference_700k": 18, "stack_exchange_paired_dataset": 18, "purpos": [19, 65, 66, 317, 318], "whenev": [19, 156, 237, 320], "llama2": [19, 24, 25, 27, 61, 73, 83, 84, 85, 86, 87, 88, 89, 90, 91, 112, 114, 115, 116, 117, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 156, 220, 230, 231, 273, 306, 309, 310, 314, 316, 317, 318, 321, 322], "were": [19, 20, 21, 53, 154, 232, 248, 265, 316, 317, 322], "gear": [19, 156], "summar": [19, 43, 69, 315, 321], "summarizetempl": [19, 69, 315], "commun": [19, 156, 316, 321], "chatmltempl": [19, 156], "gec_templ": 19, "extend": [19, 21, 22, 25, 308, 321], "customprompttempl": 19, "achiev": [19, 38, 292, 311, 318, 319, 320, 322, 323], "prepend_tag": [19, 38], "append_tag": [19, 38], "thu": [19, 31, 38, 55, 56, 230, 321, 322], "empti": [19, 47, 50, 52, 78, 277, 314], "standalon": [19, 222], "my_custom_templ": 19, "Is": 19, "overhyp": 19, "advanc": [19, 80, 81, 82, 159, 163, 232, 309], "configur": [19, 21, 22, 25, 55, 56, 59, 60, 61, 62, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 96, 105, 119, 132, 141, 161, 169, 180, 186, 287, 308, 311, 312, 315, 317, 318, 319, 320, 321, 322, 323], "doesn": [19, 316], "neatli": 19, "fall": 19, "protocol": [19, 21, 245, 254, 255, 260], "arg": [19, 21, 24, 27, 32, 39, 81, 221, 223, 230, 236, 242, 245, 254, 255, 260, 289, 296, 311, 316, 322], "whether": [19, 31, 33, 35, 36, 37, 42, 47, 50, 55, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 156, 157, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 236, 238, 240, 246, 247, 252, 253, 256, 257, 279, 280, 281, 287, 297, 315, 319], "sai": [19, 22, 314, 315, 316, 317], "eureka": 19, "eurekatempl": 19, "formatted_dialogu": 19, "llama2chattempl": [19, 118, 168, 315], "paradigm": [20, 25, 309, 310, 321], "unstructur": [20, 57, 72, 73], "unlabel": 20, "text_complet": 20, "odyssei": 20, "clear": [20, 321], "river": 20, "oceanu": 20, "had": 20, "got": [20, 50], "sea": 20, "went": 20, "till": 20, "reach": 20, "aeaean": 20, "island": 20, "dawn": 20, "sunris": 20, "drew": 20, "ship": 20, "sand": 20, "shore": 20, "sleep": 20, "wait": [20, 296], "break": [20, 156, 257], "child": 20, "morn": 20, "rosi": 20, "finger": 20, "appear": [20, 321], "sent": [20, 291], "men": 20, "circ": 20, "hous": 20, "fetch": [20, 320], "bodi": 20, "elpenor": 20, "cut": 20, "firewood": 20, "wood": 20, "headland": 20, "jut": 20, "wept": 20, "over": [20, 21, 22, 25, 36, 56, 238, 239, 264, 278, 308, 311, 314, 316, 319, 320, 321, 323], "him": 20, "lament": 20, "funer": 20, "rite": 20, "armour": 20, "been": [20, 74, 76, 222, 230, 240, 268, 274, 315, 321, 322], "burn": 20, "ash": 20, "rais": [20, 22, 27, 30, 33, 35, 37, 42, 44, 46, 47, 50, 52, 53, 55, 59, 60, 62, 64, 65, 66, 67, 69, 70, 72, 76, 82, 186, 222, 224, 228, 229, 230, 232, 233, 234, 235, 246, 247, 252, 259, 270, 271, 272, 274, 279, 280, 281, 284, 287, 291, 295, 297, 298, 299], "cairn": 20, "stone": 20, "top": [20, 75, 78, 159, 163, 276, 316, 321, 323], "oar": 20, "he": 20, "row": [20, 55, 56, 224, 230, 231, 240], "text_completion_dataset": [20, 322], "128000": [20, 315, 322], "6153": 20, "584": 20, "1051": 20, "2867": 20, "279": 20, "15140": 20, "22302": 20, "355": 20, "11": [20, 45, 47, 48, 232, 322, 323], "323": 20, "1047": 20, "2751": 20, "704": 20, "1139": 20, "1825": 20, "9581": 20, "4024": 20, "389": 20, "12222": 20, "8813": 20, "362": 20, "12791": 20, "5420": 20, "13218": 20, "1405": 20, "1070": 20, "374": 20, "39493": 20, "64919": 20, "439": 20, "304": 20, "1023": 20, "7634": 20, "1226": 20, "1243": 20, "24465": 20, "1057": 20, "8448": 20, "311": 20, "70163": 20, "1077": 20, "31284": 20, "6212": 20, "30315": 20, "1938": 20, "1288": 20, "1464": 20, "128001": [20, 322], "similarli": [20, 22, 131, 156, 160, 184, 194, 219, 322], "wikimedia": 20, "wikipedia": [20, 46, 73], "cnn_dailymail_articles_dataset": 20, "index": [21, 22, 48, 49, 50, 53, 54, 224, 226, 231, 238, 240, 263, 278, 300, 307, 315, 316], "embed": [21, 22, 79, 80, 81, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 155, 156, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224, 225, 226, 229, 230, 232, 240, 241, 242, 244, 315, 318, 321, 322], "vector": [21, 246, 266, 309, 315, 321], "understood": 21, "plai": [21, 316, 321], "necessari": [21, 22, 55, 56, 287, 288, 289, 290, 291, 315, 316, 320], "phi3": [21, 22, 180, 181, 183, 184, 185, 273, 314, 316], "phi3_mini_token": 21, "p_token": 21, "phi": [21, 183, 184, 273], "32010": 21, "29871": 21, "1792": [21, 259], "9508": [21, 259], "32007": 21, "32001": 21, "4299": [21, 22], "2933": [21, 259], "nuser": 21, "nmodel": 21, "sentencepiec": [21, 256, 318], "tiktoken": [21, 156, 257, 318], "host": [21, 307, 314, 317, 321], "distribut": [21, 78, 274, 284, 285, 293, 295, 300, 308, 309, 312, 314, 317, 318, 319, 321], "alongsid": [21, 309, 321], "alreadi": [21, 24, 33, 37, 42, 62, 63, 65, 66, 67, 68, 69, 70, 222, 224, 234, 235, 240, 273, 284, 307, 314, 316, 319, 320, 321], "_token": [21, 25], "mistraltoken": [21, 177, 315], "adher": [21, 37, 42], "arbitrarili": 21, "small": [21, 225, 316, 321], "seq": [21, 230, 240], "len": [21, 22, 50, 53, 59, 62, 65, 66, 69, 230, 232, 240], "demonstr": [21, 321, 322], "7": [21, 22, 45, 47, 48, 49, 50, 222, 232, 261, 265], "6312": 21, "28709": 21, "assign": [21, 24, 55, 56], "uniqu": [21, 55, 56, 250, 273], "abil": 21, "NOT": [21, 92, 156, 173, 316], "presenc": [21, 31], "certain": [21, 22, 24, 296, 315], "proper": [21, 307, 317], "end_of_text": 21, "special_token": [21, 257, 315], "added_token": 21, "128257": 21, "128258": 21, "remain": [21, 37, 42, 223, 228, 278, 319, 320, 321], "special_tokens_path": [21, 131, 156, 160, 184, 194, 219], "basetoken": 21, "actual": [21, 23, 24, 26, 31, 33, 35, 55, 56, 59, 62, 63, 64, 66, 67, 68, 69, 71, 156, 311, 315, 322], "string": [21, 22, 35, 36, 38, 44, 60, 61, 95, 118, 131, 156, 160, 177, 184, 194, 219, 245, 246, 247, 253, 254, 256, 257, 259, 269, 275, 279, 282, 287, 300, 314, 321], "kwarg": [21, 24, 27, 32, 39, 194, 219, 221, 223, 229, 231, 236, 242, 245, 246, 247, 254, 255, 260, 284, 287, 288, 289, 290, 291, 293, 296], "dict": [21, 22, 25, 26, 31, 33, 35, 36, 37, 38, 42, 44, 47, 48, 49, 50, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 95, 118, 131, 160, 177, 184, 194, 219, 229, 231, 236, 240, 242, 243, 249, 250, 251, 252, 254, 255, 257, 258, 260, 270, 271, 272, 274, 276, 277, 281, 284, 286, 287, 292, 297, 299, 321], "given": [21, 25, 27, 44, 47, 52, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 75, 77, 156, 234, 235, 247, 248, 254, 255, 263, 279, 282, 292, 300, 303, 308, 320], "token_id": [21, 156, 254, 257], "its": [21, 54, 112, 168, 171, 224, 226, 230, 231, 240, 242, 246, 274, 292, 295, 314, 315, 316, 318, 320, 321], "sentencepiecebasetoken": [21, 254], "bpe": 21, "sp_token": 21, "reason": [21, 22, 25, 74, 321, 322], "walk": [22, 25, 290, 308, 315, 316, 317, 322, 323], "design": [22, 25, 267, 314], "cover": [22, 23, 24, 25, 26, 315, 316, 323], "scenario": [22, 53, 156], "compos": [22, 232], "plug": [22, 321], "evalu": [22, 25, 223, 253, 306, 308, 311, 312, 317, 319, 320, 323], "gener": [22, 25, 47, 54, 61, 72, 75, 76, 77, 78, 156, 223, 225, 234, 235, 248, 262, 280, 287, 294, 295, 296, 304, 306, 311, 315, 319, 320, 321, 322, 323], "easi": [22, 25, 308, 309, 316, 320, 321], "understand": [22, 24, 25, 242, 306, 308, 309, 310, 315, 316, 320, 321, 323], "concept": [22, 309, 312, 316, 317, 321], "talk": 22, "close": [22, 25, 287, 288, 289, 290, 291, 320], "veri": [22, 53, 230, 240, 314, 316], "dictat": 22, "state_dict": [22, 236, 241, 242, 250, 252, 270, 271, 272, 273, 274, 297, 320, 323], "store": [22, 55, 56, 287, 288, 291, 320, 321, 323], "disk": [22, 57, 288], "identifi": [22, 287], "state": [22, 25, 155, 230, 232, 234, 236, 240, 243, 249, 250, 251, 252, 262, 264, 270, 271, 272, 274, 276, 277, 297, 309, 316, 318, 320, 323], "match": [22, 44, 287, 297, 307, 314, 316, 318, 320], "up": [22, 23, 25, 26, 50, 54, 61, 73, 156, 230, 234, 235, 240, 257, 261, 276, 287, 296, 310, 311, 312, 314, 315, 317, 318, 320, 321, 323], "exactli": [22, 269, 322], "definit": [22, 320], "either": [22, 47, 55, 56, 74, 224, 230, 231, 270, 287, 293, 307, 314, 320, 322, 323], "explicit": 22, "error": [22, 24, 34, 52, 270, 295, 314], "except": [22, 36, 168, 196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 259, 316, 319], "wors": [22, 321], "silent": 22, "succe": 22, "popular": [22, 240, 308, 316], "offici": [22, 112, 315, 317, 318], "websit": 22, "inspect": [22, 316, 320, 323], "mmap": 22, "weights_onli": [22, 272], "map_loc": 22, "cpu": [22, 25, 235, 236, 250, 277, 279, 296, 300, 307, 314, 323], "tensor": [22, 45, 47, 48, 49, 50, 74, 75, 76, 77, 78, 79, 80, 81, 82, 154, 155, 220, 221, 222, 223, 224, 225, 226, 227, 229, 230, 231, 232, 236, 237, 238, 239, 240, 241, 242, 246, 247, 253, 262, 263, 264, 265, 266, 268, 270, 277, 283, 287, 288, 289, 290, 291, 294, 297, 299, 319, 320, 321, 323], "item": 22, "f": [22, 26, 59, 62, 65, 66, 69, 269, 315, 316, 319, 320, 323], "tok_embed": [22, 230, 240, 241], "128256": 22, "3072": 22, "255": 22, "tabl": [22, 241, 315, 316, 318, 319, 321, 323], "layer": [22, 25, 82, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 117, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 155, 158, 159, 161, 162, 163, 164, 167, 169, 170, 171, 172, 173, 175, 176, 178, 179, 180, 181, 182, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 227, 228, 229, 230, 231, 232, 233, 234, 235, 240, 242, 244, 246, 247, 252, 253, 275, 308, 310, 318, 320, 321, 322, 323], "dim": [22, 50, 154, 155, 220, 223, 224, 225, 226, 230, 237, 239, 240, 319], "within": [22, 24, 27, 54, 74, 78, 79, 96, 105, 119, 132, 141, 161, 162, 163, 169, 171, 180, 186, 232, 234, 235, 290, 295, 296, 314, 320, 323], "big": 22, "piec": 22, "safe_open": 22, "00001": [22, 314, 316, 319], "00002": [22, 314, 316, 319], "framework": [22, 25, 308], "pt": [22, 26, 270, 271, 272, 316, 318, 319, 322], "k": [22, 75, 78, 80, 224, 320], "get_tensor": 22, "embed_token": 22, "187": [22, 322], "Not": [22, 321], "fewer": [22, 224], "sinc": [22, 24, 27, 55, 56, 239, 270, 272, 315, 316, 318, 321, 322], "mismatch": 22, "caus": [22, 256], "re": [22, 24, 234, 242, 267, 272, 308, 309, 310, 311, 315, 316, 317, 320, 321], "end": [22, 25, 36, 57, 72, 82, 156, 232, 257, 259, 306, 308, 315, 318, 320, 322], "number": [22, 25, 44, 50, 54, 61, 73, 74, 79, 80, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 155, 156, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 222, 223, 224, 230, 232, 238, 239, 261, 270, 271, 272, 274, 275, 278, 295, 296, 302, 309, 314, 317, 319, 320, 321], "save": [22, 25, 26, 230, 236, 237, 239, 240, 270, 271, 272, 274, 291, 306, 311, 314, 315, 316, 318, 320, 321, 322], "less": [22, 50, 74, 316, 317, 318, 321, 323], "prone": 22, "invari": 22, "accept": [22, 24, 317, 321, 323], "explicitli": [22, 245, 308, 320], "produc": [22, 274, 309, 311, 316, 322, 323], "One": [22, 50, 322], "advantag": [22, 262, 265, 311, 320], "abl": [22, 25, 316, 317, 322], "post": [22, 232, 292, 296, 311, 316, 318, 322, 323], "quantiz": [22, 86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 272, 282, 306, 307, 309, 310, 312, 317, 323], "eval": [22, 306, 308, 322], "without": [22, 24, 26, 224, 228, 230, 234, 240, 246, 247, 252, 307, 308, 311, 315, 320, 321, 322], "OR": 22, "surround": [22, 25, 308], "load_checkpoint": [22, 25, 270, 271, 272, 273], "save_checkpoint": [22, 25, 26, 270, 271, 272], "permut": 22, "behav": 22, "further": [22, 232, 267, 314, 316, 319, 320, 321, 322, 323], "illustr": [22, 65, 66, 318], "whilst": [22, 309, 310, 321], "read": [22, 270, 271, 272, 308, 321], "compat": [22, 270, 272, 316, 321], "mention": [22, 316, 321, 323], "assum": [22, 35, 42, 45, 47, 65, 67, 95, 118, 131, 160, 177, 184, 194, 219, 222, 224, 226, 231, 240, 241, 243, 249, 250, 257, 274, 276, 278, 279, 280, 315, 320], "checkpoint_dir": [22, 24, 270, 271, 272, 316, 318, 319, 322], "easiest": [22, 316, 317], "everyth": [22, 25, 308, 312, 317], "flow": [22, 54, 322, 323], "output_dir": [22, 24, 270, 271, 272, 296, 316, 318, 320, 322, 323], "snippet": 22, "explain": [22, 321], "fullmodelhfcheckpoint": [22, 316, 319], "sort": [22, 270, 272], "order": [22, 23, 25, 270, 272, 290, 291, 317, 321], "matter": [22, 270, 272, 314, 320], "checkpoint_fil": [22, 24, 26, 270, 271, 272, 316, 318, 319, 320, 322, 323], "model_typ": [22, 270, 271, 272, 316, 318, 322], "restart": [22, 314, 316], "later": [22, 316, 321], "resume_from_checkpoint": [22, 270, 271, 272], "discrep": [22, 270], "github": [22, 27, 75, 86, 87, 88, 97, 98, 102, 103, 104, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 226, 237, 238, 264, 265, 266, 267, 278, 307, 316, 318, 319], "repositori": [22, 55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 112, 309, 310, 311, 317], "fullmodelmetacheckpoint": [22, 318, 322], "test": [22, 24, 25, 74, 308, 311, 315, 321], "congrat": [22, 316], "far": 22, "tree": [22, 290, 316, 318], "outputdir": [22, 316], "recipe_st": [22, 270, 271, 272, 316], "hold": [22, 285, 316, 317], "last": [22, 36, 51, 54, 72, 230, 263, 274, 278, 316], "epoch": [22, 25, 26, 270, 271, 272, 274, 278, 314, 315, 316, 317, 318, 322], "log": [22, 25, 28, 264, 265, 266, 267, 281, 286, 287, 288, 289, 290, 291, 301, 309, 316, 317, 318, 319, 320, 321, 323], "metric_logg": [22, 23, 24, 25, 26], "epoch_": [22, 270, 271, 272, 316], "plu": [22, 316], "metadata": [22, 26, 316, 322], "push": [22, 316], "exclud": [22, 283, 298], "larg": [22, 53, 237, 239, 246, 247, 296, 314, 321, 323], "lightweight": [22, 257, 315, 316], "mostli": 22, "easier": [22, 24, 316, 317], "applic": [22, 25, 270, 271, 291], "check": [22, 79, 80, 81, 82, 229, 230, 231, 232, 240, 242, 252, 277, 279, 285, 298, 303, 306, 308, 309, 310, 311, 312, 315, 316, 317, 320, 321], "tutori": [22, 293, 308, 310, 311, 315, 316, 317, 318, 319, 320, 321, 322, 323], "llama3_2_3b": [22, 151, 316], "lora_single_devic": [22, 316], "epoch_0": [22, 316], "adapter_config": [22, 316], "adapter_model": [22, 270, 271, 272, 316], "ft": [22, 316], "generation_config": [22, 316], "licens": [22, 316], "txt": [22, 57, 72, 194, 219, 288, 316, 317], "orig_param": [22, 316], "original_repo_id": [22, 316], "readm": [22, 314, 316, 318], "md": [22, 314, 316], "special_tokens_map": [22, 316], "tokenizer_config": [22, 184, 316], "use_polici": [22, 316], "epoch_1": [22, 316], "log_1734652101": [22, 316], "written": [22, 24, 25, 270, 271, 287, 288, 289, 290, 291, 308], "partit": [22, 270, 323], "key_1": [22, 272], "weight_1": 22, "key_2": 22, "weight_2": 22, "mid": 22, "chekpoint": 22, "middl": [22, 242, 321], "subsequ": [22, 25, 222, 230, 232, 261], "etc": [22, 25, 155, 270, 281, 309, 316, 317], "flood": 22, "overwritten": 22, "sometim": [22, 24], "interrupt": 22, "previou": [22, 54, 270, 271, 272, 319], "updat": [22, 24, 25, 38, 222, 224, 230, 240, 246, 260, 264, 265, 274, 296, 299, 307, 315, 316, 317, 318, 320, 321, 322, 323], "your_epoch": 22, "notic": [22, 24, 79, 80, 81, 232, 315, 316, 320], "section": [22, 25, 281, 306, 316, 318, 321, 323], "adapter_checkpoint": [22, 270, 271, 272], "still": [22, 156, 237, 239, 241, 242, 246, 310, 320, 322, 323], "adapt": [22, 86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 155, 169, 170, 171, 172, 180, 181, 240, 242, 243, 245, 246, 247, 248, 249, 250, 251, 270, 271, 272, 309, 310, 315, 316, 320, 323], "influenc": 22, "resuming_from_checkpoint": 22, "save_adapter_weights_onli": [22, 316], "merg": [22, 27, 28, 194, 219, 270, 316, 318, 323], "howev": [22, 53, 184, 307, 319, 321], "therefor": [22, 237, 239, 321, 323], "untrain": [22, 315], "weigth": 22, "choos": [22, 60, 320], "reduc": [22, 230, 264, 308, 310, 311, 316, 319, 320, 321, 322, 323], "amount": 22, "storag": [22, 246, 247, 314, 323], "knowledg": [22, 306], "forward": [22, 25, 79, 80, 81, 154, 155, 220, 221, 223, 224, 225, 226, 227, 229, 230, 231, 232, 234, 235, 237, 238, 239, 240, 241, 242, 246, 247, 253, 264, 265, 266, 281, 296, 318, 319, 320, 321, 323], "right": [22, 47, 50, 77, 230, 270, 316, 318, 320], "full_single_devic": 22, "pytorch_fil": 22, "torchtune_sd": 22, "load_state_dict": [22, 240, 241, 242, 252, 274, 297, 320], "vocab": [22, 27, 194, 219, 230, 240, 241, 318], "24": [22, 232, 317, 318], "randint": 22, "dtype": [22, 24, 25, 78, 222, 224, 229, 230, 231, 233, 234, 235, 236, 240, 242, 279, 294, 298, 316, 319, 321, 322, 323], "1658": 22, "2459": 22, "3259": 22, "3262": 22, "6": [22, 45, 47, 48, 49, 50, 54, 92, 96, 101, 105, 225, 232, 283, 311, 316, 322, 323], "5942": 22, "2284": 22, "4090": [22, 316], "0129": 22, "0121": 22, "0127": 22, "5": [22, 24, 45, 47, 48, 49, 50, 76, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 219, 223, 232, 253, 264, 267, 268, 278, 309, 316, 317, 318, 319, 321], "6462": 22, "8787": 22, "0950": 22, "6460": 22, "6455": 22, "6457": 22, "4156": 22, "0626": 22, "0362": 22, "6432": 22, "6437": 22, "6427": 22, "5679": 22, "6902": 22, "5267": 22, "6137": 22, "6138": 22, "6127": 22, "3688": 22, "1350": 22, "1764": 22, "4563": 22, "4565": 22, "4564": 22, "find": [22, 23, 25, 26, 264, 314, 316, 317, 319, 320, 321], "hope": 22, "deeper": [22, 309, 310, 311, 317, 321], "insight": [22, 316], "happi": [22, 316], "cometlogg": 23, "checkpoint": [23, 24, 25, 236, 240, 242, 257, 269, 270, 271, 272, 273, 274, 275, 291, 293, 297, 308, 311, 314, 316, 318, 319, 320, 322, 323], "workspac": [23, 26, 287], "seen": [23, 26, 320, 323], "screenshot": [23, 26], "comet_ml": [23, 287], "featur": [23, 25, 26, 55, 307, 308, 309, 310, 311, 316, 317, 321], "pip": [23, 26, 287, 290, 291, 307, 316, 318, 321], "login": [23, 26, 287, 291, 314], "metric_log": [23, 24, 26, 287, 288, 289, 290, 291], "experiment_nam": [23, 287], "experi": [23, 24, 287, 291, 306, 308, 318, 319, 320], "grab": [23, 26, 318], "tab": [23, 26], "asset": 23, "artifact": [23, 26, 296, 316], "click": [23, 26], "effect": [24, 223, 267, 319, 321, 322], "prerequisit": [24, 315, 316, 317, 318, 319, 320, 322, 323], "Be": [24, 315, 316, 317, 318, 319, 320, 321, 322, 323], "familiar": [24, 315, 316, 317, 318, 319, 320, 322, 323], "fundament": [24, 322], "reproduc": [24, 287], "overridden": [24, 296], "quick": 24, "seed": [24, 25, 26, 223, 295, 316, 317, 322], "shuffl": [24, 54, 322], "fp32": [24, 225, 230, 237, 239, 321, 322, 323], "enable_fsdp": 24, "keyword": [24, 27, 55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 223, 236, 246, 247, 315], "subfield": 24, "dotpath": [24, 95, 118, 131, 160, 177, 184, 194, 219], "wish": [24, 222, 234, 297], "exact": [24, 27], "normal": [24, 54, 156, 221, 224, 225, 229, 230, 231, 237, 238, 239, 256, 315, 320, 322, 323], "instanc": [24, 27, 53, 119, 132, 141, 161, 169, 171, 180, 186, 187, 188, 191, 192, 236, 243, 249, 251, 320], "preced": [24, 27, 314, 318, 320], "throw": 24, "miss": [24, 252, 296, 320], "llama2_token": [24, 315], "llama2token": [24, 118], "512": [24, 82, 323], "overwrit": [24, 272, 297, 307, 314], "duplic": [24, 25, 308, 314, 316], "refer": [24, 25, 226, 232, 237, 248, 263, 264, 265, 266, 267, 287, 308, 309, 320, 321, 322], "resolv": [24, 28, 317], "alpaca": [24, 31, 53, 58, 59, 86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 319], "disklogg": 24, "log_dir": [24, 288, 290, 291], "verifi": [24, 279, 300, 315, 317, 320], "properli": [24, 252, 285, 314], "wa": [24, 35, 42, 50, 67, 76, 80, 81, 82, 159, 163, 232, 252, 315, 320, 322, 323], "7b_lora_single_devic": [24, 316, 317, 320, 323], "my_config": [24, 314], "guidelin": 24, "tempt": 24, "put": [24, 25, 312, 317, 320, 322], "much": [24, 241, 267, 318, 320, 321, 322, 323], "switch": 24, "encourag": [24, 267, 320, 321], "clariti": 24, "significantli": [24, 264, 310, 311, 321], "dont": 24, "privat": [24, 314], "parent": [24, 314], "guarante": 24, "stabil": [24, 237, 239, 308, 311, 321, 322, 323], "underscor": 24, "_alpaca": 24, "k1": [24, 25], "v1": [24, 25, 73], "k2": [24, 25], "v2": [24, 25, 287], "my_model_checkpoint": 24, "file_1": 24, "file_2": 24, "my_tokenizer_path": 24, "nest": [24, 299], "dot": 24, "notat": [24, 50, 154, 155, 224, 226, 230, 240, 262, 263, 283], "bitsandbyt": [24, 321], "pagedadamw8bit": [24, 321], "delet": [24, 230, 233, 234, 235, 240, 316], "foreach": 24, "8b_full": [24, 314, 316], "adamw": [24, 320, 321], "2e": [24, 321], "fuse": [24, 158, 162, 240, 241, 242, 243, 292, 322], "nproc_per_nod": [24, 309, 311, 318, 320, 322], "full_finetune_distribut": [24, 280, 314, 316, 317], "thought": [25, 308, 312, 317, 323], "target": [25, 76, 238, 239, 267, 308, 319], "pipelin": [25, 308, 311], "eg": [25, 230, 240, 270, 308], "meaning": [25, 308, 316, 321], "fsdp": [25, 228, 274, 317, 318, 321], "activ": [25, 82, 220, 275, 281, 286, 293, 296, 308, 311, 316, 322, 323], "gradient": [25, 238, 239, 292, 296, 308, 311, 316, 318, 320, 323], "accumul": [25, 292, 296, 308, 311], "mix": [25, 221, 314, 316, 321], "precis": [25, 221, 236, 279, 308, 311, 316, 317, 323], "complex": 25, "becom": [25, 232, 307], "harder": 25, "anticip": 25, "methodolog": 25, "possibl": [25, 54, 269, 314, 321], "trade": [25, 321], "vs": [25, 317, 321], "qualiti": [25, 309, 320, 322], "believ": 25, "suit": [25, 317, 321], "solut": 25, "result": [25, 65, 82, 159, 163, 232, 239, 259, 261, 277, 296, 309, 311, 316, 318, 319, 320, 321, 322, 323], "meant": [25, 236, 274], "expertis": 25, "routin": 25, "yourself": [25, 314, 318, 320], "exist": [25, 235, 242, 274, 287, 307, 314, 316, 317, 318, 323], "ones": [25, 50, 222], "modular": [25, 308], "wandb": [25, 26, 291, 317], "fulli": [25, 53, 161], "nativ": [25, 306, 308, 320, 322, 323], "numer": [25, 66, 308, 311, 322], "pariti": [25, 308], "verif": 25, "benchmark": [25, 295, 308, 318, 320, 322], "limit": [25, 274, 316, 319, 322], "hidden": [25, 82, 155, 159, 163, 220, 230, 232], "behind": 25, "unnecessari": 25, "abstract": [25, 254, 255, 308, 317, 323], "No": [25, 272, 308], "go": [25, 82, 112, 159, 163, 168, 232, 259, 308, 316, 317, 319, 321, 323], "figur": [25, 320, 323], "spectrum": 25, "decid": 25, "avail": [25, 35, 42, 67, 73, 240, 242, 279, 285, 300, 308, 314, 316, 318, 320, 321], "consist": [25, 33, 37, 42, 65, 66, 73, 312, 317], "overrid": [25, 28, 29, 33, 37, 42, 62, 63, 65, 66, 67, 68, 69, 70, 297, 312, 314, 317, 318, 319, 323], "valid": [25, 52, 77, 238, 246, 247, 252, 263, 280, 297, 298, 307, 312, 317], "closer": [25, 319, 320], "monolith": [25, 308], "trainer": [25, 264, 266, 267], "wrapper": [25, 221, 256, 257, 274, 276, 314, 320], "around": [25, 156, 221, 256, 257, 281, 314, 315, 316, 320, 321, 322, 323], "extern": 25, "primarili": [25, 53, 320], "eleutherai": [25, 73, 308, 319, 320, 322], "har": [25, 308, 319, 320, 322], "stage": [25, 232], "distil": [25, 306], "resum": [25, 278, 316, 323], "dataload": [25, 54, 59, 62, 65, 66, 69], "clean": [25, 26, 58, 319], "group": [25, 224, 284, 287, 288, 289, 290, 291, 302, 314, 318, 322], "init_process_group": [25, 284], "backend": [25, 314, 322], "gloo": 25, "nccl": 25, "fullfinetunerecipedistribut": 25, "cleanup": 25, "stuff": 25, "carri": [25, 56], "metric": [25, 317, 319, 321, 322], "logger": [25, 286, 287, 288, 289, 290, 291, 301, 317], "_devic": 25, "get_devic": 25, "_dtype": 25, "get_dtyp": 25, "ckpt_dict": 25, "wrap": [25, 242, 253, 275, 293, 315, 321], "_model": [25, 274], "_setup_model": 25, "_setup_token": 25, "_optim": 25, "_setup_optim": 25, "_loss_fn": 25, "_setup_loss": 25, "_sampler": 25, "_dataload": 25, "_setup_data": 25, "backward": [25, 274, 276, 292, 296, 323], "zero_grad": 25, "curr_epoch": 25, "rang": [25, 241, 264, 265, 267, 295, 314, 318, 322], "epochs_run": [25, 26], "total_epoch": [25, 26], "idx": [25, 54], "enumer": 25, "_autocast": 25, "logit": [25, 74, 75, 78, 237, 238, 239, 283, 319], "global_step": 25, "_log_every_n_step": 25, "_metric_logg": 25, "log_dict": [25, 287, 288, 289, 290, 291], "step": [25, 54, 55, 56, 65, 66, 230, 240, 262, 274, 276, 278, 287, 288, 289, 290, 291, 292, 296, 306, 311, 320, 322, 323], "recipe_main": [25, 29], "fullfinetunerecip": 25, "wandblogg": [26, 320, 323], "tip": 26, "straggler": 26, "background": 26, "crash": 26, "otherwis": [26, 45, 47, 50, 80, 81, 82, 159, 163, 230, 232, 285, 287, 315, 322], "exit": [26, 234, 235, 248, 307, 314], "resourc": [26, 287, 288, 289, 290, 291, 321, 322], "kill": 26, "ps": 26, "aux": 26, "grep": 26, "awk": 26, "xarg": 26, "desir": [26, 55, 56, 246, 247, 294, 309, 315, 316, 321], "suggest": [26, 319], "approach": [26, 53, 309, 319], "full_finetun": 26, "joinpath": 26, "_checkpoint": 26, "_output_dir": [26, 270, 271, 272], "torchtune_model_": 26, "with_suffix": 26, "wandb_at": 26, "descript": [26, 314], "whatev": 26, "seed_kei": 26, "epochs_kei": 26, "total_epochs_kei": 26, "max_steps_kei": 26, "max_steps_per_epoch": [26, 322], "add_fil": 26, "log_artifact": 26, "hydra": 27, "facebook": 27, "research": 27, "com": [27, 75, 86, 87, 88, 97, 98, 102, 103, 104, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 226, 237, 238, 264, 265, 266, 267, 278, 287, 307, 314, 316, 318, 319], "facebookresearch": 27, "blob": [27, 75, 86, 87, 88, 97, 98, 102, 103, 104, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 184, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 226, 238, 264, 265, 266, 267, 278], "_intern": 27, "_instantiate2": 27, "l148": 27, "num_head": [27, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 224, 226, 230], "num_kv_head": [27, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 222, 224], "vocab_s": [27, 74, 75, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 237, 238, 239, 241], "32000": [27, 320], "parsed_yaml": 27, "4096": [27, 61, 73, 101, 105, 224, 226, 316, 320, 322], "embed_dim": [27, 79, 80, 81, 82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 226, 229, 230, 231, 232, 241, 242, 297, 320], "valueerror": [27, 33, 35, 37, 42, 44, 46, 47, 50, 52, 53, 55, 59, 60, 62, 64, 65, 66, 67, 69, 70, 72, 186, 222, 224, 232, 233, 234, 235, 246, 247, 270, 271, 272, 279, 281, 295, 298], "recipe_nam": 28, "rank": [28, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 277, 285, 295, 302, 309, 310, 316, 317, 320, 323], "zero": [28, 50, 222, 225, 230, 240, 269, 316, 318, 322], "displai": 28, "callabl": [29, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 74, 82, 223, 230, 248, 282, 293], "With": [29, 316, 319, 320, 322, 323], "my_recip": 29, "foo": 29, "bar": [29, 308, 317, 321], "configerror": 30, "cannot": [30, 46, 272, 318], "equival": [31, 35, 80, 266, 267], "condit": [31, 74, 285, 314], "dedic": 31, "due": [31, 256, 320, 321, 323], "keep": [31, 33, 35, 37, 42, 63, 64, 66, 67, 68, 71, 228, 241, 316, 320, 321], "openai": [32, 37, 60, 265], "markup": 32, "im_start": 32, "context": [32, 183, 234, 235, 248, 294, 296, 321], "im_end": 32, "goe": [32, 248], "a2": [33, 55], "functool": [34, 40, 43], "partial": [34, 40, 43], "_prompt_templ": [34, 40, 43], "assistant_messag": [34, 40, 43], "respect": [35, 53, 112, 222, 246, 247, 251, 296, 315], "final": [35, 42, 55, 56, 67, 86, 87, 88, 92, 96, 101, 105, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 159, 161, 162, 163, 169, 170, 171, 172, 173, 180, 181, 186, 189, 190, 196, 197, 200, 201, 203, 204, 205, 206, 220, 230, 240, 252, 318, 319, 320, 321, 323], "leav": [35, 42, 67, 321], "liter": [36, 38, 41, 86, 87, 88, 89, 90, 91, 95, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 118, 119, 120, 121, 122, 123, 124, 125, 126, 127, 131, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 160, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 177, 178, 179, 180, 181, 184, 185, 186, 187, 188, 189, 194, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 219, 252], "union": [36, 46, 47, 58, 59, 60, 62, 64, 69, 70, 72, 73, 95, 118, 131, 160, 177, 184, 194, 219, 223, 230, 240, 246, 247, 251, 253, 270, 275, 280, 287, 288, 289, 290, 291, 293, 295], "interleav": [36, 261], "attach": 36, "writer": 36, "calcul": [36, 38, 77, 154, 156, 224, 229, 231, 232, 262, 263, 265, 318], "consecut": [36, 52, 222, 261], "properti": [36, 320, 321], "media": [36, 56], "classmethod": 36, "image_url": 37, "unmask": [37, 42, 238], "consid": [38, 53, 55, 56, 80, 81, 82, 159, 163, 232, 321], "come": [38, 52, 245, 246, 247, 320, 321], "nanswer": 40, "alia": 41, "alwai": [42, 287, 297, 315, 321], "nsummari": [43, 315], "summari": [43, 53, 69, 232, 281], "batch_first": 45, "padding_valu": 45, "float": [45, 74, 75, 78, 86, 87, 88, 89, 90, 91, 92, 96, 97, 98, 99, 100, 101, 105, 106, 107, 108, 109, 110, 111, 113, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 132, 133, 134, 135, 136, 137, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 156, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 173, 175, 178, 179, 180, 181, 182, 185, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 225, 246, 247, 253, 262, 263, 264, 265, 266, 267, 274, 278, 280, 281, 286, 287, 288, 289, 290, 291, 320, 321, 322, 323], "rnn": [45, 47, 50], "pad_sequ": [45, 47, 50], "variabl": [45, 273, 285, 287, 321, 323], "left": [45, 47, 50, 230, 320], "longest": [45, 49, 50], "trail": 45, "dimens": [45, 50, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 154, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 220, 222, 223, 224, 226, 230, 232, 241, 246, 247, 318, 320, 321, 323], "element": [45, 47, 50, 53, 238, 283], "8": [45, 47, 48, 50, 59, 62, 65, 66, 69, 86, 87, 88, 89, 90, 91, 97, 98, 99, 100, 106, 107, 108, 109, 110, 111, 120, 121, 122, 123, 124, 125, 126, 127, 133, 134, 135, 136, 137, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 170, 172, 178, 179, 181, 185, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 232, 237, 239, 316, 319, 320, 321, 322, 323], "9": [45, 47, 48, 50, 222, 232, 283, 322, 323], "12": [45, 47, 48, 70, 232, 307, 322], "image_loc": 46, "www": [46, 287], "org": [46, 65, 83, 84, 85, 86, 87, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 119, 120, 121, 122, 124, 125, 126, 127, 132, 133, 134, 135, 136, 144, 145, 146, 147, 150, 151, 152, 153, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 224, 225, 226, 232, 253, 261, 262, 264, 265, 266, 267, 285, 290, 293, 295, 301, 307], "en": [46, 57, 61, 63, 72, 73, 322], "pad_direct": [47, 50], "keys_to_pad": 47, "padding_idx": [47, 48, 49, 50, 54], "left_pad_sequ": [47, 50], "integ": [47, 49, 241, 269, 275, 295], "batch_siz": [47, 59, 62, 65, 66, 69, 222, 224, 229, 230, 231, 233, 234, 235, 237, 238, 239, 240, 241, 242, 264, 266, 268, 316, 321, 322], "ignore_idx": [48, 49, 50], "input_id": [48, 283], "chosen_input_id": [48, 68], "chosen_label": 48, "15": [48, 232, 315, 320, 323], "16": [48, 86, 87, 88, 89, 90, 91, 97, 98, 99, 100, 106, 107, 108, 109, 110, 111, 120, 121, 122, 123, 124, 125, 126, 127, 133, 134, 135, 136, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 170, 172, 178, 179, 181, 185, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 222, 232, 316, 320, 321, 323], "17": [48, 232, 320], "18": [48, 232, 318], "19": [48, 232, 323], "20": [48, 232, 268, 322], "token_pair": 49, "padded_col": 49, "pad_max_til": 50, "pad_max_imag": 50, "tile": [50, 79, 80, 81, 82, 154, 155, 156, 157, 159, 160, 161, 163, 232, 261], "aspect": [50, 79, 80, 308], "ratio": [50, 79, 80, 264, 265], "text_seq_len": [50, 261], "n_tile": [50, 79, 80, 232], "h": [50, 154, 222, 232, 237, 239, 307, 314], "w": [50, 83, 84, 85, 93, 94, 102, 103, 104, 114, 115, 116, 117, 129, 130, 138, 139, 140, 148, 149, 154, 165, 174, 176, 191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 232, 287, 290, 291, 315, 320, 323], "h_ratio": 50, "w_ratio": 50, "encoder_mask": [50, 229, 230, 240], "image_seq_len": [50, 261], "channel": [50, 82, 154, 156, 159, 163, 232, 246, 322], "height": [50, 154], "largest": 50, "bsz": [50, 74, 75, 76, 77, 79, 80, 232, 237, 239], "max_num_imag": 50, "max_num_til": [50, 79, 80, 82, 156, 159, 163, 232], "tokens_per_til": 50, "image_id": 50, "four": [50, 320], "model_input": 50, "max_text_seq_len": 50, "40": [50, 80, 81, 82, 159, 163, 232, 261, 321, 323], "did": [50, 318, 323], "extra": [50, 156, 240, 307, 315, 320, 321, 322, 323], "second": [50, 224, 241, 320, 321, 323], "eos_id": [51, 156, 257, 259], "shorter": [52, 230], "min": [52, 320], "invalid": 52, "sub": [53, 290], "unifi": [53, 176], "simplifi": [53, 264, 314, 319, 320], "simultan": 53, "intern": [53, 240], "aggreg": 53, "transpar": 53, "constitu": 53, "might": [53, 234, 241, 244, 314, 321], "comput": [53, 55, 56, 113, 119, 128, 132, 137, 141, 154, 155, 158, 162, 186, 190, 224, 226, 230, 231, 237, 239, 240, 261, 264, 266, 281, 295, 311, 316, 319, 321, 322, 323], "cumul": 53, "maintain": [53, 242, 310, 321, 323], "deleg": 53, "retriev": [53, 55, 56, 230], "lead": [53, 256, 269, 311], "scale": [53, 74, 75, 78, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 137, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 225, 227, 229, 231, 246, 247, 253, 263, 267, 320, 321, 322, 323], "strategi": [53, 311], "stream": [53, 301, 321], "demand": 53, "deriv": [53, 220, 230, 231], "instans": 53, "dataset1": 53, "mycustomdataset": 53, "params1": 53, "dataset2": 53, "params2": 53, "concat_dataset": 53, "total": [53, 263, 265, 278, 302, 305, 313, 316, 318, 319, 320, 321], "data_point": 53, "1500": 53, "vicgal": 53, "gpt4": 53, "samsum": [53, 69], "focus": [53, 312, 317, 321], "enhanc": [53, 232, 267, 321, 323], "divers": 53, "machin": [53, 266, 300, 309, 314], "max_pack": 54, "outsid": [54, 295, 296, 320], "sampler": [54, 316, 317], "part": [54, 223, 241, 266, 315, 323], "buffer": [54, 230, 240, 246, 247, 321], "enough": [54, 315, 321], "lower": [54, 311, 319, 320], "triangular": 54, "wise": 54, "made": [54, 60, 64, 67, 68, 72, 156], "smaller": [54, 241, 318, 319, 320, 321, 322, 323], "jam": 54, "s1": [54, 256], "s2": [54, 256], "s3": 54, "s4": 54, "contamin": 54, "input_po": [54, 75, 224, 226, 230, 231, 240], "matrix": [54, 229, 230, 240], "increment": 54, "move": [54, 72, 230, 246, 247, 250, 299, 321], "entir": [54, 72, 237, 244, 315, 323], "avoid": [54, 72, 225, 232, 236, 277, 295, 314, 322, 323], "truncat": [54, 61, 72, 73, 95, 118, 131, 156, 160, 177, 184, 194, 219, 257, 268], "sentenc": [54, 72, 316], "techniqu": [55, 308, 309, 310, 311, 316, 317, 318, 319, 320, 321, 322], "repons": 55, "At": [55, 56, 230, 240], "extract": [55, 56, 61, 258], "against": [55, 56, 267, 303, 322, 323], "unit": [55, 56, 308], "filepath": [55, 56, 57, 59, 60, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73], "filter": [55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 322], "prior": [55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 297], "doc": [55, 56, 57, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73, 225, 285, 287, 290, 291, 295, 301, 314], "round": [56, 322], "incorpor": [56, 264], "happen": [56, 237, 239], "ti": [56, 96, 105, 186, 190, 228, 321], "agnost": 56, "treat": [56, 232, 248, 315], "minimum": [56, 65, 66, 67], "corpu": [57, 61, 72, 73], "package_refer": [57, 61, 63, 72, 73], "loading_method": [57, 61, 63, 72, 73], "tabular": [57, 72], "eo": [57, 72, 184, 256, 259, 315], "yahma": 58, "variant": [58, 62, 69, 321], "page": [58, 73, 307, 308, 314, 316, 317, 318, 321], "tatsu": 59, "lab": [59, 75], "codebas": 59, "independ": 59, "contribut": [59, 60, 62, 64, 68, 69, 70, 238, 239, 263, 265], "alpacatomessag": 59, "alpaca_d": 59, "altern": [60, 64, 67, 68, 234, 317, 321], "toward": [60, 267, 309], "my_dataset": [60, 64, 67], "london": [60, 64], "ccdv": 61, "cnn_dailymail": 61, "textcompletiondataset": [61, 72, 73], "cnn": 61, "dailymail": 61, "articl": [61, 73], "highlight": [61, 323], "disabl": [61, 73, 223, 230, 234, 240, 248, 253, 295, 316, 322], "highest": [61, 73], "conjunct": [62, 69, 71, 230], "grammar_d": 62, "rlhflow": 63, "hh": 63, "preferencedataset": [63, 68, 71], "liuhaotian": 65, "llava": 65, "150k": 65, "coco": 65, "train2017": 65, "llava_instruct_150k": 65, "2017": 65, "visit": 65, "cocodataset": 65, "wget": 65, "zip": [65, 304], "unzip": 65, "minim": [65, 66, 317, 319, 320, 321, 322, 323], "clip": [65, 66, 79, 80, 81, 82, 154, 155, 156, 159, 163, 232, 265, 273], "mymodeltransform": [65, 66], "tokenizer_path": [65, 66], "image_transform": [65, 66], "yet": [65, 66, 67, 168, 314, 315, 316], "llava_instruct_d": 65, "huggingfacem4": 66, "the_cauldron": 66, "cauldron": 66, "card": 66, "cauldron_d": 66, "pictur": 67, "logo": 67, "rgb_pytorch": 67, "png": 67, "compris": 68, "share": [68, 224, 228, 316], "c1": 68, "r1": 68, "chosen_messag": 68, "rejected_messag": 68, "samsung": 69, "samsum_d": 69, "351": 70, "82": 70, "391": 70, "221": 70, "220": 70, "193": 70, "471": 70, "lvwerra": 71, "stack": [71, 232, 296], "exchang": 71, "allenai": [72, 322], "data_dir": 72, "realnewslik": 72, "wikitext_document_level": 73, "wikitext": [73, 322], "103": 73, "transformerdecod": [74, 75, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94, 96, 97, 98, 99, 100, 101, 102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 113, 114, 115, 116, 117, 119, 120, 121, 122, 123, 124, 125, 126, 127, 128, 129, 130, 132, 133, 134, 135, 136, 137, 138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 148, 149, 150, 151, 152, 153, 158, 162, 165, 166, 167, 169, 170, 171, 172, 173, 174, 175, 176, 178, 179, 180, 181, 182, 183, 185, 186, 187, 188, 189, 190, 191, 192, 193, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218, 229, 231, 240, 241, 242, 320], "max_generated_token": 74, "pad_id": [74, 268], "temperatur": [74, 75, 78, 264, 266, 267, 316], "top_k": [74, 75, 78, 316], "stop_token": [74, 268], "rng": 74, "custom_generate_next_token": 74, "seq_length": [74, 75, 76, 229, 231, 241, 242], "prune": [74, 78, 323], "probabl": [74, 78, 86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 246, 247, 253, 264, 265, 266, 267, 309, 316, 319], "stop": [74, 268], "random": [74, 223, 232, 295, 317], "compil": [74, 237, 316, 318, 321, 323], "generate_next_token": 74, "llama3_8b": [74, 134, 142, 240, 318, 321, 322], "manual_se": 74, "tolist": 74, "jeremi": 74, "m": [74, 236, 309, 315, 322], "seq_len": [74, 76, 77, 230], "num_generated_token": 74, "q": [75, 78, 224, 320], "randomli": [75, 78, 297], "softmax": [75, 78, 224, 230, 231, 240, 319], "trick": [75, 78], "fast": [75, 316], "32971d3129541c5bfb4f715abc33d1c5f408d204": 75, "l40": 75, "padding_mask": [76, 77, 265, 268], "target_seq_len": 76, "suitabl": 76, "scaled_dot_product_attent": [76, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 224], "static": 76, "kv": [76, 222, 224, 230, 231, 233, 234, 235, 240, 322], "cach": [76, 222, 224, 226, 229, 230, 231, 233, 234, 235, 240, 242, 307, 314, 316], "longer": [76, 222, 321], "boolean": [76, 77, 82, 224, 229, 230, 231, 240, 242, 253, 283], "assertionerror": [76, 82, 222, 229, 230, 252, 297], "shift": [77, 230], "uniform_": 78, "int32": 78, "patch": [79, 80, 81, 82, 155, 156, 159, 163, 232, 261], "vision_transform": [79, 80, 81, 82], "visiontransform": [79, 80, 81, 82], "divid": [79, 80, 81, 82, 156, 159, 163, 232, 238, 239, 261], "dimension": [79, 80, 81, 82, 159, 163, 232], "n_img": [79, 80, 232], "n_tokens_per_til": [79, 80, 81], "crop": [79, 80, 81, 82, 154, 159, 163, 232], "local_token_positional_embed": 80, "_position_embed": [80, 232], "tokenpositionalembed": [80, 232], "gate": [80, 227, 273, 309, 310, 311, 314, 317], "global_token_positional_embed": 80, "400": [80, 81, 82, 159, 163, 232, 261], "10x10": [80, 81, 82, 159, 163, 232, 261], "grid": [80, 81, 82, 159, 163, 232, 261], "th": [80, 222], "silu": [82, 220], "cls_output_dim": [82, 232], "attn_bia": 82, "use_rop": 82, "out_indic": [82, 232], "output_cls_project": 82, "in_channel": [82, 159, 163, 232], "append_cls_token": [82, 232], "transformerencoderlay": 82, "cl": [82, 155, 232], "mlp": [82, 86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 120, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 158, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 229, 230, 231, 252, 318, 320, 321], "bia": [82, 228, 245, 246, 247, 297, 320, 322, 323], "2d": 82, "rope": [82, 137, 141, 186, 190, 224, 226], "intermedi": [82, 92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 159, 162, 163, 169, 171, 173, 175, 180, 182, 186, 190, 232, 272, 293, 316, 318, 323], "fourth": [82, 159, 163, 232], "determin": [82, 159, 163, 316], "divis": [82, 225], "code_llama2": [83, 84, 85, 86, 87, 88, 89, 90, 91, 314, 316], "arxiv": [83, 84, 85, 86, 87, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 119, 120, 121, 122, 124, 125, 126, 127, 132, 133, 134, 135, 136, 144, 145, 146, 147, 150, 151, 152, 153, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 224, 226, 232, 253, 261, 262, 264, 265, 266, 267, 309], "pdf": [83, 84, 85, 261, 262], "2308": [83, 84, 85], "12950": [83, 84, 85], "lora_attn_modul": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 252, 310, 320, 321, 323], "q_proj": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 252, 310, 320, 321, 322, 323], "k_proj": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 252, 310, 320, 321, 322, 323], "v_proj": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 252, 310, 320, 321, 322, 323], "output_proj": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 224, 252, 320, 321, 322, 323], "apply_lora_to_mlp": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 252, 310, 320, 321], "apply_lora_to_output": [86, 87, 88, 89, 90, 91, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 252, 320, 321], "lora_rank": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 310, 320, 321], "lora_alpha": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 310, 320, 321], "lora_dropout": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 321], "use_dora": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 143, 144, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 321], "quantize_bas": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 143, 144, 145, 146, 147, 150, 151, 152, 153, 161, 162, 163, 164, 166, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 321, 323], "code_llama2_13b": 86, "tloen": [86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206], "8bb8579e403dc78e37fe81ffbb253c413007323f": [86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206], "l41": [86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206], "l43": [86, 87, 88, 97, 98, 106, 107, 108, 120, 121, 122, 123, 133, 134, 142, 144, 150, 151, 181, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206], "linear": [86, 87, 88, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 119, 120, 121, 122, 123, 124, 125, 126, 127, 132, 133, 134, 135, 136, 141, 142, 144, 145, 146, 147, 150, 151, 152, 153, 155, 161, 162, 163, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 228, 230, 245, 246, 247, 252, 253, 320, 321, 322, 323], "low": [86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 309, 310, 316, 319, 320, 323], "approxim": [86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 320], "factor": [86, 87, 88, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 123, 132, 133, 134, 137, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 180, 181, 186, 187, 188, 189, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 246, 247, 262], "dropout": [86, 87, 88, 92, 96, 97, 98, 101, 105, 106, 107, 108, 113, 119, 121, 122, 123, 128, 132, 133, 134, 137, 141, 142, 144, 150, 151, 161, 162, 163, 169, 170, 171, 172, 173, 175, 180, 181, 182, 186, 187, 188, 189, 190, 195, 196, 197, 198, 199, 200, 201, 202, 203, 204, 205, 206, 223, 224, 246, 247, 253, 320, 321, 323], "decompos": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181, 246, 309, 310], "magnitud": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181, 246, 250, 321], "dora": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 141, 144, 150, 151, 162, 163, 169, 170, 171, 172, 180, 181, 246, 309, 310], "ab": [86, 87, 89, 90, 91, 96, 97, 98, 99, 100, 105, 106, 107, 108, 109, 110, 111, 114, 115, 116, 117, 119, 120, 121, 122, 124, 125, 126, 127, 132, 133, 134, 135, 136, 144, 145, 146, 147, 150, 151, 152, 153, 164, 167, 169, 170, 171, 172, 178, 179, 180, 181, 185, 224, 226, 232, 253, 264, 265, 266, 267], "2402": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181], "09353": [86, 87, 96, 97, 98, 105, 106, 107, 108, 119, 120, 121, 122, 132, 133, 134, 144, 150, 151, 169, 170, 171, 172, 180, 181], "code_llama2_70b": 87, "code_llama2_7b": 88, "qlora": [89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 164, 167, 178, 179, 185, 236, 306, 308, 309, 310, 318, 320], "paper": [89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 164, 167, 178, 179, 185, 253, 261, 264, 266, 267, 319, 320, 323], "2305": [89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 164, 167, 178, 179, 185, 224, 264, 266], "14314": [89, 90, 91, 99, 100, 109, 110, 111, 124, 125, 126, 127, 135, 136, 145, 146, 147, 152, 153, 164, 167, 178, 179, 185], "lora_code_llama2_13b": 89, "lora_code_llama2_70b": 90, "lora_code_llama2_7b": 91, "head_dim": [92, 96, 101, 105, 222, 224, 230], "intermediate_dim": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190], "attn_dropout": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 224, 230], "norm_ep": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190], "1e": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 225, 319, 321], "06": [92, 96, 101, 105, 225, 320], "rope_bas": [92, 96, 101, 105, 113, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190], "10000": [92, 96, 101, 105, 113, 169, 171, 173, 175, 180, 182, 226], "transformerselfattentionlay": [92, 101, 113, 128, 137, 173, 190, 229, 230, 240, 242], "rm": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 316], "norm": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 230], "space": [92, 101, 113, 128, 137, 158, 162, 173, 190, 230, 244, 321], "slide": [92, 173, 183], "window": [92, 173, 183], "vocabulari": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 237, 239, 320, 321], "mha": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224, 230], "onto": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 224, 244], "epsilon": [92, 96, 101, 105, 113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190, 265], "rotari": [92, 96, 101, 105, 113, 128, 137, 141, 169, 171, 173, 175, 180, 182, 226, 318], "10_000": [92, 96, 101, 105, 169, 171, 173, 175, 182], "blog": [93, 94], "technolog": [93, 94], "develop": [93, 94, 307, 323], "gemmatoken": 95, "_templatetyp": [95, 118, 131, 160, 177, 184, 194, 219], "gemma_2b": [97, 107], "gemma_7b": [98, 106, 108], "lora_gemma_2b": 99, "lora_gemma_7b": 100, "hidden_capping_valu": [101, 105], "50": [101, 105, 232, 268, 287, 316], "final_capping_valu": [101, 105], "30": [101, 105, 232, 268, 322], "sliding_window_s": [101, 105], "query_pre_attn_scalar": [101, 105], "gemma2": [102, 103, 104, 105, 106, 107, 108, 109, 110, 111, 273, 316], "gemma_pytorch": [102, 103, 104], "lora_gemma2_27b": 109, "lora_gemm2a_2b": 110, "lora_gemma2_9b": 111, "taken": [112, 316, 320, 323], "sy": [112, 315], "honest": [112, 309, 315], "pari": [112, 168], "capit": [112, 168], "franc": [112, 168], "known": [112, 168, 282], "stun": [112, 168], "05": [113, 119, 128, 132, 137, 141, 169, 171, 173, 175, 180, 182, 186, 190], "gqa": [113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224], "mqa": [113, 119, 128, 132, 137, 141, 158, 162, 169, 171, 173, 175, 180, 182, 186, 190, 224], "kvcach": [113, 119, 128, 132, 137, 141, 158, 162, 180, 186, 190, 224, 230, 233, 234, 235, 240], "scale_hidden_dim_for_mlp": [113, 119, 128, 132, 137, 141, 158, 162, 186, 190], "2307": [114, 115, 116, 117], "09288": [114, 115, 116, 117], "classif": [117, 171, 175, 176, 273], "llama2_13b": 120, "llama2_70b": 121, "llama2_7b": [122, 320], "classifi": [123, 171, 175, 176, 297, 321], "llama2_reward_7b": [123, 273], "lora_llama2_13b": 124, "lora_llama2_70b": 125, "lora_llama2_7b": [126, 320], "lora_llama2_reward_7b": 127, "500000": [128, 132, 137, 141, 158, 162], "500_000": [128, 137, 141], "70": 129, "llama3token": [131, 156, 255], "regist": [131, 156, 160, 184, 194, 219, 236, 292, 323], "canon": [131, 156, 160, 184, 194, 219], "llama3_70b": 133, "lora_llama3_70b": 135, "lora_llama3_8b": [136, 321], "scale_factor": [137, 141], "llama3_1": [138, 139, 140, 141, 142, 143, 144, 145, 146, 147, 309, 310, 316, 319], "llama3_1_8b": 144, "lora_llama3_1_405b": 145, "lora_llama3_1_70b": [146, 166, 167], "lora_llama3_1_8b": 147, "llama3_2_1b": [150, 233, 234, 235], "lora_llama3_2_1b": 152, "lora_llama3_2_3b": 153, "projection_head": [154, 240, 244], "combin": [154, 156, 159, 163, 230, 240, 242, 244, 263, 319], "learnabl": [154, 227, 240, 242, 246, 316], "fusion": [154, 157, 158, 159, 161, 162, 163, 240, 241, 242, 243, 244], "encoder_dim": [154, 155], "decoder_dim": [154, 155], "num_img": [154, 155], "num_emb": [154, 155], "broken": [154, 155, 232, 242], "width": [154, 322], "clip_embeds_per_til": 154, "emb": [154, 155, 224, 229, 230, 240], "num_hidden_input": 155, "frozen": [155, 161, 164, 241, 264, 320, 321, 323], "sequenti": [155, 240, 244], "num_hidden": 155, "hidden_st": [155, 232], "image_mean": 156, "image_std": 156, "tranform": 156, "possible_resolut": 156, "448": [156, 157, 160, 161], "deviat": 156, "transformed_data": 156, "img1": [156, 261], "img2": [156, 261], "31587": [156, 256, 257], "29644": [156, 256, 257], "102": [156, 256, 257], "truncate_at_eo": [156, 257], "skip": [156, 224], "tokenize_head": 156, "tokenize_end": 156, "header": 156, "eom": 156, "wether": 156, "decoder_train": [157, 161, 164, 240], "encoder_train": [157, 161, 164, 240], "fusion_train": [157, 161, 164, 240], "deepfusionmodel": [157, 161, 164], "trainabl": [157, 161, 242, 247, 251, 320, 321, 323], "resiz": [157, 160, 161], "fusion_interv": [158, 162], "num_special_token": [158, 162], "encoder_max_seq_len": [158, 162, 229, 230, 231, 235, 240, 242], "causalselfattent": [158, 162], "interv": [158, 162, 317], "clip_embed_dim": [159, 163], "clip_num_lay": [159, 163], "clip_hidden_st": [159, 163], "num_layers_project": [159, 163], "decoder_embed_dim": [159, 163], "llama3visionencod": [159, 163], "spatial": [159, 163], "backbon": [159, 163], "trainbl": 161, "decoder_lora": 162, "fusion_lora": [162, 163], "encoder_lora": 163, "quantization_kwarg": [163, 246, 247], "lora_llama3_2_vision_11b": 164, "llama3_3": [165, 166, 167], "llama3_1_70b": 165, "num_class": [171, 175, 297], "announc": 174, "ray2333": 176, "feedback": [176, 264], "lora_mistral_7b": 178, "lora_mistral_reward_7b": 179, "phi3_mini": [181, 273], "128k": 183, "nor": 183, "phi3minitoken": 184, "spm": 184, "lm": [184, 265, 319], "bo": [184, 256, 259, 315], "unk": 184, "augment": [184, 323], "endoftext": 184, "phi3minisentencepiecebasetoken": 184, "lora_phi3_mini": 185, "1000000": [186, 190], "tie_word_embed": [186, 187, 188, 190, 191, 192, 195, 198, 199, 202, 207, 210, 211, 214], "qwen2transformerdecod": 186, "period": [186, 190], "word": [186, 190, 321, 322], "qwen2_0_5b": [187, 228], "qwen2_1_5b": [188, 228], "qwen2_7b": 189, "qwen": [191, 192, 193, 207, 208, 209, 210, 211, 212, 213, 214, 215, 216, 217, 218], "merges_fil": [194, 219], "qwen2token": 194, "qwen2_5_0_5b": 195, "qwen2_5_14b_bas": 196, "slightli": [196, 197, 198, 199, 200, 201, 203, 204, 205, 206, 208, 209, 210, 211, 212, 213, 215, 216, 217, 218, 319, 321], "qwen2_5_14b_instruct": 197, "qwen2_5_1_5b_bas": 198, "qwen2_5_1_5b_instruct": 199, "qwen2_5_32b_bas": 200, "qwen2_5_32b_instruct": 201, "qwen2_5_3b": 202, "qwen2_5_72b_bas": 203, "qwen2_5_72b_instruct": 204, "qwen2_5_7b_bas": 205, "qwen2_5_7b_instruct": 206, "qwen2_5token": 219, "gate_proj": 220, "down_proj": 220, "up_proj": 220, "feed": [220, 229, 231], "network": [220, 248, 320, 323], "fed": [220, 315], "multipli": [220, 321], "in_dim": [220, 245, 246, 247, 320, 321, 323], "out_dim": [220, 230, 245, 246, 247, 320, 321, 323], "layernorm": 221, "past": 222, "dpython": [222, 224, 229, 230, 231, 235, 236, 240, 242, 294, 298], "reset": [222, 224, 229, 230, 231, 240, 242, 281], "k_val": 222, "v_val": 222, "fill": 222, "bfloat16": [222, 236, 294, 316, 317, 318, 320, 321, 322], "greater": [222, 232, 303], "prob": 223, "disable_on_ev": [223, 253], "portion": [223, 322, 323], "regular": [223, 264, 267, 321, 322, 323], "potenti": [223, 311, 320, 321], "layer_dropout": 223, "randn": 223, "pos_embed": [224, 229, 320, 322], "q_norm": 224, "k_norm": 224, "kv_cach": [224, 233, 234, 235], "is_caus": 224, "13245v1": 224, "multihead": 224, "extrem": 224, "credit": 224, "litgpt": 224, "v": [224, 230, 240, 320], "n_kv_head": 224, "rotarypositionalembed": [224, 320, 322], "rmsnorm": 224, "vice": [224, 314], "versa": [224, 314], "y": [224, 309], "s_x": 224, "s_y": 224, "_masktyp": [224, 230, 231], "score": [224, 230, 231, 263], "encoder_max_cache_seq_len": [224, 230, 231], "decoder_max_cache_seq_len": 224, "j": [224, 229, 230, 231, 240, 309], "blockmask": [224, 230, 231], "create_block_mask": [224, 230, 231], "flex_attent": [224, 230, 231], "n_h": [224, 226], "num": [224, 226], "n_kv": 224, "h_d": [224, 226], "reset_cach": [224, 229, 230, 231, 240, 242], "setup_cach": [224, 229, 230, 231, 233, 234, 240, 242], "ep": 225, "squar": 225, "stabl": [225, 285, 290, 295, 307, 321], "html": [225, 285, 290, 293, 295, 301, 306], "propos": [226, 321], "2104": 226, "09864": 226, "verfic": 226, "l80": 226, "init": [226, 281, 286, 291, 316, 323], "exceed": 226, "freq": 226, "recomput": [226, 321], "geometr": 226, "progress": [226, 253, 312, 317, 321], "rotat": 226, "angl": 226, "basic": [227, 318], "tied_modul": 228, "pointer": [228, 308], "why": [228, 315, 317, 320], "whose": [228, 248, 287, 292], "attributeerror": [228, 299], "attn": [229, 231, 233, 234, 235, 320, 322, 323], "multiheadattent": [229, 231, 320, 322], "ca_norm": 229, "mlp_norm": [229, 231], "ca_scal": 229, "mlp_scale": [229, 231], "ff": [229, 231], "caches_are_en": [229, 230, 231, 233, 234, 235, 240, 242], "func": [229, 231, 242], "caches_are_setup": [229, 230, 231, 233, 234, 235, 240, 242], "token_sequ": 229, "embed_sequ": 229, "decoder_max_seq_len": [229, 230, 231, 233, 234, 235, 240, 242], "modulelist": [230, 253], "output_hidden_st": [230, 240], "belong": [230, 276], "statement": 230, "improv": [230, 257, 266, 309, 311, 318, 319, 320, 321], "readabl": 230, "behaviour": [230, 240, 297, 309], "alter": [230, 240], "common_util": [230, 233, 234, 235, 236], "disable_kv_cach": [230, 240], "chunked_output": 230, "last_hidden_st": 230, "chunk": [230, 237, 239, 257], "cewithchunkedoutputloss": [230, 240], "upcast": [230, 237, 239], "set_num_output_chunk": [230, 240], "num_chunk": [230, 237, 239], "s_e": [230, 240], "d_e": [230, 240], "arang": [230, 240], "prompt_length": [230, 240], "correspondingli": 230, "padded_prompt_length": 230, "m_": [230, 240], "realloc": [230, 240], "runtimeerror": [230, 246, 259, 274, 279, 280, 284], "num_output_chunk": [230, 237, 239, 240], "transformercrossattentionlay": [230, 240, 242], "fusionlay": [230, 240], "sa_norm": 231, "sa_scal": 231, "token_pos_embed": 232, "pre_tile_pos_emb": 232, "post_tile_pos_emb": 232, "cls_project": 232, "vit": 232, "2010": [232, 253], "11929": 232, "convolut": 232, "flatten": 232, "downscal": 232, "800x400": 232, "400x400": 232, "_transform": 232, "whole": [232, 319], "n_token": 232, "101": 232, "pool": 232, "tiledtokenpositionalembed": 232, "tilepositionalembed": 232, "tile_pos_emb": 232, "8x8": 232, "21": [232, 316], "22": 232, "23": [232, 278], "25": [232, 319], "26": 232, "27": [232, 316], "28": 232, "29": [232, 323], "31": [232, 318], "33": 232, "34": 232, "35": [232, 323], "36": [232, 309], "37": 232, "38": 232, "39": 232, "41": 232, "43": 232, "44": 232, "45": 232, "46": 232, "47": [232, 316], "48": [232, 323], "49": 232, "51": 232, "52": [232, 317], "53": 232, "54": 232, "55": [232, 317], "56": 232, "57": [232, 320, 323], "58": 232, "59": [232, 323], "60": 232, "61": 232, "62": 232, "63": 232, "64": [232, 310, 320, 321], "num_patches_per_til": 232, "emb_dim": 232, "constain": 232, "anim": 232, "max_n_img": 232, "n_channel": 232, "vision_util": 232, "tile_crop": 232, "800": 232, "patch_grid_s": 232, "rand": 232, "nch": 232, "tile_cropped_imag": 232, "batch_imag": 232, "unsqueez": 232, "batch_aspect_ratio": 232, "clip_vision_encod": 232, "cache_en": 233, "float32": [233, 234, 235, 279, 319, 321], "1024": [233, 234, 235, 322], "temporarili": [234, 235, 248, 321], "enter": [234, 235], "overhead": [234, 264, 311, 321, 322], "untouch": [234, 315], "yield": [234, 235, 248], "caller": [234, 235, 248], "delete_kv_cach": 235, "offload_to_cpu": 236, "hook": [236, 292, 321, 323], "nf4": [236, 321, 323], "restor": 236, "higher": [236, 316, 318, 319, 321, 322, 323], "offload": [236, 316, 323], "increas": [236, 253, 264, 278, 309, 318, 319, 320, 321, 322], "peak": [236, 277, 281, 286, 316, 318, 320, 323], "gpu": [236, 309, 311, 314, 316, 317, 318, 319, 320, 321, 322, 323], "_register_state_dict_hook": 236, "mymodul": 236, "_after_": 236, "nf4tensor": [236, 323], "unquant": [236, 322, 323], "unus": 236, "ignore_index": [237, 238, 239, 319], "entropi": [237, 239, 319], "bf16": [237, 239, 279, 316, 321, 323], "ce": [237, 319], "better": [237, 239, 267, 308, 315, 319, 322], "accuraci": [237, 239, 311, 316, 318, 319, 320, 321, 322, 323], "doubl": [237, 239, 323], "num_token": [237, 238, 239], "consider": [237, 239], "compute_cross_entropi": 237, "gain": [237, 311, 318], "won": [237, 315, 321], "realiz": 237, "pull": [237, 309, 310, 311, 314], "1390": 237, "loss_fn": [237, 239], "chunkedcrossentropyloss": 237, "output_chunk": [237, 239], "kullback": [238, 319], "leibler": [238, 319], "diverg": [238, 239, 263, 319], "jongwooko": [238, 319], "distillm": [238, 319], "17c0f98bc263b1861a02d5df578c84aea652ee65": 238, "student_logit": [238, 239, 319], "teacher_logit": [238, 239, 319], "student": [238, 239], "teacher": [238, 239], "kl": [238, 239, 263, 319], "teacher_chunk": 239, "teacher_model": 239, "model_fus": [240, 241, 242, 243, 244], "deepfus": 240, "evolut": 240, "interspers": 240, "assumpt": 240, "signatur": 240, "interchang": 240, "fusion_param": [240, 241, 242, 243, 244], "fusionembed": 240, "fusion_lay": [240, 242], "clip_vit_224": [240, 244], "feedforward": [240, 244], "register_fusion_modul": 240, "strict": [240, 241, 242, 252, 320], "freez": [240, 316, 320], "fusion_vocab_s": 241, "necessit": 241, "rout": 241, "128": [241, 310, 318, 320], "fusion_first": 242, "flamingo": [242, 261], "shot": [242, 316, 318, 322], "infus": 242, "interpret": 242, "enocd": 242, "isn": [242, 279, 314, 316, 321], "fused_lay": 242, "mark": [244, 315], "earli": 244, "peft": [245, 246, 247, 248, 249, 250, 251, 252, 270, 309, 310, 316, 320, 323], "adapter_param": [245, 246, 247, 248, 249, 251], "proj": 245, "loralinear": [245, 320, 321, 323], "alpha": [246, 247, 320, 321, 323], "use_bia": [246, 247], "scalar": [246, 287, 288, 289, 290, 291, 321], "orient": [246, 321], "bax": [246, 247], "distinct": [246, 323], "to_nf4": [246, 247, 323], "block_siz": [246, 247], "scaler_block_s": [246, 247], "granular": [246, 247], "scaler": [246, 247], "lora_a": [246, 247, 320, 323], "lora_b": [246, 247, 320, 323], "initialize_dora_magnitud": 246, "to_empti": [246, 247], "recurs": [246, 247, 290], "submodul": [246, 247], "perturb": 247, "decomposit": [247, 320, 321], "matric": [247, 320, 323], "mapsto": 247, "w_0x": 247, "r": [247, 309, 316, 320], "polici": [248, 263, 264, 265, 266, 267, 293, 309, 312], "neural": [248, 309, 320, 323], "shard": [250, 271, 277, 318], "get_adapter_param": [251, 320], "base_miss": 252, "base_unexpect": 252, "lora_miss": 252, "lora_unexpect": 252, "reli": [252, 259, 316, 318], "unexpect": 252, "nonempti": 252, "prob_max": 253, "prob_layer_scal": 253, "scaletyp": 253, "uniform": 253, "layers_str": 253, "modulelayerdropoutwrapp": 253, "inplac": [253, 299, 320], "mymodel": 253, "super": [253, 319], "uniformli": 253, "layerdrop": 253, "fan": 253, "et": [253, 320], "al": [253, 320], "1909": 253, "11556v1": 253, "linearli": [253, 278], "zhang": 253, "13369": 253, "exponenti": 253, "layerskip": 253, "elhoushi": 253, "2404": 253, "16710": 253, "exp": 253, "tiktokenbasetoken": 254, "light": 256, "sentencepieceprocessor": 256, "trim": 256, "whitespac": 256, "spm_model": [256, 315], "tokenized_text": [256, 257], "add_bo": [256, 257, 315], "trim_leading_whitespac": 256, "prefix": [256, 321], "unbatch": 256, "bos_id": [257, 259], "substr": 257, "repetit": 257, "speed": [257, 296, 318, 321, 322, 323], "identif": 257, "regex": 257, "absent": 257, "tt_model": 257, "tokenizer_json_path": 258, "heavili": 259, "concat": 259, "1788": 259, "2643": 259, "465": 259, "22137": 259, "join": 259, "satisfi": 259, "loos": 260, "image_token_id": 261, "particip": [261, 262], "laid": 261, "fig": 261, "2204": 261, "14198": 261, "immedi": [261, 321], "until": [261, 321], "img3": 261, "equal": [261, 303, 321], "gamma": [262, 266, 267, 309], "lmbda": 262, "estim": [262, 263], "1506": 262, "02438": 262, "response_len": [262, 263], "receiv": 262, "discount": 262, "gae": 262, "logprob": [263, 267], "ref_logprob": 263, "kl_coeff": 263, "valid_score_idx": 263, "coeffici": [263, 265], "total_reward": 263, "kl_reward": 263, "beta": [264, 267], "label_smooth": [264, 267], "18290": 264, "intuit": [264, 266, 267], "dispref": 264, "dynam": [264, 322], "degener": 264, "occur": [264, 311], "naiv": 264, "trl": [264, 266, 267], "5d1deb1445828cfd0e947cb3a7925b1c03a283fc": 264, "dpo_train": [264, 266], "l844": 264, "2009": 264, "01325": 264, "baselin": [264, 265, 319, 320], "uncertainti": [264, 267], "policy_chosen_logp": [264, 266], "policy_rejected_logp": [264, 266], "reference_chosen_logp": [264, 266], "reference_rejected_logp": [264, 266], "chosen_reward": [264, 266], "rejected_reward": [264, 266], "value_clip_rang": 265, "value_coeff": 265, "proxim": [265, 312], "1707": 265, "06347": 265, "eqn": 265, "vwxyzjn": 265, "ccc19538e817e98a60d3253242ac15e2a562cb49": 265, "lm_human_preference_detail": 265, "train_policy_acceler": 265, "l719": 265, "ea25b9e8b234e6ee1bca43083f8f3cf974143998": 265, "ppo2": 265, "l68": 265, "l75": 265, "pi_old_logprob": 265, "pi_logprob": 265, "phi_old_valu": 265, "phi_valu": 265, "value_padding_mask": 265, "old": 265, "participag": 265, "five": 265, "policy_loss": 265, "value_loss": 265, "clipfrac": 265, "fraction": 265, "statist": [266, 309, 321], "rso": [266, 309], "hing": [266, 309], "2309": [266, 309], "06657": [266, 309], "logist": 266, "regress": 266, "slic": 266, "10425": 266, "almost": [266, 320], "svm": 266, "counter": 266, "4dce042a3863db1d375358e8c8092b874b02934b": 266, "l1141": 266, "2405": 267, "14734": 267, "simpo": 267, "averag": [267, 319], "implicit": 267, "margin": [267, 309], "bradlei": 267, "terri": 267, "larger": [267, 272, 318, 319, 321], "win": 267, "lose": 267, "98ad01ddfd1e1b67ec018014b83cba40e0caea66": 267, "cpo_train": 267, "l603": 267, "pretti": 267, "identitc": 267, "elimin": 267, "kind": 267, "ipoloss": 267, "fill_valu": 268, "sequence_length": 268, "stop_token_id": 268, "869": 268, "eos_mask": 268, "truncated_sequ": 268, "filename_format": [269, 270], "max_filenam": [269, 270], "concis": 269, "filenam": [269, 288], "file_": 269, "_of_": 269, "n_file": 269, "build_checkpoint_filenam": 269, "00003": [269, 319], "file_00001_of_00003": 269, "file_00002_of_00003": 269, "file_00003_of_00003": 269, "recipe_checkpoint": [270, 271, 272, 322], "safe_seri": 270, "should_load_recipe_st": [270, 271, 272], "from_pretrain": 270, "0001_of_0003": 270, "0002_of_0003": 270, "largest_epoch": [270, 271, 272], "recipe_state_dirnam": [270, 271, 272], "receip": 270, "deprec": [270, 271, 272], "preserv": [270, 323], "weight_map": 270, "convert_weight": 270, "_model_typ": [270, 273], "intermediate_checkpoint": [270, 271, 272], "adapter_onli": [270, 271, 272], "_weight_map": 270, "wip": 271, "qualnam": 273, "boundari": 273, "distinguish": 273, "llama3_vis": 273, "llama3_2_vision_decod": 273, "mistral_reward_7b": 273, "clip_text": 273, "clip_text_encoder_larg": 273, "my_new_model": 273, "my_custom_state_dict_map": 273, "optim_map": 274, "bare": 274, "bone": 274, "optim_dict": [274, 276, 292], "p": [274, 309, 320, 322, 323], "cfg_optim": 274, "ckpt": 274, "optim_ckpt": 274, "placeholder_optim_dict": 274, "optiminbackwardwrapp": 274, "get_last_lr": 274, "rate": [274, 278, 280, 308, 316, 317, 321], "schedul": [274, 278, 296, 316, 317, 321], "get_optim_kei": 274, "arbitrari": [274, 320, 321], "optim_ckpt_map": 274, "set_lr_schedul": 274, "lr_schedul": [274, 278], "lrschedul": 274, "loadabl": 274, "step_lr_schedul": 274, "ac_mod": 275, "ac_opt": 275, "op": [275, 322], "ac": 275, "optimizerinbackwardwrapp": [276, 280], "named_paramet": [276, 297], "sharded_sd": 277, "dtensor": 277, "is_rank_zero": 277, "rank0": 277, "num_warmup_step": 278, "num_training_step": 278, "num_cycl": [278, 296], "last_epoch": 278, "lambdalr": 278, "decreas": [278, 320, 321, 322, 323], "cosin": 278, "v4": 278, "src": 278, "l104": 278, "warmup": [278, 296], "phase": 278, "wave": 278, "half": [278, 321], "kernel": 279, "warpper": 280, "optimizer_in_backward": 280, "reset_stat": 281, "track": [281, 287], "alloc": [281, 286, 316, 318, 321, 323], "reserv": [281, 286, 315, 316, 323], "stat": [281, 286, 316, 323], "int4": [282, 322], "4w": 282, "recogn": 282, "int8dynactint4weightquant": [282, 311, 322], "8da4w": [282, 322], "int4weightonlyquant": [282, 322], "int8dynactint4weightqatquant": [282, 311, 322], "qat": [282, 306, 312], "int4weightonlyqatquant": 282, "master": 285, "port": [285, 314], "address": [285, 319, 321], "peak_memory_act": 286, "peak_memory_alloc": 286, "peak_memory_reserv": 286, "get_memory_stat": 286, "api_kei": 287, "experiment_kei": 287, "onlin": 287, "log_cod": 287, "comet": 287, "site": 287, "ml": 287, "team": 287, "compar": [287, 290, 303, 316, 318, 319, 320, 322, 323], "sdk": 287, "uncategor": 287, "alphanumer": 287, "charact": 287, "get_or_cr": 287, "fresh": 287, "persist": 287, "hpo": 287, "sweep": 287, "server": 287, "offlin": 287, "auto": [287, 314, 316], "creation": 287, "experimentconfig": 287, "project_nam": 287, "my_workspac": 287, "my_metr": [287, 290, 291], "importerror": [287, 291], "termin": [287, 290, 291], "comet_api_kei": 287, "flush": [287, 288, 289, 290, 291], "ndarrai": [287, 288, 289, 290, 291], "record": [287, 288, 289, 290, 291, 296], "log_config": [287, 291], "payload": [287, 288, 289, 290, 291], "log_": 288, "unixtimestamp": 288, "thread": 288, "safe": [288, 309], "organize_log": 290, "tensorboard": 290, "subdirectori": 290, "logdir": 290, "startup": 290, "tfevent": 290, "encount": 290, "frontend": 290, "organ": [290, 314], "accordingli": [290, 322], "my_log_dir": 290, "view": [290, 319], "entiti": 291, "bias": [291, 320, 323], "usernam": [291, 314, 316], "my_ent": 291, "my_group": 291, "account": [291, 320, 323], "link": [291, 316, 318], "capecap": 291, "6053ofw0": 291, "torchtune_config_j67sb73v": 291, "soon": [292, 321], "readi": [292, 306, 315, 322], "grad": 292, "auto_wrap_polici": 293, "acwrappolicytyp": 293, "author": [293, 308, 317, 321, 323], "fsdp_adavnced_tutori": 293, "insid": 294, "contextmanag": 294, "debug_mod": 295, "pseudo": 295, "commonli": [295, 309, 320, 321, 323], "numpi": 295, "determinist": 295, "global": [295, 321], "warn": 295, "nondeterminist": 295, "cudnn": 295, "set_deterministic_debug_mod": 295, "profile_memori": 296, "with_stack": 296, "record_shap": 296, "with_flop": 296, "wait_step": 296, "warmup_step": 296, "active_step": 296, "profil": [296, 316], "layout": 296, "trace": 296, "profileract": 296, "gradient_accumul": 296, "sensibl": 296, "default_schedul": 296, "reduct": [296, 311, 320], "scope": 296, "flop": 296, "cycl": 296, "repeat": 296, "model_named_paramet": 297, "force_overrid": 297, "concret": [297, 321], "vocab_dim": 297, "named_param": 298, "exclude_param_nam": 298, "too": [299, 311, 318], "npu": 300, "xpu": 300, "handler": 301, "_log": 301, "aka": 302, "__version__": 303, "generated_examples_python": 304, "galleri": [304, 313], "sphinx": 304, "000": [305, 313, 318], "execut": [305, 313], "generated_exampl": 305, "mem": [305, 313], "mb": [305, 313], "gentl": 306, "introduct": 306, "first_finetune_tutori": 306, "maxim": [306, 308], "kd": 306, "torchvis": 307, "torchao": [307, 311, 316, 318, 321, 322, 323], "latest": [307, 311, 317, 321, 323], "whl": 307, "cu121": 307, "cu118": 307, "cu124": 307, "confirm": 307, "And": 307, "welcom": [307, 314], "greatest": [307, 317], "contributor": 307, "dev": 307, "commit": 307, "branch": 307, "therebi": [307, 321, 322, 323], "forc": [307, 319], "reinstal": 307, "opt": [307, 317], "suffix": 307, "On": [308, 320], "emphas": 308, "simplic": 308, "component": 308, "prove": 308, "democrat": 308, "zoo": 308, "varieti": [308, 309, 320], "integr": [308, 316, 317, 318, 320, 322, 323], "fsdp2": [308, 321], "excit": 308, "checkout": 308, "quickstart": 308, "attain": 308, "embodi": 308, "philosophi": 308, "usabl": 308, "composit": 308, "hard": 308, "outlin": 308, "unecessari": 308, "never": 308, "thoroughli": 308, "aim": [309, 319, 321], "steer": 309, "goal": [309, 322], "domain": 309, "interest": [309, 310, 311, 319], "sft": 309, "grant": [309, 310, 311, 317], "lora_dpo_single_devic": 309, "8b_lora_dpo_single_devic": 309, "lora_dpo_distribut": 309, "8b_lora_dpo": 309, "primer": 309, "un": [309, 311], "degrad": [309, 311, 321, 322, 323], "grow": 309, "art": [309, 316], "good": [309, 315, 320, 321], "penal": 309, "gap": 309, "7b_lora_dpo_single_devic": 309, "rsoloss": 309, "lever": [309, 310, 311], "rafailov": 309, "sharma": 309, "mitchel": 309, "man": 309, "ermon": 309, "finn": 309, "2024": 309, "secretli": 309, "liu": 309, "zhao": 309, "joshi": 309, "khalman": 309, "saleh": 309, "2023": 309, "preprint": 309, "competit": 310, "8b_lora_single_devic": [310, 314, 315, 316, 318, 319, 321], "action": [310, 311], "simul": [311, 321, 322], "compromis": 311, "blogpost": [311, 321], "qat_distribut": [311, 322], "8b_qat_ful": [311, 322], "workload": [311, 321, 322], "least": [311, 318, 320, 321, 322], "vram": [311, 318, 320, 321, 322], "80gb": [311, 322], "a100": 311, "h100": 311, "delai": 311, "fake": [311, 316, 322], "empir": [311, 322], "fake_quant_after_n_step": [311, 322], "idea": [311, 319, 323], "roughli": [311, 321], "total_step": 311, "futur": [311, 322], "plan": [311, 316], "groupsiz": [311, 322], "256": [311, 318, 321, 322], "hackabl": [312, 317], "singularli": [312, 317], "technic": [312, 317], "awar": [312, 321, 322], "tracker": 312, "short": 314, "subcommand": 314, "anytim": 314, "kaggl": 314, "symlink": 314, "authent": [314, 317], "successfulli": [314, 316, 317], "wrote": [314, 316], "bin": [314, 316], "metaresearch": 314, "lot": [314, 316, 321], "recent": 314, "releas": [314, 318], "agre": 314, "term": [314, 321], "perman": 314, "eat": 314, "bandwith": 314, "00030": 314, "ootb": 314, "7b_full_low_memori": [314, 316, 317], "8b_full_single_devic": [314, 316], "mini_full_low_memori": [314, 316], "7b_full": [314, 316, 317], "13b_full": [314, 316, 317], "70b_full": 314, "edit": 314, "clobber": 314, "destin": 314, "lora_finetune_distribut": [314, 318, 320], "torchrun": 314, "launch": [314, 315, 317], "nproc": 314, "node": 314, "worker": 314, "nnode": [314, 320, 322], "minimum_nod": 314, "maximum_nod": 314, "fail": 314, "rdzv": 314, "rendezv": 314, "endpoint": 314, "8b_lora": [314, 318], "bypass": 314, "fancy_lora": 314, "8b_fancy_lora": 314, "nice": 315, "meet": 315, "overhaul": 315, "multiturn": 315, "accompani": 315, "who": 315, "influenti": 315, "hip": 315, "hop": 315, "artist": 315, "2pac": 315, "rakim": 315, "flavor": 315, "formatted_messag": 315, "nyou": 315, "nwho": 315, "518": 315, "25580": 315, "29962": 315, "3532": 315, "14816": 315, "29903": 315, "6778": 315, "_spm_model": 315, "piece_to_id": 315, "learnt": 315, "manual": [315, 316, 323], "529": 315, "29879": 315, "29958": 315, "nhere": 315, "pure": 315, "mess": 315, "prime": 315, "strictli": 315, "ask": [315, 321], "though": 315, "robust": 315, "pretend": 315, "zuckerberg": 315, "seem": [315, 316], "altogeth": 315, "honor": 315, "custom_8b_lora_single_devic": 315, "favorit": [316, 320], "seamlessli": 316, "connect": [316, 322], "filesystem": 316, "gitattribut": 316, "percentag": 316, "16gb": [316, 320], "rtx": 316, "3090": 316, "3b_full_single_devic": 316, "7b_full_single_devic": 316, "1b_full": 316, "3b_full": 316, "9b_full": 316, "27b_full": 316, "mini_ful": 316, "7b_qlora_single_devic": [316, 317, 323], "hour": 316, "3b_lora_single_devic": 316, "3977464327": 316, "hint": [316, 321], "enable_activation_checkpoint": [316, 321], "enable_activation_offload": [316, 321], "log_1734708879": 316, "gib": 316, "943998098373413": 316, "1617": 316, "04": [316, 320], "87": [316, 320], "facilit": 316, "100mib": 316, "But": [316, 320], "realli": 316, "eleuther_ev": [316, 318, 322], "eleuther_evalu": [316, 318, 322], "lm_eval": [316, 318], "custom_eval_config": [316, 318], "todo": 316, "TO": 316, "1234": 316, "truthfulqa_mc2": [316, 318, 319, 320], "max_seq_length": [316, 322], "enable_kv_cach": 316, "measur": [316, 318], "propens": [316, 318], "324": 316, "loglikelihood": 316, "custom_generation_config": [316, 318], "again": 316, "tell": 316, "me": 316, "joke": 316, "max_new_token": 316, "300": 316, "kick": 316, "noodl": 316, "impasta": 316, "int8_weight_onli": [316, 318], "int8_dynamic_activation_int8_weight": [316, 318], "ao": [316, 318], "quant_api": [316, 318], "quantize_": [316, 318], "int4_weight_onli": [316, 318], "previous": [316, 318, 320], "production": 316, "publish": 316, "benefit": 316, "peftmodel": 316, "automodelforcausallm": 316, "autotoken": 316, "trained_model_path": 316, "original_model_nam": 316, "peft_model": 316, "generate_text": 316, "max_length": 316, "return_tensor": 316, "pretrained_model_name_or_path": 316, "awesom": 316, "throughput": 316, "incom": [316, 323], "specul": 316, "succesfulli": 316, "base_model": [316, 320], "samplingparam": 316, "print_output": 316, "generated_text": 316, "80": 316, "load_format": 316, "kv_cache_dtyp": 316, "sampling_param": 316, "max_token": 316, "essai": 316, "educ": 316, "use_tqdm": 316, "great": [316, 321], "huggingface_hub": 316, "hfapi": 316, "whoami": 316, "repo_nam": 316, "repo_id": 316, "create_repo": 316, "upload_fold": 316, "folder_path": 316, "repo_typ": 316, "create_pr": 316, "hopefulli": 316, "gave": 316, "minut": 317, "agreement": 317, "depth": 317, "principl": 317, "boilerpl": 317, "substanti": [317, 320], "custom_config": 317, "replic": 317, "lorafinetunerecipesingledevic": 317, "lora_finetune_output": 317, "log_1713194212": 317, "3697006702423096": 317, "25880": [317, 323], "01": 317, "83it": 317, "monitor": 317, "tqdm": 317, "e2": 317, "focu": 318, "theta": 318, "observ": [318, 322], "gb": [318, 320, 322, 323], "consum": [318, 323], "overal": [318, 319], "leverag": [318, 323], "8b_qlora_single_devic": [318, 321], "fact": [318, 320], "coupl": [318, 320, 323], "meta_model_0": [318, 322], "122": 318, "sarah": 318, "busi": 318, "mum": 318, "young": 318, "children": 318, "live": 318, "north": 318, "east": 318, "england": 318, "135": 318, "88": 318, "sec": 318, "94": 318, "138": 318, "bandwidth": 318, "346": 318, "09": 318, "139": 318, "broader": 318, "teach": [319, 320], "straight": [319, 320], "jump": [319, 320], "compress": 319, "transfer": 319, "capac": 319, "computation": 319, "expens": 319, "deploi": 319, "imit": 319, "diagram": 319, "minillm": 319, "forwardklloss": 319, "teacher_prob": 319, "student_logprob": 319, "log_softmax": 319, "prod_prob": 319, "sum": [319, 320], "forwardklwithchunkedoutputloss": 319, "knowledge_distillation_single_devic": 319, "bit": [319, 320, 321, 322, 323], "alpaca_cleaned_dataset": 319, "hellaswag": [319, 322], "commonsense_qa": 319, "kd_ratio": 319, "teacher_checkpoint": 319, "00004": 319, "truthfulqa": [319, 320], "commonsens": 319, "constant": 319, "hf_model_0001_0": 319, "boost": 319, "graph": [319, 321], "irrespect": 319, "3e": 319, "truthful_qa": 319, "wherea": 319, "unfamiliar": 320, "oppos": [320, 323], "momentum": [320, 321], "aghajanyan": 320, "hypothes": 320, "intrins": 320, "eight": 320, "practic": [320, 321], "blue": 320, "although": [320, 321, 322], "rememb": 320, "approx": 320, "15m": 320, "65k": 320, "99": 320, "requires_grad": [320, 323], "frozen_out": [320, 323], "lora_out": [320, 323], "lora_model": 320, "lora_llama_2_7b": [320, 323], "alon": 320, "in_featur": [320, 322], "out_featur": [320, 322], "validate_missing_and_unexpected_for_lora": 320, "peft_util": 320, "set_trainable_param": 320, "lora_param": 320, "total_param": 320, "numel": 320, "trainable_param": 320, "2f": 320, "6742609920": 320, "4194304": 320, "7b_lora": 320, "my_model_checkpoint_path": [320, 322, 323], "tokenizer_checkpoint": [320, 322, 323], "my_tokenizer_checkpoint_path": [320, 322, 323], "clone": [320, 322, 323], "constraint": 320, "factori": 320, "benefici": 320, "impact": [320, 321], "minor": 320, "lora_experiment_1": 320, "smooth": [320, 323], "curv": [320, 323], "500": 320, "ran": 320, "footprint": [320, 322], "commod": 320, "cogniz": 320, "ax": 320, "parallel": 320, "475": 320, "508": 320, "86": 320, "504": 320, "514": 320, "lowest": 320, "absolut": 320, "4gb": 320, "tradeoff": 320, "salman": 321, "mohammadi": 321, "brief": 321, "glossari": 321, "constrain": [321, 322], "oom": 321, "adam": 321, "gradient_accumulation_step": 321, "cost": 321, "ram": 321, "priorit": 321, "sebastian": 321, "raschka": 321, "fp16": 321, "sound": 321, "quot": 321, "aliv": 321, "region": 321, "bring": 321, "autograd": [321, 323], "saved_tensors_hook": 321, "cours": 321, "runtim": 321, "overlap": 321, "hide": 321, "unless": 321, "total_batch_s": 321, "count": 321, "suppos": 321, "log_every_n_step": 321, "translat": 321, "frequent": 321, "slowli": 321, "num_devic": 321, "artifici": 321, "faster": 321, "prototyp": 321, "low_bit_optim": 321, "adamw8bit": 321, "bnb": 321, "modern": 321, "converg": 321, "stateless": 321, "stochast": 321, "descent": 321, "sacrif": 321, "optimizer_in_bwd": 321, "cpuoffloadoptim": 321, "offload_gradi": 321, "4e": 321, "bottleneck": 321, "slowdown": 321, "amort": 321, "4x": 321, "fsdp_cpu_offload": 321, "fullyshardeddataparallel": 321, "fsdp1": 321, "greatli": 321, "lora_": 321, "lora_llama3": 321, "_lora": 321, "firstli": 321, "secondli": 321, "affect": 321, "fashion": 321, "slower": [321, 323], "jointli": 321, "sens": 321, "novel": 321, "normalfloat": [321, 323], "8x": [321, 323], "worth": 321, "cast": [321, 322], "datatyp": [321, 323], "incur": [321, 322, 323], "penalti": 321, "qlora_": 321, "qlora_llama3_8b": 321, "_qlora": 321, "particularli": 321, "reap": 321, "hood": [321, 323], "doralinear": 321, "swap": [321, 322], "perplex": 322, "ptq": 322, "kept": 322, "nois": 322, "henc": 322, "x_q": 322, "int8": 322, "zp": 322, "x_float": 322, "qmin": 322, "qmax": 322, "clamp": 322, "x_fq": 322, "dequant": 322, "proce": 322, "prepared_model": 322, "int8dynactint4weightqatlinear": 322, "int8dynactint4weightlinear": 322, "train_loop": 322, "converted_model": 322, "recov": 322, "custom_8b_qat_ful": 322, "2000": 322, "1000": 322, "memory_efficient_fsdp_wrap": 322, "led": 322, "presum": 322, "mutat": 322, "5gb": 322, "custom_quant": 322, "poorli": 322, "custom_eleuther_evalu": 322, "fullmodeltorchtunecheckpoint": 322, "my_eleuther_evalu": 322, "stderr": 322, "word_perplex": 322, "9148": 322, "byte_perplex": 322, "5357": 322, "bits_per_byt": 322, "6189": 322, "acc": 322, "5687": 322, "0049": 322, "acc_norm": 322, "7536": 322, "0043": 322, "74": 322, "048": 322, "190": 322, "7735": 322, "5598": 322, "6413": 322, "5481": 322, "0050": 322, "7390": 322, "0044": 322, "7251": 322, "4994": 322, "5844": 322, "5740": 322, "7610": 322, "outperform": 322, "importantli": 322, "characterist": 322, "958": 322, "halv": 322, "motiv": 322, "edg": 322, "smartphon": 322, "executorch": 322, "xnnpack": 322, "export_llama": 322, "use_sdpa_with_kv_cach": 322, "qmode": 322, "group_siz": 322, "get_bos_id": 322, "get_eos_id": 322, "output_nam": 322, "llama3_8da4w": 322, "pte": 322, "881": 322, "oneplu": 322, "709": 322, "tok": 322, "815": 322, "316": 322, "364": 322, "highli": 323, "vanilla": 323, "held": 323, "bespok": 323, "vast": 323, "major": 323, "normatfloat": 323, "deepdiv": 323, "de": 323, "counterpart": 323, "set_default_devic": 323, "qlora_linear": 323, "memory_alloc": 323, "177": 323, "152": 323, "del": 323, "empty_cach": 323, "lora_linear": 323, "081": 323, "344": 323, "qlora_llama2_7b": 323, "qlora_model": 323, "essenti": 323, "reparametrize_as_dtype_state_dict_post_hook": 323, "96": 323, "98": 323, "149": 323, "9157477021217346": 323, "02": 323, "08": 323, "15it": 323, "nightli": 323, "200": 323, "hundr": 323, "228": 323, "8158286809921265": 323, "95it": 323, "exercis": 323, "linear_nf4": 323, "linear_weight": 323}, "objects": {"torchtune.config": [[27, 0, 1, "", "instantiate"], [28, 0, 1, "", "log_config"], [29, 0, 1, "", "parse"], [30, 0, 1, "", "validate"]], "torchtune.data": [[31, 1, 1, "", "AlpacaToMessages"], [32, 1, 1, "", "ChatMLTemplate"], [33, 1, 1, "", "ChosenRejectedToMessages"], [34, 2, 1, "", "GrammarErrorCorrectionTemplate"], [35, 1, 1, "", "InputOutputToMessages"], [36, 1, 1, "", "Message"], [37, 1, 1, "", "OpenAIToMessages"], [38, 1, 1, "", "PromptTemplate"], [39, 1, 1, "", "PromptTemplateInterface"], [40, 2, 1, "", "QuestionAnswerTemplate"], [41, 2, 1, "", "Role"], [42, 1, 1, "", "ShareGPTToMessages"], [43, 2, 1, "", "SummarizeTemplate"], [44, 0, 1, "", "format_content_with_images"], [45, 0, 1, "", "left_pad_sequence"], [46, 0, 1, "", "load_image"], [47, 0, 1, "", "padded_collate"], [48, 0, 1, "", "padded_collate_dpo"], [49, 0, 1, "", "padded_collate_sft"], [50, 0, 1, "", "padded_collate_tiled_images_and_mask"], [51, 0, 1, "", "truncate"], [52, 0, 1, "", "validate_messages"]], "torchtune.data.Message": [[36, 3, 1, "", "contains_media"], [36, 4, 1, "", "from_dict"], [36, 4, 1, "", "get_media"], [36, 3, 1, "", "text_content"]], "torchtune.datasets": [[53, 1, 1, "", "ConcatDataset"], [54, 1, 1, "", "PackedDataset"], [55, 1, 1, "", "PreferenceDataset"], [56, 1, 1, "", "SFTDataset"], [57, 1, 1, "", "TextCompletionDataset"], [58, 0, 1, "", "alpaca_cleaned_dataset"], [59, 0, 1, "", "alpaca_dataset"], [60, 0, 1, "", "chat_dataset"], [61, 0, 1, "", "cnn_dailymail_articles_dataset"], [62, 0, 1, "", "grammar_dataset"], [63, 0, 1, "", "hh_rlhf_helpful_dataset"], [64, 0, 1, "", "instruct_dataset"], [68, 0, 1, "", "preference_dataset"], [69, 0, 1, "", "samsum_dataset"], [70, 0, 1, "", "slimorca_dataset"], [71, 0, 1, "", "stack_exchange_paired_dataset"], [72, 0, 1, "", "text_completion_dataset"], [73, 0, 1, "", "wikitext_dataset"]], "torchtune.datasets.multimodal": [[65, 0, 1, "", "llava_instruct_dataset"], [66, 0, 1, "", "the_cauldron_dataset"], [67, 0, 1, "", "vqa_dataset"]], "torchtune.generation": [[74, 0, 1, "", "generate"], [75, 0, 1, "", "generate_next_token"], [76, 0, 1, "", "get_causal_mask_from_padding_mask"], [77, 0, 1, "", "get_position_ids_from_padding_mask"], [78, 0, 1, "", "sample"]], "torchtune.models.clip": [[79, 1, 1, "", "TilePositionalEmbedding"], [80, 1, 1, "", "TiledTokenPositionalEmbedding"], [81, 1, 1, "", "TokenPositionalEmbedding"], [82, 0, 1, "", "clip_vision_encoder"]], "torchtune.models.clip.TilePositionalEmbedding": [[79, 4, 1, "", "forward"]], "torchtune.models.clip.TiledTokenPositionalEmbedding": [[80, 4, 1, "", "forward"]], "torchtune.models.clip.TokenPositionalEmbedding": [[81, 4, 1, "", "forward"]], "torchtune.models.code_llama2": [[83, 0, 1, "", "code_llama2_13b"], [84, 0, 1, "", "code_llama2_70b"], [85, 0, 1, "", "code_llama2_7b"], [86, 0, 1, "", "lora_code_llama2_13b"], [87, 0, 1, "", "lora_code_llama2_70b"], [88, 0, 1, "", "lora_code_llama2_7b"], [89, 0, 1, "", "qlora_code_llama2_13b"], [90, 0, 1, "", "qlora_code_llama2_70b"], [91, 0, 1, "", "qlora_code_llama2_7b"]], "torchtune.models.gemma": [[92, 0, 1, "", "gemma"], [93, 0, 1, "", "gemma_2b"], [94, 0, 1, "", "gemma_7b"], [95, 0, 1, "", "gemma_tokenizer"], [96, 0, 1, "", "lora_gemma"], [97, 0, 1, "", "lora_gemma_2b"], [98, 0, 1, "", "lora_gemma_7b"], [99, 0, 1, "", "qlora_gemma_2b"], [100, 0, 1, "", "qlora_gemma_7b"]], "torchtune.models.gemma2": [[101, 0, 1, "", "gemma2"], [102, 0, 1, "", "gemma2_27b"], [103, 0, 1, "", "gemma2_2b"], [104, 0, 1, "", "gemma2_9b"], [105, 0, 1, "", "lora_gemma2"], [106, 0, 1, "", "lora_gemma2_27b"], [107, 0, 1, "", "lora_gemma2_2b"], [108, 0, 1, "", "lora_gemma2_9b"], [109, 0, 1, "", "qlora_gemma2_27b"], [110, 0, 1, "", "qlora_gemma2_2b"], [111, 0, 1, "", "qlora_gemma2_9b"]], "torchtune.models.llama2": [[112, 1, 1, "", "Llama2ChatTemplate"], [113, 0, 1, "", "llama2"], [114, 0, 1, "", "llama2_13b"], [115, 0, 1, "", "llama2_70b"], [116, 0, 1, "", "llama2_7b"], [117, 0, 1, "", "llama2_reward_7b"], [118, 0, 1, "", "llama2_tokenizer"], [119, 0, 1, "", "lora_llama2"], [120, 0, 1, "", "lora_llama2_13b"], [121, 0, 1, "", "lora_llama2_70b"], [122, 0, 1, "", "lora_llama2_7b"], [123, 0, 1, "", "lora_llama2_reward_7b"], [124, 0, 1, "", "qlora_llama2_13b"], [125, 0, 1, "", "qlora_llama2_70b"], [126, 0, 1, "", "qlora_llama2_7b"], [127, 0, 1, "", "qlora_llama2_reward_7b"]], "torchtune.models.llama3": [[128, 0, 1, "", "llama3"], [129, 0, 1, "", "llama3_70b"], [130, 0, 1, "", "llama3_8b"], [131, 0, 1, "", "llama3_tokenizer"], [132, 0, 1, "", "lora_llama3"], [133, 0, 1, "", "lora_llama3_70b"], [134, 0, 1, "", "lora_llama3_8b"], [135, 0, 1, "", "qlora_llama3_70b"], [136, 0, 1, "", "qlora_llama3_8b"]], "torchtune.models.llama3_1": [[137, 0, 1, "", "llama3_1"], [138, 0, 1, "", "llama3_1_405b"], [139, 0, 1, "", "llama3_1_70b"], [140, 0, 1, "", "llama3_1_8b"], [141, 0, 1, "", "lora_llama3_1"], [142, 0, 1, "", "lora_llama3_1_405b"], [143, 0, 1, "", "lora_llama3_1_70b"], [144, 0, 1, "", "lora_llama3_1_8b"], [145, 0, 1, "", "qlora_llama3_1_405b"], [146, 0, 1, "", "qlora_llama3_1_70b"], [147, 0, 1, "", "qlora_llama3_1_8b"]], "torchtune.models.llama3_2": [[148, 0, 1, "", "llama3_2_1b"], [149, 0, 1, "", "llama3_2_3b"], [150, 0, 1, "", "lora_llama3_2_1b"], [151, 0, 1, "", "lora_llama3_2_3b"], [152, 0, 1, "", "qlora_llama3_2_1b"], [153, 0, 1, "", "qlora_llama3_2_3b"]], "torchtune.models.llama3_2_vision": [[154, 1, 1, "", "Llama3VisionEncoder"], [155, 1, 1, "", "Llama3VisionProjectionHead"], [156, 1, 1, "", "Llama3VisionTransform"], [157, 0, 1, "", "llama3_2_vision_11b"], [158, 0, 1, "", "llama3_2_vision_decoder"], [159, 0, 1, "", "llama3_2_vision_encoder"], [160, 0, 1, "", "llama3_2_vision_transform"], [161, 0, 1, "", "lora_llama3_2_vision_11b"], [162, 0, 1, "", "lora_llama3_2_vision_decoder"], [163, 0, 1, "", "lora_llama3_2_vision_encoder"], [164, 0, 1, "", "qlora_llama3_2_vision_11b"]], "torchtune.models.llama3_2_vision.Llama3VisionEncoder": [[154, 4, 1, "", "forward"]], "torchtune.models.llama3_2_vision.Llama3VisionProjectionHead": [[155, 4, 1, "", "forward"]], "torchtune.models.llama3_2_vision.Llama3VisionTransform": [[156, 4, 1, "", "decode"], [156, 4, 1, "", "tokenize_message"], [156, 4, 1, "", "tokenize_messages"]], "torchtune.models.llama3_3": [[165, 0, 1, "", "llama3_3_70b"], [166, 0, 1, "", "lora_llama3_3_70b"], [167, 0, 1, "", "qlora_llama3_3_70b"]], "torchtune.models.mistral": [[168, 1, 1, "", "MistralChatTemplate"], [169, 0, 1, "", "lora_mistral"], [170, 0, 1, "", "lora_mistral_7b"], [171, 0, 1, "", "lora_mistral_classifier"], [172, 0, 1, "", "lora_mistral_reward_7b"], [173, 0, 1, "", "mistral"], [174, 0, 1, "", "mistral_7b"], [175, 0, 1, "", "mistral_classifier"], [176, 0, 1, "", "mistral_reward_7b"], [177, 0, 1, "", "mistral_tokenizer"], [178, 0, 1, "", "qlora_mistral_7b"], [179, 0, 1, "", "qlora_mistral_reward_7b"]], "torchtune.models.phi3": [[180, 0, 1, "", "lora_phi3"], [181, 0, 1, "", "lora_phi3_mini"], [182, 0, 1, "", "phi3"], [183, 0, 1, "", "phi3_mini"], [184, 0, 1, "", "phi3_mini_tokenizer"], [185, 0, 1, "", "qlora_phi3_mini"]], "torchtune.models.qwen2": [[186, 0, 1, "", "lora_qwen2"], [187, 0, 1, "", "lora_qwen2_0_5b"], [188, 0, 1, "", "lora_qwen2_1_5b"], [189, 0, 1, "", "lora_qwen2_7b"], [190, 0, 1, "", "qwen2"], [191, 0, 1, "", "qwen2_0_5b"], [192, 0, 1, "", "qwen2_1_5b"], [193, 0, 1, "", "qwen2_7b"], [194, 0, 1, "", "qwen2_tokenizer"]], "torchtune.models.qwen2_5": [[195, 0, 1, "", "lora_qwen2_5_0_5b"], [196, 0, 1, "", "lora_qwen2_5_14b_base"], [197, 0, 1, "", "lora_qwen2_5_14b_instruct"], [198, 0, 1, "", "lora_qwen2_5_1_5b_base"], [199, 0, 1, "", "lora_qwen2_5_1_5b_instruct"], [200, 0, 1, "", "lora_qwen2_5_32b_base"], [201, 0, 1, "", "lora_qwen2_5_32b_instruct"], [202, 0, 1, "", "lora_qwen2_5_3b"], [203, 0, 1, "", "lora_qwen2_5_72b_base"], [204, 0, 1, "", "lora_qwen2_5_72b_instruct"], [205, 0, 1, "", "lora_qwen2_5_7b_base"], [206, 0, 1, "", "lora_qwen2_5_7b_instruct"], [207, 0, 1, "", "qwen2_5_0_5b"], [208, 0, 1, "", "qwen2_5_14b_base"], [209, 0, 1, "", "qwen2_5_14b_instruct"], [210, 0, 1, "", "qwen2_5_1_5b_base"], [211, 0, 1, "", "qwen2_5_1_5b_instruct"], [212, 0, 1, "", "qwen2_5_32b_base"], [213, 0, 1, "", "qwen2_5_32b_instruct"], [214, 0, 1, "", "qwen2_5_3b"], [215, 0, 1, "", "qwen2_5_72b_base"], [216, 0, 1, "", "qwen2_5_72b_instruct"], [217, 0, 1, "", "qwen2_5_7b_base"], [218, 0, 1, "", "qwen2_5_7b_instruct"], [219, 0, 1, "", "qwen2_5_tokenizer"]], "torchtune.modules": [[220, 1, 1, "", "FeedForward"], [221, 1, 1, "", "Fp32LayerNorm"], [222, 1, 1, "", "KVCache"], [223, 1, 1, "", "LayerDropout"], [224, 1, 1, "", "MultiHeadAttention"], [225, 1, 1, "", "RMSNorm"], [226, 1, 1, "", "RotaryPositionalEmbeddings"], [227, 1, 1, "", "TanhGate"], [228, 1, 1, "", "TiedLinear"], [229, 1, 1, "", "TransformerCrossAttentionLayer"], [230, 1, 1, "", "TransformerDecoder"], [231, 1, 1, "", "TransformerSelfAttentionLayer"], [232, 1, 1, "", "VisionTransformer"], [253, 0, 1, "", "prepare_layer_dropout"]], "torchtune.modules.FeedForward": [[220, 4, 1, "", "forward"]], "torchtune.modules.Fp32LayerNorm": [[221, 4, 1, "", "forward"]], "torchtune.modules.KVCache": [[222, 4, 1, "", "reset"], [222, 4, 1, "", "update"]], "torchtune.modules.LayerDropout": [[223, 4, 1, "", "forward"]], "torchtune.modules.MultiHeadAttention": [[224, 4, 1, "", "forward"], [224, 4, 1, "", "reset_cache"], [224, 4, 1, "", "setup_cache"]], "torchtune.modules.RMSNorm": [[225, 4, 1, "", "forward"]], "torchtune.modules.RotaryPositionalEmbeddings": [[226, 4, 1, "", "forward"]], "torchtune.modules.TanhGate": [[227, 4, 1, "", "forward"]], "torchtune.modules.TransformerCrossAttentionLayer": [[229, 4, 1, "", "caches_are_enabled"], [229, 4, 1, "", "caches_are_setup"], [229, 4, 1, "", "forward"], [229, 4, 1, "", "reset_cache"], [229, 4, 1, "", "setup_caches"]], "torchtune.modules.TransformerDecoder": [[230, 4, 1, "", "caches_are_enabled"], [230, 4, 1, "", "caches_are_setup"], [230, 4, 1, "", "chunked_output"], [230, 4, 1, "", "forward"], [230, 4, 1, "", "reset_caches"], [230, 4, 1, "", "set_num_output_chunks"], [230, 4, 1, "", "setup_caches"]], "torchtune.modules.TransformerSelfAttentionLayer": [[231, 4, 1, "", "caches_are_enabled"], [231, 4, 1, "", "caches_are_setup"], [231, 4, 1, "", "forward"], [231, 4, 1, "", "reset_cache"], [231, 4, 1, "", "setup_caches"]], "torchtune.modules.VisionTransformer": [[232, 4, 1, "", "forward"]], "torchtune.modules.common_utils": [[233, 0, 1, "", "delete_kv_caches"], [234, 0, 1, "", "disable_kv_cache"], [235, 0, 1, "", "local_kv_cache"], [236, 0, 1, "", "reparametrize_as_dtype_state_dict_post_hook"]], "torchtune.modules.loss": [[237, 1, 1, "", "CEWithChunkedOutputLoss"], [238, 1, 1, "", "ForwardKLLoss"], [239, 1, 1, "", "ForwardKLWithChunkedOutputLoss"]], "torchtune.modules.loss.CEWithChunkedOutputLoss": [[237, 4, 1, "", "compute_cross_entropy"], [237, 4, 1, "", "forward"]], "torchtune.modules.loss.ForwardKLLoss": [[238, 4, 1, "", "forward"]], "torchtune.modules.loss.ForwardKLWithChunkedOutputLoss": [[239, 4, 1, "", "forward"]], "torchtune.modules.model_fusion": [[240, 1, 1, "", "DeepFusionModel"], [241, 1, 1, "", "FusionEmbedding"], [242, 1, 1, "", "FusionLayer"], [243, 0, 1, "", "get_fusion_params"], [244, 0, 1, "", "register_fusion_module"]], "torchtune.modules.model_fusion.DeepFusionModel": [[240, 4, 1, "", "caches_are_enabled"], [240, 4, 1, "", "caches_are_setup"], [240, 4, 1, "", "forward"], [240, 4, 1, "", "reset_caches"], [240, 4, 1, "", "set_num_output_chunks"], [240, 4, 1, "", "setup_caches"]], "torchtune.modules.model_fusion.FusionEmbedding": [[241, 4, 1, "", "forward"], [241, 4, 1, "", "fusion_params"]], "torchtune.modules.model_fusion.FusionLayer": [[242, 4, 1, "", "caches_are_enabled"], [242, 4, 1, "", "caches_are_setup"], [242, 4, 1, "", "forward"], [242, 4, 1, "", "fusion_params"], [242, 4, 1, "", "reset_cache"], [242, 4, 1, "", "setup_caches"]], "torchtune.modules.peft": [[245, 1, 1, "", "AdapterModule"], [246, 1, 1, "", "DoRALinear"], [247, 1, 1, "", "LoRALinear"], [248, 0, 1, "", "disable_adapter"], [249, 0, 1, "", "get_adapter_params"], [250, 0, 1, "", "get_adapter_state_dict"], [251, 0, 1, "", "set_trainable_params"], [252, 0, 1, "", "validate_missing_and_unexpected_for_lora"]], "torchtune.modules.peft.AdapterModule": [[245, 4, 1, "", "adapter_params"]], "torchtune.modules.peft.DoRALinear": [[246, 4, 1, "", "adapter_params"], [246, 4, 1, "", "forward"], [246, 4, 1, "", "initialize_dora_magnitude"], [246, 4, 1, "", "to_empty"]], "torchtune.modules.peft.LoRALinear": [[247, 4, 1, "", "adapter_params"], [247, 4, 1, "", "forward"], [247, 4, 1, "", "to_empty"]], "torchtune.modules.tokenizers": [[254, 1, 1, "", "BaseTokenizer"], [255, 1, 1, "", "ModelTokenizer"], [256, 1, 1, "", "SentencePieceBaseTokenizer"], [257, 1, 1, "", "TikTokenBaseTokenizer"], [258, 0, 1, "", "parse_hf_tokenizer_json"], [259, 0, 1, "", "tokenize_messages_no_special_tokens"]], "torchtune.modules.tokenizers.BaseTokenizer": [[254, 4, 1, "", "decode"], [254, 4, 1, "", "encode"]], "torchtune.modules.tokenizers.ModelTokenizer": [[255, 4, 1, "", "tokenize_messages"]], "torchtune.modules.tokenizers.SentencePieceBaseTokenizer": [[256, 4, 1, "", "decode"], [256, 4, 1, "", "encode"]], "torchtune.modules.tokenizers.TikTokenBaseTokenizer": [[257, 4, 1, "", "decode"], [257, 4, 1, "", "encode"]], "torchtune.modules.transforms": [[260, 1, 1, "", "Transform"], [261, 1, 1, "", "VisionCrossAttentionMask"]], "torchtune.rlhf": [[262, 0, 1, "", "estimate_advantages"], [263, 0, 1, "", "get_rewards_ppo"], [268, 0, 1, "", "truncate_sequence_at_first_stop_token"]], "torchtune.rlhf.loss": [[264, 1, 1, "", "DPOLoss"], [265, 1, 1, "", "PPOLoss"], [266, 1, 1, "", "RSOLoss"], [267, 2, 1, "", "SimPOLoss"]], "torchtune.rlhf.loss.DPOLoss": [[264, 4, 1, "", "forward"]], "torchtune.rlhf.loss.PPOLoss": [[265, 4, 1, "", "forward"]], "torchtune.rlhf.loss.RSOLoss": [[266, 4, 1, "", "forward"]], "torchtune.training": [[269, 1, 1, "", "FormattedCheckpointFiles"], [270, 1, 1, "", "FullModelHFCheckpointer"], [271, 1, 1, "", "FullModelMetaCheckpointer"], [272, 1, 1, "", "FullModelTorchTuneCheckpointer"], [273, 1, 1, "", "ModelType"], [274, 1, 1, "", "OptimizerInBackwardWrapper"], [275, 0, 1, "", "apply_selective_activation_checkpointing"], [276, 0, 1, "", "create_optim_in_bwd_wrapper"], [277, 0, 1, "", "gather_cpu_state_dict"], [278, 0, 1, "", "get_cosine_schedule_with_warmup"], [279, 0, 1, "", "get_dtype"], [280, 0, 1, "", "get_lr"], [281, 0, 1, "", "get_memory_stats"], [282, 0, 1, "", "get_quantizer_mode"], [283, 0, 1, "", "get_unmasked_sequence_lengths"], [284, 0, 1, "", "init_distributed"], [285, 0, 1, "", "is_distributed"], [286, 0, 1, "", "log_memory_stats"], [292, 0, 1, "", "register_optim_in_bwd_hooks"], [293, 0, 1, "", "set_activation_checkpointing"], [294, 0, 1, "", "set_default_dtype"], [295, 0, 1, "", "set_seed"], [296, 0, 1, "", "setup_torch_profiler"], [297, 0, 1, "", "update_state_dict_for_classifier"], [298, 0, 1, "", "validate_expected_param_dtype"]], "torchtune.training.FormattedCheckpointFiles": [[269, 4, 1, "", "build_checkpoint_filenames"]], "torchtune.training.FullModelHFCheckpointer": [[270, 4, 1, "", "load_checkpoint"], [270, 4, 1, "", "save_checkpoint"]], "torchtune.training.FullModelMetaCheckpointer": [[271, 4, 1, "", "load_checkpoint"], [271, 4, 1, "", "save_checkpoint"]], "torchtune.training.FullModelTorchTuneCheckpointer": [[272, 4, 1, "", "load_checkpoint"], [272, 4, 1, "", "save_checkpoint"]], "torchtune.training.OptimizerInBackwardWrapper": [[274, 4, 1, "", "get_last_lr"], [274, 4, 1, "", "get_optim_key"], [274, 4, 1, "", "load_state_dict"], [274, 4, 1, "", "set_lr_scheduler"], [274, 4, 1, "", "state_dict"], [274, 4, 1, "", "step_lr_scheduler"]], "torchtune.training.metric_logging": [[287, 1, 1, "", "CometLogger"], [288, 1, 1, "", "DiskLogger"], [289, 1, 1, "", "StdoutLogger"], [290, 1, 1, "", "TensorBoardLogger"], [291, 1, 1, "", "WandBLogger"]], "torchtune.training.metric_logging.CometLogger": [[287, 4, 1, "", "close"], [287, 4, 1, "", "log"], [287, 4, 1, "", "log_config"], [287, 4, 1, "", "log_dict"]], "torchtune.training.metric_logging.DiskLogger": [[288, 4, 1, "", "close"], [288, 4, 1, "", "log"], [288, 4, 1, "", "log_dict"]], "torchtune.training.metric_logging.StdoutLogger": [[289, 4, 1, "", "close"], [289, 4, 1, "", "log"], [289, 4, 1, "", "log_dict"]], "torchtune.training.metric_logging.TensorBoardLogger": [[290, 4, 1, "", "close"], [290, 4, 1, "", "log"], [290, 4, 1, "", "log_dict"]], "torchtune.training.metric_logging.WandBLogger": [[291, 4, 1, "", "close"], [291, 4, 1, "", "log"], [291, 4, 1, "", "log_config"], [291, 4, 1, "", "log_dict"]], "torchtune.utils": [[299, 0, 1, "", "batch_to_device"], [300, 0, 1, "", "get_device"], [301, 0, 1, "", "get_logger"], [302, 0, 1, "", "get_world_size_and_rank"], [303, 0, 1, "", "torch_version_ge"]]}, "objtypes": {"0": "py:function", "1": "py:class", "2": "py:data", "3": "py:property", "4": "py:method"}, "objnames": {"0": ["py", "function", "Python function"], "1": ["py", "class", "Python class"], "2": ["py", "data", "Python data"], "3": ["py", "property", "Python property"], "4": ["py", "method", "Python method"]}, "titleterms": {"torchtun": [0, 1, 2, 3, 4, 5, 6, 7, 8, 10, 22, 34, 40, 41, 43, 267, 306, 308, 314, 316, 318, 319, 320, 322, 323], "config": [0, 10, 24, 25, 314, 317], "data": [1, 11, 34, 40, 41, 43, 315], "text": [1, 2, 14, 16, 20, 318], "templat": [1, 9, 12, 14, 19, 21, 315], "type": 1, "messag": [1, 13, 14, 36], "transform": [1, 5, 13, 14, 15, 260], "collat": 1, "helper": 1, "function": 1, "dataset": [2, 9, 11, 12, 16, 18, 20, 315], "imag": [2, 14, 16], "gener": [2, 3, 74, 316, 318], "builder": 2, "class": [2, 19, 25], "model": [4, 5, 15, 21, 26, 314, 316, 317, 318, 319, 320, 321, 322], "llama3": [4, 128, 315, 318, 319, 322], "3": 4, "2": [4, 319], "vision": [4, 5], "1": [4, 319], "llama2": [4, 113, 315, 320, 323], "code": 4, "llama": 4, "qwen": 4, "5": 4, "phi": 4, "mistral": [4, 173], "gemma": [4, 92], "gemma2": [4, 101], "clip": 4, "modul": 5, "compon": [5, 10, 24, 321], "build": [5, 307, 323], "block": 5, "loss": [5, 267], "base": [5, 21], "token": [5, 14, 21, 315], "util": [5, 8], "peft": [5, 321], "fusion": 5, "rlhf": [6, 267], "train": [7, 311, 317], "checkpoint": [7, 22, 26, 321], "reduc": 7, "precis": [7, 321], "distribut": [7, 311], "memori": [7, 320, 321, 323], "manag": 7, "schedul": 7, "metric": [7, 23, 26], "log": [7, 23, 26], "perform": [7, 320], "profil": 7, "miscellan": [7, 8], "chat": [9, 315], "exampl": [9, 12, 13, 15, 16, 18, 20], "format": [9, 12, 14, 16, 18, 20, 22], "load": [9, 12, 16, 18, 20, 21], "from": [9, 12, 16, 18, 20, 21, 22, 315, 323], "hug": [9, 12, 16, 18, 20, 21, 316], "face": [9, 12, 16, 18, 20, 21, 316], "local": [9, 12, 16, 18, 20], "remot": [9, 12, 16], "specifi": 9, "convers": 9, "style": 9, "sharegpt": 9, "openai": 9, "renam": [9, 12], "column": [9, 12], "built": [9, 12, 16, 18, 19, 20, 314], "custom": [10, 13, 19, 315], "recip": [10, 25, 312, 314, 317, 319, 320, 322], "set": [10, 21], "up": 10, "your": [10, 24, 25, 316, 317], "project": 10, "launch": 10, "overview": [11, 22, 308, 312, 321], "pipelin": 11, "instruct": [12, 307, 318], "configur": [13, 24], "creat": [14, 15], "prompt": [14, 19, 21, 315], "access": [14, 318], "content": 14, "multimod": [15, 16], "us": [15, 19, 24, 25, 315, 316, 319, 323], "interleav": 16, "sampl": [17, 78], "pack": 17, "prefer": [18, 309], "defin": 19, "via": [19, 307, 318], "dotpath": 19, "string": 19, "dictionari": 19, "prompttempl": [19, 38], "complet": 20, "json": 20, "txt": 20, "download": [21, 314, 317], "file": 21, "max": 21, "sequenc": 21, "length": 21, "special": [21, 315], "handl": 22, "differ": 22, "hfcheckpoint": 22, "metacheckpoint": 22, "torchtunecheckpoint": 22, "output": [22, 316], "intermedi": 22, "vs": 22, "final": 22, "resum": 22, "full": 22, "finetun": [22, 310, 312, 316, 320, 322, 323], "lora": [22, 310, 320, 321, 323], "put": [22, 323], "thi": 22, "all": [22, 24, 323], "togeth": [22, 323], "comet": 23, "logger": [23, 26], "about": 24, "where": 24, "do": 24, "paramet": [24, 321], "live": 24, "write": 24, "instanti": [24, 27], "referenc": 24, "other": 24, "field": 24, "interpol": 24, "valid": [24, 30, 314], "best": 24, "practic": 24, "airtight": 24, "public": 24, "api": 24, "onli": 24, "command": 24, "line": 24, "overrid": 24, "remov": 24, "what": [25, 308, 319, 320, 322, 323], "ar": 25, "script": 25, "run": [25, 314, 316], "cli": [25, 314], "pars": [25, 29], "weight": [26, 321], "bias": 26, "w": 26, "b": 26, "log_config": 28, "alpacatomessag": 31, "chatmltempl": 32, "chosenrejectedtomessag": 33, "grammarerrorcorrectiontempl": 34, "inputoutputtomessag": 35, "openaitomessag": 37, "prompttemplateinterfac": 39, "questionanswertempl": 40, "role": 41, "sharegpttomessag": 42, "summarizetempl": 43, "format_content_with_imag": 44, "left_pad_sequ": 45, "load_imag": 46, "padded_col": 47, "padded_collate_dpo": 48, "padded_collate_sft": 49, "padded_collate_tiled_images_and_mask": 50, "truncat": 51, "validate_messag": 52, "concatdataset": 53, "packeddataset": 54, "preferencedataset": 55, "sftdataset": 56, "textcompletiondataset": 57, "alpaca_cleaned_dataset": 58, "alpaca_dataset": 59, "chat_dataset": 60, "cnn_dailymail_articles_dataset": 61, "grammar_dataset": 62, "hh_rlhf_helpful_dataset": 63, "instruct_dataset": 64, "llava_instruct_dataset": 65, "the_cauldron_dataset": 66, "vqa_dataset": 67, "preference_dataset": 68, "samsum_dataset": 69, "slimorca_dataset": 70, "stack_exchange_paired_dataset": 71, "text_completion_dataset": 72, "wikitext_dataset": 73, "generate_next_token": 75, "get_causal_mask_from_padding_mask": 76, "get_position_ids_from_padding_mask": 77, "tilepositionalembed": 79, "tiledtokenpositionalembed": 80, "tokenpositionalembed": 81, "clip_vision_encod": 82, "code_llama2_13b": 83, "code_llama2_70b": 84, "code_llama2_7b": 85, "lora_code_llama2_13b": 86, "lora_code_llama2_70b": 87, "lora_code_llama2_7b": 88, "qlora_code_llama2_13b": 89, "qlora_code_llama2_70b": 90, "qlora_code_llama2_7b": 91, "gemma_2b": 93, "gemma_7b": 94, "gemma_token": 95, "lora_gemma": 96, "lora_gemma_2b": 97, "lora_gemma_7b": 98, "qlora_gemma_2b": 99, "qlora_gemma_7b": 100, "gemma2_27b": 102, "gemma2_2b": 103, "gemma2_9b": 104, "lora_gemma2": 105, "lora_gemma2_27b": 106, "lora_gemma2_2b": 107, "lora_gemma2_9b": 108, "qlora_gemma2_27b": 109, "qlora_gemma2_2b": 110, "qlora_gemma2_9b": 111, "llama2chattempl": 112, "llama2_13b": 114, "llama2_70b": 115, "llama2_7b": 116, "llama2_reward_7b": 117, "llama2_token": 118, "lora_llama2": 119, "lora_llama2_13b": 120, "lora_llama2_70b": 121, "lora_llama2_7b": 122, "lora_llama2_reward_7b": 123, "qlora_llama2_13b": 124, "qlora_llama2_70b": 125, "qlora_llama2_7b": 126, "qlora_llama2_reward_7b": 127, "llama3_70b": 129, "llama3_8b": 130, "llama3_token": 131, "lora_llama3": 132, "lora_llama3_70b": 133, "lora_llama3_8b": 134, "qlora_llama3_70b": 135, "qlora_llama3_8b": 136, "llama3_1": 137, "llama3_1_405b": 138, "llama3_1_70b": 139, "llama3_1_8b": 140, "lora_llama3_1": 141, "lora_llama3_1_405b": 142, "lora_llama3_1_70b": 143, "lora_llama3_1_8b": 144, "qlora_llama3_1_405b": 145, "qlora_llama3_1_70b": 146, "qlora_llama3_1_8b": 147, "llama3_2_1b": 148, "llama3_2_3b": 149, "lora_llama3_2_1b": 150, "lora_llama3_2_3b": 151, "qlora_llama3_2_1b": 152, "qlora_llama3_2_3b": 153, "llama3visionencod": 154, "llama3visionprojectionhead": 155, "llama3visiontransform": 156, "llama3_2_vision_11b": 157, "llama3_2_vision_decod": 158, "llama3_2_vision_encod": 159, "llama3_2_vision_transform": 160, "lora_llama3_2_vision_11b": 161, "lora_llama3_2_vision_decod": 162, "lora_llama3_2_vision_encod": 163, "qlora_llama3_2_vision_11b": 164, "llama3_3_70b": 165, "lora_llama3_3_70b": 166, "qlora_llama3_3_70b": 167, "mistralchattempl": 168, "lora_mistr": 169, "lora_mistral_7b": 170, "lora_mistral_classifi": 171, "lora_mistral_reward_7b": 172, "mistral_7b": 174, "mistral_classifi": 175, "mistral_reward_7b": 176, "mistral_token": 177, "qlora_mistral_7b": 178, "qlora_mistral_reward_7b": 179, "lora_phi3": 180, "lora_phi3_mini": 181, "phi3": 182, "phi3_mini": 183, "phi3_mini_token": 184, "qlora_phi3_mini": 185, "lora_qwen2": 186, "lora_qwen2_0_5b": 187, "lora_qwen2_1_5b": 188, "lora_qwen2_7b": 189, "qwen2": [190, 319], "qwen2_0_5b": 191, "qwen2_1_5b": 192, "qwen2_7b": 193, "qwen2_token": 194, "lora_qwen2_5_0_5b": 195, "lora_qwen2_5_14b_bas": 196, "lora_qwen2_5_14b_instruct": 197, "lora_qwen2_5_1_5b_bas": 198, "lora_qwen2_5_1_5b_instruct": 199, "lora_qwen2_5_32b_bas": 200, "lora_qwen2_5_32b_instruct": 201, "lora_qwen2_5_3b": 202, "lora_qwen2_5_72b_bas": 203, "lora_qwen2_5_72b_instruct": 204, "lora_qwen2_5_7b_bas": 205, "lora_qwen2_5_7b_instruct": 206, "qwen2_5_0_5b": 207, "qwen2_5_14b_bas": 208, "qwen2_5_14b_instruct": 209, "qwen2_5_1_5b_bas": 210, "qwen2_5_1_5b_instruct": 211, "qwen2_5_32b_bas": 212, "qwen2_5_32b_instruct": 213, "qwen2_5_3b": 214, "qwen2_5_72b_bas": 215, "qwen2_5_72b_instruct": 216, "qwen2_5_7b_bas": 217, "qwen2_5_7b_instruct": 218, "qwen2_5_token": 219, "feedforward": 220, "fp32layernorm": 221, "kvcach": 222, "layerdropout": 223, "multiheadattent": 224, "rmsnorm": 225, "rotarypositionalembed": 226, "tanhgat": 227, "tiedlinear": 228, "transformercrossattentionlay": 229, "transformerdecod": 230, "transformerselfattentionlay": 231, "visiontransform": 232, "delete_kv_cach": 233, "disable_kv_cach": 234, "local_kv_cach": 235, "reparametrize_as_dtype_state_dict_post_hook": 236, "cewithchunkedoutputloss": 237, "forwardklloss": 238, "forwardklwithchunkedoutputloss": 239, "deepfusionmodel": 240, "fusionembed": 241, "fusionlay": 242, "get_fusion_param": 243, "register_fusion_modul": 244, "adaptermodul": 245, "doralinear": 246, "loralinear": 247, "disable_adapt": 248, "get_adapter_param": 249, "get_adapter_state_dict": 250, "set_trainable_param": 251, "validate_missing_and_unexpected_for_lora": 252, "prepare_layer_dropout": 253, "basetoken": 254, "modeltoken": 255, "sentencepiecebasetoken": 256, "tiktokenbasetoken": 257, "parse_hf_tokenizer_json": 258, "tokenize_messages_no_special_token": 259, "visioncrossattentionmask": 261, "estimate_advantag": 262, "get_rewards_ppo": 263, "dpoloss": 264, "ppoloss": 265, "rsoloss": 266, "simpoloss": 267, "truncate_sequence_at_first_stop_token": 268, "formattedcheckpointfil": 269, "fullmodelhfcheckpoint": 270, "fullmodelmetacheckpoint": 271, "fullmodeltorchtunecheckpoint": 272, "modeltyp": 273, "optimizerinbackwardwrapp": 274, "apply_selective_activation_checkpoint": 275, "create_optim_in_bwd_wrapp": 276, "gather_cpu_state_dict": 277, "get_cosine_schedule_with_warmup": 278, "get_dtyp": 279, "get_lr": 280, "get_memory_stat": 281, "get_quantizer_mod": 282, "get_unmasked_sequence_length": 283, "init_distribut": 284, "is_distribut": 285, "log_memory_stat": 286, "cometlogg": 287, "disklogg": 288, "stdoutlogg": 289, "tensorboardlogg": 290, "wandblogg": 291, "register_optim_in_bwd_hook": 292, "set_activation_checkpoint": 293, "set_default_dtyp": 294, "set_se": 295, "setup_torch_profil": 296, "update_state_dict_for_classifi": 297, "validate_expected_param_dtyp": 298, "batch_to_devic": 299, "get_devic": 300, "get_logg": 301, "get_world_size_and_rank": 302, "torch_version_g": 303, "comput": [305, 313], "time": [305, 313], "welcom": 306, "document": 306, "get": [306, 314, 318], "start": [306, 314], "tutori": 306, "instal": 307, "pre": 307, "requisit": 307, "pypi": 307, "git": 307, "clone": 307, "nightli": 307, "kei": 308, "concept": 308, "design": 308, "principl": 308, "direct": 309, "optim": [309, 321], "singl": 310, "devic": [310, 322], "quantiz": [311, 316, 318, 321, 322], "awar": 311, "qat": [311, 322], "list": 314, "copi": 314, "fine": [315, 317, 318, 319, 320, 321, 322, 323], "tune": [315, 317, 318, 319, 320, 321, 322, 323], "chang": 315, "when": 315, "should": 315, "i": 315, "end": 316, "workflow": 316, "evalu": [316, 318, 322], "eval": [316, 318], "eleutherai": [316, 318], "s": [316, 318], "har": [316, 318], "some": 316, "introduc": 316, "wild": 316, "from_pretrain": 316, "vllm": 316, "upload": 316, "hub": 316, "first": 317, "llm": 317, "select": 317, "modifi": 317, "next": 317, "step": [317, 321], "meta": 318, "8b": [318, 319], "our": 318, "faster": 318, "distil": 319, "1b": 319, "knowledg": 319, "how": [319, 320], "doe": [319, 320], "work": [319, 320], "kd": 319, "ablat": 319, "studi": 319, "teacher": 319, "student": 319, "hyperparamet": 319, "learn": 319, "rate": 319, "ratio": 319, "5b": 319, "0": 319, "appli": [320, 322], "trade": 320, "off": 320, "activ": 321, "offload": 321, "gradient": 321, "accumul": 321, "lower": [321, 322], "fuse": 321, "backward": 321, "pass": 321, "state": 321, "cpu": 321, "effici": 321, "low": 321, "rank": 321, "adapt": 321, "qlora": [321, 323], "decompos": 321, "dora": 321, "option": 322, "save": 323, "deep": 323, "dive": 323}, "envversion": {"sphinx.domains.c": 2, "sphinx.domains.changeset": 1, "sphinx.domains.citation": 1, "sphinx.domains.cpp": 6, "sphinx.domains.index": 1, "sphinx.domains.javascript": 2, "sphinx.domains.math": 2, "sphinx.domains.python": 3, "sphinx.domains.rst": 2, "sphinx.domains.std": 2, "sphinx.ext.intersphinx": 1, "sphinx.ext.todo": 2, "sphinx.ext.viewcode": 1, "sphinx": 56}})
\ No newline at end of file
diff --git a/main/tutorials/e2e_flow.html b/main/tutorials/e2e_flow.html
index 4d927efee..94f1f7508 100644
--- a/main/tutorials/e2e_flow.html
+++ b/main/tutorials/e2e_flow.html
@@ -437,7 +437,7 @@
In this tutorial, we’ll walk through an end-to-end example of how you can fine-tune,
evaluate, optionally quantize and then run generation with your favorite LLM using
torchtune. We’ll also go over how you can use some popular tools and libraries
-from the community seemlessly with torchtune.
+from the community seamlessly with torchtune.
-
-Overview
-Fine-tuning an LLM is usually only one step in a larger workflow. An example workflow that you
-might have can look something like this:
-
-Download a popular model from HF Hub
-Fine-tune the model using a relevant fine-tuning technique. The exact technique used
-will depend on factors such as the model, amount and nature of training data, your hardware
-setup and the end task for which the model will be used
-Evaluate the model on some benchmarks to validate model quality
-Run some generations to make sure the model output looks reasonable
-Quantize the model for efficient inference
-[Optional] Export the model for specific environments such as inference on a mobile phone
-
-In this tutorial, we’ll cover how you can use torchtune for all of the above, leveraging
-integrations with popular tools and libraries from the ecosystem.
-We’ll use the Llama-3.2-3B-Instruct model for this tutorial. You can find a complete set of models supported
-by torchtune here.
-
-
-
-Download Llama-3.2-3B-Instruct
-For more information on checkpoint formats and how these are handled in torchtune, take a look at
-this tutorial on checkpoints.
-To download the HF format Llama-3.2-3B-Instruct, we’ll use the tune CLI.
-tune download meta-llama/Llama-3.2-3B-Instruct \
- --output-dir /tmp/Llama-3.2-3B-Instruct \
- --ignore-patterns "original/consolidated.00.pth"
+
+Finetune your model
+First, let’s download a model using the tune CLI. The following command will download the Llama3.2 3B Instruct
+model from the Hugging Face Hub and save it the local filesystem. Hugging Face uploaded the original
+weights (consolidated.00.pth
) and the weights compatible with the from_pretrained() API (*.safetensors
).
+We don’t need both so we’ll ignore the original weights when downloading.
+$ tune download meta-llama/Llama-3.2-3B-Instruct --ignore-patterns "original/consolidated.00.pth"
+Successfully downloaded model repo and wrote to the following locations:
+/tmp/Llama-3.2-3B-Instruct/.cache
+/tmp/Llama-3.2-3B-Instruct/.gitattributes
+/tmp/Llama-3.2-3B-Instruct/LICENSE.txt
+/tmp/Llama-3.2-3B-Instruct/README.md
+/tmp/Llama-3.2-3B-Instruct/USE_POLICY.md
+/tmp/Llama-3.2-3B-Instruct/config.json
+/tmp/Llama-3.2-3B-Instruct/generation_config.json
+/tmp/Llama-3.2-3B-Instruct/model-00001-of-00002.safetensors
+...
-Make a note of <checkpoint_dir>
, we’ll use this many times in this tutorial.
-
-
+
+
Note
+
For a list of all other models you can finetune out-of-the-box with torchtune, check out
+our models page.
-
-
-Finetune the model using LoRA
For this tutorial, we’ll fine-tune the model using LoRA. LoRA is a parameter efficient fine-tuning
technique which is especially helpful when you don’t have a lot of GPU memory to play with. LoRA
freezes the base LLM and adds a very small percentage of learnable parameters. This helps keep
@@ -515,147 +500,134 @@
Finetune the model using LoRA.
-
We’ll fine-tune using our
-single device LoRA recipe
-and use the standard settings from the
-default config.
-This will fine-tune our model using a batch_size=2
and dtype=bfloat16
. With these settings the model
-should have a peak memory usage of ~16GB and total training time of around two hours for each epoch.
-We’ll need to make some changes to the config to make sure our recipe can access the
-right checkpoints.
Let’s look for the right config for this use case by using the tune CLI.
-tune ls
-
-RECIPE CONFIG
-full_finetune_single_device llama2/7B_full_low_memory
- code_llama2/7B_full_low_memory
- llama3/8B_full_single_device
- llama3_1/8B_full_single_device
- llama3_2/1B_full_single_device
- llama3_2/3B_full_single_device
- mistral/7B_full_low_memory
- phi3/mini_full_low_memory
- qwen2/7B_full_single_device
- ...
-
-
-full_finetune_distributed llama2/7B_full
- llama2/13B_full
- llama3/8B_full
- llama3_1/8B_full
- llama3_2/1B_full
- llama3_2/3B_full
- mistral/7B_full
- gemma2/9B_full
- gemma2/27B_full
- phi3/mini_full
- qwen2/7B_full
- ...
-
-lora_finetune_single_device llama2/7B_lora_single_device
- llama2/7B_qlora_single_device
- llama3/8B_lora_single_device
+$ tune ls
+RECIPE CONFIG
+full_finetune_single_device llama2/7B_full_low_memory
+ code_llama2/7B_full_low_memory
+ llama3/8B_full_single_device
+ llama3_1/8B_full_single_device
+ llama3_2/1B_full_single_device
+ llama3_2/3B_full_single_device
+ mistral/7B_full_low_memory
+ phi3/mini_full_low_memory
+ qwen2/7B_full_single_device
+ ...
+
+
+full_finetune_distributed llama2/7B_full
+ llama2/13B_full
+ llama3/8B_full
+ llama3_1/8B_full
+ llama3_2/1B_full
+ llama3_2/3B_full
+ mistral/7B_full
+ gemma2/9B_full
+ gemma2/27B_full
+ phi3/mini_full
+ qwen2/7B_full
+ ...
+
+lora_finetune_single_device llama2/7B_lora_single_device
+ llama2/7B_qlora_single_device
+ llama3/8B_lora_single_device
...
-For this tutorial we’ll use the llama3_2/3B_lora_single_device
config.
-The config already points to the HF Checkpointer and the right checkpoint files.
-All we need to do is update the checkpoint directory for both the model and the
-tokenizer. Let’s do this using the overrides in the tune CLI while starting training!
-tune run lora_finetune_single_device --config llama3_2/3B_lora_single_device
+We’ll fine-tune using our
+single device LoRA recipe
+and use the standard settings from the
+default config.
+This will fine-tune our model using a batch_size=4
and dtype=bfloat16
. With these settings the model
+should have a peak memory usage of ~16GB and total training time of around 2-3 hours for each epoch.
+$ tune run lora_finetune_single_device --config llama3_2/3B_lora_single_device
+Setting manual seed to local seed 3977464327. Local seed is seed + rank = 3977464327 + 0
+Hint: enable_activation_checkpointing is True, but enable_activation_offloading isn't. Enabling activation offloading should reduce memory further.
+Writing logs to /tmp/torchtune/llama3_2_3B/lora_single_device/logs/log_1734708879.txt
+Model is initialized with precision torch.bfloat16.
+Memory stats after model init:
+ GPU peak memory allocation: 6.21 GiB
+ GPU peak memory reserved: 6.27 GiB
+ GPU peak memory active: 6.21 GiB
+Tokenizer is initialized from file.
+Optimizer and loss are initialized.
+Loss is initialized.
+Dataset and Sampler are initialized.
+Learning rate scheduler is initialized.
+Profiling disabled.
+Profiler config after instantiation: {'enabled': False}
+1|3|Loss: 1.943998098373413: 0%| | 3/1617 [00:21<3:04:47, 6.87s/it]
-
-
-Preparing your artifacts for inference
-Congrats for getting this far! You have loaded your weights, trained your model, now it’s time to visualize
-the outputs. A simple way of doing this is by running tree -a path/to/outputdir, which should show something like the tree below.
-There are 4 types of folders:
+Congrats on training your model! Let’s take a look at the artifacts produced by torchtune. A simple way of doing this is by running tree -a path/to/outputdir
, which should show something like the tree below.
+There are 3 types of folders:
recipe_state: Holds recipe_state.pt with the information necessary to restart the last intermediate epoch. For more information, please check our deep-dive Checkpointing in torchtune.;
-logs: Defined in your config in metric_logger;
-epoch_{}: Contains your new trained model weights plus all original files of the model, except the checkpoints, making it easy for you to choose an specific epoch to run inference on or push to a model hub;
+logs: Contains all the logging output from your training run: loss, memory, exceptions, etc.
+epoch_{}: Contains your trained model weights plus model metadata. If running inference or pushing to a model hub, you should use this folder directly.
->>> tree -a /tmp/torchtune/llama3_2_3B/lora_single_device
- /tmp/torchtune/llama3_2_3B/lora_single_device
- ├── epoch_0
- │ ├── adapter_config.json
- │ ├── adapter_model.pt
- │ ├── adapter_model.safetensors
- │ ├── config.json
- │ ├── ft-model-00001-of-00002.safetensors
- │ ├── ft-model-00002-of-00002.safetensors
- │ ├── generation_config.json
- │ ├── LICENSE.txt
- │ ├── model.safetensors.index.json
- │ ├── original
- │ │ ├── orig_params.json
- │ │ ├── params.json
- │ │ └── tokenizer.model
- │ ├── original_repo_id.json
- │ ├── README.md
- │ ├── special_tokens_map.json
- │ ├── tokenizer_config.json
- │ ├── tokenizer.json
- │ └── USE_POLICY.md
- ├── epoch_1
- │ ├── adapter_config.json
- │ ├── adapter_model.pt
- │ ├── adapter_model.safetensors
- │ ├── config.json
- │ ├── ft-model-00001-of-00002.safetensors
- │ ├── ft-model-00002-of-00002.safetensors
- │ ├── generation_config.json
- │ ├── LICENSE.txt
- │ ├── model.safetensors.index.json
- │ ├── original
- │ │ ├── orig_params.json
- │ │ ├── params.json
- │ │ └── tokenizer.model
- │ ├── original_repo_id.json
- │ ├── README.md
- │ ├── special_tokens_map.json
- │ ├── tokenizer_config.json
- │ ├── tokenizer.json
- │ └── USE_POLICY.md
- ├── logs
- │ └── log_1734652101.txt
- └── recipe_state
- └── recipe_state.pt
+$ tree -a /tmp/torchtune/llama3_2_3B/lora_single_device
+/tmp/torchtune/llama3_2_3B/lora_single_device
+├── epoch_0
+│ ├── adapter_config.json
+│ ├── adapter_model.pt
+│ ├── adapter_model.safetensors
+│ ├── config.json
+│ ├── ft-model-00001-of-00002.safetensors
+│ ├── ft-model-00002-of-00002.safetensors
+│ ├── generation_config.json
+│ ├── LICENSE.txt
+│ ├── model.safetensors.index.json
+│ ├── original
+│ │ ├── orig_params.json
+│ │ ├── params.json
+│ │ └── tokenizer.model
+│ ├── original_repo_id.json
+│ ├── README.md
+│ ├── special_tokens_map.json
+│ ├── tokenizer_config.json
+│ ├── tokenizer.json
+│ └── USE_POLICY.md
+├── epoch_1
+│ ├── adapter_config.json
+│ ...
+├── logs
+│ └── log_1734652101.txt
+└── recipe_state
+ └── recipe_state.pt
Let’s understand the files:
-adapter_model.safetensors and adapter_model.pt are your LoRA trained adapter weights. We save a duplicated .pt version of it to facilitate resuming from checkpoint.
-ft-model-{}-of-{}.safetensors are your trained full model weights (not adapters). When LoRA finetuning, these are only present if we set save_adapter_weights_only=False
. In that case, we merge the merged base model with trained adapters, making inference easier.
-adapter_config.json is used by Huggingface PEFT when loading an adapter (more on that later);
-model.safetensors.index.json is used by Huggingface .from_pretrained when loading the model weights (more on that later)
+adapter_model.safetensors
and adapter_model.pt
are your LoRA trained adapter weights. We save a duplicated .pt version of it to facilitate resuming from checkpoint.
+ft-model-{}-of-{}.safetensors
are your trained full model weights (not adapters). When LoRA finetuning, these are only present if we set save_adapter_weights_only=False
. In that case, we merge the merged base model with trained adapters, making inference easier.
+adapter_config.json
is used by Huggingface PEFT when loading an adapter (more on that later);
+model.safetensors.index.json
is used by Hugging Face from_pretrained()
when loading the model weights (more on that later)
All other files were originally in the checkpoint_dir. They are automatically copied during training. Files over 100MiB and ending on .safetensors, .pth, .pt, .bin are ignored, making it lightweight.
-
-
-Run Evaluation using EleutherAI’s Eval Harness
-We’ve fine-tuned a model. But how well does this model really do? Let’s run some Evaluations!
+
+Evaluate your model
+We’ve fine-tuned a model. But how well does this model really do? Let’s determine this through structured evaluation and playing around with it.
+
+Run evals using EleutherAI’s Eval Harness
torchtune integrates with
EleutherAI’s evaluation harness.
An example of this is available through the
-eleuther_eval
recipe. In this tutorial, we’re going to directly use this recipe by
-modifying its associated config eleuther_evaluation.yaml
.
+eleuther_eval recipe. In this tutorial, we’re going to directly use this recipe by
+modifying its associated config eleuther_evaluation.yaml.
Note
-
For this section of the tutorial, you should first run pip install lm_eval==0.4.*
+
For this section of the tutorial, you should first run pip install lm_eval>=0.4.5
to install the EleutherAI evaluation harness.
Since we plan to update all of the checkpoint files to point to our fine-tuned checkpoints,
let’s first copy over the config to our local working directory so we can make changes.
-tune cp eleuther_evaluation ./custom_eval_config.yaml \
+$ tune cp eleuther_evaluation ./custom_eval_config.yaml
+Copied file to custom_eval_config.yaml
-Then, in your config, you only need to replace two fields: output_dir
and checkpoint_files
. Notice
-that we are using the merged weights, and not the LoRA adapters.
+Notice that we are using the merged weights, and not the LoRA adapters.
# TODO: update to your desired epoch
output_dir: /tmp/torchtune/llama3_2_3B/lora_single_device/epoch_0
@@ -699,18 +671,15 @@ Preparing your artifacts for inference task from the harness.
This task measures a model’s propensity to be truthful when answering questions and
measures the model’s zero-shot accuracy on a question followed by one or more true
-responses and one or more false responses
-tune run eleuther_eval --config ./custom_eval_config.yaml
-
-[evaluator.py:324] Running loglikelihood requests
+responses and one or more false responses.
+$ tune run eleuther_eval --config ./custom_eval_config.yaml
+[evaluator.py:324] Running loglikelihood requests
+...
-
-
-Generation
+
+Generate some output
We’ve run some evaluations and the model seems to be doing well. But does it really
generate meaningful text for the prompts you care about? Let’s find out!
For this, we’ll use the
@@ -718,7 +687,8 @@
Generation.
Let’s first copy over the config to our local working directory so we can make changes.
-tune cp generation ./custom_generation_config.yaml
+$ tune cp generation ./custom_generation_config.yaml
+Copied file to custom_generation_config.yaml
@@ -772,24 +742,17 @@ Generation