For llama demo, instructions for trying everything with python first on colab notebook #7241

mergennachin · 2024-12-09T14:16:33Z

🚀 The feature, motivation and pitch

Not only the decoder path, but can we try full e2e flow (tokenizer, sampler) in python?

During the hackathon, here was a concrete user feedback. "Asking for simple docs or PT notebook that can easily run the .pte model from python (just for testing, so no need to compile the cmake version)"

Alternatives

No response

Additional context

No response

RFC (Optional)

No response

mergennachin added the module: llm LLM examples and apps, and the extensions/llm libraries label Dec 9, 2024

mergennachin moved this to To triage in ExecuTorch DevX improvements Dec 9, 2024

mergennachin added this to ExecuTorch DevX improvements Dec 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

For llama demo, instructions for trying everything with python first on colab notebook #7241

For llama demo, instructions for trying everything with python first on colab notebook #7241

mergennachin commented Dec 9, 2024

For llama demo, instructions for trying everything with python first on colab notebook #7241

For llama demo, instructions for trying everything with python first on colab notebook #7241

Comments

mergennachin commented Dec 9, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

RFC (Optional)