hiyouga LLaMA-Factory · Discussions · GitHub

Welcome to LLaMA-Factory Discussions!
Announcements hiyouga

Sort by: Latest activity

Categories

Discussions

You must be logged in to vote

怎么在本地命令行与导出的微调好的qwen2-2b-instruct模型进行多轮连贯视觉对话，给它图片，然后进行多轮对话
pending This problem is yet to be addressed
audhuiaohduia asked Nov 28, 2024 in Q&A · Unanswered

0
You must be logged in to vote

你好，请问一下支持关于SQL数据集微调吗？如果支持能给一个dataset_info.json的配置样例吗？感谢！！！
pending This problem is yet to be addressed
lhhchanger asked Nov 28, 2024 in Q&A · Unanswered

0
You must be logged in to vote

增量预训练的数据token量比较大，长度分布不均怎么办？
pending This problem is yet to be addressed
jqtian123 asked Nov 28, 2024 in Q&A · Unanswered

0
You must be logged in to vote

目前训练日志中，训练性能指标如何分析呢？
pending This problem is yet to be addressed
tensorflowt asked Nov 28, 2024 in Q&A · Unanswered

0
You must be logged in to vote

qwen2.5-coder-33B 相比 qwen2.5-33B-instruct 训练需要消耗更多的显存么？
wontfix This will not be worked on
yangyang6666 asked Nov 27, 2024 in Q&A · Unanswered

0
You must be logged in to vote

想請教 Qwen2vl image的tag 為什麼跟範例不同?

cyc00518 asked Nov 26, 2024 in Q&A · Answered

2
You must be logged in to vote

gated models inaccessible and some weird errors. Please help

Pjn11111 asked Nov 21, 2024 in Q&A · Unanswered

1
You must be logged in to vote

微调多模态数据，训练没问题，上传图片时报错

zuofei22 asked Nov 20, 2024 in Q&A · Unanswered

0
You must be logged in to vote

what are the minimum no. of A100sxm 80GB GPU required for lora sft finetuning of "Qwen2-VL-72B-instruct"?
solved This problem has been already solved
MayurKuchhadiya asked Nov 19, 2024 in Q&A · Unanswered

2
You must be logged in to vote

如果我想实现使用http模式调用微调我应该如何实现
pending This problem is yet to be addressed
qq1273834091 asked Nov 15, 2024 in Q&A · Unanswered

0
You must be logged in to vote

STF后发现模型的基础能力丢失
pending This problem is yet to be addressed
babybboy asked Oct 17, 2024 in Q&A · Unanswered

4
You must be logged in to vote

Using history in fine tuning

herandy asked Nov 13, 2024 in Q&A · Unanswered

0
You must be logged in to vote

Left padding during training

herandy asked Nov 7, 2024 in Q&A · Closed · Unanswered

1
You must be logged in to vote

How can i set lora alpha value ?
solved This problem has been already solved
bhanuash7 started Feb 20, 2024 in General · Closed

5
You must be logged in to vote

Full-Finetune worse than Lora-Finetune 全量fientuee效果差于lora
pending This problem is yet to be addressed
RobinWitch asked Nov 5, 2024 in Q&A · Unanswered

1
You must be logged in to vote

用deepspeed zero3指令微调完，怎么加载微调权重进行evaluate&predict？

lxlx2084 asked Nov 7, 2024 in Q&A · Unanswered

0
You must be logged in to vote

如何取部分数据集训练呢，datasetinfo里有些数据太大了，怎么取一部分，如果是多个数据集放进去，数据集是按顺序训练还是随机打乱呢

abc-w asked Oct 23, 2024 in Q&A · Unanswered

1
You must be logged in to vote

继续预训练的效果评估

luoruijie asked Oct 30, 2024 in Q&A · Unanswered

1
You must be logged in to vote

二次预训练阶段全参微调,损失曲线是否正常，如何优化
pending This problem is yet to be addressed
Shame-fight asked Nov 1, 2024 in Q&A · Unanswered

1
You must be logged in to vote

lora finutune QWen2.5 0.5B use CPU lora微调QWen2.5 0.5B训练使用了cpu，怎么修改为使用cuda
wontfix This will not be worked on
a-drop-in-the-ocean456 asked Oct 23, 2024 in Q&A · Unanswered

1
You must be logged in to vote

微调qwen2.5-32b时，怎么把权重分开部署到各个gpu上？

lxlx2084 asked Nov 2, 2024 in Q&A · Answered

1
You must be logged in to vote

使用llama3_lora_pretrain.yaml微调模型后，模型生成能力严重丧失
pending This problem is yet to be addressed
njzfw1024 asked Nov 1, 2024 in Q&A · Unanswered

0
You must be logged in to vote

训练集output长度1万8，微调时cutoff_len设置为2万，效果不好怎么办？
pending This problem is yet to be addressed
aaronchen2k asked Nov 1, 2024 in Q&A · Unanswered

0
You must be logged in to vote

您好，请问下，一个纯文本的txt文档来做预训练的话，dataset_info.json该如何添加这个新数据集？我需要将这个txt的内容转换成这种格式吗[ {"text": "document"}, {"text": "document"} ]？如果我不想转，就是想使用一个书本txt做预训练该如何做
pending This problem is yet to be addressed
cheun726 asked Jul 19, 2024 in Q&A · Unanswered

3
You must be logged in to vote

0.9.0版本的label构建方式
pending This problem is yet to be addressed
zhangzhili1112 asked Oct 30, 2024 in Q&A · Unanswered

0