Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

A100/A800 显卡去运行不同参数的模型,想确认一下算力的需求 #1334

Open
guoshiyin-666 opened this issue Dec 5, 2024 · 0 comments
Labels
question Further information is requested

Comments

@guoshiyin-666
Copy link

起始日期 | Start Date

2024/12/05

实现PR | Implementation PR

相关Issues | Reference Issues

我现在有A100的显卡,想试试8B 32B 72B 模型的运行效率和每一分钟可以生成多少toekn,想确认一下这个模型的使用上限在哪里

摘要 | Summary

模型的推演能力

基本示例 | Basic Example

缺陷 | Drawbacks

未解决问题 | Unresolved questions

@guoshiyin-666 guoshiyin-666 added the question Further information is requested label Dec 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant