显存占用过大疑问 #429

Upcreat · 2024-08-22T15:45:11Z

使用示例代码进行推理，4bit的在回答的时候单张图像最大达到了38G的显存占用，是否是正常的。无量化的模型版本直接报错了。
from lmdeploy import TurbomindEngineConfig, pipeline
from lmdeploy.vl import load_image
engine_config = TurbomindEngineConfig(model_format='awq')
pipe = pipeline('internlm/internlm-xcomposer2d5-7b-4bit', backend_config=engine_config)
image = load_image('/root/workspace/InternLM-XComposer/examples/cars1.jpg')
response = pipe(('describe this image', image))
print(response.text)

yuhangzang · 2024-08-26T10:31:54Z

The GPU memory usage is depended on the hd_num parameter (control the input image resolution).

mm-assistant bot assigned LightDXY Aug 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

显存占用过大疑问 #429

显存占用过大疑问 #429

Upcreat commented Aug 22, 2024

yuhangzang commented Aug 26, 2024

显存占用过大疑问 #429

显存占用过大疑问 #429

Comments

Upcreat commented Aug 22, 2024

yuhangzang commented Aug 26, 2024