Loss functions used in the InternLM-XComposer2.5 & its working #420
khyati2396
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello, InterLM community,
I am getting straight to the point,
I need to know what all loss functions are being used in the InternLM-XComposer2.5 and how the loss functions work for this perticular VLM.
I have already read the previous interLM-XComposer papers but could not find any explanation of the loss functions.
Below are the documents that I have gone through,
As far as I know
I also found this piece of line in the internLM-XComposer2 paper,
Does it follow a similar learning technique in internLM-XComposer2.5?
I must admit this piece of research is a gem for people who need strong VLMs.
It just needs more information related to the loss functions and the model's in-depth training.
Any and all responses are welcomed.
Thanks in advance,
Khyati
Beta Was this translation helpful? Give feedback.
All reactions