You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Due to encode() sorting the sentences in decreasing order by length, if I pass in a large list of texts with large variation in length (I'm using a model that supports up to 8192 max_seq_length, but most of the texts are much shorter than that), the initial estimate of time remaining from tqdm is way too high (e.g., 4 hours instead of 30 minutes). Perhaps when show_progress_bar=True, it would be good to print a warning that the time remaining may be overestimated due to the sort order.
The text was updated successfully, but these errors were encountered:
Due to encode() sorting the sentences in decreasing order by length, if I pass in a large list of texts with large variation in length (I'm using a model that supports up to 8192 max_seq_length, but most of the texts are much shorter than that), the initial estimate of time remaining from tqdm is way too high (e.g., 4 hours instead of 30 minutes). Perhaps when show_progress_bar=True, it would be good to print a warning that the time remaining may be overestimated due to the sort order.
The text was updated successfully, but these errors were encountered: