Replies: 1 comment
-
Have you also tried using
When you merge models, it should retain the order of assignment according to the order of models that you put in the |
Beta Was this translation helpful? Give feedback.
-
Have you also tried using
When you merge models, it should retain the order of assignment according to the order of models that you put in the |
Beta Was this translation helpful? Give feedback.
-
Hello,
I am trying to topic model approximately ~4.5 million tweets.
I have been unable to successfully do so given the memory requirements in UMAP. I have tried the suggested solutions to memory problems (i.e., low_memory=True, setting min_df, and setting calculate probabilities to False). I also have tried online topic modelling, but the topics seem less sensible as compared to those in the traditional full model.
I am currently trying two new solutions, one is merging topic models and the other is to train on a sample and then transforming on the rest. My question is, with these methods, is there a way to then assign each tweet in an original df a topic. I know the index length remains the same with the traditional bertopic, but I wasn't sure that was the case in merged models.
Thank you for your help!
Beta Was this translation helpful? Give feedback.
All reactions