Skip to content

How to fine tune UMAP #2088

Answered by MaartenGr
powerhorse1986 asked this question in Q&A
Jul 16, 2024 · 1 comments · 2 replies
Discussion options

You must be logged in to vote

Then that will depend on your definition of "quality" in the context of the creation of clusters and/or assignment of documents. It will also depend on whether you have a ground truth available or not. There are a number of cluster-based metrics here. If it is the representation of the clusters (the topics) you are interested in, then OCTIS has many metrics implemented.

Do note though that it is important to first define exactly what it is that you want to evaluate. With unsupervised metrics, such as BERTopic, there isn't a ground truth typically available. As such, and due to the nature of topic modeling, there is a degree of subjectivity involved with the evaluation. There are proxies (…

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@powerhorse1986
Comment options

@MaartenGr
Comment options

Answer selected by powerhorse1986
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants