Is there a way to keep representative documents after merging models? #1972

rociortizb · 2024-05-06T15:26:29Z

rociortizb
May 6, 2024

I've been training a model for a few weeks now, updating it with novel topics as new data comes in (by training on the new data and using the merge_models function). I'd like to do some analysis now but when trying to access the representative documents for the topics I realized that they are empty, and upon investigating a bit I found that they are not saved after merging models "because of privacy reasons".

I understand how this is useful, however for my use case I'd like to actually keep them, which brings me to the question: is there a way to change the default settings such that the representative documents are kept even after merging the models?

Answered by MaartenGr

May 7, 2024

That is currently not possible since merge models was also created for federated learning, where privacy is of the utmost concern.

Although that is not possible with public functions, you could use the private ._extract_representative_docs function instead to recalculate the representative documents. It would require you to have access to all documents though.

I believe there are a couple of issues that describe how to use it.

View full answer

MaartenGr · 2024-05-07T13:57:37Z

MaartenGr
May 7, 2024
Maintainer

That is currently not possible since merge models was also created for federated learning, where privacy is of the utmost concern.

Although that is not possible with public functions, you could use the private ._extract_representative_docs function instead to recalculate the representative documents. It would require you to have access to all documents though.

I believe there are a couple of issues that describe how to use it.

1 reply

rociortizb May 7, 2024
Author

Thought this may be the case, but worth asking anyway. Don't really have access to the training documents anymore but will try to reproduce the ._extract_representative_docs function with sufficient new documents to see if I can get good representative documents for each topic. Thank you for your answer! :)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is there a way to keep representative documents after merging models? #1972

{{title}}

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Is there a way to keep representative documents after merging models? #1972

rociortizb May 6, 2024

Replies: 1 comment · 1 reply

MaartenGr May 7, 2024 Maintainer

rociortizb May 7, 2024 Author

rociortizb
May 6, 2024

Replies: 1 comment 1 reply

MaartenGr
May 7, 2024
Maintainer

rociortizb May 7, 2024
Author