Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement a Doc2Vec mechanism for documents comparisson #12

Open
liadmagen opened this issue Aug 2, 2018 · 0 comments
Open

Implement a Doc2Vec mechanism for documents comparisson #12

liadmagen opened this issue Aug 2, 2018 · 0 comments
Labels
enhancement 🚀 New feature or request good first issue 🆕 Good for newcomers Hacktoberfest 🍁 For the Hacktoberfest Challenge help wanted ✋ Extra attention is needed NLP 🤖 Natural Language Processing

Comments

@liadmagen
Copy link
Member

Create an easy way to compare documents through their contents, using Doc2Vec
Read this blog post for background and details about this algorithm:
https://medium.com/scaleabout/a-gentle-introduction-to-doc2vec-db3e8c0cce5e

To do so, you should first - have a REST API methods to:

  • convert a single document to a vector
  • convert multiple documents (array of strings) to an array of vectors
  • given two vectors, return the distance between them
  • given two documents, convert them to vectors and return the distance (similarity) between them
@liadmagen liadmagen added enhancement 🚀 New feature or request help wanted ✋ Extra attention is needed good first issue 🆕 Good for newcomers labels Aug 2, 2018
@Createdd Createdd added the NLP 🤖 Natural Language Processing label Aug 15, 2018
@Createdd Createdd added the Hacktoberfest 🍁 For the Hacktoberfest Challenge label Oct 3, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement 🚀 New feature or request good first issue 🆕 Good for newcomers Hacktoberfest 🍁 For the Hacktoberfest Challenge help wanted ✋ Extra attention is needed NLP 🤖 Natural Language Processing
Projects
None yet
Development

No branches or pull requests

2 participants