Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhancing Text Analysis with Advanced Algorithms #75

Merged

Conversation

UTSAVS26
Copy link
Contributor

Closes: #73

  • Title: Enhancing Text Analysis with Advanced Algorithms
  • Name: Utsav Singhal
  • Identify yourself: SSOC Contributor

Describe the add-ons or changes you've made 📃

I have integrated two powerful algorithms, TF-IDF (Term Frequency-Inverse Document Frequency) and Word2Vec, to enhance the text analysis capabilities of the project. TF-IDF helps in identifying important words in a document relative to the entire corpus, while Word2Vec captures the semantic meanings of words through vector representations. This combination will improve the accuracy and depth of text analysis in natural language processing tasks.

Type of change ☑️

What sort of change have you made:

  • New feature (non-breaking change which adds functionality)

How Has This Been Tested? ⚙️

I have verified the changes by running several test cases on different datasets to ensure that the integration of TF-IDF and Word2Vec algorithms works correctly. The tests included various text analysis tasks such as keyword extraction, similarity measurement, and semantic analysis. All tests passed successfully, and the new functionality performs as expected without any issues.

Checklist: ☑️

  • My code follows the Contributing Guidelines & Code of Conduct of this project.
  • This PR does not contain plagiarized content.
  • I have performed a self-review of my own code.
  • I have commented my code, particularly wherever it was hard to understand.
  • My changes generate no new warnings.

Note to reviewers 📄

I have thoroughly tested the new feature and ensured that it integrates seamlessly with the existing codebase. Please review the changes and let me know if any further modifications are needed. Thank you for your time and consideration.

Copy link
Contributor

Thank you for submitting your pull request! We'll review it as soon as possible. For further communication, join our discord server https://discord.gg/tSqtvHUJzE.

Copy link
Owner

@Avdhesh-Varshney Avdhesh-Varshney left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. But write the algorithm in python file and for explanation, use README file. No notebook contribution is here.

@Avdhesh-Varshney Avdhesh-Varshney added bug 🐛 Something isn't working Requested Changes ⚙️ Some changes have been requested in this PR labels Jul 24, 2024
@UTSAVS26
Copy link
Contributor Author

UTSAVS26 commented Jul 24, 2024

LGTM. But write the algorithm in python file and for explanation, use README file. No notebook contribution is here.

Okie, i will make proper changes today itself.

(Edit) All the changes have been made properly

@Avdhesh-Varshney Avdhesh-Varshney merged commit e8104ed into Avdhesh-Varshney:main Jul 25, 2024
@Avdhesh-Varshney Avdhesh-Varshney added Approved ✔️ This PR is approved by the PR or, Mentors SSOC These issue/pull request will be considered for SSOC'24 Intermediate Intermediate level issues/pull requests NLP This comes under the `NLP` directory Code Addition 🤖 Addition of new code or program or model. and removed bug 🐛 Something isn't working Requested Changes ⚙️ Some changes have been requested in this PR labels Jul 25, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Approved ✔️ This PR is approved by the PR or, Mentors Code Addition 🤖 Addition of new code or program or model. Intermediate Intermediate level issues/pull requests NLP This comes under the `NLP` directory SSOC These issue/pull request will be considered for SSOC'24
Projects
None yet
Development

Successfully merging this pull request may close these issues.

📃: Integration of TF-IDF and Word2Vec for Enhanced Text Analysis in NLP/Algorithms
2 participants