Software Engineer Intern
SalahKart- A Product of Jaspy Technologies Pvt. Ltd.
CIN: U78100UP2024PTC199951
Duration: 2 months
- NLP and Web Scraping:
- Employed BeautifulSoup and Selenium WebDriver to scrape data from LinkedIn profiles.
- Conducted resume parsing, extracting skills from resumes and job descriptions to calculate the matching percentage.
- Skill Matching and Semantic Textual Similarity (STS):
- Implemented three primary methods for skill matching:
- SentenceTransformer Model
- CrossEncoder Model
- Dictionary Method using a predefined skill list from a CSV file.
- Combined the methods using a normalized formula to compute the final skill matching percentage.
- Conducted Semantic Textual Similarity (STS) for accurate skill assessment.
- Implemented three primary methods for skill matching:
- Natural Language Processing (NLP):
- Utilized SpaCy and NLTK for advanced NLP tasks.
- Developed a Named Entity Recognition (NER) pipeline using Transformers by Hugging Face.
- Extracted nouns and verbs from resumes on a section-wise basis for enhanced resume analysis.
- Development and Testing:
- Developed and tested code in Python using NumPy, Pandas, and regular expressions.
- Used VS Code and Google Colab for coding and collaboration.
- Designed and documented workflow using Lucid Chart for flow diagrams and brainstorming sessions.
- Created a backend database with a Level 2 ER Diagram using the Eraser.io.
- Performed unit and integration testing manually to ensure code quality and reliability.
- Programming Languages: Python
- Libraries & Frameworks: SpaCy, NLTK, NumPy, Pandas, BeautifulSoup, Selenium WebDriver
- Machine Learning Models: SentenceTransformer, CrossEncoder
- Tools: VS Code, Google Colab, Lucid Chart, Eraser.io
- Database Design: Level 2 ER Diagrams
- Testing: Manual Unit and Integration Testing
- Additional Skills: Transformers, NLP, Web Scraping, Semantic Textual Similarity (STS), Named Entity Recognition (NER)
Provided Codes don't represent my complete work during internship and are Incomplete and partial. It's only for preview and educational purposes. These are not the Final production grade codes either. Kindly use it with caution ⚠