Skip to content
View kavgan's full-sized avatar

Highlights

  • Pro

Block or report kavgan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. nlp-in-practice nlp-in-practice Public

    Starter code to solve real world text data problems. Includes: Gensim Word2Vec, phrase embeddings, Text Classification with Logistic Regression, word count with pyspark, simple text preprocessing, …

    Jupyter Notebook 1.2k 793

  2. phrase-at-scale phrase-at-scale Public

    Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in languages other than English

    Python 125 45

  3. ROUGE-2.0 ROUGE-2.0 Public

    ROUGE automatic summarization evaluation toolkit. Support for ROUGE-[N, L, S, SU], stemming and stopwords in different languages, unicode text evaluation, CSV output.

    Java 212 37

  4. OpinRank OpinRank Public

    OpinRank Dataset. Dataset containing user reviews for entities namely cars and hotels. Full reviews from Tripadvisor (~259,000 reviews) and Edmunds (~42,230 reviews)

    43 12

  5. word_cloud word_cloud Public

    Python word cloud library for use within Jupyter notebook and Python apps.

    Jupyter Notebook 48 14

  6. opinosis-summarization opinosis-summarization Public

    This repo contains code and dataset for the Opinosis Summarization Framework

    50 18