Skip to content

Latest commit

 

History

History
19 lines (13 loc) · 526 Bytes

README.md

File metadata and controls

19 lines (13 loc) · 526 Bytes

SearchEngine

Simple search engine for the ICS domain

To Do:
  • Mappings

    • Map each term to a unique termID
    • Map each doc to a unique docID (doc1: 1, doc2: 2, doc3: 3, doc4: 4)
    • Map each docID to a list of termIDs based on terms contained in the doc
    • Inverse index of term2termid above, that is map each termID to a term
    • termID -> docID, term frequency
    • termID -> docID, tf-idf
    • termID -> document list
  • Parse JSON

    • Setup/Test Jsoup