-
Lyricslk.com provides a collection of songs along with following attributes.
- Lyrics (Sinhala)
- Artist’s name (Sinhala)
- Author’s name (English)
- Title (Sinhala)
-
Author’s name needs to be converted to sinhala since we are processing the whole query in Sinhala, and therefore we converted them manually.
-
Scrapy was used to crawl the website and a seperate application was written to convert the raw data to the appropriate format which is accepted by Solr.
- Wildcard FIlter
- Stop Words Filter
- Hunspell Stem Filter Factory for Etemming
- Spell checker
- Stop Words Filter
- Synonym Filter
- Hunspell Stem Filter Factory for Stemming
- Search songs by author, title, and artist.
- Combinations of above mentioned fields.
- Wild-card queries.
- Spell checking, queries with incorrect spellings.
- Queries with stemming.
- Queries where stopwords were eliminated.
- Queries with synonyms.
- Songs by authors, title and artist. බණ්ඩාර අතාවුද හිත අස්සට හීන ගොඩක්
- Keyword සඳ
- Spell check සඳවතයේ insted of සඳවතියේ
- Combinations චන්දන සඳ
- Wild Card
සඳව*
*කුලට
- Synonyms හඳ
- stopwords were eliminated ඔබ හා මම
- Stemming ඔබත් මමත්
- Configure your Solr IP, port and core in
config.yaml
. - Run the
.jar
using the commandjava -jar solr-engine-0.0.1-SNAPSHOT.jar
. - By default, Tomcat runs on port
8080
. Navigate tohttp://localhost:8080
, and you'll access Fake Google.
- Copy the template folder to
solr-xxx/server/solr/
and rename the folder to the name of the core you specify inconfig.yaml
. - Create the core using solr admin
- Windows:
java -Dc=songs -jar post.jar song.xml
- Linux:
./post -c songs songs.xml
- @asmanjitha
- @drox2014
- @ashendes