Skip to content

jay-krishna/list-extractor

 
 

Repository files navigation

list-extractor - Extract Data from Wikipedia Lists

###How to run the script python listExtractor.py collect_mode source language

  • collect_mode : use s to specify a single resource or a for a class of resources in the next parameter.
  • source: a string representing a class of resources from DBpedia ontology (right now it works for Writer and Actor), or a single Wikipedia page of an actor/writer.
  • language: a two-letter prefix corresponding to the desired language of Wikipedia pages and SPARQL endpoint to be queried (it currently accepts only en or it).

Examples: python listExtractor.py a Writer it | python listExtractor.py s William_Gibson en

If successful, a .ttl file containing RDF statements about the specified source is created inside a subdirectory called 'extracted'.

###Requirements

About

Extract Data from Wikipedia Lists

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%