Skip to content

Python binding to libpoppler with focus on text extraction

Notifications You must be signed in to change notification settings

friedelwolff/pdfparser

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Binding for libpoppler - focus on text extration from PDF documents. Also comparison with other approaches - pdfminer and GObject Introspection binding for libpoppler.

Requires recent libpoppler >= 0.40 - so it's recommended to compile it from source. Use script build_poppler.sh to clone and build. To install system wise:

make install
ldconfig

Available under GPL v3 license (libpoppler is also GPL).

About

Python binding to libpoppler with focus on text extraction

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 97.4%
  • Shell 2.6%