Skip to content

v1.1.0

Compare
Choose a tag to compare
@NastyBoget NastyBoget released this 24 Oct 10:01
· 15 commits to master since this release
b79dd4c
  • Add BBoxAnnotation to table cells for PdfTabbyReader.
  • Fix swagger, add api schema classes, remove to_dict method from ParsedDocument.
  • Improve parsing PDF by PdfTxtlayerReader, add benchmarks.
  • Fix BBoxAnnotation extraction for tables in PdfImageReader using table_type=split_last_column parameter.
  • Change base method of metadata extractors, rename it to extract_metadata.
  • Unify BBoxAnnotation extraction for all PDF readers - return only words bboxes.
  • Increase timeout value for all converters.