Skip to content

Jean-Baptiste-Camps/IIIF-Crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 

Repository files navigation

IIIF-Crawler

Tool to interrogate IIIF servers and get images of manuscripts.

Forked from PonteIneptique's bnfcrawler, https://gist.github.com/PonteIneptique/adbb7472b9ced07ca9287fbf2e1584ce !

Currently implemented:

  • gallica;
  • e-codices (beta);
  • bvmm
  • … (more to come).

Usage

python3 iiifcrawler.py ID --source SOURCE --start 1 --end 2

where,

  • ID is the identifier of the manuscript, e.g. the ark btv1b9059486c or the code bbb-0113, or Boulogne-sur-Mer/B621606201_MS0192;
  • SOURCE, the source from which to download (currently, only gallica, e-codices and bvmm);
  • and 1 and 2 are the beginning and end folios you want to download (N.B.: page number are different on Gallica, and E-Codices, so, on E-Codices, 15 to 16 will send back four pages, 15r to 16v).

Alternatively, you can pass a tsv file with the relevant informations (see example.tsv).

python3 iiifcrawler.py example.tsv

About

Interrogate IIIF servers and get images of manuscripts

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages