URL = http://ceo.uk.gov.in/pages/view/27-uttarakhand-state-electoral-rolls-%28in-pdf-format%29
The Script does two things:
-
It produces uttarakhand_20xx.csv that contains metadata about the pdfs. The CSV has the following fields:
year, ac_no, ac_name, filename
-
Downloads all the pdfs to a directory called
uttarakhand_20xx/
pip install -r requirements.txt
python uttarakhand_archives.py