Skip to content

Latest commit

 

History

History

sikkim

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
 
 
 
 
 
 
 
 

Sikkim

URL = http://ceosikkim.nic.in/PS_Wise_ele_roll%20w.r.t.2013/PS%20AFTER%20RATIONALISATION/Polling%20Station%20wise%20final%20Electoral%20Roll%20published%20on%2010th%20January%202018.html

Year = 2018

Total number of files = 600

Script

The Script iterates through the HTML Table and does two things:

  1. Writes a sikkim.csv that tracks metadata about the pdf. The pdf has the following columns: ac_number, ac_name, part_number, polling_station_name, relative_path_to_file

  2. Downloads all the pdfs to sikkim_pdfs/

Running the script

pip install -r requirements.txt
python sikkim.py

Missing PDF files

There are missing PDF files (return HTTP 404)