Skip to content

fabitortorelli/WebScraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 

Repository files navigation

WebScraping

Context

Web scraping is the process of extracting data from websites. This process involves sending an HTTP request to the target website and extracting the desired data.

I use the website Books to Scrape to practice and improve my understanding of web scraping.

Analysis Plan

  1. Task definition:
  • Extract data from all categories of books highlighted on the website
  • Specifically, gather information on book titles and star ratings
  1. Importing Packages and Making the Request:
  • Utilize the requests package to send HTTP requests to access the website
  • Implement BeautifulSoup to parse the HTML response and navigate through the data structure
  1. Searching links for each book category:
  • Identify and extract URLs corresponding to each book category from the website
  1. Code to grab all titles and star ratings from each book category:
  • Iterate through each category URL to extract titles and star ratings
  • Use appropriate HTML parsing and selection methods to extract the desired data accurately
  1. Saving the information:
  • Save the extracted data into a structured format for further analysis and processing
  1. Description analysis of extracted data:
  • Perform descriptive analysis on the collected data
  • Explore patterns and insights derived from the extracted book titles and star ratings

Note: For a detailed presentation of this project, visit my Medium page Web Scraping: A Way to Collect Data

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published