Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The scraper seems to only pull the top 10 reviews regardless of how many pages it attempts to navigate to #57

Open
Drew-Smith-18 opened this issue Aug 8, 2021 · 2 comments

Comments

@Drew-Smith-18
Copy link

Does anyone know how to resolve this?

@Drew-Smith-18
Copy link
Author

Found a solution, it is pretty hacky but...replace go_to_next_page(): with below and it will work

def go_to_next_page():
logger.info(f'Going to page {page[0] + 1}')

  currentUrl = browser.current_url
  print(f'old url: {currentUrl}')

  currentUrl = currentUrl.split('.htm', 1)
  currentUrl = currentUrl[0].split('_')[0]
  currentUrl = [currentUrl]
  currentUrl.insert(1, f'_P{page[0] + 1}.htm')
  print(f'new url: {currentUrl}')

  browser.get(''.join(currentUrl))
  time.sleep(5) # wait for ads to load
  page[0] = page[0] + 1

@roscoe777
Copy link

roscoe777 commented Oct 25, 2021

@Drew-Smith-18 May I know if this scraper is still working? I tried example 1 but kept getting errors as below. The process stops at landing on the first reviews page. I would very appreciate it if you could give some instructions. Thank you!
Screenshot 2021-10-24 215142

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants