-
Michael Woo replied to the discussion What’s the best tool for scraping JavaScript-heavy websites? in the forum General Web Scraping a year ago
What’s the best tool for scraping JavaScript-heavy websites?
Selenium works too, but it’s slower and can be more cumbersome for complex JavaScript interactions.
-
FARHAN AHMED replied to the discussion Best practices for web scraping to avoid getting blocked by websites. in the forum General Web Scraping a year ago
Best practices for web scraping to avoid getting blocked by websites.
If a website doesn’t use JavaScript to load dynamic content, you can use Python’s
requestslibrary, passing headers and a proxy to avoid blocking and achieve a high success rate. If the site relies on JavaScript, however, you’ll need to use automation tools like Playwright or Selenium. From my testing, Selenium Stealth works best for avoiding… Read more -
FARHAN AHMED replied to the discussion Best techniques to rotate proxies when scraping websites. in the forum General Web Scraping a year ago
Best techniques to rotate proxies when scraping websites.
PADI proxies are better and more reliable than free proxies. If you have a list of URLs, you can use a for loop to iterate through them, applying a new proxy on each loop iteration. This technique helps you avoid blocks. Other factors also affect success, such as the type of automation browser you are using.
-
Koa Linn replied to the discussion Best techniques to rotate proxies when scraping websites. in the forum General Web Scraping a year ago
Best techniques to rotate proxies when scraping websites.
You can also use free proxies, but they tend to be less reliable and can lead to more blocks.
-
Koa Linn started the discussion What’s the best tool for scraping JavaScript-heavy websites? in the forum General Web Scraping a year ago
What’s the best tool for scraping JavaScript-heavy websites?
Playwright and Puppeteer are the best for handling JavaScript-heavy sites, especially modern web apps.
-
Koa Linn changed their photo a year ago
-
Koa Linn became a registered member a year ago
-
Ikaika Kapono replied to the discussion Best practices for web scraping to avoid getting blocked by websites. in the forum General Web Scraping a year ago
Best practices for web scraping to avoid getting blocked by websites.
Rotate user-agent strings to make your scraper look like different browsers.
-
Ikaika Kapono started the discussion How to avoid IP bans while scraping heavily trafficked sites? in the forum General Web Scraping a year ago
How to avoid IP bans while scraping heavily trafficked sites?
Rotate proxies or use a paid proxy service to avoid hitting the site from a single IP.
-
Ikaika Kapono changed their photo a year ago
- Load More