-
Sofie Davonte became a registered member a year ago
-
Ravi Ernestas replied to the discussion How can I dynamically manage request headers while scraping? in the forum General Web Scraping a year ago
How can I dynamically manage request headers while scraping?
Add headers like Accept-Language and Referer to each request to make your scraper appear more human.
-
Ravi Ernestas replied to the discussion How can I handle large amounts of data scraped from a website? in the forum General Web Scraping a year ago
How can I handle large amounts of data scraped from a website?
Store data in compressed formats like Parquet or Avro, which save space and load faster. Pandas and Dask in Python make working with large data easy.
-
Ravi Ernestas replied to the discussion What’s the best way to avoid triggering anti-bot mechanisms on large e-commerce in the forum General Web Scraping a year ago
What’s the best way to avoid triggering anti-bot mechanisms on large e-commerce
Varying request timings is crucial. Anti-bot mechanisms often monitor request rates, so random delays between requests can help avoid being flagged.
-
Ravi Ernestas replied to the discussion How can I scrape websites with infinite scroll without losing data? in the forum General Web Scraping a year ago
How can I scrape websites with infinite scroll without losing data?
I’ve also written scripts that detect the ‘load more’ button, which some sites use instead of infinite scrolling. Simulating clicks on this button in a loop allows you to retrieve all content without scrolling.
-
Ravi Ernestas replied to the discussion How can I optimize my scraping code for faster performance? in the forum General Web Scraping a year ago
How can I optimize my scraping code for faster performance?
Reduce redundancy in your scraping logic. Sometimes, caching responses or reusing selectors prevents extra processing, making the code leaner and faster.
-
Ravi Ernestas started the discussion How does TypeScript differ from Flow for JavaScript type checking? in the forum General Web Scraping a year ago
How does TypeScript differ from Flow for JavaScript type checking?
TypeScript has better tooling support and is more widely used than Flow, making it the preferred option for most large-scale JavaScript projects.
-
Ravi Ernestas changed their photo a year ago
-
Ravi Ernestas became a registered member a year ago
-
Ace Arash changed their photo a year ago
- Load More