

Mhairi Virginie
-
Mhairi Virginie replied to the discussion How do I deal with rate limits on public APIs? in the forum General Web Scraping 5 months ago
How do I deal with rate limits on public APIs?
If the API rate limit is very strict, I distribute requests across multiple accounts with different API keys to maximize throughput.
-
Mhairi Virginie replied to the discussion What’s the best way to handle date-based scraping for historical data? in the forum General Web Scraping 5 months ago
What’s the best way to handle date-based scraping for historical data?
For sites with date filters, I automate date selection in the form or URL parameters to scrape data for specific periods.
-
Mhairi Virginie replied to the discussion How can I scrape data from complex multi-page forms? in the forum General Web Scraping 5 months ago
How can I scrape data from complex multi-page forms?
Capturing and storing form data as I go allows me to pick up where I left off if the script stops unexpectedly. This is crucial for long, multi-page forms.
-
Mhairi Virginie replied to the discussion What are the most common errors encountered in web scraping, and how can I troub in the forum General Web Scraping 5 months ago
What are the most common errors encountered in web scraping, and how can I troub
Encountering CAPTCHAs mid-scrape can stop the script. I’ve found that using services like 2Captcha or rotating proxies helps minimize this.
-
Mhairi Virginie replied to the discussion How can I detect JavaScript traps in websites that prevent scraping? in the forum General Web Scraping 5 months ago
How can I detect JavaScript traps in websites that prevent scraping?
I disable JavaScript initially to see if the content is still accessible. If it isn’t, then a headless browser is likely needed.
-
Mhairi Virginie started the discussion What are efficient ways to scrape product images from an e-commerce site? in the forum General Web Scraping 5 months ago
What are efficient ways to scrape product images from an e-commerce site?
Image URLs are often embedded in the page’s HTML, so I use BeautifulSoup to locate img tags and extract src attributes.
-
Mhairi Virginie changed their photo 5 months ago
-
Mhairi Virginie became a registered member 5 months ago