

Gojko Diomedes
-
Gojko Diomedes replied to the discussion How do I handle CAPTCHA challenges that vary in difficulty or type? in the forum General Web Scraping 5 months ago
How do I handle CAPTCHA challenges that vary in difficulty or type?
For image CAPTCHAs, setting up an automated CAPTCHA bank where team members can manually solve them helps in real-time.
-
Gojko Diomedes replied to the discussion What are the best practices for scraping financial data from news or stock site? in the forum General Web Scraping 5 months ago
What are the best practices for scraping financial data from news or stock site?
If the data is only available on the website, I rotate proxies and user agents to avoid triggering blocks on high-traffic financial sites.
-
Gojko Diomedes replied to the discussion How do I approach scraping data that appears only on hover or in tooltips? in the forum General Web Scraping 5 months ago
How do I approach scraping data that appears only on hover or in tooltips?
Puppeteer’s hover function is another great option; it loads the tooltip content on hover without needing a visible browser.
-
Gojko Diomedes replied to the discussion How can I scrape structured data from sites without standard HTML tags? in the forum General Web Scraping 5 months ago
How can I scrape structured data from sites without standard HTML tags?
Scrapy’s XPath expressions are especially helpful for locating non-standard elements by their position in the DOM structure.
-
Gojko Diomedes replied to the discussion How can I scrape JavaScript-based content without headless browsers? in the forum General Web Scraping 5 months ago
How can I scrape JavaScript-based content without headless browsers?
Inspecting JavaScript functions in the source code can reveal data endpoints that load data independently of interactive content.
-
Gojko Diomedes started the discussion Best methods to scrape SKU or UPC metadata from Amazon or eBay? in the forum General Web Scraping 5 months ago
Best methods to scrape SKU or UPC metadata from Amazon or eBay?
Amazon often includes SKUs or ASINs directly in the page metadata, which I extract using BeautifulSoup to streamline data organization.
-
Gojko Diomedes changed their photo 5 months ago
-
Gojko Diomedes became a registered member 5 months ago