

Rhouth Vilma
-
Rhouth Vilma replied to the discussion How do I deal with scraped data that has inconsistent formatting? in the forum General Web Scraping 10 months ago
How do I deal with scraped data that has inconsistent formatting?
If I expect certain formats, like currency or dates, I parse those fields specifically to convert them into standardized formats.
-
Rhouth Vilma replied to the discussion How do I scrape data from sites using custom fonts or icons? in the forum General Web Scraping 10 months ago
How do I scrape data from sites using custom fonts or icons?
Extracting the text using Playwright and checking for any special CSS or JavaScript manipulations reveals additional data.
-
Rhouth Vilma replied to the discussion What strategies can I use to scrape websites with limited search functionality? in the forum General Web Scraping 10 months ago
What strategies can I use to scrape websites with limited search functionality?
I also check the site’s sitemap, as it often contains URLs that don’t show up in the internal search but are publicly accessible.
-
Rhouth Vilma started the discussion What are the top techniques for scraping the MediaMarkt site using PHP? in the forum General Web Scraping 10 months ago
What are the top techniques for scraping the MediaMarkt site using PHP?
PHP’s CURL library can fetch static product details effectively, but MediaMarkt’s JavaScript-heavy content may require a headless browser for full scraping.
-
Rhouth Vilma changed their photo 10 months ago
-
Rhouth Vilma became a registered member 10 months ago