

Rhouth Vilma
-
Rhouth Vilma replied to the discussion How do I deal with scraped data that has inconsistent formatting? in the forum General Web Scraping 4 months ago
How do I deal with scraped data that has inconsistent formatting?
If I expect certain formats, like currency or dates, I parse those fields specifically to convert them into standardized formats.
-
Rhouth Vilma replied to the discussion How do I scrape data from sites using custom fonts or icons? in the forum General Web Scraping 4 months ago
How do I scrape data from sites using custom fonts or icons?
Extracting the text using Playwright and checking for any special CSS or JavaScript manipulations reveals additional data.
-
Rhouth Vilma replied to the discussion What strategies can I use to scrape websites with limited search functionality? in the forum General Web Scraping 4 months ago
What strategies can I use to scrape websites with limited search functionality?
I also check the site’s sitemap, as it often contains URLs that don’t show up in the internal search but are publicly accessible.
-
Rhouth Vilma started the discussion What are the top techniques for scraping the MediaMarkt site using PHP? in the forum General Web Scraping 4 months ago
What are the top techniques for scraping the MediaMarkt site using PHP?
PHP’s CURL library can fetch static product details effectively, but MediaMarkt’s JavaScript-heavy content may require a headless browser for full scraping.
-
Rhouth Vilma changed their photo 4 months ago
-
Rhouth Vilma became a registered member 4 months ago