

Amatus Marlyn
-
Amatus Marlyn replied to the discussion What’s the best way to scrape map-based data from websites? in the forum General Web Scraping 9 months ago
What’s the best way to scrape map-based data from websites?
Checking for tile requests in the network panel can reveal image tiles, which can be pieced together manually if needed.
-
Amatus Marlyn replied to the discussion How can I detect and manage duplicate data in my scraped results? in the forum General Web Scraping 9 months ago
How can I detect and manage duplicate data in my scraped results?
Implementing Levenshtein distance calculations helps spot near-duplicates, especially for text-based data with minor differences.
-
Amatus Marlyn replied to the discussion How do I handle scraping pages with endless AJAX requests? in the forum General Web Scraping 9 months ago
How do I handle scraping pages with endless AJAX requests?
Tools like Selenium can trigger AJAX requests by interacting with the page, like clicking “load more” buttons, to display additional data.
-
Amatus Marlyn replied to the discussion What’s the best approach to scraping PDF documents online? in the forum General Web Scraping 9 months ago
What’s the best approach to scraping PDF documents online?
If the data is consistent, I automate the process to filter out only relevant pages, saving time when processing large documents.
-
Amatus Marlyn started the discussion What’s the best way to scrape Yahoo! Shopping Japan using Node.js? in the forum General Web Scraping 9 months ago
What’s the best way to scrape Yahoo! Shopping Japan using Node.js?
Using Puppeteer with Node.js is effective for scraping Yahoo! Shopping Japan’s dynamic content, as it allows full page rendering and DOM interaction.
-
Amatus Marlyn changed their photo 9 months ago
-
Amatus Marlyn became a registered member 9 months ago