

Filipp Maglocunos
-
Filipp Maglocunos replied to the discussion How can I detect and manage duplicate data in my scraped results? in the forum General Web Scraping 5 months ago
How can I detect and manage duplicate data in my scraped results?
By using unique constraints in SQL databases, I can prevent duplicates at the database level, which simplifies post-processing.
-
Filipp Maglocunos replied to the discussion How do I handle scraping pages with endless AJAX requests? in the forum General Web Scraping 5 months ago
How do I handle scraping pages with endless AJAX requests?
Inspecting the URL structure of AJAX requests often reveals pagination parameters, which I can modify to control data retrieval directly.
-
Filipp Maglocunos replied to the discussion What’s the best approach to scraping PDF documents online? in the forum General Web Scraping 5 months ago
What’s the best approach to scraping PDF documents online?
Cloud-based OCR solutions, like Google Vision API, handle complex PDFs more effectively, though there’s a cost involved.
-
Filipp Maglocunos started the discussion How can I track price changes on Mercari Japan using Ruby? in the forum General Web Scraping 5 months ago
How can I track price changes on Mercari Japan using Ruby?
I use the Nokogiri gem in Ruby for scraping Mercari Japan’s product listings, focusing on price data and seller information.
-
Filipp Maglocunos changed their photo 5 months ago
-
Filipp Maglocunos became a registered member 5 months ago