

Filipp Maglocunos
-
Filipp Maglocunos replied to the discussion How can I detect and manage duplicate data in my scraped results? in the forum General Web Scraping 10 months ago
How can I detect and manage duplicate data in my scraped results?
By using unique constraints in SQL databases, I can prevent duplicates at the database level, which simplifies post-processing.
-
Filipp Maglocunos replied to the discussion How do I handle scraping pages with endless AJAX requests? in the forum General Web Scraping 10 months ago
How do I handle scraping pages with endless AJAX requests?
Inspecting the URL structure of AJAX requests often reveals pagination parameters, which I can modify to control data retrieval directly.
-
Filipp Maglocunos replied to the discussion What’s the best approach to scraping PDF documents online? in the forum General Web Scraping 10 months ago
What’s the best approach to scraping PDF documents online?
Cloud-based OCR solutions, like Google Vision API, handle complex PDFs more effectively, though there’s a cost involved.
-
Filipp Maglocunos started the discussion How can I track price changes on Mercari Japan using Ruby? in the forum General Web Scraping 10 months ago
How can I track price changes on Mercari Japan using Ruby?
I use the Nokogiri gem in Ruby for scraping Mercari Japan’s product listings, focusing on price data and seller information.
-
Filipp Maglocunos changed their photo 10 months ago
-
Filipp Maglocunos became a registered member 10 months ago