• Blog
  • All Scraping Projects
  • Forums
  • Rayobyte University
  • Sign In
  • Sign Up
  • Products
  • News Feed
  • Members
  • Forums
  • Rayobyte University
    Sign in Sign up
    • Blog
    • All Scraping Projects
    • Forums
    • Rayobyte University
    • Sign In
    • Sign Up
    cover image
    Profile photo of Saori Mariana
    @SaoriMariana

    Saori Mariana

    Outside of my IT work, I find solace and joy in painting.
    India
    Joined Nov 2024

    Remove Connection

    Are you sure you want to remove from your connections?

    Cancel Confirm
    • About
    • Timeline
    • Blog
    • Scraping Projects
    • Connections
    • Discussions
    • Profile photo of Saori Mariana

      Saori Mariana replied to the discussion How can I scrape JavaScript-based content without headless browsers? in the forum General Web Scraping 9 months ago

      9 months ago

      Reply to How can I scrape JavaScript-based content without headless browsers?

      I identify preloaded JSON data in HTML sources, which sometimes includes all necessary data without JavaScript.

    • Profile photo of Saori Mariana

      Saori Mariana replied to the discussion What’s the best way to scrape map-based data from websites? in the forum General Web Scraping 9 months ago

      9 months ago

      Reply to What’s the best way to scrape map-based data from websites?

      I sometimes screenshot map data and run OCR to extract names and locations, though it’s not as accurate as JSON-based scraping.

    • Profile photo of Saori Mariana

      Saori Mariana replied to the discussion How can I detect and manage duplicate data in my scraped results? in the forum General Web Scraping 9 months ago

      9 months ago

      Reply to How can I detect and manage duplicate data in my scraped results?

      For more complex data, I create custom matching algorithms to compare similar fields and flag duplicates with slight variations.

    • Profile photo of Saori Mariana

      Saori Mariana replied to the discussion How do I handle scraping for real-time data that updates frequently? in the forum General Web Scraping 9 months ago

      9 months ago

      Reply to How do I handle scraping for real-time data that updates frequently?

      Using a message queue like RabbitMQ or Kafka helps organize and process real-time data efficiently without overloading resources.

    • Profile photo of Saori Mariana

      Saori Mariana started the discussion What’s the best way to scrape product pages on Decathlon with Ruby? in the forum General Web Scraping 9 months ago

      9 months ago

      What’s the best way to scrape product pages on Decathlon with Ruby?

      Use the Nokogiri gem in Ruby for HTML parsing, which is effective for scraping Decathlon’s static pages with consistent HTML structure.

    • Profile photo of Saori Mariana

      Saori Mariana changed their photo 9 months ago

      9 months ago

      0 Comments
    • Profile photo of Saori Mariana

      Saori Mariana became a registered member 9 months ago

      9 months ago

      0 Comments

      Profile photo of

      Latest updates

      Profile photo of jennifer james

      jennifer james posted an update 5 months ago

      Profile photo of dali

      dali posted an update 8 months ago

      Profile photo of yew

      yew posted an update 8 months ago

      © 2025 - Rayobyte Community

      Report

      There was a problem reporting this post.

      Harassment or bullying behavior
      Contains mature or sensitive content
      Contains misleading or false information
      Contains abusive or derogatory content
      Contains spam, fake content or potential malware

      Block Member?

      Please confirm you want to block this member.

      You will no longer be able to:

      • See blocked member's posts
      • Mention this member in posts
      • Message this member
      • Add this member as a connection

      Please note: This action will also remove this member from your connections and send a report to the site admin. Please allow a few minutes for this process to complete.

      Report

      You have already reported this .