-
Aretha Melech started the discussion Compare Python and Ruby for scraping product reviews on Tiki Vietnam in the forum General Web Scraping a year ago
Compare Python and Ruby for scraping product reviews on Tiki Vietnam
How does scraping product reviews from Tiki, one of Vietnam’s largest e-commerce platforms, differ between Python and Ruby? Would Python’s BeautifulSoup library be more efficient for parsing static HTML, or does Ruby’s Nokogiri offer a simpler and more elegant solution? How do both languages handle dynamic content, such as paginated reviews…
-
Aretha Melech changed their photo a year ago
-
Aretha Melech became a registered member a year ago
-
Laleh Korina replied to the discussion Extract discounts, product reviews, seller details from The Entertainer UK -PHP in the forum General Web Scraping a year ago
Extract discounts, product reviews, seller details from The Entertainer UK -PHP
To improve security, the script could include input sanitization and validate the URLs being scraped. This would prevent potential vulnerabilities if user input is passed directly to the script, ensuring that only valid, whitelisted domains are processed. Additionally, adding SSL verification for requests would enhance security when accessing HTTPS sites.
-
Laleh Korina replied to the discussion Scrape bestsellers, delivery charges, and star ratings from WHSmith UK using Go in the forum General Web Scraping a year ago
Scrape bestsellers, delivery charges, and star ratings from WHSmith UK using Go
Adding functionality to scrape additional categories or pages dynamically would make the script more versatile. This can be achieved by extracting links to other sections and visiting them recursively.
-
Laleh Korina started the discussion Compare PHP and Node.js for scraping hotel details on Booking.com UAE in the forum General Web Scraping a year ago
Compare PHP and Node.js for scraping hotel details on Booking.com UAE
How would scraping hotel details from Booking.com UAE differ between PHP and Node.js? Is PHP’s cURL and DOMDocument better for parsing static content, or does Node.js with Puppeteer handle dynamic, JavaScript-rendered content more effectively? What happens when dealing with large-scale scraping tasks that require concurrency or interacting with…
-
Laleh Korina changed their photo a year ago
-
Laleh Korina became a registered member a year ago
-
Scilla Phoebe replied to the discussion Scrape flash sale details, customer reviews, return policies Debenhams on Python in the forum General Web Scraping a year ago
Scrape flash sale details, customer reviews, return policies Debenhams on Python
The script could be improved by adding a pagination handler to scrape customer reviews across multiple pages. This would ensure a comprehensive collection of reviews rather than limiting to the first page.
-
Scilla Phoebe replied to the discussion Scrape special offers, user rating, product info from Marks & Spencer UK on Ruby in the forum General Web Scraping a year ago
Scrape special offers, user rating, product info from Marks & Spencer UK on Ruby
The script could include better error handling to account for network errors or missing elements. For instance, wrapping scraping methods in begin-rescue blocks can prevent the script from crashing due to unexpected issues.
- Load More