-
Emilia Maachah replied to the discussion Scrape product reviews from Argos UK using Ruby in the forum General Web Scraping a year ago
Scrape product reviews from Argos UK using Ruby
To improve security, the script could validate the input URL to ensure that only trusted domains are processed. This would prevent potential vulnerabilities if the script is modified to accept user inputs for the target URL.
-
Emilia Maachah replied to the discussion Scrape flash sale details, customer reviews, return policies Debenhams on Python in the forum General Web Scraping a year ago
Scrape flash sale details, customer reviews, return policies Debenhams on Python
Saving the scraped data into a database or exporting it as a CSV file would improve data management. This approach would allow for easier querying and sharing of the collected information.
-
Emilia Maachah started the discussion Scraping book titles and authors from an online bookstore using Java in the forum General Web Scraping a year ago
Scraping book titles and authors from an online bookstore using Java
Scraping book titles and authors from an online bookstore can be achieved efficiently using Java. Java’s Jsoup library is a powerful tool for parsing HTML and extracting specific data from static web pages. For dynamic websites that rely on JavaScript, integrating Java with Selenium WebDriver is necessary to interact with and render the content.…
-
Emilia Maachah changed their photo a year ago
-
Emilia Maachah became a registered member a year ago
-
Fanni Marija replied to the discussion How does web scraping work using Python and BeautifulSoup? in the forum General Web Scraping a year ago
How does web scraping work using Python and BeautifulSoup?
One challenge I’ve faced is when websites dynamically load content using JavaScript. BeautifulSoup can’t handle that, so I had to switch to Selenium or Playwright to scrape the full page.
-
Fanni Marija replied to the discussion Which is better: Go or Node.js for scraping hotel prices from Agoda? in the forum General Web Scraping a year ago
Which is better: Go or Node.js for scraping hotel prices from Agoda?
Go’s Colly library is incredibly fast and efficient for scraping static HTML content. However, it may not handle JavaScript-rendered content as effectively as Node.js with Puppeteer.
-
Fanni Marija started the discussion Extracting property images and prices with PHP and DOMDocument in the forum General Web Scraping a year ago
Extracting property images and prices with PHP and DOMDocument
Scraping property images and prices from real estate websites is a common use case for data aggregation or market analysis. PHP’s DOMDocument and DOMXPath libraries provide robust methods for extracting structured data. For static pages, you can parse HTML using these libraries and extract elements like image URLs and prices. If the site uses…
-
Fanni Marija changed their photo a year ago
-
Fanni Marija became a registered member a year ago
- Load More