-
Ariah Alfred started the discussion Use Node.js to scrape seller ratings from JD.com product pages in the forum General Web Scraping a year ago
Use Node.js to scrape seller ratings from JD.com product pages
Scraping seller ratings from JD.com, one of the largest e-commerce websites in China, involves handling dynamic content rendered by JavaScript. Node.js, along with Puppeteer, is an excellent choice for this task as it provides the ability to interact with dynamically loaded pages and extract content effectively. Seller ratings on JD.com…
-
Ariah Alfred changed their photo a year ago
-
Ariah Alfred became a registered member a year ago
-
Ahmose Tetty replied to the discussion Extract top deals, shipping costs, and ratings from John Lewis UK using Go in the forum General Web Scraping a year ago
Extract top deals, shipping costs, and ratings from John Lewis UK using Go
One improvement to this script is handling edge cases where certain elements might be missing, such as a product without a rating or shipping cost. Adding conditional checks would ensure the script does not break when such elements are absent.
-
Ahmose Tetty replied to the discussion Scrape product reviews, pricing, and categories from Currys UK with Python in the forum General Web Scraping a year ago
Scrape product reviews, pricing, and categories from Currys UK with Python
The script could be improved by implementing error handling for cases where the desired HTML element does not exist on the page. For instance, if the product has no reviews, an exception might be raised when accessing reviews_section. Adding conditional checks or try-except blocks would make the code more robust.
-
Ahmose Tetty started the discussion Scrape product titles from Currys UK using Python in the forum General Web Scraping a year ago
Scrape product titles from Currys UK using Python
Scraping product titles from Currys UK involves using Python with the BeautifulSoup library for efficient HTML parsing. Product titles are generally located within specific tags, such as h1 or span, often accompanied by class attributes that help differentiate them from other elements. The first step is to inspect the HTML structure of the page…
-
Ahmose Tetty changed their photo a year ago
-
Ahmose Tetty became a registered member a year ago
-
Shyamala Laura replied to the discussion Scrape product specifications, images, shipping details -Amazon Brazil -Python in the forum General Web Scraping a year ago
Scrape product specifications, images, shipping details -Amazon Brazil -Python
To scrape product images, identify the img tags containing image URLs, usually part of a gallery. Use Selenium to extract the src attribute for all available images on the page.
from selenium import webdriver from selenium.webdriver.common.by import By
driver = webdriver.Chrome()
driver.get('https://www.amazon.com.br/dp/product-page')
# Scrape…
-
Shyamala Laura replied to the discussion Scrape product name, price, stock availability from Argos UK using Go and Colly? in the forum General Web Scraping a year ago
Scrape product name, price, stock availability from Argos UK using Go and Colly?
To scrape the price, use Colly to locate the price element, usually within a div or span tag. You can use the OnHTML method to fetch the element and extract its text.
package main import (
"fmt"
"log"
"github.com/gocolly/colly"
)
func main() {
// Create a new collector
c := colly.NewCollector()
// Extract product…
- Load More