News Feed Forums General Web Scraping How to scrape home product prices from Otto.de using JavaScript?

  • How to scrape home product prices from Otto.de using JavaScript?

    Posted by Luka Jaakob on 12/21/2024 at 7:22 am

    Scraping home product prices from Otto.de using JavaScript allows you to gather data about furniture, home decor, and appliances. Otto is a popular German e-commerce site, making it a valuable source for analyzing pricing trends and product availability. Using Node.js with Puppeteer, you can automate browser interactions to handle dynamic content and extract relevant details such as product names, prices, and categories. The process begins with inspecting the website structure to identify the HTML elements that contain the required data.
    Pagination is a critical feature for accessing all product pages, ensuring that the scraper captures the full dataset. Random delays between requests reduce detection risks and help mimic human browsing behavior. Once collected, the data can be stored in structured formats for analysis. Below is an example script for scraping Otto.de using JavaScript.

    const puppeteer = require('puppeteer');
    (async () => {
        const browser = await puppeteer.launch({ headless: true });
        const page = await browser.newPage();
        const url = 'https://www.otto.de/';
        await page.goto(url, { waitUntil: 'networkidle2' });
        const products = await page.evaluate(() => {
            const productList = [];
            const items = document.querySelectorAll('.product-card');
            items.forEach(item => {
                const name = item.querySelector('.product-name')?.textContent.trim() || 'Name not available';
                const price = item.querySelector('.product-price')?.textContent.trim() || 'Price not available';
                productList.push({ name, price });
            });
            return productList;
        });
        console.log(products);
        await browser.close();
    })();
    

    This script collects product names and prices from Otto.de’s product sections. Pagination handling allows the scraper to navigate through multiple pages and collect all available products. Random delays between requests help avoid detection, ensuring smooth operation.

    Luka Jaakob replied 1 day, 5 hours ago 1 Member · 0 Replies
  • 0 Replies

Sorry, there were no replies found.

Log in to reply.