News Feed Forums General Web Scraping Use Go to scrape product categories from Media Markt Poland

  • Use Go to scrape product categories from Media Markt Poland

    Posted by Jerilyn Shankar on 12/13/2024 at 10:19 am

    Media Markt is a leading retailer in Poland, specializing in electronics and appliances. Scraping product categories from Media Markt involves navigating the main website or specific category pages to extract hierarchical information about their product offerings. Categories are typically structured in a menu or sidebar, and they are often presented as clickable links leading to subcategories or product pages. Using Go and the Colly library, this task can be accomplished efficiently by targeting these specific elements.
    The process begins by inspecting the website’s HTML structure using browser developer tools to locate the relevant tags and attributes for the categories. Using Colly, the script crawls the page, identifies the category sections, and extracts their text and URLs for further navigation. Below is a complete Go implementation for scraping product categories from Media Markt Poland:

    package main
    import (
    	"fmt"
    	"log"
    	"github.com/gocolly/colly"
    )
    func main() {
    	// Create a new Colly collector
    	c := colly.NewCollector()
    	// Handle the scraping of category names and links
    	c.OnHTML(".category-menu-item", func(e *colly.HTMLElement) {
    		categoryName := e.Text
    		categoryURL := e.Attr("href")
    		fmt.Printf("Category: %s\nLink: %s\n", categoryName, categoryURL)
    	})
    	// Handle errors during scraping
    	c.OnError(func(_ *colly.Response, err error) {
    		log.Printf("Error: %v\n", err)
    	})
    	// Visit the Media Markt Poland homepage
    	err := c.Visit("https://mediamarkt.pl/")
    	if err != nil {
    		log.Fatalf("Failed to visit website: %v", err)
    	}
    }
    
    Roi Garrett replied 5 days, 6 hours ago 2 Members · 1 Reply
  • 1 Reply
  • Roi Garrett

    Member
    12/17/2024 at 11:47 am

    Saving the scraped categories to a database or file, such as JSON or CSV, would make the data easier to analyze and integrate with other systems. This would be particularly useful for building a product classification system.

Log in to reply.