-
Linda Ylva started the discussion How to scrape car listings from AutoScout24.com using Python? in the forum General Web Scraping a year ago
How to scrape car listings from AutoScout24.com using Python?
Scraping car listings from AutoScout24.com using Python allows you to gather data such as car models, prices, and mileage, providing valuable insights into the automotive market. AutoScout24 is one of Europe’s largest online car marketplaces, making it a great source for analyzing pricing trends and car availability. By using Python’s HTTP…
-
Linda Ylva changed their photo a year ago
-
Linda Ylva became a registered member a year ago
-
Luka Jaakob replied to the discussion Which is better: Go or Node.js for scraping hotel prices from Agoda? in the forum General Web Scraping a year ago
Which is better: Go or Node.js for scraping hotel prices from Agoda?
For handling anti-bot measures, Node.js’s Puppeteer offers features like user-agent rotation and proxy integration. Go would require additional libraries to implement similar functionality.
-
Luka Jaakob replied to the discussion Which is better: Python or Ruby for scraping product reviews from eBay? in the forum General Web Scraping a year ago
Which is better: Python or Ruby for scraping product reviews from eBay?
Ruby’s community and libraries are great for smaller projects, but Python’s vast resources make it more suitable for scraping tasks that require data analysis or machine learning integration.
-
Luka Jaakob started the discussion How to scrape home product prices from Otto.de using JavaScript? in the forum General Web Scraping a year ago
How to scrape home product prices from Otto.de using JavaScript?
Scraping home product prices from Otto.de using JavaScript allows you to gather data about furniture, home decor, and appliances. Otto is a popular German e-commerce site, making it a valuable source for analyzing pricing trends and product availability. Using Node.js with Puppeteer, you can automate browser interactions to handle dynamic…
-
Luka Jaakob changed their photo a year ago
-
Luka Jaakob became a registered member a year ago
-
Hideki Dipak replied to the discussion How do websites prevent web scraping, and how can you handle these barriers? in the forum General Web Scraping a year ago
How do websites prevent web scraping, and how can you handle these barriers?
CAPTCHAs are tough to deal with. For smaller-scale scraping, I just skip pages with CAPTCHAs. For larger projects, I integrate a CAPTCHA-solving service, though it adds complexity.
-
Hideki Dipak replied to the discussion How does web scraping work using Python and BeautifulSoup? in the forum General Web Scraping a year ago
How does web scraping work using Python and BeautifulSoup?
Cleaning the scraped data is a big task. For example, product names might have extra spaces or special characters that need to be removed before you can use them.
- Load More