-
Deisy Swarna replied to the discussion Compare Ruby and Go to scrape shipping details from Yahoo! Taiwan in the forum General Web Scraping a year ago
Compare Ruby and Go to scrape shipping details from Yahoo! Taiwan
Go’s Colly library is faster and more efficient for large-scale scraping due to its concurrent request handling. If scalability is a concern, Go might be the better choice for scraping shipping details from multiple product pages.
-
Deisy Swarna started the discussion How to scrape restaurant menus and prices from online food delivery platforms? in the forum General Web Scraping a year ago
How to scrape restaurant menus and prices from online food delivery platforms?
Scraping restaurant menus and prices from online food delivery platforms requires handling structured but often dynamically loaded data. Menus are typically displayed in a hierarchical format, with categories like appetizers, mains, and desserts. Using BeautifulSoup, you can extract this data for static pages, while Puppeteer or Selenium is…
-
Deisy Swarna changed their photo a year ago
-
Deisy Swarna became a registered member a year ago
-
Nora Ramzan replied to the discussion Compare Python and Node.js to scrape product reviews from Momo Taiwan in the forum General Web Scraping a year ago
Compare Python and Node.js to scrape product reviews from Momo Taiwan
Node.js with Puppeteer is better suited for handling dynamic content since it can render JavaScript-heavy pages. It also allows for easier interaction with elements such as pop-ups or expandable sections, which are common on e-commerce sites like Momo.
-
Nora Ramzan replied to the discussion Use Node.js to scrape product titles from Books.com.tw in the forum General Web Scraping a year ago
Use Node.js to scrape product titles from Books.com.tw
If the product title element is dynamically loaded, Puppeteer is well-suited for the task. However, inspecting the network requests for API endpoints could reveal a direct way to fetch product data without rendering the entire page.
-
Nora Ramzan started the discussion How to extract sports team names and match schedules from a website? in the forum General Web Scraping a year ago
How to extract sports team names and match schedules from a website?
Scraping sports team names and match schedules can provide valuable data for analysis or personal use. Most sports websites structure this information in lists or tables, making it easy to locate with HTML inspection. For static pages, tools like BeautifulSoup are effective in extracting team names and match timings. Dynamic sites often…
-
Nora Ramzan changed their photo a year ago
-
Nora Ramzan became a registered member a year ago
-
Sunny Melanija replied to the discussion Use Python to scrape product availability from Ruten Taiwan in the forum General Web Scraping a year ago
Use Python to scrape product availability from Ruten Taiwan
Inspecting the network traffic in the browser’s developer tools could reveal API endpoints used by Ruten to fetch availability information. Querying these APIs directly might provide more reliable and efficient access to the data.
- Load More