-
Nekesa Wioletta replied to the discussion How to handle CAPTCHA challenges in web scraping projects? in the forum General Web Scraping 12 months ago
How to handle CAPTCHA challenges in web scraping projects?
I usually avoid sites with CAPTCHAs unless absolutely necessary. It’s easier to find alternative sources of data than to deal with the added complexity.
-
Nekesa Wioletta started the discussion How to scrape ticket details from SeatGeek.com using JavaScript? in the forum General Web Scraping 12 months ago
How to scrape ticket details from SeatGeek.com using JavaScript?
Scraping ticket details from SeatGeek.com using JavaScript can help you collect information like event names, ticket prices, and locations. Using Node.js with Puppeteer, you can automate browser interactions to handle dynamic content and extract the required data. Below is a sample script for scraping ticket information from…
-
Nekesa Wioletta changed their photo 12 months ago
-
Nekesa Wioletta became a registered member 12 months ago
-
Hirune Islam replied to the discussion Extracting property images and prices with PHP and DOMDocument in the forum General Web Scraping 12 months ago
Extracting property images and prices with PHP and DOMDocument
I use PHP’s filter_var function to validate and sanitize image URLs. This ensures the URLs are safe and usable for downloading images later.
-
Hirune Islam replied to the discussion Scraping book titles and authors from an online bookstore using Java in the forum General Web Scraping 12 months ago
Scraping book titles and authors from an online bookstore using Java
When dealing with dynamic content, I use Selenium WebDriver with Java to ensure all elements are fully loaded before scraping. It’s slower than Jsoup but handles JavaScript-rendered content well.
-
Hirune Islam started the discussion What data can I scrape from StockX.com sneaker listings using Ruby? in the forum General Web Scraping 12 months ago
What data can I scrape from StockX.com sneaker listings using Ruby?
Scraping sneaker listings from StockX.com using Ruby allows you to extract details such as sneaker names, prices, and popularity. Using Ruby’s open-uri library for HTTP requests and nokogiri for parsing HTML, you can efficiently extract data. Below is an example script for scraping sneaker information from StockX.
require 'open-uri'
require…
-
Hirune Islam changed their photo 12 months ago
-
Hirune Islam became a registered member 12 months ago
-
Umeda Domenica replied to the discussion How to scrape rental property data from Trulia.com using Ruby? in the forum General Web Scraping 12 months ago
How to scrape rental property data from Trulia.com using Ruby?
One major enhancement to the scraper would be to add pagination handling for gathering data across multiple pages. Trulia organizes property listings over several pages, and scraping only the first page limits the completeness of the data. By programmatically following the “Next” button and looping through all available pages, the scraper can…
- Load More