-
Ikaika Kapono became a registered member a year ago
-
Chico Cleisthenes replied to the discussion How do I extract data from a PDF using web scraping tools? in the forum General Web Scraping a year ago
How do I extract data from a PDF using web scraping tools?
Selenium can download the PDF, and then you can extract content using libraries like PyMuPDF.
-
Chico Cleisthenes started the discussion Best techniques to rotate proxies when scraping websites. in the forum General Web Scraping a year ago
Best techniques to rotate proxies when scraping websites.
Use proxy pools or paid proxy services that provide rotating IPs automatically.
-
lokesh28j became a registered member a year ago
-
Chico Cleisthenes changed their photo a year ago
-
Chico Cleisthenes became a registered member a year ago
-
Alby Adalberto replied to the discussion How to scrape data from a website protected by Cloudflare? in the forum General Web Scraping a year ago
How to scrape data from a website protected by Cloudflare?
If the website’s protection is advanced, you may need to use Playwright or Puppeteer with a headless browser to mimic real user behavior.
-
Alby Adalberto started the discussion What are the limitations of web scraping with headless browsers? in the forum General Web Scraping a year ago
What are the limitations of web scraping with headless browsers?
Headless browsers like Selenium and Puppeteer consume more memory and are slower than simple HTTP requests.
-
Alby Adalberto changed their photo a year ago
-
Alby Adalberto became a registered member a year ago
- Load More