News Feed Forums General Web Scraping How do I scrape data from sites using custom fonts or icons?

  • How do I scrape data from sites using custom fonts or icons?

    Posted by Tapiwa Evgeni on 11/16/2024 at 5:53 am

    Tools like Tesseract OCR are useful if the custom fonts render as images. It reads text within images and converts it back into text format.

    Abioye Blaga replied 4 days, 13 hours ago 8 Members · 7 Replies
  • 7 Replies
  • Lana Sneferu

    Member
    11/18/2024 at 5:35 am

    Checking the CSS or JavaScript files often reveals the Unicode mappings for custom icons. I translate these codes manually within my script.

  • Suhaila Kiyoshi

    Member
    11/18/2024 at 5:46 am

    Font Awesome icons and similar libraries can sometimes be scraped as their CSS classes directly without image recognition.

  • Keith Marwin

    Member
    11/18/2024 at 5:55 am

    I sometimes find that simply copying and pasting into a text editor can reveal hidden font or icon text that doesn’t render normally in browsers.

  • Goutam Victor

    Member
    11/18/2024 at 7:04 am

    Selenium is useful for dynamically loaded fonts, allowing me to capture content in real-time.

  • Rhouth Vilma

    Member
    11/18/2024 at 7:12 am

    Extracting the text using Playwright and checking for any special CSS or JavaScript manipulations reveals additional data.

  • Baltassar Igor

    Member
    11/18/2024 at 7:23 am

    Custom-built dictionaries for each icon or font help translate these into readable text, especially for sites using unique icons.

  • Abioye Blaga

    Member
    11/18/2024 at 8:34 am

    Browser dev tools often have font viewers that show what each custom character represents, making it easier to match characters.

Log in to reply.