Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I don't get it, I can just curl the svg and grep for mailto?


Yes, but these scrapper bots aren't that sophisticated.


Crawl every link, now including SVG, and grep all 'mailto:' does not sound super sophisticated?

    wget --recursive --quiet $BASE_URL && grep -roh 'mailto:\([^"]*\)'
works on the example and just prints the email


I think the idea is that email scraper bots typically don't bother downloading images referenced by <img> tags.


But they will be as soon as this sees widespread use.


it won't be widespread imho, not when you share you email address with other parties that then lose/sell your details. fastmail like 'temporal' email addresses could help, however.


Querying DOM nodes is inherently more complicated than a regex on unparsed HTML.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: