This is a case where I wish for a personal archiving tool to scoop-up and index everything I read on the web (and everything write, for that matter). I know there are projects out there to do that. I really should look into one. I always get hung-up in perfect being the enemy of the good and don't do anything.
The Microsoft Recall product sounded vaguely interesting, albeit I'd want a local-only version.
I've also wondered if accessibility interfaces for screen readers could be used for bulk capture of on-screen data for scraping into this mythical index. That would make the index application independent, since the text is being captured at a different "layer".
Absolutely. I tried to search-engine it when I posted and couldn't come up with it. Your search-engine skills are superior!