@brecht @floppy That sounds like a good line of thought, however storing the contents is also needed for full text search, and that's precisely what search engines have been doing for this whole millennium without much legal fuss.
The difficulty only came recently with snippets, which are all about showing the contents to some party other than the one which did the scraping.
I think my setup for search and saving using #Recoll-we and #singlefileZ is legally safe.