Christoffer S. · @nopatience
1393 followers · 445 posts · Server swecyb.com

Stumbled upon Trafilatura just now. An amazingly efficient Python lib/tool to extract text from HTML-based pages.

Especially welcomed since Newspaper3k have been abandoned since almost 3 years ago.

#python #textextraction #trafilatura

Last updated 1 year ago

Bloggo e sto · @Blogsdaseguire
6 followers · 313 posts · Server mastodon.cloud

Per la fatta a casa con le macchine trafilatrici si deve usare la di duro o , che ha le proprietà giuste per resistere al calore della .
Solo con il grano duro ottieni una pasta fresca fatta in casa perfetta. mangiocongusto.it/perche-usare

#pastafresca #farina #grano #semola #trafilatura

Last updated 1 year ago

Marian :openbsd: :gentoo: · @marian_mizik
102 followers · 182 posts · Server fosstodon.org

I am my own implementation of client/server for years. I also have a simple client. Since the beginning I struggled with stable and functioning solution for scraping full article content, so it can be shown in console and/or used for offline reading. I started with , later used and now, after 5 years I finally found library that is working 100% for every feed I am subscribed to:

Great job Adrien Barbaresi!

#selfhosting #python #rss #tui #html2text #newspaper3k #goose3 #trafilatura

Last updated 2 years ago