Elias Dabbas :verified: · @elias
57 followers · 98 posts · Server seocommunity.social

Get all meta tags from a page

🔵 Use to explore which meta tags exist on a certain URL
🔵 Get all tags, all attributes, and all their values
🔵 Discover some great "best practices" as I did in one of the screenshots ;)
🔵 Instructions, code, and a few example outputs can be found in the link

bit.ly/3OyBAvI

#seo #datascience #python #scrapy

Last updated 1 year ago

Laboratório Hacker de Campinas · @lhc
128 followers · 91 posts · Server mastodon.com.br

As inscrições GRATUITAS para o tutorial "Raspando Dados da Internet com Python" com @rennerocha na manhã do sábado 01 de julho já estão abertas!

Mais informações em: eventbrite.com.br/e/tutorial-r

#scrapy #python #webscraping #hackerspaces

Last updated 1 year ago

Rami Krispin :unverified: · @ramikrispin
889 followers · 406 posts · Server mstdn.social

Web scraping with Scrapy course 🚀🚀🚀

A new course by freeCodeCamp for web scrapping with Python and scrapy by Joe Kearney. The course is for beginners level.

Resources 📚
➡️ Course: youtube.com/watch?v=mBoX_JCKZT
➡️ Course website: thepythonscrapyplaybook.com/fr
➡️ Scrapy: docs.scrapy.org/en/latest/

#webscrapping #scrapy #Python

Last updated 1 year ago

Dave Mackey · @davidshq
802 followers · 1237 posts · Server hachyderm.io
octplane · @octplane
97 followers · 240 posts · Server mastodon.xyz

Pendant ce temps, là, on scrappait des sites accessibles via l'offre BnF pour extraire des PDFs plutôt que d'utiliser un site mal foutu pour lire un journal...

#scrapy #bnf #pdf

Last updated 2 years ago

Frederik Elwert · @felwert
754 followers · 1068 posts · Server mstdn.social

@grvsmth In this case from . But other scrapers like can also output CSV. Is this practice so uncommon? I always thought it a practical way to keep metadata and text together. But maybe I've been doing things wrong the whole time. 😅

#scrapy #openrefine

Last updated 2 years ago

Matt Layman · @mblayman
89 followers · 348 posts · Server mastodon.social

🐍 How can you scrape data from webpages using ? In this talk, you'll see how this is possible with . youtube.com/watch?v=tdA1cl6LiC

#python #scrapy

Last updated 2 years ago

Tahsin J · @knyghtmare
2 followers · 19 posts · Server mas.to

Hey, all!
Here's some must-knows if you're wanting to get into

Mathematics:
, ,

Data Mining:
or

EDA & DataVisualisation:
, , ,

Machine Learning & Deep Learning:

, , , , PyCaret

#pytorch #keras #scikitlearn #tensorflow #seaborn #plotly #matplotlib #pandas #scrapy #beautifulsoup #statmodels #scipy #numpy #datascience #Python

Last updated 2 years ago

Tahsin J · @knyghtmare
3 followers · 43 posts · Server mas.to

Hey, all!
Here's some must-knows if you're wanting to get into

Mathematics:
, ,

Data Mining:
or

EDA & DataVisualisation:
, , ,

Machine Learning & Deep Learning:

, , , , PyCaret

#pytorch #keras #scikitlearn #tensorflow #seaborn #plotly #matplotlib #pandas #scrapy #beautifulsoup #statmodels #scipy #numpy #datascience #Python

Last updated 2 years ago

Ryan He · @ryanhe
40 followers · 165 posts · Server moe.pastwind.top

抓網頁內容要比用 簡單多了
scrapy.org/

#scrapy #beautifulsoup

Last updated 2 years ago

Ryan He · @ryanhe
40 followers · 164 posts · Server moe.pastwind.top

抓網站內容要比用 簡單多了
scrapy.org/

#scrapy #beautifulsoup

Last updated 2 years ago

Oriol Piera :python: · @cortsenc
111 followers · 239 posts · Server mastodont.cat
DeaDSouL :fedora: :fediverse: · @DeaDSouL
103 followers · 290 posts · Server fosstodon.org
Jose Marichal · @JoseMarichal
284 followers · 68 posts · Server mastodon.social

is a pain in the rear for a baby data scientist/coder like me!

#scrapy

Last updated 2 years ago

Riverfount :verified_alt:​ · @Riverfount
18 followers · 173 posts · Server mstdn.social

Escrevendo um web scraping em Python usando a lib Scrapy, vou te contar a parte do Python e do Scrapy é tranquila, mas traçar as variantes que um mesmo web site tem com as mudanças de conteúdo são bem complicadas!!!

#WebDev #scrapy #Python_3

Last updated 2 years ago

Phel · @phel
139 followers · 2433 posts · Server toot.cafe

@ajroach42 You could give a shot - pretty easy to configure and scrape the relevant data. Maybe you need to feed the login cookie manually, but otherwise it could work.

TBH, this is probably already overengineering...

#scrapy

Last updated 4 years ago

Ell · @cincodenada
49 followers · 2233 posts · Server cybre.space

Having a real grumpy old man moment here trying to write a bog-standard crawler to log in to a website and download a couple dozen links.

I made the mistake of trying to use , which is apparently now an aggressively asynchronous crawler ecosystem that seems hostile to my goal of visiting a set sequence of pages, one after the other.

I'm not writing a search engine here, I don't need this to be massively parallel, I don't care about your Cloud Service, I literally want you to just log in at page A and then use that session to visit page B, and then go down a list of links on that page one by one and save them to my disk, but apparently that's too blase these days.

Ironically, the last time I did this using Node and it was way less hassle. Guess I'll dust off that project and use that instead.

#scrapy

Last updated 5 years ago

Cécile Le Chevalier · @CLC
194 followers · 1175 posts · Server framapiaf.org

#python #scrapy

Last updated 6 years ago