Gotta say working with #openxml (#ooxml) is scary. The Open XML Sdk is so low level with no guard rails. Must understand all of the underlying XML to be able to do anything meaningful, so what’s the point? And all the libraries I’ve looked at doesn’t support what I want to do.
At this point, I’m taking a swing at processing the XML by hand with #beautifulsoup4. Seems to do the trick 😅
#openxml #OOXML #beautifulsoup4
Spends a bit of time writing a web scraper for the League of Comics website, and totally remember why I never spend my time on web scrapers. #python #beautifulsoup4
So I've been playing with #Python again, finally realised how to fix a problem with conflicting versions of #beautifulsoup4. Not bs4's fault, but actually a problem with #pycaption requiring an older version of it.
Turned out all I needed to do was to tweak the setup.py for pycaption, correct the dependencies versioning, clean, rebuild, and re-install locally.
Now I don't have to be careful how I run #Gnomecast (:*
#python #beautifulsoup4 #pycaption #gnomecast