All this is an example of stuff I learn while assisting students on projects--they didn't really need the XSLT to be bundled into the python script, but it was a good opportunity for me to learn how to do it, at last! #python #xslt #saxonica #saxonc
#saxonc #saxonica #xslt #python
Pipeline concluded: Afterwards must clean up new spaCy mess b/c the autotagging matches over its own tags--gross! Regex could clean this up but XSLT is better b/c simpler: reads element nodes natively, thank you very much! So we run XSLT (in Python). Yay for a pipeline roundtrip! (3/3) #xslt #python #saxonica #saxonc
#saxonc #saxonica #python #xslt
Pipeline (in stages): Python reads XML nodes (retrieved by SaxonC XPath), the spaCy LLM goes looking for named entities, parts of speech, whatever, we run it thru the NER ruler to filter/change/fix the spaCy LLM mess, then search and replace matching strings in the XML with new XML elements. #xslt #python #saxonica #saxonc
#saxonc #saxonica #python #xslt