The CorCenCC Welsh semantic tagger in PyMUSAS is now included in the #wmatrix version 6 beta tag wizard. The first sentence I tagged was "Eisteddodd y gath ar y mat.", thanks to Steve Morris for the translation. Let me know if you want to be a beta tester https://ucrel-wmatrix6.lancaster.ac.uk
Had a bit of a breakthrough when thinking about my #corpuslinguistic analysis today. Needed to find keywords to look for prototypical texts, but realised that comparing two sets of my own data wasn't as useful in this context as comparing all my data with the brilliant #BNC. Between #Wmatrix and #ProtoAnt I am thoroughly excited about what #Linguists are capable of!
#CorpusLinguistic #bnc #wmatrix #protoant #linguists
#wmatrix version 6 beta testers have been getting some early xmas presents with new features: contextual sorting options for concordances, multiple files per corpus, lemma frequency lists and concordances, plus range (file dispersion) counting. These new features have been tested by the
Quo VaDis and CorCenCC project teams. If you'd like to beta test Wmatrix6 in January then please get in touch with me
Many thanks to Dawn Knight for the invitation to run a #wmatrix software training workshop for colleagues and students in Cardiff today. We also tried out the shiny new corpus library feature for the very first time, details in the online tutorials: https://ucrel.lancs.ac.uk/wmatrix/tutorial/