Andreas Wagner · @anwagnerdreas
771 followers · 2009 posts · Server hcommons.social

Reading of recently (👋🏻 @felwert ), would you prefer to have a corpus with un-normalized historical spelling variants or rather one with only the lemmatized tokens? We have a mechanism for lemmatizing, but not for "just" normalizing, so this option is not viable for us in the salamanca.school project.

Perhaps @dta_cthomas can you share some experiences with offering both?

Second question: do you know of alternative "distant reading" visualization tools/libraries/platforms to integrate into a (headless) corpus/collection website? (Without trying, I suppose this excludes some visualization-capable corpus analysis apps like or , but I'd be happy to be proven wrong.)

#voyanttools #txm #corpusexplorer

Last updated 1 year ago