DHQuarterly · @DHQuarterly
234 followers · 5 posts · Server hcommons.social

Check out Tobias Englmeier et al.’s work “Using an Advanced Text Index Structure for Corpus Exploration in Digital Humanities” which shows ways to explore through symmetric compacted directed acyclic word graphs (SCDAWGs)- offering ways to answer many of the questions raised in research:
digitalhumanities.org:8081/dhq

Exploring of the , see “Algorithmic Close Reading: Using Semantic Triplets to Index and Analyze Agency in Holocaust Testimonies” by Lizhou Fan & Todd Presner which uses to search :
digitalhumanities.org:8081/dhq

#corpuses #dh #microhistories #holocaust #text #analysis #methods #testimonies

Last updated 2 years ago

DHQuarterly · @DHQuarterly
233 followers · 4 posts · Server hcommons.social

With still making news, we decided to highlight some of the pieces from our vault on language models.

Check out “Digital Humanities and Natural Language Processing: Je t’aime... Moi non plus” by Barbara McGillivray, Thierry Poibeau & Pablo Ruiz Fabo which focuses on more collaboration between datasets and tools:
digitalhumanities.org:8081/dhq

See Diego Jiménez–Badillo et al.’s work titled “Developing Geographically Oriented NLP Approaches to Sixteenth–Century Historical Documents: Digging into Early Colonial Mexico” exploring how and other approaches can be applied to understand large :

digitalhumanities.org:8081/dhq

#chatgpt #dh #nlp #computational #historical #corpuses

Last updated 2 years ago

Arseny Khakhalin · @ampanmdagaba
221 followers · 503 posts · Server sigmoid.social

Cross-boost (excerpts) from
v buckenham (@v21@🐦.com)

there is a real fear among researchers that the last big of human written have already been captured. all future scrapes of the internet for text to learn from will be contaminated by machine-speak.
...
funny to think of a time when generated text is recognizable due to it's use of typically 2020-ish patterns and references. a cultural fixed point new models start from...

Link: twitter.com/v21/status/1490297

#ai #corpuses #text #speech

Last updated 2 years ago