A long time ago, I wanted to do a project on text reuse & text production strategies of . I estimated ca. 30,000 pages of printed text. It was 2009/10, and large-scale digitisation of older books had just started. was a mess, & hadn't been a thing yet. Eventually, I abandoned the project, since manually transcribing 30,000 pp. & then doing computational analysis for text similarity & re-use was unfeasible.
Imagine I wanted to do that now!

#phd #earlymodern #author #georggreflinger #ocr #htr

Last updated 2 years ago

A blog post continuing my series of posts about the project on the weblog: greflinger.hypotheses.org/716
Today about and re-using models.

#ethicacomplementoria #DigitalScholarlyEdition #georggreflinger #htr #Transkribus

Last updated 2 years ago

3 of research leave continues with a blog post on the weblog: greflinger.hypotheses.org/586
Today's digression was filenames . However, there was also the starting meeting w/ the new research assistant who will work on the automatic transcription of the Danish translation of the from 1678!

#day #georggreflinger #rdm #researchdatamanagement #ethicacomplementoria

Last updated 2 years ago

of my summer research leave!
Read about the exciting things I have been up to today on the weblog: greflinger.hypotheses.org/568
[Hint: "exciting" is a euphemism for being mad at my past self for not properly managing data...]

#day1 #georggreflinger #ethicacomplementoria #researchdatamanagement #histodons #bookhistory #earlymodernhistory

Last updated 2 years ago