A long time ago, I wanted to do a #PhD project on text reuse & text production strategies of #EarlyModern #Author #GeorgGreflinger. I estimated ca. 30,000 pages of printed text. It was 2009/10, and large-scale digitisation of older books had just started. #OCR was a mess, & #HTR hadn't been a thing yet. Eventually, I abandoned the project, since manually transcribing 30,000 pp. & then doing computational analysis for text similarity & re-use was unfeasible.
Imagine I wanted to do that now!
#phd #earlymodern #author #georggreflinger #ocr #htr
A blog post continuing my series of posts about the #EthicaComplementoria #DigitalScholarlyEdition project on the #GeorgGreflinger weblog: https://greflinger.hypotheses.org/716
Today about #HTR #Transkribus and re-using models.
#ethicacomplementoria #DigitalScholarlyEdition #georggreflinger #htr #Transkribus
#Day 3 of research leave continues with a blog post on the #GeorgGreflinger weblog: https://greflinger.hypotheses.org/586
Today's digression was filenames #RDM #ResearchDataManagement. However, there was also the starting meeting w/ the new research assistant who will work on the automatic transcription of the Danish translation of the #EthicaComplementoria from 1678!
#day #georggreflinger #rdm #researchdatamanagement #ethicacomplementoria
#Day1 of my summer research leave!
Read about the exciting things I have been up to today on the #GeorgGreflinger weblog: https://greflinger.hypotheses.org/568 #EthicaComplementoria #ResearchDataManagement #Histodons #BookHistory #EarlyModernHistory
[Hint: "exciting" is a euphemism for being mad at my past self for not properly managing data...]
#day1 #georggreflinger #ethicacomplementoria #researchdatamanagement #histodons #bookhistory #earlymodernhistory