So far, so impressive! It looks like it worked very well, with a few - expected - issues like the e/o superscripts being misread and some uppercase letters, which look different between the two prints. Correcting these won't take much time. I'll check the remaining four pages and decide whether to improve the current model for this particular print or not afterward! #EthicaComplementoria #Transkribus #DigitalScholarlyEdition
#ethicacomplementoria #Transkribus #DigitalScholarlyEdition
After successfully training and improving my #Transkribus #HTR model for the #EthicaComplementoria prints, I used it on the youngest Ethica from 1728. Typographically, it is still very similar to the editions from the 17th century, so that should not be a big problem. I'll quality-check the first eight pages today and see, whether it is worthwhile improving the model for this particular print.
#Transkribus #htr #ethicacomplementoria
So, I had the UiO-GPT create 2 scripts: one for creating 2 txt files, concatenating the contents of 10 individual files in a particular order each, and then the other for comparing these 2 files line by line and creating a CSV file containing only those lines that had differences in them.
Tweaking and fixing stuff still took time, but I am happy with the progress and found a sensible way of using #GPT. This concludes my #ResearchLeave: I progressed quite a bit w/ the #EthicaComplementoria!
#gpt #researchleave #ethicacomplementoria
And that's it! Sent off the new training set and hopefully improve the issues I have encountered!
I will also use this model for the 1728 print. It's a different printing press, but overall, the two prints are very much alike, and so is the #Fraktur type they use. #Transkribus #EthicaComplementoria #HTR #EarlyModern #PrintHistory
#fraktur #Transkribus #ethicacomplementoria #htr #earlymodern #printhistory
Moving back to the #HTR issues, we have phenomena like these (shown in the image): The layout detection model draws 'short' lines when the text is warped in the book fold. This leads to especially slim letters and punctuation not getting recognized. When I do the corrections, I will also extend the lines to include these characters. Hopefully, it will improve the recognition! #Transkribus #EthicaComplementoria
#htr #Transkribus #ethicacomplementoria
While proofreading, I observed 2 distinct ways #Typography is realised in the 1674 #EthicaComplementoria print from Copenhagen. This happens when more than one worker is typesetting the print. Let's call them Villads & Emil. Villads has been doing this job for a while now; he routinely typesets all #Latin words w/ a beautiful cursive #Antiqua font. Emil, who's already having difficulty reading the #German text, uses #Fraktur for everything & only sets Latin proverbs in cursive. #BookHistory
#typography #ethicacomplementoria #latin #antiqua #german #fraktur #bookhistory
#FrüheNeuzeit #Humor #Hofkultur #Komplimentierliteratur
"Must es nicht machen wie jener vom Adel / welcher vom Churfürsten zu Sachsen das schöne Gut Alt-Sattel genennet / begerete / weil aber dem Fürsten solches Gut sehr lieb und nutzbar war / sagte er zum Edelmann: Lieber / du bist ein Narr / was wilst du mit einen alten Sattel machen / ich will dir lassen fünff Thaler geben / kauff dir einen neu."
#EthicaComplementoria, Kopenhagen 1674, p. 34.
#fruheneuzeit #Humor #hofkultur #komplimentierliteratur #ethicacomplementoria
Exciting! Overnight, the text recognition job was completed and now I get to see how good the transcription is! #EthicaComplementoria #Transkribus #HTR
#ethicacomplementoria #Transkribus #htr
It's #Wednesday, and I'm continuing my work on transcribing the #EthicaComplementoria from 1674 using #Transkribus. My "job" ran overnight, and the training results were impressive! So, I did a final check on the layout recognition of the remaining pages and ran a #TextRecognition job on the entire Ethica print. The book contains both the Ethica Complementoria and the Tranchierbuch. The latter has many illustrations and tables, so I've decided to run it separately.
#wednesday #ethicacomplementoria #Transkribus #textrecognition
A blog post continuing my series of posts about the #EthicaComplementoria #DigitalScholarlyEdition project on the #GeorgGreflinger weblog: https://greflinger.hypotheses.org/716
Today about #HTR #Transkribus and re-using models.
#ethicacomplementoria #DigitalScholarlyEdition #georggreflinger #htr #Transkribus
I haven't talked much about my current #ResearchLeave this time. After a turbulent, travel-intense, and health-depleting summer, there was too much on my ordinary To-Do list, and it didn't feel right to hide away in my research cave. So, last week was largely spent checking off things on my list that had deadlines or needed others' input. This week, I try to focus on tying up some threads from my #EthicaComplementoria research sprint in June and see how far I get without overdoing it.
#researchleave #ethicacomplementoria
Officially, my #2 2-week #ResearchLeave this year has started, but I have some cannot-postpone stuff that needs doing. So, begrudgingly, I will check them off my to-do list today.
On a different note: This leave was planned for working on the Edvard Munch correspondence metadata paper. However, due to scheduling issues with my co-author, Munch is postponed to November. I will instead work on proofing the #EthicaComplementoria edition for the #GermanTextArchive and work on its Danish translation.
#researchleave #ethicacomplementoria #germantextarchive
🧵 3/ #ResearchSupportPartnershipUiO continues after lunch w/ my "adopted" colleagues from the #History section. I had a coordinating & getting-to-know meeting to set up some presentations and show-and-tell sessions for the Depts. #DigitalMeetup that happens every #Tuesday. I offered to do short sessions on #TaDiRAH, #Transkribus, #Tropy and #Tesseract during the fall. I'll use the 1678 #Danish translation of the #EthicaComplementoria as an example for #AutomatedTextRecognition of C17th books!
#researchsupportpartnershipuio #history #digitalmeetup #tuesday #TaDiRAH #Transkribus #tropy #tesseract #danish #ethicacomplementoria #automatedtextrecognition
#Day10 concludes my summer research leave. I wrote the final blog post on my private website this time, summing up my experiences from the last ten days and the last ten years with a personal story and a statement of commitment to a project which has become a part of me: https://www.annikarockenberger.com/?p=1382 #EthicaComplementoria #DigitalScholarlyEdition #DigitalHumanities #AltAc #Academe #Ethics
#day10 #ethicacomplementoria #DigitalScholarlyEdition #DigitalHumanities #altac #academe #ethics
#Day8 of my research leave ends late, but successfully, after proofreading the 1643 text of the #EthicaComplementoria edition with the #GermanTextArchive #DTA. Read about this and related thoughts on my blog https://greflinger.hypotheses.org/686. Don't forget: life doesn't necessarily go the way one plans. Prioritize. Take care!
#day8 #ethicacomplementoria #germantextarchive #dta
#Day8 Already 3 hours proofreading the transcription of the 1643 print of the #EthicaComplementoria which will be available through the German Text Archive #DTA http://www.deutschestextarchiv.de/ shortly.
The transcription was done manually, with several checks of the physical book in Bamberg.
However, converting a formatted MS Word file into a #DTA base format #XML file didn't go as smoothly as I had hoped. This was done in 2016, but the results were not optimal & I noticed only later.
#day8 #ethicacomplementoria #dta #xml
#Day8 of my summer research leave: I started the day following up on the #Transkribus automatic transcription of the #Danish print of the #EthicaComplementoria from 1678. We have transcribed 25 example pages and are now training with the #NorFraktur model from the National Library of Norway. Our goal: a CER <2.0. Let's see if we can reach that! The book is 300 pages and includes the #Tranchierbuch and the #Leberreime, a form of popular limericks.
#day8 #Transkribus #danish #ethicacomplementoria #norfraktur #tranchierbuch #leberreime
#Day7 finishing with a blog post in minor key about my views on the value of collaboration and communication and the printed study edition of the 1643 #EthicaComplementoria print at Harrassowitz Verlag: https://greflinger.hypotheses.org/681 #BookHistory #DigitalScholarlyEdition #Academe #Demotivation
#day7 #ethicacomplementoria #bookhistory #DigitalScholarlyEdition #academe #demotivation
#Day7 continues, but I had to focus on something else not to get completely demotivated w/ the #EthicaComplementoria project.
So instead, I prepared my #DH2023 #workshop talk about the Norwegian translation of the @tadirah taxonomy. Slides are done; only minor - aesthetic - changes will be made. See you in Graz!
#day7 #ethicacomplementoria #DH2023 #workshop
#Day7 starts w/ a shock. Turns out me working on a digital scholarly edition of the #EthicaComplementoria doesn't stop others from publishing a print edition. Without making contact and hearing about the status of my project. I mean: do a Google search, and you will find my website and the project website with contact details, and you can ask for the status quo. But I guess being a professor frees you from such considerations.