Daniil Skorinkin · @skorinkin
97 followers · 37 posts · Server hcommons.social

Can you guess which of these Chekhovs, Gogols, and Ostrovskys are FAKE (ChatGPT-generated in the style of each writer) looking at the similarity visualisations? Its an easy task but I find it somewhat educational😉 Inspired by recent work by @rebsim at the
@aiucd conf
NB: it is not about size, the sizes of generated texts are in the same range here (I used iterative chain prompting to generate long-enough stuff)

#stylometry

Last updated 1 year ago

Christof Schöch · @christof
1573 followers · 2361 posts · Server fedihum.org

Nice thread, with a punchline for attendees interested in ... read till the end: mastodon.social/@mhoye/1107075

#DH2023 #stylometry

Last updated 1 year ago

Nanette Rissler-Pipka · @NanetteRissler
426 followers · 183 posts · Server fedihum.org

yesterday's first day with so many parallel and interesting sessions, I was a bit overwhelmed and forgot to share here for people who can't be in : highlights of the day, apart from the : the session with @rabeakleymann , Jennifer Edmond, @nabsiddiqui and more

#DH2023 #graz #stylometry #dh #theory

Last updated 1 year ago

Nanette Rissler-Pipka · @NanetteRissler
426 followers · 183 posts · Server fedihum.org

Another very intriguing subject in the session at : are there different sources to the Thora, and more importantly can this can be proved statistically. Very interesting approach, I just don't quite agree with the last comment that the mathematical method is completely objective and without biases - hmm think about the starting point which is the text and the assumptions made on it... Yes, statistical methods can create evidence but they are not objective

#stylometry #DH2023

Last updated 1 year ago

Nanette Rissler-Pipka · @NanetteRissler
426 followers · 183 posts · Server fedihum.org

the morning starts right away with highlights in : Jan Rybicki spoke about literature translations and it turns out that deepl copies the author's style better than humans... well and Marcel can't be translated at all. Think about what said about translating himself in Spanish and French: "si je pense dans une langue..."

#DH2023 #stylometry #proust #picasso

Last updated 1 year ago

JCLS · @jcls
277 followers · 153 posts · Server fedihum.org

At , we're now having the pleasure of attending a (as always) very inspiring and colorful by Jan , on and distant reading applied to 10,005 novels in (translated or original).

#ccls2023 #keynote #rybicki #stylometry #polish

Last updated 1 year ago

IDHN · @idhn
51 followers · 298 posts · Server fedihum.org

Estrella Samba-Campos is presenting at our 9th IDHN conference her latest research on kutub al-Ê¿ilm and muá¹£annaf collections using Join us tinyurl.com/idhn9conf or check out her profile at the Universidad Complutense de Madrid.

#stylometry

Last updated 1 year ago

Till Grallert · @tillgrallert
527 followers · 1180 posts · Server digitalcourage.social

I thoroughly enjoyed presenting my data-driven research on late Ottoman at . The focus of my paper was stylometric authorship attribution, which relied on earlier collaborative work with Maxim Romanov on establishing parameters for reliable authorship attribution in Arabic for the `stylo()` package in ().

Slides are available at tinyurl.com/dighis23-grallert

#stylometry #PeriodicalStudies #digitalhistory #digitalhumanities #MultilingualDH #rstats #r #dighis23 #Periodicals #arabic

Last updated 1 year ago

Christian Wachter · @ChristianWachter
288 followers · 88 posts · Server fedihum.org

The continues with an inspirational presentation by @tillgrallert: He identifies anonymous authors of Arabic periodical articles through stylometric authorship attribution.

#dighis23 #stylometry #dh

Last updated 1 year ago

Josh Bressers · @joshbressers
814 followers · 533 posts · Server mastodon.social

This week on @kurtseifried and I chat about

There's a tool to look at authors and see if their writing is similar to another user (sock puppets anyone?)

This of course leads to larger discussions about , , , and of course,

opensourcesecurity.io/2022/12/

#OSSPodcast #stylometry #hackernews #privacy #cybersecurity #impersonation #shakespeare

Last updated 2 years ago

Johannes Hentschel · @johentsch
9 followers · 3 posts · Server hostux.social

Hi Fediverse,
Currently I'm spending a lot of my time on the computer researching into in order to finish my @ by the end of 2023. My main subject is and I'm trying to measure stylistic differences between tonal languages of the last four centuries through on ().
I'm here to connect with people who are interested in

#introduction #music #corpora #phd #epfl #musictheory #statistics #harmony #stylometry #dh #DataScience #machinelearning #opendata #dataset #foss #privacy #musicianship #funk #techno

Last updated 2 years ago

Jonathan Reeve · @JonathanReeve
201 followers · 13 posts · Server hcommons.social

Hi! Here's an . I'm Jonathan Reeve, and I work in computational approaches to literary study, using , , , , and methods of , in languages like and .

I maintain open-editions.org, which collects XML editions of James and other writers; corpus-db.org, an API for literary corpora; and text-matcher, a text reuse detection engine. More of my projects are up at github.com/JonathanReeve.

I'm a PhD student in English and Comparative Literature, in my final year at Columbia University, writing a dissertation which models visuality in British .

As a long-term Mastodon user, I'm happy to see a taking place, and even happier that it's happening through my old employer (in a previous iteration), HCommons! I was j0_0n on Twitter: twitter.com/j0_0n. Check out my blog at jonreeve.com.

#introduction #nlp #ai #ml #stylometry #digitalhumanities #python #haskell #tei #joyce #modernism #twittermigration

Last updated 2 years ago

Christof Schöch · @christof
861 followers · 266 posts · Server toot.io

This looks brilliant! Preprint on "Boosting word frequencies in authorship attribution" by Maciej Eder. Instead of relative frequencies, frequency normalisation against a background of semantically similar words was performed. Significant performance gains shown via fascinating heatmaps. See: arxiv.org/abs/2211.01289

#stylometry #AuthorshipAttribution #stylo #kraków #CHR2022 #wordembeddings #Heatmaps #BurrowsDelta #CosineDelta

Last updated 2 years ago

Till Grallert · @tillgrallert
139 followers · 31 posts · Server digitalcourage.social