Really like this #paper on automatic #recognition of #direct #speech in #French #narrative texts. The authors do many cool things, among them building on some of our old sentence-level #annotations and using this #corpus with relatively messy line breaks as a "noisy" test set. It's worth putting your data out there, folks! – Of course, paper and data + code repository are #open #access: https://arxiv.org/abs/2306.15634 and https://github.com/deezer/aads_french -- Our old data: https://github.com/cligs/projects/tree/master/2016/dh #openscience
#paper #recognition #direct #speech #French #narrative #annotations #Corpus #open #access #openscience
This article provides a methodology used to develop a full-text journal article corpus using the R fulltext package:
:doi: https://doi.org/10.1177/01655515231171362
The paper also provides supplementary codes, so feel free to use :rstats: to curate #corpus based on PLoS, Scopus, arXiv, bioXriv, Crossref, Entrez, Europe PMC, Biomed Central data.
#Corpus #fulltext #journal #articles
What #corpus #metadata do #Computational #Literary #Studies need?
A survey on criteria for the creation of literary text corpora or collections wants to find out from you: https://survey.academiccloud.de/index.php/916873?lang=en
Created by the NFDI consortium @Textplus, the DFG priority program "Computational Literary Studies", and the EU project @CLSinfra have jointly created this survey. #CLS
#Corpus #metadata #computational #literary #studies #cls
Ok #corpus people who use #ELAN, how do you store #metadata on the participants you're #transcribing? Seems the devs suggest CMDI files, but that seems overly complicated (unless there's some benefit for exporting afterward).
#linguistics #corpuslinguistics #sociolinguistics #languagevariation #psycholinguistics #transcription
#Corpus #elan #metadata #transcribing #linguistics #corpuslinguistics #sociolinguistics #languagevariation #psycholinguistics #transcription
RT @congabonga
LAST CALL FOR PAPERS: #Corpus Approaches to #Lexicogrammar (LxGr2023). Abstract submission closes *tomorrow*, 15 April.
https://sites.edgehill.ac.uk/lxgr/lxgr2023 #corpuslinguistics #lxgr @EHU_Research @edgehill
#Corpus #lexicogrammar #corpuslinguistics #lxgr
Ha sido una gran alegría editar con Claudia Sánchez-Gutiérrez y Nicole Tracy-Ventura este número temático
"Corpus en español: investigación, diseño y aplicabilidad a la enseñanza" en Journal of Spanish Language Teaching https://www.tandfonline.com/toc/rslt20/current
7 artículos sobre #corpus, #didáctica, aprendizaje, enseñanza #ELE, formación de docentes...
Are you aware of a text (paper, book chapter, etc.) that summarizes the dos and don'ts in keyword selection when compiling a text #corpus from sources other than academic publication databases? Somehow I can only find texts concerned about systematic literature reviews. 🤔 Can't believe no one from the #textasdata #CommunicationResearch community has gone down this rabbit hole yet? 😄
#Corpus #TextAsData #communicationresearch
Join us tomorrow for our 1st event of the year with Prof Deignan, with whom we will discuss the 'The Potential of #Corpus #Linguistic' for #SocialResearch.
https://socialresearchmethods.leeds.ac.uk/events/the-potential-of-corpus-linguistics/
#Corpus #linguistic #socialresearch
The living arrangements for a historian of late antiquity aren't always so grand, but I did enjoy seeing the interiors of the Corpus Christi College, Cambridge, Master's Lodge in a recent issue of The English Home magazine. (The current Master is Christopher Kelly.)
#InteriorDesign #CorpusChristiCollegeCambridge #Corpus #Cambridge #LateAntiquity #CambridgeUniversity
#interiordesign #corpuschristicollegecambridge #Corpus #Cambridge #lateantiquity #CambridgeUniversity
Acaba de publicarse en acceso abierto
"Corpus de español y sus usos pedagógicos: desafíos y oportunidades", por Claudia Sánchez-Gutiérrez, Nicole Tracy Ventura y su servidora ➡️bit.ly/3WBun0F
Es la introducción de un número temático de Journal of Spanish Language Teaching, que hemos coeditado con Claudia Sánchez-Gutiérrez y Nicole Tracy Ventura, sobre #corpus y enseñanza #ELE. Entonces, llegarán muchos artículos interesantes en las semanas que vienen 📝 📚.
Muchas gracias a tod@s l@s autor@s
@noam Basically, I'm looking for a politically/socially relevant topic around which enough #discourse was generated on #SocialMedia (for instance, #COP27).
The idea would be collecting and analysing a #corpus to look at framing techniques by #media, or at user-generated discourse (something between sentiment and thematic analyis).
But I am open to other ideas!
#discourse #SocialMedia #cop27 #Corpus #media
Looking for #climatechange / #environment / #sustainability related topics in the #UK context for a #corpus-based #DiscourseAnalysis.
Suggestions welcome!
#ClimateChange #environment #sustainability #uk #Corpus #DiscourseAnalysis
RT @RudyLoock
For #corpus lovers out there, a new version of #BootCaT (1.55) is available at https://bootcat.dipintra.it/
Looking for a basic introduction to computational analysis of literary genre in #CLS? Look no further! => https://dragonfly.hypotheses.org/1219 It covers some basics of #corpus building for #genre analysis and a bit of contrastive analysis using #keywords and #topic modeling.
#cls #Corpus #genre #keywords #Topic
New server, new #introduction!
Hi everyone! I'm very excited to join an explicitly #queer server. I'm a #bisexual, #polyamorous he/they who likes #singing and keeps insisting he's going to start #running again.
I work in #ComputationalLinguistics doing #NaturalLanguageGeneration with an interest in #Psycholinguistics, #Corpus creation, and human #Evaluation of #NLG systems.
You can find me at various #cafe/s in #Edinburgh #Scotland
Nice to meet you! :D
#introduction #queer #bisexual #polyamorous #singing #running #computationallinguistics #NaturalLanguageGeneration #psycholinguistics #Corpus #evaluation #nlg #cafe #edinburgh #scotland
Coming from #computerscience, I now work as a researcher in #corpuslinguistics. at Humboldt-Universität zu Berlin together with a fantastic crowd of #linguistic researchers. As a #RSE (research software engineer) I maintain several #opensource software like the #ANNIS #corpus search or the #Hexatomic #annotation editor.
#researchdata management and other issues related to #OpenScience are also daily part of my job.
#OpenScience #researchdata #annotation #Hexatomic #Corpus #ANNIS #opensource #RSE #Linguistic #corpuslinguistics #computerscience #introductions
It's a fair corpus, guv! Next Tuesday (8 Nov) I have the honour to present to the Surrey Linguistics Circle how the DGS Corpus implements CARE and FAIR principles, showing how ethics and open data must go hand in hand in sign language research.
The presentation is open to the public, starting at 11.30 UTC, so join us at https://www.smg.surrey.ac.uk/events/
#opendata #signlanguage #corpus
#Corpus #signlanguage #opendata
RT @MaiteMNa: Un nuevo corte de luz deja a oscuras #Norte #Granada, mientras el ferial #corpus deslumbraba.
Barrios rechazados y oscuros:la vida se entreteje de pobreza, de pasos sin futuro, de sálvese quien pueda. ¡Y con tiniebla y cortes de luz que a nadie importan!😡 https://www.elindependientedegranada.es/ciudadania/nuevo-corte-luz-deja-oscuras-zona-norte-granada-mientras-ferial-deslumbraba