Heaviness makes a tool become less interoperable, less tinkerable, less understandable and usage drains more resources.
Development of high resource tools requires more financiation. Its at least a full time job to maintain it and further development.
Much of the problem is that todays operating systems are designed around high resource tools. Low resource tools are often seen as technical.
We need low resource tools that aims to be part of a toolbox.
I wrote about my first steps moving forward in my Balochi language modelling project. Training a custom tokenizer is my initial short-term goal but to do that I first needed to put together a small dataset with which I could work. I detail some of the things I did to that end and a list of resources I'm maintaining as I continue on this journey.
https://mlops.systems/posts/2023-05-29-balochi-language-dataset.html #balochi #nlp #lowresource
I wrote about my first steps moving forward in my Balochi language modelling project. Training a custom tokenizer is my initial short-term goal but to do that I first needed to put together a small dataset with which I could work. I detail some of the things I did to that end and a list of resources I'm maintaining as I continue on this journey.
https://mlops.systems/posts/2023-05-29-balochi-language-dataset.html #balochi #nlp #lowresource
When your covid lasts a week and screws up end of year plans as well as attending #GEM at #EMNLP2022.
I will be connecting to the virtual only poster session to discuss our paper on #LowResource #NLG in about an hour
#gem #emnlp2022 #lowresource #nlg
Great fun today giving an invited talk for the NLP seminar series at Dublin City University :D
I'll put the slides up online sometime soon for anyone who wants to hear about our work on #LowResource #NLG at #EdinburghNapierUniversity and building up a dataset for #ScottishGaelic.
#lowresource #nlg #edinburghnapieruniversity #scottishgaelic
In my own work, I'm working to adapt simpler models which are not as data-hungry to perform data-to-text #NLG in #LowResource settings. Part of this effort will involve exploring #MultitaskLearning and #Pipeline approaches to neural #NaturalLanguageGeneration.
Follow me if you want more content related to these topics, though be advised that this is a whole-person account and I will talk about things other than work as well.
#nlg #lowresource #multitasklearning #pipeline #naturallanguagegeneration