Matt Coler · @Mattcoler
464 followers · 109 posts · Server fediscience.org

Discover the Future of Speech Technology!

🤖 Speech Tech Summer School: 🤖 robots, 👾 video games, 🗯 voice cloning ... and more!

🗓 May 14-17
📍 Campus Fryslân - University of Groningen @universityofgroningen , the Netherlands

⏰ Few spots left!

rug.nl/education/summer-winter

Please share w interested students of all backgrounds!
,

#voiceai #speechrecognition #summerschool #speechtech #SpeechTechnology #VoiceTechnology

Last updated 2 years ago

Matt Coler · @Mattcoler
460 followers · 105 posts · Server fediscience.org

Discover the Future of Speech Technology!

🤖 Speech Tech Summer School: robots, video games, voice cloning ... and more!
🗓 May 14-17
📍 Campus Fryslân - University of Groningen, the Netherlands

From robots 🤖 to video games 👾 to speech recognition 🗯, our hands-on summer school will feature workshops plus, speakers from Google, Respeecher, and more.

⏰ Space is limited!
rug.nl/education/summer-winter

,

#voiceai #speechrecognition #summerschool #speechtech #SpeechTechnology #VoiceTechnology

Last updated 2 years ago

Matt Coler · @Mattcoler
449 followers · 92 posts · Server fediscience.org

A week ago today I was speaking at the 1st Dutch Speech Tech Day in Hilversum. Focusing on what the University of Groningen team was working on and introducing the new MSc Voice Tech. Plus, promoted the upcoming

#summerschool #speechtech #voicetech

Last updated 2 years ago

Matt Coler · @Mattcoler
443 followers · 89 posts · Server fediscience.org

Congrats to Leminh Nguyen from the MSc Voice Technology who was nominated for the prestigious Ben Feringa Impact award because of his groundbreaking work on Luxembourgish Speech Recognition! 💻🗯️ Freely available, for 👏🇱🇺 well done!!

rug.nl/news/2023/01/nominees-f

#luxembourgish #speechtech #openaccess

Last updated 2 years ago

Matt Coler · @Mattcoler
436 followers · 82 posts · Server fediscience.org

Our MSc Voice Tech grad Leminh Nguyen presented his poster at the 2022 IEEE Spoken Language Technology workshop: "improving speech recognition with cross-lingual speech representations"

#speechtech #speechrecognition #luxembourgish

Last updated 2 years ago

Matt Coler · @Mattcoler
436 followers · 82 posts · Server fediscience.org

The IEEE Workshop on “Self-supervision in Audio, Speech, and Beyond” will be held jointly with ICASSP.

📍 Greece 🇬🇷
🗓️ June 4 -- 9
📨 Submissions accepted until Feb 24th.

Accepted papers will appear in the official IEEE ICASSP proceedings and IEEE Xplore.

ℹ️ More info:

sites.google.com/view/icassp-s

#voicetech #speechtech #audio #cfp

Last updated 2 years ago

Vaclav Hanzl · @vaclavh
15 followers · 24 posts · Server sigmoid.social

@nickfisherau Nice. Did you publish any details about your alignment code?
For post-edit - maybe is easier for ?
I recently did phone alignment tool as a gift to Czech and minimizing dependencies (and install hassle), I finally went without anything like Kaldi, using just very basic for NN AM training and then alignment. I guess going zero-up was really easier than decomposing some big thing like Whisper.
github.com/vaclavhanzl/prak

#Praat #speechtech #phonetics #pytorch

Last updated 2 years ago

Nick Fisher · @nickfisherau
28 followers · 295 posts · Server mstdn.social

I still need to manually check (and occasionally correct) the alignments though.

I wrote a couple of Python scripts/extension for Audacity that loads up the audio/labels so you can manually drag the handles to adjust the alignments. Works pretty well!

I know there are various web interfaces for this but seems you need to use them for your entire pipeline (ingest/labelling/export/indexing/etc) or not at all.

#MachineLearning #speechrecognition #speechtech #asr

Last updated 2 years ago

Nick Fisher · @nickfisherau
41 followers · 363 posts · Server mstdn.social

I still need to manually check (and occasionally correct) the alignments though.

I wrote a couple of Python scripts/extension for Audacity that loads up the audio/labels so you can manually drag the handles to adjust the alignments. Works pretty well!

I know there are various web interfaces for this but seems you need to use them for your entire pipeline (ingest/labelling/export/indexing/etc) or not at all.

#MachineLearning #speechrecognition #speechtech #asr

Last updated 2 years ago

Marieke van Vugt · @mvugt
387 followers · 461 posts · Server akademienl.social

RT @martijnwieling
We have 2 @FacultyofArtsUG @GroNlp vacancies for tenured (!) Assist. Profs in /#SpeechTech. If you are interested in applying to minority languages, contact me, as more research time (up to 70%) may then be possible: rug.nl/about-ug/work-with-us/j. RT=nice!

#CompLing #speechtech

Last updated 2 years ago

Phonetics Lab · @phoneticslab
98 followers · 6 posts · Server mastodonapp.uk

Hi everyone we are the Phonetics Lab at Lancaster University! We'll be tooting about fun phonetics stuff as well as news from our lab and collaborators. @LAEL

#introduction #phonetics #linguistics #phonology #labphon #sociophonetics #articulation #forensicphonetics #speechtech #bilingualism #ultrasound #ema

Last updated 2 years ago

Hi everyone we are the Phonetics Lab at Lancaster University! We'll be tooting about fun phonetics stuff as well as news from our lab and collaborators.

#introduction #phonetics #linguistics #phonology #labphon #sociophonetics #articulation #forensicphonetics #speechtech #bilingualism #ultrasound #ema

Last updated 2 years ago

Maaike · @Maaike
430 followers · 173 posts · Server mastodon.design

💡 Interesting read on how one of the biggest commercial players out there plans to use Mozilla Open Voice data to make speech AI more inclusive and open to more language.

💬 Sounds idealistic, but Open Voice datasets are created by unpaid volunteers who donate hours and hours of their speech. Not sure whether I feel comfortable with that, tbh.

💭 Thoughts?

venturebeat.com/ai/nvidia-ente

#speechAI #speechtech #voicetech #speech #voice #transparentai #ethicalai

Last updated 2 years ago

Abi Aryan :verified: · @goabiaryan
777 followers · 171 posts · Server mstdn.social

We'll dig deeper into OpenAI Whisper (openai.com/blog/whisper/) this weekend. I'll announce a DateTime for the YT live stream here later.

Note: It's not a presentation but a session. You can also join in on the task/Livestream via audio/video through Google Meet.

YT Link: youtube.com/@datadrivenbabe

How much do you know about the sub-field already?

#studywithme #MLops #ml #AI #speechtech #nlp #voicetech #Data

Last updated 2 years ago

Abi :coffefied: · @goabiaryan
44 followers · 232 posts · Server mstdn.social

We'll dig deeper into OpenAI Whisper (openai.com/blog/whisper/) this weekend. I'll announce a DateTime for the YT live stream here later.

Note: It's not a presentation but a session. You can also join in on the task/Livestream via audio/video through Google Meet.

YT Link: youtube.com/@datadrivenbabe

How much do you know about the sub-field already?

#studywithme #MLops #ml #AI #speechtech #nlp #voicetech #Data

Last updated 2 years ago

Matt Coler · @Mattcoler
328 followers · 53 posts · Server fediscience.org

Maybe an interesting opportunity for students working in : Internship (6 months) at NAVER LABS Europe to contribute to the development of a “single TTS model with the ability to control speaker identity, emotions, prosodic focus, etc” europe.naverlabs.com/job/fine-

#speechtech

Last updated 2 years ago

Matt Coler · @Mattcoler
328 followers · 53 posts · Server fediscience.org

Tune in for Dr Shekhar Nayak’s lecture at LITHME 2nd int’l conference — stream it now! Zero research speech processing m3.jyu.fi/jyumv/ohjelmat/hum/k

#speechtech

Last updated 3 years ago

Matt Coler · @Mattcoler
328 followers · 53 posts · Server fediscience.org

Today at () our team will present research on phoneme mapping and source-language selection in transfer learning for TTS in under-resourced languages. Full paper here lrec-conf.org/proceedings/lrec

#speechsynthesis #text2speech #speechtech #lrec2022 #sigul2022

Last updated 3 years ago

Matt Coler · @Mattcoler
328 followers · 53 posts · Server fediscience.org