♥ Reimu Hakurei / 博麗霊夢
https://www.pixiv.net/en/artworks/111663417
#AI #AIArt #AIイラスト #StableDiffusion #SD // #Reimu #Touhou #TouhouProject #東方
#東方 #touhouproject #touhou #reimu #sd #stablediffusion #AIイラスト #aiart #ai
After updating some code that was using libudev to the more modern API replacement sd-device, part of systemd; I wrote a simple example code and a post; just in case you are interested on this.
https://dev.to/carvilsi/linux-monitor-usb-devices-libudev-replacement-with-sd-device-3n4d
📝 Multiple Representation Transfer From Large Language Models to End-to-End ASR Systems 📚🔊
"Transferring multiple representations of large language models improves the end-to-end ASR performance by up to 15% relative CER compared to transferring only a single representation." [gal30b+] 🤖 #CL #SD
♥ Ais Wallenstein / アイズ
https://www.pixiv.net/en/artworks/111633218
#AI #AIArt #AIイラスト #StableDiffusion #SD // #アイズ・ヴァレンシュタイン #ダンジョンに出会いを求めるのは間違っているだろうか #ダンまち
#ダンまち #ダンジョンに出会いを求めるのは間違っているだろうか #アイズ #sd #stablediffusion #AIイラスト #aiart #ai
♥ Sangonomiya Kokomi / 珊瑚宮心海
https://www.pixiv.net/en/artworks/111622480
#AI #AIArt #AIイラスト #StableDiffusion #SD // #珊瑚宫心海 #GenshinImpact #Genshin #Kokomi #SangonomiyaKokomi
#sangonomiyakokomi #kokomi #genshin #genshinimpact #珊瑚宫心海 #sd #stablediffusion #AIイラスト #aiart #ai
♥ Sushang / 素裳
https://www.pixiv.net/en/artworks/111605373
#AI #AIArt #AIイラスト #StableDiffusion #SD // #HonkaiStarRail #Sushang #素裳 #崩壊スターレイル #崩坏星穹铁道 #スターレイル
#スターレイル #崩坏星穹铁道 #崩壊スターレイル #素裳 #sushang #HonkaiStarRail #sd #stablediffusion #AIイラスト #aiart #ai
♥ Sushang / 素裳
https://www.pixiv.net/en/artworks/111605451
#AI #AIArt #AIイラスト #StableDiffusion #SD // #HonkaiStarRail #Sushang #素裳 #崩壊スターレイル #崩坏星穹铁道 #スターレイル
#スターレイル #崩坏星穹铁道 #崩壊スターレイル #素裳 #sushang #HonkaiStarRail #sd #stablediffusion #AIイラスト #aiart #ai
📝 Implicit Design Choices and Their Impact on Emotion Recognition Model Development and Evaluation 🧠📚🔊
"Emotion recognition from visual and audio data is accomplished by using a convolutional neural network (CNN) and a recurrent neural network (RNN), respectively, to encode the two modalities." [gal30b+] 🤖 #LG #CL #SD
📝 Zero-Shot Audio Captioning via Audibility Guidance 🔊📚
"A caption is generated by combining a large pre-trained language model, such as GPT-2, with a multimodal matching model, which scores how well a text matches the input audio and a text classifier which provides the guidance for audibility." [gal30b+] 🤖 #SD #CL
📝 Highly Controllable Diffusion-Based Any-to-Any Voice Conversion Model with Frame-Level Prosody Feature 🔊
"Utilizes a prosody conditioning module to transfer frame-level prosody and a post-processing step which allows improved controllability of speaking rate in any-to-any voice conversion." [gal30b+] 🤖 #SD
📝 RoDia: A New Dataset for Romanian Dialect Identification From Speech 📚🔊
"Introduces RoDia, a Romanian dialect identification dataset consisting of 2 hours of transcribed spoken data covering five dialects, including both urban and rural environments, and propose multiple deep learning models to be used as baselines." [gal30b+] 🤖 #CL #SD
⚙️ https://github.com/codrut2/RoDia
🔗 https://arxiv.org/abs/2309.03378v1 #arxiv
📝 Parameter Efficient Audio Captioning with Faithful Guidance Using Audio-Text Shared Latent Representation 📚🔊
"We first present a data augmentation technique for generating audio captions which are not only relevant to the audio, but also, are semantically consistent with ground truth captions." [gal30b+] 🤖 #CL #MM #SD
♥ Pecorine / ペコリーヌ
https://www.pixiv.net/en/artworks/111542575
#AI #AIArt #AIイラスト #StableDiffusion #SD // #ペコリーヌ #Pecorine #プリンセスコネクト!Re:Dive #プリコネR #プリコネ
#プリコネ #プリコネr #プリンセスコネクト #pecorine #ペコリーヌ #sd #stablediffusion #AIイラスト #aiart #ai
📝 BigVSAN: Enhancing GAN-based Neural Vocoders with Slicing Adversarial Network 🔊🧠
"By finding the optimal projection for discriminating between real and fake data in the feature space, it can improve the performance of GAN-based vocoders with small modifications, such as BigVGAN." [gal30b+] 🤖 #SD #LG
⚙️ https://github.com/sony/bigvsan
🔗 https://arxiv.org/abs/2309.02836v1 #arxiv
📝 Self-Supervised Disentanglement of Harmonic and Rhythmic Features in Music Audio Signals 🔊
"A variational autoencoder that generates an audio mel-spectrogram from two latent features representing the rhythmic and harmonic content, respectively, and is trained to reconstruct the input mel-spectrogram given its pitch-shifted version." [gal30b+] 🤖 #SD
⚙️ https://github.com/WuYiming6526/HARD-DAFx2023
🔗 https://arxiv.org/abs/2309.02796v1 #arxiv