Wout Bittremieux · @wout
140 followers · 73 posts · Server sigmoid.social

We had an excellent session at the conference last week.

Many thanks to keynote speakers @lgoracci01@twitter.com, @RenardLab@twitter.com, and @tomas_pluskal@twitter.com; all selected speakers; and poster presenters for showcasing the latest computational advances in mass spectrometry, with applications across , , , and more.

#compms #ismbeccb2023 #proteomics #metabolomics #lipidomics

Last updated 2 years ago

PastelBio · @pastelbio
164 followers · 1616 posts · Server mstdn.science

RT @: Keep calm, Pfam is still running!But now it's hosted on the InterPro website! At , we had the opportunity to learn more about @PfamDB and its integration with @InterProDB website. We even won these really cool t-shirts,Thanks!

#ismbeccb2023

Last updated 2 years ago

Chloé Azencott · @cazencott
704 followers · 336 posts · Server lipn.info

Mark Gerstein at : Deep learning is exciting, but let's not forget about the physical and biological models underlying the science we're interested in. Let's make biomedical data science more like weather forecasting.

#ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
104 followers · 168 posts · Server fediscience.org

Névéol: What can we do?
Understand the stakes better.
Facilitate levers like data sharing, shared tasks, and policy.
Write more documentation, for protocols, etc.; elicit audits.

See Cohen-Boulakia et al 2017 Future Gen Comput Syst


#textmining #ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
104 followers · 167 posts · Server fediscience.org

Aurélie Névéol:
How can we make clinical NLP more reproducible? Can NLP also help with reproducibility? Even word or sentence tokenization can be inconsistent. Most NLP folks have, at least once, failed to repeat someone else's experiment, or even their own. Sometimes it's due to differences in preprocessing, software versions, training vs test splits, or other boring things. Availability issues, page limits, and the bias toward novelty don't help either.


#textmining #ismbeccb2023

Last updated 2 years ago

Kirt in Queensland · @kirt
294 followers · 242 posts · Server aus.social

@BOSC

I've think worked out the confusion, the partly overlapping hashtags made me think that it was a satellite meeting for

But it's actually completely separate?

or are and also separate to each other?

some of it was recoded? how long is that available for?

is recording access registered only?

cc other mes @kirt@mastodon.social @kirt@ecoevo.social @kirt@genomic.social

and friends @quinsibell @TashTaylor

now I need to work out what I'm registered for … I might have registered for clashing things because I neglected to put in my calendar…

#bosc2023 #ismbeccb2023 #smbe2023

Last updated 2 years ago

Chloé Azencott · @cazencott
704 followers · 336 posts · Server lipn.info

One perk of attending virtually: watching the recording of a keynote I missed instead of the talk I had planned to watch but turned out not to be interested in.

(I guess you could also plug in your headphones and do the same if you're there in person, but that's noticeably ruder.)

#ismbeccb2023

Last updated 2 years ago

Kirt in Queensland · @kirt
294 followers · 242 posts · Server aus.social

note to self, people and topics to follow from my @kirt@ecoevo.social profile

@biocrusoe

@BOSC

@gedankenstuecke

@OpenBio

@biocrusoe

@OpenBio

@openbioeconomy@bird.makeup

@bgruening

#ismbeccb2023 #bosc2023

Last updated 2 years ago

Madelaine · @mmarchin
71 followers · 102 posts · Server genomic.social

KB: cell type matching across species github.com/kbiharie/TACTiCS

#ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
104 followers · 163 posts · Server fediscience.org

Sylwia Szymanska: Word embeddings capture functions of low complexity regions: scientific literature analysis using a transformer-based language model

Low-complexity regions in proteins are biologically important. But there isn't a database or even a list of these relationships. So let's extract them with a language model.

#textmining #ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
104 followers · 162 posts · Server fediscience.org

Brett Beaulieu-Jones: Can we use large language models with clinical notes to estimate likelihood of seizure recurrence? Yes - and even with good results - but models are difficult to interpret. So can we build a model that includes things we really care about, then add an instructable layer? Yes! Use note metadata as weak supervision -> instructions for the model. A tuned T5-Flan model does really well.


#textmining #ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
103 followers · 158 posts · Server fediscience.org

Krallinger: Organizing shared tasks. Some processes can take years. Examples - CANTEMIST, CodiEsp, MESINESP, MEDDOCAN, MEDDOPROF, ClinSpEn, DisTEMIST. Most recently MEDDOPLACE, PharmaCoNER

#textmining #ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
103 followers · 157 posts · Server fediscience.org

Krallinger: It's important to engage clinical experts from the beginning. That includes their considerations on the content sources.

Annotation guidelines are necessary. See the guides at zenodo.org/communities/medical
Translating these to languages beyond English helps the community.


#textmining #ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
103 followers · 157 posts · Server fediscience.org

Krallinger: Developing language models for clinical data in Spanish. Since clinical text varies so much in structure and content, you need a balance between general language and domain-specific optimization. Need some clear annotation guidelines too.

Really need a set of clear clinical use cases, too.

#textmining #ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
103 followers · 155 posts · Server fediscience.org

Hi .
I'm in Text Mining today.

Martin Krallinger: Unstructured text from clinical narratives is still underused. There are many other text sources too, like patient forums or drug leaflets, but clinical narratives are especially difficult. No out of the box NLP solution works. Need data, infrastructure, and reproducible benchmarks.

#ismbeccb2023

Last updated 2 years ago

Lars Juhl Jensen · @larsjuhljensen
172 followers · 44 posts · Server mas.to

Day 4 recap from : gene regulation, single-cell data, and visualization of spatial transcriptomics. Papers/preprints/links for highlights are in the description.
youtube.com/shorts/TkKDmY6lmZU

#ismbeccb2023

Last updated 2 years ago

António Domingues · @keyboardpipette
224 followers · 2116 posts · Server genomic.social

Oh today I saw more alternative splicing goodies at

#ismbeccb2023 #ismb2023

Last updated 2 years ago

Harry Caufield · @jhc
103 followers · 153 posts · Server fediscience.org

* Zachary Flamholz: Unannotated *viral* proteins. There are many of them, and annotation is usually done by homology. See the PHROGs database of phage genomes - representations of these sequences can accurately identify functional category. Also enables identifying some novel protein families.

See researchsquare.com/article/rs-

#ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
103 followers · 152 posts · Server fediscience.org

* Miguel Fernández Martín: Comparing bacterial protein interactomes to find antibiotic resistance genes. (Back In My Day, we did this with a lot of Y2H). An adaptation of ContextMirror that takes coevolutionary context into account should work. Spoiler: it does. Likely a good way to assemble experimental interactomes with better guidance.

#ismbeccb2023

Last updated 2 years ago

Harry Caufield · @jhc
103 followers · 151 posts · Server fediscience.org

Back to Function!
* Aysun Urhan: What to do with proteins of unknown function? A new species -> new genes. We can make protein sequence embeddings to try to infer homology, though most embedding approaches so far haven't focused on bacteria. Use what we know about operons (including predicting if they haven't been confirmed) and combine with protein embeddings. Then assign GO terms w/ cosine similarity. This does work better than using AA's alone.

See github.com/AbeelLab/sap

#ismbeccb2023

Last updated 2 years ago