YuutaW 🌸 · @yuuta
369 followers · 1627 posts · Server mastodon.yuuta.moe

RISC-V Vector Basics and Development Environment

blog.yuuta.moe/2023/08/08/rvv-

#blog #riscv #simd

Last updated 1 year ago

NLnet Labs · @nlnetlabs
1836 followers · 666 posts · Server fosstodon.org

Our capable zone file parser is still getting faster! It can now parse the .com zone in 16.344s on Jeroen’s laptop. That being said, optimising for the remaining record types is now getting seriously complicated. github.com/NLnetLabs/simdzone/

#simd #dns

Last updated 1 year ago

NLnet Labs · @nlnetlabs
1761 followers · 650 posts · Server fosstodon.org

#ipv4 #simd

Last updated 1 year ago

GStreamer · @GStreamer
194 followers · 95 posts · Server fosstodon.org

📢 New ORC 0.4.34 release of the Optimised inner loop Runtime Compiler!

New parser API with better error reporting, thread-safety improvements around the orc codemem allocator and many smaller improvements, bug fixes and build fixes.

Enjoy!

lists.freedesktop.org/archives

#gstreamer #release #opensource #freesoftware #multimedia #simd #assembly

Last updated 1 year ago

NLnet Labs · @nlnetlabs
1709 followers · 608 posts · Server fosstodon.org

Most of the NLnet Labs crew is going to be at ! Here's who to find for our projects:
🧑‍🔬 Interested about our and work? Talk to @willem or @benno.
🏎️ Want to know how our super fast zonefile parser is coming along? Chat with Jeroen.
🧷 Curious about our focus on and DNS implementations in ? Watch Philip's presentation in dns-wg to learn more...

#ripe86 #opensource #dns #openstandards #research #simd #memorysafety #rustlang

Last updated 1 year ago

NLnet Labs · @nlnetlabs
1640 followers · 558 posts · Server fosstodon.org

We’re very excited that Daniel Lemire is contributing to our blazingly fast, enabled zone file parser. lemire.me/en/ github.com/NLnetLabs/simdzone

#simd #dns #opensource

Last updated 2 years ago

· @dcz
263 followers · 1602 posts · Server fosstodon.org

Ever wanted to use to manipulate images like iterators?

image
.map(to_rgb)
.filter(|r, g, b| b > 100)
.collect();

Well, I don't quite have that, but I do have a -ish library called .

framagit.org/dcz/obraztam

It's -first, it has .map() and .zip() and it optimizes to instructions. Can be easily parallelized even more.

I used it to detect beams in the picture!

#rust #monadic #obraztam #immutable #simd #laser #graphics #monad #iterator

Last updated 2 years ago

Felix LeClair (Wants a job😊) · @fclc
379 followers · 868 posts · Server mast.hpc.social

As of now the closest to what I want is the instruction vpmadd231d zmm, zmm, zmm which can deal with EPI32

Idea is lowering power requirements while also taking advantage of the fp32 pipes extremely high performance, which also happens to have lower latency than the integer unit (both CPI 0.5, but FP unit is lat 4 vs integer being lat10)

I need more than EPI16 native, but EPI32/64 is a waste of power and precision.

I'd also rather not do horrible things in the FP unit...

#knc #simd

Last updated 2 years ago

NLnet Labs · @nlnetlabs
1579 followers · 510 posts · Server fosstodon.org

Our superfast zone file parser now has fallback functionality to support architectures for which no
implementation is available yet. github.com/NLnetLabs/simdzone/

#dns #simd #opensource #clang

Last updated 2 years ago

Rick Wong – Tech & code  · @ryguw
147 followers · 120 posts · Server fosstodon.org

With just ONE line, decorate a function with ‘@njit’ to parallelize it automatically. Multi-core (CPu and GPU) and SIMD vectorization supported. *jaw drops* 😲

numba.pydata.org/

#python #simd #parallelism #numba #jit #decorators

Last updated 2 years ago

Felix LeClair (Wants a job😊) · @fclc
377 followers · 843 posts · Server mast.hpc.social

Hi everyone!

Was chatting with a few students and enthusiasts about the how part of why CPUs are so fast. It covers SIMD, multiple ALUs and Out of Order Execution.

Converted it into a Git Gist and added some more practical examples that should be approachable to anyone who's written a `for` or `while` loop at some point.

Please find it here: gist.github.com/FCLC/8322c7e0f

#cpu #simd #hpc #x86 #arm #c #ooo #ooe

Last updated 2 years ago

NLnet Labs · @nlnetlabs
1468 followers · 447 posts · Server fosstodon.org

Want to hear how we are targeting 1GB/s to parse a zone file? Jeroen's talk is now live! Lots of details in the Q&A at the end. fosdem.org/2023/schedule/event

#dns #fosdem23 #fosdem #opensource #simd #softwaredevelopment

Last updated 2 years ago

NLnet Labs · @nlnetlabs
1411 followers · 420 posts · Server fosstodon.org

This Saturday in the Devroom at @fosdem Jeroen will be presenting his work on simdzone, a fast and standards compliant DNS zone file parser. 🚀

Running simdzone on an Intel Core i7-1065G7 against an older .com zone file of 12482791271 bytes under Linux (Fedora 37).

GCC 12.2.1, release mode:

$ time ./parser ../../zones/com.zone
parsed 341535548 records

real 0m17.755s
user 0m16.602s
sys 0m1.105s

#

github.com/NLnetLabs/simdzone

#dns #clang #fosdem #fosdem23 #softwaredevelopment #simd

Last updated 2 years ago

NLnet Labs · @nlnetlabs
1403 followers · 417 posts · Server fosstodon.org

This Saturday at we'll be presenting in the Devroom twice!

First up will be Jeroen, talking about parsing zone files really fast. Later in the session Philip will be presenting on Connectbyname and the Proxy Control option. fosdem.org/2023/schedule/track

#fosdem #dns #opensource #softwaredevelopment #openstandards #simd

Last updated 2 years ago

dkorunic · @dkorunic
38 followers · 97 posts · Server mastodon.cloud

Testing SIMD instructions on ARM with Rust on Android buff.ly/402KQND

#android #simd #programming

Last updated 2 years ago

jbz · @jbzfn
255 followers · 1770 posts · Server mastodon.social

📍 Are GPUs FASTER than CPUs? (not exactly...)
— Low Level Learning

youtu.be/xi-wTlVUZsQ

#gpu #cpu #simd #computation

Last updated 2 years ago

CK's Technology News · @CKsTechNews
1768 followers · 2342 posts · Server cktn.todon.de

Instructions on ARM with Rust on tested

Blog post
gendignoux.com/blog/2023/01/05

#simd #android

Last updated 2 years ago

René Dudfield · @renedudfield
43 followers · 55 posts · Server fosstodon.org

A while ago kept a mini 4 day blog whilst hacking about on sort with AVX-512 github.com/illume/renesortfun

Learnt some of avx-512 for the first time! Read a lot, including some good books and papers on optimization. Hoping to take it further some time. Some good speedups to be had. I love doing low level stuff occasionally. Whilst it is simpler it also requires a creativity, and a different sort of zen than other types of programming.

#simd

Last updated 2 years ago

Stephen Whittle · @mov_eax_rgb
92 followers · 4 posts · Server mastodon.gamedev.place

What do people use to analyze code these days? I've got some code targeting Zen2 that I want to profile or statically analyze. AMD's uProf only samples at 1ms intervals, and I don't think microbenching is going to be much help.

#avx2 #simd

Last updated 2 years ago

Felix LeClair (Wants a job😊) · @fclc
330 followers · 469 posts · Server mast.hpc.social

Damn! The legendary @lynne of FFMPEG fame is here!

For those unaware, Lynne handles a lot of the incredible acceleration found in , including in x86_64 and Arm/aarch!

#simd #ffmpeg

Last updated 2 years ago