Felix LeClair (waiting on HR) · @fclc
473 followers · 1625 posts · Server mast.hpc.social

Anyone happen to have historical data comparing the Extended implementation and performance across different packages?

I'm back on grid, re-reading the spec (netlib.org/blas/blast-forum/ch) and I *think* there's nothing stopping me from having a routine using the same underlying techniques as @enp1s0 pointed out in their (his?) recent paper.

Partially because @steve gave a subtle nod of "it's not insane", I think it might workout well?

#blas #egemm #hpc #hpl

Last updated 1 year ago

Felix LeClair (waiting on HR) · @fclc
445 followers · 1596 posts · Server mast.hpc.social

In the context of ? It’s

In the context of Raytracing? It’s 😩LAS and 😈LAS

#hpc #blas

Last updated 1 year ago

Gretl · @gretl
23 followers · 66 posts · Server econtwitter.net

🖥️💪 Do you struggle with numerical efficiency in your work? 🤔👨‍💼 Marcin Błażejowski has the answer! 💡👨‍🏫

Presented at the conference 2023 work on "Which C compiler and / library should I use?" to learn expert insights on configuration that boosts your work efficiency.

#gretl #blas #lapack #economics #econometrics #gc2023 #econtwitter

Last updated 1 year ago

Gretl · @gretl
21 followers · 51 posts · Server econtwitter.net

🖥️💪 Do you struggle with numerical efficiency in your work? 🤔👨‍💼 Marcin Błażejowski has the answer! 💡👨‍🏫

Presented at the conference 2023 work on "Which C compiler and / library should I use?" to learn expert insights on configuration that boosts your work efficiency.

#gretl #blas #lapack #numeralanalysis #economics #econometrics #gc2023 #econtwitter

Last updated 1 year ago

Scalable Analyses · @scalable
2 followers · 42 posts · Server fosstodon.org

Matrix-matrix multiplications in BLIS. James H. Wilkinson Prize for Numerical Software at

#siamcse23 #gemm #hpc #blas #arm

Last updated 2 years ago

Felix LeClair (Wants a job😊) · @fclc
363 followers · 767 posts · Server mast.hpc.social

Mixed precision things:

foundation gave us conversion instruction for fp16->fp32->fp16 instructions.

Overtime, these instructions got faster and faster. That's great!

From SkylakeX to ICL, we went from Latency7, CPI 1 to Latency7 CPI 0.5!

Great.

But with SPR having dedicated FP16 math extensions, the importance of datatype conversion was "downgraded"

from 7 and 0.5 to 8 and 1

#blas #avx512

Last updated 2 years ago

Felix LeClair (Wants a job😊) · @fclc
363 followers · 762 posts · Server mast.hpc.social

@sri @hipsterelectron

@karolherbst is already working towards SYCL support (don't remember if it's 2020 or 1.2.1?) with the RustICL project within mesa for opensource, OpenCL drivers.

I'd be looking at first towards a Rust written , compatible library that also provides classical Bindings.

As of now there are BLAS bindings for rust, and rust written libraries that provide BLAS functionality, but there isn't a rust BLAS project that caters to a general audience (AFAIK)

#sycl #blas #fortran

Last updated 2 years ago

n-gons · @ngons
972 followers · 1062 posts · Server mathstodon.xyz
Ben Fulton · @benfulton
88 followers · 155 posts · Server fosstodon.org

Just compiling up a version of NCBI Blast for use on a large node. Anyone know why it looks for Boost/LaPack/GSL but doesn't require them? Will it be faster if I make them available?

#bioinformatics #hpc #ncbi #blast #blas

Last updated 2 years ago

La ilustración navideña que dibujé en 2018 con los personajes de Barrio Sesamo.
Sigue siendo una de mis favoritas (sino la que más) y mira que ya tiene tiempo.

¡Feliz Navidad a todos!

#art #arte #christmas #holiday #dibujante #cartoon #comicartist #sesamestreet #elmo #epi #blas #coco #groomer #cookiemonster #cartoonart #fanart #artwork

Last updated 2 years ago

Felix LeClair (Wants a job😊) · @fclc
260 followers · 185 posts · Server mast.hpc.social

Time for an !
I'm a young Canuck with interests/experience in , , , , , , , heterogeneous compute & other such things.

Currently my personal projects are bringing to the library, working to standardize what Complex domain BLAS FP16 kernels/implementations should look like, and making sure is available everywhere.

I also write every now and again. Here's the tail of AVX512 FP16 on Alderlake
gist.github.com/FCLC/56e4b3f4a

#introduction #hpc #linux #blas #sycl #c #avx512 #rust #fp16 #openblas

Last updated 2 years ago

Felix LeClair (Wants a job😊) · @fclc
254 followers · 136 posts · Server mast.hpc.social

Was going through the Risc-V Vector ISA spec (as you do) and noticed this little gem:

Specifically the line "When 16-bit and 128-bit element widths are added, they will be also be treated as IEEE-754/2008-compatible values. "

Unless I'm miss interpreting this, is Risc-V indicating future *native* support for 128 bit integer and floating point?

On the other hand, because I'm that guy: GOSH DARN IT, WHY NOT SHIP FP16 AS PART OF V.1 😭
github.com/riscv/riscv-v-spec/

#hpc #blas #riscv #fp16 #asm

Last updated 2 years ago

FelixCLC · @FelixCLC
130 followers · 136 posts · Server mastodon.social
ijliao · @ijliao
298 followers · 6174 posts · Server g0v.social
Gerald Leppert :verified: · @gerald_leppert
1249 followers · 595 posts · Server bonn.social
Debian · @debian
6531 followers · 674 posts · Server framapiaf.org

Good news especially for Debian scientific computing users: BLAS/LAPACK Ecosys massive update lists.debian.org/debian-scienc

#lapack #blas #linearalgebra

Last updated 5 years ago