FedSearch - Federated network search engine

FelixCLC (16 weeks of 🐌 HR) · @fclc

502 followers · 1935 posts · Server mast.hpc.social

Linux 6.6 Unconditionally Enables x86 CPU Microcode Loading Support

Well shit: https://www.phoronix.com/news/Linux-6.6-x86-microcode

Looks to me like this may screw over those of use relying on disabling kernel MC (fallsback to bios MC) for #AVX512 on AlderLake

Need to look deeper and ping some folks

#avx512

Last updated 1 year ago

Original post

FelixCLC (still waiting on HR) · @fclc

489 followers · 1780 posts · Server mast.hpc.social

Something doesn't fit in the register article covering AVX10.

An #Intel fellow comments that AVX10 requires 256 bit or larger implementations, yet there's an *explicit* carve out in the spec for 128 bit implementations, & that carve out means that you *must* plan for AVX10.N/128, instead of having a true baseline of AVX10.N/256

the specific passage is "[... ] 256 bit instructions will be the minimum width required by the AVX10 instruction set."

https://www.theregister.com/2023/08/15/avx10_intel_interviews/
#HPC #AVX512 #avx10

#intel #hpc #avx512 #avx10

Last updated 1 year ago

Original post

Linh Pham · @qlp

499 followers · 2801 posts · Server linh.social

Double yikes in CPU vulnerabilities! Both articles are from #ServeTheHome

#Intel DOWNFALL Ultra-Scary #AVX2 and #AVX512 Side channel Attack Discovered

https://www.servethehome.com/intel-downfall-ultra-scary-avx2-and-avx-512-side-channel-attack-discovered/

New Inception Vulnerability Impacts ALL #AMD Zen CPUs Yikes

https://www.servethehome.com/new-inception-vulnerability-impacts-all-amd-zen-cpus-yikes-phantom/

#Security

#servethehome #intel #avx2 #avx512 #amd #security

Last updated 1 year ago

Original post

Benjamin Carr, Ph.D. 👨🏻‍💻🧬 · @BenjaminHCCarr

967 followers · 2482 posts · Server hachyderm.io

Intel AVX10: Taking AVX-512 With More Features & Supporting It Across P/E Cores

Intel #AVX10 is a new #ISA that includes "all the richness" of #AVX512 and additional features/capabilities while being able to work for both P/E cores. Intel says AVX10 will be "the vector ISA of choice" moving forward.Very exciting from hardware perspective with Intel's #opensource track record around new ISA support it means we'll likely start seeing software enablement preparations begin soon so that everything is upstream and ready by time supported processors ship.
https://www.phoronix.com/news/Intel-AVX10

#avx10 #isa #avx512 #opensource

Last updated 1 year ago

Original post

Jason Pester (GameDev) · @jay

335 followers · 536 posts · Server mastodon.gamedev.place

💡 Interesting news... Intel AVX-512 becomes AVX10

I remember a year or two ago, Pixar had worked with Intel to take advantage of AVX-512 in its XPU tech, but then Intel seemed to limit AVX-512 support to only a few of its CPUs. AVX10 seems like good news 👍

Intel Is Making Big Changes To The x86 ISA With APX And AVX10 Extensions
https://hothardware.com/news/intel-apx-and-avx10-extensions

#HotHardware #Intel #AVX512 #AVX10 #CPU #Vector #Tensor #Math #Pixar #XPU #CGI #ComputerGraphics #3D #Rendering #GameDev #DCC #AI #ML

#hothardware #intel #avx512 #avx10 #cpu #vector #Tensor #math #pixar #xpu #cgi #computergraphics #3d #rendering #gamedev #dcc #ai #ml

Last updated 1 year ago

Original post

Qiita - 人気の記事 · @qiita

23 followers · 764 posts · Server rss-mstdn.studiofreesia.com

今アツイ𝕏といえば… ISA e𝕏tension! AV𝕏10とAP𝕏で夏を乗りこえよう
https://qiita.com/tanakmura/items/dfd99fa2359d7f42bb21?utm_campaign=popular_items&utm_medium=feed&utm_source=popular_items
#qiita #x86 #AVX #avx512 #AVX2 #AVX_512

#qiita #x86 #avx #avx512 #avx2 #avx_512

Last updated 1 year ago

Original post

Felix LeClair (waiting on HR) · @fclc

473 followers · 1625 posts · Server mast.hpc.social

@enp1s0 Specifically this would be using AVX512IFMA, which uses a 52 bit signed integer borrowed from the FP64 SIMD unit that internally accumulates to int104.

Based off the paper, I think this means we can 2 cycle over what Tsuki did with Alg3/4 and exceed whats needed to compete with cursed 2xfp64 setups.

Also relevant is that next gen Sierra Forest, Grand Ridge, Arrow Lake, Lunar Lake are all getting AVX-IFMA (a weaker, but usable version of the AVX512IFMA extension)

#AVX512

#avx512

Last updated 1 year ago

Original post

Hanno Rein · @hannorein

445 followers · 481 posts · Server botsin.space

Open media

The code is of course freely available in the REBOUND package (https://github.com/hannorein/rebound). To use the new fast WHFast512 integrator you need a CPU with #AVX512 instructions. If you don't have one, you should at least check out the paper to see what we do. You'll get rewarded with some fancy #latex #tikz illustrations of the algorithm!

#avx512 #latex #tikz

Last updated 1 year ago

Original post

Hanno Rein · @hannorein

441 followers · 470 posts · Server botsin.space

🚨 New paper!

We re-implement the symplectic WHFast integrator for planetary systems using #AVX512 instructions. We get an almost 5x speedup for a typical integration of the Solar system.

This is a big deal because no matter how much money you spend buying a cluster or GPUs, you just cannot accelerate small N-body integrations. They are inherently sequential. But now a 5 GYr simulation that used to take a week to finish only takes 1.4 days.

#hpc #astrodon #nbody
https://arxiv.org/abs/2307.05683

#avx512 #hpc #Astrodon #nbody

Last updated 1 year ago

Original post

Felix LeClair(received offers) · @fclc

411 followers · 1322 posts · Server mast.hpc.social

The feature I miss the most from Twitter on Mastodon is quote tweets, explicitly of my own for the use case of related but divergent technical threads.

Working on the #GPGPU && #AVX512 accelerated build of #PETSc to enable faster execution of #OpenFOAM as of now and all the relevant threads would have to be independent threads and hard to cross reference as new threads/tangents pop up.

#gpgpu #avx512 #petsc #openfoam

Last updated 1 year ago

Original post

Phoronix · @phoronix

2759 followers · 2502 posts · Server noc.social

Open media

#GCC @gnutools Lands #AVX512 Fully-Masked Vectorization

https://www.phoronix.com/news/GCC-AVX-512-Fully-Masked-Vector

Original tweet : https://twitter.com/phoronix/status/1670741762535563264

#avx512 #gcc

Last updated 1 year ago

Original post

Felix LeClair(received offers) · @fclc

406 followers · 1277 posts · Server mast.hpc.social

Bit of a meme, but this is a little like "Aurora/Sunspot" at home.

Single node with the host CPU being Goldencove #AVX512 (same as SPR) and using an Intel GPU (specifically Arc A770) and reading off of flash.

#avx512

Last updated 1 year ago

Original post

Felix LeClair(received offers) · @fclc

405 followers · 1273 posts · Server mast.hpc.social

Sanity check, but looks to me like LLVM (and by extension ICX) are slightly borked in this context?

Code is a version of the intel AMX examples

https://godbolt.org/z/MdxEK7jsb

#AMX #AVX512

#amx #avx512

Last updated 2 years ago

Original post

Felix LeClair (Wants a job😊) · @fclc

379 followers · 866 posts · Server mast.hpc.social

#AVX512 extension request: IFMA-52 but lower precision integers.

IFMA-52 is nice because of it's high intermediate precision as well as great throughput (but high latency). I suspect 52 is convenient because of the FP64 FMA unit.

Perhaps an FP32 based IFMA-22 could be doable?

#intel #HPC #ai #x86 #YetAnotherISAExtension

#avx512 #intel #hpc #ai #x86 #yetanotherisaextension

Last updated 2 years ago

Original post

Phoronix · @phoronix

2118 followers · 1656 posts · Server noc.social

Open media

.@IntelSoftware @IntelDevTools Releases x86-simd-sort v1.0 Library For High Performance #AVX512 Sorting

-- This is the AVX-512 code that a few weeks back was integrated into #Python #Numpy for 10~17x speed-ups for this quicksort implementation.

https://www.phoronix.com/news/x86-simd-sort-1.0

Original tweet : https://twitter.com/phoronix/status/1633250284825636864

#numpy #python #avx512

Last updated 2 years ago

Original post

Felix LeClair (Wants a job😊) · @fclc

365 followers · 774 posts · Server mast.hpc.social

If someone wants to learn assembly using the most up to date ISA's on real hardware that they own, what's actually available for a reasonable cost? Looking at x86, Arm and RISC-V
Vectors ISA: AVX512, SVE2, RISC-V V
Matrix ISA: AMX, SME, RISC-V M

x86_64:
#AVX512 You're in relatively good shape. IceLake is a bargain, as is Tigerlake. Zen4 is pricier, but still accessible

#AMX As of now, it's "SoonTM", mainly depending on what boards will cost for SPR-W. Chip, board and Ram, hopefully sub 1200

#avx512 #amx

Last updated 2 years ago

Original post