Giuseppe Bilotta · @giuseppebilotta
155 followers · 289 posts · Server fediscience.org

So, one of the reasons why we could implement the backend easily in is that provides drop-in replacement for much of the ‌ libraries, including , which (as I mentioned in the other thread) is a fork of with a /‌ backend.
This is good as it reduces porting effort, *but* it also means you have to trust the quality of the provided implementation.

#thrust #rocthrust #CUDA #nvidia #ROCm #amd #GPUSPH #hip

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 283 posts · Server fediscience.org

Anyway, as I mentioned recently, I have a new workstation that finally allows me to test our code using all three backends (, /‌ and w/ ) thanks to having an processor with an integrated in addition to a discrete ‌ GPU.
Of course the iGPU is massively underpowered compared to the high-end dGPU workhorse, but I would expect to outperform the CPU on most workloads.
And this is where things get interesting.

#nvidia #gpu #ryzen #amd #opemp #cpu #hip #ROCm #CUDA

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 281 posts · Server fediscience.org

So obviously I took the opportunity to install both and ‌/‌ and make sure our software still built and ran correctly.
And honestly, it's unpleasant that in 2023 you still have to do some hoop jumping for either platform.
With CUDA, the issue is always making sure that you have a supported gcc version.
With ROCm, it's easy to trip on unsupported/untested hardware to find the right combination of env vars and define to make it go.

#ROCm #hip #CUDA

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
147 followers · 248 posts · Server fediscience.org

Corporate at its worst: controls the library and its , and backend. provides rocThrust, that is just Thrust with the CUDA part stripped an a new backend for / . Nobody* is working on a backend for
provides its own alternative as , which is NOT a drop-in replacement.

This is why we can't have nice things.

*there's a dead project here
github.com/wdmapp/syclthrust

#onedpl #oneAPI #intel #sycl #hip #ROCm #amd #tbb #OpenMP #CUDA #thrust #nvidia #floss

Last updated 2 years ago

Giuseppe Bilotta · @giuseppebilotta
144 followers · 229 posts · Server fediscience.org

Making progress with on the for . One does need to install the @archlinux community repo, to get the packages, but that's relatively painless. However:

Auto-detected GCN arch gfx1033 with flag 0x97ff (AMD Custom GPU 0405)
error: Illegal instruction detected: Invalid dpp_ctrl value: broadcasts are not supported on GFX10+
renamable $vgpr39 = V_MOV_B32_dpp undef $vgpr39(tied-def 0), killed $vgpr52, 322, 15, 15, 0, implicit $exec

I'll have to look into this.

#GPUSPH #SteamDeck #ROCm #amd

Last updated 2 years ago

Giuseppe Bilotta · @giuseppebilotta
144 followers · 227 posts · Server fediscience.org

Does anybody know if the stack works on he ?

Asking for a friend

#askfedi #SteamDeck #valve #ROCm #amd

Last updated 2 years ago

Oblomov · @oblomov
473 followers · 13894 posts · Server sociale.network

Oh, seems to be finally transitioning more aggressively to , which on the one hand, nice, on the other, it's breaking all my oldish python2-based versions. No more testing on those, it seems.

#ROCm #python3 #sid #debian

Last updated 2 years ago

Oblomov · @oblomov
473 followers · 13894 posts · Server sociale.network

Oh, seems to be finally transitioning more aggressively to , which on the one hand, nice, on the other, it's breaking all my oldish python2-based versions. No more testing on those, it seems.

#ROCm #python3 #side #debian

Last updated 2 years ago

Giuseppe Bilotta · @giuseppebilotta
122 followers · 88 posts · Server fediscience.org

@ashwinvis The stack might or might not work. is more likely to work than in my experience, unless you build the stack yourself, because the -shipped packages tend to disable OpenCL for older archs (even if it runs, it's not supported).
I found the only way to check if it works or not is to install the packages and try.

@anteru may have more information or may be able to direct you to it.

#amd #opencl #hip #ROCm

Last updated 2 years ago

ƒ(t) = ∮ ashwinvis ㎗ · @ashwinvis
166 followers · 1196 posts · Server fediscience.org

Do you know if it is possible to do computing with AMD integrated graphics? Say on:

AMD Radeon™ RX Vega 6 Graphics (AMD Ryzen 5 4600H)

I read through here, but it is not very clear to me.

github.com/RadeonOpenCompute/R


#gpu #opencl #ROCm #hip #gpgpu #help #fedihelp

Last updated 3 years ago