Giuseppe Bilotta · @giuseppebilotta
155 followers · 290 posts · Server fediscience.org

Turns out, the ecosystem is less mature than the one it emulates (unsurprising, giving how much more recent it is), and has *obviously* been tested much less in more exotic hardware configurations and with the wide variety of software and developers the CUDA libraries have had interact with.
In the few days in which I've had the opportunity to play with it, I've already discovered two issues with it:

#CUDA #nvidia #hip #amd

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 289 posts · Server fediscience.org

So, one of the reasons why we could implement the backend easily in is that provides drop-in replacement for much of the ‌ libraries, including , which (as I mentioned in the other thread) is a fork of with a /‌ backend.
This is good as it reduces porting effort, *but* it also means you have to trust the quality of the provided implementation.

#thrust #rocthrust #CUDA #nvidia #ROCm #amd #GPUSPH #hip

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 284 posts · Server fediscience.org

The CA model is something of a weird beast. It's actually (the descendant of) my first work done converting an existing C++ code base to . It also sports some support for multi-GPU that was developed by a student of ours when NVIDIA didn't have much support for that in the first place (andof course, by the time this student finished their work, NVIDIA‌ released CUDA 4 which obsoleted most of it.)

#phd #CUDA

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 283 posts · Server fediscience.org

Anyway, as I mentioned recently, I have a new workstation that finally allows me to test our code using all three backends (, /‌ and w/ ) thanks to having an processor with an integrated in addition to a discrete ‌ GPU.
Of course the iGPU is massively underpowered compared to the high-end dGPU workhorse, but I would expect to outperform the CPU on most workloads.
And this is where things get interesting.

#nvidia #gpu #ryzen #amd #opemp #cpu #hip #ROCm #CUDA

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 281 posts · Server fediscience.org

So obviously I took the opportunity to install both and ‌/‌ and make sure our software still built and ran correctly.
And honestly, it's unpleasant that in 2023 you still have to do some hoop jumping for either platform.
With CUDA, the issue is always making sure that you have a supported gcc version.
With ROCm, it's easy to trip on unsupported/untested hardware to find the right combination of env vars and define to make it go.

#ROCm #hip #CUDA

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
147 followers · 248 posts · Server fediscience.org

Corporate at its worst: controls the library and its , and backend. provides rocThrust, that is just Thrust with the CUDA part stripped an a new backend for / . Nobody* is working on a backend for
provides its own alternative as , which is NOT a drop-in replacement.

This is why we can't have nice things.

*there's a dead project here
github.com/wdmapp/syclthrust

#onedpl #oneAPI #intel #sycl #hip #ROCm #amd #tbb #OpenMP #CUDA #thrust #nvidia #floss

Last updated 2 years ago

Giuseppe Bilotta · @giuseppebilotta
140 followers · 189 posts · Server fediscience.org

The most important is definitely our dependency on in a couple of places, but the extensive use of constant and shared memory is also a problem, since the syntax for their use in is very different from the one in , which would pose a problem for our plans to keep the device source code unified across APIs.

#CUDA #sycl #thrust

Last updated 2 years ago

Giuseppe Bilotta · @giuseppebilotta
140 followers · 188 posts · Server fediscience.org

Here's another interesting thing about on : currently, it uses the backend, because the only backends available in are for and through . I've actually looked at adding support too, but there are a few structural issues that currently prevent a solution as straightforward as the one used to implement CPU and HIP support.

#sycl #hip #amd #CUDA #nvidia #gpu #cpu #android #GPUSPH

Last updated 2 years ago

Quantum Brilliance Launches Software Suite Enabling Exploration of Applications with Miniature Quantum Computers thequantuminsider.com/?p=23509 Insider Brief Quantum Brilliance announced the launch of its Qristal software suite, enabling R&D teams to explore integrating quantum systems in real-world applications. Developers and researchers can use Qristal, d develop and test novel quantu

#Quantum_Computing_Business #CUDA #Mark_Luo #NVIDIA #Qristal_SDK #Quantum_Brilliance #quantumdaily

Last updated 2 years ago

Tech Feeder · @techfeeder
1 followers · 510 posts · Server social.dytrych.cloud

How CUDA programming works @nVidia blog.adafruit.com/?p=535489

#CUDA #graphics #gpu

Last updated 2 years ago

2877133136 · @2877133136
1 followers · 13801 posts · Server 285936586.masto.host

Tech demo video for the latest version of the simulator alien. It's open source and available at t.co/E1Un9uGxBc. twitter.com/chrx_h/status/1401

#CUDA #ArtificialLife

Last updated 4 years ago

ƒ(t) = ∮ ashwinvis ㎗ · @ashwinvis
166 followers · 1197 posts · Server fediscience.org

Last week I attended a course on computing. Specifically for devices and its API. Here are the recorded lectures:

invidious.site/playlist?list=P

And the course material:

git.cs.umu.se/mirkom/gpu_cours

It covers different levels, with hands on exercises in .

#gpu #CUDA #c

Last updated 4 years ago