Giuseppe Bilotta · @giuseppebilotta
155 followers · 291 posts · Server fediscience.org

one is a build failure against my GPU (already reported, with a fix ready and pending release), and the other is … slow performance in one of the API calls that we use!

Turns out, `sort_by_key`, at least in the way we use it, is somewhere between 25% and 30% slower on my iGPU when using the latest (from the 5.6.0 software stack) than it is on the *CPU* when using the latest with the OpenMP backend!

#rocthrust #amd #thrust

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 289 posts · Server fediscience.org

So, one of the reasons why we could implement the backend easily in is that provides drop-in replacement for much of the ‌ libraries, including , which (as I mentioned in the other thread) is a fork of with a /‌ backend.
This is good as it reduces porting effort, *but* it also means you have to trust the quality of the provided implementation.

#thrust #rocthrust #CUDA #nvidia #ROCm #amd #GPUSPH #hip

Last updated 1 year ago