Giuseppe Bilotta · @giuseppebilotta
161 followers · 319 posts · Server fediscience.org

Part of the issue is that the presentation is more of a showcase of what can do and how, so it's hard to find a classic bullet-point synthesis of the thing. I could go with «… and this is why it's awesome» but I'm not sure the audience has the sense of humor to take that the right way.

#GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
161 followers · 319 posts · Server fediscience.org

I'm sitting here trying to finish the presentation on to be presented at next week, and while the thing is “done” overall, I can't think of anything to put on the (final slide). I'm stymied.

#conclusions #simai2023 #GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 295 posts · Server fediscience.org

(That being said, if anyone wants to implement a sort-by-key and segmented reduction that don't depend on Thrust, and contribute it to , I'm not going to complain.)

#GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 289 posts · Server fediscience.org

So, one of the reasons why we could implement the backend easily in is that provides drop-in replacement for much of the ‌ libraries, including , which (as I mentioned in the other thread) is a fork of with a /‌ backend.
This is good as it reduces porting effort, *but* it also means you have to trust the quality of the provided implementation.

#thrust #rocthrust #CUDA #nvidia #ROCm #amd #GPUSPH #hip

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 286 posts · Server fediscience.org

For comparison, with the discrete GPU is over 50× faster than the CPU, and that's on the *low* side of things, actually, due to many kernels being memory-bound rather than compute-bound, and no optimization attempts having been made yet to run on this hardware.

But, and this is where things get surprising, the performance of the iGPU *drops*, failing to even get 2× over the CPU.

Why would something more *intense* have lower performance ratio?

#GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
155 followers · 283 posts · Server fediscience.org

For reference, I'm testing this hardware primarily with two pieces of software: one is an internal cellular automaton model that we use for the assessment of lava flow invasion hazard, and the other is the I've already talked about. These two codebases are *very* different, and it's interesting to se how their differences impact the performance ratios I'm observing across the available hardware.

#GPUSPH #floss

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
145 followers · 262 posts · Server fediscience.org

I've been horribly busy these days with lots of trivial but time-consuming bureaucratic stuff, to the point I've been unable to work on at all. Worse, I haven't even started working on my presentation for (the material is ready, since the article for the proceedings has been submitted already, so it's really just a matter of building the presentation)

spheric2023.com/

#spheric2023 #GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
144 followers · 231 posts · Server fediscience.org

OK now I need a way to take a video of in action on the . I should probably ask @gamingonlinux for recommendations, but my understanding from his YT channel is that he uses an external camera rather than some kind of built-in screen recording capability.

#SteamDeck #GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
144 followers · 229 posts · Server fediscience.org

Making progress with on the for . One does need to install the @archlinux community repo, to get the packages, but that's relatively painless. However:

Auto-detected GCN arch gfx1033 with flag 0x97ff (AMD Custom GPU 0405)
error: Illegal instruction detected: Invalid dpp_ctrl value: broadcasts are not supported on GFX10+
renamable $vgpr39 = V_MOV_B32_dpp undef $vgpr39(tied-def 0), killed $vgpr52, 322, 15, 15, 0, implicit $exec

I'll have to look into this.

#GPUSPH #SteamDeck #ROCm #amd

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
145 followers · 225 posts · Server fediscience.org

(This minithread brought to you by the need to add more validation test cases for .)

#GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
143 followers · 214 posts · Server fediscience.org

Linearized models go a long way to describe the behavior of the wave, but some nonlinear effects can only be captured by a full 3D model, which in our case is (I'm sure nobody that follows me is surprised by that ;-)).
If anybody is interested in discussing the findings, we've opened a topic on the GPUSPH Discourse forum at
gpusph.discourse.group/t/new-p

#GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
140 followers · 190 posts · Server fediscience.org

Why am I so interested in for ? (See also my nudge nudge wink wink at @sri <fediscience.org/@giuseppebilot>)
Because SYCL is today the best bet we have at a unified API, and introducing such a backend would have great potential for our aims of “universal hardware support”.

#gpgpu #GPUSPH #sycl

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
140 followers · 188 posts · Server fediscience.org

Here's another interesting thing about on : currently, it uses the backend, because the only backends available in are for and through . I've actually looked at adding support too, but there are a few structural issues that currently prevent a solution as straightforward as the one used to implement CPU and HIP support.

#sycl #hip #amd #CUDA #nvidia #gpu #cpu #android #GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
140 followers · 186 posts · Server fediscience.org

I think I should create an actual account for the project on the Fediverse.

#GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
140 followers · 181 posts · Server fediscience.org

I'm so ridiculously happy about this on thing, even if it's of no real use —nobody is going to run a serious simulation on their cellphone's CPU and get meaningful results in a reasonable time. It's really just for , even if not as as support
fediscience.org/@giuseppebilot

#hurd #gnu #nerdcred #cfd #android #GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
138 followers · 156 posts · Server fediscience.org

between changes at the operating system level, changes in the Termux toolchain and changes in the code itself, I cannot really claim support for in : the program _does_ compile and run, but the results are completely bogus!
Worse, trying to debug the issue in _any_ way results in the weirdest undebuggable segmentation faults I've ever seen in my life. Now, it's definitely possible that this is just a matter of unreliability of the stuff in Termux, but it's still quite frustratng.

#GPUSPH #android

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
138 followers · 155 posts · Server fediscience.org

One of the first thing I tried after implementing support for the CPU backend in was to try and port the software to , “cheating” by setting up a build environment instead Termux to build and run it as a command-line application there.
It actually worked, that had me seriously thrilled about the thing.

However, the situation has changed now, and not for the best:

#android #GPUSPH

Last updated 1 year ago

Giuseppe Bilotta · @giuseppebilotta
122 followers · 88 posts · Server fediscience.org

@anteru relevant issues concerning the (lack of) information are
github.com/RadeonOpenCompute/R
and
github.com/RadeonOpenCompute/R

This information is important to have, as accurate as possible, also to support developers that wish to advertise support. For example, we recently introduced support in on a private branch, but we're not sure if we can announce it on the next public release without hitting significant support issues.

@ashwinvis

#GPUSPH #hip #amd

Last updated 2 years ago

Giuseppe Bilotta · @giuseppebilotta
117 followers · 76 posts · Server fediscience.org

LOL, I was even able to make buildable and runnable on (just need to fence off a couple of unavailable headers and functions). Not sure how useful an program is on an OS that doesn't even fully support 64-bit processors though. I confess to having done this purely for credits.

#nerd #hpc #hurd #gnu #GPUSPH

Last updated 2 years ago

Giuseppe Bilotta · @giuseppebilotta
117 followers · 76 posts · Server fediscience.org

The reason I decided to do this isn't so much to expand OS support in (although that's always a nice bonus), but as a opportunity for myself. I may do a write-up on this some time in the future.
(FWIW, so far the experience “feels” way more friendly than that of other OSes.)

#bsd #freebsd #learning #GPUSPH

Last updated 2 years ago