Alan · @alan
6 followers · 55 posts · Server piaille.fr

Thinking about publishing some data in fluid dynamics in an open format that could be a standard, and trying not to reproduce this piece of art
xkcd.com/927/
looks interesting, or but my data are not so standardized.

#experimental #hdf5 #netcdf

Last updated 1 year ago

gauteh · @gauteh
24 followers · 45 posts · Server hachyderm.io

Fixed a long standing bug in hidefix (github.com/gauteh/hidefix) where files with un-aligned chunking failed. 0.7.0 is out. Fast as ever compared to native hdf5 library.

#hdf5 #rust

Last updated 1 year ago

nialov · @nialov
3 followers · 37 posts · Server fosstodon.org

Looking for help with hdf5 and fortran packaging as a complete newbie: github.com/NixOS/nixpkgs/issue

#nixos #nix #fortran #hdf5

Last updated 1 year ago

Markus Osterhoff · @mosterh1
87 followers · 499 posts · Server academiccloud.social

And after … a couple of days, here the for my
It turns out that using needs some cleanup routines – that's totally okay, but I have plenty of that worked without 🤔

#terabytes #hdf5 #memoryleak #bugfix

Last updated 1 year ago

Markus Osterhoff · @mosterh1
81 followers · 486 posts · Server academiccloud.social

The User Group will hold their annual European meeting on 19-21 September at , Hamburg, Germany.

The special topic of this meeting will be plugins and , and the meeting is hosted in collaboration with the LEAPS-INNOV EU project. The HDF Group will give an update on the latest developments within HDF5 and present the roadmap for the future.

indico.desy.de/e/ehug2023

#compression #data #hdf5 #desy #hdf

Last updated 1 year ago

Markus Osterhoff · @sci_photos
235 followers · 3309 posts · Server troet.cafe

Ein anderes Programm (nennen wir es kleinhirn) hat sinnvollere Werte angezeigt. Aber das erste (nennen wir es zeiger) soll zählen, die Werte an das grosshirn senden, wo SPEC die Zahlen dann abholt für die Justage.

Die Daten kommen als -komprimierte -Strom; zeiger entpackt und zählt, schreibt anschließend den originalen lz4-Blob in die Datei; kleinhirn auf einem anderen Rechner öffnet die Datei, zählt auch und zeigt das Bild an.

#hdf5 #lz4

Last updated 1 year ago

gauteh · @gauteh
18 followers · 26 posts · Server hachyderm.io

New release of hidefix (github.com/gauteh/hidefix) updated to work with xarrays plugin registration. pip install hidefix should do the trick, conda package in the wait: github.com/conda-forge/staged-

#hidefix #rust #hdf5 #netcdf

Last updated 1 year ago

Christopher J Burke · @CuriousTerran
101 followers · 196 posts · Server universeodon.com

@ExoHugh I really hate getting an undocumented or poorly documented file. It is very challenging to parse it's contents. Having the human readable header metadata at beginning of fits/asdf files is a godsend for simply cat/less a file to figure out what's in it.

#hdf5

Last updated 1 year ago

Warrick Ball · @warrickball
75 followers · 347 posts · Server mas.to

@ExoHugh

I vote for whatever will be most easily queried by user tools like astroquery (or the equivalent of Lightkurve, if one is made).

That said, I lean towards . I wish I'd started using it more and sooner.

#hdf5

Last updated 1 year ago

Mark Kittisopikul · @markkitti
251 followers · 114 posts · Server fosstodon.org
gauteh · @gauteh
10 followers · 12 posts · Server hachyderm.io

On a machine with 64 CPUs reading with hidefix is between 10x and 16x faster than native HDF5!!

And that is _including_ generating the index.

Going from 20 seconds to 1.8 seconds is pretty significant when reading a bunch of variables! 7s to 380ms, is the difference between waiting and immediately in processing!

This is for a file on a network disk.

The ultimate goal: Make this into a backend!

#xarray #rust #hidefix #hdf5 #data #met

Last updated 2 years ago

gauteh · @gauteh
10 followers · 12 posts · Server hachyderm.io

Since the read can now be grouped by chunk and read directly to destination slice, we can read a slice in parallel: We're now at 2.72 times faster reading than the native HDF5 library (w/ 4 CPUs)!

#rust #hdf5 #data

Last updated 2 years ago

gauteh · @gauteh
9 followers · 6 posts · Server hachyderm.io

The adventure continues in speedy HDF5 reading: reading directly into a target slice requires sorting by chunk so it is only read once. However, it turns out that grouping the segments within each chunk in a Vec of Vecs becomes very slow to deallocate! 2-3 seconds on a big variable. Better with a flat vector: github.com/gauteh/hidefix/pull

#rust #optimize #hdf5 #data

Last updated 2 years ago

Mark Kittisopikul · @markkitti
229 followers · 98 posts · Server fosstodon.org

I'm preparing for a sprint to meet 1.14 merge window for consistency. This function allows you to efficiently iterate through chunk information such as the chunk address and compression filter state.

PR approved. Backporting to 1.12 required.
github.com/HDFGroup/hdf5/pull/

I might even try backporting to 1.10.
forum.hdfgroup.org/t/backporti

#hdf5 #h5dchunk_iter

Last updated 2 years ago

openPMD · @openPMD
0 followers · 1 posts · Server mast.hpc.social

Hey there 👋
Did someone say , and ?
Check our our community work on productive data , leveraging leading HPC file formats such as and !

openPMD.org
github.com/openPMD

#standardization #fairdata #hpc #compatibility #adios #hdf5

Last updated 2 years ago

Kostas Andreadis · @kandreadis
5 followers · 12 posts · Server fediscience.org

Trying to reproject some global soil moisture data, using the one solution that seems to consistently work requires setting the extent when extracting from the file gis.stackexchange.com/a/253959

#hdf5 #gdal #smap #nasa

Last updated 2 years ago

Fred · @fred
85 followers · 57 posts · Server m.baldhead.be

I still miss some people.

Anyone out there? :)

#netcdf #zarr #geotiff #cog #geoparquet #hdf5 #ogc #geospatial

Last updated 2 years ago

Yann :python: · @nobodyinperson
86 followers · 316 posts · Server fosstodon.org

@thfriedrich I benchmarked the different compression algorithms in once if you're interested: gitlab.com/-/snippets/2043808

With the metric I use there (distance to optimum 'fast and small'), blosc:lz4 is the best compromise.

I still hit a wall with at some point though, I guess the compression prevented something from being done, I don't remember...

Also, is a subtype of so you'll feel familiar.

#hdf5 #NetCDF4

Last updated 2 years ago

Thomas Friedrich · @thfriedrich
47 followers · 84 posts · Server fosstodon.org

@nobodyinperson thanks for the insight. 🙂 I’ll have a look at and which are both new to me. Working in science, so far I used mostly for larger data, which I think has a + for & since there’s easy interfaces for most programming languages. Compression is decent I believe.

#mqtt #NetCDF4 #hdf5 #opendata #openscience

Last updated 2 years ago

Arne Babenhauserheide · @ArneBab
397 followers · 9174 posts · Server rollenspiel.social

and performance: storing and reading columnar float
github.com/fizban99/hdf_vs_fea


Feather writes faster, HDF5 reads faster at large sizes.

#hdf5 #feather #Benchmark

Last updated 2 years ago