Comments (12)
@jreadey, I realize this is more of an HSDS issue, right?
Any ETA on supporting dimension scales there?
from h5pyd.
Right the create_scale method in h5py just turns around and calls the HDF5 library.
Looking at the dimension scale spec: https://support.hdfgroup.org/HDF5/doc/HL/H5DS_Spec.pdf, I think I just need to implement object references in HSDS to get this working.
That's on schedule for August so I'd hope we can get it in h5pyd/h5netcdf soon.
from h5pyd.
@jreadey, awesome, thanks!
cc @rabernat
from h5pyd.
@jreadey , just curious: any update here?
Would love to see this working with xarray/h5netcdf...
from h5pyd.
@rsignell-usgs - I got pulled into some other work last week - promise to get to work on it this week.
from h5pyd.
@jreadey , I'm at the Unidata Users Committee Meeting on Monday. Anything progress/roadblocks to report here?
from h5pyd.
@rsignell-usgs - sorry, we've had some other items come up. Will target having dimension scales working by end of month.
from h5pyd.
@jreadey , any updates?
from h5pyd.
@rsignell-usgs - we have dimscales implemented in h5pyd thanks to @ajelenak-thg. It is still fairly "fresh" so please try it out and let us know if you run into any issues. I haven't created a release yet, so you'll need to do a "git pull" and "python setup.py install" to get the latest.
Also, I've been tinkering with the hsload util to do the right thing with files that include dimension scales. I'll try this out with your sandy NetCDF file.
from h5pyd.
@jreadey , okay, I'll check that out when I get a chance. I'm at a joint USGS/NOAA meeting this week at the National Water Center in Tuscaloosa. Talking about NWS National Water Model that produces 1TB forecast data in NetCDF each day.
http://water.noaa.gov/about/nwm
Folks I've spoken to here seem very interested in HSDS/h5pyd.
from h5pyd.
Dimension scales are implement in v0.2.7 release and HSDS. I'll close this issue and request users create new issues for any specific issues with the dimscale implementation.
from h5pyd.
I was able to get this going using tips from @ajelenak-thg
Here's the hypyd_env.yml
environment file I used:
name: h5pyd
channels:
- conda-forge
- defaults
dependencies:
- python=3.6
- h5py
- jupyter
- pytz
- requests
- matplotlib
- xarray
- pip:
- https://github.com/HDFGroup/h5pyd.git@master
- https://github.com/ajelenak-thg/h5netcdf.git@h5pyd
and then in this h5pyd
environment:
import xarray as xr
import h5pyd
I wasn't able to use @ajelenak-thg HSDS endpoint because I didn't have permission, so this didn't work:
f = xr.open_dataset('http://35.166.11.248:5101/home/ajelenak/MERRA2_400.statD_2d_slv_Nx.20171031.nc4',
engine='h5netcdf')
but I was able to open one of my data endpoints on @jreadey HSDS on XSEDE:
f = xr.open_dataset('http://149.165.156.174:5101/home/rsignell/Sandy_ocean_his_nc4.nc', engine='h5netcdf')
Took 90 seconds to open, which as @ajelenak-thg explained:
Expect a delay when h5netcdf/xarray are opening HSDS files. HSDS is still not optimized for metadata-type requests and these happen a lot on file open. I counted 184 requests for the above MERRA-2 file. The number of requests is directly related to the number of variables and global/variable attributes in a file.
from h5pyd.
Related Issues (20)
- hsload fails with datasets using scale offset filter HOT 1
- apply source compression filter in hsload HOT 1
- Show filters applied to any datasets in hsls HOT 1
- h5pyd not evaluating environment variables HOT 1
- An error related to hsrm HOT 12
- Error with 1D chunk sizes HOT 2
- `logging.info` call forces downstream package loggers to emit messages twice
- Recursive Domain Deletion Flag for `hsrm`
- git tags missing HOT 1
- hsload doesn't allow linkpath and fastlink options to be used together
- Update build process to use toml HOT 1
- CI Testing in Github HOT 1
- Support numpy-style broadcasting
- Support field selection from compound types HOT 1
- Support ordering links/attribute by creation index/name HOT 1
- Attributes in root group not displayed by hsls --showattrs HOT 1
- Support for H5Dread_multi/H5Dwrite_multi
- Missing v0.18.0 tag HOT 2
- PEP 632: Python 3.12 Incompatible with Distutils HOT 1
- Failures during vlen writes in test_dataset
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from h5pyd.