-
Notifications
You must be signed in to change notification settings - Fork 54
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Strange Interactions with Transposition and StoreToZarr #710
Comments
Related to #709 and #715 and h5py/h5py#2019 Doing an issue dump of what we've learned and a thread with great detail from the past that is related ✨ MRE:Multiple different jobs (beyond the one in the this issue) seem to hang in Getting an MRE was hard and instead we decided to let things hang and take the opportunity to inspect thread traces and dumps to see what we can learn. The environment we were working on didn't give us privilege to install Investigation:
|
Next Steps and Ideas:
|
Could try with |
With my new
|
Also, look into using this: https://www.hdfgroup.org/solutions/cloud-amazon-s3-storage-hdf5-connector/ instead of any repurposed synchronous tooling |
Versions:
pangeo-forge-runner==0.10.2
recipe.py
recipe versions
Problem:
Putting up the bat signal on this one 🦇 📡 b/c it's kept us confused for days. On the
LocalDirectRunner
andFlink
we've noticed that this recipe with transposing coordinates will either hang/stall or dump zero useful tracebacks about where it's failing.Looking for ideas about the finicky nature of this beast if you have any 🙇
Unfortunately, the source data is in a protected s3 bucket 😞 and the recipe is written to leverage the implied
AssumeRole
behind the scenes but there's a JH cluster you can be added to if you want to test it outThe text was updated successfully, but these errors were encountered: