Comparing performance between access multi-file patterns #987
Replies: 6 comments 1 reply
This comment has been hidden.
This comment has been hidden.
-
A new notebook (in a discussion-987/comparing-access-performance branch) shows initial test results in the second markdown cell, titled "Test Report (so far)". |
Beta Was this translation helpful? Give feedback.
-
This has been on the list forever, I'm happy we are moving along! |
Beta Was this translation helpful? Give feedback.
-
This is great. I will look onto replicating these access workflows with xr.open_dataset(opendap_dap4url, engine='pydap') and xr.open_mfdataset(list_opendap_dap4urls, engine='pydap', **open_params) and will share a notebook later today. |
Beta Was this translation helpful? Give feedback.
-
Opened draft PR #989 for this effort. |
Beta Was this translation helpful? Give feedback.
-
Quick chart showing the results from the initial tests: |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Perhaps we can create a blog post, white paper, or some other artifact, explaining the results of our performance assessment.
Access methods to compare:
earthaccess.open_virtual_dataset()
earthaccess.open_virtual_mfdataset()
pydap
earthaccess.download()
xarray.open_mfdataset()
Tentatively, we want to access as Zarr v3, and our "moon shot" would be Icechunk.
(This came up in a discussion during the earthaccess hackday on 15 April 2025, with @battistowx, @betolink, @Mikejmnez, @rwegener2)
Beta Was this translation helpful? Give feedback.
All reactions