Comparing performance between access multi-file patterns #987

danielfromearth · 2025-04-15T18:58:57Z

danielfromearth
Apr 15, 2025
Maintainer

Perhaps we can create a blog post, white paper, or some other artifact, explaining the results of our performance assessment.

Access methods to compare:

earthaccess.open_virtual_dataset()
earthaccess.open_virtual_mfdataset()
pydap
earthaccess.download()
xarray.open_mfdataset()

Tentatively, we want to access as Zarr v3, and our "moon shot" would be Icechunk.

(This came up in a discussion during the earthaccess hackday on 15 April 2025, with @battistowx, @betolink, @Mikejmnez, @rwegener2)

danielfromearth · 2025-04-16T15:44:04Z

danielfromearth
Apr 16, 2025
Maintainer Author

A new notebook (in a discussion-987/comparing-access-performance branch) shows initial test results in the second markdown cell, titled "Test Report (so far)".

0 replies

betolink · 2025-04-16T17:24:09Z

betolink
Apr 16, 2025
Maintainer

This has been on the list forever, I'm happy we are moving along!

0 replies

Mikejmnez · 2025-04-16T17:32:22Z

Mikejmnez
Apr 16, 2025

This is great. I will look onto replicating these access workflows with

xr.open_dataset(opendap_dap4url, engine='pydap')

and

xr.open_mfdataset(list_opendap_dap4urls, engine='pydap', **open_params)

and will share a notebook later today.

0 replies

danielfromearth · 2025-04-18T19:32:29Z

danielfromearth
Apr 18, 2025
Maintainer Author

Opened draft PR #989 for this effort.

0 replies

This comment has been hidden.

Sign in to view

This comment has been hidden.

Sign in to view

This comment has been hidden.

Sign in to view

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comparing performance between access multi-file patterns #987

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 5 comments 1 reply

This comment has been hidden.

This comment has been hidden.

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Comparing performance between access multi-file patterns #987

danielfromearth Apr 15, 2025 Maintainer

Replies: 5 comments · 1 reply

This comment has been hidden.

This comment has been hidden.

danielfromearth Apr 16, 2025 Maintainer Author

betolink Apr 16, 2025 Maintainer

Mikejmnez Apr 16, 2025

danielfromearth Apr 18, 2025 Maintainer Author

danielfromearth
Apr 15, 2025
Maintainer

Replies: 5 comments 1 reply

danielfromearth
Apr 16, 2025
Maintainer Author

betolink
Apr 16, 2025
Maintainer

Mikejmnez
Apr 16, 2025

danielfromearth
Apr 18, 2025
Maintainer Author