Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added docs/_static/geojupyter_cloud_data.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
10 changes: 10 additions & 0 deletions docs/related.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
# Related guides

Check out these related guides and cookbooks!

- [NASA Earthdata Cloud Cookbook](https://nasa-openscapes.github.io/earthdata-cloud-cookbook/) - Openscapes cookbook to support NASA Earth Science Team's Migration to the Cloud
- [Scaling Science](https://developmentseed.org/scaling_science/docs/index.html) - Development Seed guide on science applied in industry
- [Zarr Visualization Report](https://github.com/NASA-IMPACT/zarr-visualization-report) - NASA VEDA guide on Zarr visualization approaches
- [Warp resample profiling](https://github.com/developmentseed/warp-resample-profiling) - Development Seed overview of all the Python resampling/reprojection options, built with support from NASA ODSI
- [Cloud-native geospatial formats guide](https://guide.cloudnativegeo.org/) - CNG resource on cloud-native formats, built with support of NASA, CNG, and Development Seed
- [PO.DAAC Cookbook](https://podaac.github.io/tutorials/) - Physical Oceanography Distributed Active Archive Center (PO.DAAC) coobook to make NASA’s ocean, climate, and surface water data universally accessible and meaningful
4 changes: 4 additions & 0 deletions docs/worst-practices/fsspec-caching-defaults.md
Original file line number Diff line number Diff line change
@@ -1,3 +1,7 @@
# Default FSSpec caching arguments

FSSpec's default caching behavior is designed for tabular data and performs poorly for datacubes. Recommendations for better better arguments can be found in the [Xarray's tutorial](https://tutorial.xarray.dev/intermediate/remote_data/remote-data.html#remote-access-and-file-caching).

For example, see the figure below from the [Open Source Geospatial Workflows in the Cloud workshop](https://agu2025.workshops.geojupyter.org/modules/data-in-the-cloud/) showing how much slower it is to open a datacube via `xarray.open_mfdataset()` when using the default readahead fsspec caching mechanism vs. a block cache. For a bonus, you can provide even faster access by virtualizing your data using VirtualiZarr and Icechunk!

![Figure from Open Source Geospatial Workflows in the Cloud workshop showing data loading performance](../_static/geojupyter_cloud_data.png)
5 changes: 3 additions & 2 deletions mkdocs.yml
Original file line number Diff line number Diff line change
Expand Up @@ -40,8 +40,8 @@ nav:
- Xpublish-tiles: "visualization/xpublish-tiles.md"
- Rendering layers:
- Carbonplan-maps: "visualization/carbonplan-maps.md"
- Deck.gl-raster-layer: "visualization/deck.gl-raster.md"
- Numeric-data-layer: "visualization/zarr-layer.md"
- Deck.gl-raster: "visualization/deck.gl-raster.md"
- Numeric-data-layer: "visualization/numeric-data-layer.md"
- Zarr-cesium: "visualization/zarr-cesium.md"
- Zarr-gl: "visualization/zarr-gl.md"
- Zarr-layer: "visualization/zarr-layer.md"
Expand All @@ -51,6 +51,7 @@ nav:
- API Reference:
- Benchmarking:
- "api-reference/datacube-benchmark.md"
- Related Guides: "related.md"

watch:
- packages
Expand Down