Related Packages
There are a number of other packages that combine Scikit-Learn with Xarray and/or use estimators for mapping. The table below attempts to compare their features (to the best of our understanding) with sklearn-raster so you can choose the right tool for your application.
Supported features are described in detail below.
| Supported features | sklearn-raster | sklearn-xarray | dask-ml | scikit-map | pyimpute | pyspatialml |
|---|---|---|---|---|---|---|
| 2D outputs | ✅ | ❌ | ❌ | ✅ | ✅ | ✅ |
| 3D outputs | ✅ | ❌ | ❌ | ✅ | ❌ | ❌ |
| n-D outputs | ✅ | ❌ | ❌ | ❌ | ❌ | ❌ |
| Preserves metadata | ✅ | ❌ | ❌ | ✅1 | ✅1 | ✅1 |
| Parallel computation | ✅ | ✅2 | ✅ | ✅ | ❌ | ❌ |
| Lazy computation | ✅ | ✅3 | ✅ | ❌ | ❌ | ❌ |
| Xarray support | ✅ | ✅ | ✅ | ❌ | ❌ | ❌ |
| Parallel fitting | ❌ | ✅ | ✅ | ❌ | ❌ | ❌ |
| Custom estimators | ❌ | ✅ | ✅ | ❌ | ❌ | ❌ |
| Raster processing tools | ❌ | ❌ | ❌ | ✅ | ❌ | ❌ |
Supported Features¶
- 2D outputs: Package is capable of generating 2D model outputs from 2D feature inputs, e.g. predicting georeferenced land cover maps or transforming geospatial imagery.
- 3D outputs: Package is capable of generating 3D model outputs from 3D feature inputs, e.g. predicting a time series of georeferenced maps.
- n-D outputs: Package is capable of generating model outputs with arbitrary dimensionality from n-D feature inputs, e.g. predicting a time series of climate variables at various pressure levels.
- Preserves metadata: Model outputs retain the metadata of the input data, such as spatial references, band names, and NoData masks.
- Parallel computation: Package computes model outputs in parallel.
- Lazy computation: Package computes model outputs lazily, deferring computation until necessary.
- Xarray support: Package supports Xarray data structures as model inputs and outputs.
- Parallel fitting: Package supports fitting estimators in parallel to reduce training time.
- Custom estimators: Package implements additional estimators.
- Raster processing tools: Package includes additional functionality for processing raster data beyond ML modeling.
-
scikit-map,pyimpute, andpyspatialmlonly preserve metadata when writing outputs directly to disk. ↩↩↩ -
sklearn-xarraysupports parallel operations by wrappingdask-mlestimators. ↩ -
sklearn-xarraysupports lazy evaluation by deferring computation, but requires that the entire dataset is loaded into memory. ↩