Glossary#

BenchDataset#

Abstract base class implemented by every dataset wrapper in torchgeo_bench.datasets. Declares static metadata (bands, number of classes, task type, default split sizes) and exposes a get_dataset() factory returning a PyTorch Dataset for a given split.

BenchModel#

Abstract base class implemented by every backbone in torchgeo_bench.models. Sealed forward_patch_features method guarantees that input normalisation is always applied before the subclass-specific _forward_patch_features.

bootstrap#

Resampling technique used to estimate a confidence interval around a metric. eval.bootstrap controls the number of resamples; per-dataset CIs are reported as (ci_lower, ci_upper) in the results CSV.

GeoBench V1#

The original GeoBench benchmark released as a set of HDF5 files. Datasets in V1 use the m- prefix in the CLI (e.g. m-eurosat) and live under data/classification_v1.0/.

GeoBench V2#

The second-generation GeoBench distribution, packaged as one directory per dataset under data/geobenchv2/. V2 datasets use no prefix (benv2, treesatai, pastis, …).

Hydra#

The configuration framework used to compose model and run configs. See Configuration for the override syntax and https://hydra.cc for the full documentation.

intrinsic dimension#

The geometric / statistical dimension of a manifold of feature embeddings, estimated by methods such as TwoNN, MLE, or lPCA. Optional in torchgeo-bench via eval.intrinsic_dim and the [id] extra; see torchgeo_bench.intrinsic_dim.

KNN-5#

5-nearest-neighbour classifier evaluated on backbone-extracted features. Reported as method="knn5" in the results CSV.

linear probe#

Logistic regression trained on frozen backbone features. We sweep C over eval.c_range and report the best test-set performance with best_c. Method label: linear.

mIoU#

Mean Intersection-over-Union, the primary metric for segmentation datasets. Computed by evaluate_segmentation().

resume mode#

When resume=true, the runner skips any (dataset, method, model, config) combination already present in the output CSV. See Results format for the exact key.