torchgeo_bench.models#

This module provides the abstract BenchModel interface and a collection of concrete backbones that can be benchmarked across the torchgeo_bench.datasets registry.

Interface#

class torchgeo_bench.models.BenchModel(bands, normalization=NormalizationStrategy.BANDSPEC_ZSCORE, **_)[source][source]#

Bases: Module, ABC

Abstract base interface for benchmarkable models.

Parameters:

bands (list[BandSpec]) – Ordered list of BandSpec describing the input channels. Length determines num_channels.
normalization (NormalizationStrategy | str) – Input-normalisation strategy name (one of bandspec_zscore / model_native / minmax / minmax_zscore / identity). Defaults to "bandspec_zscore".

Subclasses may declare:

expected_input_unit — what scale the pretrained backbone was fed at training (e.g. s2_dn, reflectance_0_1, uint8). Used by the model_native strategy.
pretrain_mean / pretrain_std — per-channel normalisation applied after unit conversion under model_native.

normalize_inputs(images)[source][source]#

Apply the configured normalisation strategy.

forward_patch_features(images)[source][source]#

Return a batch of vector embeddings (B, K) from raw inputs.

Sealed: applies normalize_inputs() then dispatches to _forward_patch_features(). Override normalize_inputs() to change the normalization policy and _forward_patch_features() to change the backbone forward.

forward(images)[source][source]#

Alias for forward_patch_features().

Backbones#

Random Convolutional Features#

class torchgeo_bench.models.RCFBench(bands, features=512, kernel_size=3, mode='gaussian', stats_mode='mean', seed=None, dataset=None, **_kwargs)[source][source]#

Bases: BenchModel

Wrapper for the existing RCF implementation.

Modes:

mode="gaussian": filters are drawn from a Gaussian; default BenchModel.normalize_inputs() (per-channel z-score) is applied to inference inputs.
mode="empirical": filters are sampled from dataset. To keep the filter bank and inference inputs in the same distribution, the passed dataset is wrapped so its samples are pre-normalized with the same per-channel z-score this RCFBench will use at inference.

Image statistics baseline#

class torchgeo_bench.models.ImageStatsBench(bands, normalization=NormalizationStrategy.BANDSPEC_ZSCORE, **_)[source][source]#

Bases: BenchModel

BenchModel that returns per-image statistics (mean, std, min, max).

Returns raw sensor statistics: normalize_inputs() is overridden to identity so the per-band magnitudes are preserved. Downstream KNN distances and the LogisticRegression sweep see large, unscaled per-channel values; widen eval.c_range if the default sweep saturates.

normalize_inputs(images)[source][source]#

Identity — this model intentionally exposes raw sensor statistics.

timm encoders#

class torchgeo_bench.models.TimmPatchBenchModel(bands, *, model_name, pretrained=True, normalize=False, global_pool='avg', auto_resize=False, target_size=None, use_cls_token=False, input_normalization='bands_zscore', **_kwargs)[source][source]#

Bases: BenchModel

BenchModel wrapper for any timm backbone.

Parameters:

bands (list[BandSpec]) – Ordered BandSpec list (channel count = len(bands)).
model_name (str) – Any timm model name (e.g. "resnet50", "convnext_small", "vit_base_patch16_224").
pretrained (bool) – Load pretrained weights when available.
normalize (bool) – If True, L2-normalize the output embedding.
global_pool (str | None) – Global pooling strategy for timm headless models.
use_cls_token (bool) – For ViT-family models, use the CLS token instead of averaging spatial tokens.
auto_resize (bool) – If True, bilinearly resize each batch to target_size.
target_size (int | None) – Square target size; auto-inferred from backbone.default_cfg["input_size"] when not given.
input_normalization (str) – One of "bands_zscore" (default; per-channel z-score using BandSpec statistics — correct for raw remote-sensing data of any channel count), "imagenet" (per-channel rescale to [0, 1] using BandSpec.{min, max}, then (x - mean) / std with ImageNet RGB stats; refuses to instantiate when len(bands) != 3), "timm_default" (same shape as "imagenet" but reads backbone.default_cfg’s mean/std; also requires len(bands) == 3), or "none" (identity).

normalize_inputs(images)[source][source]#

Apply the configured normalization policy.

torchgeo encoders#

class torchgeo_bench.models.TorchGeoResNetBench(bands, *, factory='resnet50', weights_class='ResNet50_Weights', weights_member='SENTINEL2_RGB_MOCO', auto_resize=False, target_size=224, input_unit_check='warn', **_kwargs)[source][source]#