Attentive Analog Ensemble (A2E)

Attentive Analog Ensemble (A2E) is a TensorFlow-based framework for probabilistic weather postprocessing. It rethinks the traditional Analog Ensemble (AnEn) by replacing hard analog retrieval with a differentiable cross-attention mechanism, enabling end-to-end learning of analog similarity directly from probabilistic scoring rules such as CRPS and SCRPS.

The framework includes three model variants:

A2E – single-location attentive analog retrieval
CA2E – conditional multi-location variant with shared model weights and location-specific embeddings
SA2E – spatial extension for spatiotemporal modeling across multiple locations

Motivation

The classical Analog Ensemble is effective for generating probabilistic forecasts, but it has several limitations:

similarity is based on manually weighted linear predictor combinations
weight optimization becomes expensive in higher dimensions
locations are typically modeled independently
temporal context is short
analog retrieval is non-differentiable and cannot be trained end-to-end with backpropagation

A2E addresses these limitations by treating:

the current forecast as the query
historical forecasts as keys
historical observations as values

Cross-attention then acts as a soft analog retrieval mechanism, with attention weights defining how strongly each historical observation contributes to the predictive distribution. Because this retrieval is differentiable, the model can learn nonlinear similarity representations in latent space.

Core idea

A2E combines two main components:

1. Embedding Network

A Siamese WaveNet-inspired encoder maps forecast sequences into a shared latent space. The encoder is designed to capture:

nonlinear feature interactions
long temporal context
efficient sequence processing through dilated causal convolutions
optional conditioning or spatial structure depending on the model variant

2. Cross-Attention Retrieval

The query embedding attends to historical forecast embeddings. The resulting weights are used to construct a weighted ensemble of historical observations, which defines the predictive distribution for the target timestep.

Model variants

A2E

The base model for single-location probabilistic forecasting. It uses a Siamese encoder and a differentiable cross-attention retrieval layer.

CA2E

Conditional A2E extends the embedding network with location embeddings, allowing a single model to generalize across multiple locations. This supports transfer learning across sites and makes it easier to add new locations later by learning only their latent representation.

SA2E

Spatial A2E extends the temporal encoder with spatial convolutions so that the latent state of one location can incorporate information from neighboring or jointly modeled locations.

Repository structure

The package is organized into modular subpackages:

A2E/
    callbacks/
    factory/
    io/
    layer/
    loss/
    metrics/
    model/
    pipeline/

These modules cover model definitions, neural layers, training pipelines, callbacks, metrics, loss functions, and high-level workflow orchestration.

Adding a custom encoder

A2E is designed to be extensible. To add a custom encoder, you need to register it across the model, pipeline, factory, and configuration layers so that it can be instantiated through the standard API.

In general, adding a new encoder requires the following steps:

Implement the encoder layer in layer/
Add the encoder implementation as a reusable neural network layer. This layer should define how forecast inputs are mapped into the latent representation used for attention-based retrieval.
Create the model definition in model/
Add the model class that uses your new encoder as part of the A2E architecture.
Create the corresponding pipeline in pipeline/
Implement the data-preparation pipeline required by your encoder. This includes any model-specific preprocessing, reshaping, or dataset construction.
Register the encoder in the factory in factory/factory.py
Extend the factory logic so the framework can instantiate your model and pipeline from the configuration.
Register the configuration in io/config.py
Add the required configuration fields and make sure the new encoder or model type can be selected through ModelConfig.

After these steps, the encoder should be available through the regular training and inference workflow exposed by the API.

As a rule of thumb:

use layer/ for the reusable encoder implementation
use model/ for the architecture that integrates the encoder
use pipeline/ for data preparation
use factory/factory.py for object creation and dispatch
use io/config.py for user-facing configuration and registration

This design keeps custom extensions aligned with the existing A2E structure and ensures that new encoder types can be used consistently across training, embedding generation, and retrieval workflows.

Key components

`model/`

Contains the model implementations for A2E, CA2E, SA2E, and the classical Analog Ensemble baseline. The core A2E model:

encodes current and historical forecasts with a shared encoder
computes similarity-based attention weights
returns aligned historical observations and weights for training against probabilistic losses

`layer/`

Contains the neural building blocks:

cross_attention.py – differentiable retrieval with configurable similarity metrics
encoder.py – WaveNet-style temporal encoder
spatial_encoder.py – spatiotemporal extension for SA2E

The cross-attention layer supports multiple similarity metrics, including cosine similarity, Pearson correlation, scaled dot product, and Euclidean distance. It also supports optional top-k selection during retrieval.

`pipeline/`

Builds training datasets and preprocessing workflows for the supported model types. The pipeline layer handles train/test splitting, normalization, and model-specific data preparation.

`io/`

Provides the main workflow interface through an Api class that covers:

training
embedding generation
embedding-based retrieval

It also includes model configuration serialization and plotting utilities.

Training workflow

The repository exposes a high-level API for training. The general workflow is:

create a ModelConfig
initialize the Api
prepare forecast and observation tensors
provide optimizer, loss, metrics, and training parameters
call train(...)

The training API supports:

model creation or loading from disk
resuming training
checkpointing and epoch tracking
saving model config and normalization parameters
training-history plotting after completion

Example outline

import tensorflow as tf
from A2E.io.config import ModelConfig
from A2E.io.api import Api
from A2E.loss import SCRPS
from A2E.metrics.keras import CRPSMetric, EntropyMetric

config = ModelConfig(
    # fill in your model configuration here
)

api = Api(config=config)

model, history = api.train(
    forecasts=forecasts,
    observations=observations,
    save_path=save_path,
    epochs=100,
    batch_size=64,
    test_size=0.3,
    optimizer=tf.keras.optimizers.AdamW(),
    loss=SCRPS(),
    metrics=[CRPSMetrics(), EntropyMetric()],
)

Note: adjust imports and configuration fields to match the exact implementation in the repository.

Evaluation

The framework supports evaluation with:

CRPS
SCRPS
RMSE
Bias
Rank histograms for calibration analysis

These metrics allow assessment of both probabilistic forecast quality and deterministic point accuracy.

Data

This repository uses Weather data by Open-Meteo.com.

Why this repository is useful

This repository is intended for researchers and practitioners interested in:

postprocessing deterministic weather forecasts
analog ensemble methods
differentiable retrieval
similarity learning in latent space

Citation

If you use this repository in academic work, please cite the associated article:

Phillip Schlicht, Ralf Schemm
Attentive Analog Ensemble (A2E): End-To-End Learning of Analog Similarity by Optimizing Probabilistic Scoring Rules via Differentiable Retrieval

Not published so far.

Add bibtex when published.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
A2E		A2E
AblationStudy		AblationStudy
Data		Data
Evaluation		Evaluation
Trained_Models		Trained_Models
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Attentive Analog Ensemble (A2E)

Motivation

Core idea

1. Embedding Network

2. Cross-Attention Retrieval

Model variants

A2E

CA2E

SA2E

Repository structure

Adding a custom encoder

Key components

`model/`

`layer/`

`pipeline/`

`io/`

Training workflow

Example outline

Evaluation

Data

Why this repository is useful

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Attentive Analog Ensemble (A2E)

Motivation

Core idea

1. Embedding Network

2. Cross-Attention Retrieval

Model variants

A2E

CA2E

SA2E

Repository structure

Adding a custom encoder

Key components

model/

layer/

pipeline/

io/

Training workflow

Example outline

Evaluation

Data

Why this repository is useful

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

`model/`

`layer/`

`pipeline/`

`io/`

Packages