Temporal MOSAIKS - Embed2Scale Challenge 4th Place Solution (Training Free!)

This repo contains the code for the 4th place solution to the Embed2Scale Challenge for the 2025 CVPR Earthvision Workshop.

Team AI4G Intern Squad

Isaac Corley (@isaaccorley) - Wherobots
Burak Ekim (@burakekim) - University of the Bundeswehr Munich
Caleb Robinson (@calebrob6) - Microsoft AI for Good Lab
Esther Rolf (@estherrolf) - University of Colorado Boulder

Solution

Our goal for this challenge was to discover a compute-friendly method which can provide remote sensing image embeddings with adequate representations for downstream tasks without any form of training or pretraining necessary.

First we stack the Sentinel-1 and Sentinel-2 imagery across the channel dimensions creating a datacube of size (4, 27, 264, 264).
We then feed this to MOSAIKS which is initialized using 4,096 randomly sampled 3x3 kernels from the dataset which generates a time-series of vectors of shape (4, 4096).
We then perform mean pooling across the temporal dimension to reduce this to a single vector of shape (4096).
To reduce this down to the required 1,024 dimensional vector we use simple PCA dimensionality reduction.

Example Usage

import torch
from einops import rearrange
from torchgeo.models import RCF
from sklearn.decomposition import PCA

# Each sample in a batch is a time-series of images
# batch = torch.randn(8, 4, 27, 64, 64)  # (b, t, c, h, w)

dataset = ...  # a torch dataset that returns dict(image=...)
dataloader = torch.utils.data.DataLoader(dataset, batch_size=2, shuffle=False)

# Initialize the model
model = RCF(in_channels=27, features=4096, kernel_size=3, mode="gaussian")
model.eval()

# Loop over your images and collect all embeddings
embeddings = []
for batch in dataloader:
    b, t = batch.shape[0], batch.shape[1]
    with torch.inference_mode():
        # Run MOSAIKS across all imagery indepedently
        x = rearrange(batch, "b t c h w -> (b t) c h w")
        emb = model(x)
        
        # Average pool over the time dimension
        emb = rearrange(emb, "(b t) c -> b t c", b=b, t=t)
        emb = emb.mean(dim=1)

        embeddings.append(emb)

# Perform PCA on the embeddings
embeddings = torch.cat(embeddings, dim=0).numpy()
embeddings = PCA(n_components=1024).fit_transform(embeddings)

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
assets		assets
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
solution.ipynb		solution.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Temporal MOSAIKS - Embed2Scale Challenge 4th Place Solution (Training Free!)

Team AI4G Intern Squad

Solution

Example Usage

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

License

isaaccorley/temporal-mosaiks

Folders and files

Latest commit

History

Repository files navigation

Temporal MOSAIKS - Embed2Scale Challenge 4th Place Solution (Training Free!)

Team AI4G Intern Squad

Solution

Example Usage

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages