Skip to content

[WIP] cuda backend prefetch data blob before .so loading #11816

[WIP] cuda backend prefetch data blob before .so loading

[WIP] cuda backend prefetch data blob before .so loading #11816

Annotations

1 warning

export-model-cuda-artifact (mistralai, Voxtral-Mini-4B-Realtime-2602, quantized-int4-tile-packed)  /  linux-job

succeeded Mar 17, 2026 in 41m 42s