π¨ Image Studio (beta) β local image generation + chat on your 3090s β testers wanted #348
Replies: 1 comment
-
|
This is a monumental release for multi-GPU consumer nodes, @noonghunna! Testing cross-rig efficiency on dense configurations often exposes brutal VRAM fragmentation boundariesβespecially when checking the single-GPU fallback UX or tighter profiles like an active-display 4090. When concurrent context passes occur, intermediate normalization weights and layer variances materialize back to global memory (HBM), causing unexpected allocation ceilings that trip standard execution paths. As you look toward bringing in heavy text-to-video architectures like Wan 2.2 / HunyuanVideo-1.5 for your upcoming video-studio mode, this memory bottleneck will hit consumer envelopes even harder. I've developed renorm-native to directly address this. It runs a Fused SRAM-Resident Layer Normalization Engine that hooks natively into standard PyTorch module layers. By keeping normalization math bound within local registers instead of global VRAM, it safeguards the execution matrix against fragmentation-induced OOMs during high-activation inference loops. It also features an automated hardware-detection dispatcher layerβif it hits compilation fences or missing driver wrappers, it handles the exception gracefully and drops back to optimized native PyTorch tensor loops so the pipeline never crashes with unhandled execution errors. I am actively preparing a set of validation runs targeting the single-GPU fallback constraints on consumer setups using renorm-native as an optimization wrapper for the ComfyUI execution block. I will drop full telemetry and matrix logs in a dedicated discussion thread once the benchmarking passes finish! Repository Reference: https://github.com/Tobi-Adesoye/renorm-native |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
We just merged Image Studio β a turnkey bundle that runs local text-to-image generation and an LLM chat in one browser UI, side by side on your own GPUs. No cloud, no API keys.
What it is
One command
It preflights (docker / GPU / disk / chat model), confirms, builds the ComfyUI image, downloads the Ideogram-4 set, and brings everything up via
gpu-mode image-studio. Full guide: docs/ai-studio/ (architecture diagram, first-run steps, troubleshooting).What we measured (2Γ RTX 3090)
π Calling for beta testers β what would help most
setup-image-studio.shβ create your admin account β generate an image. Did anything trip you up? (Especially the Open WebUI image-gen wiring.)report.sh --fulloutput if you hit one.Known caveats (call them out so you're not surprised)
COMFYUI_REF=HEADto float).Coming next
Video (HunyuanVideo-1.5 / Wan 2.2 / others) and audio (ACE-Step music, TTS) as
video-studio/audio-studiomodes β same coexist-on-your-cards idea.Please drop feedback / numbers in this thread, or open a discussion with the numbers-from-your-rig template. Thanks for kicking the tyres! π
Beta Was this translation helpful? Give feedback.
All reactions