-
Notifications
You must be signed in to change notification settings - Fork 603
Pull requests: EricLBuehler/mistral.rs
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
docs(bench): trim outdated mistralrs-bench README
#2157
opened May 20, 2026 by
fiorelorenzo
Loading…
feat(qwen3.5,gguf): add quantized Qwen3.5 model - CPU + Metal (Cuda WIP)
#2129
opened May 13, 2026 by
Rushit
Loading…
fix(core): restrict interprocess dependency to desktop targets
#2120
opened Apr 25, 2026 by
setoelkahfi
Contributor
Loading…
Pluggable KvCacheCodec hook on SingleCache / RotatingCache
#2116
opened Apr 21, 2026 by
ravituringworks
Loading…
3 tasks done
ci: fix Clippy, Rustfmt, and Typos failures for Rust 1.95 stable
#2115
opened Apr 17, 2026 by
glaziermag
Contributor
Loading…
fix(responses): drain streaming chunks in background task handler
#2114
opened Apr 16, 2026 by
glaziermag
Contributor
Loading…
fix(bench): require a successful KV flush probe barrier
#2112
opened Apr 16, 2026 by
glaziermag
Contributor
•
Draft
fix(scheduler): treat Error state sequences as finished in PagedAttention
#2111
opened Apr 16, 2026 by
glaziermag
Contributor
•
Draft
fix(gguf): safely propagate runtime errors for unknown architectures
#2106
opened Apr 14, 2026 by
glaziermag
Contributor
Loading…
fix(quant): prevent device mismatch in GEMV guard and UnquantLinear forward
#2089
opened Apr 9, 2026 by
Jamesrobertsonldn
Loading…
fix(qwen-vl): make chunked MRoPE slicing offset-aware
#2083
opened Apr 9, 2026 by
glaziermag
Contributor
Loading…
fix: Idefics3 encoder cache panic when do_image_splitting is enabled
#2074
opened Apr 8, 2026 by
romnn
Loading…
fix(metal): pass --sdk and -std to air-to-metallib link step in build scripts
#2067
opened Apr 6, 2026 by
setoelkahfi
Contributor
Loading…
Add tensor parallelism support for GDN layers + fix UQFF artifact count
#2054
opened Apr 4, 2026 by
ormandj
Loading…
feat(gguf): add Qwen3.5 (qwen3-next) hybrid MoE GGUF loader
#2049
opened Apr 2, 2026 by
emanueleDiVizio
Loading…
feat(metal): fused MoE expert dispatch with Q4K kernels for Metal
#2048
opened Apr 2, 2026 by
emanueleDiVizio
Loading…
fix(metal): GDN bfloat16, PA scheduler, error handling, MLX SDPA fixes
#2047
opened Apr 2, 2026 by
emanueleDiVizio
Loading…
fix(scheduler): preserve FCFS priority across paged-attention buckets
#2034
opened Mar 28, 2026 by
glaziermag
Contributor
Loading…
fix(server): return /re_isq errors instead of panicking
#2025
opened Mar 25, 2026 by
glaziermag
Contributor
Loading…
fix(docker): align GLIBC versions and add mistralrs binary
#1976
opened Mar 10, 2026 by
glaziermag
Contributor
Loading…
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.