10 min read
Day-0 Support for the Qwen3.5 Family
What we found inside Qwen3.5's hybrid Mamba-Transformer weights and what it took to make the gated delta rule fast on GH200 — from matrix-valued recurrences to mixed batch state corruption.
inferencemambaqwen3.5+3
Read article