Wafer Blog Posts

Ian Ye

Achieving Heterogeneous Compute One Kernel at a Time

How custom kernels pushed our AMD MI355X deployment from a tuned baseline to leading Qwen3.5 397B throughput.

Read all 8 min