| model | test | t/s | peak t/s | ttfr (ms) | est_ppt (ms) | e2e_ttft (ms) |
|:---------------------------------------|-------:|---------------:|---------------:|-----------------:|-----------------:|-----------------:|
| nvidia/diffusiongemma-26B-A4B-it-NVFP4 | pp4096 | 661.05 ± 72.99 | | 6287.72 ± 748.86 | 6280.38 ± 748.86 | 6287.72 ± 748.86 |
| nvidia/diffusiongemma-26B-A4B-it-NVFP4 | tg2048 | 120.66 ± 35.30 | 569.67 ± 37.04 | | | |
Not bad, I guess.