Made by
@freQuensy23
· channel
t.me/mlphys
EN
RU
⚡ LLM Inference Cost Calculator
Mixture-of-Experts (MoE)
FP16
2 B/p
FP8
1 B/p
FP4
0.5 B/p
1×
2×
4×
8×
16×
Prompt cache hit rate