770 Experiments to Squeeze 30 tok/s Out of a 35B MoE Model on a $500 GPU
📰 Dev.to · AlexChen
770 Experiments to Squeeze 30 tok/s Out of a 35B MoE Model on a $500 GPU 29.899 tokens per...
770 Experiments to Squeeze 30 tok/s Out of a 35B MoE Model on a $500 GPU 29.899 tokens per...