Most capable model, best for complex tasks
Fastest 70B model with speculative decoding
Ultra-fast responses, good for simple queries
Balanced performance with long context
Efficient and versatile