F5 and Nvidia Enhance AI Inference
Analysis based on 7 articles · First reported Mar 17, 2026 · Last updated Mar 23, 2026
The collaboration between F5, Inc. and Nvidia is expected to positively impact the AI infrastructure market by providing more efficient and scalable solutions. This will likely lead to increased adoption of their technologies, benefiting both companies' stock performance and market share in the AI sector.
F5, Inc. and Nvidia have expanded their collaboration to accelerate and optimize AI inference infrastructures. The integration combines F5, Inc.'s BIG-IP Next for Kubernetes with Nvidia BlueField-3 DPUs, creating an intelligent, telemetry-aware infrastructure layer. This solution aims to increase token throughput by up to 40%, reduce latency by 34%, and achieve a 61% faster time to first token (TTFT), ultimately improving GPU utilization and reducing the cost per token. The enhanced capabilities support agent-driven AI workflows and secure multi-tenant AI platforms, enabling enterprises and NeoCloud providers to monetize AI services more efficiently. The Tolly Group validated these performance gains, confirming the structural uplift in infrastructure efficiency without requiring model modifications. This partnership positions BIG-IP Next for Kubernetes as a strategic control plane for AI factory economics, maximizing return on investment for GPU infrastructure.
Set up alerts, explore entity relationships, search across thousands of events, and build custom intelligence feeds.
Open Dashboard