MangoBoost Sets New AI BenchmarkMangoBoost Sets New Benchmark with Record-Breaking DPU-Based AI Training Storage Solution in MLPerf Storage v2.0MangoBoost announced that it has delivered record-breaking performance in the latest MLPerf Storage v2.0 with its Mango StorageBoost solution. This achievement marks a major milestone for DPU-accelerated NVMe/TCP storage systems, showcasing industry-leading performance, efficiency, and scalability for AI training workloads. MangoBoost’s submission, comprising the Mango StorageBoost™ NVMe/TCP Initiator (NTI) and Target (NTT), demonstrated line-rate throughput over a 400G Ethernet fabric, providing near-local SSD performance for distributed AI workloads such as 3D-UNet on both NVIDIA A100 and H100 GPUs. Best-in-Class Results Across the Board In MLPerf Storage v2.0’s Fabric-attached Block Storage category, Mango StorageBoost™ delivered:
Unlocking New Possibilities in AI Storage Architecture MangoBoost’s submission deployed its NTI on the host and NTT on the storage server, connected via a 400G Ethernet switch. This configuration allowed the system to emulate demanding AI workloads across multiple GPUs with near-zero CPU overhead and maximum bandwidth utilization. Furthermore, MangoBoost’s performance outpaced even BlueField-3 systems running both NVMe/TCP and NVMe/RDMA in equivalent test conditions. The Mango StorageBoost DPU architecture not only outperformed on throughput but also delivered significant reductions in cost of ownership as systems scale. The Technology Behind the Results Mango StorageBoost™ consists of three tightly integrated solutions:
Designed for Real-World Deployment Mango StorageBoost™ offers seamless integration with standard server platforms and GPUs. Its DPU-based architecture ensures maximum performance while reducing CPU utilization and total infrastructure cost. The solution is available today and can be deployed in existing data center environments without modification to hardware or software stacks. Source: MangoBoost media announcement |