Huawei's CloudMatrix AI vs NVIDIA Grace AI Servers

Huawei's CloudMatrix AI challenges NVIDIA with superior power, despite higher costs. Explore the AI hardware race.

Introduction

In the rapidly evolving world of artificial intelligence, the competition between tech giants like NVIDIA and Huawei is heating up, with significant implications for the global AI landscape. Recently, NVIDIA's CEO acknowledged that Huawei's new CloudMatrix AI cluster is a formidable competitor, not just to NVIDIA's existing offerings but also to its upcoming Grace Blackwell AI servers. This marks a pivotal moment in the AI hardware race, especially as Huawei seeks to fill the gap left by U.S. export restrictions on advanced AI technologies.

Let's dive into the details of Huawei's CloudMatrix 384 and its comparison with NVIDIA's offerings.

Huawei CloudMatrix 384: A Game-Changer in AI Computing

Huawei's CloudMatrix 384 is an AI chip cluster designed to challenge NVIDIA's dominance in the AI hardware sector. This system features 384 Ascend 910C processors, interconnected through an advanced 'super node' architecture that utilizes optical technology to enhance communication bandwidth and minimize latency[2][3]. The CloudMatrix 384 delivers 67% more compute power than NVIDIA's NVL72 cluster, making it a significant player in the AI computing market[2]. However, this increased performance comes with higher energy consumption and operational costs, as well as a higher price tag of around $8.2 million compared to NVIDIA's NVL72, which is priced at about $3 million[2].

Architecture and Performance

A full CloudMatrix system is spread across 16 racks, with 12 compute racks containing 32 GPUs each. The system requires an incredible network infrastructure, including thousands of optical transceivers, similar to NVIDIA's discontinued DGX H100 NVL256 "Ranger" platform[1]. This architecture allows for massive scale-up networking, enabling the CloudMatrix to achieve impressive compute capacities, such as delivering 300 PFLOPs of dense BF16 compute, nearly double that of the GB200 NVL72[1].

Ascend SuperNode Architecture

At the heart of the CloudMatrix 384 is the Ascend SuperNode architecture, unveiled at the Ascend AI Developer Summit in May 2025. This architecture is designed to maximize computational efficiency by housing 384 Ascend 910C processors across 12 compute cabinets[3]. The SuperNode is built to handle large-scale AI models and datasets, offering over three times the memory capacity of NVIDIA's NVL72[2].

NVIDIA's Response: Grace Blackwell AI Servers

NVIDIA's Grace Blackwell AI servers are part of the company's strategy to maintain its leadership in AI computing. While specific details about these servers are still emerging, they are expected to offer high-performance capabilities tailored for AI workloads. NVIDIA's CEO acknowledging Huawei's CloudMatrix as a competitor suggests that these new servers will be designed to outperform existing offerings, including Huawei's.

Comparison of CloudMatrix 384 and NVIDIA's Offerings

Feature	Huawei CloudMatrix 384	NVIDIA NVL72	NVIDIA Grace Blackwell
Processors	384 Ascend 910C	72 NVIDIA GPUs	Details emerging
Compute Power	67% more than NVL72	Reference point	Expected to be high
Memory Capacity	Over 3x NVL72	Reference point	Expected to be high
Energy Consumption	Higher than NVL72	Lower than CloudMatrix	Details emerging
Price	Around $8.2 million	Around $3 million	Not disclosed

Historical Context and Future Implications

The rise of Huawei's CloudMatrix 384 is partly a response to U.S. export restrictions on advanced AI technologies, which have limited China's access to NVIDIA's high-performance AI chips[2]. This has created an opportunity for Huawei to develop domestic AI solutions, marking a significant shift in the global AI landscape.

As we look to the future, the competition between Huawei and NVIDIA will likely intensify, with both companies pushing the boundaries of AI computing. NVIDIA's Grace Blackwell servers are poised to be a major player in this race, but Huawei's CloudMatrix 384 has already shown its potential as a powerful alternative.

Conclusion

In conclusion, Huawei's CloudMatrix 384 represents a significant advancement in AI computing, offering substantial compute power and memory capacity. However, it faces challenges in energy efficiency and operational costs. NVIDIA's acknowledgment of the CloudMatrix as a competitor underscores the intensity of the AI hardware race, with both companies vying for leadership in this rapidly evolving field.

Huawei's CloudMatrix AI vs NVIDIA Grace AI Servers

Introduction

Huawei CloudMatrix 384: A Game-Changer in AI Computing

Architecture and Performance

Ascend SuperNode Architecture

NVIDIA's Response: Grace Blackwell AI Servers

Comparison of CloudMatrix 384 and NVIDIA's Offerings

Historical Context and Future Implications

Conclusion

Related Articles

Windows 11 Beta: AI Search Tool Designed by Microsoft

AI Hardware Innovations at Computex 2025: GPUs in Focus

Generative AI Boosts Contract Lifecycle Management