Qwen3.7-Max, upgraded cloud infrastructure and model services, and new T-Head chips announced at Alibaba Cloud Summit
Alibaba today announced a comprehensive upgrade
of its full AI stack—spanning cloud infrastructure and model services,
AI chips, and foundation models —to empower customers in building,
deploying, and scaling AI agents with greater
efficiency, reliability, and performance.
Unveiled at the Alibaba Cloud Summit, Qwen3.7-Max is Alibaba's
latest large language model, engineered for advanced agentic coding,
complex reasoning, and long-horizon task execution. Qwen3.7-Max will be
available soon for developers and enterprises
worldwide.
To address surging compute and AI workload demands in the agentic era,
Alibaba Cloud has also upgraded its infrastructure and model services.
Key launches include the
Panjiu AL128 Supernode Server, designed to empower scalable agent
inference and large-scale model training, and an optimization update
within Alibaba's model service platform that continuously refines model
performance.
Additionally, T-Head, Alibaba's semiconductor design subsidiary, introduced the
Zhenwu M890, its latest AI training and inference processor,
featuring high-capacity memory, robust inter-chip bandwidth, and native
FP4 precision support.
Qwen 3.7-Max: A Versatile Foundation Model for the Agent Era
Designed as a robust foundation for AI agents,
Qwen
3.7-Max seamlessly handles
code generation and debugging, office workflow automation, and complex
multi-step tasks requiring hundreds or thousands of actions.
The model delivers exceptional agent capabilities across diverse
domains. As a frontier-level coding assistant, it supports coding tasks
from rapid frontend prototyping to complex, multi-file software
engineering. To enhance office work productivity, it reliably
orchestrates multi-agent workflows to tackle sophisticated operations.
Notably,
Qwen 3.7-Max can autonomously execute long-horizon agentic
tasks—sustaining continuous operation for up to 35 hours and managing
over 1,000 tool calls without performance degradation.
Deeply optimized for leading agent frameworks including OpenClaw, Hermes
Agent, Claude Code, Qwen Paw and Qoder, it serves as a reliable
backbone for different agent systems. The model achieves top-tier
results across major benchmarks in coding, general-purpose
agents, general capabilities and multilingualism, making it competitive
with leading frontier models. It will be soon accessible through
Alibaba's model service platform Model Studio for global developers.
Next-Generation Intelligent Computing and Enhanced Model Services
To empower scalable AI Agent inference and large-scale model training, Alibaba Cloud has launched the
Panjiu AL128 Supernode Server, powered by the Zhenwu M890 AI
processor and ICN Switch 1.0 networking chip. By tightly integrating 128
AI accelerators within a single rack, the system delivers single-rack
bandwidth at the petabyte-per second (PB/s) scale,
dramatically improving the handling of large-scale concurrent requests
from agents.
The Panjiu AL128 is now available on Model Studio for the China market
(or "Bailian"), enabling Chinese enterprises to efficiently address
training and inference demands across sectors.
To optimize performance, Bailian has introduced Agentic RL, a
reinforcement learning mechanism powered by agent execution feedback, to
drive continuous model iteration. Bailian also features built-in safety
governance capabilities, ensuring that autonomously
operating agents always remain within defined boundaries.
T-Head’s Latest Chips and Software Stack for AI Training and Inferencing
T-Head’s latest AI accelerator, the Zhenwu M890, delivers three times the performance of its predecessor Zhenwu 810E. Zhenwu M890 features
144 gigabytes (GB) of GPU memory and 800 GB per second of inter-chip bandwidth.
The chip natively supports multiple data precision formats, ranging from FP32 (32-bit floating-point) down to FP4 (4-bit floating-point), supporting
both high-precision model training and ultra-low-precision model
inference. These capabilities
make it exceptionally well-suited for complex agentic AI workloads,
which demand extensive working memory for context retention, high-speed
communication for multi-agent coordination, and low-precision computing
to maintain rapid execution while reducing cost.
The chip is built on T-Head’s proprietary parallel computing
architecture and utilizes its custom ICN (Inter-Chip Network)
interconnect protocol.
Alongside the accelerator, T-Head unveiled the ICN Switch 1.0, a dedicated switching chip designed to create high-bandwidth, low-latency scale-up networks for compute clusters. It delivers up to
25.6 Tbps of aggregate bandwidth and achieves extreme low latency and congestion-free communication. By
pairing the Zhenwu M890 with the ICN Switch 1.0 chip, it enables
full-bandwidth interconnection across 64 accelerators, significantly
boosting
the computational efficiency and stability of large-scale intelligent
computing. T-Head also unveiled its proprietary software stack,
T-Head SAIL™, to unleash the full computational potential for its chips.
T-Head has achieved widespread industrial adoption of its proprietary AI chips, with over
560,000 Zhenwu units delivered to date. More than 400 external
customers across 20 industries, including leading
automakers and financial services companies, have deployed the chips to
power intelligent operations.
Tags
AI agents
Alibaba Cloud Summit
Alibaba Qwen 3.7-Max
Model Studio.
OpenClaw AI
Panjiu AL128 Supernode Server
T-Head Chips
Zhenwu M890 Processor