DeepSeek V4 Launches with 1M Context Window; Huawei Ascend and Cambricon Chips Achieve Full Compatibility

Gate News message, April 24 — DeepSeek V4-Pro and DeepSeek V4-Flash were officially released and open-sourced on April 24, with context processing length significantly expanded from 128K to 1M, representing nearly a 10-fold capacity increase. Huawei Computing announced that its Ascend supernode products fully support DeepSeek V4 series models through close collaboration between chip and model technologies.

Huawei Ascend 950 achieves high-throughput, low-latency DeepSeek V4 model inference deployment through fused kernel and multi-stream parallelism techniques to reduce Attention computation and memory access overhead. For DeepSeek V4-Pro with 8K input, Ascend 950 achieves approximately 20ms TPOT with 4,700 TPS single-card Decode throughput; for DeepSeek V4-Flash under 8K input, it reaches approximately 10ms TPOT with 1,600 TPS throughput. Ascend A3 supernode series also achieves full compatibility, with training reference implementations provided for rapid fine-tuning. Based on Ascend A3 64-card supernode with large EP mode, DeepSeek V4-Flash achieves over 2,000 TPS single-card Decode throughput in 8K/1K input-output scenarios using vLLM inference engine. Huawei’s full Ascend A2, A3, and 950 product lines support both DeepSeek V4-Flash and V4-Pro.

Huawei Cloud announced first-mover compatibility with DeepSeek V4, providing developers with one-click API token services through its MaaS platform. Huawei Cloud optimized system layer, operator layer, and cluster layer capabilities to ensure rapid model adaptation and high-performance deployment. Enterprises including Kingsoft WPS and 360 have already integrated DeepSeek’s new model via Huawei Cloud.

Cambricon also announced Day 0 compatibility with DeepSeek V4-Flash and V4-Pro based on the vLLM inference framework, with adaptation code open-sourced to the GitHub community. Cambricon previously achieved first-mover adaptation when DeepSeek V3.2 was released last year, having conducted deep software-hardware collaborative performance optimization on DeepSeek series models.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

Ransomware Cases Jump 389% in 2025 to 7,831, Fortinet Reports

According to Fortinet, global ransomware cases rose 389% year on year to 7,831 in 2025 as cybercriminals leveraged AI and accelerated attacks on software vulnerabilities. Manufacturing was the most targeted sector with 1,284 cases, followed by business services with 824 and retail with 682. Fortine

GateNewsJust Now

Blackstone, KKR, EQT in Talks with Alphabet on AI Portfolio Deals

According to Bloomberg, Blackstone, KKR, and Sweden-based EQT are in talks with Alphabet to provide their portfolio companies with access to Google's AI models through portfolio-wide contracts. The discussions are non-exclusive and may not result in deals. The arrangement would give Google broader a

GateNews10m ago

Finnish AI Lab QuTwo Completes $29M Seed Round at $380M Valuation; Founder's Prior Company Silo AI Sold to AMD for $665M

According to Beating, Finnish AI lab QuTwo completed a 25 million euro (approximately $29 million) seed round with a post-valuation of 325 million euros (approximately $380 million). Founder and Executive Chairman Peter Sarlin previously founded Silo AI, which AMD acquired for $665 million in 2024.

GateNews40m ago

DeepSeek Valued at $45B as China's State Semiconductor Fund Eyes Lead Investment

According to ChainCatcher, China's state-backed semiconductor investment fund is in talks to lead DeepSeek's Series A funding round, potentially valuing the AI lab at approximately $45 billion. The funding negotiations are ongoing, according to four people familiar with the

GateNews1h ago

Microsoft survey: Only 13% of employees who are incentivized to drive AI-powered workplace innovation fail

According to Microsoft’s annual Work Trend Index report released on May 5, the report analyzed trillions of anonymous Microsoft 365 productivity signals and surveyed 20,000 employees across multiple markets including the United States, the United Kingdom, India, and Japan. The report data shows that only 13% of employees say their employers provide incentives when attempts to improve work with AI do not deliver the expected results.

MarketWhisper2h ago

Meta is developing an AI assistant named Hatch to rival OpenClaw, completing internal testing by the end of June

According to a May 5 report by the Financial Times, Meta is developing an AI assistant (Hatch) for mainstream consumers, inspired by OpenClaw from OpenAI, with the goal of completing internal testing by the end of June; at the same time, Meta plans to integrate an independent agentic shopping tool into its Instagram service before the fourth quarter of this year.

MarketWhisper2h ago
Comment
0/400
No comments