AI Daily: Volcanic Engine Unveils Doubao Seedance 2.5 Suite; Shengshu's Vidu Q3 Deploys on Huawei Cloud; Baichuan Intelligence Ships M4 Model

Reports 2026-06-25 05:07:13

Welcome to AI Daily — your daily briefing on the artificial intelligence landscape. Each edition curates top AI developments for developers, offering a clear lens on technology trends and emerging AI-powered products and applications.

Explore featured AI products: https://app.aibase.com/zh

1. Volcanic Engine Unveils Doubao Seedance 2.5 Model Family, Ushering in a One-Stop Large Model Development Paradigm

Volcanic Engine has rolled out Seedance 2.5 and a suite of AI model upgrades, delivering an end-to-end development pipeline from model selection through application integration. The move streamlines deployment while broadening multimodal use cases.

[AiBase Summary:] 🔥 Seedance 2.5 delivers gains in instruction adherence, pose control, and cinematic-grade visual quality 🛠️ Volcanic Engine offers an end-to-end development experience that accelerates AI application construction 🎯 Flexible access is provided for AIGC creators, developers, and enterprises

2. Shengshu Technology's Vidu Q3 Deploys on Huawei Cloud — a Milestone in Domestic Video Generation Models' Commercial Cloud Migration

Shengshu Technology's Vidu Q3 has officially landed on Huawei Cloud, granting developers and enterprises streamlined cloud access to video generation capabilities. The launch represents a significant milestone in the commercialization of domestic video generation models.

[AiBase Summary:] 🤖 Vidu Q3 is a video generation model that produces dynamic footage from text prompts or source images. ☁️ Listing on Huawei Cloud reduces adoption friction, speeding commercialization and ecosystem build-out. 🚀 Domestic multimodal large-model commercialization is accelerating amid rising competitive pressure.

3. Baichuan Intelligence Ships M4: MoE Architecture with 200K Context Window, Built for Complex Agent Workloads

Baichuan Intelligence has unveiled M4, a large model built on a Mixture-of-Experts architecture with a 200K-token context window, purpose-built for complex Agent workflows. Inference costs are slashed by as much as 90%.

[AiBase Summary:] 🧠 M4 is built on an MoE architecture — 1T total parameters, 200B activated. 📚 Its 200K-token context window suits long-document processing and extended multi-turn dialogues. 💰 The Exploit-and-Explore training regimen drives inference costs down by up to 90%.

4. DeepSeek Open-Sources Multimodal In-Distribution Detector: DeepSeek-HawkEye

DeepSeek has open-sourced DeepSeek-HawkEye, a multimodal in-distribution detector that evaluates whether inputs lie within the distribution of a model's training data. The tool enables developers to flag out-of-distribution inputs prior to deployment, bolstering AI system reliability.

[AiBase Summary:] 👁️ DeepSeek-HawkEye identifies whether inputs reside inside the training data distribution 🔧 The open-source tool flags out-of-distribution inputs, enhancing system robustness 🧪 It adds a new layer of technical assurance for safely deploying multimodal AI systems

5. Alibaba International Debuts Multimodal AI Shopping Assistant 'SearchMind-2.0'

Alibaba International has introduced SearchMind-2.0, a multimodal AI shopping assistant that leverages image, text, and voice inputs to let consumers worldwide search and discover products with greater intuition. The tool meaningfully upgrades the cross-border shopping experience and efficiency.

[AiBase Summary:] 🛍️ SearchMind-2.0 enables multimodal search across image, text, and voice modalities. 🌍 Built for a global consumer base, it elevates the cross-border shopping journey. 🤖 Through AI-powered e-commerce, Alibaba International is advancing intelligent retail.

6. Microsoft Ships Phi-4-Sequor: A 4-Billion-Parameter Generative Speech Model Optimized for Mobile

Microsoft has unveiled Phi-4-Sequor — a 4-billion-parameter generative speech model purpose-built for mobile deployment — extending the frontier of on-device AI speech interaction.

[AiBase Summary:] 📱 A 4-billion-parameter speech model runs locally on-device, delivering low latency and privacy preservation. 🔊 It handles text-to-speech (TTS), speech-to-text (ASR), and other speech tasks. 🔬 FP6 quantization and related techniques shrink the model footprint while preserving high accuracy.

7. OpenAI Rushes GPT-5.5 Image Generation Back Online After Quality Instability Forced Shutdown

OpenAI has restored GPT-5.5's image generation capability after pulling it offline over quality inconsistencies. The swift restart underscores both the technical challenges inherent in generative AI for creative use cases and the rapid-iteration tempo required to stay competitive.

[AiBase Summary:] 🔄 GPT-5.5 image generation has been restored with quality improvements. 🎨 Intense competition in the image generation space keeps OpenAI iterating to defend its edge. ⚙️ Output instability in AI-generated imagery persists as an industry-wide challenge.