Infinigence Unveils Next-Gen AI Infrastructure Suite, Aims to Lead China’s AI Deployment Revolution

今日霍州(www.jrhz.info)©️

Xia Lixue, Co-founder and CEO of Infinigence

AsianFin -- Infinigence, an AI infrastructure startup backed by Tsinghua University, introduced a sweeping portfolio of performance-optimized computing platforms targeting the full spectrum of AI deployment at this year’s World Artificial Intelligence Conference (WAIC 2025) .

The company officially launched three flagship products under its integrated solution suite: Infinicloud, a global-scale AI cloud platform for clusters of up to 100,000 GPUs; InfiniCore, a high-performance intelligent computing platform designed for multi-thousand-GPU clusters; and InfiniEdge, a lean, edge computing solution optimized for terminal deployments with as few as one GPU.

Together, the platforms represent what CEO Xia Lixue calls a “software-hardware co-designed infrastructure system for the AI 2.0 era.” Built for compatibility across heterogeneous computing environments, the Infinigence stack offers full lifecycle support—from model scheduling and performance optimization to large-scale application deployment.

“We’re addressing a core bottleneck in China’s AI industry: fragmentation in compute infrastructure,” Xia said. “With InfiniCloud, InfiniCore, and InfiniEdge, we’re enabling AI developers to move seamlessly between different chips, architectures, and workloads—unlocking intelligent performance at scale.”

In a fast-evolving AI landscape dominated by open-source large language models such as 『DeepSeek』, GLM-4.5, and MiniMax M1, Chinese infra startups are racing to build the backbone that powers model deployment and inference.

Early on July 29, Infinigence announced that InfiniCloud now supports Zhipu AI’s latest GLM-4.5 and GLM-4.5-air models, which currently rank third globally in performance. The move signals Infinigence’s ambition to anchor the growing synergy between Chinese model developers and domestic chipmakers.

Xia likened the trio of newly launched platforms to “three bundled boxes” that can be matched to AI workloads of any scale. “From a single smartphone to clusters of 100,000 GPUs—our system is designed to ensure resource efficiency and intelligent elasticity,” he said.

Infinigence’s platforms are already powering Shanghai ModelSpeed Space, the world’s largest AI incubator. The facility sees daily token call volumes exceed 10 billion, supports over 100 AI use cases, and reaches tens of millions of monthly active users across its applications.

A key challenge for China’s AI infrastructure sector is hardware heterogeneity. With dozens of domestic chip vendors and proprietary architectures, developers often struggle to port models across systems.

Xia emphasized that Infinigence has developed a “universal compute language” that bridges chips with disparate instruction sets. “We treat computing resources like supermarket goods—plug-and-play, interoperable, and composable,” he said.

The company’s infrastructure has already achieved full-stack adaptation for more than a dozen domestic chips, delivering 50%–200% performance gains through algorithm and compiler optimization. It also supports unified scheduling and mixed-precision computing, enabling cost-performance ratios that beat many international offerings.

“What’s missing in China’s ecosystem is a feedback loop,” Xia said. “In the U.S., NVIDIA and OpenAI form a tight cycle: model developers know what chips are coming, and chipmakers know what models are being built. We’re building that loop domestically.”

Infinigence is also targeting AI democratization with a first-of-its-kind cross-regional federated reinforcement learning system. The system links idle GPU resources from different regional AIDC centers into a unified compute cluster—allowing SMEs to build and fine-tune domain-specific inference models using consumer-grade cards.

To support this, Infinigence launched the “AIDC Joint Operations Innovation Ecosystem Initiative” in partnership with China’s three major telecom providers and 20+ AIDC institutions.

Xia noted that while training still depends heavily on NVIDIA hardware, inference workloads are rapidly migrating to domestic accelerators. “Users often start with international chips on our platform, but we help them transition to Chinese cards—many of which now deliver strong commercial value,” he said.

Infinigence has also rolled out a series of on-device and edge inference engines under its Infini-Ask line. These include:

  • Infini-Megrez2.0, co-developed with the Shanghai Institute of Creative Intelligence, the world’s first on-device intrinsic model.

  • Infini-Mizar2.0, built with Lenovo, which enables heterogeneous computing across AI PCs, boosting local model capacity from 7B to 30B parameters.

  • A low-cost FPGA-based large model inference engine, jointly developed with Suzhou Yige Technology.

Founded in May 2023, Infinigence has raised more than RMB 1 billion in just two years, including a record-setting RMB 500 million Series A round in 2024—the largest to date in China’s AI infrastructure sector.

Its product portfolio now spans everything from model hosting and cloud management to edge optimization and model migration—serving clients across intelligent computing centers, model providers, and industrial sectors.

The company’s broader mission, Xia said, is to balance scale, performance, and resource availability. “Our vision is to deliver ‘boundless intelligence and flawless computing’—wherever there's compute, we want Infinigence to be the intelligence that flows through it.”

特别声明:[Infinigence Unveils Next-Gen AI Infrastructure Suite, Aims to Lead China’s AI Deployment Revolution] 该文观点仅代表作者本人,今日霍州系信息发布平台,霍州网仅提供信息存储空间服务。

猜你喜欢

35岁『谭松韵』反差魅力,少女元气御姐气场她全有!(『谭松韵』的真实)

与此同时,许多85后的女星已经在古装剧里扮演母亲,她却依然在新剧中和1999年出生的男主角🎭️谈恋爱,弹幕满屏都是毫无违和。这场景切换得如此迅速,让我一下子明白了,真正打动人心的并非她看似幼嫩的脸庞,而是她在少女…

35岁『谭松韵』反差魅力,少女元气御姐气场她全有!(『谭松韵』的真实)

请教专家隐形眼镜👓一般多少钱呢(隐形眼睛怎么样)

隐形眼镜👓的价格通常在100至1000元之间,具体取决于镜片类型、使用周期和个人需求。普通软性隐形眼镜👓适合日常矫正近视或远视,价格一般在100到300元

请教专家隐形眼镜👓一般多少钱呢(隐形眼睛怎么样)

被『设计师』怒怼仅1天,杨颖与『黄晓明』再同框,仅存的体面也被撕碎(『设计师』被骂)

结果『设计师』忍不住跳出来表示,我们可没同意这种做法,你这是在拉低高定艺术品的档次。杨颖可不是第一次这样做了,之前她也穿过这个品牌的衣服在『直播间』卖东西,圈内不少『明星』️都这样做过。那些优质的电影和剧本她拿不到了,综艺…

被『设计师』怒怼仅1天,杨颖与『黄晓明』再同框,仅存的体面也被撕碎(『设计师』被骂)

InfoQ:2025年火山引擎智能视频云实践精选集(2025年火山爆发是哪个国家)

行业实践上,方案已广泛落地多领域。 未来,火山引擎将持续深化“LLM×视频云”融合,推进全息直播等场景规模化落地,通过技术迭代重塑社交、办公与远程协作体验,为AI时代企业提供坚实的视频基建支撑,共建全球智能…

InfoQ:2025年火山引擎智能视频云实践精选集(2025年火山爆发是哪个国家)

黑龙江籍游客到剑门关景区永久免票 黑龙江籍老乡:原来剑门关一直记得!(黑龙江旅游知乎)

定居成都的原黑龙江籍游客邵先生,近日在剑门关景区收获了一份跨越地域的感动……2月3日下午,第一次来到剑门关的游客邵先生在售票处递上身份证🪪,准备购买门票挑战惊险的猿猱道。工作人员在查看证件后告诉他:“黑龙江的游客,到我们景区是永久免票的。”邵

黑龙江籍游客到剑门关景区永久免票 黑龙江籍老乡:原来剑门关一直记得!(黑龙江旅游知乎)