Scaling Law for Large Models Not to See Perpetual Growth, Says Founder of Axera Tech

It has achieved large-scale production in the fields of smart cities and assisted driving and can be applied to various model scenarios such as text-to-image search, general detection, text generation, and AI agents.

TMTPOST--Axera Technology, a developer of computer vision processors, has launched the “Axera Tongyuan AI Processor” designed for AI edge model inference at the 2024 World Artificial Intelligence Conference (WAIC 2024).

The Axera Tongyuan AI Processor, unveiled last Friday, features an operator instruction set and data flow microarchitecture at its core, using a programmable data flow microarchitecture. It offers three levels of computing power and supports native transformer models.

It has achieved large-scale production in the fields of smart cities and assisted driving and can be applied to various model scenarios such as text-to-image search, general detection, text generation, and AI agents.

Qiu Xiaoxin, the founder and chairman of Axera Technology, noted that the true large-scale implementation of large models requires a tight integration of cloud-edge-terminal systems. The key to the integration of edge and terminal lies in AI computing and perception.

Axera Technology, by leveraging its self-developed core technologies of Axera Zhimou AI-ISP and Axera Tongyuan hybrid precision NPU, has established a strategic route focusing on "AIoT+ADAS" and is advancing into the edge computing and AI inference fields. This aims to accelerate the application of smart cities and intelligent driving.

Qiu pointed out that the team is closely monitoring industry peers' layouts for edge AI model applications. She asserted that the Scaling Law for large models is not a “hard and fast rule” and won’t continue to grow linearly. There will inevitably be periods of stable and gradual optimization, she added.

“People have realized that Moore’s Law has reached its limit. From a single-chip perspective, Moore's Law is still evolving but at a slowing pace. Currently, the semiconductor industry enhances overall performance through system-level solutions. This year’s GTC AI Conference highlighted system-level optimization, not just single chips. The entire system’s scheduling, optimization, and management are critical. The future trajectory of Scaling Law will involve further optimization in some form. Nothing can grow linearly forever; it will reach a point of nonlinear growth. When that inflection point arrives, whether the current optimization methods remain viable needs reevaluation,” Qiu elaborated.

Axera Technology, founded in May 2019, provides foundational AI computing platforms for various industries. The company has successfully developed and mass-produced high, medium, and low-end edge AI chips, focusing on smart cities, intelligent driving, robotics, and innovative business scenarios.

As of now, Axera Technology has completed its Series B financing, with investors including Tencent, Qiming Venture Partners, Meituan, V Hall Venture Capital, Lenovo Capital, and Glory Ventures.

The newly unveiled “Axera Tongyuan AI Processor” is primarily aimed at AI model inference. It optimizes computing power through model algorithm and chip design collaboration and model miniaturization, accelerating the large-scale application of large models.

Qiu said that intelligent computing centers, which other AI chip companies are focusing on, are not Axera’s priority at the current stage. The focus now is on the vast market of edge and terminal applications. According to Qiu, the large-scale application of AI models must involve a cloud-edge-terminal triad.

“The foundational large models of generative AI will definitely be cloud-based. However, whether these large models can be refined or optimized to become industry-specific models and move to the edge, instead of having trillions of parameters, is still possible,” said Qiu. The current stage of large model application is still very primitive, or “broke force,” she added.

Qiu pointed out that the first scenarios for deploying large models at the edge are likely to be in vehicles, followed by smartphones and AI PCs, because vehicles require real-time responses, making them an ideal application for edge large models, such as in intelligent driving, smart cockpits, human-machine interaction, and AI agents.

“Application scenarios are very diverse. A 3.2T small chip integrated into a phone chip can enable many local applications without needing cloud support,” Qiu suggested. She also envisioned that a potential future application for generative AI could be “smart homes,” where a home AI server hub acts as a computing center, with entry points possibly being phones, robots, and "embodied intelligence," camera, and voice control devices.

Regarding the business model for implementing Axera’s AI capabilities, Qiu mentioned two main approaches: One is for clients whose SoC computing power is insufficient or cannot natively support large models, integrating Axera’s NPU IP into their chips to provide efficient NPU capabilities. The other is to achieve large-scale deployment solutions through Axera’s chips and software stack.

Qiu emphasized that the semiconductor industry has long cycles and requires a mutual selection process with investors. Fast-paced, short-term investment firms are unsuitable for this sector.

“Choosing the right investors is crucial. First, investors must understand the entire logic and cycle of the semiconductor industry. Second, as a chip company, achieving a commercial closed-loop is essential. This is why we aim for large-scale production and ecosystem formation. A healthy chip company can achieve breakeven and profitability within seven to ten years on average. We hope to develop steadily and quickly enter a positive cycle,” Qiu remarked.

转载请注明出处、作者和本文链接
声明:文章内容仅供参考、交流、学习、不构成投资建议。
想和千万钛媒体用户分享你的新奇观点和发现,点击这里投稿 。创业或融资寻求报道,点击这里

敬原创,有钛度,得赞赏

赞赏支持
发表评论
0 / 300

根据《网络安全法》实名制要求,请绑定手机号后发表评论

登录后输入评论内容

快报

更多

2024-10-06 22:50

2024国庆档预测票房超21.2亿

2024-10-06 22:27

上交所最新时间安排被误读为”取消集合竞价“ 求证:就是为方便新开户的指定交易

2024-10-06 22:02

中信建投:工商业储能市场国内外发展迅速

2024-10-06 21:39

中国学者实现跨越7公里的分布式光量子计算

2024-10-06 21:38

中信建投:AIC将成为科技金融市场中重要的耐心资本

2024-10-06 21:37

十一返程客流7日将达峰值,部分航线十一返程中转机票比高铁票便宜

2024-10-06 21:27

国庆假期接近尾声,南铁返程客流持续增长将迎最高峰

2024-10-06 21:06

2024年中网门票总收入超8000万,创中网门票销售历史新高

2024-10-06 21:05

浙江宁波:购首套房公积金最高贷款额度提高至130万元

2024-10-06 20:58

宁波:首次申请住房公积金贷款最高额度由100万元/户提高至130万元/户

2024-10-06 20:49

美军基地“毒废水”疑外泄,东京都政府被瞒月余

2024-10-06 20:41

上交所发布延长接受指定交易申报指令时间的通知

2024-10-06 20:34

印度与美国签协议加强电池关键矿物供应链

2024-10-06 20:09

10月6日新闻联播速览17条

2024-10-06 19:47

2024国庆档电影票房破20亿

2024-10-06 19:42

下周(10月7日-13日)市场大事预告

2024-10-06 19:21

陈茂波:香港将引进新一批重点企业

2024-10-06 18:37

2024国庆档电影票房破19亿

2024-10-06 18:24

开平热度大涨220% ,“宝藏小城”为年轻人十一的出游新目标

2024-10-06 18:12

自然资源部与中国气象局10月6日18时联合发布地质灾害气象风险预警

扫描下载App