DeepSeek’s latest technical paper, co-authored by the firm’s founder and CEO Liang Wenfeng, has been cited as a potential game changer in developing artificial intelligence models, as it could translate into improvements in the fundamental architecture of machine learning.
The paper’s theme of Manifold-Constrained Hyper-Connections (mHC) marks an improvement to conventional hyper-connections in residual networks (ResNet), a fundamental mechanism underlying large language models (LLMs),...
DeepSeek proposes shift in AI model development with ‘mHC’ architecture to upgrade ResNet
Published 1 hour ago
Source: scmp.com

Related Articles from scmp.com
28 minutes ago
SpaceX will move more than 4,400 satellites to a lower orbit after China cited safety risk
37 minutes ago
Swiss investigators rush to identify victims of New Year’s fire
57 minutes ago
Did a PLA stealth fighter approach a key Taiwan airbase? New video sparks debate
1 hour ago
Beijing pledges with ‘utmost sincerity’ to continue push to reunite peacefully with Taiwan
1 hour ago
Trump and Iran official exchange threats after protests turn deadly in the Islamic Republic
1 hour ago