DeepSeek’s latest technical paper, co-authored by the firm’s founder and CEO Liang Wenfeng, has been cited as a potential game changer in developing artificial intelligence models, as it could translate into improvements in the fundamental architecture of machine learning.
The paper’s theme of Manifold-Constrained Hyper-Connections (mHC) marks an improvement to conventional hyper-connections in residual networks (ResNet), a fundamental mechanism underlying large language models (LLMs),...
DeepSeek proposes shift in AI model development with ‘mHC’ architecture to upgrade ResNet
Published 4 hours ago
Source: scmp.com

Related Articles from scmp.com
14 minutes ago
2 cats found at Tai Po inferno site a month after Hong Kong blaze, SPCA says
31 minutes ago
Chinese public warms to US while strongly backing Beijing’s trade stance: survey
54 minutes ago
Hong Kong’s John Lee has sciatica. Here’s what you need to know about the condition
57 minutes ago
7 dead as Saudi jets strike UAE-backed separatists in ‘peaceful’ Yemen operation
1 hour ago
As the US struggles with affordability, consumer confidence eludes China
1 hour ago