-+ 0.00%
-+ 0.00%
-+ 0.00%

DeepSeek published a new paper on New Year's Day proposing a new architecture called MHC. The study aims to solve the instability problems of traditional hyperconnections in large-scale model training while maintaining their significant performance gains. There are three first authors of this paper: Zhenda Xie, Yixuan Wei, and Huanqi Cao. Notably, DeepSeek founder & CEO Liang Wenfeng is also on the list of authors.

智通財經·01/01/2026 08:49:00
語音播報
DeepSeek published a new paper on New Year's Day proposing a new architecture called MHC. The study aims to solve the instability problems of traditional hyperconnections in large-scale model training while maintaining their significant performance gains. There are three first authors of this paper: Zhenda Xie, Yixuan Wei, and Huanqi Cao. Notably, DeepSeek founder & CEO Liang Wenfeng is also on the list of authors.