DeepSeek Founder Unveils New Training Method Aimed at Overcoming GPU Memory Limits
A new research paper co-authored Liang Wenfeng, founder of Chinese artificial intelligence start-up DeepSeek, is drawing attention within the global AI community for proposing an alternative approach to training large models under hardware constraints. The paper, produced in collaboration with researchers from Peking University, outlines a technique designed to bypass graphics processing unit memory limits […]
