新闻与深度文章
加载中
新闻报道 | Venture Beat
Microsoft’s Differential Transformer cancels attention noise in LLMs
Improving LLMs’ ability to retrieve in-p…
新闻报道 | IEEE Spectrum
1-bit LLMs Could Solve AI’s Energy Demands
“Imprecise” language models are smaller,…
| Zinan Lin, Jinyu Li, Bhaskar Mitra, Siân Lindley, Liang Wang, Nan Yang, 和 Furu Wei
Mixture-of-linear-experts for long-term …