< All Topics

Large language models can do jaw-dropping things. But nobody knows exactly why.

https://www.technologyreview.com/2024/03/04/1089403/large-language-models-amazing-but-nobody-knows-why

Abstract

The largest models, and large language models in particular, seem to behave in ways textbook math says they shouldn’t. This highlights a remarkable fact about deep learning, the fundamental technology behind today’s AI boom: for all its runaway success, nobody knows exactly how—or why—it works.

Source

Large language models can do jaw-dropping things. But nobody knows exactly why. (n.d.). MIT Technology Review. https://www.technologyreview.com/2024/03/04/1089403/large-language-models-amazing-but-nobody-knows-why/