There's so much to be grateful for, words are poor things.
Marilynne Robinson
Trill News
STEM

How Large Language Models Develop Unexpected Skills

Via Topics in Cognitive Science

Summary

Large language models (LLMs) have been observed to develop unexpected and emergent capabilities that were not explicitly programmed and did not appear in smaller versions of the same architectures. These emergent abilities — including in-context learning, multi-step arithmetic reasoning, instruction following, and rudimentary theory of mind — tend to appear suddenly at certain scales of model size, a phenomenon researchers have described as a phase transition rather than a gradual improvement.

The unpredictability of these emergent skills poses both scientific and safety challenges. Because researchers cannot reliably forecast which new capabilities will appear at what scale, models may develop unintended or potentially dangerous behaviors without warning. The debate over whether these abilities represent genuine reasoning or sophisticated pattern matching remains unresolved, but the implications — for education, security, and AI governance — are already reshaping how developers and policymakers think about increasingly powerful language models.

FIND A BOOK ON BOOKSHOP.ORG