Understanding LLMs through Statistical Learning

重置

题目（Title）：

主讲人（Speaker）：

Jingzhao Zhang

开始时间（Start Time）：

2025-03-18 10:10

结束时间（End Time）：

报告地点（Place）：

Tencent Conference: 145 224 735 https://meeting.tencent.com/dm/gUEWEDTRtFpZ

主办单位（Organization）：

信息科学与技术学院

协办单位(Co-organizer)：

简介（Brief Introduction）：

Statistical learning has been a foundational framework for understanding machine learning and deep learning models, offering key insights into generalization and optimization. However, the pretraining–alignment paradigm of Large Language Models (LLMs) introduces new challenges. Specifically, (a) their error rates do not fit conventional parametric or nonparametric regimes and exhibit dataset-size dependence, and (b) the training and testing tasks can differ significantly, complicating generalization. In this talk, we propose new learning frameworks to address these challenges. Our analysis highlights three key insights: the necessity of data-dependent generalization analysis, the role of sparse sequential dependence in language learning, and the importance of autoregressive compositionality in enabling LLMs to generalize to unseen tasks.

活动信息