Machine Learning at the Flatiron Institute: Clément Hongler

Name: Machine Learning at the Flatiron Institute: Clément Hongler
Start: 2025-04-01T16:00:00-04:00
End: 2025-04-01T18:00:00-04:00
Location: Ingrid Daubechies Auditorium (IDA)

Tuesday April 1, 2025

Title: Arrows of Time for Large Language Models

Abstract: Large Language Models famously predict the next token in a text. What happens if we teach them to predict the next word? It turns out that some subtle differences emerge. I will discuss some empirical and theoretical results about this, and also some (hopefully exciting) consequences and perspectives suggested by our results.

About the Speaker

Clément Hongler has worked on statistical mechanics, quantum field theory, deep learning theory, and a few other things. He enjoys talking with people from various horizons.