Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability.

Published in NeurIPS, 2024

Download paper here