Mesa-Optimization in Autoregressively Trained Transformers: Emergence and Capability.Published in NeurIPS, 2024Download paper hereShare on Twitter Facebook LinkedIn Previous Next