February 6, 2024
2024
We preprint a paper: Superiority of Multi-Head Attention in In-Context Linear Regression