A neural network architecture that uses attention mechanisms, fundamental to modern language models like GPT and Claude. Revolutionized natural language processing.