Generative modeling with sparse transformers
OpenAI Blog
April 23, 2019
We’ve developed the Sparse Transformer, a deep neural network which sets new records at predicting what comes next in a sequence—whether text, images, or sound. It uses an algorithmic improvement of the attention mechanism to extract patterns from sequences 30x longer than possible previously.
Verticals
airesearch
Originally published on OpenAI Blog on 4/23/2019