WikiText
The WikiText language modeling dataset is a collection of over 100 million tokens extracted from the set of verified Good and Featured articles on Wikipedia.
cc-by-sa-4.0
1M<n<10M
Text Generation
Fill-Mask
English
Total downloads
2
Created: July 12, 2024