W3AI - WikiText

WikiText

The WikiText language modeling dataset is a collection of over 100 million tokens extracted from the set of verified Good and Featured articles on Wikipedia.

cc-by-sa-4.0

1M<n<10M

Text Generation

Fill-Mask

English

by @AIOZNetwork

•

Last updated: 4 months ago

Details

Files

Discussions

Total downloads

Created: July 12, 2024