Introduction
Get Started
Prepare your data
Cerebras PyTorch API
cerebras_pytorch
cerebras_pytorch.amp
cerebras_pytorch.optim
cerebras_pytorch.sparse
cerebras_pytorch.metrics
Cerebras Model Zoo
Cerebras Guides
Fundamentals
Support
Get urls for downloading files for tokenization.
A dictionary containing urls for original GPT2 tokenizaiton and GPT-NeoX tokenization schemes
previous
modelzoo.transformers.data_processing.scripts.pile.download.download_tokenizer_files
next
modelzoo.transformers.data_processing.scripts.pile.download.get_urls_from_split