Introduction
Get Started
Prepare your data
Cerebras PyTorch API
cerebras_pytorch
cerebras_pytorch.amp
cerebras_pytorch.optim
cerebras_pytorch.sparse
cerebras_pytorch.metrics
Cerebras Model Zoo
Cerebras Guides
Fundamentals
Support
Get urls given split of dataset.
split (str) – Split of dataset to get urls for.
List of urls, containing jsonl.zst file names for downloading.
previous
modelzoo.transformers.data_processing.scripts.pile.download.get_urls_for_tokenizer_files
next
modelzoo.transformers.data_processing.scripts.pile.download.main