Introduction
Get Started
Prepare your data
Cerebras PyTorch API
cerebras_pytorch
cerebras_pytorch.amp
cerebras_pytorch.optim
cerebras_pytorch.sparse
cerebras_pytorch.metrics
Cerebras Model Zoo
Cerebras Guides
Fundamentals
Support
convert_dataset_to_HDF5
create_hdf5_dataset
Script that generates a dataset in HDF5 format for GPT Models.
hdf5_base_preprocessor
hdf5_curation_corpus_preprocessor
hdf5_dataset_preprocessors
hdf5_nlg_preprocessor
utils
previous
modelzoo.transformers.data_processing.scripts
next
modelzoo.transformers.data_processing.scripts.hdf5_preprocessing.convert_dataset_to_HDF5