modelzoo.transformers.data_processing.scripts.chunk_preprocessing#
This module implements a generic data preprocessor called ChunkDataPreprocessor. |
|
Script to generate an HDF5 dataset for GPT Models. |
|
This module contains helper functions and classes to read data from different formats, process them, and save in HDF5 format. |
|
FIMTokenGenerator Module |
|
LMDataTokenGenerator Module |
|
SummarizationTokenGenerator Module |
|