modelzoo.transformers.pytorch.bert.input.scripts#

create_csv

Preprocessed CSV data generator for BERT pretraining from raw text documents.

create_csv_mlm_only

Preprocessed CSV data generator for BERT pretraining from raw text documents.

create_csv_mlm_only_static_masking

Preprocessed CSV data generator for BERT pretraining from raw text documents.

create_csv_static_masking

Preprocessed CSV data generator for BERT pretraining from raw text documents.

create_hdf5_files

Script to write HDF5 files for MLM_only and MLM + NSP datasets.

parser_utils