cerebras.modelzoo.data_preparation.nlp.slimpajama.preprocessing.filter#

Functions

clean

filter_dataset

get_short_documents