NeMo Common Collection API#
The common collection contains things that could be used across all collections.
- Callbacks
- Losses
- Metrics
- Tokenizers
AutoTokenizerAutoTokenizer.__init__()AutoTokenizer.add_special_tokens()AutoTokenizer.additional_special_tokens_idsAutoTokenizer.apply_chat_template()AutoTokenizer.bos_idAutoTokenizer.cls_idAutoTokenizer.eodAutoTokenizer.eos_idAutoTokenizer.ids_to_text()AutoTokenizer.ids_to_tokens()AutoTokenizer.inv_vocabAutoTokenizer.mask_idAutoTokenizer.nameAutoTokenizer.pad_idAutoTokenizer.save_pretrained()AutoTokenizer.save_vocabulary()AutoTokenizer.sep_idAutoTokenizer.text_to_ids()AutoTokenizer.text_to_tokens()AutoTokenizer.token_to_id()AutoTokenizer.tokens_to_ids()AutoTokenizer.tokens_to_text()AutoTokenizer.unk_idAutoTokenizer.vocabAutoTokenizer.vocab_size
SentencePieceTokenizerSentencePieceTokenizer.__init__()SentencePieceTokenizer.add_special_tokens()SentencePieceTokenizer.additional_special_tokens_idsSentencePieceTokenizer.bos_idSentencePieceTokenizer.cls_idSentencePieceTokenizer.eos_idSentencePieceTokenizer.ids_to_text()SentencePieceTokenizer.ids_to_tokens()SentencePieceTokenizer.mask_idSentencePieceTokenizer.pad_idSentencePieceTokenizer.sep_idSentencePieceTokenizer.text_to_ids()SentencePieceTokenizer.text_to_tokens()SentencePieceTokenizer.token_to_id()SentencePieceTokenizer.tokens_to_ids()SentencePieceTokenizer.tokens_to_text()SentencePieceTokenizer.unk_idSentencePieceTokenizer.vocab
TokenizerSpecTokenizerSpec.add_special_tokens()TokenizerSpec.apply_chat_template()TokenizerSpec.bosTokenizerSpec.clsTokenizerSpec.eodTokenizerSpec.eosTokenizerSpec.ids_to_text()TokenizerSpec.ids_to_tokens()TokenizerSpec.maskTokenizerSpec.nameTokenizerSpec.padTokenizerSpec.sepTokenizerSpec.text_to_ids()TokenizerSpec.text_to_tokens()TokenizerSpec.tokens_to_ids()TokenizerSpec.tokens_to_text()TokenizerSpec.unique_identifiers
- Data
- S3 Checkpointing