Vocabularies

ffp.vocab

ffp.vocab.load_vocab(file)

Load a vocabulary from a finalfusion file.

ffp.vocab.subword.load_finalfusion_bucket_vocab(file)

Load a FinalfusionBucketVocab from the given finalfusion file.

ffp.vocab.subword.load_fasttext_vocab(file)

Load a FastTextVocab from the given finalfusion file.

ffp.vocab.subword.load_explicit_vocab(file)

Load a ExplicitVocab from the given finalfusion file.

ffp.vocab.simple_vocab.load_simple_vocab(file)

Load a SimpleVocab from the given finalfusion file.

ffp.vocab.vocab.Vocab

Finalfusion vocabulary interface.

ffp.vocab.simple_vocab.SimpleVocab(words[, …])

Simple vocabulary.

ffp.vocab.subword.SubwordVocab

Interface for vocabularies with subword lookups.

ffp.vocab.subword.FinalfusionBucketVocab(words)

Finalfusion Bucket Vocabulary.

ffp.vocab.subword.FastTextVocab(words[, …])

FastText vocabulary

ffp.vocab.subword.ExplicitVocab(words, indexer)

A vocabulary with explicitly stored n-grams.

ffp.vocab.cutoff.Cutoff(cutoff[, mode])

Frequency Cutoff

ffp.vocab.load_vocab(file: Union[str, bytes, int, os.PathLike])ffp.vocab.vocab.Vocab[source]

Load a vocabulary from a finalfusion file.

Loads the first known vocabulary from a finalfusion file.

Parameters

file (str, bytes, int, PathLike) – Path to file containing a finalfusion vocab chunk.

Returns

vocab – First Vocab in the file.

Return type

SimpleVocab, FastTextVocab, FinalfusionBucketVocab, ExplicitVocab

Raises

ValueError – If the file did not contain a vocabulary.