| Modifier and Type | Field and Description |
|---|---|
protected TokenizerFactory |
BagOfWordsVectorizer.Builder.tokenizerFactory |
protected TokenizerFactory |
TfidfVectorizer.Builder.tokenizerFactory |
protected TokenizerFactory |
BaseTextVectorizer.tokenizerFactory |
protected TokenizerFactory |
Builder.tokenizerFactory
Deprecated.
|
| Modifier and Type | Method and Description |
|---|---|
BagOfWordsVectorizer.Builder |
BagOfWordsVectorizer.Builder.setTokenizerFactory(TokenizerFactory tokenizerFactory) |
TfidfVectorizer.Builder |
TfidfVectorizer.Builder.setTokenizerFactory(TokenizerFactory tokenizerFactory) |
Builder |
Builder.tokenize(TokenizerFactory tokenizerFactory)
Deprecated.
|
| Modifier and Type | Method and Description |
|---|---|
CnnSentenceDataSetIterator.Builder |
CnnSentenceDataSetIterator.Builder.tokenizerFactory(TokenizerFactory tokenizerFactory)
The
TokenizerFactory that should be used. |
| Modifier and Type | Method and Description |
|---|---|
protected static TokenizerFactory |
WordVectorSerializer.getTokenizerFactory(VectorsConfiguration configuration) |
| Modifier and Type | Field and Description |
|---|---|
protected TokenizerFactory |
Glove.Builder.tokenFactory |
| Modifier and Type | Method and Description |
|---|---|
Glove.Builder |
Glove.Builder.tokenizerFactory(TokenizerFactory tokenizerFactory)
Sets TokenizerFactory to be used for training
|
| Modifier and Type | Method and Description |
|---|---|
ParagraphVectors.Builder |
ParagraphVectors.Builder.tokenizerFactory(TokenizerFactory tokenizerFactory)
This method defines TokenizerFactory to be used for strings tokenization during training
PLEASE NOTE: If external VocabCache is used, the same TokenizerFactory should be used to keep derived tokens equal.
|
| Constructor and Description |
|---|
BlindInferenceCallable(VocabCache<VocabWord> vocabCache,
TokenizerFactory tokenizerFactory,
String document) |
BlindInferenceCallable(VocabCache<VocabWord> vocabCache,
TokenizerFactory tokenizerFactory,
String document,
AtomicLong flag) |
InferenceCallable(VocabCache<VocabWord> vocabCache,
TokenizerFactory tokenizerFactory,
LabelledDocument document) |
InferenceCallable(VocabCache<VocabWord> vocabCache,
TokenizerFactory tokenizerFactory,
LabelledDocument document,
AtomicLong flag) |
| Modifier and Type | Field and Description |
|---|---|
protected TokenizerFactory |
SentenceTransformer.tokenizerFactory |
protected TokenizerFactory |
SentenceTransformer.Builder.tokenizerFactory |
| Modifier and Type | Method and Description |
|---|---|
SentenceTransformer.Builder |
SentenceTransformer.Builder.tokenizerFactory(TokenizerFactory tokenizerFactory) |
| Modifier and Type | Field and Description |
|---|---|
protected TokenizerFactory |
Word2Vec.tokenizerFactory |
protected TokenizerFactory |
Word2Vec.Builder.tokenizerFactory |
| Modifier and Type | Method and Description |
|---|---|
void |
Word2Vec.setTokenizerFactory(TokenizerFactory tokenizerFactory)
This method defines TokenizerFactory instance to be using during model building
|
Word2Vec.Builder |
Word2Vec.Builder.tokenizerFactory(TokenizerFactory tokenizerFactory)
This method defines TokenizerFactory to be used for strings tokenization during training
PLEASE NOTE: If external VocabCache is used, the same TokenizerFactory should be used to keep derived tokens equal.
|
| Modifier and Type | Method and Description |
|---|---|
static org.nd4j.linalg.primitives.Pair<String,MultiDimensionalMap<Integer,Integer,String>> |
ContextLabelRetriever.stringWithLabels(String sentence,
TokenizerFactory tokenizerFactory)
Returns a stripped sentence with the indices of words
with certain kinds of labels.
|
static List<Window> |
Windows.windows(InputStream words,
TokenizerFactory tokenizerFactory,
int windowSize)
Constructs a list of window of size windowSize.
|
static List<Window> |
Windows.windows(String words,
TokenizerFactory tokenizerFactory)
Constructs a list of window of size windowSize.
|
static List<Window> |
Windows.windows(String words,
TokenizerFactory tokenizerFactory,
int windowSize,
WordVectors vectors)
Constructs a list of window of size windowSize.
|
| Modifier and Type | Class and Description |
|---|---|
class |
DefaultTokenizerFactory
Default tokenizer based on string tokenizer or stream tokenizer
|
class |
NGramTokenizerFactory |
| Constructor and Description |
|---|
NGramTokenizerFactory(TokenizerFactory tokenizerFactory,
Integer minN,
Integer maxN) |
Copyright © 2017. All rights reserved.