public interface WordVectors extends Serializable
| Modifier and Type | Method and Description |
|---|---|
Map<String,Double> |
accuracy(List<String> questions)
Accuracy based on questions which are a space separated list of strings
where the first word is the query word, the next 2 words are negative,
and the last word is the predicted word to be nearest
|
String |
getUNK() |
double[] |
getWordVector(String word)
Get the word vector for a given matrix
|
org.nd4j.linalg.api.ndarray.INDArray |
getWordVectorMatrix(String word)
Get the word vector for a given matrix
|
org.nd4j.linalg.api.ndarray.INDArray |
getWordVectorMatrixNormalized(String word)
Returns the word vector divided by the norm2 of the array
|
org.nd4j.linalg.api.ndarray.INDArray |
getWordVectors(Collection<String> labels)
This method returns 2D array, where each row represents corresponding word/label
|
org.nd4j.linalg.api.ndarray.INDArray |
getWordVectorsMean(Collection<String> labels)
This method returns mean vector, built from words/labels passed in
|
boolean |
hasWord(String word)
Returns true if the model has this word in the vocab
|
int |
indexOf(String word) |
WeightLookupTable |
lookupTable()
Lookup table for the vectors
|
void |
setModelUtils(ModelUtils utils)
Specifies ModelUtils to be used to access model
|
void |
setUNK(String newUNK) |
double |
similarity(String word,
String word2)
Returns the similarity of 2 words
|
List<String> |
similarWordsInVocabTo(String word,
double accuracy)
Find all words with a similar characters
in the vocab
|
VocabCache |
vocab()
Vocab for the vectors
|
Collection<String> |
wordsNearest(Collection<String> positive,
Collection<String> negative,
int top)
Words nearest based on positive and negative words
|
Collection<String> |
wordsNearest(org.nd4j.linalg.api.ndarray.INDArray words,
int top) |
Collection<String> |
wordsNearest(String word,
int n)
Get the top n words most similar to the given word
|
Collection<String> |
wordsNearestSum(Collection<String> positive,
Collection<String> negative,
int top)
Words nearest based on positive and negative words
|
Collection<String> |
wordsNearestSum(org.nd4j.linalg.api.ndarray.INDArray words,
int top) |
Collection<String> |
wordsNearestSum(String word,
int n)
Get the top n words most similar to the given word
|
String getUNK()
void setUNK(String newUNK)
boolean hasWord(String word)
word - the word to test forCollection<String> wordsNearest(org.nd4j.linalg.api.ndarray.INDArray words, int top)
Collection<String> wordsNearestSum(org.nd4j.linalg.api.ndarray.INDArray words, int top)
Collection<String> wordsNearestSum(String word, int n)
word - the word to comparen - the n to getCollection<String> wordsNearestSum(Collection<String> positive, Collection<String> negative, int top)
positive - the positive wordsnegative - the negative wordstop - the top n wordsMap<String,Double> accuracy(List<String> questions)
questions - the questions to askint indexOf(String word)
List<String> similarWordsInVocabTo(String word, double accuracy)
word - the word to compareaccuracy - the accuracy: 0 to 1double[] getWordVector(String word)
word - the word to get the matrix fororg.nd4j.linalg.api.ndarray.INDArray getWordVectorMatrixNormalized(String word)
word - the word to get the matrix fororg.nd4j.linalg.api.ndarray.INDArray getWordVectorMatrix(String word)
word - the word to get the matrix fororg.nd4j.linalg.api.ndarray.INDArray getWordVectors(Collection<String> labels)
labels - org.nd4j.linalg.api.ndarray.INDArray getWordVectorsMean(Collection<String> labels)
labels - Collection<String> wordsNearest(Collection<String> positive, Collection<String> negative, int top)
positive - the positive wordsnegative - the negative wordstop - the top n wordsCollection<String> wordsNearest(String word, int n)
word - the word to comparen - the n to getdouble similarity(String word, String word2)
word - the first wordword2 - the second wordVocabCache vocab()
WeightLookupTable lookupTable()
void setModelUtils(ModelUtils utils)
utils - Copyright © 2017. All rights reserved.