|
||||||||||
PREV NEXT | FRAMES NO FRAMES |
Packages that use SimpleTokenizer | |
---|---|
cc.mallet.pipe | Classes for processing arbitrary data into instances. |
cc.mallet.util | Miscellaneous utilities including command line processing, math functions, lexing, logging. |
Uses of SimpleTokenizer in cc.mallet.pipe |
---|
Methods in cc.mallet.pipe that return SimpleTokenizer | |
---|---|
SimpleTokenizer |
SimpleTokenizer.deepClone()
|
Methods in cc.mallet.pipe with parameters of type SimpleTokenizer | |
---|---|
void |
FeatureDocFreqPipe.addPrunedWordsToStoplist(SimpleTokenizer tokenizer,
double docFrequencyCutoff)
Add all pruned words to the internal stoplist of a SimpleTokenizer. |
void |
FeatureCountPipe.addPrunedWordsToStoplist(SimpleTokenizer tokenizer,
int minimumCount)
Add all pruned words to the internal stoplist of a SimpleTokenizer. |
Uses of SimpleTokenizer in cc.mallet.util |
---|
Methods in cc.mallet.util with parameters of type SimpleTokenizer | |
---|---|
static void |
BulkLoader.generateStoplist(SimpleTokenizer prunedTokenizer)
Read the data from inputFile, then write all the words that do not occur pruneCount.value times or more to the pruned word file. |
static void |
BulkLoader.writeInstanceList(SimpleTokenizer prunedTokenizer)
|
|
||||||||||
PREV NEXT | FRAMES NO FRAMES |