|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectcc.mallet.util.BulkLoader
public class BulkLoader
This class reads through a single file, breaking each line into data and (optional) name and label fields.
| Constructor Summary | |
|---|---|
BulkLoader()
|
|
| Method Summary | |
|---|---|
static void |
generateStoplist(SimpleTokenizer prunedTokenizer)
Read the data from inputFile, then write all the words that do not occur pruneCount.value times or more to the pruned word file. |
static void |
main(java.lang.String[] args)
|
static void |
writeInstanceList(SimpleTokenizer prunedTokenizer)
|
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public BulkLoader()
| Method Detail |
|---|
public static void generateStoplist(SimpleTokenizer prunedTokenizer)
throws java.io.IOException
prunedTokenizer - the tokenizer that will be used to write instances
java.io.IOException
public static void writeInstanceList(SimpleTokenizer prunedTokenizer)
throws java.io.IOException
java.io.IOException
public static void main(java.lang.String[] args)
throws java.io.IOException
java.io.IOException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||