|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectedu.umass.cs.mallet.base.extract.CRFExtractor
Created: Oct 12, 2004
| Constructor Summary | |
CRFExtractor(CRF4 crf)
|
|
CRFExtractor(CRF4 crf,
Pipe tokpipe)
|
|
CRFExtractor(CRF4 crf,
Pipe tokpipe,
TokenizationFilter filter)
|
|
CRFExtractor(CRF4 crf,
Pipe tokpipe,
TokenizationFilter filter,
java.lang.String backgroundTag)
|
|
CRFExtractor(java.io.File crfFile)
|
|
| Method Summary | |
Extraction |
extract(java.lang.Object o)
Performs extraction given a raw object. |
Extraction |
extract(PipeInputIterator source)
Performs extraction on a a set of raw documents. |
Extraction |
extract(Tokenization spans)
Performs extraction from an object that has been already been tokenized. |
CRF4 |
getCrf()
|
Pipe |
getFeaturePipe()
Returns the pipe used by this extractor for. |
Alphabet |
getInputAlphabet()
Returns an alphabet of the features used by the extractor. |
LabelAlphabet |
getTargetAlphabet()
Returns an alphabet of the labels used by the extractor. |
Pipe |
getTokenizationPipe()
Returns the pipe used by this extractor to tokenize the input. |
Sequence |
pipeInput(java.lang.Object input)
|
InstanceList |
pipeInstances(PipeInputIterator source)
|
void |
setFeaturePipe(Pipe featurePipe)
|
void |
setTokenizationPipe(Pipe tokenizationPipe)
Sets the pipe used by this extractor for tokenization. |
void |
slicePipes(int num)
Transfer some Pipes from the feature pipe to the tokenization pipe. |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
public CRFExtractor(CRF4 crf)
public CRFExtractor(java.io.File crfFile)
throws java.io.IOException
public CRFExtractor(CRF4 crf,
Pipe tokpipe)
public CRFExtractor(CRF4 crf,
Pipe tokpipe,
TokenizationFilter filter)
public CRFExtractor(CRF4 crf,
Pipe tokpipe,
TokenizationFilter filter,
java.lang.String backgroundTag)
| Method Detail |
public Extraction extract(java.lang.Object o)
Extractor
extract in interface Extractoro - The document to extract from (often a String).
public Extraction extract(Tokenization spans)
Extractor
extract in interface Extractorspans - A tokenized document
public InstanceList pipeInstances(PipeInputIterator source)
public Extraction extract(PipeInputIterator source)
Extractor
extract in interface Extractorsource - A source of raw documents
public Pipe getTokenizationPipe()
Extractor
getTokenizationPipe in interface Extractorpublic void setTokenizationPipe(Pipe tokenizationPipe)
ExtractorThe pipe @link{edu.umass.cs.mallet.base.pipe.CharSequence2TokenSequence} is an example of a pipe that could be used here.
setTokenizationPipe in interface Extractorpublic Pipe getFeaturePipe()
Extractor
getFeaturePipe in interface Extractorpublic void setFeaturePipe(Pipe featurePipe)
public Alphabet getInputAlphabet()
Extractor
getInputAlphabet in interface Extractorpublic LabelAlphabet getTargetAlphabet()
Extractor
getTargetAlphabet in interface Extractorpublic CRF4 getCrf()
public void slicePipes(int num)
public Sequence pipeInput(java.lang.Object input)
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||