|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object cc.mallet.extract.CRFExtractor
public class CRFExtractor
Created: Oct 12, 2004
Constructor Summary | |
---|---|
CRFExtractor(CRF crf)
|
|
CRFExtractor(CRF crf,
Pipe tokpipe)
|
|
CRFExtractor(CRF crf,
Pipe tokpipe,
TokenizationFilter filter)
|
|
CRFExtractor(CRF crf,
Pipe tokpipe,
TokenizationFilter filter,
java.lang.String backgroundTag)
|
|
CRFExtractor(java.io.File crfFile)
|
Method Summary | |
---|---|
Extraction |
extract(InstanceList ilist)
Assumes Instance.source contains the Tokenization object. |
Extraction |
extract(java.util.Iterator<Instance> source)
Performs extraction on a a set of raw documents. |
Extraction |
extract(java.lang.Object o)
Performs extraction given a raw object. |
Extraction |
extract(Tokenization spans)
Performs extraction from an object that has been already been tokenized. |
java.lang.String |
getBackgroundTag()
|
CRF |
getCrf()
|
Pipe |
getFeaturePipe()
Returns the pipe used by this extractor for. |
Alphabet |
getInputAlphabet()
Returns an alphabet of the features used by the extractor. |
LabelAlphabet |
getTargetAlphabet()
Returns an alphabet of the labels used by the extractor. |
TokenizationFilter |
getTokenizationFilter()
|
Pipe |
getTokenizationPipe()
Returns the pipe used by this extractor to tokenize the input. |
Sequence |
pipeInput(java.lang.Object input)
|
InstanceList |
pipeInstances(java.util.Iterator<Instance> source)
|
void |
setFeaturePipe(Pipe featurePipe)
|
void |
setTokenizationPipe(Pipe tokenizationPipe)
Sets the pipe used by this extractor for tokenization. |
void |
slicePipes(int num)
Transfer some Pipes from the feature pipe to the tokenization pipe. |
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Constructor Detail |
---|
public CRFExtractor(CRF crf)
public CRFExtractor(java.io.File crfFile) throws java.io.IOException
java.io.IOException
public CRFExtractor(CRF crf, Pipe tokpipe)
public CRFExtractor(CRF crf, Pipe tokpipe, TokenizationFilter filter)
public CRFExtractor(CRF crf, Pipe tokpipe, TokenizationFilter filter, java.lang.String backgroundTag)
Method Detail |
---|
public Extraction extract(java.lang.Object o)
Extractor
extract
in interface Extractor
o
- The document to extract from (often a String).
public Extraction extract(Tokenization spans)
Extractor
extract
in interface Extractor
spans
- A tokenized document
public InstanceList pipeInstances(java.util.Iterator<Instance> source)
public Extraction extract(InstanceList ilist)
public Extraction extract(java.util.Iterator<Instance> source)
Extractor
extract
in interface Extractor
source
- A source of raw documents
public TokenizationFilter getTokenizationFilter()
public java.lang.String getBackgroundTag()
public Pipe getTokenizationPipe()
Extractor
getTokenizationPipe
in interface Extractor
public void setTokenizationPipe(Pipe tokenizationPipe)
Extractor
The pipe @link{edu.umass.cs.mallet.base.pipe.CharSequence2TokenSequence} is an example of a pipe that could be used here.
setTokenizationPipe
in interface Extractor
public Pipe getFeaturePipe()
Extractor
getFeaturePipe
in interface Extractor
public void setFeaturePipe(Pipe featurePipe)
public Alphabet getInputAlphabet()
Extractor
getInputAlphabet
in interface Extractor
public LabelAlphabet getTargetAlphabet()
Extractor
getTargetAlphabet
in interface Extractor
public CRF getCrf()
public void slicePipes(int num)
public Sequence pipeInput(java.lang.Object input)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |