cc.mallet.share.casutton.ner
Class ConllNer2003Sentence2TokenSequence

java.lang.Object
  extended by cc.mallet.pipe.Pipe
      extended by cc.mallet.share.casutton.ner.ConllNer2003Sentence2TokenSequence
All Implemented Interfaces:
AlphabetCarrying, java.io.Serializable

public class ConllNer2003Sentence2TokenSequence
extends Pipe

Reads a data file in CoNLL 2003 format, and makes some simple transformations. Unlike the version in mccallum.ner, does not expect fields in the data file for tags and phrasos if those features are off. Does not look for target field if isTargetProcessing() is false.

See Also:
Serialized Form

Constructor Summary
ConllNer2003Sentence2TokenSequence()
           
ConllNer2003Sentence2TokenSequence(boolean useTags, boolean usePhrases)
           
 
Method Summary
 Instance pipe(Instance carrier)
          Really this should be 'protected', but isn't for historical reasons.
 
Methods inherited from class cc.mallet.pipe.Pipe
alphabetsMatch, getAlphabet, getAlphabets, getDataAlphabet, getInstanceId, getTargetAlphabet, instanceFrom, instancesFrom, instancesFrom, isDataAlphabetSet, isTargetProcessing, newIteratorFrom, preceedingPipeDataAlphabetNotification, preceedingPipeTargetAlphabetNotification, precondition, readResolve, setDataAlphabet, setOrCheckDataAlphabet, setOrCheckTargetAlphabet, setTargetAlphabet, setTargetProcessing
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

ConllNer2003Sentence2TokenSequence

public ConllNer2003Sentence2TokenSequence()

ConllNer2003Sentence2TokenSequence

public ConllNer2003Sentence2TokenSequence(boolean useTags,
                                          boolean usePhrases)
Method Detail

pipe

public Instance pipe(Instance carrier)
Description copied from class: Pipe
Really this should be 'protected', but isn't for historical reasons.

Overrides:
pipe in class Pipe