cc.mallet.share.upenn.ner
Class LengthBins

java.lang.Object
  extended by cc.mallet.pipe.Pipe
      extended by cc.mallet.share.upenn.ner.LengthBins
All Implemented Interfaces:
AlphabetCarrying, java.io.Serializable

public class LengthBins
extends Pipe
implements java.io.Serializable

A feature approximating string length.

See Also:
Serialized Form

Constructor Summary
LengthBins(java.lang.String featureName, int[] binMaxes)
          bins contains the maximum sizes of elements in each bin.
 
Method Summary
 Instance pipe(Instance carrier)
          Really this should be 'protected', but isn't for historical reasons.
 
Methods inherited from class cc.mallet.pipe.Pipe
alphabetsMatch, getAlphabet, getAlphabets, getDataAlphabet, getInstanceId, getTargetAlphabet, instanceFrom, instancesFrom, instancesFrom, isDataAlphabetSet, isTargetProcessing, newIteratorFrom, preceedingPipeDataAlphabetNotification, preceedingPipeTargetAlphabetNotification, precondition, readResolve, setDataAlphabet, setOrCheckDataAlphabet, setOrCheckTargetAlphabet, setTargetAlphabet, setTargetProcessing
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

LengthBins

public LengthBins(java.lang.String featureName,
                  int[] binMaxes)

bins contains the maximum sizes of elements in each bin.

For example, passing in {1,3,7} would produce 4 bins, for strings of lengths 1, 2-3, 4-7, and 8+.

Method Detail

pipe

public Instance pipe(Instance carrier)
Description copied from class: Pipe
Really this should be 'protected', but isn't for historical reasons.

Overrides:
pipe in class Pipe