cc.mallet.pipe
Class CharSequence2CharNGrams

java.lang.Object
  extended by cc.mallet.pipe.Pipe
      extended by cc.mallet.pipe.CharSequence2CharNGrams
All Implemented Interfaces:
AlphabetCarrying, java.io.Serializable

public class CharSequence2CharNGrams
extends Pipe
implements java.io.Serializable

Transform a character sequence into a token sequence of character N grams.

Author:
Andrew McCallum mccallum@cs.umass.edu
See Also:
Serialized Form

Constructor Summary
CharSequence2CharNGrams(int n, boolean distinguishBorders)
           
 
Method Summary
protected  java.lang.String[] ngramify(java.lang.CharSequence s)
           
 Instance pipe(Instance carrier)
          Really this should be 'protected', but isn't for historical reasons.
 
Methods inherited from class cc.mallet.pipe.Pipe
alphabetsMatch, getAlphabet, getAlphabets, getDataAlphabet, getInstanceId, getTargetAlphabet, instanceFrom, instancesFrom, instancesFrom, isDataAlphabetSet, isTargetProcessing, newIteratorFrom, preceedingPipeDataAlphabetNotification, preceedingPipeTargetAlphabetNotification, precondition, readResolve, setDataAlphabet, setOrCheckDataAlphabet, setOrCheckTargetAlphabet, setTargetAlphabet, setTargetProcessing
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

CharSequence2CharNGrams

public CharSequence2CharNGrams(int n,
                               boolean distinguishBorders)
Method Detail

ngramify

protected java.lang.String[] ngramify(java.lang.CharSequence s)

pipe

public Instance pipe(Instance carrier)
Description copied from class: Pipe
Really this should be 'protected', but isn't for historical reasons.

Overrides:
pipe in class Pipe