Class CharSequenceRemoveUUEncodedBlocks

  extended by cc.mallet.pipe.Pipe
      extended by cc.mallet.pipe.CharSequenceRemoveUUEncodedBlocks
All Implemented Interfaces:

public class CharSequenceRemoveUUEncodedBlocks
extends Pipe

See Also:
Serialized Form

Field Summary
static java.util.regex.Pattern UU_ENCODED_LINE
          Given a string, remove lines that begin with M and are 61 characters long.
Constructor Summary
Method Summary
 Instance pipe(Instance carrier)
          Really this should be 'protected', but isn't for historical reasons.
Methods inherited from class cc.mallet.pipe.Pipe
alphabetsMatch, getAlphabet, getAlphabets, getDataAlphabet, getInstanceId, getTargetAlphabet, instanceFrom, instancesFrom, instancesFrom, isDataAlphabetSet, isTargetProcessing, newIteratorFrom, preceedingPipeDataAlphabetNotification, preceedingPipeTargetAlphabetNotification, precondition, readResolve, setDataAlphabet, setOrCheckDataAlphabet, setOrCheckTargetAlphabet, setTargetAlphabet, setTargetProcessing
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Detail


public static final java.util.regex.Pattern UU_ENCODED_LINE
Given a string, remove lines that begin with M and are 61 characters long. Note that there are some UUEncoded blocks that do not match this. I have seen some that are 64 characters long, and have no regular prefix character, but this filter gets most of them in 20 Newsgroups.

Constructor Detail


public CharSequenceRemoveUUEncodedBlocks()
Method Detail


public Instance pipe(Instance carrier)
Description copied from class: Pipe
Really this should be 'protected', but isn't for historical reasons.

pipe in class Pipe