cc.mallet.pipe
Class CharSequenceRemoveUUEncodedBlocks

java.lang.Object
  extended by cc.mallet.pipe.Pipe
      extended by cc.mallet.pipe.CharSequenceRemoveUUEncodedBlocks
All Implemented Interfaces:
AlphabetCarrying, java.io.Serializable

public class CharSequenceRemoveUUEncodedBlocks
extends Pipe

See Also:
Serialized Form

Field Summary
static java.util.regex.Pattern UU_ENCODED_LINE
          Given a string, remove lines that begin with M and are 61 characters long.
 
Constructor Summary
CharSequenceRemoveUUEncodedBlocks()
           
 
Method Summary
 Instance pipe(Instance carrier)
          Really this should be 'protected', but isn't for historical reasons.
 
Methods inherited from class cc.mallet.pipe.Pipe
alphabetsMatch, getAlphabet, getAlphabets, getDataAlphabet, getInstanceId, getTargetAlphabet, instanceFrom, instancesFrom, instancesFrom, isDataAlphabetSet, isTargetProcessing, newIteratorFrom, preceedingPipeDataAlphabetNotification, preceedingPipeTargetAlphabetNotification, precondition, readResolve, setDataAlphabet, setOrCheckDataAlphabet, setOrCheckTargetAlphabet, setTargetAlphabet, setTargetProcessing
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Field Detail

UU_ENCODED_LINE

public static final java.util.regex.Pattern UU_ENCODED_LINE
Given a string, remove lines that begin with M and are 61 characters long. Note that there are some UUEncoded blocks that do not match this. I have seen some that are 64 characters long, and have no regular prefix character, but this filter gets most of them in 20 Newsgroups.

Constructor Detail

CharSequenceRemoveUUEncodedBlocks

public CharSequenceRemoveUUEncodedBlocks()
Method Detail

pipe

public Instance pipe(Instance carrier)
Description copied from class: Pipe
Really this should be 'protected', but isn't for historical reasons.

Overrides:
pipe in class Pipe