cc.mallet.extract
Interface TokenizationFilter

All Known Implementing Classes:
BIOTokenizationFilter, BIOTokenizationFilterWithTokenIndices, ConfidenceTokenizationFilter, DefaultTokenizationFilter, HierarchicalTokenizationFilter

public interface TokenizationFilter

Created: Nov 12, 2004

Version:
$Id: TokenizationFilter.java,v 1.1 2007/10/22 21:37:44 mccallum Exp $
Author:
Method Summary
 LabeledSpans constructLabeledSpans(LabelAlphabet dict, java.lang.Object document, Label backgroundTag, Tokenization input, Sequence seq)
          Converts a the sequence of labels into a set of labeled spans.
 

Method Detail

constructLabeledSpans

LabeledSpans constructLabeledSpans(LabelAlphabet dict,
                                   java.lang.Object document,
                                   Label backgroundTag,
                                   Tokenization input,
                                   Sequence seq)
Converts a the sequence of labels into a set of labeled spans. Essentially, this converts the output of sequence labeling into an extraction output.

Parameters:
dict -
document -
backgroundTag -
input -
seq -
Returns: