Package com.googlecode.whatswrong.io

The io package contains classes for loading NLP instances from different types of formats.

See:
          Description

Interface Summary
CorpusFormat The CorpusFormat interface describes objects that can load a list of NLPInstances from a file.
CorpusFormat.Monitor A Monitor monitors the progress of the CorpusFormat.load(java.io.File, int, int) method.
TabProcessor A TabProcessor takes a table of string values and returns an NLPInstance.
 

Class Summary
BioNLP2009SharedTaskFormat The BioNLP2009SharedTaskFormat loads files in the format of the BioNLP 2009 Shared Task.
CoNLL2000 Loads CoNLL 2000 chunk data.
CoNLL2002 Loads CoNLL 2002 NER data.
CoNLL2003 Loads CoNLL 2003 chunk and NER data.
CoNLL2004 Loads CoNLL 2004 SRL data.
CoNLL2005 Loads CoNLL 2005 SRL data.
CoNLL2006 Loads CoNLL 2006 Dependency data.
CoNLL2008 Loads CoNLL 2008 Joint SRL and Dependency data.
GaleAlignmentFormat The GaleAlignmentFormat reads bilingual alignment data in a xml-like format.
LispSExprFormat Loads Lisp S-Expression data.
MaltTab Loads Malt-TAB dependencies.
TabFormat A TabFormat loads data from text files where token properties are represented as white-space/tab separated values.
TheBeastFormat Loads markov thebeast data.
 

Package com.googlecode.whatswrong.io Description

The io package contains classes for loading NLP instances from different types of formats. The main interface of interest is CorpusFormat which is implemented for several types of NLP corpus formats, such as the CoNLL shared task formats. To add new formats this interface needs to be implemented.

See Also:
CorpusFormat


Copyright © 2009. All Rights Reserved.