|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Objectcom.googlecode.whatswrong.io.TabFormat
public class TabFormat
A TabFormat loads data from text files where token properties are represented as white-space/tab separated values.
This includes formats such as the CoNLL shared task formats or the MALT-Tab format. This class represents the generic
framework to process such tab separated data. To implement a concrete format clients have to implement the TabProcessor
interface.
Nested Class Summary |
---|
Nested classes/interfaces inherited from interface com.googlecode.whatswrong.io.CorpusFormat |
---|
CorpusFormat.Monitor |
Constructor Summary | |
---|---|
TabFormat()
|
Method Summary | |
---|---|
void |
addProcessor(java.lang.String name,
TabProcessor processor)
|
void |
addProcessor(TabProcessor processor)
|
static void |
extractSpan00(java.util.List<? extends java.util.List<java.lang.String>> rows,
int column,
java.lang.String type,
NLPInstance instance)
|
static void |
extractSpan03(java.util.List<? extends java.util.List<java.lang.String>> rows,
int column,
java.lang.String type,
NLPInstance instance)
|
static void |
extractSpan05(java.util.List<? extends java.util.List<java.lang.String>> rows,
int column,
java.lang.String type,
java.lang.String prefix,
NLPInstance instance)
|
javax.swing.JComponent |
getAccessory()
Returns the GUI element that controls how this format is to be loaded. |
java.lang.String |
getLongName()
Returns a longer name that may contain information about the configuration of this format. |
java.lang.String |
getName()
Returns the name of this format. |
java.util.List<NLPInstance> |
load(java.io.File file,
int from,
int to)
Loads a corpus from a file, starting at instance from and ending at instance to
(exclusive). |
void |
loadProperties(java.util.Properties properties,
java.lang.String prefix)
Loads a configuration for this format from the given Properties object. |
void |
saveProperties(java.util.Properties properties,
java.lang.String prefix)
Saves the configuration of this format to a Properties object. |
void |
setMonitor(CorpusFormat.Monitor monitor)
Sets the objects that monitors the progress of this format when loading a file. |
java.lang.String |
toString()
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
Constructor Detail |
---|
public TabFormat()
Method Detail |
---|
public void addProcessor(java.lang.String name, TabProcessor processor)
public void addProcessor(TabProcessor processor)
public java.lang.String toString()
toString
in class java.lang.Object
public java.lang.String getName()
CorpusFormat
getName
in interface CorpusFormat
public java.lang.String getLongName()
CorpusFormat
getLongName
in interface CorpusFormat
public javax.swing.JComponent getAccessory()
CorpusFormat
getAccessory
in interface CorpusFormat
public void setMonitor(CorpusFormat.Monitor monitor)
CorpusFormat
setMonitor
in interface CorpusFormat
monitor
- the monitor for this format.public void loadProperties(java.util.Properties properties, java.lang.String prefix)
CorpusFormat
loadProperties
in interface CorpusFormat
properties
- the Properties object to load from.prefix
- the prefix that properties for this format have in the Properties object.public void saveProperties(java.util.Properties properties, java.lang.String prefix)
CorpusFormat
saveProperties
in interface CorpusFormat
properties
- the Properties object to store this configuration of this format to.prefix
- the prefix that the properties should have.public java.util.List<NLPInstance> load(java.io.File file, int from, int to) throws java.io.IOException
CorpusFormat
from
and ending at instance to
(exclusive). This method is required to call CorpusFormat.Monitor.progressed(int)
after each instance that was processed.
load
in interface CorpusFormat
file
- the file to load the corpus from.from
- the starting instance index.to
- the end instance index.
java.io.IOException
- if I/O goes wrong.public static void extractSpan03(java.util.List<? extends java.util.List<java.lang.String>> rows, int column, java.lang.String type, NLPInstance instance)
public static void extractSpan00(java.util.List<? extends java.util.List<java.lang.String>> rows, int column, java.lang.String type, NLPInstance instance)
public static void extractSpan05(java.util.List<? extends java.util.List<java.lang.String>> rows, int column, java.lang.String type, java.lang.String prefix, NLPInstance instance)
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |