UIMA MUC7 Collection Reader
The JULIE Lab MUC7 Collection Reader (a UIMA Collection Reader) reads MUC7 data that can be purchased from the Linguistic Data Consortium (LDC, http://www.ldc.upenn.edu/). The MUC7 data must be transformed in valid XML format (instead of SGML). The reader reads sections, paragraphs, all named entities as well as coreferences. The MUC7 templates that are only available in BNF and that describe events are not processed (yet). The extracted information is stored in the type system (see our UIMA type system).