data.MASC-1.0.3.data.written.20000415_apw_eng-NEW.anc Maven / Gradle / Ivy
20000415_apw-NEW
Language Understanding Annotation Corpus
Linguistic Data Consortium
LDC2009T10"
Associated Press
unknown
Eliminated XML introduced in the LU corpus version.
English (United States)
8-bit UCS/Unicode Transformation Format
Logical structure
Sentence boundaries
FrameNet
FrameNet tokens and part of speech tags
Noun chunks
Penn part of speech tags
Penn Tree Bank
Penn Tree Bank tokens and part of speech tags
Base segmentation (quarks)
Verb chunks
Committed Belief
Events
Multi-Perspective Question Answering opinion corpus
Named Entities
Document content
Keith Suderman
- Generated standoff from original files.
Keith Suderman
- Added fn, fntok, nc, NE, penn, ptb, ptbtok, seg, vc annotations.
Nancy Ide
- Added fileDesc information to the header
2010-09-19
KBS
- Added cb, event, mpqa, ne, content annotations.