All Downloads are FREE. Search and download functionalities are using the official Maven repository.

data.MASC-1.0.3.data.written.20000415_apw_eng-NEW.anc Maven / Gradle / Ivy

There is a newer version: 0.6.3
Show newest version


   
      
         20000415_apw-NEW
      
      
      
         Language Understanding Annotation Corpus
         Linguistic Data Consortium
         LDC2009T10"
         Associated Press
         unknown
      
   
   
      Eliminated XML introduced in the LU corpus version.
   
   
      
         English (United States)
      
      
         8-bit UCS/Unicode Transformation Format
      
      
         
         
         
      
      
      
         Logical structure
         Sentence boundaries
         FrameNet
         FrameNet tokens and part of speech tags
         Noun chunks
         Penn part of speech tags
         Penn Tree Bank
         Penn Tree Bank tokens and part of speech tags
         Base segmentation (quarks)
         Verb chunks
         Committed Belief
         Events
         Multi-Perspective Question Answering opinion corpus
         Named Entities
         Document content
      
   
   
      
         
         Keith Suderman
         Generated standoff from original files.
      
      
         
         Keith Suderman
         Added fn, fntok, nc, NE, penn, ptb, ptbtok, seg, vc annotations.
      
      
         
         Nancy Ide
         Added fileDesc information to the header
      
      
         2010-09-19
         KBS
         Added cb, event, mpqa, ne, content annotations.
      
   




© 2015 - 2024 Weber Informatics LLC | Privacy Policy