Standards for construction of corpora based on Korean language analysis

Detail 1

Morphemes analysis, named entity recognition, and parsing have been selected as the Telecommunication Technology Association (TTA) standards through validation by various experts under direction of the ETRI language intelligence research group. The standards for thematic role recognition and question analysis are submitted and currently subject to validation. You can download the standards below

Move
ETRI Corpora

Detail 1

Exobrain QA datasets (ETRI), language analysis corpora (ETRI), Korean TimeBank and SpaceBank, Morphology/Semantics corpora provided by University of Ulsan, corpora service in open API services (http://aiopen.etri.re.kr) provided by ETRI

Move
Current status of development and plan for improvement of Exobrain’s Korean language analysis and question answering technology

Detail 1

Journal of Society for Information Science and Technology, Vol. 35, No. 8, Aug. 2017

Download
irel.kaist.ac.kr data list

Detail 3

  • English concept embedding
  • English context embedding
  • Morphological-semantic annotation corpus
  • Reliance-thematic role annotation corpus
  • Object name annotation corpus
Move