TY - GEN
T1 - Distant supervision for relation extraction with an incomplete knowledge base
AU - Min, Bonan
AU - Grishman, Ralph
AU - Wan, Li
AU - Wang, Chang
AU - Gondek, David
N1 - Publisher Copyright:
© 2013 Association for Computational Linguistics.
PY - 2013
Y1 - 2013
N2 - Distant supervision, heuristically labeling a corpus using a knowledge base, has emerged as a popular choice for training relation extractors. In this paper, we show that a significant number of "negative" examples generated by the labeling process are false negatives because the knowledge base is incomplete. Therefore the heuristic for generating negative examples has a serious flaw. Building on a state-of-The-Art distantly-supervised extraction algorithm, we proposed an algorithm that learns from only positive and unlabeled labels at the pair-of-entity level. Experimental results demonstrate its advantage over existing algorithms.
AB - Distant supervision, heuristically labeling a corpus using a knowledge base, has emerged as a popular choice for training relation extractors. In this paper, we show that a significant number of "negative" examples generated by the labeling process are false negatives because the knowledge base is incomplete. Therefore the heuristic for generating negative examples has a serious flaw. Building on a state-of-The-Art distantly-supervised extraction algorithm, we proposed an algorithm that learns from only positive and unlabeled labels at the pair-of-entity level. Experimental results demonstrate its advantage over existing algorithms.
UR - http://www.scopus.com/inward/record.url?scp=84926224568&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84926224568&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:84926224568
T3 - NAACL HLT 2013 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Main Conference
SP - 777
EP - 782
BT - Proceedings of the 2013 Conference of the North American Chapter of the Association for Computational Linguistics
PB - Association for Computational Linguistics (ACL)
T2 - 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL HLT 2013
Y2 - 9 June 2013 through 14 June 2013
ER -