TY - JOUR
T1 - Distributed representation learning for knowledge graphs with entity descriptions
AU - Fan, Miao
AU - Zhou, Qiang
AU - Zheng, Thomas Fang
AU - Grishman, Ralph
N1 - Funding Information:
This work is supported by China Scholarship Council, National Program on Key Basic Research Project (973 Program) under Grant 2013CB329304, National Science Foundation of China (NSFC) under Grant nos. 61433018 and 61373075, Proteus Project of NYU. The first author conducted this research while he was a joint-supervision Ph.D. student of Tsinghua University and New York University.
Publisher Copyright:
© 2016 Elsevier B.V.
PY - 2017/7/1
Y1 - 2017/7/1
N2 - Recent studies of knowledge representation attempt to project both entities and relations, which originally compose a high-dimensional and sparse knowledge graph, into a continuous low-dimensional space. One canonical approach TransE [2] which represents entities and relations with vectors (embeddings), achieves leading performances solely with triplets, i.e. (head_entity, relation, tail_entity), in a knowledge base. The cutting-edge method DKRL [23] extends TransE via enhancing the embeddings with entity descriptions by means of deep neural network models. However, DKRL requires extra space to store parameters of inner layers, and relies on more hyperparameters to be tuned. Therefore, we create a single-layer model which requests much fewer parameters. The model measures the probability of each triplet along with corresponding entity descriptions, and learns contextual embeddings of entities, relations and words in descriptions simultaneously, via maximizing the loglikelihood of the observed knowledge. We evaluate our model in the tasks of knowledge graph completion and entity type classification with two benchmark datasets: FB500K and EN15K, respectively. Experimental results demonstrate that the proposed model outperforms both TransE and DKRL, indicating that it is both efficient and effective in learning better distributed representations for knowledge bases.
AB - Recent studies of knowledge representation attempt to project both entities and relations, which originally compose a high-dimensional and sparse knowledge graph, into a continuous low-dimensional space. One canonical approach TransE [2] which represents entities and relations with vectors (embeddings), achieves leading performances solely with triplets, i.e. (head_entity, relation, tail_entity), in a knowledge base. The cutting-edge method DKRL [23] extends TransE via enhancing the embeddings with entity descriptions by means of deep neural network models. However, DKRL requires extra space to store parameters of inner layers, and relies on more hyperparameters to be tuned. Therefore, we create a single-layer model which requests much fewer parameters. The model measures the probability of each triplet along with corresponding entity descriptions, and learns contextual embeddings of entities, relations and words in descriptions simultaneously, via maximizing the loglikelihood of the observed knowledge. We evaluate our model in the tasks of knowledge graph completion and entity type classification with two benchmark datasets: FB500K and EN15K, respectively. Experimental results demonstrate that the proposed model outperforms both TransE and DKRL, indicating that it is both efficient and effective in learning better distributed representations for knowledge bases.
KW - Entity description
KW - Entity type classification
KW - Knowledge graph
KW - Knowledge graph completion
KW - Representation learning
UR - http://www.scopus.com/inward/record.url?scp=85000839169&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85000839169&partnerID=8YFLogxK
U2 - 10.1016/j.patrec.2016.09.005
DO - 10.1016/j.patrec.2016.09.005
M3 - Article
AN - SCOPUS:85000839169
SN - 0167-8655
VL - 93
SP - 31
EP - 37
JO - Pattern Recognition Letters
JF - Pattern Recognition Letters
ER -