TY - GEN
T1 - Updating a name tagger using contemporary unlabeled data
AU - Mota, Cristina
AU - Grishman, Ralph
N1 - Funding Information:
The first author’s research work was funded by Fundac¸ão para a Ciência e a Tecnologia through a doctoral scholarship (ref.: SFRH/BD/3237/2000).
PY - 2009
Y1 - 2009
N2 - For many NLP tasks, including named entity tagging, semi-supervised learning has been proposed as a reasonable alternative to methods that require annotating large amounts of training data. In this paper, we address the problem of analyzing new data given a semi-supervised NE tagger trained on data from an earlier time period. We will show that updating the unlabeled data is sufficient to maintain quality over time, and outperforms updating the labeled data. Furthermore, we will also show that augmenting the unlabeled data with older data in most cases does not result in better performance than simply using a smaller amount of current unlabeled data.
AB - For many NLP tasks, including named entity tagging, semi-supervised learning has been proposed as a reasonable alternative to methods that require annotating large amounts of training data. In this paper, we address the problem of analyzing new data given a semi-supervised NE tagger trained on data from an earlier time period. We will show that updating the unlabeled data is sufficient to maintain quality over time, and outperforms updating the labeled data. Furthermore, we will also show that augmenting the unlabeled data with older data in most cases does not result in better performance than simply using a smaller amount of current unlabeled data.
UR - http://www.scopus.com/inward/record.url?scp=79952416941&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79952416941&partnerID=8YFLogxK
U2 - 10.3115/1667583.1667693
DO - 10.3115/1667583.1667693
M3 - Conference contribution
AN - SCOPUS:79952416941
SN - 9781617382581
T3 - ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.
SP - 353
EP - 356
BT - ACL-IJCNLP 2009 - Joint Conf. of the 47th Annual Meeting of the Association for Computational Linguistics and 4th Int. Joint Conf. on Natural Language Processing of the AFNLP, Proceedings of the Conf.
PB - Association for Computational Linguistics (ACL)
T2 - Joint Conference of the 47th Annual Meeting of the Association for Computational Linguistics and 4th International Joint Conference on Natural Language Processing of the AFNLP, ACL-IJCNLP 2009
Y2 - 2 August 2009 through 7 August 2009
ER -