Entity linking for tweets
Abstract
Named Entity Linking (NEL) is the task of semantically annotating entity mentions in a portion of text with links to a knowledge base. The automatic annotation, which requires the recognition and disambiguation of the entity mention, usually exploits contextual clues like the context of usage and the coherence with respect to other entities. In Twitter, the limits of 140 characters originates very short and noisy text messages that pose new challenges to the entity linking task. We propose an overview of NEL methods focusing on approaches specifically developed to deal with short messages, like tweets. NEL is a fundamental task for the extraction and annotation of concepts in tweets, which is necessary for making the Twitter’s huge amount of interconnected user-generated contents machine readable and enable the intelligent information access.
References
- 1. , Kore: Keyphrase overlap relatedness for entity disambiguation, Proc. 21st ACM Int. Conf. Information and Knowledge Management (2012), pp. 545–554. Google Scholar
- 2. D. Weissenborn, L. Hennig, F. Xu and H. Uszkoreit, Multi-objective optimization for the joint disambiguation of nouns and named entities, Proc. 53rd Annual Meeting of the Association for Computational Linguistics and the 7th Int. Joint Conf. Natural Language Processing (Volume 1: Long Papers) (Association for Computational Linguistics, Beijing, China, July 2015), pp. 596–605. Google Scholar
- 3. , Robust disambiguation of named entities in text, Proc. Conf. on Empirical Methods in Natural Language Processing (2011), pp. 782–792. Google Scholar
- 4. , Adding semantics to microblog posts, Proc. Fifth ACM Int. Conf. Web Search and Data Mining (2012), pp. 563–572. Google Scholar
- 5. , Analysis of named entity recognition and linking for tweets, Inf. Process. Manage. 51, 32 (2015). Crossref, Google Scholar
- 6. , Introduction to the conll-2003 shared task: Language-independent named entity recognition, Proc. CoNLL-2003, eds. W. Daelemans and M. Osborne (Edmonton, Canada, 2003), pp. 142–147. Google Scholar
- 7. , Annotating named entities in twitter data with crowdsourcing, Proc. NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, CSLDAMT’10 (Association for Computational Linguistics, Stroudsburg, PA, USA, 2010), pp. 80–88. Google Scholar
- 8. , Named entity recognition in tweets: An experimental study, Proc. Conf. Empirical Methods in Natural Language Processing (2011), pp. 1524–1534. Google Scholar
- 9. A. E. Cano, M. Rowe, M. Stankovic and A. Dadzie (eds.), Proc. Concept Extraction Challenge at the Workshop on ’Making Sense of Microposts’, Rio de Janeiro, Brazil, May 13, 2013, CEUR Workshop Proc. Vol. 1019 (CEUR-WS.org, 2013). Google Scholar
- 10. X. Liu, M. Zhou, F. Wei, Z. Fu and X. Zhou, Joint inference of named entity recognition and normalization for tweets, Proc. 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1 (2012), pp. 526–535. Google Scholar
- 11. , Making sense of social media streams through semantics: A survey, Sem. Web 5, 373 (2014). Crossref, Google Scholar
- 12. , Yago: A core of semantic knowledge unifying wordnet and wikipedia, 16th Int. World Wide Web Conf. (WWW 2007) (2007), pp. 697–706. Google Scholar
- 13. , Wordnet: A lexical database for english, Commun. ACM 38, 39 (1995). Crossref, Google Scholar
- 14. , BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network, Artif. Intell. 193, 217 (2012). Crossref, Google Scholar
- 15. , Dbpedia: A nucleus for a web of open data, 6th Int. Semantic Web Conf. (ISWC 2007) (Springer, 2007), pp. 722–735. Google Scholar
- 16. , Freebase: A collaboratively created graph database for structuring human knowledge, Proc. 2008 ACM SIGMOD Int. Conf. Management of data (2008), pp. 1247–1250. Google Scholar
- 17. , Entity linking with a knowledge base: Issues, techniques, and solutions, IEEE Trans. Knowl. Data Eng. 27, 443 (2015). Crossref, Google Scholar
- 18. , Entity linking with effective acronym expansion, instance selection, and topic modeling., IJCAI (2011), pp. 1909–1914. Google Scholar
- 19. , Nlpr_kbp in tac 2009 kbp track: A two-stage method to entity linking, Proc. Test Analysis Conf. 2009 (TAC 09) (2009). Google Scholar
- 20. , Entity disambiguation for knowledge base population, Proc. 23rd Int. Conf. Computational Linguistics (2010), pp. 277–285. Google Scholar
- 21. , Entity linking leveraging: Automatically generated annotation, Proc. 23rd Int. Conf. Computational Linguistics (2010), pp. 1290–1298. Google Scholar
- 22. , To link or not to link? a study on end-to-end tweet entity linking, HLT-NAACL (2013), pp. 1020–1030. Google Scholar
- 23. , Entity extraction, linking, classification, and tagging for social media: A wikipedia-based approach, Proc. VLDB Endowment 6, 1126 (2013). Crossref, Google Scholar
- 24. , Large-scale named entity disambiguation based on wikipedia data, EMNLP-CoNLL (2007), pp. 708–716. Google Scholar
- 25. , Learning to link with wikipedia, Proc. 17th ACM Conf. Information and Knowledge Management (2008), pp. 509–518. Google Scholar
- 26. , An open-source toolkit for mining wikipedia, Artif. Intell. 194, 222 (2013). Crossref, Google Scholar
- 27. , The google similarity distance, IEEE Trans. Knowl. Data Eng. 19, 370 (2007). Crossref, Google Scholar
- 28. , Local and global algorithms for disambiguation to wikipedia, Proc. 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1 (2011), pp. 1375–1384. Google Scholar
- 29. , Learning relatedness measures for entity linking, Proc. 22nd ACM Int. Conf. Information & Knowledge Management (2013), pp. 139–148. Google Scholar
- 30. , Learning to rank entity relatedness through embedding-based features, Int. Conf. Applications of Natural Language to Information Systems (2016), pp. 471–477. Google Scholar
- 31. , Collective entity linking in web text: A graph-based method, Proc. 34th Int. ACM SIGIR Conf. Research and Development in Information Retrieval (2011), pp. 765–774. Google Scholar
- 32. , Topic-sensitive pagerank, Proc. 11th Int. Conf. World Wide Web (2002), pp. 517–526. Google Scholar
- 33. , Entity linking meets word sense disambiguation: A unified approach, Trans. Assoc. Comput. Linguist. (TACL) 2, 231 (2014). Crossref, Google Scholar
- 34. , Modeling mention, context and entity with neural networks for entity disambiguation, Proc. Int. Joint Conf. Artificial Intelligence (IJCAI) (2015), pp. 1333–1339. Google Scholar
- 35. , Learning entity representation for entity disambiguation., 51st Annual Meeting of the Association for Computational Linguistics (2013), pp. 30–34. Google Scholar
- 36. H. Huang, L. Heck and H. Ji, Leveraging deep neural networks and knowledge graphs for entity disambiguation, arXiv: 1504.07678. Google Scholar
- 37. M. Francis-Landau, G. Durrett and D. Klein, Capturing semantic similarity for entity linking with convolutional neural networks, arXiv:1604.00734. Google Scholar
- 38. M. A. Yosef, J. Hoffart, Y. Ibrahim, A. Boldyrev and G. Weikum, Adapting aida for tweets, Making Sense of Microposts (# Microposts2014) (2014). Google Scholar
- 39. , Aida: An online tool for accurate disambiguation of named entities in text and tables, Proc. VLDB Endowment 4, 1450 (2011). Crossref, Google Scholar
- 40. U. Scaiella, M. Barbera, S. Parmesan, G. Prestia, E. Del Tessandoro and M. Verı, Datatxt at# microposts2014 challenge, Making Sense of Microposts (# Microposts2014) (2014), pp. 1–15. Google Scholar
- 41. , Fast and accurate annotation of short texts with wikipedia pages, IEEE Softw. 1, 70 (2012). Crossref, Google Scholar
- 42. H. Barathi Ganesh, N. Abinaya, M. Anand Kumar, R. Vinayakumar and K. Soman, Amrita-cen@ neel: Identification and linking of twitter entities, Making Sense of Microposts (# Microposts2015) (2015). Google Scholar
- 43. , Robust entity linking via random walks, Proc. 23rd ACM Int. Conf. Conf. Information and Knowledge Management (2014), pp. 499–508. Google Scholar
- 44. Z. Guo and D. Barbosa, Entity recognition and linking on tweets with random walks, Making Sense of Microposts (# Microposts2015) (2015). Google Scholar
- 45. C. Gârbacea, D. Odijk, D. Graus, I. Sijaranamual and M. de Rijke, Combining multiple signals for semanticizing tweets: University of amsterdam at# microposts2015, Making Sense of Microposts (# Microposts2015) (2015), pp. 59–60. Google Scholar
- 46. , Semanticizing search engine queries: The university of amsterdam at the erd 2014 challenge, Proc. first Int. Workshop on Entity Recognition & Disambiguation (2014), pp. 69–74. Google Scholar
- 47. J. Waitelonis and H. Sack, Named entity linking in# tweets with kea. Google Scholar
- 48. , The journey is the reward-towards new paradigms in web search, Int. Conf. Business Information Systems (2015), pp. 15–26. Google Scholar
- 49. D. R. K. W. Amparo E. Cano, Daniel PreoÂÿtiuc-Pietro and A.-S. Dadzie, 6th Workshop on Making Sense of Microposts (# microposts2016), Word Wide Web Conf. (WWW16) Companion (ACM). Google Scholar
- 50. P. Basile, A. Caputo, G. Semeraro and F. Narducci, Uniba: Exploiting a distributional semantic model for disambiguating and linking entities in tweets Making Sense of Microposts (# Microposts2015) (2015). Google Scholar
- 51. I. Yamada, H. Takeda and Y. Takefuji, An end-to-end entity linking approach for tweets, Making Sense of Microposts (# Microposts2015) (2015). Google Scholar
- 52. M. B. Habib, M. Van Keulen and Z. Zhu, Named entity extraction and linking challenge: University of twente at# microposts2014, Making Sense of Microposts (# Microposts2014), (2014). Google Scholar
- 53. , Twiner: named entity recognition in targeted twitter stream, Proc. 35th Int. ACM SIGIR Conf. Research and Development in Information Retrieval (2012), pp. 721–730. Google Scholar
- 54. R. Bansal, S. Panem, P. Radhakrishnan, M. Gupta and V. Varma, Linking entities in# microposts, Making Sense of Microposts (# Microposts2014) (2014). Google Scholar
- 55. , Adapting boosting for information retrieval measures, Inf. Ret. 13, 254 (2010). Crossref, Google Scholar
- 56. M.-W. Chang, B.-J. Hsu, H. Ma, R. Loynd and K. Wang, E2e: An end-to-end entity linking system for short and noisy text, Making Sense of Microposts (# Microposts2014) (2014). Google Scholar
- 57. , Making sense of microposts:(# microposts2014) named entity extraction & linking challenge, CEUR Workshop Proc (2014), pp. 54–60. Google Scholar
- 58. , Entity linking for tweets, 51st Annual Meeting of the Association for Computational Linguistics (ACL, 2013), pp. 1304–1311. Google Scholar
- 59. , S-mart: Novel tree-based structured learning algorithms applied to tweet entity linking, Proc. Association for Computational Linguistics (2015), pp. 504–513. Google Scholar
- 60. , Linking named entities in tweets with knowledge base via user interest modeling, Proc. 19th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining (2013), pp. 68–76. Google Scholar
- 61. , Microblog entity linking by leveraging extra posts, Proc. 2013 Conf. Empirical Methods in Natural Language Processing (2013), pp. 863–868. Google Scholar
- 62. , Improved entity linking with user history and news articles, 9th Pacific Asia Conf. on Language, Information and Computation (2015), pp. 19–26. Google Scholar
- 63. , Entity linking on microblogs with spatial and temporal signals, Trans. Assoc. Comput. Linguist. 2, 259 (2014). Crossref, Google Scholar
- 64. , Making sense of microposts (# microposts2015) named entity recognition and linking (neel) challenge, 5th Workshop on Making Sense of Microposts (# Microposts2015) (2015), pp. 44–53. Google Scholar
- 65. , On coreference resolution performance metrics, Proc. Conf. Human Language Technology and Empirical Methods in Natural Language Processing (2005), pp. 25–32. Google Scholar
- 66. , Erd’14: entity recognition and disambiguation challenge, ACM SIGIR Forum (2), 63 (2014). Crossref, Google Scholar
- 67. , Entity linking for italian tweets, Proc. Second Italian Conf. Computational Linguistics CLiC-it 2015, eds. C. Bosco, S. Tonelli and F. M. Zanzotto (Accademia University Press, 2015), pp. 36–40. Google Scholar
- 68. ,
Linked data-the story so far , Semantic Services, Interoperability and Web Applications: Emerging Concepts, 205 (2009). Crossref, Google Scholar - 69. , Twitter power: Tweets as electronic word of mouth, J. Am. Soc. Inf. Sci. Technol. 60, 2169 (2009). Crossref, Google Scholar
- 70. , Entity based sentiment analysis on twitter, Science 9, 1 (2010). Google Scholar
- 71. A. Tumasjan, T. Sprenger, P. Sandner and I. Welpe, Predicting elections with twitter: What 140 characters reveal about political sentiment (2010). Google Scholar
- 72. , (how) will the revolution be retweeted?: Information diffusion and the 2011 egyptian uprising, Proc. ACM 2012 Conf. Computer Supported Cooperative Work, CSCW ’12 (ACM, New York, NY, USA, 2012), pp. 7–16. Google Scholar
- 73. , You are what you tweet: Analyzing twitter for public health, Proc. Fifth Int. AAAI Conf. Weblogs and Social Media (2011), pp. 265–272. Google Scholar
- 74. , Predicting the future with social media, Proc. 2010 IEEE/WIC/ACM Int. Conf. Web Intelligence and Intelligent Agent Technology — Volume 01, WI-IAT ’10 (IEEE Computer Society, Washington, DC, USA, 2010), pp. 492–499. Google Scholar
Remember to check out the Most Cited Articles! |
---|
Notable titles in semantic computing |