World Scientific
  • Search
  •   
Skip main navigation

Cookies Notification

We use cookies on this site to enhance your user experience. By continuing to browse the site, you consent to the use of our cookies. Learn More
×

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at [email protected] for any enquiries.

Entity linking for tweets

    https://doi.org/10.1142/S2425038416300202Cited by:5 (Source: Crossref)

    Named Entity Linking (NEL) is the task of semantically annotating entity mentions in a portion of text with links to a knowledge base. The automatic annotation, which requires the recognition and disambiguation of the entity mention, usually exploits contextual clues like the context of usage and the coherence with respect to other entities. In Twitter, the limits of 140 characters originates very short and noisy text messages that pose new challenges to the entity linking task. We propose an overview of NEL methods focusing on approaches specifically developed to deal with short messages, like tweets. NEL is a fundamental task for the extraction and annotation of concepts in tweets, which is necessary for making the Twitter’s huge amount of interconnected user-generated contents machine readable and enable the intelligent information access.

    References

    • 1. J. Hoffart, S. Seufert, D. B. Nguyen, M. Theobald and G. Weikum, Kore: Keyphrase overlap relatedness for entity disambiguation, Proc. 21st ACM Int. Conf. Information and Knowledge Management (2012), pp. 545–554. Google Scholar
    • 2. D. Weissenborn, L. Hennig, F. Xu and H. Uszkoreit, Multi-objective optimization for the joint disambiguation of nouns and named entities, Proc. 53rd Annual Meeting of the Association for Computational Linguistics and the 7th Int. Joint Conf. Natural Language Processing (Volume 1: Long Papers) (Association for Computational Linguistics, Beijing, China, July 2015), pp. 596–605. Google Scholar
    • 3. J. Hoffart, M. A. Yosef, I. Bordino, H. Fürstenau, M. Pinkal, M. Spaniol, B. Taneva, S. Thater and G. Weikum, Robust disambiguation of named entities in text, Proc. Conf. on Empirical Methods in Natural Language Processing (2011), pp. 782–792. Google Scholar
    • 4. E. Meij, W. Weerkamp and M. de Rijke, Adding semantics to microblog posts, Proc. Fifth ACM Int. Conf. Web Search and Data Mining (2012), pp. 563–572. Google Scholar
    • 5. L. Derczynski, D. Maynard, G. Rizzo, M. van Erp, G. Gorrell, R. Troncy, J. Petrak and K. Bontcheva, Analysis of named entity recognition and linking for tweets, Inf. Process. Manage. 51, 32 (2015). CrossrefGoogle Scholar
    • 6. E. F. Tjong Kim Sang and F. De Meulder, Introduction to the conll-2003 shared task: Language-independent named entity recognition, Proc. CoNLL-2003, eds. W. Daelemans and M. Osborne (Edmonton, Canada, 2003), pp. 142–147. Google Scholar
    • 7. T. Finin, W. Murnane, A. Karandikar, N. Keller, J. Martineau and M. Dredze, Annotating named entities in twitter data with crowdsourcing, Proc. NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk, CSLDAMT’10 (Association for Computational Linguistics, Stroudsburg, PA, USA, 2010), pp. 80–88. Google Scholar
    • 8. A. Ritter, S. Clark, O. Etzioni et al., Named entity recognition in tweets: An experimental study, Proc. Conf. Empirical Methods in Natural Language Processing (2011), pp. 1524–1534. Google Scholar
    • 9. A. E. Cano, M. Rowe, M. Stankovic and A. Dadzie (eds.), Proc. Concept Extraction Challenge at the Workshop on ’Making Sense of Microposts’, Rio de Janeiro, Brazil, May 13, 2013, CEUR Workshop Proc. Vol. 1019 (CEUR-WS.org, 2013). Google Scholar
    • 10. X. Liu, M. Zhou, F. Wei, Z. Fu and X. Zhou, Joint inference of named entity recognition and normalization for tweets, Proc. 50th Annual Meeting of the Association for Computational Linguistics: Long Papers-Volume 1 (2012), pp. 526–535. Google Scholar
    • 11. K. Bontcheva and D. Rout, Making sense of social media streams through semantics: A survey, Sem. Web 5, 373 (2014). CrossrefGoogle Scholar
    • 12. G. K. Fabian, M. Suchanek and G. Weikum, Yago: A core of semantic knowledge unifying wordnet and wikipedia, 16th Int. World Wide Web Conf. (WWW 2007) (2007), pp. 697–706. Google Scholar
    • 13. G. A. Miller, Wordnet: A lexical database for english, Commun. ACM 38, 39 (1995). CrossrefGoogle Scholar
    • 14. R. Navigli and S. P. Ponzetto, BabelNet: The automatic construction, evaluation and application of a wide-coverage multilingual semantic network, Artif. Intell. 193, 217 (2012). CrossrefGoogle Scholar
    • 15. S. Auer, C. Bizer, G. Kobilarov, J. Lehmann, R. Cyganiak and Z. Ives, Dbpedia: A nucleus for a web of open data, 6th Int. Semantic Web Conf. (ISWC 2007) (Springer, 2007), pp. 722–735. Google Scholar
    • 16. K. Bollacker, C. Evans, P. Paritosh, T. Sturge and J. Taylor, Freebase: A collaboratively created graph database for structuring human knowledge, Proc. 2008 ACM SIGMOD Int. Conf. Management of data (2008), pp. 1247–1250. Google Scholar
    • 17. W. Shen, J. Wang and J. Han, Entity linking with a knowledge base: Issues, techniques, and solutions, IEEE Trans. Knowl. Data Eng. 27, 443 (2015). CrossrefGoogle Scholar
    • 18. W. Zhang, Y. C. Sim, J. Su and C. L. Tan, Entity linking with effective acronym expansion, instance selection, and topic modeling., IJCAI (2011), pp. 1909–1914. Google Scholar
    • 19. X. Han and J. Zhao, Nlpr_kbp in tac 2009 kbp track: A two-stage method to entity linking, Proc. Test Analysis Conf. 2009 (TAC 09) (2009). Google Scholar
    • 20. M. Dredze, P. McNamee, D. Rao, A. Gerber and T. Finin, Entity disambiguation for knowledge base population, Proc. 23rd Int. Conf. Computational Linguistics (2010), pp. 277–285. Google Scholar
    • 21. W. Zhang, J. Su, C. L. Tan and W. T. Wang, Entity linking leveraging: Automatically generated annotation, Proc. 23rd Int. Conf. Computational Linguistics (2010), pp. 1290–1298. Google Scholar
    • 22. S. Guo, M.-W. Chang and E. Kiciman, To link or not to link? a study on end-to-end tweet entity linking, HLT-NAACL (2013), pp. 1020–1030. Google Scholar
    • 23. A. Gattani, D. S. Lamba, N. Garera, M. Tiwari, X. Chai, S. Das, S. Subramaniam, A. Rajaraman, V. Harinarayan and A. Doan, Entity extraction, linking, classification, and tagging for social media: A wikipedia-based approach, Proc. VLDB Endowment 6, 1126 (2013). CrossrefGoogle Scholar
    • 24. S. Cucerzan, Large-scale named entity disambiguation based on wikipedia data, EMNLP-CoNLL (2007), pp. 708–716. Google Scholar
    • 25. D. Milne and I. H. Witten, Learning to link with wikipedia, Proc. 17th ACM Conf. Information and Knowledge Management (2008), pp. 509–518. Google Scholar
    • 26. D. Milne and I. H. Witten, An open-source toolkit for mining wikipedia, Artif. Intell. 194, 222 (2013). CrossrefGoogle Scholar
    • 27. R. L. Cilibrasi and P. M. Vitanyi, The google similarity distance, IEEE Trans. Knowl. Data Eng. 19, 370 (2007). CrossrefGoogle Scholar
    • 28. L. Ratinov, D. Roth, D. Downey and M. Anderson, Local and global algorithms for disambiguation to wikipedia, Proc. 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies-Volume 1 (2011), pp. 1375–1384. Google Scholar
    • 29. D. Ceccarelli, C. Lucchese, S. Orlando, R. Perego and S. Trani, Learning relatedness measures for entity linking, Proc. 22nd ACM Int. Conf. Information & Knowledge Management (2013), pp. 139–148. Google Scholar
    • 30. P. Basile, A. Caputo, G. Rossiello and G. Semeraro, Learning to rank entity relatedness through embedding-based features, Int. Conf. Applications of Natural Language to Information Systems (2016), pp. 471–477. Google Scholar
    • 31. X. Han, L. Sun and J. Zhao, Collective entity linking in web text: A graph-based method, Proc. 34th Int. ACM SIGIR Conf. Research and Development in Information Retrieval (2011), pp. 765–774. Google Scholar
    • 32. T. H. Haveliwala, Topic-sensitive pagerank, Proc. 11th Int. Conf. World Wide Web (2002), pp. 517–526. Google Scholar
    • 33. A. Moro, A. Raganato and R. Navigli, Entity linking meets word sense disambiguation: A unified approach, Trans. Assoc. Comput. Linguist. (TACL) 2, 231 (2014). CrossrefGoogle Scholar
    • 34. Y. Sun, L. Lin, D. Tang, N. Yang, Z. Ji and X. Wang, Modeling mention, context and entity with neural networks for entity disambiguation, Proc. Int. Joint Conf. Artificial Intelligence (IJCAI) (2015), pp. 1333–1339. Google Scholar
    • 35. Z. He, S. Liu, M. Li, M. Zhou, L. Zhang and H. Wang, Learning entity representation for entity disambiguation., 51st Annual Meeting of the Association for Computational Linguistics (2013), pp. 30–34. Google Scholar
    • 36. H. Huang, L. Heck and H. Ji, Leveraging deep neural networks and knowledge graphs for entity disambiguation, arXiv: 1504.07678. Google Scholar
    • 37. M. Francis-Landau, G. Durrett and D. Klein, Capturing semantic similarity for entity linking with convolutional neural networks, arXiv:1604.00734. Google Scholar
    • 38. M. A. Yosef, J. Hoffart, Y. Ibrahim, A. Boldyrev and G. Weikum, Adapting aida for tweets, Making Sense of Microposts (# Microposts2014) (2014). Google Scholar
    • 39. M. A. Yosef, J. Hoffart, I. Bordino, M. Spaniol and G. Weikum, Aida: An online tool for accurate disambiguation of named entities in text and tables, Proc. VLDB Endowment 4, 1450 (2011). CrossrefGoogle Scholar
    • 40. U. Scaiella, M. Barbera, S. Parmesan, G. Prestia, E. Del Tessandoro and M. Verı, Datatxt at# microposts2014 challenge, Making Sense of Microposts (# Microposts2014) (2014), pp. 1–15. Google Scholar
    • 41. P. Ferragina and U. Scaiella, Fast and accurate annotation of short texts with wikipedia pages, IEEE Softw. 1, 70 (2012). CrossrefGoogle Scholar
    • 42. H. Barathi Ganesh, N. Abinaya, M. Anand Kumar, R. Vinayakumar and K. Soman, Amrita-cen@ neel: Identification and linking of twitter entities, Making Sense of Microposts (# Microposts2015) (2015). Google Scholar
    • 43. Z. Guo and D. Barbosa, Robust entity linking via random walks, Proc. 23rd ACM Int. Conf. Conf. Information and Knowledge Management (2014), pp. 499–508. Google Scholar
    • 44. Z. Guo and D. Barbosa, Entity recognition and linking on tweets with random walks, Making Sense of Microposts (# Microposts2015) (2015). Google Scholar
    • 45. C. Gârbacea, D. Odijk, D. Graus, I. Sijaranamual and M. de Rijke, Combining multiple signals for semanticizing tweets: University of amsterdam at# microposts2015, Making Sense of Microposts (# Microposts2015) (2015), pp. 59–60. Google Scholar
    • 46. D. Graus, D. Odijk, M. Tsagkias, W. Weerkamp and M. De Rijke, Semanticizing search engine queries: The university of amsterdam at the erd 2014 challenge, Proc. first Int. Workshop on Entity Recognition & Disambiguation (2014), pp. 69–74. Google Scholar
    • 47. J. Waitelonis and H. Sack, Named entity linking in# tweets with kea. Google Scholar
    • 48. H. Sack, The journey is the reward-towards new paradigms in web search, Int. Conf. Business Information Systems (2015), pp. 15–26. Google Scholar
    • 49. D. R. K. W. Amparo E. Cano, Daniel PreoÂÿtiuc-Pietro and A.-S. Dadzie, 6th Workshop on Making Sense of Microposts (# microposts2016), Word Wide Web Conf. (WWWâĂŹ16) Companion (ACM). Google Scholar
    • 50. P. Basile, A. Caputo, G. Semeraro and F. Narducci, Uniba: Exploiting a distributional semantic model for disambiguating and linking entities in tweets Making Sense of Microposts (# Microposts2015) (2015). Google Scholar
    • 51. I. Yamada, H. Takeda and Y. Takefuji, An end-to-end entity linking approach for tweets, Making Sense of Microposts (# Microposts2015) (2015). Google Scholar
    • 52. M. B. Habib, M. Van Keulen and Z. Zhu, Named entity extraction and linking challenge: University of twente at# microposts2014, Making Sense of Microposts (# Microposts2014), (2014). Google Scholar
    • 53. C. Li, J. Weng, Q. He, Y. Yao, A. Datta, A. Sun and B.-S. Lee, Twiner: named entity recognition in targeted twitter stream, Proc. 35th Int. ACM SIGIR Conf. Research and Development in Information Retrieval (2012), pp. 721–730. Google Scholar
    • 54. R. Bansal, S. Panem, P. Radhakrishnan, M. Gupta and V. Varma, Linking entities in# microposts, Making Sense of Microposts (# Microposts2014) (2014). Google Scholar
    • 55. Q. Wu, C. J. Burges, K. M. Svore and J. Gao, Adapting boosting for information retrieval measures, Inf. Ret. 13, 254 (2010). CrossrefGoogle Scholar
    • 56. M.-W. Chang, B.-J. Hsu, H. Ma, R. Loynd and K. Wang, E2e: An end-to-end entity linking system for short and noisy text, Making Sense of Microposts (# Microposts2014) (2014). Google Scholar
    • 57. A. E. Cano, G. Rizzo, A. Varga, M. Rowe, M. Stankovic and A.-S. Dadzie, Making sense of microposts:(# microposts2014) named entity extraction & linking challenge, CEUR Workshop Proc (2014), pp. 54–60. Google Scholar
    • 58. X. Liu, Y. Li, H. Wu, M. Zhou, F. Wei and Y. Lu, Entity linking for tweets, 51st Annual Meeting of the Association for Computational Linguistics (ACL, 2013), pp. 1304–1311. Google Scholar
    • 59. Y. Yang and M.-W. Chang, S-mart: Novel tree-based structured learning algorithms applied to tweet entity linking, Proc. Association for Computational Linguistics (2015), pp. 504–513. Google Scholar
    • 60. W. Shen, J. Wang, P. Luo and M. Wang, Linking named entities in tweets with knowledge base via user interest modeling, Proc. 19th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining (2013), pp. 68–76. Google Scholar
    • 61. Y. Guo, B. Qin, T. Liu and S. Li, Microblog entity linking by leveraging extra posts, Proc. 2013 Conf. Empirical Methods in Natural Language Processing (2013), pp. 863–868. Google Scholar
    • 62. S. Jeong, Y. Park, S. Kang and J. Seo, Improved entity linking with user history and news articles, 9th Pacific Asia Conf. on Language, Information and Computation (2015), pp. 19–26. Google Scholar
    • 63. Y. Fang and M.-W. Chang, Entity linking on microblogs with spatial and temporal signals, Trans. Assoc. Comput. Linguist. 2, 259 (2014). CrossrefGoogle Scholar
    • 64. G. Rizzo, A. C. Basave, B. Pereira, A. Varga, M. Rowe, M. Stankovic and A. Dadzie, Making sense of microposts (# microposts2015) named entity recognition and linking (neel) challenge, 5th Workshop on Making Sense of Microposts (# Microposts2015) (2015), pp. 44–53. Google Scholar
    • 65. X. Luo, On coreference resolution performance metrics, Proc. Conf. Human Language Technology and Empirical Methods in Natural Language Processing (2005), pp. 25–32. Google Scholar
    • 66. D. Carmel, M.-W. Chang, E. Gabrilovich, B.-J. P. Hsu and K. Wang, Erd’14: entity recognition and disambiguation challenge, ACM SIGIR Forum (2), 63 (2014). CrossrefGoogle Scholar
    • 67. P. Basile, A. Caputo and G. Semeraro, Entity linking for italian tweets, Proc. Second Italian Conf. Computational Linguistics CLiC-it 2015, eds. C. Bosco, S. Tonelli and F. M. Zanzotto (Accademia University Press, 2015), pp. 36–40. Google Scholar
    • 68. C. Bizer, T. Heath and T. Berners-Lee, Linked data-the story so far, Semantic Services, Interoperability and Web Applications: Emerging Concepts, 205 (2009). CrossrefGoogle Scholar
    • 69. B. J. Jansen, M. Zhang, K. Sobel and A. Chowdury, Twitter power: Tweets as electronic word of mouth, J. Am. Soc. Inf. Sci. Technol. 60, 2169 (2009). CrossrefGoogle Scholar
    • 70. S. Batra and D. Rao, Entity based sentiment analysis on twitter, Science 9, 1 (2010). Google Scholar
    • 71. A. Tumasjan, T. Sprenger, P. Sandner and I. Welpe, Predicting elections with twitter: What 140 characters reveal about political sentiment (2010). Google Scholar
    • 72. K. Starbird and L. Palen, (how) will the revolution be retweeted?: Information diffusion and the 2011 egyptian uprising, Proc. ACM 2012 Conf. Computer Supported Cooperative Work, CSCW ’12 (ACM, New York, NY, USA, 2012), pp. 7–16. Google Scholar
    • 73. M. D. Michael and J. Paul, You are what you tweet: Analyzing twitter for public health, Proc. Fifth Int. AAAI Conf. Weblogs and Social Media (2011), pp. 265–272. Google Scholar
    • 74. S. Asur and B. A. Huberman, Predicting the future with social media, Proc. 2010 IEEE/WIC/ACM Int. Conf. Web Intelligence and Intelligent Agent Technology — Volume 01, WI-IAT ’10 (IEEE Computer Society, Washington, DC, USA, 2010), pp. 492–499. Google Scholar
    Remember to check out the Most Cited Articles!

    Notable titles in semantic computing