PARAMETRIZED EVENT ANALYSIS FROM SOCIAL NETWORKS
DOI:
https://doi.org/10.37943/TSYV3590Keywords:
event-detection with multi-topic semi-supervision, SEDTWik, social media, dictionary, Events2012.Abstract
The growth of data in social networks facilitate demand for data analysis. The field of event detection is of increasing interest to researchers. Events from real life are actively discussed in the virtual space. Event detection results can be used in a variety of applications, from digital marketing to collecting data about natural disasters. Thereby, researchers face the emergence of new algorithms along with the improvement of existing solutions in the event detection field. This paper proposes improvements to the SEDTWik (Segment-based Event Detection from Tweets using Wikipedia) algorithm. The SEDTWik algorithm is designed to detect events without contextual guidance. The overall SEDTWik detection process excludes the perspective of a topic, or multi-topic, guided (or semi-supervised) event detection approach. As a result, some interesting narrowly focused events are not detected as they are weakly relevant in a broader context (e.g., Wikipedia) although acquiring relevance within a conditioned context. Therefore, there is a need for an adaptive perspective where data is to be analysed against a set of narrower topics of interest. This paper shows that SEDTWik gains expressive power after being extended with multi-topic semi-supervision. The evaluation of the current proposal uses the well-known corpora with labeled events, Events2012. In the Events2012 dataset used notation category for events, meaning that events are combined by a certain topic. SEDTWik with topic dictionaries was checked across all categories. In the main part of the article, it is also explained the process of topic dictionary construction from Events2012 labeled tweets. At this stage of the research, in all tasks unigrams were used. SEDTWik with dictionaries showed improved accuracy, and more events were found within a certain category.
References
Mussina, A. B., Aubakirov, S. S., & Trigo, P. (2021). An Architecture for Real-Time Massive Data Extraction from Social Media. Communications in Computer and Information Science, 138–145. https://doi.org/10.1007/978-3-030-78759-2_11
Morabia, K., Bhanu Murthy, N. L., Malapati, A., & Samant, S. (2019). SEDTWik: segmentation-based event detection from tweets using Wikipedia. Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, 77–85. https://doi.org/10.18653/v1/n19-3011
Li, C., Sun, A., & Datta, A. (2012). Twevent: segment-based event detection from tweets. Proceedings of the 21st ACM International Conference on Information and Knowledge Management - CIKM ’12, 155–164. https://doi.org/10.1145/2396761.2396785
McMinn, A. J., Moshfeghi, Y., & Jose, J. M. (2013). Building a large-scale corpus for evaluating event detection on twitter. Proceedings of the 22nd ACM International Conference on Conference on Information & Knowledge Management - CIKM ’13, 409–418. https://doi.org/10.1145/2505515.2505695
Bekoulis, G., Deleu, J., Demeester, T. & Develder, C. (2019). Sub-event detection from twitter streams as a sequence labeling problem. arXiv preprint arXiv:1903.05396
Chen, X., Zhou, X., Sellis, T., & Li, X. (2018). Social event detection with retweeting behavior correlation. Expert Systems with Applications, 114, 516–523. https://doi.org/10.1016/j.eswa.2018.08.022
Lu, X. S., Zhou, M., Qi, L., & Liu, H. (2019). Clustering-Algorithm-Based Rare-Event Evolution Analysis via Social Media Data. IEEE Transactions on Computational Social Systems, 6(2), 301–310. https://doi.org/10.1109/tcss.2019.2898774
Goswami, A., & Kumar, A. (2016). A survey of event detection techniques in online social networks. Social Network Analysis and Mining, 6(1). https://doi.org/10.1007/s13278-016-0414-1
Cui, W., Wang, P., Du, Y., Chen, X., Guo, D., Li, J., & Zhou, Y. (2017). An algorithm for event detection based on social media data. Neurocomputing, 254, 53–58. https://doi.org/10.1016/j.neucom.2016.09.127
Papers with Code - The latest in Machine Learning. (2021, August 25). Papers with Code. Retrieved August 25, 2021, from https://paperswithcode.com/
Hamborg, F., Breitinger, C. & Gipp, B. (2019). Giveme5w1h: A universal system for extracting main events from news articles. arXiv preprint arXiv:1909.02766
Du, X. & Cardie, C. (2020). Event extraction by answering (almost) natural questions. arXiv preprint arXiv:2004.13625
Liu, X., Luo, Z. & Huang, H. (2018). Jointly multiple events extraction via attention-based graph information aggregation. arXiv preprint arXiv:1809.09078.
ENwiki-latest-all-titles. (2021). Wikimedia Downloads. Retrieved August 26, 2021, from http://dumps.wikimedia.org/enwiki/latest/enwiki-latest-all-titles-in-ns0.gz
Wikipedia Keyphraseness. (2021). Aixin’s Homepage. Retrieved August 26, 2021, from https://personal.ntu.edu.sg/axsun/datasets.html
Mussina, A. & Aubakirov, S. (2017) Dictionary extraction based on statistical data. KazNU Bulletin. Mathematics, Mechanics, Computer Science Series, 94(2), 72–82.
Barr, I. (2016, April 20). Heavy Metal and Natural Language Processing - Part 1. Degenerate State. Retrieved September 20, 2016, from http://www.degeneratestate.org/posts/2016/Apr/20/heavy-metal-and-natural-language-processing-part-1/
SEDTWik-Event-Detection-from-Tweets. (2020, July 13). Github. Retrieved August 26, 2021, from https://github.com/kevalmorabia97/SEDTWik-Event-Detection-from-Tweets
Downloads
Published
How to Cite
Issue
Section
License
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish a manuscript in this journal agree to the following terms:
- The authors reserve the right to authorship of their work and transfer to the journal the right of first publication under the terms of the Creative Commons Attribution License, which allows others to freely distribute the published work with a mandatory link to the the original work and the first publication of the work in this journal.
- Authors have the right to conclude independent additional agreements that relate to the non-exclusive distribution of the work in the form in which it was published by this journal (for example, to post the work in the electronic repository of the institution or publish as part of a monograph), providing the link to the first publication of the work in this journal.
- Other terms stated in the Copyright Agreement.