Developing Insights from Social Media using Semantic Lexical Chains to Mine Short Text Structures
Abstract
Social media is increasingly being used for communication by individuals and organizations. Social media stores vast amounts of publicly available data that provides a rich source of information and insights. Often, social media users can easily infer meaning from short text such as microblogs and Facebook posts because they understand the context and terminology used. Although automated data-mining can be effective for gaining insights from text data, a significant challenge is to accurately infer meaning from social media text derived from a single social media account. This is difficult because social media communication uses very short, or sparse, text, which yields a relatively small sample of usable words for analysis. Furthermore, interpreting the contextual meaning from a relatively small set of words is challenging. This research proposes a methodology for extracting semantic lexical chains from frequently occurring words in a single social media account and using these chains to mine short text structures to infer the overall themes of the user. The methodology is based on a proposed clustering algorithm and illustrated with examples from Facebook posts. The algorithm is tested and illustrated by comparing it to existing work and further applying it to a variety of news posts. This methodology could be useful for gaining decision-making insights from social media, or other online forms with short or sparse text.
Recommended Citation
Chua, C. E., Storey, V. C., Li, X., & Kaul, M. (2019). Developing Insights from Social Media using Semantic Lexical Chains to Mine Short Text Structures. Decision Support Systems, 127 Elsevier B.V..
The definitive version is available at https://doi.org/10.1016/j.dss.2019.113142
Department(s)
Business and Information Technology
Keywords and Phrases
Lexical chain; Semantic clustering; Short text; Social media; Text mining; Word sense disambiguation
International Standard Serial Number (ISSN)
0167-9236
Document Type
Article - Journal
Document Version
Citation
File Type
text
Language(s)
English
Rights
© 2019 Elsevier B.V., All rights reserved.
Publication Date
01 Dec 2019