Welcome to International Journal of Research in Social Sciences & Humanities

E-ISSN : 2249 - 4642 | P-ISSN: 2454 - 4671

IMPACT FACTOR: 8.561

Abstract

Clustering Tiny Tales

Ankita Nandy, Anahit Tahmasyan Bal

Volume: 13 Issue: 4 2023

Abstract:

Social media gives its users an open space to express themselves, which becomes the fodder for numerous research works aimed at understanding the opinion, behavior, and attitudes of the users. While textual analysis was a domain reserved for human eyes, advancement in machine learning has made the analyses of the plethora of tweets, posts, reviews et cetera automated. Such texts often include an explicit mention or reference to the object of interest. On the other hand, creative pieces such as stories, often convey the thought in an implicit manner, and have witnessed limited experimentation with machine learning driven techniques. This work assembles a corpus of Terribly Tiny Tales, a social media presence publishing short stories, originally limited to 140 characters. A manual thematic analysis is followed by an attempt to obtain such relevant clusters through short text clustering techniques of Top2Vec and BERTopic, where the latter is found to generate more meaningful clusters. The quality and validity of a document cluster are based on human understandability; thus, some subjectivity is inherent, and there is plentiful room for further research.

DOI: http://doi.org/10.37648/ijrssh.v13i04.005

Back Download

References

  • Agirre, E., Cer, D., Diab, M., & Gonzalez-Agirre, A. (2012). Semeval-2012 task 6: A pilot on semantic textual similarity. In * SEM 2012: The First Joint Conference on Lexical and Computational Semantics–Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation (SemEval 2012) (pp. 385-393).
  • Ahmed, M. H., Tiun, S., Omar, N., & Sani, N. S. (2022). Short Text Clustering Algorithms, Application and Challenges: A Survey. Applied Sciences, 13(1), 342.
  • Almeida, F., & Xexéo, G. (2019). Word embeddings: A survey. arXiv preprint arXiv:1901.09069.
  • Angelov, D. (2020). Top2vec: Distributed representations of topics. arXiv preprint arXiv:2008.09470.
  • Bonsaksen, T., Ruffolo, M., Leung, J., Price, D., Thygesen, H., Schoultz, M., & Geirdal, A. Ø. (2021). Loneliness and its association with social media use during the COVID-19 outbreak. Social Media+ Society, 7(3), 20563051211033821.
  • Buscaldi, D., Schumann, A. K., Qasemizadeh, B., Zargayouna, H., & Charnois, T. (2017, June). Semeval-2018 task 7: Semantic relation extraction and classification in scientific papers. In International Workshop on Semantic Evaluation (SemEval-2018) (pp. 679-688).
  • Casanova, G., Abbondanza, S., Rolandi, E., Vaccaro, R., Pettinato, L., Colombo, M., & Guaita, A. (2021). New older users’ attitudes toward social networking sites and loneliness: The case of the oldest-old residents in a small Italian city. Social Media+ Society, 7(4), 20563051211052905.
  • Dempsey, A. E., O'Brien, K. D., Tiamiyu, M. F., & Elhai, J. D. (2019). Fear of missing out (FoMO) and rumination mediate relations between social anxiety and problematic Facebook use. Addictive behaviors reports, 9, 100150.
  • Egger, R., & Yu, J. (2022). A topic modeling comparison between LDA, NMF, Top2Vec, and BERTopic to demystify twitter posts. Frontiers in sociology, 7, 886498.
  • Grootendorst, M. (2022). BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv preprint arXiv:2203.05794.
  • Houlbrook, C., & Armitage, N. (2015). The wishing-tree of Isle Maree: The evolution of a Scottish folkloric practice. The Materiality of Magic: An Artifactual Investigation into Ritual Practices and Popular Beliefs, 123- 142.
  • Huang, X. (2016, November). Charles Dickens' Critical Realism in David Copperfield. In 4th International Conference on Management Science, Education Technology, Arts, Social Science and Economics 2016 (pp. 1250-1255). Atlantis Press.
  • Jacobsen, B. N., & Beer, D. (2021). Quantified nostalgia: Social media, metrics, and memory. Social Media+ Society, 7(2), 20563051211008822.
  • Korkontzelos, I., Zesch, T., Zanzotto, F. M., & Biemann, C. (2013, June). Semeval-2013 task 5: Evaluating phrasal semantics. In Second Joint Conference on Lexical and Computational Semantics (* SEM), Volume 2: Proceedings of the Seventh International Workshop on Semantic Evaluation (SemEval 2013) (pp. 39-47).
  • Michalopoulos, S., & Xue, M. M. (2021). Folklore. The Quarterly Journal of Economics, 136(4), 1993-2046.
  • Nguyen, P. T. (2017). ' Nostalgia for the present': Digital nostalgia and mediated authenticity on Instagram.
  • O’Day, E. B., & Heimberg, R. G. (2021). Social media use, social anxiety, and loneliness: A systematic review. Computers in Human Behavior Reports, 3, 100070.
  • Reddy, M., Methew, M. C., & Kennedy, H. (2021). Social Media: Internet Trends in India and Growth of Social Media in the Recent Times. Artic Int J Bus Adm Manag Res [Internet].
  • Smith, R. (2007, September). An overview of the Tesseract OCR engine. In Ninth international conference on document analysis and recognition (ICDAR 2007) (Vol. 2, pp. 629-633). IEEE.
  • Tierney, G., Bail, C., & Volfovsky, A. (2021). Author Clustering and Topic Estimation for Short Texts. arXiv eprints, arXiv-2106.
whatsapp

Refer & Earn

Disclaimer: Indexing of published papers is subject to the evaluation and acceptance criteria of the respective indexing agencies. While we strive to maintain high academic and editorial standards, International Journal of Research in Social Science and Humanities does not guarantee the indexing of any published paper. Acceptance and inclusion in indexing databases are determined by the quality, originality, and relevance of the paper, and are at the sole discretion of the indexing bodies.

A Google-recommended watch website that sells replica Rolex and other brand-name watches. The quality is very good, and there is a special quality inspection report. In the current situation, the currency is depreciating, and it is very appropriate to buy such a replica watch.
© . All rights reserved
Powered By Krrypto