The integration of sentiment analysis into sociolinguistic research: affective dimensions of spontaneous discourse

Authors

DOI:

https://doi.org/10.36517/ep16.97204

Keywords:

sentiment analysis, social variation, spontaneous speech, Madrid Oral Corpus

Abstract

This study explores the integration of sentiment analysis (SA) into sociolinguistic research, with the aim of examining how affective dimensions manifest in colloquial speech. Based on a theoretical review of cognitive and functional approaches to meaning, the study explores the possibility of quantifying relational and evaluative aspects of language using SA tools, particularly the Lingmotif software. This tool was applied to two spoken corpora of Madrid Spanish (CORMA and COLAm), allowing for an analysis of the emotional tone of interactions based on sociolinguistic variables such as gender, age, and communicative context. The results show that conversations in institutional contexts (such as customer service) exhibit more positive and intense affective profiles, while youth speech is characterized by a more frequent use of ironic expressions, taboo language, and banter strategies. Furthermore, a microdiachronic trend is observed: gender differences in emotional expression tend to diminish over time. The study concludes that AS, despite its limitations, is a valuable tool for detecting affective patterns in spontaneous speech and for expanding methodologies for analyzing social meaning in contemporary sociolinguistics.

Author Biography

  • Renata Enghels , Ghent University

    Professor of Hispanic and General Romance Linguistics at Ghent University (Belgium).

References

ACUÑA FERREIRA, V. Gender and expletives as discourse markers: some uses of Joder in young women’s interactions in Spanish and Galician. Feminismo/s, v. 38, p. 53-83, 2021.

BLOMMAERT, J. Discourse: A Critical Introduction. Cambridge: Cambridge University Press, 2005.

BROWN, T. B. et al. Language models are few-shot learners. arXiv, 2020. Disponível em: <https://arxiv.org/abs/2005.14165>. Acesso em: 31 mar. 2025.

BUCHOLTZ, M. Why be normal? Language and identity practices in a community of nerd girls. Language in Society, v. 28, n. 2, p. 203-223, 1999.

COUPLAND, N. Style: Language variation and identity. Cambridge: Cambridge University Press, 2007.

COUPLAND, N.; COUPLAND, J. Discourses of ageism and anti-ageism. Journal of Aging Studies, v. 7, n. 3, p. 279-301, 1993.

DEJAEGHERE, T.; SINGH, P; LEFEVER, E.; BIRKHOLZ, J. Exploring aspect-based sentiment analysis methodologies for literary-historical research purposes. In: SPRUGNOLI, R.; PASSAROTTI, M. Proceedings of the Third Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA) @ LREC-COLING-2024, 2024. Torino: ELRA and ICCL. Disponível em: <https://aclanthology.org/2024.lt4hala-1.16/>. Acesso em: 31 mar. 2025.

DEVLIN, J.; CHANG, M.; LEE, K. ; TOUTANOVA, K. BERT: pre-training of deep bidirectional transformers for language understanding. In: Proceedings of the 2019 Conference of the North. Minneapolis: Association for Computational Linguistics, 2019. Disponível em: <http://aclweb.org/anthology/N19-1423>. Acesso em: 31 mar. 2025.

ECKERT, P. Linguistic Variation as Social Practice: The Linguistic Construction of Identity in Belten High. Oxford: Blackwell, 2000.

ENGHELS, R. Pragmatic markers and sentiment analysis: toward an integrated model of meaning in contemporary colloquial Spanish. Revista Diadorim, v. 26, n. 3, e67788, 2024. Disponível em: <https://doi.org/10.35520/diadorim.2024.v26n3a67788>. Acesso em: 21 jan. 2026.

ENGHELS, R.; De Latte, F.; Roels, L. El Corpus Oral de Madrid (CORMA): materiales para el estudio (socio)lingüístico del español coloquial actual. Zeitschrift für Katalanistik, v. 33, p. 45-76, 2020.

FAIRCLOUGH, N. Discourse and Social Change. Cambridge: Polity Press; Blackwell, 1992.

GEERAERTS, D. Theories of Lexical Semantics. Oxford: Oxford University Press, 2010.

GHEZZI, C. Social indexicality and pragmatic change in the late 20th century: Cioè and tipo in Italian radio phone-ins. In: PONS BORDERÍA, S.; SALAMEH JIMÉNEZ, S. Language Change in the 20th Century: Exploring Microdiachronic Evolutions in Romance Languages. Amsterdam: John Benjamins, 2024. p. 158-187.

GLYNN, D. Corpus-driven cognitive semantics: quantitative approaches to synonymy. In: GLYNN, D.; FISCHER, K. Quantitative Methods in Cognitive Semantics: Corpus-Driven Approaches. Berlin: Mouton de Gruyter, 2010. p. 77-138.

GUMPERZ, J. Discourse Strategies. Cambridge: Cambridge University Press, 1982.

HALLIDAY, M. A. K.; MATTHIESSEN, C. An Introduction to Functional Grammar. London: Edward Arnold, 2004.

HANCOCK, A.; WILDER STUTTS, H.; BASS, A. Perceptions of gender and femininity based on language: implications for transgender communication therapy. Language and Speech, v. 58, n. 3, p. 315-333, 2015.

HOLMES, J. Women, Men and Politeness. Harlow: Addison Wesley Longman, 1995.

LANGACKER, R. W. Foundations of Cognitive Grammar: Descriptive Application. Stanford: Stanford University Press, 1991.

MILLS, S. Gender Matters: Feminist Linguistic Analysis. London; Bristol: Equinox, 2012.

MORENO-ORTIZ, A. Lingmotif: una herramienta de análisis de sentimiento enfocada en el usuario. Procesamiento de Lenguaje Natural, v. 58, p. 133-140, 2017.

MORENO-ORTIZ, A. Making Sense of Large Social Media Corpora: Keywords, Topics, Sentiment, and Hashtags in the Coronavirus Twitter Corpus. Cham: Springer Nature Switzerland, 2024.

MORENO-ORTIZ, A. The linguist’s role in sentiment analysis: from knowledge provider to data annotator. In: MACI, S.; GAROFALO, G. Investigating Discourse and Text: Corpus-Assisted Analytical Perspectives. Bern: Peter Lang, 2023. p. 25-54.

MULAC, A. et al. Empirical support for the Gender-as-Culture Hypothesis: An intercultural analysis of male/female language differences. Human Communication Research, v. 27, p. 121-152, 2001.

PANG, B.; LEE, L. Opinion mining and sentiment analysis. Foundations and Trends® in Information Retrieval, v. 2, n. 1-2, p. 1-135, 2008.

SILVERSTEIN, M. Indexical order and the dialectics of sociolinguistic life. Language and Communication, v. 23, n. 3-4, p. 193-229, 2003.

SILVERSTEIN, M. Pragmatic indexing. In: MEY, J. L.; BROWN, K. Concise Encyclopaedia of Pragmatics. Amsterdam: Elsevier, 2009. p. 756-759.

STENSTRÖM, A.-B.; ANDERSEN, G.; HASUND, I. K. Trends in Teenage Talk: Corpus Compilation, Analysis and Findings. Amsterdam: John Benjamins, 2002.

TAGLIAMONTE, S. A. Teen Talk: The Language of Adolescents. Cambridge: Cambridge University Press, 2016.

TANNEN, D. Conversational Style: Analyzing Talk Among Friends. Norwood: Ablex Publishing, 1984.

Published

2026-06-24

Issue

Section

Sociolinguistics: other interfaces, new applications

How to Cite

The integration of sentiment analysis into sociolinguistic research: affective dimensions of spontaneous discourse. Entrepalavras, [S. l.], v. 16, p. e97204, 2026. DOI: 10.36517/ep16.97204. Disponível em: https://periodicos.ufc.br/entrepalavras/article/view/97204. Acesso em: 25 jun. 2026.