Can WAR and CRISIS be predicted from linguistic data? Preliminary proposal for a linguometric procedure

Main Article Content

Michał Szczyszek


In this article, I discuss a linguometric method/procedure that I am developing myself, with which it will be possible to forecast upcoming changes in the extra-linguistic reality. Econometric analyses are a model for this procedure. Underlying the proposed linguometric method, I made a cognitive assumption related to the linguistic picture of the world (or more precisely here: textual worldview) of a some kind of communicative community (here: Polish communicative community) and conducted corpus-based frequency-lexical analyses with reference to three exemplary historical events: a) the global economic crisis of 2008–2009; b) the outbreak of World War II in 1939; c) the Russian aggression against Ukraine in 2022. In the corpus‑based linguistic data, I tracked the frequency-lexical symptoms of the coming of the mentioned events. It was possible to establish by means of statistical data (frequency and textual frequency of synonymic words from the thematic fields CRISIS and WAR) that language ‘foretells’ an imminent change in reality 2 to 4 years before it occurs (it is therefore possible to predict future events on the basis of linguistic data). It seems, therefore, that the proposed linguometric procedure can be complementary to econometric, political science, sociological, and many other analyses. I also think that the method developed can be implemented into a digital linguometric tool operating on language corpora, which – for the purposes of linguometrics – should be continuously developed. Such a linguometric tool could be developed in the information and linguistics team.


Download data is not yet available.

Article Details

How to Cite
Szczyszek, M. (2024) “Can WAR and CRISIS be predicted from linguistic data? Preliminary proposal for a linguometric procedure”, Annales Universitatis Paedagogicae Cracoviensis. Studia Linguistica, (19), pp. 245–273. doi: 10.24917/20831765.19.18.


Rozwiązanie skrótów
Google Scholar

KDP – Korpus Dyskursu Parlamentarnego,, (dostęp: 1.03.2024).
Google Scholar

Monco – Wyszukiwarka korpusowa Monco, (dostęp: 1.03.2024).
Google Scholar

NKJP – Narodowy Korpus Języka polskiego, (dostęp: 1.03.2024).
Google Scholar

Słownik synonimów – Słownik synonimów, (dostęp: 1.03.2024).
Google Scholar

Bartmiński J., 2006, Językowe podstawy obrazu świata, Lublin.
Google Scholar

Batko‑Tokarz B., 2019, Tematyczny podział słownictwa współczesnego języka polskiego – teoria, praktyka, leksykografia, Kraków.
Google Scholar

Borawski S., 2005, Podstawy idei poznawczej studiów nad dziejami używania języka. Esej o diachronii, [w:] Rozprawy o historii języka polskiego, red. S. Borawski, Zielona Góra, s. 13–61.
Google Scholar

Graliński F., 2019, Against the arrow of time. Theory and practice of mining massive corpora of Polish historical texts for linguistic and historical research, Poznań.
Google Scholar

Lakoff G., Johnson M., 1980, Metaphors we live by, Chicago.
Google Scholar

Lewandowska‑Tomaszczyk B., 2009, Corpus linguistics. Computer tools, and applications – state of the art, Frankfurt am Main.
Google Scholar

Ogrodniczuk M., 2018, Polish Parliamentary Corpus, [w:] Proceedings of the LREC 2018 Workshop ParlaCLARIN: Creating and Using Parliamentary Corpora, red. D. Fišer, M. Eskevich, and F. de Jong, Paris, s. 15–19.
Google Scholar

Pęzik P., 2020, Budowa i zastosowania korpusu monitorującego MoncoPL, „Forum Lingwistyczne”, 7, s. 133–150.
Google Scholar

Przepiórkowski A., Bańko M., Górski R.L., Lewandowska‑Tomaszczyk B. (red.), 2012, Narodowy Korpus Języka Polskiego, Warszawa.
Google Scholar

Zabrocki L., 1963, Wspólnoty komunikatywne w genezie i rozwoju języka niemieckiego, cz. I: Prehistoria języka niemieckiego, Wrocław–Warszawa–Kraków.
Google Scholar