Projects / Programmes
Computer-assisted multilingual news discourse analysis with contextual embeddings
Code |
Science |
Field |
Subfield |
6.05.02 |
Humanities |
Linguistics |
Theoretical and applied linguistics |
Code |
Science |
Field |
6.02 |
Humanities |
Languages and Literature |
Natural language processing, critical discourse analysis, news analysis, word embeddings, deep learning, diachronic news analysis, comparative news analysis, metaphor
Organisations (3)
, Researchers (23)
0106 Jožef Stefan Institute
0582 University of Ljubljana, Faculty of Social Sciences
no. |
Code |
Name and surname |
Research area |
Role |
Period |
No. of publicationsNo. of publications |
1. |
52077 |
Jan Kostanjevec |
Political science |
Young researcher |
2020 - 2021 |
20 |
2. |
50488 |
PhD Primož Medved |
Sociology |
Researcher |
2021 |
47 |
3. |
38127 |
PhD Nina Perger |
Sociology |
Researcher |
2020 - 2023 |
167 |
4. |
22331 |
PhD Maruša Pušnik |
Political science |
Researcher |
2020 - 2021 |
396 |
5. |
27578 |
PhD Andreja Vezovnik |
Culturology |
Researcher |
2020 - 2023 |
266 |
1539 University of Ljubljana, Faculty of Computer and Information Science
no. |
Code |
Name and surname |
Research area |
Role |
Period |
No. of publicationsNo. of publications |
1. |
55352 |
Matic Kavaš |
|
Technical associate |
2021 |
0 |
2. |
15295 |
PhD Marko Robnik Šikonja |
Computer science and informatics |
Researcher |
2020 - 2023 |
473 |
3. |
50769 |
PhD Tadej Škvorc |
Computer science and informatics |
Researcher |
2022 - 2023 |
18 |
4. |
56007 |
Aleš Žagar |
Computer science and informatics |
Researcher |
2021 - 2023 |
35 |
Abstract
The ability to analyse and understand the news media has never been more important than now. Mass online content surrounds us, with misinformation and bias rife, leading to a wide distrust in broadcast media, and increasing reliance on unattested sources and social media. Qualitative discourse analysis techniques are designed precisely to uncover and understand the biases, viewpoints and framing techniques involved; but are limited by their reliance on complex and time-consuming manual analysis. At the same time, computational natural language processing techniques have been developed which can accurately process and categorise text in a range of ways; but are rarely applied to critical discourse analysis tasks or in less-resourced languages such as Slovene. This project will bridge the gap between these two, by adapting recent NLP tools to Slovene, developing new methods to help identify important discourse analysis phenomena, support multilingual news analysis, and work with discourse analysts to produce tools and visualisation methods that they can use to augment and improve their work.