Projects / Programmes
Speech copora and tools for the Slovenian language
Code |
Science |
Field |
Subfield |
2.06.05 |
Engineering sciences and technologies |
Systems and cybernetics |
Application areas |
Code |
Science |
Field |
T121 |
Technological sciences |
Signal processing |
H351 |
Humanities |
Phonetics, phonology |
speech corpora, spontaneous speech, diphone speech data, lexicon, speech segmentation
Organisations (2)
, Researchers (5)
0106 Jožef Stefan Institute
no. |
Code |
Name and surname |
Research area |
Role |
Period |
No. of publicationsNo. of publications |
1. |
05023 |
PhD Tomaž Erjavec |
Linguistics |
Researcher |
1998 - 2000 |
694 |
1538 University of Ljubljana, Faculty of Electrical Engineering
no. |
Code |
Name and surname |
Research area |
Role |
Period |
No. of publicationsNo. of publications |
1. |
11805 |
PhD Simon Dobrišek |
Computer science and informatics |
Researcher |
1998 - 2000 |
296 |
2. |
09580 |
PhD France Mihelič |
Computer science and informatics |
Head |
1998 - 2000 |
313 |
3. |
01938 |
PhD Nikola Pavešič |
Systems and cybernetics |
Researcher |
1998 - 2000 |
659 |
4. |
12000 |
PhD Jerneja Žganec Gros |
Computer science and informatics |
Researcher |
1998 - 2000 |
292 |
Abstract
Speech databases are essential for building language technology applications in a given language. The aim of the project is to set up a relevant speech database for the Slovenian language. The work on the project includes: speech data acquisition and documentation, development of user ineterfaces for viewing, editing and annotation of the speech data and lexicon building. The speech database will consist of the following spoken recordings: isolated words, read continuous speech, spontaneous speech and a diphone database.