Speech copora and tools for the Slovenian language

Code

T2-0409 (C) - included in ARIS records

Head

PhD France Mihelič

Period

7/1/1998 - 6/30/2000

Science

Engineering sciences and technologies (4)
Humanities (1)

Reseacher status

Researcher (5)
Junior expert or technical associate (0)

Education

Doctoral degree (5)

Sex

Woman (1)
Man (4)

Status

Employed at RO and RRD (3)
Retired (2)

No. of publications

100–999 (5)

Projects / Programmes source: ARIS

Speech copora and tools for the Slovenian language

Research activity

Code	Science	Field	Subfield
2.06.05	Engineering sciences and technologies	Systems and cybernetics	Application areas

Code	Science	Field
T121	Technological sciences	Signal processing
H351	Humanities	Phonetics, phonology

Keywords

speech corpora, spontaneous speech, diphone speech data, lexicon, speech segmentation

Evaluation (metodology)

Evaluation of bibliographic research performance indicators according to ARIS methodology

Citations Citations for bibliographic records in COBIB.SI that are linked to records in citation databases

Organisations (2) , Researchers (5)

0106 Jožef Stefan Institute

no.	Code	Name and surname	Research area	Role	Period	No. of publicationsNo. of publications
1.	05023	PhD Tomaž Erjavec	Linguistics	Researcher	1998 - 2000	694

1538 University of Ljubljana, Faculty of Electrical Engineering

no.	Code	Name and surname	Research area	Role	Period	No. of publicationsNo. of publications
1.	11805	PhD Simon Dobrišek	Computer science and informatics	Researcher	1998 - 2000	296
2.	09580	PhD France Mihelič	Computer science and informatics	Head	1998 - 2000	313
3.	01938	PhD Nikola Pavešič	Systems and cybernetics	Researcher	1998 - 2000	659
4.	12000	PhD Jerneja Žganec Gros	Computer science and informatics	Researcher	1998 - 2000	292

Abstract

Speech databases are essential for building language technology applications in a given language. The aim of the project is to set up a relevant speech database for the Slovenian language. The work on the project includes: speech data acquisition and documentation, development of user ineterfaces for viewing, editing and annotation of the speech data and lexicon building. The speech database will consist of the following spoken recordings: isolated words, read continuous speech, spontaneous speech and a diphone database.

Speech copora and tools for the Slovenian language

Views history

Favourite

Speech copora and tools for the Slovenian language

FRASCATI classification

CERIF classification

Confirmation required

Views history

Favourite