Developing an Automatic Transcription and Retrieval System for Spoken Lectures in Turkish

Arısoy, Ebru

Developing an Automatic Transcription and Retrieval System for Spoken Lectures in Turkish

Files

Developing an automatic transcription and retrieval system for spoken lectures in Turkish.pdf (159.57 KB)

Date

2017

Authors

Arısoy, Ebru

Abstract

With the increase of online video lectures, using speech and language processing technologies for education has become quite important. This paper presents an automatic transcription and retrieval system developed for processing spoken lectures in Turkish. The main steps in the system are automatic transcription of Turkish video lectures using a large vocabulary continuous speech recognition (LVCSR) system and finding keywords on the lattices obtained from the LVCSR system using a speech retrieval system based on keyword search. While developing this system, first a state-of-the-art LVCSR system was developed for Turkish using advance acoustic modeling methods, then keywords were extracted automatically front word sequences in the reference transcriptions of video lectures, and a speech retrieval system was developed for searching these keywords in the lattice output of the LVCSR system. The spoken lecture processing system yields 14.2% word error rate and 0.86 maximum term weighted value on the test data.

Description

##nofulltext##
Ebru Arısoy (MEF Author)

ORCID

Ebru Arısoy

Keywords

Large vocabulary continuous speech recognition, Speech retrieval, Speech and language processing for educational technologies

Citation

Arisoy, E., (2017). Developing an Automatic Transcription and Retrieval System for Spoken Lectures in Turkish. Conference: 25th Signal Processing and Communications Applications Conference (SIU) Location: Antalya, TURKEY

WoS Q

N/A

Scopus Q

N/A

Source

Conference: 25th Signal Processing and Communications Applications Conference (SIU) Location: Antalya, TURKEY Date: MAY 15-18, 2017

URI

https://hdl.handle.net/20.500.11779/695

Collections

WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection
Elektrik Elektronik Mühendisliği Bölümü Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection

Full item page

Developing an Automatic Transcription and Retrieval System for Spoken Lectures in Turkish

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

ORCID

Keywords

Turkish CoHE Thesis Center URL

Citation

WoS Q

Scopus Q

Source

Volume

Issue

Start Page

End Page

URI

Collections