Communications - Scientific Letters of the University of Zilina 2013, 15(11):124-128 | DOI: 10.26552/com.C.2013.2A.124-128

Building of Broadcast News Database for Evaluation of the Automated Subtitling Service

Matus Pleva1, Jozef Juhar1
1 Department of Electronics and Multimedia Communications, Faculty of Electrical Engineering and Informatics, Technical University of Kosice, Slovakia

This paper describes the process of recording, annotation, correction and evaluation of the new Broadcast News (BN) speech database named KEMT-BN2, as an extension for our older KEMT-BN1 and COST-278 databases used for automatic Slovak continuous speech recognition development. The database utilisation and statistics are presented. This database was prepared for evaluation of the automated BN transcription system, developed in our laboratory, which is mainly used for subtitle generation for recorded BN shows. The speech database is the key part of the acoustic models training for specific domains and also for speaker and anchor adapted models creation.

Keywords: broadcast news, segmentation, speech recognition, transcriber

Published: July 31, 2013  Show citation

ACS AIP APA ASA Harvard Chicago Chicago Notes IEEE ISO690 MLA NLM Turabian Vancouver
Pleva, M., & Juhar, J. (2013). Building of Broadcast News Database for Evaluation of the Automated Subtitling Service. Communications - Scientific Letters of the University of Zilina15(2A), 124-128. doi: 10.26552/com.C.2013.2A.124-128
Download citation

References

  1. JUHAR, J., CIZMAR, A., RUSKO, M., TRNKA, M., ROZINAJ, G., JARINA, R.: Voice Operated Information System in Slovak, Computing and Informatics, vol. 26 (6), pp. 577-603, 2007.
  2. NOUZA, J., SILOVSKY, J., ZDANSKY, J., CERVA, P., KROUL, M., CHALOUPKA, J.: Czech-to-Slovak Adapted Broadcast News Transcription System, Proc. of INTERSPEECH 2008, pp. 2683-2686, 2008. Go to original source...
  3. PLEVA, M., JUHAR, J., CIZMAR, A.: About Development and Evaluation of Multilingual Database for Automatic Broadcast News Transcription Systems, Acta Electrotechnica et Informatica (AeI), vol. 4 (2), pp. 56-59, 2004.
  4. STAS, J., HLADEK, D., PLEVA, M., JUHAR, J.: Slovak Language Model from Internet Text Data, Lecture Notes in Computer Science, Vol. 6456 LNCS, pp. 340-346, 2011. Go to original source...
  5. PROCHAZKA, V., POLLAK, P., ZDANSKY, J., NOUZA, J.: Performance of Czech Speech Recognition with Language Models Created from Public Resources, Radioengineering, Vol. 20 (4), pp. 1002-1008, 2011.
  6. HLADEK, D., STAS, J.: Text mining and processing for corpora creation in Slovak language, Journal of Computer Science and Control Systems, vol. 3 (1), pp. 65-68, 2010.
  7. PLEVA, M., JUHAR, J., CIZMAR, A.: Slovak Broadcast News Speech Corpus for Automatic Speech Recognition, Proceedings of RTT 2007 conference, Zilina, p. 4, 2007.
  8. VANDECATSEYE, A. et al.: The COST278 pan-European Broadcast News Database, Proc. of LREC 2004, vol. 6, May 2004, Lisbon, pp. 873-876, 2004.
  9. PLEVA, M.: Building European Broadcast News Database, Proc. of 4. Doktorandska konferencia a SVOS TU v Kosiciach - SCYR 2004, Kosice, pp. 85-86, 2004.
  10. PLEVA, M., CIZMAR, A., JUHAR, J., ONDAS, S., MIRILOVIC, M.: Towards Slovak Broadcast News Automatic Recording and Transcribing Service, Lecture Notes in Computer Science: Verbal and Nonverbal Features of Human-Human and Human-Machine Interaction, vol. 5042 LNCS, p. 158-168, 2008. Go to original source...
  11. JARINA, R., KUBA, M.: Speech Recognition Using Hidden Markov Model with Low Redundancy in the Observation Space, Komunikacie (Communications), Vol. 6 (4), pp. 17-21, 2004. Go to original source...
  12. NOUZA, J. et al.: Making Czech Historical Radio Archive Accessible and Searchable for Wide Public, Journal of Multimedia, Vol. 7 (2), pp. 159-169, 2012. Go to original source...
  13. HRIC, M., CHMULIK, M., JARINA, R.: Comparison of Selected Classification Methods in Automatic Speaker Identification, Komunikacie (Communications), Vol. 13 (4), pp. 20-24, 2011. Go to original source...
  14. CERVA, P., NOUZA, J., SILOVSKY, J.: Study on Cross-lingual Adaptation of a Czech LVCSR System Towards Slovak, Lecture Notes in Computer Science. Vol. 6800 LNCS, pp. 81-87, 2011. Go to original source...
  15. NOUZA, J., SILOVSKY, J.: Adapting Lexical and Language Models for Transcription of Highly Spontaneous Spoken Czech, Lecture Notes in Computer Science, vol. 6231 LNAI, pp. 377-384, 2010. Go to original source...
  16. JUHAR, J., STAS, J., HLADEK, D.: Recent Progress in Development of Language Model for Slovak Large Vocabulary Continuous Speech Recognition, New Technologies: Trends, Innovations and Research, Rijeka: InTech, pp. 261-276, 2012.
  17. PLEVA, M., JUHAR, J., CIZMAR, A.: Speech Detection in the Broadcast News Processing, Proc. of DSP-MCOM 2005. Kosice, pp. 84-85, 2005.
  18. VAVREK, J.: Audio Content Classification using SVM Binary Decision Trees, Proc. of SCYR 2012: 12th Scientific Conference of Young Researchers, May, Herlany, pp. 80-83, 2012. Go to original source...
  19. http://www.technisat.com does not present the end-of-life product, see spec.: http://www.digitalnow.com.au/product_pages/airstar2.html
  20. http://neuron2.net/dgmpgdec/dgmpgdec.html - developer site
  21. http://trans.sourceforge.netTranscriber 1.5.1 developer site
  22. http://www.foobar2000.org/ - developer site
  23. PLEVA, M., JUHAR, J., CIZMAR, A.: Multimedia Database Management for Annotators of the Metadata Content, Proc. of RTT 2009, Praha : CVUT, p. 3, 2009.
  24. http://www.ldc.upenn.edu/Projects/Corpus_Cookbook/transcription/broadcast_speech/english/index.html - LDC recommendation
  25. NIST SCLITE scoring toolkit: http://www.itl.nist.gov/iad/mig/tools/
  26. POLLAK, P. et al.: SpeechDat(E)-Eastern Speech Databases, Proc. of LREC 2000, XLDB satellite workshop, Athens, Greece, pp. 20-25, 2000.
  27. ZGANK, A. et al.: The COST 278 Initiative - Crosslingual Speech Recognition with Large Telephone Database, Proc. of LREC 2004, Lisbon, May 2004, pp. 2107-2110.
  28. PLEVA, M.: Automatic Processing of Speech Data in Multimedia Databases (Automaticke Spracovanie Recovych Dat v Multimedialnych Databazach), PhD Thesis (in Slovak), FEI TU of Kosice, p. 93, 2009.
  29. LINDBERG, B. et al.: A Noise Robust Multilingual Reference Recogniser Based on Speechdat (II), Proc. of INTERSPEECH 2000, Beijing, China, October 16-20, 2000, pp. 370-373, 2000. Go to original source...
  30. IVANECKY, J., NABELKOVA, M.: Phonetic transcription SAMPA and Slovak language (in Slovak), Jazykovedny casopis, vol. 53, pp. 81-95, 2002.

This is an open access article distributed under the terms of the Creative Commons Attribution 4.0 International License (CC BY 4.0), which permits use, distribution, and reproduction in any medium, provided the original publication is properly cited. No use, distribution or reproduction is permitted which does not comply with these terms.