PAPER SUBMISSION FOR 2022

Whose submitted the latest research and development

Exploration of Semantic Information of Previous Sentences for Automatic Speech Recognition

Abstract

In a recent study, semantic information of the current sentence helps improve automatic speech recognition (ASR) performance in noisy environments. This work aims to improve the ASR system in noisy conditions by exploiting semantic information from previously recognized sentences to re-evaluate the N-best hypotheses list. The semantic probability score, used to reevaluate the N-best hypotheses list, is obtained by two approaches. The first approach is to use a deep neural network (DNN) semantic model with bidirectional encoder representations from transformers (BERT), namely P-BERT, to compare sentence hypotheses pairwise and choose the hypothesis with better semantic consistency. In the second approach, we exploit Universal Sentence Encoder, a pre-trained sentence encoding model based on transformer architecture. We represent previously recognized sentence and current sentence hypotheses as high dimensional vectors and compute the semantic distance between sentence vectors of previously recognized sentence and current sentence hypotheses. We perform experiments on the publicly available TED-LIUM corpus with different noise levels. We evaluate these two approaches using different context lengths. The proposed methods show the improvement of the ASR system over the baseline method, which only uses semantic information from the current sentence. Our experiment results show that most of the best results are obtained from the P-BERT rescoring method.

2022 Papers

Local and Global Orientation Correction for Oriented Human (Pose) Detection

Preliminary Study on SSCF-derived Polar Coordinate for ASR

Text Recognition on the Khmer Identification Cards and Its Application in Electronic Know Your Customer (e-KYC)

Cambodia Distributed Ledger – CamDL

Job Trends Analysis Using Power BI

Students’ Sentiment and Feedback Analysis on Online Learning System during COVID-19

Temperature Forcasting in Pnhom Penh Using Time Series Models

Eveluation of Regularization based Contiual Learning Alogorithm in the Context of Human Activity Recognition

Implementation of Deep Learning for Smart City Application: Lessons Learned

Intelligent Control in SDN/NFV-Empowered IoT System for Smart City Application

ENI-ETSI Meets the Proactive Network Solutions for Multi-tier Networking

ADDRESS

National Road 6A, Kthor, Prek Leap Chroy ​Changvar, Phnom Penh, Cambodia

CONTACT US

Phone: +855 10 344 040

Email: pr@cadt.edu.kh