Research

No items found.

Improving Non-autoregressive ASR with Autoregressive Pretraining

Yanjia Li, Lahiru Samarakoon, Ivan Fung,ICASSP 2023, June 2023

Abstract

Autoregressive (AR) automatic speech recognition (ASR) models predict each output token conditioning on the previous ones, which slows down their inference speed. On the other hand, nonautoregressive (NAR) models predict tokens independently and simultaneously within a constant number of decoding iterations, which brings high inference speed. However, NAR models generally have lower accuracy than AR models. In this work, we propose AR pretraining to the NAR encoder to reduce the accuracy gap between AR and NAR models. The experiment results show that our AR-pretrained MaskCTC reaches the same accuracy as AR Conformer on Aishell-1 (both 4.9% CER) and reduce the performance gap with AR Conformer on LibriSpeech by relatively 50%. Moreover, our AR-pretrained MaskCTC only needs single decoding iteration, which reduces inference time by 50%. We also investigate multiple masking strategies in training the masked language model of MaskCTC.

Research

Improving Non-autoregressive ASR with Autoregressive Pretraining

Abstract

Untied Positional Encodings For Efficient Transformer-based Speech Recognition

Abstract

Fine-tuning Pre-trained Language Models for Few-shot Intent Detection: Supervised Pre-training and Isotropization

Abstract

Conformer-based Speech Recognition with Linear Nystrom Attention and Rotary Position Embedding

Abstract

Two-Stage Auction Mechanism for Long-Term Participation in Crowdsourcing

Abstract

Robust End-to-end Speaker Diarization with Conformer and Additive Margin Penalty

Abstract

Unknown Intent Detection Using Gaussian Mixture Model with an Application to Zero-shot Intent Classification

Abstract

Deep-AIR: A Hybrid CNN-LSTM Framework forFine-Grained Air Pollution Forecast

Abstract

Incorporating Prior Knowledge Into Speaker Diarization and Linking for Identifying Common Speaker

Abstract

A five-layer architecture for big data processing and analytics

Abstract

Go From the General to the Particular: Multi-Domain Translation with Domain Transformation Networks

Abstract

Reconstructing Capsule Networks for Zero-shot Intent Classification

Abstract

Public Transport Waiting Time Estimation Using Semi-Supervised Graph Convolutional Networks

Abstract

Synchrophasor Recovery and Prediction: A Graph-Based Deep Learning Approach

Abstract

Improved Zero-shot Neural Machine Translation via Ignoring Spurious Correlations

Abstract

Deep Multi-Scale Convolutional LSTM Network for Travel Demand and Origin-Destination Predictions

Abstract

Domain Adaptation of End-to-end Speech Recognition in Low-resource Settings

Abstract

Subspace Based Sequence Discriminative Training of LSTM Acoustic Models with Feed-Forward Layers

Abstract

Travel Demand Prediction using Deep Multi-Scale Convolutional LSTM Network

Abstract

Delay Aware Power System Synchrophasor Recovery and Prediction Framework

Abstract

Delay Aware Transient Stability Assessment with Synchrophasor Recovery and Prediction Framework

Abstract

Intelligent Time-Adaptive Transient Stability Assessment System

Abstract

Neural Machine Translation with Gumbel-Greedy Decoding

Abstract

Non-Autoregressive Neural Machine Translation

Abstract

Universal Neural Machine Translation for Extremely Low Resource Languages

Abstract

Delay Aware Intelligent Transient Stability Assessment System

Abstract

An Extended Spatio-temporal Granger Causality Model for Air Quality Estimation with Heterogeneous

Abstract

Search Engine Guided Non-Parametric Neural Machine Translation

Abstract

A Teacher-Student Framework for Zero-Resource Neural Machine Translation

Abstract

Intelligent Fault Detection Scheme for Microgrids with Wavelet-based Deep Neural Networks

Abstract

Trainable Greedy Decoding for the Neural Machine Translation

Abstract

A Four-Layer Architecture for Online and Historical Big Data Analytics

Abstract

Incorporating Copying Mechanism in Sequence-to-Sequence Learning

Abstract

A Gaussian Bayesian Model to Identify Spatio-temporal Causalities for Air Pollution Based on Urban Big Data

Abstract

Learning to Translate in Real-time with Neural Machine Translation

Abstract

Pg-Causality: Identifying Spatiotemporal Causal Pathways for Air Pollutants with Urban Big Data

Abstract

Efficient Learning for Undirected Topic Models

Abstract

Granger-Causality-Based Air Quality Estimation with Spatio-Temporal (S-T) Heterogeneous Big Data

Abstract

Spatio-temporal (S-T) similarity model for constructing WIFI-based RSSI fingerprinting map for indoor localization

Abstract

Performance Models of Access Latency in Cloud Storage Systems