IEEE-ACM Transactions on Audio Speech and Language Processing

Papers
(The H4-Index of IEEE-ACM Transactions on Audio Speech and Language Processing is 30. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Improving the Adversarial Robustness for Speaker Verification by Self-Supervised Learning214
Low-Latency Active Noise Control Using Attentive Recurrent Network183
CL-XABSA: Contrastive Learning for Cross-Lingual Aspect-Based Sentiment Analysis151
Convolutive Transfer Function-Based Multichannel Nonnegative Matrix Factorization for Overdetermined Blind Source Separation88
List of Reviewers81
Dual Microphone Speech Enhancement Based on Statistical Modeling of Interchannel Phase Difference73
Towards Maximizing a Perceptual Sweet Spot for Spatial Sound With Loudspeakers68
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker Verification58
CET2: Modelling Topic Transitions for Coherent and Engaging Knowledge-Grounded Conversations52
Multi-Cue Guided Semi-Supervised Learning Toward Target Speaker Separation in Real Environments51
Lightweight Speaker Verification Using Transformation Module With Feature Partition and Fusion47
One General Teacher for Multi-Data Multi-Task: A New Knowledge Distillation Framework for Discourse Relation Analysis45
SANet: A Compressed Speech Encoder and Steganography Algorithm Independent Steganalysis Deep Neural Network45
Cross-Domain Aspect-Based Sentiment Classification With Tripartite Graph Modeling44
Computation and Parameter Efficient Multi-Modal Fusion Transformer for Cued Speech Recognition41
DropAttack: A Random Dropped Weight Attack Adversarial Training for Natural Language Understanding40
On Ambisonic Source Separation With Spatially Informed Non-Negative Tensor Factorization39
Statistical Analysis for Speaker Recognition Evaluation With Data Dependence and Three Score Distributions39
Operation-Augmented Numerical Reasoning for Question Answering38
JMS-QA: A Joint Hierarchical Architecture for Mental Health Question Answering36
High-Fidelity and Pitch-Controllable Neural Vocoder Based on Unified Source-Filter Networks36
Handover QG: Question Generation by Decoder Fusion and Reinforcement Learning35
Review of Methods for Automatic Speaker Verification34
Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-Hour Scale33
Spatial Analysis and Synthesis Methods: Subjective and Objective Evaluations Using Various Microphone Arrays in the Auralization of a Critical Listening Room33
The VoxCeleb Speaker Recognition Challenge: A Retrospective33
Multi-Level Interaction Based Knowledge Graph Completion32
Dynamic Convolutional Neural Networks as Efficient Pre-Trained Audio Models31
Cacophony: An Improved Contrastive Audio-Text Model31
Dynamic Prompt-Driven Zero-Shot Relation Extraction31
0.068820953369141