Speech Communication

Papers
(The TQCC of Speech Communication is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)
ArticleCitations
Editorial Board122
Psychoacoustic features explain creakiness classifications made by naive and non-naive listeners89
Phase unwrapping based packet loss concealment using deep neural networks85
Subband fusion of complex spectrogram for fake speech detection59
Facemask occlusion's impact on L2 listening comprehension58
A comprehensive study on supervised single-channel noisy speech separation with multi-task learning55
Editorial Board53
Editorial Board47
Data augmentation for speech separation45
A novel distortion-tolerant speech encryption scheme for secure voice communication43
Assessing child communication engagement and statistical speech patterns for American English via speech recognition in naturalistic active learning spaces40
Editorial Board37
Perceptual asymmetry between pitch peaks and valleys35
Fixed frequency range empirical wavelet transform based acoustic and entropy features for speech emotion recognition34
A robust temporal map of speech monitoring from planning to articulation33
An introduction to pluricentric languages in speech science and technology29
Automatic classification of vocal intensity categories from amplitude-normalized speech signals by comparing acoustic features and classifier models29
A corpus of audio-visual recordings of linguistically balanced, Danish sentences for speech-in-noise experiments26
Self-Supervised Learning for Speaker Recognition: A study and review24
Vocal emotion perception in Mandarin-speaking older adults with hearing loss24
Investigating a neural all pass warp in modern TTS applications23
Editorial Board22
Editorial Board22
The prosody of theme, rheme and focus in Egyptian Arabic: A quantitative investigation of tunes, configurations and speaker variability21
Blood pressure monitoring from naturally recorded speech sounds: advancements and future prospects21
Speech intelligibility deterioration for normal hearing and hearing impaired patients with different types of tinnitus20
Editorial Board20
HC-APNet: Harmonic Compensation Auditory Perception Network for low-complexity speech enhancement20
The influence of task engagement on phonetic convergence19
Efficient acoustic feature transformation in mismatched environments using a Guided-GAN18
"I said simPle, not symBol!"Is clear speech tailored to the listener's feedback17
Deletion and insertion tampering detection for speech authentication based on fluctuating super vector of electrical network frequency16
Two-stage UNet with channel and temporal-frequency attention for multi-channel speech enhancement16
Investigating prosodic entrainment from global conversations to local turns and tones in Mandarin conversations16
Expectation of speech style improves audio-visual perception of English vowels16
Evaluating the effects of continuous pitch and speech tempo modifications on perceptual speaker verification performance by familiar and unfamiliar listeners15
Automatic Speech Recognition and Pronunciation Error Detection of Dutch Non-native Speech: cumulating speech resources in a pluricentric language15
Frequent-words analysis for forensic speaker comparison15
Neural speech-rate conversion with multispeaker WaveNet vocoder15
Real-time intelligibility affects the realization of French word-final schwa15
Unsupervised Automatic Speech Recognition: A review15
Editorial Board14
Paradigm fusion learning from overt and silent chinese speech based on pseudo-siamese multiscale capsule neural network14
Blind Speech Separation and Dereverberation using neural beamforming13
Learning and controlling the source-filter representation of speech with a variational autoencoder13
Vocal characteristics of accuracy in eyewitness testimony13
A study of correlation between physiological process of articulation and emotions on Mandarin Chinese12
Towards unsupervised speech recognition without pronunciation models11
Hand gesture realisation of contrastive focus in real-time whisper-to-speech synthesis: Investigating the transfer from implicit to explicit control of intonation11
Multilingual speech recognition for GlobalPhone languages11
A formant modification method for improved ASR of children’s speech11
Sequential perception of tone and focus in parallel–A computational simulation11
Dynamic graph learning with gated convolutions for single-channel speech separation11
Effects of voice onset time and place of articulation on perception of dichotic Turkish syllables11
Prosody in narratives: An exploratory study with children with sex chromosomes trisomies10
Speechformer-CTC: Sequential modeling of depression detection with speech temporal classification10
The effect of fluency strategy training on interpreter trainees’ speech fluency: Does content familiarity matter?10
A new universal camouflage attack algorithm for intelligent speech system10
Yanbian Korean speakers tend to merge /e/ and /ɛ/ when exposed to Seoul Korean10
The Lombard intelligibility benefit of native and non-native speech for native and non-native listeners10
Multi-modal co-learning for silent speech recognition based on ultrasound tongue images10
Differences between listeners with early and late immersion age in spatial release from masking in various acoustic environments10
Adaptive weighting in a transformer framework for multimodal emotion recognition9
Editorial Board9
GM-TCNet: Gated Multi-scale Temporal Convolutional Network using Emotion Causality for Speech Emotion Recognition9
Progressive channel fusion for more efficient TDNN on speaker verification9
Disordered speech recognition considering low resources and abnormal articulation9
Prosody and fluency of Finland Swedish as a second language: Investigating global parameters for automated speaking assessment9
Exploiting Locality Sensitive Hashing - Clustering and gloss feature for sign language production9
Recognition of vocoded speech in English by Mandarin-speaking English-learners9
Using iterative adaptation and dynamic mask for child speech extraction under real-world multilingual conditions9
Perceptual effects of interpolated Austrian and German standard varieties9
Coarse-to-fine speech separation method in the time-frequency domain9
Deep ad-hoc beamforming based on speaker extraction for target-dependent speech separation8
Role of language familiarity in understanding speech in noise under various acoustic environments8
Differential constant-beamwidth beamforming with cube arrays8
Efficient time-domain speech separation using short encoded sequence network8
Pathological voice classification using MEEL features and SVM-TabNet model8
Speech pause distribution as an early marker for Alzheimer’s disease8
Bangladeshi Bangla speech corpus for automatic speech recognition research8
Cross-modal information fusion for voice spoofing detection8
A cross-modal attention model with contextual enhancements for speech emotion recognition8
Editorial Board8
Tone-syllable synchrony in Mandarin: New evidence and implications8
Modulation spectral features for speech emotion recognition using deep neural networks8
FinnAffect: An affective speech corpus for spontaneous Finnish8
Nasal coarticulation in Lombard speech8
Enhancing bone-conducted speech with spectrum similarity metric in adversarial learning8
Combined approach to dysarthric speaker verification using data augmentation and feature fusion8
One-shot emotional voice conversion based on feature separation7
Arabic Automatic Speech Recognition: Challenges and Progress7
Controllable speech synthesis by learning discrete phoneme-level prosodic representations7
The Second-Language Productivity of Two Mandarin Tone Sandhi Patterns7
Automatic speech recognition technology to evaluate an audiometric word recognition test: A preliminary investigation7
Addressing the semi-open set dialect recognition problem under resource-efficient considerations7
Space-and-speaker-aware acoustic modeling with effective data augmentation for recognition of multi-array conversational speech7
Speakers’ vocal expression of sexual orientation depends on experimenter gender7
CSLNSpeech: Solving the extended speech separation problem with the help of Chinese sign language7
Editorial Board7
Learning transfer from singing to speech: Insights from vowel analyses in aging amateur singers and non-singers6
Editorial Board6
Prosodic characteristics of deceptive picture descriptions in Finnish: Acoustics, beliefs, self-evaluations, and deception theories6
Editorial Board6
Categorization of patients affected with neurogenerative dysarthria among Hindi-speaking population and analyzing factors causing reduced speech intelligibility at the human-machine interface6
MC-Mamba: Cross-modal target speaker extraction model based on multiple consistency6
Robust prosody modeling for synthetic speech detection6
Accurate synthesis of dysarthric Speech for ASR data augmentation6
Comparing the nativeness vs. intelligibility approach in prosody instruction for developing speaking skills by interpreter trainees: An experimental study6
Assessing Cancer-Related Cognitive Impairment for breast cancer survivors with speech analysis6
The perception of intonational peaks and valleys: The effects of plateaux, declination and experimental task6
1.6657330989838