EURASIP Journal on Audio Speech and Music Processing

Papers
(The TQCC of EURASIP Journal on Audio Speech and Music Processing is 4. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
Learning domain-heterogeneous speaker recognition systems with personalized continual federated learning26
MIRACLE—a microphone array impulse response dataset for acoustic learning22
Hybrid lightweight temporal-frequency analysis network for multi-channel speech enhancement19
AAM: a dataset of Artificial Audio Multitracks for diverse music information retrieval tasks19
Supervised Attention Multi-Scale Temporal Convolutional Network for monaural speech enhancement18
Generating chord progression from melody with flexible harmonic rhythm and controllable harmonic density18
Domain-weighted transfer learning and discriminative embeddings for low-resource speaker verification17
A simplified and controllable model of mode coupling for addressing nonlinear phenomena in sound synthesis processes16
Compression of room impulse responses for compact storage and fast low-latency convolution14
Parameter-efficient adaptation with multi-channel adversarial training for far-field speech recognition13
Benefits of pre-trained mono- and cross-lingual speech representations for spoken language understanding of Dutch dysarthric speech12
Sound recurrence analysis for acoustic scene classification12
Investigations on higher-order spherical harmonic input features for deep learning-based multiple speaker detection and localization12
Estimation of playable piano fingering by pitch-difference fingering match model12
Attention mechanism combined with residual recurrent neural network for sound event detection and localization11
Three-stage training and orthogonality regularization for spoken language recognition11
Silent speech recognition using visual cascading fusion of tongue-lip movements based on pre-trained and fine-tuned model10
Feature compensation based on independent noise estimation for robust speech recognition10
Multi-rate modulation encoding via unsupervised learning for audio event detection10
Enhancing Speaker Recognition with CRET Model: a fusion of CONV2D, RESNET and ECAPA-TDNN10
Sound field reconstruction using neural processes with dynamic kernels10
Anchor voiceprint recognition in live streaming via RawNet-SA and gated recurrent unit8
The whole is greater than the sum of its parts: improving music source separation by bridging networks8
Neural electric bass guitar synthesis framework enabling attack-sustain-representation-based technique control8
Parallel processing of distributed beamforming and multichannel linear prediction for speech denoising and deverberation in wireless acoustic sensor networks7
Dance2Music-Diffusion: leveraging latent diffusion models for music generation from dance videos7
W2VC: WavLM representation based one-shot voice conversion with gradient reversal distillation and CTC supervision7
Pronunciation augmentation for Mandarin-English code-switching speech recognition7
Timestamp-aligning and keyword-biasing end-to-end ASR front-end for a KWS system7
Comparative performance analysis of end-to-end ASR models on Indo-Aryan and Dravidian languages within India’s linguistic landscape7
Variational Autoencoders for chord sequence generation conditioned on Western harmonic music complexity7
Vulnerability issues in Automatic Speaker Verification (ASV) systems7
Correction: N-dimensional N-microphone sound source localization6
Automatic detection of attachment style in married couples through conversation analysis6
Auxiliary function-based algorithm for blind extraction of a moving speaker6
An overview of machine learning and other data-based methods for spatial audio capture, processing, and reproduction6
Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy6
Data-based spatial audio processing5
dEchorate: a calibrated room impulse response dataset for echo-aware signal processing5
Training audio transformers for cover song identification5
Masked multi-center angular margin loss for language recognition5
Text-to-speech system for low-resource language using cross-lingual transfer learning and data augmentation5
Dual-branch attention module-based network with parameter sharing for joint sound event detection and localization5
DOA-informed switching independent vector extraction and beamforming for speech enhancement in underdetermined situations5
Performance evaluation of perceptible impulsive noise detection methods based on auditory models4
Multi-scale Information Aggregation for Spoofing Detection4
Guest editorial: AI for computational audition—sound and music processing4
AI-based Chinese-style music generation from video content: a study on cross-modal analysis and generation methods4
Robust and early howling detection based on a sparsity measure4
Optimal sensor placement for the spatial reconstruction of sound fields4
Significance of relative phase features for shouted and normal speech classification4
A survey of technologies for automatic Dysarthric speech recognition4
Data-driven room acoustic modeling via differentiable feedback delay networks with learnable delay lines4
Recognition of target domain Japanese speech using language model replacement4
Fake speech detection using VGGish with attention block4
Single-microphone speaker separation and voice activity detection in noisy and reverberant environments4
Automatic dysarthria detection and severity level assessment using CWT-layered CNN model4
0.11033010482788