Computer Speech and Language

Papers
(The TQCC of Computer Speech and Language is 6. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Building a text retrieval system for the Sanskrit language: Exploring indexing, stemming, and searching issues175
Speech enhancement approach for body-conducted unvoiced speech based on Taylor–Boltzmann machines trained DNN119
Towards inclusive automatic speech recognition113
Towards a unified assessment framework of speech pseudonymisation90
FE-CFNER: Feature Enhancement-based approach for Chinese Few-shot Named Entity Recognition90
Corpus and unsupervised benchmark: Towards Tagalog grammatical error correction68
A new speech corpus of super-elderly Japanese for acoustic modeling60
Addressing subjectivity in paralinguistic data labeling for improved classification performance: A case study with Spanish-speaking Mexican children using data balancing and semi-supervised learning56
Seq2Seq dynamic planning network for progressive text generation54
Stochastic Data-to-Text Generation Using Syntactic Dependency Information53
Optimizing pipeline task-oriented dialogue systems using post-processing networks52
Assessing language models’ task and language transfer capabilities for sentiment analysis in dialog data50
Automatic detection of pharyngeal fricatives in cleft palate speech using acoustic features based on the vocal tract area spectrum49
Improved relation extraction through key phrase identification using community detection on dependency trees49
Adaptive line enhancer for nonstationary harmonic noise reduction47
Sentence transition matrix: An efficient approach that preserves sentence semantics45
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis37
Automatic detection of behavioural codes in team interactions37
Glottal features for classification of phonation type from speech and neck surface accelerometer signals37
Learning to extract from multiple perspectives for neural keyphrase extraction30
Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement29
A methodological approach to enable natural language interaction in an Intelligent Tutoring System29
Editorial Board26
Editorial Board25
Effective infant cry signal analysis and reasoning using IARO based leaky Bi-LSTM model25
KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System24
RepSum: A general abstractive summarization framework with dynamic word embedding representation correction23
Two evaluations on Ontology-style relation annotations22
GEPC: Global embeddings with PID control22
Talking-heads attention-based knowledge representation for link prediction22
PLDA inspired Siamese networks for speaker verification21
Automatic offline annotation of turn-taking transitions in task-oriented dialogue21
Neural multi-task learning for end-to-end Arabic aspect-based sentiment analysis19
Document-level relation extraction with entity mentions deep attention19
A benchmark dataset for Turkish data-to-text generation18
Editorial Board17
Towards better Chinese-centric neural machine translation for low-resource languages17
An effective approach for identifying keywords as high-quality filters to get emergency-implicated Twitter Spanish data16
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network15
Exploring the ability of LLMs to classify written proficiency levels15
FinD: Fine-grained discrepancy-based fake news detection enhanced by event abstract generation15
Tamil Handwritten Character Recognition System using Statistical Algorithmic Approaches15
Taking relations as known conditions: A tagging based method for relational triple extraction14
A language-agnostic model of child language acquisition14
Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson's disease prediction13
Effectiveness of energy separation-based instantaneous frequency estimation for cochlear cepstral features for synthetic and voice-converted spoofed speech detection12
An investigation of neural uncertainty estimation for target speaker extraction equipped RNN transducer12
Direct enhancement of pre-trained speech embeddings for speech processing in noisy conditions12
A flexible BERT model enabling width- and depth-dynamic inference12
Incorporating external knowledge for text matching model12
Room impulse response reshaping-based expectation–maximization in an underdetermined reverberant environment12
BERT-hLSTMs: BERT and hierarchical LSTMs for visual storytelling12
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer11
C-KGE: Curriculum learning-based Knowledge Graph Embedding11
A generalized decoding method for neural text generation11
A potential relation trigger method for entity-relation quintuple extraction in text with excessive entities11
Deep ad-hoc beamforming11
A computational analysis of transcribed speech of people living with dementia: The Anchise 2022 Corpus10
ECDG-DST: A dialogue state tracking model based on efficient context and domain guidance for smart dialogue systems10
Hate speech detection on Twitter using transfer learning10
The effect of preference elicitation methods on the user experience in conversational recommender systems10
Exploiting spatial information and target speaker phoneme loss for multichannel directional speech enhancement and recognition10
Turn-taking in Conversational Systems and Human-Robot Interaction: A Review10
MS-Transformer: Introduce multiple structural priors into a unified transformer for encoding sentences9
Monotonic Gaussian regularization of attention for robust automatic speech recognition9
A knowledge-Aware NLP-Driven conversational model to detect deceptive contents on social media posts9
Editorial Board9
Prosodic event detection in children’s read speech9
UniKDD: A Unified Generative model for Knowledge-driven Dialogue9
A tag-based methodology for the detection of user repair strategies in task-oriented conversational agents9
An optimal approach for text feature selection9
Exploring intrinsic information content models for addressing the issues of traditional semantic measures to evaluate verb similarity8
Self-segmentation of pass-phrase utterances for deep feature learning in text-dependent speaker verification8
A novel approach to unsupervised pattern discovery in speech using Convolutional Neural Network8
Rep-MCA-former: An efficient multi-scale convolution attention encoder for text-independent speaker verification8
On significance of constant-Q transform for pop noise detection8
Cross-lingual multi-speaker speech synthesis with limited bilingual training data8
Detection of vowel transition regions from Hindi language8
Misogynistic attitude detection in YouTube comments and replies: A high-quality dataset and algorithmic models8
Editorial Board8
A study of vowel nasalization using instantaneous spectra8
Editorial Board8
Multi-branch feature aggregation based on multiple weighting for speaker verification7
Speech recognition using Taylor-gradient Descent political optimization based Deep residual network7
Intelligibility assessment of impaired speech using Regularized self-representation based compact supervectors7
Contextual emotion detection using ensemble deep learning7
Natural language processing for under-resourced languages: Developing a Welsh natural language toolkit7
Prototypical networks relation classification model based on entity convolution7
Uncertainty-aware non-autoregressive neural machine translation7
Conversation Initiation of Mothers, Fathers, and Toddlers in their Natural Home Environment6
Improving BERT with local context comprehension for multi-turn response selection in retrieval-based dialogue systems6
Training RNN language models on uncertain ASR hypotheses in limited data scenarios6
A method of phonemic annotation for Chinese dialects based on a deep learning model with adaptive temporal attention and a feature disentangling structure6
Towards detecting the level of trust in the skills of a virtual assistant from the user’s speech6
Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components6
Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement6
Generating identities with mixture models for speaker anonymization6
CLIPMulti: Explore the performance of multimodal enhanced CLIP for zero-shot text classification6
Language-independent extractive automatic text summarization based on automatic keyword extraction6
A high-performance speech BioHashing retrieval algorithm based on audio segmentation6
Bayesian active summarization6
PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition6
A closer look at reinforcement learning-based automatic speech recognition6
Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances6
Speaking to remember: Model-based adaptive vocabulary learning using automatic speech recognition6
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge6
Prediction of speech intelligibility with DNN-based performance measures6
0.06545090675354