Computer Speech and Language

Papers
(The median citation count of Computer Speech and Language is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
A language-agnostic model of child language acquisition188
Stochastic Data-to-Text Generation Using Syntactic Dependency Information120
Corpus and unsupervised benchmark: Towards Tagalog grammatical error correction100
Automatic detection of behavioural codes in team interactions94
Seq2Seq dynamic planning network for progressive text generation63
Speech enhancement approach for body-conducted unvoiced speech based on Taylor–Boltzmann machines trained DNN60
KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System59
Room impulse response reshaping-based expectation–maximization in an underdetermined reverberant environment58
GEPC: Global embeddings with PID control58
Towards privacy-preserving conversation analysis in everyday life: Exploring the privacy-utility trade-off53
Editorial Board52
Identifying offensive memes in low-resource languages: A multi-modal multi-task approach using valence and arousal52
Unsupervised question-retrieval approach based on topic keywords filtering and multi-task learning50
A method of phonemic annotation for Chinese dialects based on a deep learning model with adaptive temporal attention and a feature disentangling structure47
Monotonic Gaussian regularization of attention for robust automatic speech recognition42
Contextual emotion detection using ensemble deep learning40
Misogynistic attitude detection in YouTube comments and replies: A high-quality dataset and algorithmic models38
Complementary regional energy features for spoofed speech detection37
Multi-branch feature aggregation based on multiple weighting for speaker verification33
PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition30
Improving low-resource machine transliteration by using 3-way transfer learning29
Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study29
Editorial Board27
Unsupervised sign language validation process based on hand-motion parameter clustering26
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning25
Maximal activation weighted memory for aspect based sentiment analysis24
A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages24
Perceptions and reactions to conversational privacy initiated by a conversational user interface24
Editorial Board23
Adversarial subsequences for unconditional text generation21
Augmentative and alternative speech communication (AASC) aid for people with dysarthria21
Enhancing analysis of diadochokinetic speech using deep neural networks21
A hybrid approach to Natural Language Inference for the SICK dataset21
Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion19
Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source19
Exploring accidental triggers of smart speakers19
Combining replay and LoRA for continual learning in natural language understanding19
An unsupervised approach to detect review spam using duplicates of images, videos and Chinese texts18
Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review18
The use of Active Learning systems for stimulus selection and response modelling in perception experiments17
Loanword identification based on web resources: A case study on wikipedia16
Enhancing Arabic aspect-based sentiment analysis using deep learning models16
A mobile application using automatic speech analysis for classifying Alzheimer's disease and mild cognitive impairment15
English–Assamese neural machine translation using prior alignment and pre-trained language model15
A lightweight approach based on prompt for few-shot relation extraction14
Representation learning strategies to model pathological speech: Effect of multiple spectral resolutions14
Editorial Board14
Conversations in the wild: Data collection, automatic generation and evaluation14
Unsupervised induction of inflectional families14
Meta adversarial learning improves low-resource speech recognition13
Effects of cross-cultural language differences on social cognition during human-agent interaction in cooperative game environments13
Dialect Identification using Chroma-Spectral Shape Features with Ensemble Technique13
A novel channel estimate for noise robust speech recognition13
Adjustable deterministic pseudonymization of speech12
Editorial Board12
Evidence and Axial Attention Guided Document-level Relation Extraction12
Editorial Board12
Zero-Shot Strike: Testing the generalisation capabilities of out-of-the-box LLM models for depression detection11
A multi-label emoji classification method using balanced pointwise mutual information-based feature selection11
MPSA-DenseNet: A novel deep learning model for English accent classification11
SecNLP: An NLP classification model watermarking framework based on multi-task learning11
Named entity recognition using neural language model and CRF for Hindi language11
A computational analysis of transcribed speech of people living with dementia: The Anchise 2022 Corpus11
Neural multi-task learning for end-to-end Arabic aspect-based sentiment analysis10
Improved relation extraction through key phrase identification using community detection on dependency trees10
Effective infant cry signal analysis and reasoning using IARO based leaky Bi-LSTM model10
Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement10
Addressing subjectivity in paralinguistic data labeling for improved classification performance: A case study with Spanish-speaking Mexican children using data balancing and semi-supervised learning10
Towards a unified assessment framework of speech pseudonymisation10
A flexible BERT model enabling width- and depth-dynamic inference10
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer9
Towards inclusive automatic speech recognition9
A closer look at reinforcement learning-based automatic speech recognition9
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network9
Detection of vowel transition regions from Hindi language9
Prosodic event detection in children’s read speech9
Towards detecting the level of trust in the skills of a virtual assistant from the user’s speech9
FinD: Fine-grained discrepancy-based fake news detection enhanced by event abstract generation9
Editorial Board9
A tag-based methodology for the detection of user repair strategies in task-oriented conversational agents9
Improving BERT with local context comprehension for multi-turn response selection in retrieval-based dialogue systems9
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge8
A review of speaker diarization: Recent advances with deep learning8
Towards lifelong human assisted speaker diarization8
Multiple time-instances features based approach for reference-free speech quality measurement8
Generating identities with mixture models for speaker anonymization8
Arabic speech recognition by end-to-end, modular systems and human8
Evaluating voice-assistant commands for dementia detection8
Conversation Initiation of Mothers, Fathers, and Toddlers in their Natural Home Environment8
Prototypical networks relation classification model based on entity convolution8
Automated grapheme-to-phoneme conversion for Central Kurdish based on optimality theory8
Test-retest reliability of acoustic and linguistic measures of speech tasks8
Adversarial attack and defense strategies for deep speaker recognition systems8
Refining the evaluation of speech synthesis: A summary of the Blizzard Challenge 20237
An intention multiple-representation model with expanded information7
Automatic speaker independent dysarthric speech intelligibility assessment system7
Unsupervised speech representation learning for behavior modeling using triplet enhanced contextualized networks7
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework7
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech7
Empirical Mode Decomposition articulation feature extraction on Parkinson’s Diadochokinesia7
Novel textual entailment technique for the Arabic language using genetic algorithm7
End-to-End Speech-to-Text Translation: A Survey7
A neural network approach for speech enhancement and noise-robust bandwidth extension7
Adaptive feature extraction for entity relation extraction7
Generative adversarial networks for speech processing: A review7
A physical exertion inspired multi-task learning framework for detecting out-of-breath speech7
A cross-attention augmented model for event-triggered context-aware story generation6
Classification of stuttering – The ComParE challenge and beyond6
An automated quality evaluation framework of psychotherapy conversations with local quality estimates6
Two in One: A multi-task framework for politeness turn identification and phrase extraction in goal-oriented conversations6
Improving named entity correctness of abstractive summarization by generative negative sampling6
Spoofing countermeasure for fake speech detection using brute force features6
Lightweight and irreversible speech pseudonymization based on data-driven optimization of cascaded voice modification modules6
Significance of chirp MFCC as a feature in speech and audio applications6
Morse wavelet transform-based features for voice liveness detection6
SEBGM: Sentence Embedding Based on Generation Model with multi-task learning6
Channel and channel subband selection for speaker diarization6
Discovering phonetic inventories with crosslingual automatic speech recognition6
Automatic screening of mild cognitive impairment and Alzheimer’s disease by means of posterior-thresholding hesitation representation6
Goal-oriented conditional variational autoencoders for proactive and knowledge-aware conversational recommender system6
Speech self-supervised representations benchmarking: A case for larger probing heads6
GTSO: Gradient tangent search optimization enabled voice transformer with speech intelligibility for aphasia6
Measuring and implementing lexical alignment: A systematic literature review6
Multilingual non-intrusive binaural intelligibility prediction based on phone classification6
Analysis and classification of speech sounds of children with autism spectrum disorder using acoustic features6
Hierarchical state recurrent neural network for social emotion ranking5
A new speech corpus of super-elderly Japanese for acoustic modeling5
Assessing language models’ task and language transfer capabilities for sentiment analysis in dialog data5
Editorial Board5
Scale-aware dual-branch complex convolutional recurrent network for monaural speech enhancement5
Accurate speaker counting, diarization and separation for advanced recognition of multichannel multispeaker conversations5
FE-CFNER: Feature Enhancement-based approach for Chinese Few-shot Named Entity Recognition5
C-KGE: Curriculum learning-based Knowledge Graph Embedding5
Optimizing pipeline task-oriented dialogue systems using post-processing networks5
A study of vowel nasalization using instantaneous spectra5
A potential relation trigger method for entity-relation quintuple extraction in text with excessive entities5
Building a text retrieval system for the Sanskrit language: Exploring indexing, stemming, and searching issues5
Talking-heads attention-based knowledge representation for link prediction5
A knowledge-augmented heterogeneous graph convolutional network for aspect-level multimodal sentiment analysis5
Towards better Chinese-centric neural machine translation for low-resource languages5
Two evaluations on Ontology-style relation annotations5
UniKDD: A Unified Generative model for Knowledge-driven Dialogue5
Direct enhancement of pre-trained speech embeddings for speech processing in noisy conditions5
An optimal approach for text feature selection4
Uncertainty-aware non-autoregressive neural machine translation4
Rep-MCA-former: An efficient multi-scale convolution attention encoder for text-independent speaker verification4
EMGVox-GAN: A transformative approach to EMG-based speech synthesis, enhancing clarity, and efficiency via extensive dataset utilization4
How to make embeddings suitable for PLDA4
Editorial Board4
Cross-lingual multi-speaker speech synthesis with limited bilingual training data4
A novel word sense disambiguation approach using WordNet knowledge graph4
On significance of constant-Q transform for pop noise detection4
Copiously Quote Classics: Improving Chinese Poetry Generation with historical allusion knowledge4
A semi-supervised high-quality pseudo labels algorithm based on multi-constraint optimization for speech deception detection4
Speaking to remember: Model-based adaptive vocabulary learning using automatic speech recognition4
Exploring intrinsic information content models for addressing the issues of traditional semantic measures to evaluate verb similarity4
Language-independent extractive automatic text summarization based on automatic keyword extraction4
Neural referential form selection: Generalisability and interpretability4
Predicting children’s perceived reading proficiency with prosody modeling4
A code-mixed task-oriented dialog dataset for medical domain4
Editorial Board3
Editorial Board3
Editorial Board3
Spoken language interaction with robots: Recommendations for future research3
Self-feeding training method for semi-supervised grammatical error correction3
Knowledge-grounded dialogue modelling with dialogue-state tracking, domain tracking, and entity extraction3
Multi-task learning neural framework for categorizing sexism3
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition3
What’s so complex about conversational speech? A comparison of HMM-based and transformer-based ASR architectures3
Enhancing Turkish Coreference Resolution: Insights from deep learning, dropped pronouns, and multilingual transfer learning3
An analysis of machine learning models for sentiment analysis of Tamil code-mixed data3
Multi-level context features extraction for named entity recognition3
Editorial Board3
Knowledge-enhanced meta-prompt for few-shot relation extraction3
Sequential routing framework: Fully capsule network-based speech recognition3
LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech3
Overlapped Speech Detection and speaker counting using distant microphone arrays3
Modelling child comprehension: A case of suffixal passive construction in Korean3
Demystifying large language models in second language development research3
Dereverberation of autoregressive envelopes for far-field speech recognition3
An experimental review of speaker diarization methods with application to two-speaker conversational telephone speech recordings3
Spectral–temporal saliency masks and modulation tensorgrams for generalizable COVID-19 detection3
Train from scratch: Single-stage joint training of speech separation and recognition3
TadaStride: Using time adaptive strides in audio data for effective downsampling3
Discriminating speech traits of Alzheimer's disease assessed through a corpus of reading task for Spanish language3
M-Sim: Multi-level Semantic Inference Model for Chinese short answer scoring in low-resource scenarios3
New research on monaural speech segregation based on quality assessment3
Analysis of Instantaneous Frequency Components of Speech Signals for Epoch Extraction3
Single-channel speech enhancement using colored spectrograms3
Deep learning-based speaker-adaptive postfiltering with limited adaptation data for embedded text-to-speech synthesis systems3
Combining context-relevant features with multi-stage attention network for short text classification3
Character expression for spoken dialogue systems with semi-supervised learning using Variational Auto-Encoder3
Speaker anonymization by modifying fundamental frequency and x-vector singular value3
COMPASS: A creative support system that alerts novelists to the unnoticed missing contents3
An automatic Alzheimer’s disease classifier based on spontaneous spoken English3
Cross-lingual transfer learning for relation extraction using Universal Dependencies3
A CBR-based conversational architecture for situational data management2
Generation of Coherent Multi-Sentence Texts with a Coherence Mechanism2
Feature learning for efficient ASR-free keyword spotting in low-resource languages2
Prediction of speech intelligibility with DNN-based performance measures2
The limits of the Mean Opinion Score for speech synthesis evaluation2
A generalized decoding method for neural text generation2
Editorial Board2
RepSum: A general abstractive summarization framework with dynamic word embedding representation correction2
Editorial Board2
Supervised speech separation combined with adaptive beamforming2
Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components2
Bispectral feature speech intelligibility assessment metric based on auditory model2
The management of mental health in a smart medical dialogue system based on a two-stage attention speech enhancement module2
Knowledge-aware audio-grounded generative slot filling for limited annotated data2
Editorial Board2
End-to-end neural systems for automatic children speech recognition: An empirical study2
Glottal features for classification of phonation type from speech and neck surface accelerometer signals2
CLIPMulti: Explore the performance of multimodal enhanced CLIP for zero-shot text classification2
Hate speech detection on Twitter using transfer learning2
Automatic detection of pharyngeal fricatives in cleft palate speech using acoustic features based on the vocal tract area spectrum2
Exploring the ability of LLMs to classify written proficiency levels2
HOTTEST: Hate and Offensive content identification in Tamil using Transformers and Enhanced STemming2
Effectiveness of energy separation-based instantaneous frequency estimation for cochlear cepstral features for synthetic and voice-converted spoofed speech detection2
Joint emotion label space modeling for affect lexica2
Comparison of rule-based and data-driven approaches for syllabification of simple syllable languages and the effect of orthography2
A novel and secured email classification using deep neural network with bidirectional long short-term memory2
Local and non-local dependency learning and emergence of rule-like representations in speech data by deep convolutional generative adversarial networks2
Multipath-guided heterogeneous graph neural networks for sequential recommendation2
Speech intelligibility assessment of dysarthria using Fisher vector encoding2
Automatic offline annotation of turn-taking transitions in task-oriented dialogue2
Deep ad-hoc beamforming2
Corrigendum to ‘Unsupervised sign language validation process based on hand-motion parameter clustering’ <Computer Speech & Language Volume 71, January 2022, 101256>2
Incorporating external knowledge for text matching model2
Editorial Board2
Joint speaker diarization and speech recognition based on region proposal networks2
Taking relations as known conditions: A tagging based method for relational triple extraction2
Multi-level embeddings for processing Arabic social media contents2
0.03937816619873