Computer Speech and Language

Papers
(The TQCC of Computer Speech and Language is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2020-05-01 to 2024-05-01.)
ArticleCitations
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech139
A review of speaker diarization: Recent advances with deep learning99
Attention-based BiLSTM fused CNN with gating mechanism model for Chinese long text classification84
Turn-taking in Conversational Systems and Human-Robot Interaction: A Review67
Bayesian HMM clustering of x-vector sequences (VBx) in speaker diarization: Theory, implementation and analysis on standard tasks61
Deep reinforcement and transfer learning for abstractive text summarization: A review55
Transfer learning from adult to children for speech recognition: Evaluation, analysis and recommendations50
Hate speech detection on Twitter using transfer learning46
Human evaluation of automatically generated text: Current trends and best practice guidelines41
Combining context-relevant features with multi-stage attention network for short text classification40
Spoken language interaction with robots: Recommendations for future research39
Deep learning based multi-source localization with source splitting and its effectiveness in multi-talker speech recognition39
Non-negative matrix factorization-based time-frequency feature extraction of voice signal for Parkinson's disease prediction38
Generalized end-to-end detection of spoofing attacks to automatic speaker recognizers37
Enhancing Arabic aspect-based sentiment analysis using deep learning models37
Linguistic features and automatic classifiers for identifying mild cognitive impairment and dementia36
Multilingual stance detection in social media political debates36
Adversarial attack and defense strategies for deep speaker recognition systems35
Part-of-speech tagging for Arabic tweets using CRF and Bi-LSTM32
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework31
Generative adversarial networks for speech processing: A review31
MuST-C: A multilingual corpus for end-to-end speech translation29
Emotion recognition in low-resource settings: An evaluation of automatic feature selection methods28
tax2vec: Constructing Interpretable Features from Taxonomies for Short Text Classification27
Trajectory-based recognition of dynamic Persian sign language using hidden Markov model26
An automatic Alzheimer’s disease classifier based on spontaneous spoken English26
BERT syntactic transfer: A computational experiment on Italian, French and English languages26
Advances in subword-based HMM-DNN speech recognition across languages26
The VoicePrivacy 2020 Challenge: Results and findings25
Automatic assessment of intelligibility in speakers with dysarthria from coded telephone speech using glottal features24
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech23
Arabic speech recognition by end-to-end, modular systems and human23
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer22
A survey on automatic speech recognition systems for Portuguese language and its variations20
Detection of replay spoof speech using teager energy feature cues19
Optimization of the area under the ROC curve using neural network supervectors for text-dependent speaker verification18
TOP-Rank: A TopicalPostionRank for Extraction and Classification of Keyphrases in Text18
BERT-hLSTMs: BERT and hierarchical LSTMs for visual storytelling17
Voice spoofing detection corpus for single and multi-order audio replays17
Comprehensive analysis of aspect term extraction methods using various text embeddings16
A Korean named entity recognition method using Bi-LSTM-CRF and masked self-attention16
Verbal fluency in normal aging and cognitive decline: Results of a longitudinal study16
Analysis and classification of speech sounds of children with autism spectrum disorder using acoustic features16
Vocal tract shaping of emotional speech16
Transfer fine-tuning of BERT with phrasal paraphrases16
Replay spoofing countermeasure using autoencoder and siamese networks on ASVspoof 2019 challenge16
Analysis of gender and identity issues in depression detection on de-identified speech16
Sequence labeling to detect stuttering events in read speech15
Evaluating voice-assistant commands for dementia detection15
Named entity recognition using neural language model and CRF for Hindi language15
Improving the potential of Enhanced Teager Energy Cepstral Coefficients (ETECC) for replay attack detection14
Cluster-based beam search for pointer-generator chatbot grounded by knowledge14
On the effect of dropping layers of pre-trained transformer models14
Towards the first Maithili part of speech tagger: Resource creation and system development14
Representation transfer learning from deep end-to-end speech recognition networks for the classification of health states from speech14
NEC-TT System for Mixed-Bandwidth and Multi-Domain Speaker Recognition14
A Bayesian end-to-end model with estimated uncertainties for simple question answering over knowledge bases13
The automatic detection of heart failure using speech signals13
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network13
X-vector anonymization using autoencoders and adversarial training for preserving speech privacy13
Assessing Parkinson's disease severity using speech analysis in non-native speakers13
Recurrent neural network language generation for spoken dialogue systems13
Overview of the seventh Dialog System Technology Challenge: DSTC713
A novel word sense disambiguation approach using WordNet knowledge graph13
A multi-label emoji classification method using balanced pointwise mutual information-based feature selection11
Discriminating speech traits of Alzheimer's disease assessed through a corpus of reading task for Spanish language11
Low resource end-to-end spoken language understanding with capsule networks11
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning11
Low-resource text classification using domain-adversarial learning11
A speaker verification backend with robust performance across conditions11
Deep generative variational autoencoding for replay spoof detection in automatic speaker verification11
HOTTEST: Hate and Offensive content identification in Tamil using Transformers and Enhanced STemming10
End-to-end neural systems for automatic children speech recognition: An empirical study10
Multilingual and unsupervised subword modeling for zero-resource languages10
Hybrid-task learning for robust automatic speech recognition10
An analysis of machine learning models for sentiment analysis of Tamil code-mixed data10
Siamese networks for large-scale author identification9
13 years of speaker recognition research at BUT, with longitudinal analysis of NIST SRE9
Language-independent extractive automatic text summarization based on automatic keyword extraction9
QBSUM: A large-scale query-based document summarization dataset from real-world applications9
Towards a speech therapy support system based on phonological processes early detection9
Leveraging Linguistic Context in Dyadic Interactions to Improve Automatic Speech Recognition for Children8
Dialect Identification using Chroma-Spectral Shape Features with Ensemble Technique8
Replay spoof detection using energy separation based instantaneous frequency estimation from quadrature and in-phase components8
Towards a unified assessment framework of speech pseudonymisation8
Acoustic and articulatory analysis and synthesis of shouted vowels8
A classification benchmark for Arabic alphabet phonemes with diacritics in deep neural networks8
Natural language processing for under-resourced languages: Developing a Welsh natural language toolkit8
An online multi-source summarization algorithm for text readability in topic-based search8
Joint emotion label space modeling for affect lexica8
Automatic speaker independent dysarthric speech intelligibility assessment system8
Perceptions and reactions to conversational privacy initiated by a conversational user interface8
Exploring neural models for predicting dementia from language8
Prediction of speech intelligibility with DNN-based performance measures8
Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis8
0.040491819381714