OOIR: Observatory of International Research

Papers

(The TQCC of Computer Speech and Language is 8. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)

Article	Citations
A language-agnostic model of child language acquisition	201
Stochastic Data-to-Text Generation Using Syntactic Dependency Information	106
Corpus and unsupervised benchmark: Towards Tagalog grammatical error correction	105
Room impulse response reshaping-based expectation–maximization in an underdetermined reverberant environment	70
Seq2Seq dynamic planning network for progressive text generation	64
Speech enhancement approach for body-conducted unvoiced speech based on Taylor–Boltzmann machines trained DNN	64
Automatic detection of behavioural codes in team interactions	62
KddRES: A Multi-level Knowledge-driven Dialogue Dataset for Restaurant Towards Customized Dialogue System	62
Identifying offensive memes in low-resource languages: A multi-modal multi-task approach using valence and arousal	56
Editorial Board	55
A method of phonemic annotation for Chinese dialects based on a deep learning model with adaptive temporal attention and a feature disentangling structure	54
Monotonic Gaussian regularization of attention for robust automatic speech recognition	53
Contextual emotion detection using ensemble deep learning	46
Complementary regional energy features for spoofed speech detection	45
PaSCoNT - Parallel Speech Corpus of Northern-central Thai for automatic speech recognition	41
Multi-branch feature aggregation based on multiple weighting for speaker verification	41
Unsupervised question-retrieval approach based on topic keywords filtering and multi-task learning	38
Improving low-resource machine transliteration by using 3-way transfer learning	33
Misogynistic attitude detection in YouTube comments and replies: A high-quality dataset and algorithmic models	32
Editorial Board	29
Maximal activation weighted memory for aspect based sentiment analysis	27
Editorial Board	27
Unsupervised sign language validation process based on hand-motion parameter clustering	27
Perceptions and reactions to conversational privacy initiated by a conversational user interface	27
Augmentative and alternative speech communication (AASC) aid for people with dysarthria	26

Adversarial subsequences for unconditional text generation	25
A hybrid approach to Natural Language Inference for the SICK dataset	24
Enhancing analysis of diadochokinetic speech using deep neural networks	24
Exploring accidental triggers of smart speakers	23
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning	21
Combining replay and LoRA for continual learning in natural language understanding	21
A transformer-based spelling error correction framework for Bangla and resource scarce Indic languages	21
Improving self-supervised learning model for audio spoofing detection with layer-conditioned embedding fusion	21
Preserving the beamforming effect for spatial cue-based pseudo-binaural dereverberation of a single source	19
Loanword identification based on web resources: A case study on wikipedia	18
The use of Active Learning systems for stimulus selection and response modelling in perception experiments	18
Symbolic and Statistical Learning Approaches to Speech Summarization: A Scoping Review	18
A lightweight approach based on prompt for few-shot relation extraction	18
A mobile application using automatic speech analysis for classifying Alzheimer's disease and mild cognitive impairment	17
Enhancing Arabic aspect-based sentiment analysis using deep learning models	16
Representation learning strategies to model pathological speech: Effect of multiple spectral resolutions	16
Conversations in the wild: Data collection, automatic generation and evaluation	15
Editorial Board	15
English–Assamese neural machine translation using prior alignment and pre-trained language model	15
Unsupervised induction of inflectional families	15
Effects of cross-cultural language differences on social cognition during human-agent interaction in cooperative game environments	14
A novel channel estimate for noise robust speech recognition	14
Dialect Identification using Chroma-Spectral Shape Features with Ensemble Technique	14
Evidence and Axial Attention Guided Document-level Relation Extraction	13
Editorial Board	13
Adjustable deterministic pseudonymization of speech	13
Meta adversarial learning improves low-resource speech recognition	13
Zero-Shot Strike: Testing the generalisation capabilities of out-of-the-box LLM models for depression detection	12
SecNLP: An NLP classification model watermarking framework based on multi-task learning	12
A multi-label emoji classification method using balanced pointwise mutual information-based feature selection	12
A bias evaluation solution for multiple sensitive attribute speech recognition	12
MPSA-DenseNet: A novel deep learning model for English accent classification	12
Editorial Board	12
Editorial Board	12
A flexible BERT model enabling width- and depth-dynamic inference	11
Improved relation extraction through key phrase identification using community detection on dependency trees	11
Named entity recognition using neural language model and CRF for Hindi language	11
Addressing subjectivity in paralinguistic data labeling for improved classification performance: A case study with Spanish-speaking Mexican children using data balancing and semi-supervised learning	11
Phase sensitive masking-based single channel speech enhancement using conditional generative adversarial network	10
Towards inclusive automatic speech recognition	10
FinD: Fine-grained discrepancy-based fake news detection enhanced by event abstract generation	10
Effective infant cry signal analysis and reasoning using IARO based leaky Bi-LSTM model	10
Neural multi-task learning for end-to-end Arabic aspect-based sentiment analysis	10
Towards a unified assessment framework of speech pseudonymisation	10
Offensive language detection in Tamil YouTube comments by adapters and cross-domain knowledge transfer	10
A computational analysis of transcribed speech of people living with dementia: The Anchise 2022 Corpus	10
Editorial Board	9
Improving BERT with local context comprehension for multi-turn response selection in retrieval-based dialogue systems	9
Prototypical networks relation classification model based on entity convolution	9
Towards detecting the level of trust in the skills of a virtual assistant from the user’s speech	9

Enhancing accuracy and privacy in speech-based depression detection through speaker disentanglement	9
GenCeption: Evaluate vision LLMs with unlabeled unimodal data	9
Generating identities with mixture models for speaker anonymization	9
Objective and subjective evaluation of speech enhancement methods in the UDASE task of the 7th CHiME challenge	9
Detection of vowel transition regions from Hindi language	9
Conversation Initiation of Mothers, Fathers, and Toddlers in their Natural Home Environment	9
A tag-based methodology for the detection of user repair strategies in task-oriented conversational agents	9
A review of speaker diarization: Recent advances with deep learning	9
Test-retest reliability of acoustic and linguistic measures of speech tasks	8
End-to-End Speech-to-Text Translation: A Survey	8
A neural network approach for speech enhancement and noise-robust bandwidth extension	8
Arabic speech recognition by end-to-end, modular systems and human	8
Multiple time-instances features based approach for reference-free speech quality measurement	8
A closer look at reinforcement learning-based automatic speech recognition	8
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech	8
Generative adversarial networks for speech processing: A review	8
Automated grapheme-to-phoneme conversion for Central Kurdish based on optimality theory	8
Hate speech and offensive language detection in Dravidian languages using deep ensemble framework	8
Refining the evaluation of speech synthesis: A summary of the Blizzard Challenge 2023	8
Adaptive feature extraction for entity relation extraction	8
Empirical Mode Decomposition articulation feature extraction on Parkinson’s Diadochokinesia	8
Automatic speaker independent dysarthric speech intelligibility assessment system	8
Towards lifelong human assisted speaker diarization	8
Evaluating voice-assistant commands for dementia detection	8