Transactions of the Association for Computational Linguistics

Papers
(The median citation count of Transactions of the Association for Computational Linguistics is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-08-01 to 2025-08-01.)
ArticleCitations
The Ethics of Automating Legal Actors309
Cross-functional Analysis of Generalization in Behavioral Learning197
From Robustness to Improved Generalization and Calibration in Pre-trained Language Models147
Segmentation-Free Streaming Machine Translation136
Transformers for Tabular Data Representation: A Survey of Models and Applications130
Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection100
State of What Art? A Call for Multi-Prompt LLM Evaluation84
T 2 -NER: A Two-Stage Span-Based Framework for Unified Named Entity Recognition with Templates82
Erasure of Unaligned Attributes from Neural Representations78
Revisiting Meta-evaluation for Grammatical Error Correction74
A Survey of Text Games for Reinforcement Learning Informed by Natural Language72
The Flores-101 Evaluation Benchmark for Low-Resource and Multilingual Machine Translation67
Aggretriever: A Simple Approach to Aggregate Textual Representations for Robust Dense Passage Retrieval64
A Survey on Automated Fact-Checking47
DEAR: Disentangled Event-Agnostic Representation Learning for Early Fake News Detection46
Uncertainty Estimation and Reduction of Pre-trained Models for Text Regression45
Time-Aware Language Models as Temporal Knowledge Bases42
Bridging the Gap between Synthetic and Natural Questions via Sentence Decomposition for Semantic Parsing41
Do Multi-Document Summarization Models Synthesize?40
Learning More from Mixed Emotions: A Label Refinement Method for Emotion Recognition in Conversations40
Context-Aware Machine Translation with Source Coreference Explanation39
Retrieval-Pretrained Transformer: Long-range Language Modeling with Self-retrieval39
Learning English with Peppa Pig39
Benchmarking the Generation of Fact Checking Explanations37
To Diverge or Not to Diverge: A Morphosyntactic Perspective on Machine Translation vs Human Translation33
Federated Learning for Exploiting Annotators’ Disagreements in Natural Language Processing33
Compositional Evaluation on Japanese Textual Entailment and Similarity32
How to Dissect a Muppet: The Structure of Transformer Embedding Spaces31
Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation29
Culturally Aware and Adapted NLP: A Taxonomy and a Survey of the State of the Art28
Few-Shot Multilingual Open-Domain QA from Five Examples28
Quality at a Glance: An Audit of Web-Crawled Multilingual Datasets26
An Energy-based Model for Word-level AutoCompletion in Computer-aided Translation25
Scientia Potentia Est—On the Role of Knowledge in Computational Argumentation25
Communication Drives the Emergence of Language Universals in Neural Agents: Evidence from the Word-order/Case-marking Trade-off25
Morphology Without Borders: Clause-Level Morphology25
True Few-Shot Learning with Prompts—A Real-World Perspective25
ProoFVer: Natural Logic Theorem Proving for Fact Verification24
Conformal Prediction for Natural Language Processing: A Survey23
Questions Are All You Need to Train a Dense Passage Retriever22
PADA: Example-based Prompt Learning for on-the-fly Adaptation to Unseen Domains20
Toward Robust RALMs: Revealing the Impact of Imperfect Retrieval on Retrieval-Augmented Language Models20
Break, Perturb, Build: Automatic Perturbation of Reasoning Paths Through Question Decomposition19
Template-based Abstractive Microblog Opinion Summarization19
Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale19
Navigating the Landscape of Hint Generation Research: From the Past to the Future18
InSCIt: Information-Seeking Conversations with Mixed-Initiative Interactions18
Adapting to the Long Tail: A Meta-Analysis of Transfer Learning Research for Language Understanding Tasks18
Improving Probability-based Prompt Selection Through Unified Evaluation and Analysis18
Navigating Cultural Chasms: Exploring and Unlocking the Cultural POV of Text-To-Image Models18
Are Character-level Translations Worth the Wait? Comparing ByT5 and mT5 for Machine Translation17
Canine: Pre-training an Efficient Tokenization-Free Encoder for Language Representation17
Retrieve What You Need: A Mutual Learning Framework for Open-domain Question Answering17
Interactive Machine Teaching by Labeling Rules and Instances16
OpenFact: Factuality Enhanced Open Knowledge Extraction14
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models13
Sense-specific Historical Word Usage Generation13
Neuron-level Interpretation of Deep NLP Models: A Survey13
Robust Pronoun Fidelity with English LLMs: Are they Reasoning, Repeating, or Just Biased?13
Efficient Long-Text Understanding with Short-Text Models13
Addressing the Binning Problem in Calibration Assessment through Scalar Annotations13
MENLI: Robust Evaluation Metrics from Natural Language Inference12
Structural Persistence in Language Models: Priming as a Window into Abstract Language Representations12
ABNIRML: Analyzing the Behavior of Neural IR Models12
A Confidence-based Acquisition Model for Self-supervised Active Learning and Label Correction12
Pre-train, Prompt, and Recommendation: A Comprehensive Survey of Language Modeling Paradigm Adaptations in Recommender Systems11
Investigating Critical Period Effects in Language Acquisition through Neural Language Models11
Explainable Abuse Detection as Intent Classification and Slot Filling11
Modeling Emotion Dynamics in Song Lyrics with State Space Models11
Learning Fair Representations via Rate-Distortion Maximization11
Is My Model Using the Right Evidence? Systematic Probes for Examining Evidence-Based Tabular Reasoning11
NLP Security and Ethics, in the Wild10
Time-and-Space-Efficient Weighted Deduction10
How “Real” is Your Real-Time Simultaneous Speech-to-Text Translation System?10
TaxoPro: A Plug-In LoRA-based Cross-Domain Method for Low-Resource Taxonomy Completion10
Self-Rationalization in the Wild: A Large-scale Out-of-Distribution Evaluation on NLI-related tasks9
Rescue Conversations from Dead-ends: Efficient Exploration for Task-oriented Dialogue Policy Optimization9
Sub-Character Tokenization for Chinese Pretrained Language Models9
Evaluating Attribution in Dialogue Systems: The BEGIN Benchmark9
PaniniQA: Enhancing Patient Education Through Interactive Question Answering9
Data-to-text Generation with Variational Sequential Planning9
xcomet : Transparent Machine Translation Evaluation through Fine-grained Error Detection9
TANQ: An Open Domain Dataset of Table Answered Questions9
Helpful Neighbors: Leveraging Neighbors in Geographic Feature Pronunciation9
Patchwise Cooperative Game-based Interpretability Method for Large Vision-language Models9
Data-driven Parsing Evaluation for Child-Parent Interactions9
FeTaQA: Free-form Table Question Answering8
Bridging the Gap: A Survey on Integrating (Human) Feedback for Natural Language Generation8
End-to-end Argument Mining with Cross-corpora Multi-task Learning8
Assessing the Capacity of Transformer to Abstract Syntactic Representations: A Contrastive Analysis Based on Long-distance Agreement8
Benchmarking Large Language Models for News Summarization8
Not Eliminate but Aggregate: Post-Hoc Control over Mixture-of-Experts to Address Shortcut Shifts in Natural Language Understanding8
Know Your Limits: A Survey of Abstention in Large Language Models8
Large Language Models Enable Few-Shot Clustering8
Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages8
Visual Spatial Reasoning8
Decomposing and Recomposing Event Structure8
Direct Speech Translation for Automatic Subtitling7
Diff-Explainer: Differentiable Convex Optimization for Explainable Multi-hop Inference7
Abstractive Meeting Summarization: A Survey7
Speak, Read and Prompt: High-Fidelity Text-to-Speech with Minimal Supervision7
Visually Grounded Speech Models Have a Mutual Exclusivity Bias7
A Cross-Linguistic Pressure for Uniform Information Density in Word Order7
QAmeleon: Multilingual QA with Only 5 Examples7
Scope Ambiguities in Large Language Models7
How Abstract Is Linguistic Generalization in Large Language Models? Experiments with Argument Structure7
Conformalizing Machine Translation Evaluation7
Evaluating Transformer Models and Human Behaviors on Chinese Character Naming7
CreoleVal: Multilingual Multitask Benchmarks for Creoles7
Hallucinations in Large Multilingual Translation Models7
Meta-Learning the Difference: Preparing Large Language Models for Efficient Adaptation7
Expectations over Unspoken Alternatives Predict Pragmatic Inferences6
Can Authorship Representation Learning Capture Stylistic Features?6
The Emergence of Argument Structure in Artificial Languages6
Causal Inference in Natural Language Processing: Estimation, Prediction, Interpretation and Beyond6
The Parallelism Tradeoff: Limitations of Log-Precision Transformers6
A Multi-Level Optimization Framework for End-to-End Text Augmentation6
Chinese Idiom Paraphrasing6
♫ MuSiQue: Multihop Questions via Single-hop Question Composition6
mGPT: Few-Shot Learners Go Multilingual5
Collective Human Opinions in Semantic Textual Similarity5
Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences5
Meta-Learning a Cross-lingual Manifold for Semantic Parsing5
Cultural Adaptation of Recipes5
Compositional Generalization in Multilingual Semantic Parsing over Wikidata5
Improving the Domain Adaptation of Retrieval Augmented Generation (RAG) Models for Open Domain Question Answering5
Robust Dialogue State Tracking with Weak Supervision and Sparse Data5
A Comparative Approach for Auditing Multilingual Phonetic Transcript Archives5
Lost in the Middle: How Language Models Use Long Contexts4
AfriSpeech-200: Pan-African Accented Speech Dataset for Clinical and General Domain ASR4
Comparing Humans and Large Language Models on an Experimental Protocol Inventory for Theory of Mind Evaluation (EPITOME)4
Ultra-fine Entity Typing with Indirect Supervision from Natural Language Inference4
How Much Semantic Information is Available in Large Language Model Tokens?4
Hierarchical Indexing for Retrieval-Augmented Opinion Summarization4
Exploring Contrast Consistency of Open-Domain Question Answering Systems on Minimally Edited Questions4
KoBBQ: Korean Bias Benchmark for Question Answering4
Document Summarization with Latent Queries4
Less is More: Mitigate Spurious Correlations for Open-Domain Dialogue Response Generation Models by Causal Discovery4
Shared Lexical Items as Triggers of Code Switching4
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation4
FaithDial: A Faithful Benchmark for Information-Seeking Dialogue3
Unleashing the True Potential of Sequence-to-Sequence Models for Sequence Tagging and Structure Parsing3
Can Authorship Attribution Models Distinguish Speakers in Speech Transcripts?3
Sentence Similarity Based on Contexts3
Self-supervised Topic Taxonomy Discovery in the Box Embedding Space3
Saturated Transformers are Constant-Depth Threshold Circuits3
Naturalistic Causal Probing for Morpho-Syntax3
Do Text Simplification Systems Preserve Meaning? A Human Evaluation via Reading Comprehension3
Decision-Oriented Dialogue for Human-AI Collaboration3
Heterogeneous Supervised Topic Models3
Investigating Reasons for Disagreement in Natural Language Inference3
Optimal Transport Posterior Alignment for Cross-lingual Semantic Parsing3
Relational Memory-Augmented Language Models3
FRMT: A Benchmark for Few-Shot Region-Aware Machine Translation2
A Survey on Cross-Lingual Summarization2
ByT5: Towards a Token-Free Future with Pre-trained Byte-to-Byte Models2
Holmes ⌕ A Benchmark to Assess the Linguistic Competence of Language Models2
Multi-task Active Learning for Pre-trained Transformer-based Models2
Diverse AI Feedback For Large Language Model Alignment2
Data-driven Model Generalizability in Crosslinguistic Low-resource Morphological Segmentation2
A Closer Look at Classification Evaluation Metrics and a Critical Reflection of Common Evaluation Practice2
Why Does Surprisal From Larger Transformer-Based Language Models Provide a Poorer Fit to Human Reading Times?2
Calibrated Interpretation: Confidence Estimation in Semantic Parsing2
Automatically Correcting Large Language Models: Surveying the Landscape of Diverse Automated Correction Strategies2
Word Acquisition in Neural Language Models2
PASTA: A Dataset for Modeling PArticipant STAtes in Narratives2
TabVer: Tabular Fact Verification with Natural Logic2
Assessing the Role of Context in Chat Translation Evaluation: Is Context Helpful and Under What Conditions?2
Modeling Non-Cooperative Dialogue: Theoretical and Empirical Insights2
Eliciting the Translation Ability of Large Language Models via Multilingual Finetuning with Translation Instructions2
FINCH: Prompt-guided Key-Value Cache Compression for Large Language Models2
Retrieval-style In-context Learning for Few-shot Hierarchical Text Classification2
The Impact of Word Splitting on the Semantic Content of Contextualized Word Representations2
Discontinuous Combinatory Constituency Parsing2
Tracking Brand-Associated Polarity-Bearing Topics in User Reviews2
Fact Checking with Insufficient Evidence2
Do LLMs Exhibit Human-like Response Biases? A Case Study in Survey Design2
Neuro-symbolic Natural Logic with Introspective Revision for Natural Language Inference2
An Efficient Self-Supervised Cross-View Training For Sentence Embedding2
What Do Self-Supervised Speech Models Know About Words?2
Rank-Aware Negative Training for Semi-Supervised Text Classification2
Design Choices for Crowdsourcing Implicit Discourse Relations: Revealing the Biases Introduced by Task Design2
Explicitly Representing Syntax Improves Sentence-to-Layout Prediction of Unexpected Situations2
A Survey on Model Compression for Large Language Models2
On the Robustness of Dialogue History Representation in Conversational Question Answering: A Comprehensive Study and a New Prompt-based Method2
Reasoning over Public and Private Data in Retrieval-Based Systems2
Getting BART to Ride the Idiomatic Train: Learning to Represent Idiomatic Expressions2
Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval2
Temporal Effects on Pre-trained Models for Language Processing Tasks2
How Often Are Errors in Natural Language Reasoning Due to Paraphrastic Variability?2
Transformers as Transducers2
Hate Speech Classifiers Learn Normative Social Stereotypes2
General then Personal: Decoupling and Pre-training for Personalized Headline Generation2
MACSum: Controllable Summarization with Mixed Attributes2
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models2
0.04304313659668