IEEE Transactions on Software Engineering

Papers
(The TQCC of IEEE Transactions on Software Engineering is 15. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-06-01 to 2026-06-01.)
ArticleCitations
50 Years of Transactions on Software Engineering588
Question Selection for Multimodal Code Search Synthesis Using Probabilistic Version Spaces369
Computation Tree Logic Guided Program Repair259
Confirmation Bias and Time Pressure: A Family of Experiments in Software Testing140
Shield Broken: Black-Box Adversarial Attacks on LLM-Based Vulnerability Detectors135
Towards Scalable Model Checking of Reflective Systems via Labeled Transition Systems130
Can We Trust the Phone Vendors? Comprehensive Security Measurements on the Android Firmware Ecosystem130
Influence of the 1990 IEEE TSE Paper “Automated Software Test Data Generation” on Software Engineering127
Just-in-Time Prediction of Software Architectural Changes Through Commit-Level Analyses127
Prevent: An Unsupervised Approach to Predict Software Failures in Production114
Enhancing Mobile App Bug Reporting via Real-Time Understanding of Reproduction Steps114
The Why, When, What, and How About Predictive Continuous Integration: A Simulation-Based Investigation111
A Retrospective on Whole Test Suite Generation: On the Role of SBST in the Age of LLMs107
Enhancing Project-Specific Code Completion by Inferring Internal API Information100
Deobfuscation of Control Flow Flattening Based on Abstract Interpretation99
Combining Genetic Programming and Model Checking to Generate Environment Assumptions87
Are Your Dependencies Code Reviewed?: Measuring Code Review Coverage in Dependency Updates86
To Do or Not to Do: Semantics and Patterns for Do Activities in UML PSSM State Machines85
Answering Uncertain, Under-Specified API Queries Assisted by Knowledge-Aware Human-AI Dialogue84
Recommending API Function Calls and Code Snippets to Support Software Development80
Mission Specification Patterns for Mobile Robots: Providing Support for Quantitative Properties79
Efficiently Testing Distributed Systems via Abstract State Space Prioritization77
A Declarative Metamorphic Testing Framework for Autonomous Driving76
Automatic Fairness Testing of Neural Classifiers Through Adversarial Sampling75
Visibility of Domain Elements in the Elicitation Process Interviews: A Family of Empirical Studies74
Do as You Say: Consistency Detection of Data Practice in Program Code and Privacy Policy in Mini-App74
DSSDPP: Data Selection and Sampling Based Domain Programming Predictor for Cross-Project Defect Prediction73
Advanced Smart Contract Vulnerability Detection via LLM-Powered Multi-Agent Systems72
How Composite Metamorphic Relations Enhance Test Effectiveness of DNN Testing: An Empirical Study72
Tackling Expressive Feature-Modeling Constructs with Pseudo-Boolean d-DNNF Compilation67
Multi-Granularity Detector for Vulnerability Fixes66
Socio-Technical Grounded Theory for Software Engineering66
MalElves: Reinforcement Learning-Driven Adversarial Example Generation for Evading Cross-Platform ELF Malware Detection66
Enhancing Protocol Fuzzing via Diverse Seed Corpus Generation65
2023 Reviewers List62
Towards a Cognitive Model of Dynamic Debugging: Does Identifier Construction Matter?62
Mutation Testing in Practice: Insights From Open-Source Software Developers61
Esale: Enhancing Code-Summary Alignment Learning for Source Code Summarization57
Mole: Efficient Crash Reproduction in Android Applications With Enforcing Necessary UI Events56
T-Evos: A Large-Scale Longitudinal Study on CI Test Execution and Failure56
A Wizard of Oz Study Simulating API Usage Dialogues With a Virtual Assistant56
Detecting Malicious Packages in PyPI and NPM by Clustering Installation Scripts55
Trace Diagnostics for Signal-Based Temporal Properties54
P-NPR: Practical Neural Program Repair via Learning to Ensemble54
MASTER: Multi-Source Transfer Weighted Ensemble Learning for Multiple Sources Cross-Project Defect Prediction54
Measuring the Fidelity of a Physical and a Digital Twin Using Trace Alignments54
Robust Test Selection for Deep Neural Networks53
Boosting Compiler Fault Localization: Getting the Best of Both Worlds by Fusing Dynamic and Historical Data52
A Theory of Pending Schemas in Combinatorial Testing52
An Empirical Study of Refactoring Rhythms and Tactics in the Software Development Process52
Mask–Mediator–Wrapper Architecture as a Data Mesh Driver51
An Empirical Study of Software Refactorings in Real-World Open-Source Java Projects50
Efficient State Identification for Finite State Machine-Based Testing49
Multi-Objective Software Defect Prediction via Multi-Source Uncertain Information Fusion and Multi-Task Multi-View Learning49
Neural Library Recommendation by Embedding Project-Library Knowledge Graph47
Systematic Evaluation and Usability Analysis of Formal Methods Tools for Railway Signaling System Design47
GenMorph: Automatically Generating Metamorphic Relations via Genetic Programming47
Multimodal Fusion for Android Malware Detection Based on Large Pre-Trained Models47
Automated Code Editing With Search-Generate-Modify45
A Systematic Review of IoT Systems Testing: Objectives, Approaches, Tools, and Challenges44
δ-SCALPEL: Docker Image Slimming Based on Source Code Static Analysis44
Mitigating False Positive Static Analysis Warnings: Progress, Challenges, and Opportunities43
Program Synthesis for Cyber-Resilience43
Towards Automated Discovery of Asymmetric Mempool DoS in Blockchains43
MBL-CPDP: A Multi-Objective Bilevel Method for Cross-Project Defect Prediction42
An Empirical Study of Parameter-Efficient Fine-Tuning in Code Change Learning and Beyond42
Evolutionary generation of test suites for multi-path coverage of MPI programs with non-determinism42
AC2Next: A Novel Model That Can Predict the Next Animation API by Fusing the Animation API Context and the UI Animation Task41
Generalized Coverage Criteria for Combinatorial Sequence Testing41
Triple Peak Day: Work Rhythms of Software Developers in Hybrid Work41
A Faceted Taxonomy of Requirements Changes in Agile Contexts40
Evaluating and Improving GPT-Based Expansion of Abbreviations40
TransformCode: A Contrastive Learning Framework for Code Embedding via Subtree Transformation40
Annotative Software Product Line Analysis Using Variability-Aware Datalog40
Context-Aware Personalized Crowdtesting Task Recommendation40
API2Vec++: Boosting API Sequence Representation for Malware Detection and Classification39
An Experience Report on Producing Verifiable Builds for Large-Scale Commercial Systems39
Formal Equivalence Checking for Mobile Malware Detection and Family Classification39
Weighted Community Division for Automated Software Architecture Refactoring39
Decision Support for Selecting Blockchain-Based Application Design Patterns With Layered Taxonomy and Quality Attributes38
On the Understandability of MLOps System Architectures38
Leveraging Large Language Model for Automatic Patch Correctness Assessment37
Human-in-the-Loop Automatic Program Repair37
Discovering Reusable Functional Features in Legacy Object-Oriented Systems37
How Should Software Engineering Secondary Studies Include Grey Material?37
Pull Request Decisions Explained: An Empirical Overview37
Pathidea: Improving Information Retrieval-Based Bug Localization by Re-Constructing Execution Paths Using Logs37
LLMorpheus: Mutation Testing Using Large Language Models36
EpiTESTER: Testing Autonomous Vehicles With Epigenetic Algorithm and Attention Mechanism35
Legion: Massively Composing Rankers for Improved Bug Localization at Adobe35
Specializing Neural Networks for Cryptographic Code Completion Applications34
DiffGAN: A Test Generation Approach for Differential Testing of Deep Neural Networks for Image Analysis34
What Drives and Sustains Self-Assignment in Agile Teams34
From Tea Leaves to System Maps: A Survey and Framework on Context-Aware Machine Learning Monitoring34
Test Flakiness Across Programming Languages34
Continuously Managing NFRs: Opportunities and Challenges in Practice33
When Voice Meets Touch: Conflict Analysis in Mobile Applications33
Evaluating and Improving Unified Debugging33
Self-Admitted GenAI Usage in Open-Source Software33
Exploring and Analyzing Software Architecture Refactoring in Practice33
“Estimating Software Project Effort Using Analogies”: Reflections After 28 Years33
Automated Refactoring of Non-Idiomatic Python Code With Pythonic Idioms33
Microservice Extraction Based on a Comprehensive Evaluation of Logical Independence and Performance32
Beyond Functional Correctness: Exploring Hallucinations in LLM-Generated Code32
Revisiting Test Impact Analysis in Continuous Testing From the Perspective of Code Dependencies32
Typestate-Based Fault Localization of API Usage Violations in a Deep Learning Program32
Just-In-Time Obsolete Comment Detection and Update32
Automated Commit Message Generation With Large Language Models: An Empirical Study and Beyond31
How Do Developers Structure Unit Test Cases? An Empirical Analysis of the AAA Pattern in Open Source Projects31
Empirical Validation of Automated Vulnerability Curation and Characterization31
Nighthawk: Fully Automated Localizing UI Display Issues via Visual Understanding30
Practitioners’ Expectations on Log Anomaly Detection30
Data Quality Matters: A Case Study on Data Label Correctness for Security Bug Report Prediction30
Detecting Continuous Integration Skip Commits Using Multi-Objective Evolutionary Search30
Cost-Effective Adversarial Attacks Against Code LLM With Model Attention30
Increasing the Confidence of Deep Neural Networks by Coverage Analysis29
A Study About the Knowledge and Use of Requirements Engineering Standards in Industry29
SigRec: Automatic Recovery of Function Signatures in Smart Contracts29
Mind the Gap! A Study on the Transferability of Virtual Versus Physical-World Testing of Autonomous Driving Systems29
Evaluation of Static Vulnerability Detection Tools With Java Cryptographic API Benchmarks29
Retrieval-Augmented Fine-Tuning for Improving Retrieve-and-Edit Based Assertion Generation29
Towards Exploring Developers’ Struggles in Developing Upgradeable Smart Contracts28
The Analysis of Safety Critical Software Systems28
DAppSCAN: Building Large-Scale Datasets for Smart Contract Weaknesses in DApp Projects28
Understanding the Robustness of Transformer-Based Code Intelligence via Code Transformation: Challenges and Opportunities27
Improving Cross-Language Code Clone Detection via Code Representation Learning and Graph Neural Networks27
Predictive Comment Updating With Heuristics and AST-Path-Based Neural Learning: A Two-Phase Approach26
A Systematic Study on Real-World Android App Bundles26
Are You Still Working on This? An Empirical Study on Pull Request Abandonment26
Reaching Software Quality for Bioinformatics Applications: How Far Are We?26
Large-Scale Empirical Analysis of Continuous Fuzzing: Insights from 1 Million Fuzzing Sessions26
From Executable Specifications to Hard-to-Specify Requirements: Challenges in Describing Reactive System Behavior26
The Impact of Prompt Programming on Function-Level Code Generation26
Deconstructing the Nature of Collaboration in Organizations Open Source Software Development: The Impact of Developer and Task Characteristics25
AdaptGen: A Problem-Adaptive Solution Template Generation Technique for Online Programming Platforms25
STRE: An Automated Approach to Suggesting App Developers When to Stop Reading Reviews25
Does AI Code Review Lead to Code Changes? A Case Study of GitHub Actions25
Causality-Aware Safety Testing for Autonomous Driving Systems25
On the Validity of Pre-Trained Transformers for Natural Language Processing in the Software Engineering Domain24
FlakyFix: Using Large Language Models for Predicting Flaky Test Fix Categories and Test Code Repair24
Provably Valid and Diverse Mutations of Real-World Media Data for DNN Testing24
Cross-Language Taint Analysis: Generating Caller-Sensitive Native Code Specification for Java24
CRPWarner: Warning the Risk of Contract-Related Rug Pull in DeFi Smart Contracts23
Diversity-Oriented Testing for Competitive Game Agent via Constraint-Guided Adversarial Agent Training23
Causes and Canonicalization of Unreproducible Builds in Java23
iTCRL: Causal-Intervention-Based Trace Contrastive Representation Learning for Microservice Systems23
Effect of Requirements Analyst Experience on Elicitation Effectiveness: A Family of Quasi-Experiments23
Line-Level Defect Prediction by Capturing Code Contexts With Graph Convolutional Networks23
A Variability Fault Localization Approach for Software Product Lines23
The Power of Small LLMs: A Multi-Agent for Code Generation via Dynamic Precaution Tuning23
Parameterized Verification of Leader/Follower Systems via Arithmetic Constraints23
Assessing Evaluation Metrics for Neural Test Oracle Generation23
NumScout: Unveiling Numerical Defects in Smart Contracts Using LLM-Pruning Symbolic Execution23
Misactivation-Aware Stealthy Backdoor Attacks on Neural Code Understanding Models23
Subgraph-Oriented Testing for Deep Learning Libraries23
Domain-Driven Design for Microservices: An Evidence-Based Investigation22
A Grounded Theory of Cross-Community SECOs: Feedback Diversity Versus Synchronization22
Beyond the Sum of Parts: Leveraging Entanglement for Bug Inducing Commit Localization22
Beyond Literal Meaning: Uncover and Explain Implicit Knowledge in Code Through Wikipedia-Based Concept Linking22
Automated Use-After-Free Detection and Exploit Mitigation: How Far Have We Gone?22
Towards Robust Detection for Malicious Injection Variants22
FCGHunter: Towards Evaluating Robustness of Graph-Based Android Malware Detection22
Forecasting the Principal of Code Technical Debt in JavaScript Applications22
Mithra: Anomaly Detection as an Oracle for Cyberphysical Systems22
Do Pretrained Language Models Indeed Understand Software Engineering Tasks?22
CodeS+: Towards Assessing the Generalization Ability of Code Models Under Distribution Shift21
Unearthing Gas-Wasting Code Smells in Smart Contracts With Large Language Models21
SmartOracle: Generating Smart Contract Oracle via Fine-Grained Invariant Detection21
Syntactic Versus Semantic Similarity of Artificial and Real Faults in Mutation Testing Studies21
How Templated Requirements Specifications Inhibit Creativity in Software Engineering21
ArchHypo: Managing Software Architecture Uncertainty Using Hypotheses Engineering21
Range Specification Bug Detection in Flight Control System Through Fuzzing21
Let’s Talk With Developers, Not About Developers: A Review of Automatic Program Repair Research21
Hashing Fuzzing: Introducing Input Diversity to Improve Crash Detection21
A Comparison of Natural Language Understanding Platforms for Chatbots in Software Engineering21
Automated Infrastructure as Code Program Testing21
DyCITO+: Scalable Deep Reinforcement Learning for Generating Class Integration Test Orders of Java Programs20
Boosting Generalizable Fairness With Mahalanobis Distances Guided Boltzmann Exploratory Testing20
A Little Help Goes a Long Way: Tutoring LLMs in Solving Competitive Programming Through Hints20
Practical Mutation Testing at Scale: A view from Google20
Stakeholder Preference Extraction From Scenarios20
Clopper-Pearson Algorithms for Efficient Statistical Model Checking Estimation20
Retrospective on: Constraint-Based Automatic Test Data Generation20
PopArt: Ranked Testing Efficiency20
Translating to a Low-Resource Language with Compiler Feedback: A Case Study on Cangjie20
Does Treatment Adherence Impact Experiment Results in TDD?20
Studying the Influence and Distribution of the Human Effort in a Hybrid Fitness Function for Search-Based Model-Driven Engineering19
The “Question Neighbourhood” Approach for Systematic Evaluation of Code-Generating LLMs19
Active Code Learning: Benchmarking Sample-Efficient Training of Code Models19
Accelerating Finite State Machine-Based Testing Using Reinforcement Learning19
PATEN: Identifying Unpatched Third-Party APIs via Fine-Grained Patch-Enhanced AST-Level Signature19
Engineering Within Boundaries When Software Has None19
SCAnoGenerator: Automatic Anomaly Injection for Ethereum Smart Contracts19
Do Chase Your Tail! Missing Key Aspects Augmentation in Textual Vulnerability Descriptions of Long-Tail Software Through Feature Inference19
Verification of Fuzzy Decision Trees19
Evaluating SZZ Implementations: An Empirical Study on the Linux Kernel19
Emerging App Issue Identification via Online Joint Sentiment-Topic Tracing19
Runtime Evolution of Bitcoin's Consensus Rules19
Learning to Predict User-Defined Types19
Generating Structurally Realistic Models With Deep Autoregressive Networks18
Stealthy Backdoor Attack for Code Models18
Concretization of Abstract Traffic Scene Specifications Using Metaheuristic Search18
An Assessment of Rules of Thumb for Software Phase Management, and the Relationship Between Phase Effort and Schedule Success18
Examiner-Pro: Testing Arm Emulators Across Different Privileges18
Bridging Bug Localization and Issue Fixing: A Hierarchical Localization Framework Leveraging Large Language Models18
Onboarding Software Professionals in a Hybrid World18
A Retrospective of Proving the Correctness of Multiprocess Programs18
AddressWatcher: Sanitizer-Based Localization of Memory Leak Fixes18
DaNuoYi: Evolutionary Multitask Injection Testing on Web Application Firewalls18
Dealing With Data Challenges When Delivering Data-Intensive Software Solutions18
Towards More Precise Coincidental Correctness Detection With Deep Semantic Learning18
The Human Side of Software Engineering Teams: An Investigation of Contemporary Challenges18
Software Testing With Large Language Models: Survey, Landscape, and Vision18
Enforcing Correctness of Collaborative Business Processes Using Plans18
Multitask-Based Evaluation of Open-Source LLM on Software Vulnerability18
Finding Trends in Software Research17
Static Profiling of Alloy Models17
A Framework for Evaluating GenAI Adoption and Use in Software Engineering17
Active Learning of Discriminative Subgraph Patterns for API Misuse Detection17
Investigating the Feasibility of Conducting Webcam-Based Eye-Tracking Studies in Code Comprehension17
How Toxic Can You Get? Search-Based Toxicity Testing for Large Language Models17
Malo in the Code Jungle: Explainable Fault Localization for Decentralized Applications17
A Procedure to Continuously Evaluate Predictive Performance of Just-In-Time Software Defect Prediction Models During Software Development17
A Search-Based Testing Approach for Deep Reinforcement Learning Agents17
On the Workflows and Smells of Leaderboard Operations (LBOps): An Exploratory Study of Foundation Model Leaderboards17
Fast and Precise Static Null Exception Analysis With Synergistic Preprocessing17
Isolating Compiler Faults Through Differentiated Compilation Configurations17
Darcy: Automatic Architectural Inconsistency Resolution in Java17
RNN-Test: Towards Adversarial Testing for Recurrent Neural Network Systems17
How Software Developers Mitigate Their Errors When Developing Code17
A Framework for Emotion-Oriented Requirements Change Handling in Agile Software Engineering17
MultiPL-E: A Scalable and Polyglot Approach to Benchmarking Neural Code Generation16
PackHunter: Recovering Missing Packages for C/C++ Projects16
State of the Journal16
Safety and Performance, Why Not Both? Bi-Objective Optimized Model Compression Against Heterogeneous Attacks Toward AI Software Deployment16
Vulnerability Detection via Multiple-Graph-Based Code Representation16
Neural Transfer Learning for Repairing Security Vulnerabilities in C Code16
FlexFL: Flexible and Effective Fault Localization With Open-Source Large Language Models16
Let's Go to the Whiteboard (Again): Perceptions From Software Architects on Whiteboard Architecture Meetings16
DT4LM: Differential Testing for Reliable Language Model Updates in Classification Tasks16
Consistent Local-First Software: Enforcing Safety and Invariants for Local-First Applications16
Distilling Quality Enhancing Comments From Code Reviews to Underpin Reviewer Recommendation16
Factors Affecting On-Time Delivery in Large-Scale Agile Software Development16
Using Symbolic States to Infer Numerical Invariants16
A Retrospective on Mining Version Histories to Guide Software Changes16
OpCodeBERT: A Method for Python Code Representation Learning by BERT With Opcode16
Software Architecture Description Revisited16
A Systematical Study on Application Performance Management Libraries for Apps16
Pride: Prioritizing Documentation Effort Based on a PageRank-Like Algorithm and Simple Filtering Rules15
Retrospective: Data Mining Static Code Attributes to Learn Defect Predictors15
0.06608510017395