ACM Transactions on Software Engineering and Methodology

Papers
(The median citation count of ACM Transactions on Software Engineering and Methodology is 3. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-10-01 to 2025-10-01.)
ArticleCitations
An empirical study on vulnerability disclosure management of open source software systems292
Assessing the Robustness of Test Selection Methods for Deep Neural Networks189
Preference-wise Testing of Android Apps via Test Amplification126
Mutant Reduction Evaluation: What is There and What is Missing?125
Test Generation Strategies for Building Failure Models and Explaining Spurious Failures98
Automatic Identification of Game Stuttering via Gameplay Videos Analysis89
Deceiving Humans and Machines Alike: Search-based Test Input Generation for DNNs Using Variational Autoencoders88
Understanding the OSS Communities of Deep Learning Frameworks: A Comparative Case Study of P y T orch and T ensor77
I Depended on You and You Broke Me: An Empirical Study of Manifesting Breaking Changes in Client Packages74
Reusing d-DNNFs for Efficient Feature-Model Counting72
Neuron Semantic-Guided Test Generation for Deep Neural Networks Fuzzing71
Securing the Ethereum from Smart Ponzi Schemes: Identification Using Static Features70
Horus : Accelerating Kernel Fuzzing through Efficient Host-VM Memory Access Procedures69
KAPE: k NN-based Performance Testing for Deep Code Search69
SPENCER: Self-Adaptive Model Distillation for Efficient Code Retrieval63
Finding Information Leaks with Information Flow Fuzzing—RCR Report62
TestLoop: A Process Model Describing Human-in-the-Loop Software Test Suite Generation62
FairGenerate: Enhancing Fairness Through Synthetic Data Generation and Two-Fold Biased Labels Removal61
Enhancing Android Malware Detection: The Influence of ChatGPT on Decision-centric Task53
Better Supporting Human Aspects in Mobile eHealth Apps: Development and Validation of Enhanced Guidelines52
FoC: Figure out the Cryptographic Functions in Stripped Binaries with LLMs51
A Survey on Failure Analysis and Fault Injection in AI Systems51
Bounded Verification of Atomicity Violations for Interrupt-Driven Programs via Lazy Sequentialization50
History-Driven Fuzzing for Deep Learning Libraries50
An Empirical Study of the Non-Determinism of ChatGPT in Code Generation49
Communicating Study Design Trade-offs in Software Engineering48
Model Transformation Development Using Automated Requirements Analysis, Metamodel Matching, and Transformation by Example47
Deep API Sequence Generation via Golden Solution Samples and API Seeds46
Introducing Interactions in Multi-Objective Optimization of Software Architectures46
Estimating Uncertainty in Labeled Changes by SZZ Tools on Just-In-Time Defect Prediction44
PVDetector: Pretrained Vulnerability Detection on Vulnerability-enriched Code Semantic Graph43
Fine-Tuning Large Language Models to Improve Accuracy and Comprehensibility of Automated Code Review40
Try with Simpler - An Evaluation of Improved Principal Component Analysis in Log-based Anomaly Detection39
FormatFuzzer : Effective Fuzzing of Binary File Formats39
JavaScript SBST Heuristics to Enable Effective Fuzzing of NodeJS Web APIs38
Toward Interpretable Graph Tensor Convolution Neural Network for Code Semantics Embedding36
Why Do Developers Reject Refactorings in Open-Source Projects?36
Do Current Language Models Support Code Intelligence for R Programming Language?35
Adopting Two Supervisors for Efficient Use of Large-Scale Remote Deep Neural Networks - RCR Report35
Assessing and Analyzing the Correctness of GitHub Copilot’s Code Suggestions35
Storage State Analysis and Extraction of Ethereum Blockchain Smart Contracts34
An Accurate Identifier Renaming Prediction and Suggestion Approach34
A Survey of Learning-based Automated Program Repair33
Help Them Understand: Testing and Improving Voice User Interfaces31
An Empirical Study on Governance in Bitcoin’s Consensus Evolution31
FAVDisco: Modeling and Discovering File Access Vulnerabilities30
A Comprehensive View on TD Prevention Practices and Reasons for Not Preventing It30
Towards Automating Domain-Specific Data Generation for Text-to-SQL: A Comprehensive Approach30
HeMiRCA: Fine-Grained Root Cause Analysis for Microservices with Heterogeneous Data Sources29
Enhancing Security and Acuity of Smart Contract Vulnerability Detection based on Federated Learning and BiLSTM-Attention29
Single and Multi-objective Test Cases Prioritization for Self-driving Cars in Virtual Environments29
Systematic Literature Review on Software Security Vulnerability Information Extraction29
I Know What You Are Searching for: Code Snippet Recommendation from Stack Overflow Posts29
Characterizing Deep Learning Package Supply Chains in PyPI: Domains, Clusters, and Disengagement28
A Systematic Literature Review of Multi-Label Learning in Software Engineering27
Supporting Emotional Intelligence, Productivity and Team Goals while Handling Software Requirements Changes27
Vulnerability Repair via Concolic Execution and Code Mutations27
Revisiting the Identification of the Co-evolution of Production and Test Code26
Why Do GitHub Actions Workflows Fail? An Empirical Study26
When Fine-Tuning LLMs Meets Data Privacy: An Empirical Study of Federated Learning in LLM-Based Program Repair26
AutoRIC: Automated Neural Network Repairing Based on Constrained Optimization26
An Empirical Study of Retrieval-Augmented Code Generation: Challenges and Opportunities26
Towards On-The-Fly Code Performance Profiling26
Editorial: Toward the Future with Eight Issues Per Year25
APIRO: A Framework for Automated Security Tools API Recommendation25
Reinforcement Learning Informed Evolutionary Search for Autonomous Systems Testing23
An Empirical Study on GitHub Pull Requests’ Reactions23
Editorial: ICSE and the Incredible Contradictions of Software Engineering23
Contemporary Software Modernization: Strategies, Driving Forces, and Research Opportunities23
Leveraging Reviewer Experience in Code Review Comment Generation23
GIST : Generated Inputs Sets Transferability in Deep Learning23
SimClone: Detecting Tabular Data Clones Using Value Similarity23
Assessing and Improving an Evaluation Dataset for Detecting Semantic Code Clones via Deep Learning23
Industry–Academia Research Collaboration and Knowledge Co-creation: Patterns and Anti-patterns22
Assessing the Early Bird Heuristic (for Predicting Project Quality)22
SimADFuzz: Simulation-Feedback Fuzz Testing for Autonomous Driving Systems22
SCOPE : Performance Testing for Serverless Computing22
PatchCensor: Patch Robustness Certification for Transformers via Exhaustive Testing22
On-the-fly Generation-Quality Enhancement of Deep Code Models via Model Collaboration22
Cleaning Up Confounding: Accounting for Endogeneity Using Instrumental Variables and Two-Stage Models22
Commit Messages Generation Based on Core Changes21
Security of Language Models for Code: A Systematic Literature Review21
Characterizing Installation- and Run-Time Compatibility Issues in Android Benign Apps and Malware21
A Characterization Study of Merge Conflicts in Java Projects21
You Don’t Have to Say Where to Edit! jLED—Joint Learning to Localize and Edit Source Code21
Demo2Test: Transfer Testing of Agent in Competitive Environment with Failure Demonstrations21
Feature Matching-based Approaches to Improve the Robustness of Android Visual GUI Testing21
SourcererJBF: A Java Build Framework For Large-Scale Compilation20
Towards Learning Generalizable Code Embeddings Using Task-agnostic Graph Convolutional Networks20
Simulator-based Explanation and Debugging of Hazard-triggering Events in DNN-based Safety-critical Systems20
Graphuzz: Data-driven Seed Scheduling for Coverage-guided Greybox Fuzzing20
Learning from Very Little Data: On the Value of Landscape Analysis for Predicting Software Project Health20
Demystifying Hidden Sensitive Operations in Android Apps19
Efficient Multivariate Time Series Anomaly Detection through Transfer Learning for Large-Scale Software Systems19
MR-Scout: Automated Synthesis of Metamorphic Relations from Existing Test Cases19
Exploring Fine-Grained Bug Report Categorization with Large Language Models and Prompt Engineering: An Empirical Study18
Exploring the Capabilities of LLMs for Code-Change-Related Tasks18
Test Input Prioritization for 3D Point Clouds18
Monitoring data for Anomaly Detection in Cloud-Based Systems: A Systematic Mapping Study18
Actor-Driven Decomposition of Microservices through Multi-level Scalability Assessment18
Beyond Fidelity: Explaining Vulnerability Localization of Learning-Based Detectors18
A Comprehensive Empirical Study of Bias Mitigation Methods for Machine Learning Classifiers18
Generation-based Differential Fuzzing for Deep Learning Libraries18
Improving Deep Assertion Generation via Fine-Tuning Retrieval-Augmented Pre-trained Language Models17
A Roadmap for Integrating Sustainability into Software Engineering Education17
Efficient Management of Containers for Software Defined Vehicles17
Developer Perspectives on Licensing and Copyright Issues Arising from Generative AI for Software Development17
Variable Renaming-Based Adversarial Test Generation for Code Model: Benchmark and Enhancement17
Coverage-directed Differential Testing of X.509 Certificate Validation in SSL/TLS Implementations17
Bypassing Guardrails: Lessons Learned from Red Teaming ChatGPT17
Autonomous Driving System Testing via Diversity-Oriented Driving Scenario Exploration17
Is It Hard to Generate Holistic Commit Message?16
Adaptive Modelling Languages: Abstract Syntax and Model Migration16
All in One: Design, Verification, and Implementation of SNOW-optimal Read Atomic Transactions16
Testing Causality in Scientific Modelling Software16
Evolution-Aware Constraint Derivation Approach for Software Remodularization16
Automating TODO-missed Methods Detection and Patching16
Interpreting Deep Neural Networks via Relative Activation-Deactivation Abstractions16
Fairness Concerns in App Reviews: A Study on AI-Based Mobile Apps16
Certified Cost Bounds for Abstract Programs16
Battling against Protocol Fuzzing: Protecting Networked Embedded Devices from Dynamic Fuzzers16
Adopting Two Supervisors for Efficient Use of Large-Scale Remote Deep Neural Networks16
Stress Testing Control Loops in Cyber-Physical Systems - RCR Report15
Time-travel Investigation: Toward Building a Scalable Attack Detection Framework on Ethereum15
PonziHunter: Hunting Ethereum Ponzi Contract via Static Analysis and Contrastive Learning on the Bytecode Level15
SPOLRE: Semantic Preserving Object Layout Reconstruction for Image Captioning System Testing15
MeDeT: Medical Device Digital Twins Creation with Few-shot Meta-learning15
Programming Smart Playtesting15
Enhancing Task In-Progress Time Predictions through Affective and Personality Factors15
Measuring and Clustering Heterogeneous Chatbot Designs15
Duplicate Bug Report Detection: How Far Are We?14
AdaptiveLog: An Adaptive Log Analysis Framework with the Collaboration of Large and Small Language Model14
OSS Effort Estimation Using Software Features Similarity and Developer Activity-Based Metrics14
Refactoring in Computational Notebooks14
AI for DevSecOps: A Landscape and Future Opportunities14
Fold2Vec: Towards a Statement-Based Representation of Code for Code Comprehension14
Differentiable Quantum Programming with Unbounded Loops14
An In-depth Study of Java Deserialization Remote-Code Execution Exploits and Vulnerabilities14
Automatic Core-Developer Identification on GitHub: A Validation Study14
Survey of Code Search Based on Deep Learning14
An Interleaving Guided Metamorphic Testing Approach for Concurrent Programs14
On the Impact of Lower Recall and Precision in Defect Prediction for Guiding Search-based Software Testing13
Preparation and Utilization of Mixed States for Testing Quantum Programs13
Visualization Task Taxonomy to Understand the Fuzzing Internals13
Can GitHub Issues Help in App Review Classifications?13
On the Significance of Category Prediction for Code-Comment Synchronization13
MORepair: Teaching LLMs to Repair Code via Multi-Objective Fine-Tuning13
Reputation Gaming in Crowd Technical Knowledge Sharing13
LogUpdater: Automated Detection and Repair of Specific Defects in Logging Statements13
CITYWALK : Enhancing LLM-Based C++ Unit Test Generation via Project-Dependency Awareness and Language-Specific Knowledge12
Let’s Discover More API Relations: A Large Language Model-Based AI Chain for Unsupervised API Relation Inference12
Studying the Impact of TensorFlow and PyTorch Bindings on Machine Learning Software Quality12
DiPri : Distance-Based Seed Prioritization for Greybox Fuzzing12
A Comparative Study on Method Comment and Inline Comment12
Exploring Development Methods for Reactive Synthesis Specifications12
Data Complexity: A New Perspective for Analyzing the Difficulty of Defect Prediction Tasks12
The Influence of Human Aspects on Requirements Engineering-related Activities: Software Practitioners’ Perspective12
Input Distribution Coverage: Measuring Feature Interaction Adequacy in Neural Network Testing12
Mitigating Regression Faults Induced by Feature Evolution in Deep Learning Systems12
Rise of Distributed Deep Learning Training in the Big Model Era: From a Software Engineering Perspective11
Obfuscated Clone Search in JavaScript based on Reinforcement Subsequence Learning11
Can Coverage Criteria Guide Failure Discovery for Image Classifiers? An Empirical Study11
A Systematic Literature Review on the Use of Deep Learning in Software Engineering Research11
NSFuzz: Towards Efficient and State-Aware Network Service Fuzzing11
How Do Successful and Failed Projects Differ? A Socio-Technical Analysis11
A Hypothesis Testing-based Framework for Software Cross-modal Retrieval in Heterogeneous Semantic Spaces11
Testing RESTful APIs: A Survey11
Large Language Models for Cyber Security: A Systematic Literature Review11
Understanding Real-Time Collaborative Programming: A Study of Visual Studio Live Share11
Automated Abstract Transformer Synthesis for Reduced Product Domains11
The IDEA of Us: An Identity-Aware Architecture for Autonomous Systems11
Software Engineering by and for Humans in an AI Era11
Towards Robustness of Deep Program Processing Models—Detection, Estimation, and Enhancement11
Identifying and Explaining Safety-critical Scenarios for Autonomous Vehicles via Key Features11
Towards Practical Binary Code Similarity Detection: Vulnerability Verification via Patch Semantic Analysis11
Guided Feature Identification and Removal for Resource-constrained Firmware11
Representation Learning for Stack Overflow Posts: How Far Are We?10
Grammar Mutation for Testing Input Parsers10
Fairness Testing of Machine Translation Systems10
Simulating Software Evolution to Evaluate the Reliability of Early Decision-making among Design Alternatives toward Maintainability10
Less Is More: Unlocking Semi-Supervised Deep Learning for Vulnerability Detection10
Exploring JVM Garbage Collector Testing with Event-Coverage10
Revisiting Sentiment Analysis for Software Engineering in the Era of Large Language Models10
Open Problems in Fuzzing RESTful APIs: A Comparison of Tools10
Toward Better Comprehension of Breaking Changes in the NPM Ecosystem10
BiRD: Race Detection in Software Binaries under Relaxed Memory Models10
Identifying Performance Issues in Cloud Service Systems Based on Relational-Temporal Features10
Revealing the Unseen: AI Chain on LLMs for Predicting Implicit Dataflows to Generate Dataflow Graphs in Dynamically Typed Code10
An Empirical Study on the Relationship Between Defects and Source Code’s Unnaturalness10
Test Oracle Generation for REST APIs10
Addressing OSS Community Managers’ Challenges in Contributor Retention10
The Havoc Paradox in Generator-Based Fuzzing — RCR Report10
Verification Witnesses10
Divide-and-Conquer: Automating Code Revisions via Localization-and-Revision10
Mobile Application Online Cross-Project Just-in-Time Software Defect Prediction Framework10
What Constitutes the Deployment and Runtime Configuration System? An Empirical Study on OpenStack Projects10
Decision Support Model for Selecting the Optimal Blockchain Oracle Platform: An Evaluation of Key Factors10
Large Language Model for Vulnerability Detection and Repair: Literature Review and the Road Ahead9
CCIHunter: Enhancing Smart Contract Code-Comment Inconsistencies Detection via Two-Stage Pre-training9
Model Driven Engineering, Artificial Intelligence, and DevOps for Software and Systems Engineering: A Systematic Mapping Study of Synergies and Challenges9
Making Software Development More Diverse and Inclusive: Key Themes, Challenges, and Future Directions9
Identifying Affected Third-Party Java Libraries from Textual Descriptions of Vulnerabilities and Libraries9
V ex IR2V ec : An Architecture-Neutral Embedding Framework for Binary Similarity9
Understanding Vulnerability Inducing Commits of the Linux Kernel9
Microservice Security Metrics for Secure Communication, Identity Management, and Observability9
SemMT: A Semantic-Based Testing Approach for Machine Translation Systems9
Editorial: The End of the Journey9
A Review of Learning-based Smart Contract Vulnerability Detection: A Perspective on Code Representation9
Sustainability in the Field of Software Engineering: A Tertiary Study9
Digital Twin-based Anomaly Detection with Curriculum Learning in Cyber-physical Systems9
From Triumph to Uncertainty: The Journey of Software Engineering in the AI Era9
What You See is What it Means! Semantic Representation Learning of Code based on Visualization and Transfer Learning9
Some Seeds Are Strong: Seeding Strategies for Search-based Test Case Selection9
Recommending Variable Names for Extract Local Variable Refactorings9
Finding Information Leaks with Information Flow Fuzzing9
How the Quality of Maintenance Tasks is Affected by Criteria for Selecting Engineers for Collaboration9
DRIVE: Dockerfile Rule Mining and Violation Detection9
Benchmarking and Categorizing the Performance of Neural Program Repair Systems for Java9
Influential Global and Local Contexts Guided Trace Representation for Fault Localization9
Prompt-based Code Completion via Multi-Retrieval Augmented Generation8
FQN Inference in Partial Code by Prompt-tuned Language Model of Code8
Large Language Model-Aware In-Context Learning for Code Generation8
Just-in-Time Detection of Silent Security Patches8
Software Security Analysis in 2030 and Beyond: A Research Roadmap8
Accountability in Code Review: The Role of Intrinsic Drivers and the Impact of LLMs8
ReCDroid+: Automated End-to-End Crash Reproduction from Bug Reports for Android Apps8
µOpTime: Statically Reducing the Execution Time of Microbenchmark Suites Using Stability Metrics8
A Road-Map to Readily Available Early Validation and Verification of System Behaviour in Model-Based Systems Engineering using Software Engineering Best Practices8
AceCoder : An Effective Prompting Technique Specialized in Code Generation8
Automated Identification of Toxic Code Reviews Using ToxiCR8
Rise of the Planet of Serverless Computing: A Systematic Review8
Enumerating Valid Non-Alpha-Equivalent Programs for Interpreter Testing8
The Good, the Bad, and the Missing: Neural Code Generation for Machine Learning Tasks8
Snippet Comment Generation Based on Code Context Expansion8
My Fuzzers Won’t Build: An Empirical Study of Fuzzing Build Failures8
Challenges of Working from Home in Software Development During Covid-19 Lockdowns8
Learning Software Bug Reports: A Systematic Literature Review8
AcTracer: Active Testing of Large Language Model via Multi-Stage Sampling8
Learning-based Relaxation of Completeness Requirements for Data Entry Forms8
Improving Automated Program Repair with Domain Adaptation8
Poracle: Testing Patches under Preservation Conditions to Combat the Overfitting Problem of Program Repair8
Improving Code Reviewer Recommendation: Accuracy, Latency, Workload, and Bystanders8
Bias Testing and Mitigation in LLM-based Code Generation7
ConE: A Concurrent Edit Detection Tool for Large-scale Software Development7
Requirements-Driven Automated Software Testing: A Systematic Review7
Privacy in Chatbot Conversation-Driven Development: A Comprehensive Review and Requirements Proposal7
Using Personality Detection Tools for Software Engineering Research: How Far Can We Go?7
FairFLRep: Fairness aware fault localization and repair of Deep Neural Networks7
deGraphCS : Embedding Variable-based Flow Graph for Neural Code Search7
Weakly Supervised Vulnerability Localization via Multiple Instance Learning7
1.1783759593964