OOIR: Observatory of International Research

Papers

(The median citation count of Journal of Educational and Behavioral Statistics is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-01-01 to 2026-01-01.)

Article	Citations
Acknowledgments	42
Bayesian Change-Point Analysis Approach to Detecting Aberrant Test-Taking Behavior Using Response Times	25
Handling Missing Data in Cross-Classified Multilevel Analyses: An Evaluation of Different Multiple Imputation Approaches	22
It’s Up for Debate: A Review of Seminal Ideas and Controversies in Statistics by Roderick J. A. Little LittleR. J. A.Seminal Ideas and Controversies in S	21
A Causal Latent Transition Model With Multivariate Outcomes and Unobserved Heterogeneity: Application to Human Capital Development	17
Using MLP-F in Three Different Aberrant Behaviors in Education	17
Measurement and Uncertainty Preserving Parametric Modeling for Continuous Latent Variables With Discrete Indicators and External Variables	15
Chance-Constrained Automated Test Assembly	14
Analyzing Polytomous Test Data: A Comparison Between an Information-Based IRT Model and the Generalized Partial Credit Model	11
Multiple Imputation to Estimate Hierarchical Models From Data Missing at Random: Latent Covariates, Random Coefficients, and Statistical Interactions	10
Statistical Power for Estimating Treatment Effects Using Difference-in-Differences and Comparative Interrupted Time Series Estimators With Variation in Treatment Timing	10
Using Response Times for Joint Modeling of Careless Responding and Attentive Response Styles	9
A General Mixture Model for Cognitive Diagnosis	9
Power Analyses for Estimation of Complier Average Causal Effects Under Random Encouragement Designs in Education Research: Theory and Guidance	8
Speed–Accuracy Trade-Off? Not So Fast: Marginal Changes in Speed Have Inconsistent Relationships With Accuracy in Real-World Settings	8
Using Extant Data to Improve Estimation of the Standardized Mean Difference	8
Commentary on “Obtaining Interpretable Parameters From Reparameterized Longitudinal Models: Transformation Matrices Between Growth Factors in Two Parameter Spaces”	7
Latent Transition Cognitive Diagnosis Model With Covariates: A Three-Step Approach	7
Using Robust Estimation Method to Improve Person Traits Assessment	7
Introduction to the JEBS Special Section on Artificial Intelligence in Educational Statistics	7
Sample Size Calculation and Optimal Design for Multivariate Regression-Based Norming	6
A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement	6
Identifying Informative Predictor Variables With Random Forests	6
Nonparametric Classification Method for Multiple-Choice Items in Cognitive Diagnosis	5
IRT Models for Learning With Item-Specific Learning Parameters	5

Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design	5
Using Item Scores and Distractors to Detect Item Compromise and Preknowledge	5
Assessing Inter-rater Reliability With Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables	5
A Two-Stage Regression Approach to Detecting Section Score Inconsistency	5
AI and Psychometrics: Epistemology, Process, and Politics	5
A Simple Technique Assessing Ordinal and Disordinal Interaction Effects	4
Jenss–Bayley Latent Change Score Model With Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions	4
New Iterative Algorithms for Estimation of Item Functioning	4
Harnessing AI for Educational Measurement: Standards and Emerging Frontiers	4
A Two-Level Adaptive Test Battery	4
Improving Accuracy and Stability of Aggregate Student Growth Measures Using Empirical Best Linear Prediction	4
Inferring Individual Attributes Using Testlet-Based Visual Analogue Scaling and Beta Copula Diagnostic Classification Models	4
A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams	4
Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial	4
Using Regularized Methods to Validate Q-Matrix in Cognitive Diagnostic Assessment	4
Using Permutation Tests to Identify Statistically Sound and Nonredundant Sequential Patterns in Educational Event Sequences	4
Predictive Performance of Bayesian Stacking in Multilevel Education Data	3
A Hybrid EM Algorithm for Linear Two-Way Interactions With Missing Data	3
Computational Strategies and Estimation Performance With Bayesian Semiparametric Item Response Theory Models	3
Maximum Information per Time Unit Designs for Calibrating Multidimensional Items Online	3
Smoothing of Bivariate Test Score Distributions: Model Selection Targeting Test Score Equating	3
How Do We Demonstrate AI Responsibility: The Devil Is in the Details	3
Two Statistical Tests for the Detection of Item Compromise	3
Three-Part Random Effect Models for Longitudinal Skewed Survey Data With “Not Applicable” Responses	3
Deep Reinforcement Learning for Adaptive Learning Systems	3
Using the Bayesian Network’s Structural Learning Algorithm to Estimate the Q-Matrix in Cognitive Diagnosis Models	3
A Bayesian Framework to Establish Validity Evidence for Multi-Unidimensional Instruments With Small Samples	3
A Generalized Forced-Choice Diagnostic Classification Model Based on Thurstone’s Law of Comparative Judgment in Noncognitive Test: Theory and Application	3
A Within-Group Approach to Ensemble Machine Learning Methods for Causal Inference in Multilevel Studies	3
Development of a High-Accuracy and Effective Online Calibration Method in CD-CAT Based on Gini Index	3
A Two-Parameter IRT Model With Ability-Based Guessing: Estimation Using the Stochastic EM Algorithm	3
An Improved Satterthwaite (1941, 1946) Effective df Approximation	3
Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models	3
Finding the Right Grain-Size for Measurement in the Classroom	3
Modeling Item-Level Heterogeneous Treatment Effects With the Explanatory Item Response Model: Leveraging Large-Scale Online Assessments to Pinpoint the Impact of Educational Interventions	2
Inspection-Guided Randomization: A Flexible and Transparent Restricted Randomization Framework for Better Experimental Design	2
Evaluating Intersectional Fairness in Algorithmic Decision Making Using Intersectional Differential Algorithmic Functioning	2
Plausible Values and Multilevel Models in Large-Scale Assessments	2
What Is Actually Equated in “Test Equating”? A Didactic Note	2
Jointly Modeling Omitted and Not-Reached Items in Time-Limit Tests: A Survival Analysis Approach	2
Reviewer Acknowledgments	2
Fuzzy Regression Discontinuity Designs With Multiple Control Groups Under One-Sided Noncompliance: Evaluating Extended Time Accommodations	2
Detecting Item Preknowledge Using Revisits With Speed and Accuracy	2
A Diagnostic Tree Model for Adaptive Assessment of Complex Cognitive Processes Using Multidimensional Response Options	2
Expertise on Offer: Why Isn’t Anyone Buying?	2
Using Ordering Theory to Learn Attribute Hierarchies From Examinees’ Attribute Profiles	2
Bayesian Q Matrix Estimation of Saturated Diagnostic Classification Models Using NIMBLE	2
Cognitive Diagnosis Modeling Incorporating Response Times and Fixation Counts: Providing Comprehensive Feedback and Accurate Diagnosis	2
Evaluating Psychometric Differences Between Fast Versus Slow Responses on Rating Scale Items	2
Regression Discontinuity Designs With an Ordinal Running Variable: Evaluating the Effects of Extended Time Accommodations for English-Language Learners	2

Extending the Cluster Approach to Differential Item Functioning in Polytomous Items	2
Editorial	1
Assessing Item Fit Using Expected Score Curve Under Restricted Recalibration	1
Diagnosing Primary Students’ Reading Progression: Is Cognitive Diagnostic Computerized Adaptive Testing the Way Forward?	1
Flexible Bayesian Slice-Sampling Algorithms: An Illustration Using the Hierarchical Piecewise Constant Proportional Hazards Latent Trait Model	1
Testing Differential Item Functioning Without Predefined Anchor Items Using Robust Regression	1
Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items	1
A Critical View on the NEAT Equating Design: Statistical Modeling and Identifiability Problems	1
Bayesian Analysis Methods for Two-Level Diagnosis Classification Models	1
Using Response Times in Answer Similarity Analysis	1
Automatic Text Classification With Large Language Models: A Review of `openai` for Zero- and Few-Shot Classification	1
Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores	1
An Empirical Bayesian Approach for Testing the Fairness of Automated Scoring	1
Generalizing Beyond the Test: Permutation-Based Profile Analysis for Explaining DIF Using Item Features	1
Exploiting Network Information to Disentangle Spillover Effects in a Field Experiment on Teens’ Museum Attendance	1
Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups	1
Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity	1
Optimizing Diagnostic Classification Models Application Considering Real-Life Constraints	1
Acknowledgments	1