Journal of Educational and Behavioral Statistics

Papers
(The median citation count of Journal of Educational and Behavioral Statistics is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-06-01 to 2025-06-01.)
ArticleCitations
Acknowledgments33
Handling Missing Data in Cross-Classified Multilevel Analyses: An Evaluation of Different Multiple Imputation Approaches25
Bayesian Change-Point Analysis Approach to Detecting Aberrant Test-Taking Behavior Using Response Times16
Comparison of Within- and Between-Series Effect Estimates in the Meta-Analysis of Multiple Baseline Studies15
Using MLP-F in Three Different Aberrant Behaviors in Education14
Statistical Power for Estimating Treatment Effects Using Difference-in-Differences and Comparative Interrupted Time Series Estimators With Variation in Treatment Timing14
A Causal Latent Transition Model With Multivariate Outcomes and Unobserved Heterogeneity: Application to Human Capital Development13
Analyzing Polytomous Test Data: A Comparison Between an Information-Based IRT Model and the Generalized Partial Credit Model12
Chance-Constrained Automated Test Assembly12
Measurement and Uncertainty Preserving Parametric Modeling for Continuous Latent Variables With Discrete Indicators and External Variables10
Using Response Times for Joint Modeling of Careless Responding and Attentive Response Styles9
Analyzing Cross-Sectionally Clustered Data Using Generalized Estimating Equations9
A General Mixture Model for Cognitive Diagnosis9
Speed–Accuracy Trade-Off? Not So Fast: Marginal Changes in Speed Have Inconsistent Relationships With Accuracy in Real-World Settings9
A Comparison of Latent Semantic Analysis and Latent Dirichlet Allocation in Educational Measurement8
Using Extant Data to Improve Estimation of the Standardized Mean Difference8
Latent Transition Cognitive Diagnosis Model With Covariates: A Three-Step Approach7
Power Analyses for Estimation of Complier Average Causal Effects Under Random Encouragement Designs in Education Research: Theory and Guidance7
Sample Size Calculation and Optimal Design for Multivariate Regression-Based Norming7
Commentary on “Obtaining Interpretable Parameters From Reparameterized Longitudinal Models: Transformation Matrices Between Growth Factors in Two Parameter Spaces”7
Introduction to the JEBS Special Section on Artificial Intelligence in Educational Statistics7
IRT Models for Learning With Item-Specific Learning Parameters6
Using Item Scores and Distractors to Detect Item Compromise and Preknowledge6
Utilizing Real-Time Test Data to Solve Attenuation Paradox in Computerized Adaptive Testing to Enhance Optimal Design6
Identifying Informative Predictor Variables With Random Forests6
A Two-Stage Regression Approach to Detecting Section Score Inconsistency6
Nonparametric Classification Method for Multiple-Choice Items in Cognitive Diagnosis5
Assessing Inter-rater Reliability With Heterogeneous Variance Components Models: Flexible Approach Accounting for Contextual Variables5
AI and Psychometrics: Epistemology, Process, and Politics5
Jenss–Bayley Latent Change Score Model With Individual Ratio of the Growth Acceleration in the Framework of Individual Measurement Occasions4
A Two-Level Adaptive Test Battery4
New Iterative Algorithms for Estimation of Item Functioning4
A Randomization P-Value Test for Detecting Copying on Multiple-Choice Exams4
A Simple Technique Assessing Ordinal and Disordinal Interaction Effects4
Improving Accuracy and Stability of Aggregate Student Growth Measures Using Empirical Best Linear Prediction3
Inferring Individual Attributes Using Testlet-Based Visual Analogue Scaling and Beta Copula Diagnostic Classification Models3
Reporting Proficiency Levels for Examinees With Incomplete Data3
Harnessing AI for Educational Measurement: Standards and Emerging Frontiers3
Combining Human and Automated Scoring Methods in Experimental Assessments of Writing: A Case Study Tutorial3
Using Regularized Methods to Validate Q-Matrix in Cognitive Diagnostic Assessment3
Finding the Right Grain-Size for Measurement in the Classroom3
Obtaining Interpretable Parameters From Reparameterized Longitudinal Models: Transformation Matrices Between Growth Factors in Two Parameter Spaces3
Using Permutation Tests to Identify Statistically Sound and Nonredundant Sequential Patterns in Educational Event Sequences3
Smoothing of Bivariate Test Score Distributions: Model Selection Targeting Test Score Equating3
A Within-Group Approach to Ensemble Machine Learning Methods for Causal Inference in Multilevel Studies3
How Do We Demonstrate AI Responsibility: The Devil Is in the Details2
A Hybrid EM Algorithm for Linear Two-Way Interactions With Missing Data2
Evaluating Intersectional Fairness in Algorithmic Decision Making Using Intersectional Differential Algorithmic Functioning2
Using Ordering Theory to Learn Attribute Hierarchies From Examinees’ Attribute Profiles2
Modeling Item-Level Heterogeneous Treatment Effects With the Explanatory Item Response Model: Leveraging Large-Scale Online Assessments to Pinpoint the Impact of Educational Interventions2
Computational Strategies and Estimation Performance With Bayesian Semiparametric Item Response Theory Models2
Using the Bayesian Network’s Structural Learning Algorithm to Estimate the Q-Matrix in Cognitive Diagnosis Models2
Deep Reinforcement Learning for Adaptive Learning Systems2
Two Statistical Tests for the Detection of Item Compromise2
Three-Part Random Effect Models for Longitudinal Skewed Survey Data With “Not Applicable” Responses2
Item Pool Quality Control in Educational Testing: Change Point Model, Compound Risk, and Sequential Detection2
Bayesian Q Matrix Estimation of Saturated Diagnostic Classification Models Using NIMBLE2
Fuzzy Regression Discontinuity Designs With Multiple Control Groups Under One-Sided Noncompliance: Evaluating Extended Time Accommodations2
An Improved Satterthwaite (1941, 1946) Effective df Approximation2
Predictive Performance of Bayesian Stacking in Multilevel Education Data2
Development of a High-Accuracy and Effective Online Calibration Method in CD-CAT Based on Gini Index2
Mean Comparisons of Many Groups in the Presence of DIF: An Evaluation of Linking and Concurrent Scaling Approaches2
Detecting Item Preknowledge Using Revisits With Speed and Accuracy2
Reviewer Acknowledgments2
Bayesian Adaptive Lasso for the Detection of Differential Item Functioning in Graded Response Models2
Alternatives to Weighted Item Fit Statistics for Establishing Measurement Invariance in Many Groups1
A Diagnostic Tree Model for Adaptive Assessment of Complex Cognitive Processes Using Multidimensional Response Options1
Expertise on Offer: Why Isn’t Anyone Buying?1
Cognitive Diagnosis Modeling Incorporating Response Times and Fixation Counts: Providing Comprehensive Feedback and Accurate Diagnosis1
Disentangling Person-Dependent and Item-Dependent Causal Effects: Applications of Item Response Theory to the Estimation of Treatment Effect Heterogeneity1
Profiles in Research: Lawrence J. Hubert1
Erratum to Identifying Informative Predictor Variables With Random Forests1
Forced-Choice Ranking Models for Raters’ Ranking Data1
Testing Differential Item Functioning Without Predefined Anchor Items Using Robust Regression1
Diagnosing Primary Students’ Reading Progression: Is Cognitive Diagnostic Computerized Adaptive Testing the Way Forward?1
Assessing Item Fit Using Expected Score Curve Under Restricted Recalibration1
Extending the Cluster Approach to Differential Item Functioning in Polytomous Items1
Analyzing Longitudinal Social Relations Model Data Using the Social Relations Structural Equation Model1
Automatic Text Classification With Large Language Models: A Review of openai for Zero- and Few-Shot Classification1
Generalizing Beyond the Test: Permutation-Based Profile Analysis for Explaining DIF Using Item Features1
Improving the Estimation of Site-Specific Effects and Their Distribution in Multisite Trials1
Using Response Times in Answer Similarity Analysis1
Editorial1
Optimizing Diagnostic Classification Models Application Considering Real-Life Constraints1
Bayesian Analysis Methods for Two-Level Diagnosis Classification Models1
Evaluating Psychometric Differences Between Fast Versus Slow Responses on Rating Scale Items1
Deep Learning Imputation for Asymmetric and Incomplete Likert-Type Items1
Regression Discontinuity Designs With an Ordinal Running Variable: Evaluating the Effects of Extended Time Accommodations for English-Language Learners1
What Is Actually Equated in “Test Equating”? A Didactic Note1
Acknowledgments1
A Position-Sensitive Mixture Item Response Model1
Exploiting Network Information to Disentangle Spillover Effects in a Field Experiment on Teens’ Museum Attendance1
A Critical View on the NEAT Equating Design: Statistical Modeling and Identifiability Problems1
Model Misspecification and Robustness of Observed-Score Test Equating Using Propensity Scores1
0.069181203842163