Journal of Educational Measurement

Papers
(The median citation count of Journal of Educational Measurement is 1. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
How Many Plausible Values?31
31
Leveraging Process Data and Variable Selection for Achievement Estimation in Large‐Scale Assessments19
Parameter Estimation in Comparative Judgment Under Random and Adaptive Scheduling Schemes19
Optimal Calibration of Items for Multidimensional Achievement Tests13
NCME Presidential Address 2022: Turning the Page to the Next Chapter of Educational Measurement11
A Statistical Test for the Detection of Item Compromise Combining Responses and Response Times10
Measuring the Uncertainty of Imputed Scores10
A Note on the Use of Categorical Subscores8
Comparing Data‐Driven Methods for Removing Options in Assessment Items8
The Precision and Bias of Cut Score Estimates from the Beuk Standard Setting Method8
Linking Error on Achievement Levels Accounting for Dependencies and Complex Sampling8
Using Linkage Sets to Improve Connectedness in Rater Response Model Estimation6
Historical Perspectives on Score Comparability Issues Raised by Innovations in Testing6
Automated Coding of Communications in Collaborative Problem‐Solving Tasks Using ChatGPT6
Issue Information6
Issue Information6
Briggs, Derek C.Historical and Conceptual Foundations of Measurement in the Human Sciences: Credos and Controversies5
A Quantitative Method for Evaluating the Predictive Utility of Linked Scores5
A Deterministic Gated Lognormal Response Time Model to Identify Examinees with Item Preknowledge5
Using Item Parameter Predictions for Reducing Calibration Sample Requirements—A Case Study Based on a High‐Stakes Admission Test5
Validity Arguments for AI‐Based Automated Scores: Essay Scoring as an Illustration5
4
Model Selection Posterior Predictive Model Checking via Limited‐Information Indices for Bayesian Diagnostic Classification Modeling4
Differential and Functional Response Time Item Analysis: An Application to Understanding Paper versus Digital Reading Processes4
Gender Bias in Test Item Formats: Evidence from PISA 2009, 2012, and 2015 Math and Reading Tests4
An Exponentially Weighted Moving Average Procedure for Detecting Back Random Responding Behavior4
Information Functions of Rank‐2PL Models for Forced‐Choice Questionnaires4
Issue Information4
Using Response Time in Multidimensional Computerized Adaptive Testing4
Parametric Bootstrap Mantel‐Haenszel Statistic for Aggregated Testlet Effects4
Likelihood‐Based Estimation of Model‐Derived Oral Reading Fluency4
DIF Detection for Multiple Groups: Comparing Three‐Level GLMMs and Multiple‐Group IRT Models4
Simultaneous Detection of Compromised Items and Examinees with Item Preknowledge in Online Assessments Using Response Time Data3
An Item Response Tree Model for Items with Multiple‐Choice and Constructed‐Response Parts3
Special Issue: Adaptive Testing in Large‐Scale Assessments3
A Generalized Objective Function for Computer Adaptive Item Selection3
Issue Information3
Utilizing Response Time for Item Selection in On‐the‐Fly Multistage Adaptive Testing for PISA Assessment3
Detecting Group Collaboration Using Multiple Correspondence Analysis3
Controlling the Speededness of Assembled Test Forms: A Generalization to the Three‐Parameter Lognormal Response Time Model3
3
Sensemaking of Process Data from Evaluation Studies of Educational Games: An Application of Cross‐Classified Item Response Theory Modeling3
Addressing Bias in Spoken Language Systems Used in the Development and Implementation of Automated Child Language‐Based Assessment3
Measuring the Impact of Peer Interaction in Group Oral Assessments with an Extended Many‐Facet Rasch Model2
Subscores: A Practical Guide to Their Production and Consumption. ShelbyHaberman, SandipSinharay, RichardFeinberg, and HowardWainer. Cambridge, Cambridge University Press2024, 176 pp. (paperback)2
Mapping out the Hexagon Measurement Framework as a Blueprint Underlying Measurement in the Human Sciences2
Cognitive Diagnostic Multistage Testing by Partitioning Hierarchically Structured Attributes2
2
Argument‐Based Approach to Validity: Developing a Living Document and Incorporating Preregistration2
2
Using Item Scores and Distractors in Person‐Fit Assessment2
Online Monitoring of Test‐Taking Behavior Based on Item Responses and Response Times2
Issue Information2
Detecting Multidimensional DIF in Polytomous Items with IRT Methods and Estimation Approaches2
MSAEM Estimation for Confirmatory Multidimensional Four‐Parameter Normal Ogive Models2
Generalizability Theory for Randomly Parallel Testing2
On the Choice of Parameters for the Lognormal Model for Response Times: Commentary on Becker et al. (2013)2
2
3PL with Ability‐Expression Gap: Modeling the Discrepancy between Latent and Expressed Ability2
Issue Information2
The Vulnerability of AI‐Based Scoring Systems to Gaming Strategies: A Case Study2
A Highly Adaptive Testing Design for PISA2
2
BettyLanteigne, ChristineCoombe, & James DeanBrown. 2021. Challenges in Language Testing around the World: Insights for language test users. Singapore: Springer, 2021, 129.99 € (hardcover),2
Comparing and Combining IRTree Models and Anchoring Vignettes in Addressing Response Styles2
Using GPT‐4 to Augment Imbalanced Data for Automatic Scoring2
Issue Information2
2
Evaluating the Consistency and Reliability of Attribution Methods in Automated Short Answer Grading (ASAG) Systems: Toward an Explainable Scoring System2
Issue Information2
Using Multilabel Neural Network to Score High‐Dimensional Assessments for Different Use Foci: An Example with College Major Preference Assessment2
Influence of Intersectional Routing Modules between Dimensions on Measurement Precision in Multidimensional Multistage Testing1
Using Keystroke Dynamics to Detect Nonoriginal Text1
A Topic Testlet Model for Calibrating Testlet Constructed Responses1
From Item Estimates to Test Operations: The Cascading Effect of Rapid Guessing1
Reckase, M.The Psychometrics of Standard Setting: Connecting Policy and Test Scores: First edition published 2023 by CRC Press, 6000 Broken Sound Parkway NW, Suite 300, Boca Raton, FL 33487‐2741
Curvilinearity in the Reference Composite and Practical Implications for Measurement1
Exploring the Influence of Response Time Allocation on Item Revisiting: Implications for Test‐Taking Strategies in Cognitive Diagnostic Assessments1
A Dual‐Purpose Model for Binary Data: Estimating Ability and Misconceptions1
The Impact of Cheating on Score Comparability via Pool‐Based IRT Pre‐equating1
Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment: A Discussion and Look Forward1
Modeling the Intraindividual Relation of Ability and Speed within a Test1
Fully Gibbs Sampling Algorithms for Bayesian Variable Selection in Latent Regression Models1
Correction to “Using GPT‐4 to Augment Imbalanced Data for Automatic Scoring”1
1
Vertical Scaling with Moderated Nonlinear Factor Analysis1
Finding Words Associated with DIF: Predicting Differential Item Functioning Using LLMs and Explainable AI1
IRT Observed‐Score Equating for Rater‐Mediated Assessments Using a Hierarchical Rater Model1
Validity Arguments Meet Artificial Intelligence in Innovative Educational Assessment1
A Factor Mixture Model for Item Responses and Certainty of Response Indices to Identify Student Knowledge Profiles1
Robustness of Item Response Theory Models under the PISA Multistage Adaptive Testing Designs1
Modeling Hierarchical Attribute Structures in Diagnostic Classification Models with Multiple Attempts1
Using Exploratory Item Response Modeling to Identify the Hierarchical Structure of Anti‐Racism Value Stance Constructs among Teachers1
Simultaneous Detection of Cheaters and Compromised Items Using a Biclustering Approach1
0.14009094238281