Educational and Psychological Measurement

Papers
(The median citation count of Educational and Psychological Measurement is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2022-05-01 to 2026-05-01.)
ArticleCitations
Iterative Item Selection of Neighborhood Clusters: A Nonparametric and Non-IRT Method for Generating Miniature Computer Adaptive Questionnaires315
Collapsing Sparse Responses in Likert-Type Scale Data: Advantages and Disadvantages for Model Fit in CFA41
Assessing the Properties and Functioning of Model-Based Sum Scores in Multidimensional Measures With Local Item Dependencies: A Comprehensive Proposal28
Using Deep Reinforcement Learning to Decide Test Length26
Model Specification Searches in Structural Equation Modeling Using Bee Swarm Optimization19
Functional Approaches for Modeling Unfolding Data17
Detecting Preknowledge Cheating via Innovative Measures: A Mixture Hierarchical Model for Jointly Modeling Item Responses, Response Times, and Visual Fixation Counts16
Optimal Number of Replications for Obtaining Stable Dynamic Fit Index Cutoffs16
Using Item Scores and Response Times to Detect Item Compromise in Computerized Adaptive Testing16
Generalized Mantel–Haenszel Estimators for Simultaneous Differential Item Functioning Tests15
Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models14
Detecting Differential Item Functioning Using Response Time13
An Illustration of an IRTree Model for Disengagement13
An Omega-Hierarchical Extension Index for Second-Order Constructs With Hierarchical Measuring Instruments12
Reconceptualizing Scoring Reliability Through Linguistic Similarity12
Assessing the Speed–Accuracy Tradeoff in Psychological Testing Using Experimental Manipulations12
Improving the Use of Parallel Analysis by Accounting for Sampling Variability of the Observed Correlation Matrix10
An Explanatory Multidimensional Random Item Effects Rating Scale Model10
On the Benefits of Using Maximal Reliability in Educational and Behavioral Research10
Examination of ChatGPT’s Performance as a Data Analysis Tool10
Item Parameter Recovery: Sensitivity to Prior Distribution10
How to Improve the Regression Factor Score Predictor When Individuals Have Different Factor Loadings9
Rotation Local Solutions in Multidimensional Item Response Theory Models9
What Affects the Quality of Score Transformations? Potential Issues in True-Score Equating Using the Partial Credit Model9
Integrating Ensemble Clustering and Text Embeddings for Estimating the Factor Loadings of Self-Report Scales8
From Linear Geometry to Nonlinear and Information-Geometric Settings in Test Theory: Bregman Projections as a Unifying Framework8
Examining the Dynamic of Clustering Effects in Multilevel Designs: A Latent Variable Method Application8
The Impact and Detection of Uniform Differential Item Functioning for Continuous Item Response Models8
On the Complex Sources of Differential Item Functioning: A Comparison of Three Methods7
Measuring Unipolar Traits With Continuous Response Items: Some Methodological and Substantive Developments7
Agreement Lambda for Weighted Disagreement With Ordinal Scales: Correction for Category Prevalence7
Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates7
Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics6
Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items6
The One-Parameter Logistic Model Can Be True With Zero Probability for a Unidimensional Measuring Instrument: How One Could Go Wrong Removing Items Not Satisfying the Model6
A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning Within a Test6
Evaluating Model Fit of Measurement Models in Confirmatory Factor Analysis6
Examining the Instructional Sensitivity of Constructed-Response Achievement Test Item Scores6
Obtaining a Bayesian Estimate of Coefficient Alpha Using a Posterior Normal Distribution6
Detecting Rating Scale Malfunctioning With the Partial Credit Model and Generalized Partial Credit Model6
An Item Response Theory Model for Incorporating Response Times in Forced-Choice Measures5
Differential Item Functioning Effect Size Use for Validity Information5
Awareness Is Bliss: How Acquiescence Affects Exploratory Factor Analysis5
Investigating Heterogeneity in Response Strategies: A Mixture Multidimensional IRTree Approach5
An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models5
Discriminating Between Attribute, Item-Position, and Wording Effects by the Congeneric and Tau-Equivalent Confirmatory Factor Analysis Models5
Historical Measurement Information Can Be Used to Improve Estimation of Structural Parameters in Structural Equation Models With Small Samples5
Impacts of DIF Item Balance and Effect Size Incorporation With the Rasch Tree5
Using Multiple Imputation to Account for the Uncertainty Due to Missing Data in the Context of Factor Retention5
Modeling Misspecification as a Parameter in Bayesian Structural Equation Models5
A Small Sample Correction for Factor Score Regression4
Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?4
Evaluating the Performance of a Regularized Differential Item Functioning Method for Testlet-Based Polytomous Items4
Equidistant Response Options on Likert-Type Instruments: Testing the Interval Scaling Assumption Using Mplus4
Reducing Calibration Bias for Person Fit Assessment by Mixture Model Expansion4
Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment3
Is the Area Under Curve Appropriate for Evaluating the Fit of Psychometric Models?3
The Dominant Trait Profile Method of Scoring Multidimensional Forced-Choice Questionnaires3
Discriminant Validity of Interval Response Formats: Investigating the Dimensional Structure of Interval Widths3
From Agreement to Epistemic Alignment: A Signal Detection–Theoretic Model of Inter-Rater Reliability3
Two-Method Measurement Planned Missing Data With Purposefully Selected Samples3
Detecting Cheating in Large-Scale Assessment: The Transfer of Detectors to New Tests3
Overestimation of Internal Consistency by Coefficient Omega in Data Giving Rise to a Centroid-Like Factor Solution3
A Note on Evaluation of Polytomous Item Locations With the Rating Scale Model and Testing Its Fit3
Estimation of Conditional Standard Errors of Measurement for MLE Scores in MST3
Comparing Accuracy of Parallel Analysis and Fit Statistics for Estimating the Number of Factors With Ordered Categorical Data in Exploratory Factor Analysis3
On the Consistency of Automatic Scoring with Large Language Models3
Model-Based Person Fit Statistics Applied to the Wechsler Adult Intelligence Scale IV3
When Cluster-Robust Inferences Fail3
Field-Testing Multiple-Choice Questions With AI Examinees: English Grammar Items3
Scoring Graphical Responses in TIMSS 2019 Using Artificial Neural Networks2
Added Value of Subscores for Tests With Polytomous Items2
Investigating the Ordering Structure of Clustered Items Using Nonparametric Item Response Theory2
Dominance Analysis for Latent Variable Models: A Comparison of Methods With Categorical Indicators and Misspecified Models2
Evaluation of Residual-Based Fit Statistics for Item Response Theory Models in the Presence of Non-Responses2
Comparing RMSEA-Based Indices for Assessing Measurement Invariance in Confirmatory Factor Models2
Dimensionality Assessment in Forced-Choice Questionnaires: First Steps Toward an Exploratory Framework2
Why Forced-Choice and Likert Items Provide the Same Information on Personality, Including Social Desirability2
Evaluating Imputation-Based Fit Statistics in Structural Equation Modeling With Ordinal Data: The MI2S Approach2
The Impact of Sample Size and Various Other Factors on Estimation of Dichotomous Mixture IRT Models2
Estimating Trends With Differential Item Functioning: A Comparison of Five IRT-Based Approaches2
Supervised Classes, Unsupervised Mixing Proportions: Detection of Bots in a Likert-Type Questionnaire2
The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability2
On the Utility of Indirect Methods for Detecting Faking2
Interpretation of the Standardized Mean Difference Effect Size When Distributions Are Not Normal or Homoscedastic2
Disentangling Qualitatively Different Faking Strategies in High-Stakes Personality Assessments: A Mixture Extension of the Multidimensional Nominal Response Model2
A Comparison of the Next Eigenvalue Sufficiency Test to Other Stopping Rules for the Number of Factors in Factor Analysis2
On the Importance of Coefficient Alpha for Measurement Research: Loading Equality Is Not Necessary for Alpha’s Utility as a Scale Reliability Index2
Evaluating Equating Methods for Varying Levels of Form Difference2
Coefficient Lambda for Interrater Agreement Among Multiple Raters: Correction for Category Prevalence2
The Effect of Modeling Missing Data With IRTree Approach on Parameter Estimates Under Different Simulation Conditions2
Treating Noneffortful Responses as Missing2
0.044312000274658