Educational and Psychological Measurement

Papers
(The median citation count of Educational and Psychological Measurement is 2. The table below lists those papers that are above that threshold based on CrossRef citation counts [max. 250 papers]. The publications cover those that have been published in the past four years, i.e., from 2021-04-01 to 2025-04-01.)
ArticleCitations
Supervised Classes, Unsupervised Mixing Proportions: Detection of Bots in a Likert-Type Questionnaire109
On Latent Structure Examination of Behavioral Measuring Instruments in Complex Empirical Settings86
Functional Approaches for Modeling Unfolding Data42
Corrigendum to The Optimal Item Pool Design in Multistage Computerized Adaptive Tests with the p-Optimality Method27
On Modeling Missing Data in Structural Investigations Based on Tetrachoric Correlations With Free and Fixed Factor Loadings26
Reevaluating the SIBTEST Classification Heuristics for Dichotomous Differential Item Functioning23
Assessing Ability Recovery of the Sequential IRT Model With Unstructured Multiple-Attempt Data23
On Effect Size Measures for Nested Measurement Models17
The Impact of Measurement Model Misspecification on Coefficient Omega Estimates of Composite Reliability14
An Ensemble Learning Approach Based on TabNet and Machine Learning Models for Cheating Detection in Educational Tests13
Croon’s Bias-Corrected Estimation for Multilevel Structural Equation Models with Non-Normal Indicators and Model Misspecifications13
Multimodal Data Fusion to Detect Preknowledge Test-Taking Behavior Using Machine Learning13
Using Simulated Annealing to Investigate Sensitivity of SEM to External Model Misspecification11
Iterative Item Selection of Neighborhood Clusters: A Nonparametric and Non-IRT Method for Generating Miniature Computer Adaptive Questionnaires10
Can One Pool Over Site in a Multi-Site Study With Categorical Item Measuring Instruments?: A Multiple Testing Procedure9
Equating Oral Reading Fluency Scores: A Model-Based Approach9
A Note on Comparing the Bifactor and Second-Order Factor Models: Is the Bayesian Information Criterion a Routinely Dependable Index for Model Selection?9
Evaluation of Second- and Third-Level Variance Proportions in Multilevel Designs With Completely Observed Populations: A Note on a Latent Variable Modeling Procedure9
Separation of Traits and Extreme Response Style in IRTree Models: The Role of Mimicry Effects for the Meaningful Interpretation of Estimates8
Summary Intervals for Model-Based Classification Accuracy and Consistency Indices8
Latent Variable Forests for Latent Variable Score Estimation8
Measuring Unipolar Traits With Continuous Response Items: Some Methodological and Substantive Developments7
Model Specification Searches in Structural Equation Modeling Using Bee Swarm Optimization7
Resolving Dimensionality in a Child Assessment Tool: An Application of the Multilevel Bifactor Model6
Assessing the Properties and Functioning of Model-Based Sum Scores in Multidimensional Measures With Local Item Dependencies: A Comprehensive Proposal6
Detecting Differential Item Functioning Using Response Time6
A New Stopping Criterion for Rasch Trees Based on the Mantel–Haenszel Effect Size Measure for Differential Item Functioning6
The Effect of Latent and Error Non-Normality on Measures of Fit in Structural Equation Modeling6
Studying Factorial Invariance With Nominal Items: A Note on a Latent Variable Modeling Procedure6
A Comparison of Reliability Estimation Based on Confirmatory Factor Analysis and Exploratory Structural Equation Models6
Field-Testing Multiple-Choice Questions With AI Examinees: English Grammar Items6
A Monte Carlo Study of Confidence Interval Methods for Generalizability Coefficient6
Comparing Accuracy of Parallel Analysis and Fit Statistics for Estimating the Number of Factors With Ordered Categorical Data in Exploratory Factor Analysis5
Power Analysis for Moderator Effects in Longitudinal Cluster Randomized Designs5
The Sampling Ratio in Multilevel Structural Equation Models: Considerations to Inform Study Design5
Fused SDT/IRT Models for Mixed-Format Exams5
A Note on Statistical Hypothesis Testing: Probabilifying Modus Tollens Invalidates Its Force? Not True!5
Using Multiple Imputation to Account for the Uncertainty Due to Missing Data in the Context of Factor Retention5
Linear and Nonlinear Indices of Score Accuracy and Item Effectiveness for Measures That Contain Locally Dependent Items5
A Bayesian General Model to Account for Individual Differences in Operation-Specific Learning Within a Test5
The Accuracy of Bayesian Model Fit Indices in Selecting Among Multidimensional Item Response Theory Models5
Exploring the Influence of Response Styles on Continuous Scale Assessments: Insights From a Novel Modeling Approach4
Two-Method Measurement Planned Missing Data With Purposefully Selected Samples4
Detecting Rating Scale Malfunctioning With the Partial Credit Model and Generalized Partial Credit Model4
Generalized Mantel–Haenszel Estimators for Simultaneous Differential Item Functioning Tests4
An Illustration of an IRTree Model for Disengagement4
Examining the Instructional Sensitivity of Constructed-Response Achievement Test Item Scores4
Optimal Number of Replications for Obtaining Stable Dynamic Fit Index Cutoffs4
Design Effect in Multilevel Settings: A Commentary on a Latent Variable Modeling Procedure for Its Evaluation4
Linear Factor Analytic Thurstonian Forced-Choice Models: Current Status and Issues4
Detecting Cheating in Large-Scale Assessment: The Transfer of Detectors to New Tests4
Use of the Lagrange Multiplier Test for Assessing Measurement Invariance Under Model Misspecification4
Are Speeded Tests Unfair? Modeling the Impact of Time Limits on the Gender Gap in Mathematics4
DIF Detection With Zero-Inflation Under the Factor Mixture Modeling Framework4
Obtaining a Bayesian Estimate of Coefficient Alpha Using a Posterior Normal Distribution3
Detecting Differential Rater Functioning in Severity and Centrality: The Dual DRF Facets Model3
Detecting Preknowledge Cheating via Innovative Measures: A Mixture Hierarchical Model for Jointly Modeling Item Responses, Response Times, and Visual Fixation Counts3
Artificial Neural Networks for Short-Form Development of Psychometric Tests: A Study on Synthetic Populations Using Autoencoders3
Factor Retention in Exploratory Factor Analysis With Missing Data3
Application of Change Point Analysis of Response Time Data to Detect Test Speededness3
Fixed Effects or Mixed Effects Classifiers? Evidence From Simulated and Archival Data3
Item Classification by Difficulty Using Functional Principal Component Clustering and Neural Networks3
Treatments of Differential Item Functioning: A Comparison of Four Methods3
Evaluating the Effects of Missing Data Handling Methods on Scale Linking Accuracy3
Evaluating Model Fit of Measurement Models in Confirmatory Factor Analysis3
An Item Response Theory Model for Incorporating Response Times in Forced-Choice Measures3
Investigating Confidence Intervals of Item Parameters When Some Item Parameters Take Priors in the 2PL and 3PL Models3
Assessing Dimensionality of IRT Models Using Traditional and Revised Parallel Analyses3
The Importance of Thinking Multivariately When Setting Subscale Cutoff Scores3
Exploration of the Stacking Ensemble Machine Learning Algorithm for Cheating Detection in Large-Scale Assessment3
A Comparison of Response Time Threshold Scoring Procedures in Mitigating Bias From Rapid Guessing Behavior3
“What If Applicants Fake Their Responses?”: Modeling Faking and Response Styles in High-Stakes Assessments Using the Multidimensional Nominal Response Model3
The Impact of Sample Size and Various Other Factors on Estimation of Dichotomous Mixture IRT Models2
A Comparison of the Next Eigenvalue Sufficiency Test to Other Stopping Rules for the Number of Factors in Factor Analysis2
Factor Retention in Exploratory Multidimensional Item Response Theory2
An Evaluation of Fit Indices Used in Model Selection of Dichotomous Mixture IRT Models2
Item Parameter Recovery: Sensitivity to Prior Distribution2
Is the Area Under Curve Appropriate for Evaluating the Fit of Psychometric Models?2
Evaluating The Predictive Reliability of Neural Networks in Psychological Research With Random Datasets2
Semisupervised Learning Method to Adjust Biased Item Difficulty Estimates Caused by Nonignorable Missingness in a Virtual Learning Environment2
Position of Correct Option and Distractors Impacts Responses to Multiple-Choice Items: Evidence From a National Test2
Procedures for Analyzing Multidimensional Mixture Data2
The NEAT Equating Via Chaining Random Forests in the Context of Small Sample Sizes: A Machine-Learning Method2
An Explanatory Multidimensional Random Item Effects Rating Scale Model2
Conceptualizing Correlated Residuals as Item-Level Method Effects in Confirmatory Factor Analysis2
Correcting for Extreme Response Style: Model Choice Matters2
Improving the Use of Parallel Analysis by Accounting for Sampling Variability of the Observed Correlation Matrix2
The Response Vector for Mastery Method of Standard Setting2
Examination of ChatGPT’s Performance as a Data Analysis Tool2
Symptom Presence and Symptom Severity as Unique Indicators of Psychopathology: An Application of Multidimensional Zero-Inflated and Hurdle Graded Response Models2
Assessing Essential Unidimensionality of Scales and Structural Coefficient Bias2
Identifying Ability and Nonability Groups: Incorporating Response Times Using Mixture Modeling2
Evaluating the Quality of Classification in Mixture Model Simulations2
Implementing a Standardized Effect Size in the POLYSIBTEST Procedure2
0.042155981063843