- Joined
- Aug 20, 2013
- Messages
- 3,495
- Reaction score
- 3,083
Does anyone know why STEP uses a scaled scoring system rather than percentiles? It seems to me like it would make a lot more sense to receive you score back in the form of a percentile (i.e. 1-100, a 50 would mean you scored in the 50th percentile, which is a ~230 today).
You might be thinking that scaled scores are required because it provides a way to compare the difficulty level of different tests across years, but I don't think that's really necessary because the difficulty level isn't important (aside from passing/failing, which could be done internally w/ a simple P or F attached to the score) - what schools/residencies care about is percentile or how well you performed compared to your peers. The percentile/difficulty of each question could be kept on a rolling basis (with a large enough pool so that you couldn't exploit this by taking the test in off season) that constantly evolves so that it doesn't get outdated - probably similar to how they do it now.
Finally, there is one more aspect to the scaled score that makes absolutely no sense to me: it magnifies differences at the peaks of performance and gives the illusion that there's a large gap in performance when one doesn't exist. For example, a 260 (96th percentile) to 270 (100% percentile) is 4% whereas a 230 (48th) to 240 (67%) is 19%.
Anyway, the system makes no sense to me and I don't understand why USMLE doesn't use percentiles as the primary score. It seems like it would resolve a lot of headache for students, schools and residency programs. Was wondering if someone could fill me in on what I'm missing.
You might be thinking that scaled scores are required because it provides a way to compare the difficulty level of different tests across years, but I don't think that's really necessary because the difficulty level isn't important (aside from passing/failing, which could be done internally w/ a simple P or F attached to the score) - what schools/residencies care about is percentile or how well you performed compared to your peers. The percentile/difficulty of each question could be kept on a rolling basis (with a large enough pool so that you couldn't exploit this by taking the test in off season) that constantly evolves so that it doesn't get outdated - probably similar to how they do it now.
Finally, there is one more aspect to the scaled score that makes absolutely no sense to me: it magnifies differences at the peaks of performance and gives the illusion that there's a large gap in performance when one doesn't exist. For example, a 260 (96th percentile) to 270 (100% percentile) is 4% whereas a 230 (48th) to 240 (67%) is 19%.
Anyway, the system makes no sense to me and I don't understand why USMLE doesn't use percentiles as the primary score. It seems like it would resolve a lot of headache for students, schools and residency programs. Was wondering if someone could fill me in on what I'm missing.
Last edited: