Last edited:
So yeah this is insane that some people will just have a 2 hour shorter exam. But are they really telling me right now that 80 questions per step exam are unscored filler?
Pray tell....why is it ridiculous?
There are other students who feel differently. I talked to a student who tested at Brown. He thought it was easier to stay sharp for 5 blocks of 40 questions than for 7 blocks of 40 questions. That could be an advantage.
How are you people not getting this? Severely modifying a standardized test destroys the entire validity of a STANDARDIZED test within a year of applicants and beyond.
Maybe this is the right time to implement the p/f decision for next year.That is so crazy, because apparently the curve for the current year is set by the previous year's test takers.
Maybe this is the right time to implement the p/f decision for next year.
I'm not sure how the scoring works, but if what you said is true, and people do significantly better this year because they've had an extra 3 months to study or they take a neutered version of the exam in their school library, wouldn't it be more likely that scores would be scaled down for the following year? Meaning you could turn in a killer performance and get a score of 235. That would wreck your chances at a competitive specialty much more than a p/f.Whoa whoa whoa, lol. I would still 100% rather have it scored and be unfair than to have it P/F. They also said that they're definitely not moving up the timeline for it anyway.
I'm not sure how the scoring works, but if what you said is true, and people do significantly better this year because they've had an extra 3 months to study or they take a neutered version of the exam in their school library, wouldn't it be more likely that scores would be scaled down for the following year?
Answer the USMLE survey with a big NO
Email your deans and [email protected] to call out this unfair advantage
Tweet https://twitter.com/theusmle
Are program directors diligent enough to take into account covid and reduction to 200q? Or will they see the numbers as what they are superficially. Has anyone heard a PD’s opinion?
I'm not sure how the scoring works, but if what you said is true, and people do significantly better this year because they've had an extra 3 months to study or they take a neutered version of the exam in their school library, wouldn't it be more likely that scores would be scaled down for the following year? Meaning you could turn in a killer performance and get a score of 235. That would wreck your chances at a competitive specialty much more than a p/f.
P.S. In the back of my mind I'm thinking that there is something off about this—I feel like I've heard that the score is and always has been referenced directly to individual exam performance and not group metrics, like when people discussing score inflation say that a 250 today is the same as a 250 was 15 years ago. I know there are people on here who are very knowledgeable about this stuff.
I emailed them telling them this is ridiculous and to remove all experimental questions for the foreseeable future
What do you expect PDs to do anyway, subtract 10 points from the scores of students who took the 5 hour test instead of the 7 hour test?
Wouldn't be surprised if the MCAT was like this eitherOk so I know people are mad at the lack of standardization, but I can't get over how two entire block's worth of questions have never counted for the actual score. That's almost 30% of the exam!
It's not the same percentile-wise, but I do believe it represents the same performance on the exam. This is my understanding:I doubt that a 250 today is the same as a 250 15 years ago. The way they standardize the test results in a natural score creep.
It's not the same percentile-wise, but I do believe it represents the same performance on the exam. This is my understanding:
Because Step 1 is a criterion-referenced test (designed to decide yes/no for licensure) and not a norm-referenced test like the MCAT, SAT, etc., the score is reflective of the person's performance with respect to the questions, not others taking the test. The score creep is more likely due to an actual improvement in performance over the years as Step 1 has grown in importance and taken on the role of a sort of residency aptitude test. People have been much more deliberate and organized in their studying, and study materials have become much more advanced, which has resulted in the percentile curve shifting to the right as the absolute performance scale stays the same.
I don't think that's compatible with scaling scores each year based on the previous years' results. But again, I'm fairly ignorant on the topic. I would love to hear from some experts.
Good, thoughtful post. I can't refute your point about CK with any real evidence, but I would guess that CK scores have gone up because people are studying more for Step 1. Putting in that legwork for a more challenging exam really makes CK easier. Anecdotally, most of my friends and I did very well on Step 1, didn't study at all for CK per se, and scored quite well.I don’t think so since Step 2CK score averages have increased even faster than Step 1 averages, even though there is less emphasis placed on Step 2CK and few organized resources. Here’s my theory:
The NBME have just admitted that the experimental questions are validated and scored using performance on old questions. So if someone taking a test in 2017 performs at a 230 level on old questions and gets 80% right on experimental questions, 230 = 80% correct (averaged) for future examinees taking those questions. Then two things happen, (1) Some of the experimental questions get thrown out because they have no predictive value. The average score will tend to drift higher because the experimental questions have improved in quality compared to when they were taken by the old examinees. The NBME attempts to correct for this, but are imperfect. (2) Some of the information from the experimental part gets passed on to First Aid, UWorld, etc. Future test takers who study these sources then get a boost. Thus, examinees will technically “know more,” but what would have been critical thinking questions become simple memorization questions if sources are telling you the answer. These changes might be small enough to compare applicants year by year, but not over 5+ years.
I don’t think so since Step 2CK scores have increased even faster than Step 1, even though there is less emphasis placed on Step 2CK and few organized resources. Here’s my theory:
The NBME have just admitted that the experimental questions are validated and scored using performance on old questions. So if someone taking a test in 2017 performs at a 230 level on old questions and gets 80% right on experimental questions, 230 = 80% correct (averaged) for future examinees taking those questions. Then two things happen, (1) Some of the experimental questions get thrown out because they have no predictive value. The average score will tend to drift higher because the experimental questions have improved in quality compared to when they were taken by the old examinees. The NBME attempts to correct for this, but are imperfect. (2) Some of the information from the experimental part gets passed on to First Aid, UWorld, etc. Future test takers who study these sources then get a boost. Thus, examinees will technically “know more,” but what would have been critical thinking questions become simple memorization questions if sources are telling you the answer. These changes might be small enough to compare applicants year by year, but not over 5+ years.
Experimental questions are not scored. Hence the more proper name for them: unscored pretest items (see the first post in this thread). Unfortunately I think this renders your theory moot.
The inclusion of unscored pretest items on USMLE exams is done to gather statistical data on their performance, most notably whatever discrimination index the NBME uses. These indices basically tell you whether or not a given test question separates those who know the content from those who don't know the content. Items with good discrimination indices get called up to the big leagues and are used "live" on a future exam. When a question gets used for real the NBME is still collecting data on it to ensure the discrimination index remains acceptable. The other important determination is how difficult it is (how many people get it correct). The reason it takes 3+ weeks to get your score back is because of all the statistics that have to be run to evaluate discrimination and relative test difficulty between different takers.
To my knowledge the year-over-year comparisons have more to do with estimating percentiles than anything else, which change very slowly over time.
To get you out of the testing center faster?????Hope they roll this back. I know I was thinking a little slower by the last couple blocks on test day, it's mentally exhausting. Seems like a huge advantage to get two hours shaved off.
Not to mention that the head-scratcher questions that you have to read 5 times to understand what they're asking are more likely to be experimental than scored. Getting rid of all those stumbling blocks and taking only 200 high-quality Q's under the same time limit per block is yet another advantage.
Why can't they use 280 validated items and provide a much more accurate score under the original format? Does anyone know why they need it to be shorter?
"Let's mess with our validity to save a couple hours of proctoring costs."To get you out of the testing center faster?????
Yeah, F them for trying to reduce the chances of you catching or spreading SARS-CoV2."Let's mess with our validity to save a couple hours of proctoring costs."
"Agreed"
???
I hate to break this to you Goro, but if 2 hours socially distanced sitting in a test booth is enough to give a bunch of med students SARS2, then they're all going to immediately get it when they return to the wards anyways. There is no way to stay healthy for 2 years of rotations in a hospital, if a testing room is that dangerous.Yeah, F them for trying to reduce the chances of you catching or spreading SARS-CoV2.
It's all about reducing risk, in my view.I hate to break this to you Goro, but if 2 hours socially distanced sitting in a test booth is enough to give a bunch of med students SARS2, then they're all going to immediately get it when they return to the wards anyways. There is no way to stay healthy for 2 years of rotations in a hospital, if a testing room is that dangerous.
Yeah, F them for trying to reduce the chances of you catching or spreading SARS-CoV2.
You know what, I think I read that the LSAT or GRE (or both?) are also being abridged right now, and those are actually being remote proctored to people sitting at home.It's all about reducing risk, in my view.
Maybe it's fair...they def have some good karma coming for having to be in Dedicated for 4-6 months hahaJeeze, the hyperacheivers need to chill.
Then why are the people at Prometric centers taking the full length unabridged test? How is their exposure different? Heck if I take it at Prometric, I might be exposing people from outside of any healthcare career.Yeah, F them for trying to reduce the chances of you catching or spreading SARS-CoV2.
Didn't know this. Still waters clearly run deepThen why are the people at Prometric centers taking the full length unabridged test? How is their exposure different? Heck if I take it at Prometric, I might be exposing people from outside of any healthcare career.
Sorry but that doesn’t make sense. And when it doesn’t make sense, it probably makes money.
I was just thinking about this regarding COMLEX PE. It seems absurd at first for them to insist on keeping PE around when CS is cancelled...but not if PE could be the new way for IMGs to get ECFMG certified (previously required CS).And when it doesn’t make sense, it probably makes money.
Goro you really make less and less sense these days loolYeah, F them for trying to reduce the chances of you catching or spreading SARS-CoV2.
We do have a few people on our side, lemme plug Carmody one more time. I think he's also popping off on Twitter about the abridged step exam.Everything I've heard from this organization over the last 6 months, in addition to my interactions with my own admins, has convinced me that everyone in Med Ed is brain dead. Who else could have possibly green lighted this decision.