crookedletter
bendy
- Joined
- Jun 14, 2024
- Posts
- 632
I really thought "It's an average" would be about the beginning and end of this discussion. You guys are impressive.
Follow along with the video below to see how to install our site as a web app on your home screen.
Note: This feature may not be available in some browsers.
It's good for estimating, but can't nail down exact votes.This article explains one process for analyzing scores. It involves some math.
I'm not that mathematically inclined so I can't attest to its accuracy.
How to Analyze Your Scores
More precisely, early votes impact the early ratings more than any individual votes (early or late) impact the later ratings.Plus, early votes make more difference to the average than later ones, so trying to track beyond 25 or 30 is pointless.
Ah, the average Lit readerCoincidentally, after a long working day and two glasses of wine, my IQ is also exactly 4.18.
I can count to eleven, if I have my hands in my pockets.Ah, the average Lit reader
Coincidentally, after a long working day and two glasses of wine, my IQ is also exactly 4.18.
My humour is subtle like that hammer.Like a hammer on glass, you crack me up!
I can count to eleven, if I have my hands in my pockets.
The thread is here, and I've pasted the key graph below. It only compares Loving Wives with 'The Rest', but of course the categories are not uniform. The main point was that you can't really compare scores across categories, or even between now (when your story might have been 1-bombed) and the all time scores (where bombs have been cleaned up and voter behaviour may have changed). The median (50%) 'all stories, all time' score was 4.41, but the 12-month LW median was 3.75.Not really. Someone did a score data crunch a few months back and the mean score (50th percentile) was I think 4.42 if I recall, and my personal avg score of 3.8ish ranked around the 12th percentile, so a 4 would be somewhere around a dismal 20th percentile. No school that I know hands out a B grade for a score like that.
LW aside, anything below roughly 4.2 is a pretty bad score.

I think OP might not have known about being able to see the vote count, awareness of which answers the question "average of WHAT exactly".I really thought "It's an average" would be about the beginning and end of this discussion. You guys are impressive.
The main point was that you can't really compare scores across categories
I'm in full agreement with you.Agree. One cannot compare raw scores across categories. That is (just) one reason why the arbitrary 4.5 Red H threshold is folly. However, if we convert the scores of each separate category into percentiles, we actually can begin to compare stories across categories with some decent accuracy. We could then say that a 3.75 in LW is roughly the equivalent of a 4.41 in category X, etc.
Also, the Red H bar could be set at a meaningful level, such as 75th or 80th percentile. As it is now, in most categories it's somewhere around the 55th. That's preposterously low to indicate some sort of cut above. A second level could also be implemented, say a Red H at 90th percentile and a Pink H at 70th, say.
All or most of this was mentioned in the other thread but it's still very valid discussion worth continuing.
I also think that only the highest scoring chapter of a series should count for Red H (and 2nd highest for Pink say).
Agree. One cannot compare raw scores across categories. That is (just) one reason why the arbitrary 4.5 Red H threshold is folly. However, if we convert the scores of each separate category into percentiles, we actually can begin to compare stories across categories with some decent accuracy. We could then say that a 3.75 in LW is roughly the equivalent of a 4.41 in category X, etc.
I live in a cold climate.I heard ten and a half.
You (PSG and others) are still making the error of fetishizing the category distinctions, granting them undue power and meaning over the percentage bands that they create.
Considering how much pixelated ink has been spilled about the uncountable deficiencies of the category system, I find this very baffling.
You (PSG and others) are still making the error of fetishizing the category distinctions, granting them undue power and meaning over the percentage bands that they create.
Considering how much pixelated ink has been spilled about the uncountable deficiencies of the category system, I find this very baffling.
As a pragmatic question for the category specific H levels, does the percentile ever get recalculated?
I am way too new to carefully watching ratings to have a clue, but I would not be surprised if there is a drift over time, and it may be different between different categories. The graph that is posted above (and in a few other threads), might argue that we are seeing rating inflation. Or it might be that the individual scores decline over time. Some of you may have a good handle on that. I certainly don't.
If you recalibrate it over time, do stories keep their calibration from their era or get dragged along. Either seems to have odd behaviors.
If you do not recalibrate and their is a differential shift in the categories, we are introducing whole new unfairness in the H, speaking of something that is fetishized.
And this much harder for the site itself than the simple 4.5 rule.
All this said, I would still prefer the per-category percentile based approach.