Algorithm for how many students in a class did better than a student given this information

2023-02-18 22:59 问答作者：

I want to make a simple application that will take in:

number of s开发者_StackOverflow中文版tudents
class average (score/100)
median grade (score/100)
class standard deviation
the current grade of a student (score/100)

The output would be how many students did better than that student.

I'm interested in the best estimate possible with this information.

I'm just not sure how to go about calculating this.

The grades in my data set have the same average as the median, so please, simply explain how to do it this way.

The commenters above are correct that without more information you can't nail this down precisely. However, as Steve Jobs likes to say, real artists ship so here is what I would do if you need a ball park estimate.

The two most straight forward ways to go about this is to either assume the data is normally distributed or from a beta distribution (because the scores are bounded between 0-100). Because you said the mean and median are close in your data I will give code to calculate the quantity assuming a normal distribution.

A normal distribution has two parameters and a mean and a variance. The best estimate of the mean you are going to get is the sample mean from the data, and best estimate of the variance will be the square of the standard deviation. So you if you want to know how many students did worse than a particular score what you need is the cumulative distribution function.

double mu=sample_mean;
double sigma=sample_std_deviation;
int numStudents=sample_size;
int NumberBetterThan(double score,double mu,double sigma,int numStudents)
{
   double temp=(score-mu)/sqrt(2*pow(sigma,2.0));
   temp=0.5*(1+erf(temp));
   int result=numStudents*(1.0-temp); // truncates to int but you can return a float if you are ok with a fractional number of students
   return(result);

}

erf is the error function from statistics. You can find c++ code to implement it many places on the web. One such place is here.

You need to know more than average, median, and standard deviation to have a probability distribution of the scores, and you need that distribution to figure out how many students did better.

If you assume a probability distribution (or know the distribution because the teacher graded on that curve), the number of students that did better would be (cdf(maximum possible score) - cdf(student's score)) * number of students, where cdf is the cumulative disribution function for that distribution.

继续阅读：algorithm statistics

Algorithm for how many students in a class did better than a student given this information

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？