Function File: CL = cl_multinom (x, N, b, calculation_type ) - Confidence level of multinomial portions

Returns confidence level of multinomial parameters estimated p = x / sum(x) with predefined confidence interval b. Finite population is also considered.

This function calculates the level of confidence at which the samples represent the true distribution given that there is a predefined tolerance (confidence interval). This is the upside down case of the typical excercises at which we want to get the confidence interval given the confidence level (and the estimated parameters of the underlying distribution). But once we accept (lets say at elections) that we have a standard predefined maximal acceptable error rate (e.g. b=0.02 ) in the estimation and we just want to know that how sure we can be that the measured proportions are the same as in the entire population (ie. the expected value and mean of the samples are roghly the same) we need to use this function.

Arguments

  • x : int vector : sample frequencies bins
  • N : int : Population size that was sampled by x. If N<sum(x), infinite number assumed
  • b : real, vector : confidence interval if vector, it should be the size of x containing confence interval for each cells if scalar, each cell will have the same value of b unless it is zero or -1 if value is 0, b=.02 is assumed which is standard choice at elections otherwise it is calculated in a way that one sample in a cell alteration defines the confidence interval
  • calculation_type : string : (Optional), described below "bromaghin" (default) - do not change it unless you have a good reason to do so "cochran" "agresti_cull" this is not exactly the solution at reference given below but an adjustment of the solutions above

Returns

Confidence level.

Example

CL = cl_multinom( [27;43;19;11], 10000, 0.05 ) returns 0.69 confidence level.

References

"bromaghin" calculation type (default) is based on is based on the article Jeffrey F. Bromaghin, "Sample Size Determination for Interval Estimation of Multinomial Probabilities", The American Statistician vol 47, 1993, pp 203-206.

"cochran" calculation type is based on article Robert T. Tortora, "A Note on Sample Size Estimation for Multinomial Populations", The American Statistician, , Vol 32. 1978, pp 100-102.

"agresti_cull" calculation type is based on article in which Quesenberry Hurst and Goodman result is combined A. Agresti and B.A. Coull, "Approximate is better than \"exact\" for interval estimation of binomial portions", The American Statistician, Vol. 52, 1998, pp 119-126

Package: statistics