Gas Metrology Group: CRM Document


APPENDIX A

Statistical Analysis of Data

Introduction

The data submitted to NIST by the producer and the auditor will be examined in detail to determine that the candidate lot of CRM's is of the concentration claimed and that the lot is stable and homogenous. The statistical treatment is illustrated in this appendix by means of an example based, for the most part, on real experimental data. However, it was necessary to synthe-size some data, particularly in describing the auditor results, to produce an example illustrative of the whole process. It should be noted that this example does not define the degree of measurement precision required of a CRM, and, in many cases, the precision shown may not necessarily be attainable, due to such factors as instrument sensitivity and the chemical and physical properties of the gases involved.

Illustrative Example and Statistical Analysis

Producer's Calibration Data:

The producer provided the following data on the calibration of his instrument:

Table 1
SRM No.Concentration*Signal or Sensitivity
16779.67 ± .09 ppm11.51 mV (signal)
167844.9 ± .5 ppm54.10 mV (signal)
167997.1 ± .9 ppm114.86 mV (signal)

*The ± values are the total uncertainties given in the SRM certificates.

A linear regression of signal vs. concentration provides the equation:

signal = .464 + 1.181 (concentration)

The fit to the straight line is good, and is only minimally affected by making the intercept zero. This leads to the modified equation:

signal = 1.187 (concentration)

The fit provided by this equation is shown in the following table where the fitted values in the last column should be compared with the observed values of signal:

Table 2
Signal
ObservedFitted
11.5111.48
54.1053.30
114.86115.26

We conclude that the instrument gives essentially a linear response and is in good state of calibration. However, since the samples to be analyzed have a concentration close to that of SRM 1678 (about 45 ppm), a sensitivity value can be adopted that is based on a calibration line going through the origin and through the point whose abscissa is the certified value of SRM 1678 and whose ordinate is the measured value for this SRM. This gives the calibration equation which will be used for all further analyses:

signal = 1.205 (concentration)

Analysis of Internal Reference Standard

One of the cylinders was randomly selected as the lot "Internal Reference Standard." The measurements,* in mV are shown in the second column of Table 3. The third column shows the concentration values, in ppm, obtained by using the sensitivity 1.205 mV/ppm for the conversion.

The average concentration for the Internal Reference Standard is:

X = 44.6139 ppm

with a standard deviation among single replicate measurements of

sx = 0.018 ppm

The standard error of the average value for this cylinder is:

sx = 0.018/SQRT(10) = 0.0057 ppm

Table 3
MeasurementSignal (mV)Concentration (ppm)Measurement No.Signal (mV)Concentration (ppm)
153.762344.616653.768344.621
253.738244.596753.745444.602
353.745444.602853.723744.584
453.794844.643953.770744.623
553.771944.6271053.776744.628

*Note: It is not required that either producer or auditor provide the actual measured value of the signal. However, the value of the concentration calculated from the signal should be expressed with sufficient digits to reflect the magnitude of the signal. In other words, don't round off the calculated concentration when submitting the data for evaluation.

Analysis of All Samples

The Internal Reference Standard is now used by the producer to analyze all the cylinders in the lot. This is generally accomplished by measuring the ratio of the signal for the sample to that for the internal standard. Table 4 shows the signal ratios, denoted as R, and the corresponding calculated con-centrations, denoted as C, using the value 44.6139 ppm for the internal standard. Thus, each concentration is obtained by the equation:

C = R (44.6139)

where

R = signal for sample/signal for internal standard

The last column of Table 4 is used for two purposes: a) to obtain an additional estimate of the standard deviation among replicates, and b) to test whether a significant systematic shift has occurred between the two sets ("first analysis" and "second analysis"). The estimate of the standard deviation, converted to a single measurement basis is 0.021 and is consistent with that obtained previously (s = 0.018) [see section 2, preceding]. As to a possible shift, there is no evidence for such an occurrence. A test of significance can be carried out as follows:

t = -0.0048/(0.030/SQRT(10)) = -0.50

This value is not significant, when compared with the critical value of Student's t, for 10-1 = 9 degrees of freedom.

The standard deviations, 0.021 and 0.022, for the two sets of values in Table 4 are mutually consistent. Moreover, since they are of the same order of magnitude as the measurement error (as derived from replicate measurements on the same sample), it may be concluded that no measurable heterogeneity exists between the cylinders of this lot.

13
Table 4
Calculated
Concentration
Difference
Between
Duplicates
First AnalysisSecond Analysis
SampleRCRC
51.00017744.6220.99960144.5960.026
61.00059244.592
111.00059244.640
1.00010244.6181.00040644.632-0.014
151.00039244.631
210.99948244.591
231.00090444.6540.99980744.6050.049
291.00043844.633
301.00036844.630
310.99990644.6101.00035344.630-0.020
330.99935044.5851.00030744.628-0.043
340.99952044.5921.00005044.616-0.024
351.00048044.635
390.99956344.5941.00021044.623-0.029
450.99951444.5921.00005044.616-0.024
460.99941544.588
480.99916844.577
490.99980044.605
500.99988144.6090.99930444.5830.026
510.99906044.572
520.99945244.5890.99932144.5840.005
Average44.606544.6099-0.0048
Std. dev.0.0210.0220.030*

*Since the numbers in this column are differences of two measurements, the standard deviation found for this column is SQRT(2) times that for single measurements. Consequently, the standard deviation for single measurements derived from the differences is (0.030)/SQRT(2) = 0.021. This values does not include possible variability between samples.

The best average value for the concentration of the lot is:

((44.6065 x 10) + (44.6099 x 21))/(10 + 21) = 44.6088

The standard error of this overall average is:

0.022/SQRT(31) = 0.0040

This standard error does not include calibration error, or errors in the value of the SRM and in the value of internal reference standard.

Auditor's Calibration Data

Table 5 shows the auditor's calibration results.
SRM No.Concentration*Signal (counts)
16779.79 ± .0919,688
167845.3 ± .591,642
167997.1 ± .8197,990
1680476. ± .4979,130

*The ± values are the total uncertainties given in the SRM certificates.

The sensitivities (signal/concentration) are successively:

2011, 2023, 2039, and 2057 counts/ppm.

These values indicate a trend due, either to the presence of a blank or to curvature, or to both.

A regression analysis shows a slight amount of curvature, but otherwise the calibration data appear satisfactory. Since the samples are of the order of magnitude of SRM 1678, the latter will be used for conversion of signal to concentration, through the equation:

signal = 2023 (concentration).

Auditor's Sample Measurements

The auditor made 10 replicate analysis on each of two cylinders, no. 48 and no. 29. The results are shown in Table 6.

Table 6
Cylinder 48Cylinder 29
SignalConcentrationSignalConcentration
8986644.4228978144.380
9004444.5109012144.548
8949544.2398987444.426
8957044.2769005044.270
9002444.5008955844.270
8999744.4878987444.426
8956844.2759022644.600
8970844.3448988844.433
8974244.3618998744.482
8985244.4158965544.318
Average44.382944.4396
Std. dev.0.0990.101

Evaluation of Uncertainties and Intercomparison of Results

Internal Comparison of Auditor's Results

The standard deviation of a single measurement made by the auditor is 0.10. The results for cylinders 48 and 29 may be compared by Student's t-test:

t =(44.3829 - 44.4396)/(0.100 SQRT(1/10 + 1/10)) = -1.27

The t-value is not significant. There is, therefore, no evidence of heterogeneity between the two cylinders.

Total Uncertainty of Producer's Values

The value of each sample obtained by the producer is obtained by a procedure represented by the following diagram:

                   Calibration    Internal Reference
   SRM                Factor           Standard            Sample
    |__________________| |_______________| |_________________|
        Calibration              A                  B

The total error in the sample value is composed of four parts:

  1. the uncertainty in the SRM value
  2. the uncertainty in the calibration experiment
  3. the uncertainty due to comparison A
  4. the uncertainty due to comparison B

We use the rule (derived from the law of propagation of errors) that the square of the relative error of the final value is equal to the sum of the squares of the relative errors of the components.

More specifically, if ci is the final concentration value obtained for a particular sample (denoted by the subscript i) in the lot, we have (from step B):

ci = Ri CRef (1)

where Ri is the ratio of signals for sample i to the reference sample, and CRef is the concentration attached to the reference sample. But CRef is obtained in step A by averaging ten values obtained each as

Signal/k (2)

where k is the calibration value derived from the calibration experiment. In our case, k = 1.205. The average of the ten measurements may be described by

CRef = S/k (3)

where S is the average of ten replicate signal values. Combining (1) and (3) gives

ci = Ri (S/k) (4)

The value of k is obtained experimentally from a single signal value divided by the concentration of the SRM. Thus we may write:

k = So/Co (5)

where Co is the concentration attached to the SRM by the SRM certificate and So is the signal corresponding to it.

Combining (4) and (5), we obtain finally:

ci = Ri S (Co/So (6)

The law of propagation of errors gives:

(sci/ci) = (sRi/Ri) + (sS/S) + (Ssig So/So) + (sCo/Co)2 (7)

The first term of the right side represents step B; the second term, step A; and the third term the uncertainty of the calibration experiment itself. The last term represents the uncertainty of the SRM used for calibration.

We now estimate these four components.

1) sRi/Ri (step B) is obtained from the last column of Table 4:

sRi/Ri = 0.021/44.61 = 4.71 x 10-4 (8)

2) Ss/S (step A) is obtained from the calculations derived from Table 3:

Ss/S = 0.0057/44.62 = 1.28 x 10-4 (9)

3) sSo/So (calibration experiment) has not been measured, but we can assume the same precision as in step A for a single measurement:

sRi/Ri = 0.018/44.62 = 4.03 x 10-4 (10)

4) sCo/Co (uncertainty of SRM) is derived from the uncertainty stated in the SRM certificate. This stated uncertainty is equal to two standard deviations. Thus:

sCo/Co = (1/2) * (0.5/44.9) = 55.68 x 10-4 (11)

Adding the squares we have:

(sCo/Co)2 = [(4.71)2 + (1.28)2 + (4.03)2 + (55.68)2] x 10-8 = 3140 x 10-8

Hence:

sci/ci = ±56 x 10-4

Since all ci values are approximately the same and equal to

ci = 44.62 ppm, we obtain

sci = 56 x 10-4 x 44.62 = ±0.25 ppm (12)

We see that the predominant component of uncertainty, in this case, is that of the SRM.

Comparison of Producer's and Auditor's Values

By a calculation similar to that above, we obtain, for the auditor's value for a particular cylinder, say c*i:

(sc*i/c*i)2 = (Sct/ct)2 + (Sct/ct)2 = (SC*o/C*o)2

Step A is not present for the auditor's data. The symbol ct represents a count, and an average of ten counts; Co* represents the value given by the certificate for the SRM used in the calculation of the calibration factor. We have:

(sc*i/c*i)2 = ((0.10/SQRT(10))/44.4)2 + (0.10/44.4)2 + (0.25/45.3)2

= (7.12x10-4)2 + (22.52x10-4)2 + (55.19x10-4)2

= 3604x10-8 = (60x10-4)2

Since c*i = 44.4 ppm for both samples analyzed by the auditor, we have:

sc*i = 60x10-4x44.4 = ±0.27 ppm

We now obtain the following summary results (Table 7):

Table 7*
Sample 48Sample 29
Producer44.58 ± .2544.63 ± .25
Auditor44.38 ± .2744.44 ± .27

*The ± values represent standard errors in this table.

It is apparent that the result obtained by the auditor for each sample is not significantly different from that of the producer for the same sample. Thus, the auditor's values in this case, substantiate those provided by the producer.

In general, it may be assumed that there is no significant difference between the concentration claimed by the producer and that found by the auditor if the following expression is satisfied:

|ci - c*i| <= 2 SQRT(sc2i + sc2*i)


goto Index

goto Home Page.