Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
Physical Address
304 North Cardinal St.
Dorchester Center, MA 02124
ASTM D4855-97 (Reapproved 2002) establishes a comprehensive statistical and experimental framework for evaluating and comparing test methods under strictly controlled conditions. Administered by Committee D13 on Textiles (Subcommittee D13.93 on Statistics), this practice is essential for analysts who need to determine if a new test method measures a property as well as, or better than, an existing standard method. The practice specifically covers how to obtain and compare estimates of precision, sensitivity, and bias by testing the same materials during the same time span.
The statistical design assumes a carefully planned experiment where the primary sources of variability are controlled to isolate the true differences between the methods. The scope explicitly covers topics ranging from the basic requirements for materials to specific procedures for comparing precision and evaluating bias.
Accurate application of this standard hinges on a firm understanding of specific statistical terms as defined within Section 3 of the document. The following table summarizes the critical definitions that form the backbone of the comparative analysis.
| 📖 Term | 🔍 Definition (per D4855) | 💡 Practical Significance |
|---|---|---|
| Accuracy | Degree of agreement between the true value (or accepted standard value) and the average of many observations. | Increased accuracy is associated with decreased bias. Two methods can have equal accuracy even if one is more precise, as long as their bias relative to the true value is equivalent. |
| Bias | A constant or systematic error in test results. | Can exist between a method and a reference value, between two methods, or between two operators within a single method. Identifying this is a primary goal of the practice. |
| Confidence Interval | The interval estimate of a population parameter. | Defines the range within which the true difference between methods is likely to fall. |
| Confidence Level | The stated proportion of times the interval is expected to include the parameter. | Typically set at 95% for general use but must be adjusted based on the consequences of an incorrect estimation. |
The core of the practice revolves around Sections 8 through 12, which outline the Basic Statistical Design and specific procedures. The practice heavily promotes the use of the TEX-PAC software adjunct (referenced in Note 1), a suite of PC programs available through ASTM Headquarters that automates the calculations for comparing precision, sensitivity, and bias.
| 📐 Section | ⚙️ Objective / Procedure | 🎯 Key Deliverable |
|---|---|---|
| 6 & 7 | Evaluating Test Methods & Sensitivity Criterion | Establish a baseline for the capability of each test method. |
| 8 & 9 | Basic Statistical Design & Experimental Procedure | Define the controlled test plan, including materials, operators, and laboratories. |
| 10 | Procedure for Comparing Precision | Analyze repeatability and reproducibility to determine which method is more consistent. |
| 11 | Evaluating the Bias Between Test Methods | Determine if a systematic error exists between the test methods under comparison. |
| 12 | Procedure for Comparing Sensitivities | Assess the relative ability of the methods to detect differences in the material property. |
This practice provides procedures for comparing the precision (consistency of results), sensitivity (ability to detect differences in the property), and bias (systematic error) of two or more test methods under controlled conditions using the same materials and time span.
While it is under the jurisdiction of ASTM Committee D13 on Textiles, the statistical principles and procedures described are applicable to comparing test methods in any field where controlled comparative experiments are required. It leverages standards like E456 (Terminology Relating to Quality and Statistics) to ensure broad statistical relevance.
The TEX-PAC software (referenced in Note 1 of the standard) automates the complex statistical calculations required by this practice. It processes experimental data to compare precision, sensitivity, and bias, and directly generates statements featured in the Report section (Section 13) on the relative merits of the test methods.
This condition is fundamental to the comparative design. Using the same materials eliminates material variability as a confounding factor. Testing them during the same time span controls for drift in laboratory conditions, operator skill, or environmental factors that could otherwise unfairly bias the comparison of the methods.