Mike Ulman

Mike Ulman

Technology Manager, Packaging and Delivery Systems

February 18, 2025

Sample Sizes for Performance Testing of Combination Products and Drug Packaging

When developing combination products or evaluating primary packaging for drug products one must evaluate the performance of the devices and the packaging to ensure that they are fit-for-purpose. Numerous international standardsi and pharmacopeial chaptersii focus on functional performance testing and they all recommend conducting tests that simulate real-world conditions based on the intended use of the device or packaging. This simulation includes matching the physical properties of the drug when using placebos, duplicating the number of punctures and the expected needle size that would be encountered in actual practice and testing samples that had been conditioned with the appropriate sterilization treatments, transportation simulations, storage conditions and thermal exposure to approximate actual expected sample environmental exposure which might affect performance.

Scientist testing combination products and drug packaging in lab

After determining the appropriate design verification tests that will be needed to establish that the combination product meets the design input requirements or that the drug packaging adequately protects the drug product, the next consideration is justifying the number of test samples required to demonstrate acceptable performance, a design verification requirementiii. Addressing this starts with deciding on the required reliability of the combination product or drug packaging failure mode being evaluated, which is integral to the assessment of risk.

A high-risk failure mode generally necessitates 99% reliability to ensure that the failure occurs no more than 1% of the time. Meanwhile, a medium-risk failure mode typically requires 97.5% reliability, whereas a low-risk failure mode is usually acceptable with 95% reliability (sometimes those reliability values are chosen to be 99%, 95% and 90%, respectively). For whatever failure mode the test is evaluating, consider the consequence of failing to meet the acceptance criteria and the severity of the resulting harm. Critical harms typically involve patient death and are high risk. Major harms typically result in permanent patient impairment or the loss of the device’s primary function (i.e., drug delivery to the appropriate location) and are medium risk. Minor harms usually result in patient discomfort, where the harm is reversible, and are low risk.

To determine the appropriate sample size for variable tests (those tests which generate a numerical value for a result) in design verification, a small sample size of about 10-20 samples could first be tested to determine how the product performs (alternatively, existing data from previous studies may be utilized). Statistical techniques can be used to determine the sample size for testing based on the confidence level (typically 95%), the desired reliability level, the average result of the above initial sample set, the standard deviation of the results from the above initial sample set and the acceptance criteria (or specification limits) of the test method. If the results for each member of the sample set are very similar (displaying a small standard deviation) and are far from the specification limit(s), then a small sample set would be sufficient in design verification to assure that the product would meet the specifications. However, if the results show a wide range of values and/or are close to the specification limit, it is necessary to use a larger sample size in design verification to assure that outliers do not exceed the specification. Alternatively, ISO 11608-1:2022 Annex Fiv simply recommends using 20 samples for 95% reliability and 30 samples for 97.5% reliability, for test result data that is normally distributed.

For tests that produce pass/fail results, known as attribute test methods, one will find tables, such as the one shown in ISO 11608-1:2022 Annex F and reproduced as Table 1 below, which show that for 99% reliability and 95% confidence, 299 samples (n) must be tested; for 95% reliability and 95% confidence, 59 samples (n) must be tested; while for 90% reliability and 95% confidence, 29 samples (n) must be tested. These values come from the calculation of an attribute sampling plan using the Bayes Success-Run Theorem,v which states that:

n=ln(1 Confidence)ln(Reliability)

Table 1 – Recommended attribute sample size, per ISO 11608-1:2024 Annex F

Notice that this table shows that successfully testing a single sample only produces assurance of 5% reliability while successfully testing 5 samples only produces assurance of 50% reliability. This is not to say that 95% and 50% of samples, respectively, will fail but that one can only be assured that at least 5% and 50% of samples, respectively, will pass.

For variable test methods, results can be close to the acceptance criteria (just passing) or far away (comfortably passing). For attribute tests, the results, by definition, either pass or fail with no indication of the margin of passing or failing and therefore a larger sample size is required. If one needs to establish 99% reliability at 95% confidence, attaining 299 passing test results might prove to be challenging because, for one reason or another, one may have a failure, which is acceptable because one is looking for 99% reliability and not 100% reliability. Using the binomial distribution function, one can calculate the number of samples that must be tested to establish reliability while observing zero, one or even two failures at the approximately 95% confidence level. For example, if one wants to show 99% reliability and tests 299 samples but achieves two failures, one could test additional samples, up to a total of 628, and as long as there are only the two failures, then one has still demonstrated 99% reliability. Of course, one should not “test into compliance” and, if anticipating a chance of failure, it is best to plan to test 628 samples from the start. These permissible failures to still achieve the desired level of reliability, at a minimum of a 95% confidence level, are shown in Table 2.

Table 2 – Permissible failures to still achieve the desired level of reliability, at a minimum of a 95% confidence level, based on the binomial distribution function.
Zero FailuresOne FailureTwo Failures
ReliabilitySamplesConfidenceSamplesConfidenceSamplesConfidence
99.0%2990.950464730.95026280.9502
97.5%1190.950851880.95012500.9503
95.0%590.95151930.95001240.9505
90.0%290.95290460.9520610.9509

Consider the testing in USP <382> for fragmentation, which is based on ISO 8871-5:2016 Annex B. The test consists of puncturing 12 vials, not-less-than four times each (as anticipated in the intended application), gathering all the generated elastomer or coating film fragments, and counting those ≥ 150 µm in their longest linear dimension. The acceptance criterion for this test is that not more than five fragments are formed from the total of the not-less-than 48 punctures. That test result is a single data point for an attribute test (the test does not worry about how many fragments are observed, as long as it is not more than five – a pass/fail criterion). If one were to evaluate the risk of having excessive particles in an injectable and if, hypothetically, one requires 90% reliability, one would need 29 data points of passing results. That means that one would need to evaluate 12 stoppers/test x 29 tests = 348 stoppers to demonstrate 90% reliability.

It is crucial to ensure that once test methods are selected for product design verification, the results are statistically meaningful. This can be accomplished by using an adequately large sample size, which is determined by the type of test method (variable or attribute) and the desired levels of reliability and confidence.

West Analytical Services has a laboratory in Exton, PA (US-FDA inspected) and one in Waterford, Ireland (an Irish-HPRAvi accredited facility located in an EU Member State) ready to help design and perform Design Verification, stability and release testingvii for combination products. Contact West to learn more about how we can help you with your testing needs.

References

Vials: International Organization for Standardization. (2022). Elastomeric parts for parenterals and for devices for pharmaceutical use - Part 5: Functional requirements and testing (ISO Standard No. 8871-5:2016). Available at: https://www.iso.org/standard/68560.html
Cartridges: International Organization for Standardization. (2011). Prefilled syringes Part 2: Plunger stoppers for dental local anaesthetic cartridges (ISO Standard No. 11040-2:2011). Available at: https://www.iso.org/standard/53947.html
International Organization for Standardization. (2012). Prefilled syringes Part 3: Seals for dental local anaesthetic cartridges (ISO Standard No. 11040-3:2012). Available at: https://www.iso.org/standard/53948.html
International Organization for Standardization. (2017). Pen systems - Part 2: Plunger stoppers for pen-injectors for medical use (ISO Standard No. 13926-2:2017). Available at: https://www.iso.org/standard/72576.html
International Organization for Standardization. (2019). Pen systems - Part 3: Seals for pen-injectors for medical use (ISO Standard No. 13926-3:2019). Available at: https://www.iso.org/standard/75131.html
International Organization for Standardization. (2022). Needle-based injection systems for medical use — Requirements and test methods — Part 3: Containers and integrated fluid paths (ISO Standard No. 11608-3:2022). https://www.iso.org/standard/76626.html
Autoinjectors: International Organization for Standardization. (2022). Needle-based injection systems for medical use — Requirements and test methods — Part 5: Automated functions (ISO Standard No. 11608-5:2022). Available at: https://www.iso.org/standard/76627.html
On-Body Delivery Systems: International Organization for Standardization. (2022). Needle-based injection systems for medical use — Requirements and test methods — Part 6: On-body delivery systems (ISO Standard No. 11608-6:2022). Available at: https://www.iso.org/standard/72451.html
Pre-filled Syringes (glass): International Organization for Standardization. (2024). Prefilled syringes Part 4: Glass barrels for injectables and sterilized subassembled syringes ready for filling (ISO Standard No. 11040-4:2024). Available at: https://www.iso.org/standard/82222.html
International Organization for Standardization. (2016). Prefilled syringes Part 8: Requirements and test methods for finished prefilled syringes (ISO Standard No. 11040-8:2016). Available at: https://www.iso.org/standard/66036.html
Single-use Syringes: International Organization for Standardization. (2017). Sterile hypodermic syringes for single use - Part 1: Syringes for manual use (ISO Standard No. 7886-1:2017). Available at: https://www.iso.org/standard/64790.html
ii USP <382> Elastomeric Component Functional Suitability in Parenteral Product Packaging/Delivery Systems; United States Pharmacopeia (2021)
Ph. Eur. 3.2.9 Rubber closures for containers for aqueous parenteral preparations, for powders and for freeze-dried powders; European Pharmacopoeia, Supplement 11.1 (2022).
iii International Organization for Standardization. (2016) Medical devices — Quality management systems — Requirements for regulatory purposes. (ISO Standard No. 13485:2016). https://www.iso.org/standard/59752.html
iv International Organization for Standardization. (2022). Needle-based injection systems for medical use — Requirements and test methods — Part 1: Needle-based injection systems (ISO Standard No. 11608-1:2022). Available at: https://www.iso.org/standard/70733.html
Mark Durivage, How To Establish Sample Sizes For Process Validation Using The Success-Run Theorem, Pharmaceutical Online, July 19, 2016 Available at: https://www.pharmaceuticalonline.com/doc/how-to-establish-sample-sizes-for-process-validation-using-the-success-run-theorem-0001
vi The Health Products Regulatory Authority (HPRA) is the Irish regulatory agency for medicines, medical devices and other health products.
vii Directive 2001/83/EC of the European Parliament and of the Council of 6 November 2001 on the Community Code Relating to Medicinal Products for Human Use, which states that medicines that are manufactured outside the EU, in countries that do not have a Mutual Recognition Agreement with the EU, must be retested within an EU Member State for release.

 

Recommended Blogs