A Framework for Pediatric HRQoL Selection

In pediatric drug development, selecting a health-related quality of life (HRQoL) measure has shifted from a secondary consideration to a regulatory expectation. The FDA's continued emphasis on patient-focused drug development ensures sponsors must demonstrate meaningful benefit to children and adolescents with the same rigor applied to adult populations.

Instrument selection in clinical trials is often influenced by precedent. Measures that have been widely used in prior studies or frequently cited in the literature are sometimes selected again in new trials. While historical use can provide useful context, it does not establish that an instrument is appropriate for a given study. The FDA's Patient-Focused Drug Development (PFDD) Guidance 3 (“Selecting, Developing, or Modifying Fit-for-Purpose Clinical Outcome Assessments”) is explicit on this point: fit-for-purpose selection begins with the concept of interest and the decision the data must support. Historical usage is context, not justification.

In pediatric trials, these decisions are further complicated by developmental differences across age groups, which can influence both the relevance of concepts measured and how outcomes are reported (e.g., child self-report versus caregiver proxy report).

This article outlines a practical framework for selecting a pediatric HRQoL instrument, and applies that framework to the Pediatric Quality of Life Inventory™ (PedsQL™).

Why pediatric HRQoL measurement is harder than it looks

Selecting a pediatric HRQoL measure is not simply a matter of adapting adult instruments for younger respondents. A valid pediatric COA must account for a wide developmental range of toddlers through adolescents, while producing data that remain interpretable across age groups. Additionally, it requires both child self-report and caregiver proxy-report formats, each validated independently. Items must be written at developmentally appropriate reading levels without sacrificing conceptual consistency across forms. And, for multinational trials, rigorous linguistic and cultural validation adds further scientific and operational complexity.

The FDA's final PFDD Guidance 3 (October 2025) reinforces that prior regulatory acceptance of an instrument does not establish fitness for purpose in a new context. Sponsors bear responsibility for demonstrating that their chosen COA captures the meaningful aspects of health (MAHs) relevant to the specific population and trial design. Below, we present a framework for meeting this responsibility.

A framework: Six criteria for pediatric HRQoL selection

1. Development Coverage

A pediatric COA should span the full age range relevant to the study population without requiring sponsors to switch instruments mid-program. Tools limited to narrow age bands create gaps in longitudinal data and complicate between-study comparisons.

Look for instruments that offer developmentally appropriate formats at each stage while maintaining construct consistency across age groups, so data collected from a 7 year old and a 15 year old within the same trial remains interpretable and comparable.

2. Psychometric and Clinical Validation

Validation data from a single study in a single population is a thin foundation. Regulators expect demonstrated reliability, validity, and responsiveness across a range of settings, conditions, and patient groups, particularly when an instrument is being applied in a context that differs from its original development sample. The strength of an instrument's evidence base is one of the most important factors in building a defensible regulatory submission.

3. Generic and Disease-Specific Architecture

Many pediatric trials require both a generic HRQoL measure for cross-condition comparisons and a disease-specific measure capturing what matters most in a given indication. However, administering two separate instruments increases respondent burden and complicates analysis.

Instruments with a modular architecture, where a generic core can be paired with validated condition-specific add-ons in a single administration, offer a meaningful practical and scientific advantage.

4. Linguistic Validation and Cross-Cultural Adaptation at Global Scale

Most pivotal pediatric trials are multinational. An instrument without rigorous cross-cultural adaptation for key global markets introduces measurement risk that is difficult to correct once a trial is underway.

The breadth of an instrument's linguistically validated translations, and the methodological rigor with which those translations were developed, is a practical indicator of how safely it can be deployed across international sites.

5. Interpretability Infrastructure

A score is only useful if it can be interpreted. Sponsors, clinical teams, and regulators must understand:

whether a change in score represents a clinically meaningful difference
how a study population compares to healthy children of the same age
what an absolute score implies about disease burden.

Instruments with established population norms and published minimum clinically important difference (MCID) estimates provide the interpretive infrastructure that regulatory submissions increasingly require.

6. Operational Feasibility

Scientific rigor means little if an instrument places unreasonable burden on young patients, caregivers, or site staff. In pediatric studies, completion burden is a genuine practical concern. Instruments that are brief, self-administrable by children across a wide age range, and feasible in community and clinical settings alike are far more likely to produce complete, high-quality data at scale.

Applying the framework to the PedsQL™

To demonstrate how this framework works in practice, we apply the six criteria to one widely used pediatric HRQoL instrument: Pediatric Quality of Life Inventory™ (PedsQL™), developed by Dr. James W. Varni.

Many pediatric HRQoL instruments perform well on one or two dimensions but show meaningful structural gaps on others -- limited age coverage, absence of child self-report in younger populations, fragmented disease-specific measurement requiring separate administrations, or insufficient normative and MCID data to support regulatory interpretation. In multinational trials, incomplete linguistic validation or inconsistent cross-cultural methodology can introduce additional measurement risk that is difficult to mitigate once a study is underway.

PedsQL™

The PedsQL™ 4.0 Generic Core Scales cover child self-reporting across three developmentally appropriate age bands (5–7, 8–12, and 13–18), with parent proxy-report forms spanning ages 2 through 18. Construct consistency is maintained across all forms, making the instrument suitable for trials that enroll across a broad pediatric age range.¹

Its psychometric evidence base is among the deepest available for any pediatric COA. The PedsQL™ was validated in over 25,000 children and adolescents (as of 2005) across hundreds of peer-reviewed publications, covering conditions including cancer, diabetes, asthma, cardiac disease, rheumatological disorders, cerebral palsy, and neuromuscular disease. In the inpatient setting, lower admission scores have been associated with longer hospital stays, 30-day readmissions, and emergency department returns, all of which are evidence of clinically meaningful predictive validity beyond psychometric benchmarks alone.³

The PedsQL™’s modular design allows the Generic Core Scales to be paired with validated disease-specific modules for cancer, diabetes, asthma, rheumatology, cardiac conditions, fatigue, pain, and others, with both generic and condition-specific scores derived from a single administration.⁴'⁵

Thousands of linguistically validated language versions are available, developed to established cross-cultural adaptation standards.¹ A normative database built from over 25,000 subjects supports population-level comparisons, and MCID estimates have been published across multiple therapeutic areas.⁶ At 23 items, the instrument typically takes under five minutes to complete.⁶

As with any COA, selection should be context-specific. However, when evaluated against the six criteria above, the PedsQL™ demonstrates strength across all domains in a way that is uncommon among pediatric HRQoL instruments The PedsQL™'s combination of developmental breadth, evidentiary depth, modular flexibility, global availability, and practical feasibility, however, makes it one of the few pediatric COAs that can be evaluated against all six criteria with confidence.

Making a defensible choice

In pediatric trials, instrument selection can no longer rely on habit or precedent. Regulators now expect sponsors to explain not just whether a measure is valid, but why it is appropriate for the specific population, indication, and decision the study is designed to support.

When that rationale is weak, the consequences surface late and at the worst possible time: additional validation requests, interpretability questions, or prolonged regulatory dialogue that slows submission timelines. A strong choice is one that can be clearly justified across age groups, across countries, and across therapeutic settings, without relying on historical precedent alone.

The PedsQL™ has been widely evaluated across multinational programs and diverse pediatric populations, generating a substantial body of evidence relevant to these expectations.

For sponsors evaluating candidate instruments against these criteria, the PedsQL™ frequently emerges as a strong option due to its developmental continuity, extensive evidence base, global readiness, and practical feasibility.

Checklist: Application of the framework to the PedsQL™

Assessment Against Gold-Standard Selection Criteria

Mapi Research Trust manages global distribution and licensing of the PedsQL™ on behalf of its developer, Dr. James W. Varni. Through ePROVIDE, sponsors and researchers can review available modules, linguistic validations, and supporting documentation to determine whether the PedsQL™ aligns with their study’s needs.

Learn more about the PedsQL™ and access its validation resources through ePROVIDE to inform your pediatric endpoint strategy.

References

1. Varni JW, Burwinkle TM, Seid M. The PedsQL as a pediatric patient-reported outcome: reliability and validity of the PedsQL Measurement Model in 25,000 children. Expert Review of Pharmacoeconomics & Outcomes Research. 2005;5(6):705–719.

2. Varni JW, Seid M, Kurtin PS. PedsQL 4.0: Reliability and Validity of the Pediatric Quality of Life Inventory Version 4.0 Generic Core Scales in Healthy and Patient Populations. Medical Care. 2001;39(8):800–812.

3. Desai AD, Zhou C, Stanford S, Haaland W, Varni JW, Mangione-Smith RM. Validity and responsiveness of the Pediatric Quality of Life Inventory (PedsQL) 4.0 Generic Core Scales in the pediatric inpatient setting. JAMA Pediatrics. 2014;168(12):1114–1121.

4. Varni JW, Seid M, Rode CA. The PedsQL: measurement model for the pediatric quality of life inventory. Medical Care. 1999;37(2):126–139.

5. Varni JW, Seid M, Smith Knight T, et al. The PedsQL in pediatric rheumatology: reliability, validity, and responsiveness of the Pediatric Quality of Life Inventory Generic Core Scales and Rheumatology Module. Arthritis & Rheumatism. 2002;46(3):714–725.

6. Varni JW, Burwinkle TM, Seid M, Skarr D. The PedsQL 4.0 as a pediatric population health measure: feasibility, reliability, and validity. Ambulatory Pediatrics. 2003;3(6):329–341.

7. U.S. Food and Drug Administration. Patient-Focused Drug Development: Selecting, Developing, or Modifying Fit-for-Purpose Clinical Outcome Assessments. Final Guidance for Industry. October 2025.

A Framework for Pediatric HRQoL Selection: Evaluating the PedsQL™

Page tools