There is growing acknowledgement among international stakeholders, including regulators, health technology assessment (HTA) bodies and professional societies, that the inclusion of patient‐reported outcome measures (PROMs) in clinical trials should be guided by clear rationales.1 In this regard, existing PROMs may overreach or fall short in measuring domains most relevant for a specific study's context, population, treatments and stakeholders.
In response to the need for flexibility in assessing specific health‐related quality of life (HRQoL) domains and regulatory recommendations,1 PROM developers and stakeholders have provided guidance on a modular approach.2,3,4 Although most guidance has focused on cancer clinical trials, the same principles are applicable to any context where PROMs are used (eg, clinical practice). In trials, the modular approach has been defined as the selection and assessment of specific patient‐relevant and clinically relevant domains of interest, purposefully selected from multidomain PROMs, and then independently scored and interpreted.3,4
Selection of the most relevant disease‐ and treatment‐specific domains can be based on previous research, consumer advice and clinical experience. Stakeholders have also recommended using conceptual frameworks and core patient‐reported outcome (PROs) sets to guide the selection of PROM domains.5 HRQoL concepts identified as primary or key secondary outcomes should ideally be assessed using the most valid, reliable and dedicated measures, which are often more in‐depth than domains in multidomain PROMs used to provide a broad overview. For example, if pain is a primary outcome in a trial investigating an anti‐cancer treatment, the Brief Pain Inventory might be a comprehensive substitution for the two‐item domain of pain from the widely used cancer‐specific HRQoL PROM, the European Organisation for Research and Treatment of Cancer Quality of Life Core Questionnaire (EORTC QLQ‐C30).6 Secondary outcomes, such as appetite loss and constipation, might be adequately assessed by the QLQ‐C30's single‐item domains for these outcomes. Thus, we propose three possible applications of the modular approach (Box 1): (i) using a study‐specific conceptual framework, incorporating dedicated PROMs or domains to measure relevant HRQoL concepts; (ii) using a full‐length PROM, removing domains less relevant to the study context or not covered by a core PRO set; or (iii) using a full‐length PROM, substituting domains that are primary or key secondary outcomes with dedicated PROMs or domains.
Box 2 presents the cases for and against the use of a modular approach. In this perspective article, we discuss some of these arguments in the context of assessing clinical efficacy and conducting economic evaluations.
The case for the modular approach
A modular approach can address respondent burden associated with lengthy PROMs and the potential for missing or poor quality PRO data in clinical trials7 (Box 3, Example 1). However, the aim of a modular approach is to prioritise assessment of the most important domains in each context, rather than to reduce PROM length. Indeed, patients may be willing to complete longer measures if items are relevant to their experience.10 Additionally, administering domains relevant to the clinical trial context can improve sensitivity to clinically important changes in PROs (Box 3, Example 2).
The modular approach also enables flexibility in assessing different domains at various timepoints. Domains that are conceptually proximal to the disease and treatment (eg, nausea severity) are more likely to change over a short period of time than more distal domains (eg, emotional function), and may therefore require more frequent assessment.11
To preserve the psychometric properties of domains resulting from a modular approach, working with PROM developers and following published recommendations is encouraged.2 If selected domains are obtained from PROMs validated in the target population, they are likely to retain psychometric properties, such as content and construct validity. As additional psychometric testing may not be required, this can be beneficial for novel treatments, given faster evaluation and approval times,12 or for patients with rare cancers, where developing or validating bespoke disease‐specific PROMs may be challenging.
The case against the modular approach
The assessment of a broad set of items promotes the integration of PROMs into early‐phase trials (phase 1 or phase 2) for capturing potential symptomatic adverse events and HRQoL problems. However, once these issues are identified, investigators must decide which domains to assess in late‐phase trials, and to what extent. It is unsurprising that many may default to using full‐length PROMs to maintain comparability with existing studies, avoid missing unexpected effects and meet HTA agencies’ requirements for economic evaluation.
To complicate matters, the modular approach necessitates greater scrutiny in the balance between the depth and breadth of issues and respondent burden (ie, time taken). Although this choice can be aided by international guidance and patient and public involvement,13,14 it becomes less clear for domains amidst an abundance of options from various PROMs for the same HRQoL concept. Studies comparing the psychometric properties of PROMs in various study contexts can assist in this selection15 but, traditionally, conclusions have been drawn by assessing full‐length PROMs instead of their domains. Psychometric evidence for specific combinations of subscales and items obtained from the EORTC suite are emerging,16 but evidence for the combination of subscales across different PROMs is lacking. This raises several questions requiring further conceptual and empirical consideration: Should only domains that have demonstrated the best psychometric evidence be selected and combined? How will a mixture of recall periods and response options across PROMs impact face validity and response distributions? What strategies can avoid biased selection of domains that may favour certain effects while downplaying others?
Guidance on the modular approach suggests that item order can affect the validity of subscales because responses may be influenced by the preceding questions.2,4 These potential biases are usually addressed during early development of a PROM, which supports the argument for keeping PROMs in their original format unless there is potential for retesting. However, item order effects are mostly theoretical, with limited research available that has tested these directly.17 Order effects have primarily been considered at the questionnaire level and conclusions regarding their presence have only been conservatively applied to the PROMs assessed.17
The use of the modular approach in economic evaluation
The evolution of PROM design should progress alongside other fields that rely on PROMs. In economic evaluations, preference‐based measures (PBMs) are commonly used to acquire utility values for calculating quality‐adjusted life years in cost–utility analyses. PBMs can be derived from PROMs using a subset of items from the existing measure (eg, the Short Form‐6 Dimension (version 2) [SF‐6Dv2] from the generic 36 Item Short Form Survey (version 2) [SF‐36v2];18 the EORTC Quality of Life Utility – Core 10 Dimensions [QLU‐C10D] from the EORTC QLQ‐C3019).
A key consideration in applying a modular approach is that the resulting measure may lack the full subset of items necessary for estimating utility values for economic evaluations. For instance, in response to the United States Food and Drug Administration's recommendation of a core PRO set, the EORTC developed the Core Function Questionnaire QLQ‐F17, which includes only the functional domains of the QLQ‐C3020 (ie, our proposed Approach B in Box 1). However, the QLQ‐F17 does not enable estimation of utility values using the QLU‐C10D algorithm.
It is uncertain if the same considerations and recommendations of the modular approach used to determine clinical efficacy can be applied to PBMs. Similar to domains in the modular approach, derived PBMs can be independently scored, interpreted and validated. Improving the flexibility of PBMs has also been explored by administering derived PBMs as standalone measures. This may reduce respondent burden when the full‐length PROM adds little additional information, and when the derived PBM has been shown to be valid in the population of interest. For instance, the SF‐6Dv2 can either be derived from responses to the SF‐36v2, or administered as standalone measures.21 These approaches have yielded different utility values in patients with breast cancer, with evidence supporting the use of the standalone versions for enhanced sensitivity and discriminative power.22,23 However, derived PBMs are typically not designed as standalone measures, nor do they serve the same purpose as modules intended to improve clinical relevance.
Future discussions of the modular approach should strive for further consensus on the definition, ensuring it is both clinically relevant and suitable for economic evaluations. Additionally, it is unclear if these novel approaches to PROM administration will be acceptable for economic evaluations submitted to HTA agencies (eg, the Pharmaceutical Benefits Advisory Committee). A key challenge for these HTA agencies will be balancing comparability across different health conditions, relevance to patient populations, and the validity and responsiveness of PBMs.
Conclusion
The modular approach involves replacing existing full‐length PROMs with a focused selection of HRQoL domains most relevant to a particular clinical trial context. Several considerations must be addressed before integrating the modular approach into clinical trials, including achieving broader acceptance from regulatory bodies (eg, HTA agencies). These considerations include expanding and clarifying the definition of what constitutes a module and how to apply the modular approach, while ensuring its suitability for economic evaluations. Further evidence that alleviates concerns associated with a modular approach, particularly evidence supporting its psychometric validity, would improve its acceptability in the field.
Box 1 – Suggested applications of the modular approach for patient‐reported outcome measures (PROM) inclusion in clinical trials
|
Example of application |
Specific considerations |
|||||||||||||
|
|||||||||||||||
Context |
Dr Smith was developing a randomised phase 2 study of drug X + Y versus drug X alone for the management of symptoms in a population with disease Z. |
|
|||||||||||||
Approach A: using a study‐specific conceptual framework, incorporating dedicated patient‐reported outcome measures (PROMs) and/or domains to measure relevant health‐related quality of life (HRQoL) concepts |
Dr Smith identified patient‐reported concepts that are relevant and important to patients with disease Z, and which are expected to be affected by drug X +/‐ Y.
|
Match the concepts of interest with the selected PROM domains and respective items to ensure they are conceptually similar. |
|||||||||||||
Approach B: using a full‐length PROM, removing domains less relevant to the study context |
Dr Smith planned to use PROM B in his study. PROM B is disease‐Z‐specific and consists of five multi‐item domains: fatigue, pain, body image, weight change and musculoskeletal symptoms. |
Consider if the subscale that was removed covers a concept that may arise at a different timepoint during the trial. Administering this subscale at specific timepoint(s) may be required. |
|||||||||||||
Approach C: using a full‐length PROM, substituting domains that are primary or key secondary outcomes with dedicated PROMs or domains. |
Dr Smith planned to use PROM C‐1 in his study. PROM C‐1 is disease‐Z‐specific and consists of ten domains, all of which cover the concepts of interest that are important to his study. |
Evidence showing that dedicated PROM/domain psychometrically performs better than the substituted domain may help to the strengthen the rationale for substitution. |
|||||||||||||
|
|||||||||||||||
|
Box 2 – Summary of the cases for and against use of the modular approach for patient‐reported outcome measures (PROMs) in clinical trials
Case for the modular approach |
Case against the modular approach |
||||||||||||||
|
|||||||||||||||
Promotion of scientific rigor |
|
||||||||||||||
|
|
||||||||||||||
Respondent burden |
|
||||||||||||||
|
|
||||||||||||||
Psychometric/measurement properties |
|
||||||||||||||
|
|
||||||||||||||
Trade‐off between flexibility and comparability |
|
||||||||||||||
|
|
||||||||||||||
|
|||||||||||||||
|
Box 3 – Examples of how a modular approach can improve patient‐reported outcome data quality
Example 1:
The Expanded Prostate Cancer Index Short Form (EPIC‐26) was developed based on a comprehensive set of symptoms associated with various treatment types for men with localised prostate cancer.
However, a content analysis of the EPIC‐26 revealed significant missing data in the urinary function and urinary bother domains for patients using urinary catheters, as the response options were not relevant.8 Excluding these subscales for use in this patient population would improve the instrument's precision while enhancing the relevance of its items.
Example 2:
An analysis of the United States Food and Drug Administration‐registered trials for new immune checkpoint inhibitors found that patient‐reported outcome (PRO) strategies did not assess all eight adverse events unique to these novel immunotherapy agents.9 Commonly used cancer‐specific patient‐reported outcome measures, such as the European Organisation for Research and Treatment of Cancer Quality of Life Core Questionnaire (EORTC QLQ‐C30), are supplemented by relevant PRO domains (eg, assessing rash and itching) to better inform treatment toxicity while minimising the potential for biased assessment of the patient experience.
Provenance: Not commissioned; externally peer reviewed.
Open access:
Open access publishing facilitated by University of Technology Sydney, as part of the Wiley – University of Technology Sydney agreement via the Council of Australian University Librarians.
No relevant disclosures.