Skip to main content

Patient-reported outcome measures in functional dyspepsia: a systematic review and COSMIN analysis

Abstract

Background

Functional dyspepsia (FD) as a type of disorders of brain-gut interaction (DBGI), patient self-reporting of its symptoms becomes an important component of clinical outcome assessment. We performed a systematic review using Consensus Based Standards for the Selection of Health Measurement Instruments (COSMIN) guidelines to identify the best available patient-reported outcome measure (PROM) of FD.

Methods

The study was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA). We searched four databases with no date limit, looking for previously confirmed PROMs for evaluating FD symptoms. An overall rating was then assigned based upon COSMIN guidelines, and the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) approach was used to assess the level of evidence for psychometric properties of included PROMs.

Results

Thirty articles covering outcome indicators of 24 patient reports were included. The Leuven Postprandial Distress Scale (LPDS) showed adequate content validity and moderate quality evidence of adequate internal consistency to generate an A recommendation.

Conclusion

LPDS is currently the most recommended PROM for patient self-reported FD symptoms. However, it fails to assess two important areas of cross-cultural validity/ measurement invariance and measurement error. Future research can be continuously improved on this basis.

Peer Review reports

Introduction

Functional gastrointestinal disease as a chronic disease is a very common clinical condition encountered in the clinical practice of gastroenterology, and the Rome IV redefines functional gastrointestinal disease as disorders of gut-brain interaction [1]. Functional dyspepsia (FD) is one of its most common types, divided into postprandial distress syndrome (PDS) and epigastric pain syndrome (EPS). Early satiation and postprandial fullness are the main symptoms of PDS, while epigastric pain and epigastric burning in the upper abdomen are the main symptoms of EPS. In addition, upper abdominal bloating, postprandial nausea and excessive belching are also considered important additional symptoms of FD [2, 3]. The prevalence of functional dyspepsia is 10–30% worldwide and varies between regions, with a higher average prevalence in Asia than in Europe and the Americas [4,5,6]. And under the influence of the COVID-19, the incidence and recurrence rates of the disease have increased significantly, consuming a large amount of medical resources and placing a heavy burden on the health care system [7]. The high incidence and persistence of FD seriously affects the quality of life of patients. However, the pathogenesis of FD has not been fully elucidated so far, and it is generally believed to be associated with multiple pathophysiological mechanisms, with gastrointestinal motility disorders, visceral hypersensitivity, immune dysfunction, altered gastrointestinal microbiota, abnormal gut-brain interactions, psychosocial factors, and genetic susceptibility as existing etiologies [8]. Because of its complex pathogenesis, the clinical use of pro-gastrointestinal drugs, antacids and antidepressants and other drug therapy has a positive effect. In addition, non-pharmacological treatments such as acupuncture and hypnosis may be effective, but the corresponding research is scarce [9,10,11,12].

Efficacy assessment as an important indicator to evaluate the effectiveness of treatment regimens has been an aspect of general interest in various research areas. However, up to now, there is no "gold standard" for assessing the effectiveness of FD treatment protocols, and the most appropriate assessment scale remains to be determined. FD has been suggested as a group of symptoms originating in the gastric and duodenal regions, without underlying organic, systemic or metabolic disease that might explain the symptoms [13]. This is why patients' self-reporting of their symptoms becomes an important element of efficacy assessment and is more authentic and credible. The Food and Drug Administration (FDA) Guidelines similarly recommend the use of well-defined patient-reported outcome measures (PROMs) to assess treatment outcomes [14]. A recent systematic evaluation summarized the prom of symptom assessment in patients with FD from 1970 to 2017, but has not assessed the risk of bias and level of evidence for psychometric properties [15]. According to COSMIN guidelines [16] and GRADE approach [17], this study attempts to comprehensively analyze the PROMs included in the study, and formulate its recommendation level, so as to select the best PROMs for evaluating FD symptoms and provide reference for clinicians to choose an appropriate curative effect evaluation scale.

Methods

Data sources and search strategy

One researcher conducted a literature search in PubMed, Cochrane, Embase, and Web of science databases on October 7, 2022, with no date restrictions and using English as a filter. The search formula for each database is shown in Supplementary Material 1. In addition, the researchers checked the reference list of the literature read in full for additional relevant citations (Fig. 1).

Fig. 1
figure 1

Flow diagram of the search and selection process

Inclusion and exclusion criteria

Studies were original literature from PROMs or psychometric literature, applicable to adult patients with a confirmed FD and published publicly in academic journals in English. Studies were excluded if they were reviews, commentaries or responses to the original study, or abstracts without the full text (presented at a conference). In addition, studies that were not symptom assessments (e.g., scales used to diagnose FD) or that assessed only quality of life, perceived knowledge, or psychology were excluded (Fig. 1). Titles and abstracts found in the literature search were judged independently by two researchers, and for the remaining full-text articles were also independently searched and judged eligible by two reviewers. Both researchers agreed on the inclusion of these articles, and if any inconsistencies arose, agreement was reached by consulting a third researcher. Full-text articles were screened if at least one researcher considered a study relevant based on the abstract, or if there was doubt.

Data analysis

After removing duplicates from four different databases using Endnote software, three researchers extracted data from the included studies using a standardized data collection tool. Two researchers independently evaluated the psychometric properties and methodological quality of the included PROMs using the COSMIN systematic review guidelines [16] and cross-checked the results. The GRADE approach was then used to synthesize the level of evidence for the inclusion of each PROMs and to form a final recommendation for the scale. In case of disagreement, it was resolved through consultation and agreement with a third researcher.

  1. (1)

    COSMIN Risk of Bias checklist [18]: COSMIN Bias Risk Checklist is a part of COSMIN Guidelines. The methodological quality of the included scales was assessed using the COSMIN Risk of Bias Inventory in five dimensions: very good (V), adequate (A), doubtful (D), insufficient (I), or not assessed (NA). If the risk of bias differs across entries in a measurement characteristic, the overall risk of bias rating (related to the method used) is assessed based on the "lowest score count" principle.

  2. (2)

    COSMIN quality criteria [19]: COSMIN quality criteria is another part of COSMIN guidelines. The psychometric characteristics of the included scales were assessed using the COSMIN quality criteria, and the included PROMs were rated as adequate ( +), inadequate (-), inconsistent ( ±), or uncertain (?) based on the results reported in the individual studies. If only one study assessed the psychometric properties of a scale, it was used as an overall quality assessment; if multiple studies jointly assessed a particular psychometric characteristic of a scale, their individual scores were aggregated to provide an overall quality assessment of the instrument.

  3. (3)

    GRADE approach [17]: the COSMIN modified GRADE was used to assess the level of evidence of the included scales into 4 dimensions: high (High), moderate (Moderate), low (Low) and very low (Very low).COSMIN assumes that each measured characteristic is of high quality and then downgrades them according to the following 4 components: (i) risk of bias (if there is a serious, very serious, or extremely serious risk of bias, then downgrade by one, two, or three levels, respectively); (ii) inconsistency (if the inconsistency is serious or very serious, then downgrade by one or two levels, respectively); (iii) imprecision (if the sample size is between 50 and 100, then downgrade by one level; if the sample size is < 50, then downgrade by two levels; not applicable to content validity, structural validity, and cross-cultural validity); (iv) indirectness (if it is serious of or very serious indirectness, then the grade was lowered by one or two levels, respectively). Evidence recommendations are divided into three categories: A, B, and C. Level A: has "sufficient ( +)" content validity (for any level of evidence) and "sufficient ( +)" internal consistency (for at least low quality levels of evidence); Level C: has high quality evidence of "inadequate (-)" measurement properties; Level B: Level B if it does not belong to Levels A and C.

Results

Seven thousand seven hundred seventy-six literatures were retrieved from four databases, and 5883 remained after excluding 1893 duplicate literatures; then, each paper’s title and abstract was read one by one, and finally, 69 articles met the inclusion criteria. After reading the full text and further screening the literature, 30 studies including 24 FD symptom assessment scales were finally included [20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49]. The details of the selection process can be seen in Fig. 1.

Summary of study data

The general information on the included PROMs is shown in Table 1. The included studies were divided into developmental, validation and intervention studies, the number of patients assessed ranged from 35–1633. Their publication countries were dispersed, with the most scales studied and published in the USA (DHSI, DSSI, SODA, GISSI and FDSD), followed by the UK (GDSS, CDQ, LDQ and SF-LDQ). The number of entries for each PROMs ranged from 5 to 40, with the NDI scale having the highest number of entries, which assessed both symptom and quality of life components; the longest recall period was 24 weeks (GDSS and LDQ), and the shortest recall period was the ESM-PROM which is a real-time assessment. Multiple translated versions are available for the following included PROMs: GSRS, DHSI and NDI (8 translations) and PAGI-SYM (7 translations). English translated version is the most common version in included PROMs (21 of 24), and 5 PROMs (GDSS, LDQ, GIS, FDSD, and ESM-PROM) reported the fill time, but the rest were not mentioned. 14 PROMs were scored using Likert scale (4–7 points), 2 PROMs used an 11-point numeric rating scale (NRS), and the remaining studies used Rasch model calibration scales, images, or smiley faces for outcome measures. More detailed information can be seen in Table 1.

Table 1 Details of the findings

Content validity assessment

Content validity includes relevance, comprehensiveness and comprehensibility. Whether PROMs assessed all symptoms of FD is one aspect of content comprehensiveness (Table 2). The included PROMs were evaluated based on FD symptoms as defined by the Rome IV. Seven PROMs (NDI, DSSI, LPDS, FDSD, ESM-PROM, FGI-Checklist and SEQ-DYSPEPSIA) scales can evaluate four core symptoms and three additional symptoms of FD, followed by GOS and GISSI, which can evaluate six discomfort symptoms. The remaining PROMs scales all assessed symptoms incompletely.

Table 2 Comprehensiveness of the content incorporated into the scale

The overall evaluation of the content effectiveness is shown in Table 3. The overall content validity of these studies was unsatisfactory for reasons including, but not limited to, insufficient relevance or comprehensiveness of unreported content. Based on the COSMIN risk of bias assessment checklist, the methodological quality of most of these studies was found to be reported as grade D (doubtful) for content validity, with HKDI, Severity Index of Bologna and GOS rated as grade I (insufficient). The comprehensiveness of the content was compromised because most scales did not fully assess the core and additional symptoms of functional dyspepsia, and the comprehensiveness of the final content comprehensiveness was mostly ± (inconsistent). Only 7 PROMs (NDI, DSSI, LPDS, FDSD, ESM-PROM, FGI checklist, and SEQ-DYSPEPSIA) comprehensively assessed all symptoms of FD with a content validity in comprehensiveness of + (adequate). However, the correlation and comprehensibility of the DSSI, FGI-Checklist and SEQ-DYSPEPSIA were insufficient or have not been reported, so the final content validity score was ± (inconsistent). Most of the included studies had a low level of evidence for content validity.

Table 3 Summary of findings. (M: Methodology reported as ‘very good(V)’, ‘adequate(A),’ ‘doubtful(D)’, ‘inadequate(I)’, or ‘not assessed (NA)’. R: Ratings for overall quality reported as sufficient ( +), insufficient (-), inconsistent ( ±), or indeterminate (?). L: Level of evidence using GRADE reported as: High, Moderate, Low, or Very low.)

Assessment of the remaining measurement attributes (structural validity, internal consistency, cultural validity/measurement invariance, stability, measurement error, hypothesis testing validity, and responsiveness)

The remaining psychometric attributes assessed are shown in Table 3. In total, the methodological quality of 192 measurement attributes was rated, 49 measurement properties (25.5%) were very good, 21 (10.9%) were adequate, 35 (18.2%) were doubtful and insufficient methodological quality, and the rest (45.3%) were not assessed.

Summary of findings

A summary of the survey results is presented in Table 3. NDI, LPDS, FDSD, and ESM-PROM had adequate content validity, but internal consistency is not reported in the NDI's symptom checklist, and FDSD and ESM-PROM did not report structural validity, so only LPDS could receive an A recommendation (adequate content validity and moderate quality evidence of adequate internal consistency). The DHSI and FGI-Checklist are given a Category C recommendation due to high-quality evidence of inadequate psychometric properties. The rest of the scales are recommended for category B. Based on the available evidence, the best PROMs for assessing functional dyspepsia symptoms are the LPDS, a specific scale for the assessment of functional dyspepsia symptoms. It was developed after conducting patient focus group interviews before development.

Discussion

A total of 24 PROMs from 30 documents were included in this systematic review to assess their psychometric properties according to COSMIN guidelines. LPDS received an A recommendation and can be considered the best tool available for patient self-assessment of functional dyspepsia symptoms. Additional measurement characteristics not currently evaluated should be further investigated in the future to refine the assessment content of the tools.

Summary of existing literature

FD has been considered a symptom-based disease, and patient self-assessment of its symptoms has important implications for the selection of clinical treatment options. However, there is no consensus on the outcome measures of symptoms. The purpose of this study is not to seek a scale to diagnose or identify FD, but to summarize a self-reporting tool for FD patients' symptoms and to assess symptom changes throughout treatment from a patient's perspective to understand treatment effects. We systematically assessed the psychometric properties of the 24 included PROMs according to the COSMIN guidelines, enriching the current study to some extent. The long recall periods for most tools do not meet the relevant FDA recommendations and are prone to recall bias, which is detrimental to the accuracy of the results. Of the study instruments included in this study, only the modified versions of the SODA, LPDS, FDSD, and PDS symptom scales had a one-day recall period, and the ESM-PROM is a real-time assessment that provides a more realistic response to patients' current symptoms. It is worth noting that the time required to complete most scales is still unclear. In addition, all except the ESM-PROM are assessed on paper, which may be influenced by patient compliance and the prevailing environment.

The NDI is a scale developed by an international group of gastroenterologists and methodologists, which includes components for both symptom assessment and quality of life assessment. Since its development it has been used many times as an outcome indicator in randomized controlled clinical trials and has been widely recognized. The current study concludes that a change in NDI score of at least 10 points may reflect a change in clinical patient symptoms [50]. Some studies have evaluated the psychological measurement characteristics of NDI quality of life subscale, but there is no single evaluation of symptom scale at present [51]. The symptom assessment scale, which provides information on the frequency and severity of patients' symptoms, is an important aspect of assessing the efficacy of treatment regimens and should be further tested for validity and credibility. In addition, the NDI short form SF-NDI was not included in this study because it only evaluates quality of life [52]. Designed and developed for patients with PDS, the LPDS is comprehensive and has good reliability and has been approved by the European Medicines Agency for the assessment of clinical symptoms in patients with PDS. In addition, the entries included in the LPDS also include EPS symptoms such as epigastric pain and epigastric burning, but the validity and reliability of the scale have not been validated in patients with EPS. Further studies should be conducted to understand the variability of the assessment in the two subgroups of the FD population. Both the FDSD and the ESM-PROM were developed under FDA guidance recommendations with very good content validity, and multiple interviews were conducted with patients at the beginning of development to understand patients' perception of symptoms and comprehension of the scale items. They all take less time to fill in the information and use an 11-point numeric rating scale for outcome measures, with the difference that the FDSD has a 1-day recall period but the ESM-PROM is a real-time assessment which is more useful for documenting fluctuations in FD symptoms. Unfortunately, the structural validity and responsiveness of these two instruments have not been reported in studies. Future studies may further investigate in this area.

Study implications

As a chronic disease, FD is persistent and has a high recurrence rate, which seriously affects the quality of life of patients and imposes an economic burden on patients and their families, as well as putting pressure on the health care system. At present, many scholars have devoted themselves to the research of effective ways to treat FD, but there are great differences in the tools chosen to test the treatment plan, and no consensus has been reached. The selection of sensitive and efficient FD-specific scales is of great importance in the field of clinical gastroenterology, as inappropriate scales are not conducive to detecting the ultimate efficacy of treatment protocols and can even be misleading [53]. To date no studies have used the COSMIN guidelines to systematically evaluate the FD symptom assessment scale. The present study underwent a rigorous screening of the literature to finally summarize 24 specific and non-specific scales that can be used to assess FD symptoms. NDI, PADYQ, LPDS, FDSD, ESM-PROM, a Scale for PDS and SEQ-DYSPEPSIA are all specific scales to assess FD symptoms, and the rest are general gastrointestinal symptom scales. We were guided by the COSMIN guidelines to assess their psychometric properties. According to the available evidence, LPDS is the best tools for self-symptom assessment of FD patients, covering all the core symptoms and additional symptoms of FD and being managed by patients themselves, which is more conducive to reflecting patients' cognition of their own symptoms. Currently unreported measurement attributes can be further explored in future studies to refine the scale assessment content. This study summarizes and evaluates the existing FD efficacy assessment tools from the patient's perspective in order to provide a reference for future clinical studies. Health care providers can choose the most appropriate tool for their study needs.

Suggestions for future development

Real-time symptom assessment

Long instrument recall periods may produce recall bias, and it is recommended to use real-time assessments to understand the dynamics of FD patients' symptoms. The empirical sampling method (ESM), a widely used method in the field of psychiatry, is also important for the real-time symptom assessment of FD patients. An ESM-based method for FD symptom assessment has been reported with significant results. Researchers can install the corresponding program on patients' cell phones, push the content and set reminders at regular intervals every day, and patients only need to complete the assessment content within the reminder time every day [54, 55]. This real-time dynamic assessment helps the investigator to understand the changes in symptoms at each time period and provide personalized treatment. It is less influenced by the environment and location, which results in better patient compliance compared to paper mass scales. In addition, ESM provides insight into the individual characteristics and behavioral patterns of each patient and reduces the cost of investigation. In the future, the relevant analysis of big data can be used to understand the onset of each treatment option and provide personalized treatment in the context of patient differences to help make the right clinical decisions [56]. It is important to note that the number of scales filled out daily should not be too many and the time required to fill them out should not be too long, as this may lead to a decrease in patient compliance and may affect the accuracy of the results. Moreover, investigators can use incentive policies to increase patient response rates and facilitate the smooth conduct of the study. We believe that real-time symptom assessment based on ESM may be an important future development direction, which needs to be thoroughly studied and further explored.

Use of pictograms

The validity of pictograms has been recognized in the expert consensus on FD published in Japan and Europe respectively [2, 12]. Some studies have shown that the use of pictograms enhances patients' understanding and memory of verbal representations of the scale, and improves the variability of patient and physician ratings of symptoms [57]. Four of the PROMs included in this study added pictograms with good results: the LPDS and PDS symptom assessment scales both used smiley faces as outcome measures, the FDSD added pictures of the stomach to help patients understand the range of symptoms, and the SEQ-DYSPEPSIA added cartoonish images of symptoms to facilitate patients' understanding of the corresponding symptoms. In addition, the ESM-PROM also suggested that the use of pictograms should be increased in the future. Clinical patients may have some differences in understanding the wording of the scale due to their education and environment. Adding pictures close to life in the scale can help patients better understand the contents of the items and increase the accuracy of the survey results. It should be noted that studies using pictograms are still limited, and even fewer PROMs use them as outcome measures. The validity of pictogram use should be further evaluated in future studies in large sample clinical randomized controlled trials to promote the use of pictograms in FD symptom assessment scales.

Strengths and limitations of this systematic review

As a non-organic disease, patient self-reporting of symptoms in FD is an important aspect of outcome assessment. However, no studies have systematically evaluated the measurement properties of PROMs in this area based on COSMIN guidelines. The purpose of this study was to systematically evaluate the measurement characteristics of these instruments to recommend the most appropriate tool for assessing symptoms in patients with FD. In this systematic review, we conducted a comprehensive search in four large databases (PubMed, Cochrane, Embase and Web of science), and conducted a search in the list of references included in the study to avoid missing the literature. Then, we use the applied predefined inclusion and exclusion criteria to screen the literature, and used the COSMIN risk of bias checklist, COSMIN quality criteria, COSMIN Modified GRADE ratings to assess the methodological quality, measurement characteristics, and level of evidence of the included studies. Two independent reviewers were evaluated and cross-checked separately, and discrepancies were discussed and agreed upon with a third reviewer. A potential limitation of the systematic review is that although we made detailed screening efforts not to miss any significant studies, omissions may occur. Secondly, the recommendations made by our review do not necessarily imply that PROMs with low recommendation ratings are of unacceptable quality, but only that the symptom content they assessed may be incomplete and require further studies for assessing their measurement characteristics. Finally, we only included PROMs for functional dyspeptic symptoms and did not evaluate other aspects such as quality of life. The clinical can choose the most suitable scale according to the actual needs.

Conclusion

This systematic review suggests that LPDS is currently the most recommended tool for patient self-assessment of functional dyspepsia symptoms. However, it fails to assess two important areas of cross-cultural validity/ measurement invariance and measurement error. Future studies can build on this foundation and further explore other measurement tools with adequate content validity to find the most appropriate patient self-report tool.

Availability of data and materials

All data generated or analysed during this study are included in this published article [and its supplementary information files].

References

  1. Drossman DA, Tack J, Ford AC, et al. Neuromodulators for Functional Gastrointestinal Disorders (Disorders of Gut-Brain Interaction): A Rome Foundation Working Team Report. Gastroenterology. 2018;154(4):1140-1171.e1.

    Article  PubMed  Google Scholar 

  2. Wauters L, Dickman R, Drug V, et al. United European Gastroenterology (UEG) and European Society for Neurogastroenterology and Motility (ESNM) consensus on functional dyspepsia. United European Gastroenterology J. 2021;9(3):307–31.

    Article  Google Scholar 

  3. Taylor F, Reasner DS, Carson RT, et al. Development of a symptom-based patient-reported outcome instrument for functional dyspepsia: a preliminary conceptual model and an evaluation of the adequacy of existing instruments. Patient. 2016;9(5):409–18.

    Article  PubMed  PubMed Central  Google Scholar 

  4. Ford AC, Mahadeva S, Carbone MF, et al. Functional dyspepsia. Lancet. 2020;396(10263):1689–702.

    Article  CAS  PubMed  Google Scholar 

  5. Mahadeva S, Ford AC. Clinical and epidemiological differences in functional dyspepsia between the East and the West. Neurogastroenterol Motil. 2016;28(2):167–74.

    Article  CAS  PubMed  Google Scholar 

  6. Aziz I, Palsson OS, Törnblom H, et al. Epidemiology, clinical characteristics, and associations for symptom-based Rome IV functional dyspepsia in adults in the USA, Canada, and the UK: a cross-sectional population-based study. Lancet Gastroenterol Hepatol. 2018;3(4):252–62.

    Article  PubMed  Google Scholar 

  7. Gubatan J, Zikos T, Spear Bishop E, et al. Gastrointestinal symptoms and healthcare utilization have increased among patients with functional gastrointestinal and motility disorders during the COVID-19 pandemic[J]. Neurogastroenterol Motil. 2022;34(4):e14243.

  8. Sayuk GS, Gyawali CP. Functional dyspepsia: diagnostic and therapeutic approaches. Drugs. 2020;80(13):1319–36.

    Article  PubMed  Google Scholar 

  9. Ford AC, Moayyedi P, Black CJ, et al. Systematic review and network meta-analysis: efficacy of drugs for functional dyspepsia. Aliment Pharmacol Ther. 2021;53(1):8–21.

    Article  PubMed  Google Scholar 

  10. Madisch A, Andresen V, Enck P, et al. The diagnosis and treatment of functional dyspepsia. Dtsch Arztebl Int. 2018;115(13):222–32.

    PubMed  PubMed Central  Google Scholar 

  11. Masuy I, Van Oudenhove L, Tack J. Review article: treatment options for functional dyspepsia. Aliment Pharmacol Ther. 2019;49(9):1134–72.

    Article  PubMed  Google Scholar 

  12. Miwa H, Nagahara A, Asakawa A, et al. Evidence-based clinical practice guidelines for functional dyspepsia 2021. J Gastroenterol. 2022;57(2):47–61.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Wauters L, Talley NJ, Walker MM, et al. Novel concepts in the pathophysiology and treatment of functional dyspepsia. Gut. 2020;69(3):591–600.

    Article  CAS  PubMed  Google Scholar 

  14. Guidance for industry. patient-reported outcome measures: use in medical product development to support labeling claims: draft guidance. Health Qual Life Outcomes. 2006;4:79.

    Article  Google Scholar 

  15. Smeets FGM, Masclee AAM, Conchillo JM, et al. Systematic review: disease‐specific instruments to assess gastrointestinal symptoms in functional dyspepsia[J]. Neurogastroenterol Motil. 2018;30(8):e13327.

  16. Prinsen CAC, Mokkink LB, Bouter LM, et al. COSMIN guideline for systematic reviews of patient-reported outcome measures[J]. Qual Life Res. 2018;27(5):1147–57.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Alhazzani W, Guyatt G. An overview of the GRADE approach and a peek at the future. Med J Aust. 2018;209(7):291–2.

    Article  PubMed  Google Scholar 

  18. Mokkink LB, De Vet HCW, Prinsen CAC, et al. COSMIN Risk of Bias checklist for systematic reviews of Patient-Reported Outcome Measures. Qual Life Res. 2018;27(5):1171–9.

    Article  CAS  PubMed  Google Scholar 

  19. Terwee CB, Bot SD, De Boer MR, et al. Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007;60(1):34–42.

    Article  PubMed  Google Scholar 

  20. Svedlund J, Sjödin I, Dotevall G. GSRS–a clinical rating scale for gastrointestinal symptoms in patients with irritable bowel syndrome and peptic ulcer disease. Dig Dis Sci. 1988;33(2):129–34.

    Article  CAS  PubMed  Google Scholar 

  21. Kulich KR, Madisch A, Pacini F, et al. Reliability and validity of the Gastrointestinal Symptom Rating Scale (GSRS) and Quality of Life in Reflux and Dyspepsia (QOLRAD) questionnaire in dyspepsia: A six-country study. Health Qual Life Outcomes. 2008;6:12.

    Article  PubMed  PubMed Central  Google Scholar 

  22. El-Omar EM, Banerjee S, Wirz A, et al. The glasgow dyspepsia severity score - a tool for the global measurement of dyspepsia. Eur J Gastroenterol Hepatol. 1996;8(10):967–71.

    Article  CAS  PubMed  Google Scholar 

  23. Garratt AM, Ruta DA, Russell I, et al. Developing a condition-specific measure of health for patients with dyspepsia and ulcer-related symptoms. J Clin Epidemiol. 1996;49(5):565–71.

    Article  CAS  PubMed  Google Scholar 

  24. Moayyedi P, Duffett S, Braunholtz D, et al. The Leeds Dyspepsia Questionnaire: a valid tool for measuring the presence and severity of dyspepsia. Am J Gastroenterol. 1998;12(12):1257–62.

    CAS  Google Scholar 

  25. Shaw M, Talley NJ, Adlis S, et al. Development of a digestive health status instrument: tests of scaling assumptions, structure and reliability in a primary care population. Aliment Pharmacol Ther. 1998;12(11):1067–78.

    Article  CAS  PubMed  Google Scholar 

  26. Shaw MJ, Beebe TJ, Adlis SA, et al. Reliability and validity of the digestive health status instrument in samples of community, primary care, and gastroenterology patients. Aliment Pharmacol Ther. 2001;15(7):981–7.

    Article  CAS  PubMed  Google Scholar 

  27. Talley NJ, Haque M, Wyeth JW, et al. Development of a new dyspepsia impact scale: the Nepean Dyspepsia Index. Aliment Pharmacol Ther. 1999;13(2):225–35.

    Article  CAS  PubMed  Google Scholar 

  28. Leidy NK, Farup C, Rentz AM, et al. Patient-based assessment in dyspepsia - Development and validation of Dyspepsia Symptom Severity Index (DSSI). Dig Dis Sci. 2000;45(6):1172–9.

    Article  CAS  PubMed  Google Scholar 

  29. Rabeneck L, Cook KF, Wristers K, et al. SODA (severity of dyspepsia assessment): a new effective outcome measure for dyspepsia-related health. J Clin Epidemiol. 2001;54(8):755–65.

    Article  CAS  PubMed  Google Scholar 

  30. Goldman J, Conrad DF, Ley C, et al. Validation of Spanish Language Dyspepsia Questionnaire. Dig Dis Sci. 2002;47(3):624–40.

    Article  PubMed  Google Scholar 

  31. Hu WHC, Lam KF, Wong YH, et al. The Hong Kong index of dyspepsia: A validated symptom severity questionnaire for patients with dyspepsia. J Gastroenterol Hepatol (Australia). 2002;17(5):545–51.

    Article  Google Scholar 

  32. Sander GB, Mazzoleni LE, Francesconi CFM, et al. Development and validation of a cross-cultural questionnaire to evaluate nonulcer dyspepsia: The Porto Alegre Dyspeptic Symptoms Questionnaire (PADYQ). Dig Dis Sci. 2004;49(11–12):1822–9.

    Article  PubMed  Google Scholar 

  33. Rentz AM, Kahrilas P, Stanghellini V, et al. Development and psychometric evaluation of the patient assessment of upper gastrointestinal symptom severity index (PAGI-SYM) in patients with upper gastrointestinal disorders. Qual Life Res. 2004;13(10):1737–49.

    Article  CAS  PubMed  Google Scholar 

  34. Revicki DA, Rentz AM, Tack J, et al. Responsiveness and interpretation of a symptom severity index specific to upper gastrointestinal disorders. Clin Gastroenterol Hepatol. 2004;2(9):769–77.

    Article  PubMed  Google Scholar 

  35. De Luca L, Zagari RM, Pozzato P, et al. Measuring dyspepsia: A new severity index validated in Bologna. Dig Liver Dis. 2004;36(12):806–10.

    Article  PubMed  Google Scholar 

  36. Adam B, Liebregts T, Saadat-Gilani K, et al. Validation of the gastrointestinal symptom score for the assessment of symptoms in patients with functional dyspepsia. Aliment Pharmacol Ther. 2005;22(4):357–63.

    Article  CAS  PubMed  Google Scholar 

  37. Veldhuyzen Van Zanten SJ, Chiba N, Armstrong D, et al. Validation of a 7-point Global Overall Symptom scale to measure the severity of dyspepsia symptoms in clinical trials. Aliment Pharmacol Ther. 2006;23(4):521–9.

    Article  CAS  PubMed  Google Scholar 

  38. Fraser A, Delaney BC, Ford AC, et al. The Short-Form Leeds Dyspepsia Questionnaire validation study. Aliment Pharmacol Ther. 2007;25(4):477–86.

    Article  CAS  PubMed  Google Scholar 

  39. Crowell MD, Umar SB, Lacy BE, et al. Multi-Dimensional Gastrointestinal Symptom Severity Index: Validation of a Brief GI Symptom Assessment Tool. Dig Dis Sci. 2015;60(8):2270–9.

    Article  PubMed  Google Scholar 

  40. Carbone F, Vandenberghe A, Holvoet L, et al. Validation of the Leuven Postprandial Distress Scale, a questionnaire for symptom assessment in the functional dyspepsia/postprandial distress syndrome. Aliment Pharmacol Ther. 2016;44(9):989–1001.

    Article  CAS  PubMed  Google Scholar 

  41. Carbone F, Holvoet L, Vandenberghe A, et al. Functional dyspepsia: outcome of focus groups for the development of a questionnaire for symptom assessment in patients suffering from postprandial distress syndrome (PDS). Neurogastroenterol Motil. 2014;26(9):1266–74.

    Article  CAS  PubMed  Google Scholar 

  42. Taylor F, Higgins S, Carson RT, et al. Development of a Symptom-Focused Patient-Reported Outcome Measure for Functional Dyspepsia: The Functional Dyspepsia Symptom Diary (FDSD). Am J Gastroenterol. 2018;113(1):39–48.

    Article  PubMed  Google Scholar 

  43. Shi X, Luo H, Wang X, et al. Functional dyspepsia symptom diary is correlated with other questionnaires and associated with severity in patients with functional dyspepsia: a multicenter, prospective observational study. J Gastroenterol Hepatol. 2022;37:1298–306.

    Article  PubMed  Google Scholar 

  44. Koloski NA, Jones M, Hammer J, et al. The Validity of a New Structured Assessment of Gastrointestinal Symptoms Scale (SAGIS) for Evaluating Symptoms in the Clinical Setting. Dig Dis Sci. 2017;62(8):1913–22.

    Article  CAS  PubMed  Google Scholar 

  45. Smeets FGM, Keszthelyi D, Vork L, et al. Development of a real-time patient-reported outcome measure for symptom assessment in patients with functional dyspepsia using the experience sampling method[J]. Neurogastroenterol Motil. 2019;31(2):e13496.

    Article  PubMed  Google Scholar 

  46. Klaassen T, Smeets FGM, Vork L, et al. Psychometric evaluation of an experience sampling method-based patient-reported outcome measure in functional dyspepsia[J]. Neurogastroenterol Motil. 2021;33(9):e14136.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Chan Y, Cheong PK, Fang FF, et al. A symptom severity questionnaire for patients suffering from functional gastrointestinal disorder: FGI-Checklist. J Gastroenterol Hepatol. 2020;35(7):1130–5.

    Article  PubMed  Google Scholar 

  48. Xiao M, Ying J, Zhao Y, et al. Developing a Scale for the Evaluation of People With Post-prandial Distress Syndrome. Front Public Health. 2021;9:695809.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Jung K, Jung H-K, Kwon JG, et al. Development and Validity Assessment of a Self-evaluation Questionnaire for Functional Dyspepsia: A Multicenter Prospective Study in Korea. J Neurogastroenterol Motil. 2022;28(1):111–20.

    Article  PubMed  PubMed Central  Google Scholar 

  50. Jones M, Talley NJ. Minimum clinically important difference for the Nepean Dyspepsia Index, a validated quality of life scale for functional dyspepsia. Am J Gastroenterol. 2009;104(6):1483–8.

    Article  PubMed  Google Scholar 

  51. Jones MP, Sato YA, Talley NJ. The Nepean Dyspepsia Index is a valid instrument for measuring quality-of-life in functional dyspepsia. Eur J Gastroenterol Hepatol. 2019;31(3):329–33.

    Article  PubMed  Google Scholar 

  52. Mahadeva S, Wee H-L, Goh K-L, et al. Quality of life in South East Asian patients who consult for dyspepsia: Validation of the short form Nepean Dyspepsia Index[J]. Health and Quality of Life Outcomes. 2009;7:45.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Ioannidis JP, Greenland S, Hlatky MA, et al. Increasing value and reducing waste in research design, conduct, and analysis. Lancet. 2014;383(9912):166–75.

    Article  PubMed  PubMed Central  Google Scholar 

  54. Bastiaansen JA, Meurs M, Stelwagen R, et al. Self-monitoring and personalized feedback based on the experiencing sampling method as a tool to boost depression treatment: a protocol of a pragmatic randomized controlled trial (ZELF-i). BMC Psychiatry. 2018;18(1):276.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Riese H, Von Klipstein L, Schoevers RA, et al. Personalized ESM monitoring and feedback to support psychological treatment for depression: a pragmatic randomized controlled trial (Therap-i). BMC Psychiatry. 2021;21(1):143.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Van Os J, Verhagen S, Marsman A, et al. The experience sampling method as an mHealth tool to support self-monitoring, self-insight, and personalized health care in clinical practice. Depress Anxiety. 2017;34(6):481–93.

    Article  PubMed  Google Scholar 

  57. Tack J, Carbone F, Holvoet L, et al. The use of pictograms improves symptom evaluation by patients with functional dyspepsia. Aliment Pharmacol Ther. 2014;40(5):523–30.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

The funders had no additional role in the design of the study, data collection and analysis, decision to publish or preparation of the manuscript.

Funding

This study was supported by the National Natural Science Foundation of China.

Author information

Authors and Affiliations

Authors

Contributions

Xinai Wang and Yan Fei: design this research and write the first draft. Hao Liu and Wenjing Li: reviewing the literature and prepared Table 1– 3. Hongling Xiao: rechecking the manuscript and putting forward suggestions for amendment. Yaxuan Wu and Chenqi Wang: revise the language and characters. All authors contributed to the article and approved the submitted version.

Authors’ information

Xinai Wang, Wenjing Li, Hao Liu, Yaxuan Wu and Chenqi Wang are all from the Graduate School of Tianjin University of Traditional Chinese Medicine, China. Yan Fei is from Shanghai Tenth People’s Hospital Chongming Branch, Shanghai, China. Hongling Xiao is a professor at the School of Nursing, Tianjin University of Traditional Chinese Medicine, China.

Corresponding author

Correspondence to Hongling Xiao.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, X., Fei, Y., Li, W. et al. Patient-reported outcome measures in functional dyspepsia: a systematic review and COSMIN analysis. BMC Gastroenterol 23, 316 (2023). https://0-doi-org.brum.beds.ac.uk/10.1186/s12876-023-02935-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://0-doi-org.brum.beds.ac.uk/10.1186/s12876-023-02935-9

Keywords