Carta Acesso aberto Revisado por pares

Measuring Quality of Life After Stroke Using the SF-36

2002; Lippincott Williams & Wilkins; Volume: 33; Issue: 5 Linguagem: Inglês

10.1161/01.str.0000015223.98452.97

ISSN

1524-4628

Autores

Rob J. de Haan,

Tópico(s)

Health Systems, Economic Evaluations, Quality of Life

Resumo

HomeStrokeVol. 33, No. 5Measuring Quality of Life After Stroke Using the SF-36 Free AccessEditorialPDF/EPUBAboutView PDFView EPUBSections ToolsAdd to favoritesDownload citationsTrack citationsPermissions ShareShare onFacebookTwitterLinked InMendeleyReddit Jump toFree AccessEditorialPDF/EPUBMeasuring Quality of Life After Stroke Using the SF-36 Rob J. de Haan, PhD Rob J. de HaanRob J. de Haan From the Department of Clinical Epidemiology and Biostatistics, Academic Medical Center, University of Amsterdam, the Netherlands. Originally published1 May 2002https://doi.org/10.1161/01.STR.0000015223.98452.97Stroke. 2002;33:1176–1177The SF-36 is the most widely used generic instrument for measuring quality of life (QOL). The instrument is translated into numerous languages, and the validity of the 8 subscales is confirmed in general populations and in a wide variety of patient groups in more than 2000 articles. In an article published in this issue of Stroke, Hobart et al1 report the psychometric properties of the SF-36 in a sample of ischemic stroke patients. The authors conclude that (1) some subscales, especially the scales for General Health (GH) and Social Functioning (SF), have limited reliability and validity; (2) half of the subscales suffer from floor and/or ceiling effects; and (3) the 2 summary scores inadequately reflect the patient's physical and mental health. In view of the overwhelming weight of evidence that the subscales of the SF-36 are psychometrically sound to measure QOL in a range of patient populations, the question arises how convincing the arguments of Hobart and his colleagues are.The authors argue that the GH and SF scales generate low reliability scores and have limited convergent and discriminant validity. However, these conclusions can be challenged. The reliability of only 1 scale (GH) was marginally less (Cronbach's alpha=0.68) than the authors' predefined criteria. Although it is often recommended that coefficient values should be above 0.80, values above 0.70 are generally regarded as acceptable for scales when assessing outcome on a group level. Moreover, it should be noticed that the alpha coefficient not only depends on the correlations of the items but is also related to the number of items in the scale. For example, the relatively low coefficient (0.70) of the SF scale may also be explained by the fact that this scale encompasses only 2 items. Since Cronbach's coefficients increase as the number of items is increased (or vice versa), one may wonder whether it is sensible to specify criteria for acceptable levels of the alpha coefficient without specifying the number of items in the scale.2 The authors' criticism with regard to the convergent and discriminant validity of both subscales is not convincing. In general, the item-total subscale correlation of the GH and SF is above 0.40, indicating that the items in each scale measure a common underlying trait. Moreover, in both scales, all item-own scale correlations are higher than the item-other scale correlations (although less than 2 SE),See article on page 1349indicating that the subscale items are best placed in the scales in which they already appear.In this study, a number of subscales of the SF-36 exhibit floor and/or ceiling effects. However, as the authors noted, the psychometric properties of an instrument are sample dependent. The narrow range of many health scales partly results from the traditional methods used in the development of scales, such as interitem correlation and factor analysis. Unfortunately, these techniques for item analyses are highly dependent on the average level of the patients in the samples used in the psychometric evaluation of a scale. This means that the resulting scales may exhibit considerable ceiling and floor effects in score distribution when they are used with groups of patients with a lower or higher average level (eg, mild to moderate stroke patients as in this study) of functional health.To simplify the statistical analysis and to enhance the interpretation of the SF-36, the developers of the instrument recently made available scoring algorithms for aggregating subscale scores in 2 distinct summary scores: Physical Component Summary and Mental Component Summary. An important finding of Hobart et al is that the study results do not support the computation of summarized scores in stroke. These findings are in line with recent studies,3–7 which also demonstrate shortcomings of the summary scores in accurately reflecting patients' physical and mental health on the basis of subscale scores. Taft et al showed that the discrepancies between subscale profile and component scores of the SF-36 are attributable to the way in which these summary scores are calculated.7 The main problem in the scoring algorithm derives from the use of negatively weighted subscale factor score coefficients, leading to inaccurately summarize profile scores and, sometimes, clinically counterintuitive study results.To summarize, when the results of the relatively small study of Hobart et al are taken in conjunction with the findings of previous research, there is at present insufficient evidence to question the reliability and validity of the SF-36 subscales in stroke. The finding that the SF-36 is suffering from floor and/or ceiling effects can largely be explained by the specific characteristics of the mild to moderate stroke patients studied. However, a point in case is their finding that the assumptions for generating 2 summary scores could not be supported. Until the current scoring method is statistically revised, it is advisable not to use the component scores in stroke research.In the light of the study results presented by Hobart et al, some additional remarks should be made with regard to the measurement of functional outcome using traditional multi-item instruments such as the SF-36. Typically, these scales calculate the total scale score for each patient using a (weighted) sum of the responses to each item. However, some serious problems are associated with this approach. Firstly, all items on a scale have to be presented to patients in order to obtain a summated score. This inefficiency has led researchers to shorten health instruments, resulting in more practical but less precise scales. Secondly, since summated scores are dependent on the number of, and precisely which, items are included in the instrument, it is impossible to compare scores obtained on different instruments, even if they measure the same health concept: 10 points on the Barthel are not the same as 10 points on the physical dimension of the Sickness Impact Profile.Thirdly, the clinical interpretation of summated scores is not as straightforward as it may seem. For example, in the study of Hobart et al, stroke patients had a mean score of 47.6 on the subscale Physical Functioning. The clinical meaning of this SF-36 score would be unclear for most neurologists. This problem is amplified by the ordinal nature of summated scores, meaning that a given difference in scores at one point on the scale does not necessarily represent the same amount of functional change as an identical difference at another point on the scale. Following growing dissatisfaction with the classical approach, an alternative method has been introduced: item response theory (IRT).8 This statistical paradigm uses a logistic regression-type analysis to model the responses of the patients to the individual items. Using this technique, both patients and items can be placed on the same hierarchical continuous scale. There are a number of advantages to the use of IRT techniques in clinical measurement. Firstly, not all items in an instrument have to be presented to all patients to assess their functional health. Thus, more difficult items (eg, vigorous activities) can be presented to less disabled patients and easier ones (eg, bathing or dressing) to more severely impaired patients. This approach leads to a more efficient data collection method known as adaptive testing and results in a considerable reduction of floor or ceiling effects. Secondly, even if different subsets of items are presented to subgroups of patients, the measurements of their functional level remain completely comparable. This is because the difficulty of each item (its position on the linear scale) has been estimated beforehand. Thirdly, the clinical interpretation of functional measurements is straightforward because the patients' level of functional health can be directly compared with the hierarchically ordered items on the linear scale. In spite of recent interest in IRT in clinical outcome measurement, these methods been have not yet been developed in stroke research. Perhaps IRT is not suitable for the development of scales measuring a subjective and multidimensional construct such as QOL. However, with regard to more direct and tangible manifestations of disease such as physical disability, IRT is probably a useful supplement to the traditional approach.The opinions expressed in this editorial are not necessarily those of the editors or of the American Stroke Association.FootnotesCorrespondence to Rob J. de Haan, PhD, Department of Clinical Epidemiology and Biostatistics, University of Amsterdam, Academic Medical Center, Room J2-203, Meibergdreef 9, PO Box 22660, 1105 AZ Amsterdam, the Netherlands. E-mail [email protected] References 1 Hobart JC, Williams LS, Moran K, Thompson AJ. Quality of life measurement after stroke: uses and abuses of the SF-36. Stroke. 2002;1349–1356.Google Scholar2 Fayers PM, Machin D. Quality of Life: Assessment, Analysis and Interpretation. London, England: John Wiley & Sons; 2000:85–87.Google Scholar3 Simon GE, Revicky DA, Grothaus L, Von Korff M. SF-36 summary scores: are physical and mental health truly distinct? Med Care. 1998; 36: 567–572.CrossrefMedlineGoogle Scholar4 Hurst NP, Ruta DA, Kind P. Comparison of the MOS short-form-12 (SF-12) health status questionnaire with the SF-36 in patients with rheumatoid arthritis. Br J Rheumatol. 1998; 37: 862–869.CrossrefMedlineGoogle Scholar5 Nortvedt MW, Riise T, Myhr KM, Nyland HI. Performance of the SF-36, SF-12, and RAND-36 summary scales in a multiple sclerosis population. Med Care. 2000; 38: 1022–1028.CrossrefMedlineGoogle Scholar6 Wilson D, Parsons J, Tucker G. The SF-36 summary scales: problems and solutions. Soz Praventiv Med. 2000; 45: 239–246.CrossrefGoogle Scholar7 Taft C, Karlsson J, Sullivan M. Do SF-36 summary component scores accurately summarize subscale scores? Qual Life Res. 2001; 10: 395–404.CrossrefMedlineGoogle Scholar8 Van der Linden WJ, Hambleton RK. Handbook of Modern Item Response Theory. New York, NY: Springer; 1997.Google Scholar Previous Back to top Next FiguresReferencesRelatedDetailsCited By Corrigan A, Carter B, Smith A, Pennington A and Hewitt J (2022) MORe PREcISE: a multicentre prospective study of patient reported outcome measures in stroke morbidity: a cross sectional study, BMC Neurology, 10.1186/s12883-022-02634-0, 22:1, Online publication date: 1-Dec-2022. Del Brutto V, Rundek T and Sacco R (2022) Prognosis After Stroke Stroke, 10.1016/B978-0-323-69424-7.00017-X, (207-220.e11), . Duret C, Breuckmann P, Louchart M, Kyereme F, Pepin M and Koeppel T (2020) Adapted physical activity in community-dwelling adults with neurological disorders: design and outcomes of a fitness-center based program, Disability and Rehabilitation, 10.1080/09638288.2020.1771439, 44:4, (536-541), Online publication date: 13-Feb-2022. Gagny M, Grenevald L, El-Hage W, Chrusciel J, Sanchez S, Schwan R, Klemina I, Biberon J, de Toffol B, Thiriaux A, Visseaux J, Martin M, Meyer M, Maillard L and Hingray C (2021) Explanatory factors of quality of life in psychogenic non-epileptic seizure, Seizure, 10.1016/j.seizure.2020.10.028, 84, (6-13), Online publication date: 1-Jan-2021. Atalan P, Bērziņa G and Sunnerhagen K (2021) Influence of mobility restrictions on post‐stroke pain, Brain and Behavior, 10.1002/brb3.2092, 11:5, Online publication date: 1-May-2021. Chandrasekhar D, Pradeep A, Geoji A, George A, Athira V and Thomas G (2018) Impact of intensified pharmaceutical care on health related quality of life in patients with stroke in a tertiary care hospital, Clinical Epidemiology and Global Health, 10.1016/j.cegh.2018.05.001, 6:4, (198-202), Online publication date: 1-Dec-2018. Rundek T and Sacco R (2016) Prognosis after Stroke Stroke, 10.1016/B978-0-323-29544-4.00016-5, (234-252.e10), . Quigley M and Dillon M (2016) Quality of life in persons with partial foot or transtibial amputation, Prosthetics & Orthotics International, 10.1177/0309364614546526, 40:1, (18-30), Online publication date: 1-Feb-2016. Davis D (2016) Patient-reported Outcome Measures—Framework, Psychometrics, and Uses Outcomes in Orthopaedic Physical Therapy Practice, 10.17832/isc.2016.26.1.1, (1-39), Online publication date: 1-Jan-2016. Faulkner J, McGonigal G, Woolley B, Stoner L, Wong L and Lambrick D (2014) A randomized controlled trial to assess the psychosocial effects of early exercise engagement in patients diagnosed with transient ischaemic attack and mild, non-disabling stroke, Clinical Rehabilitation, 10.1177/0269215514555729, 29:8, (783-794), Online publication date: 1-Aug-2015. Verheijde J, White F, Tompkins J, Dahl P, Hentz J, Lebec M and Cornwall M (2013) Reliability, Validity, and Sensitivity to Change of the Lower Extremity Functional Scale in Individuals Affected by Stroke, PM&R, 10.1016/j.pmrj.2013.07.001, 5:12, (1019-1025), Online publication date: 1-Dec-2013. Noll M, Candotti C and Vieira A (2013) Instrumentos de avaliação da postura dinâmica: aplicabilidade ao ambiente escolar, Fisioterapia em Movimento, 10.1590/S0103-51502013000100023, 26:1, (203-217), Online publication date: 1-Mar-2013. van der Sluijs Veer L, Kempers M, Maurice-Stam H, Last B, Vulsma T and Grootenhuis M (2012) Health- related quality of life and self-worth in 10-year old children with congenital hypothyroidism diagnosed by neonatal screening, Child and Adolescent Psychiatry and Mental Health, 10.1186/1753-2000-6-32, 6:1, (32), . Rundek T and Sacco R (2011) Prognosis after Stroke Stroke, 10.1016/B978-1-4160-5478-8.10015-6, (219-241), . Adams H (2011) Clinical Scales to Assess Patients with Stroke Stroke, 10.1016/B978-1-4160-5478-8.10021-1, (307-333), . Adams H and Lyden P (2008) Chapter 48 Assessment of a patient with stroke Stroke Part III: Investigation and Management, 10.1016/S0072-9752(08)94048-3, (971-1009), . Mota J and Nicolato R (2008) Qualidade de vida em sobreviventes de acidente vascular cerebral: instrumentos de avaliação e seus resultados, Jornal Brasileiro de Psiquiatria, 10.1590/S0047-20852008000200013, 57:2, (148-156), . Nagayoshi M, Iwata N and Hachisuka K (2009) Factors associated with life satisfaction in Japanese stroke outpatients, Disability and Rehabilitation, 10.1080/09638280701255266, 30:3, (222-230), Online publication date: 1-Jan-2008. Gargano J and Reeves M (2007) Sex Differences in Stroke Recovery and Stroke-Specific Quality of Life, Stroke, 38:9, (2541-2548), Online publication date: 1-Sep-2007.Gray L, Sprigg N, Bath P, Boysen G, De Deyn P, Leys D, O'Neill D and Ringelstein E (2007) Sex Differences in Quality of Life in Stroke Survivors, Stroke, 38:11, (2960-2964), Online publication date: 1-Nov-2007. Bramlett R, Bothe A and Franic D (2006) Using Preference-Based Measures to Assess Quality of Life in Stuttering, Journal of Speech, Language, and Hearing Research, 10.1044/1092-4388(2006/030), 49:2, (381-394), Online publication date: 1-Apr-2006. Krishnan G, Grant B, Muti P, Mishra A, Ochs-Balcom H, Freudenheim J, Trevisan M and Schünemann H (2006) Association between anemia and quality of life in a population sample of individuals with chronic obstructive pulmonary disease, BMC Pulmonary Medicine, 10.1186/1471-2466-6-23, 6:1, Online publication date: 1-Dec-2006. Paolucci S, Gandolfo C, Provinciali L, Torta R and Toso V (2006) The Italian multicenter observational study on post–stroke depression (DESTRO), Journal of Neurology, 10.1007/s00415-006-0058-6, 253:5, (556-562), Online publication date: 1-May-2006. Jönsson A, Lindgren I, Hallström B, Norrving B and Lindgren A (2005) Determinants of Quality of Life in Stroke Survivors and Their Informal Caregivers, Stroke, 36:4, (803-808), Online publication date: 1-Apr-2005. Paolucci S, Gandolfo C, Provinciali L, Torta R, Sommacal S and Toso V (2005) Quantification of the risk of poststroke depression: the Italian multicenter observational study DESTRO, Acta Psychiatrica Scandinavica, 10.1111/j.1600-0447.2005.00590.x, 112:4, (272-278), Online publication date: 1-Oct-2005. Rundek T and Sacco R (2004) Outcome following Stroke Stroke, 10.1016/B0-44-306600-0/50006-7, (35-57), . BELICE T, BÖLÜKBAŞI S and MANDIRACIOĞLU A (2020) Yaşlılarda Fiziksel İnaktivitenin Yaşam Kalitesi Üzerine Etkileri, Celal Bayar Üniversitesi Sağlık Bilimleri Enstitüsü Dergisi, 10.34087/cbusbed.758231 Aramaki A, Sampaio R, Reis A, Cavalcanti A and Dutra F (2019) Virtual reality in the rehabilitation of patients with stroke: an integrative review, Arquivos de Neuro-Psiquiatria, 10.1590/0004-282x20190025, 77:4, (268-278) May 2002Vol 33, Issue 5 Advertisement Article InformationMetrics https://doi.org/10.1161/01.STR.0000015223.98452.97PMID: 11988583 Originally publishedMay 1, 2002 PDF download Advertisement

Referência(s)
Altmetric
PlumX