Outcome Measures Table 3 Criteria for Rating Outcome Measures
Criterion |
Definition |
Standard |
Reliability |
- the reproducibility and internal consistency of the tool (synonyms include stability, repeatability, etc) |
- Internal consistency ratings are: excellent (≥0.80), adequate (0.70-0.79), or poor (≤0.69) (Andresen 2000). |
Validity |
Does the instrument measure what it purports to measure? Forms of validity include face, content, construct and criterion. Concurrent, convergent or discriminative and predictive validity are all considered to be forms of criterion validity. However, concurrent, convergent and discriminative validity all depend on the existence of a “gold standard†to provide a basis for comparison. If no gold standard exists, they represent a form of construct validity in which the relationship to another measure is hypothesized (Finch et al. 2002) |
Construct/convergent and concurrent correlations: |
Responsiveness |
Sensitivity to changes within patients over time (which might be indicative of therapeutic effects). |
Sensitivity to change: |
Interpretability |
How meaningful are the scores? Are there consistent definitions and classifications for results? Are there norms available for comparison? |
Jutai & Teasell (2003) point out these practical issues should not be separated from consideration of the values that underscore the selection of outcome measures. A brief assessment of practicality will accompany each summary evaluation. |
Acceptability |
How acceptable the scale is in terms of completion by the patient – does it represent a burden? Can the assessment be completed by proxy, if necessary? Are there different formats available? |
|
Feasibility |
Extent of effort, burden, expense & disruption to staff/clinical care arising from the administration of the instrument. Availability of the tool or representative version of the tool. Cost of the tool. |
|
Clinical |
Will the tool prove useful in clinical situations? What SCI sub groups is it suitable to use with? What type of information is generated (descriptive, predictive, and evaluative)? Will it help with discharge planning? Is the tool used as a component of an administrative data base? |
|