In statistics and research, reliability refers to the consistency or repeatability of a measurement. A reliable instrument or procedure produces similar results under consistent conditions. It is a key aspect of measurement quality, alongside validity.
In short: Does the same instrument, used on the same item or person, to measure the same trait, produce the same (or very similar) result?
Reliability is important in any field that involves measurement—such as psychology, education, medical testing, or survey research. A lack of reliability introduces random error, which can obscure true patterns in the data and reduce the power of statistical tests.
There are several common types of reliability, depending on what aspect of consistency is being evaluated:
- Test-retest: Does the instrument produce similar results when used at two different times?
- Inter-rater: Do different people (raters or observers) produce similar scores using the same tool?
- Internal consistency: Do the items within a single scale or test measure the same underlying concept?
- Parallel-forms: Do two versions of the same test yield similar results?
Researchers often calculate reliability using statistics such as correlation coefficients, Cronbach’s alpha, or a Gage R&R study, depending on the type they are assessing.
For example, a researcher creates a questionnaire to measure anxiety. If the same person takes the test twice a week apart and receives similar scores, the questionnaire shows test-retest reliability. If different clinicians score the same patient responses and assign similar ratings, the test shows inter-rater reliability. These forms of reliability support the conclusion that the test results are consistent and trustworthy.
« Back to Glossary Index