빅데이터를 분석할 경우, 나이를 고려하여 표준화 작업을 해주어야 하는 경우가 있다.
특히 다른 cohort나 연도 데이터들을 비교할 때, 나이에 따른 weight 를 곱하여 비교하여야 좀 더 정확한 결과를 얻을 수 있다.
이때 사용되는 weight를 정해 놓은 것이 standard population이고, 각 나이 구간의 crude rate에 곱하여 cohort의 motablity 를 구한다.
The standard population data files contain the following data:
- U.S. Standards (1940, 1950, 1960, 1970, 1980, 1990, 2000)
- Canadian Standards (1991, 1996, 2011)
- European (Scandinavian 1960) Standard2
- European (EU-27 plus EFTA 2011-2030) Standard
- World (Segi 1960) Standard2
- World (WHO 2000-2025) Standard2
Example. Age-standardized rate of Canadians with 1991 standard population weight per 100,000.
Age group | Characteristic | 2000 | 2010 |
0 to 39 years | Estimate of population | 17,068,876 | 17,191,850 |
Number of deaths | 1,345 | 1,004 | |
Crude rate | 7.9 | 5.8 | |
40 years and over | Estimate of population | 13,616,854 | 17,150,930 |
Number of deaths | 61,325 | 71,472 | |
Crude rate | 450.4 | 416.7 | |
Total all ages | Estimate of population | 30,685,730 | 34,342,780 |
Number of deaths | 62,672 | 72,476 | |
Crude rate | 204.2 | 211.0 | |
Weight | 61.6% | 38.4% | |
Adjusted rate* | 177.9 | 163.6 |
* Adjusted rate = Sum of (crude rate)*(weight)
Reference
- Standard Populations (Millions) for Age-Adjustment, https://seer.cancer.gov/stdpopulations/
- World (WHO 2000-2025) Standard, https://seer.cancer.gov/stdpopulations/world.who.html
- Age-standardized Rates,
https://www.statcan.gc.ca/en/dai/btd/asr
'Study' 카테고리의 다른 글
Signature matrix (0) | 2021.12.29 |
---|---|
RNA velocity (0) | 2021.11.02 |
scRNA-seq analysis (0) | 2021.08.19 |
ICGC database (0) | 2021.06.24 |
Nanopore (0) | 2021.06.22 |
댓글