The Simplified Acute Physiology Score (SAPS) 3 was recently proposed to reflect contemporary changes in intensive care practices. SAPS 3 features customized equations for the prediction of mortality in different geographic regions. However, the usefulness of SAPS 3 and its customized equation (Australasia SAPS 3) have never been externally validated in Korea. This study was designed to validate SAPS 3 and Australasia SAPS 3 for mortality prediction in Korea.
A retrospective analysis of the prospective intensive care unit (ICU) registry was conducted in the medical ICU of Samsung Medical Center. Calibration and discrimination were determined by the Hosmer-Lemeshow test and area under the receiver operating characteristic (aROC) curve from 633 patients.
The mortalities (%) predicted by SAPS 3, Australasia SAPS 3, and SAPS II were 42 ± 28, 39 ± 27 and 37 ± 31, respectively. The calibration of SAPS II was poor (
In Korea, mortality rates predicted using general SAPS 3 and Australasia SAPS 3 exhibited good calibration and modest discrimination. However, Australasia SAPS 3 did not improve the mortality prediction. To better predict mortality in Korean ICUs, a new equation may be needed specifically for Korea.
Several prognostic scoring systems have been developed to predict hospital mortality in intensive care unit (ICU) patients, the most famous being the Simplified Acute Physiology Score (SAPS), the Acute Physiologic and Chronic Health Evaluation (APACHE) Score, and the Mortality Prediction Model (MPM). These scores can be used as an aid to clinical management, resource allocation, and quality assessment.
Recently, SAPS 3 was developed through a worldwide prospective study. SAPS 3 features customized prognostic models for different geographic regions. However, to our knowledge, there have been no external validations of SAPS 3 or its Australasia prognostic model in a critical care population of Korea.
All patients admitted to the medical ICU of Samsung Medical Center, a 1,900-bed tertiary referral center in Seoul, South Korea, between March 1st, 2008 and February 28st, 2009, were eligible for this study. We excluded patients if they were less than 15 years old, or stayed in the ICU < 8 h. For patients with two or more admissions to the ICU during the same hospital stay, only the data from the first admission were included. A total of 660 patients were admitted to our medical ICU, and a total of 633 patients were included in the study. The study was approved by the Institutional Review Board and the requirement for informed consent was waived.
We drew data from a prospectively collected database. We used the worst data from the first 24 hours after admission to the ICU to calculate the SAPS II score, while data from the first hour after admission were used to calculate the SAPS 3 score. For the SAPS 3 score, data were recorded using a SAPS 3 admission score sheet downloaded from the SAPS 3 website (
Data are shown as mean ± standard deviation (SD). For statistical analysis, SAS 9.1 was used (SAS Institute Inc., Cary, NC, USA). Validation of the scoring system was performed using standard tests to measure calibration and discrimination. Calibration is defined as the agreement between individual probabilities and actual outcomes. The Hosmer-Lemeshow goodness of fit Ĉ statistic to evaluate calibration of each predictive models.
The baseline characteristics of the patients are reported in
The hospital mortality was 31% (193/633). The SAPS 3 score was 63 ± 18, while the SAPS II score was 44 ± 20 points. The predicted mortality rates were 42 ± 28%, 39 ± 27%, and 37 ± 31% for general SAPS 3, Australasia SAPS 3, and SAPS II, respectively.
The general SAPS 3 and Australasia SAPS 3 model exhibited good calibration (Ĉ = 3.174,
Although general SAPS 3 and Australasia SAPS 3 had a greater aROC, suggesting slightly better discriminative power than the SAPS II model, all predicted models had aROC values less than 0.8. Customization using the Australasia equation did not improve the discriminative ability of the original score and the SAPS II score had the lowest aROC [aROC 0.76, 95% confidence interval (CI) 0.73, 0.80] (
SAPS II is one of the most commonly used prognostic scoring systems in critically ill patients, but a revised scoring system, SAPS 3, was devised for several reasons. First, the SAPS II score was developed from a database collected in the early 1990s, and there have been significant changes in the prevalence of major diseases, diagnostic approaches, and therapeutic modalities since that time.
The SAPS 3 scoring system was developed to enhance prediction power by overcoming these deficiencies. The SAPS 3 has the following unique characteristics. First, SAPS 3 is composed of 20 variables gathered within one hour of ICU admission.
To our knowledge, this is the first study exploring the validation of general SAPS 3 or its customized equation for Australasia in patients of Korea. External validation is essential before routine application of any model in a group of patients different from the one originally used for model development. So far there have been only a handful of studies of the external validation of SAPS 3 and its customized scores, with mixed results. In an external validation study of a general intensive care population in Europe, SAPS 3 and its customized model for Central and Western Europe were more discriminative and had better calibration compared to Acute Physiology and Chronic Health Evaluation II (APACHE II), but were not significantly better than SAPS II.
In our cohort, the general SAPS 3 admission prognostic model and the Australasia SAPS 3 had good calibration. However, the SAPS II model exhibited poor calibration. Performance of SAPS II in our cohort was similar to other reports:
One of the most important findings of our study was that the customized equation for Australasia (Australasia SAPS 3) did not result in better predictive power than the general SAPS 3 equation. The regional equation was developed for more precise estimation from a more homogeneous case mix. However, in our cohort, Australasia SAPS 3 as a local or regional equation did not improve the predictive power of the original prediction model. Several factors may have potentially contributed to that fact. First, although the customized equation for Australasia was derived from patients in Australia, India, and Hong Kong, these patients may differ from our cohort in terms of genetics, disease distribution, or other factors. Australia, which contributed more than one-third of the patients in the original cohort, is a multi-ethnic country with a large population of European descent. Other than the fact that they are geographically close to each other, there is no intuitive reason to combine data from Australia, India, and Hong Kong to formulate a customized equation for patients of Asian descent due to differences in genetics and medical and social systems. Australia is more like European countries than other countries in the region. It would be interesting to see how the equation might have differed if only patients of East Asian descent were included.
Another explanation might be that our cohort was sicker and only involved medical patients, while the original cohort included less sick patients and more diverse patient samples. In the original cohort of SAPS 3, the median SAPS II score was 28, and ICU mortality was 12.7%, which is significantly different from our cohort (SAPS II mean of 44). In addition, 25% of our cohort was made up of cancer patients, while in the original SAPS 3 cohort this fraction was only 10%. Finally, different patterns and quality of ICU care might have resulted in these discrepancies. However, our observed mortality was similar with the predicted mortality from all three scores, which suggests that the results of this study did not result from poor intensive care quality.
Our study has several limitations. It is a retrospective analysis of prospectively collected data, but we made every effort to validate the data as thoroughly as possible. Second, the data are from a single center with a relatively limited number of patients, which limits the generalization of our findings.
In conclusion, the SAPS 3 admission prognostic model had good calibration and modest discriminative power when applied to ICU patients in Korea. However, its customized equation for Australasia did not improve predictive power. Therefore, a new prognostic model customized for Korean patients is deemed necessary.
The authors have no financial conflicts of interest.
Calibration curves for the general SAPS 3, Australasia SAPS 3, and SAPS II. Predicted risk of hospital death, observed hospital mortality rate, and the corresponding number of patients per decile are shown. Columns; number of patients; line with open circles; mean predicted mortality per decile; line with closed circles; mean observed mortality per decile.
Baseline Clinical Profiles of the Patients
SD, standard deviation; ER, emergency room; ICU, intensive care unit; SAPS, simplified acute physiology score.
Patients' characteristics and the main reasons for ICU admission are outlined.
Hosmer-Lemeshow Goodness of fit Ĉ Test and Area under the Receiver-Operating Characteristic Curves
SD, standard deviation; ROC, receiver operating characteristic; aROC, area under the receiver operating characteristic; SE, standard error; 95% CI, 95% confidence interval; SAPS, simplified acute physiology score.
The general SAPS 3 and Australasia SAPS 3 model exhibited good calibration (Ĉ = 3.174,