Background: Recent studies have shown that anticoagulant therapy has heterogeneous treatment effects on patients with sepsis-induced coagulopathy (SIC).
Aims: To identify the latent phenotypes of patients with SIC.
Study design: Retrospective cohort study.
Methods: We obtained data of patients with SIC from the Medical Information Mart for Intensive Care IV database. SIC subphenotypes were identified by latent class analysis (LCA) and K-means clustering. Clinical and laboratory variables were obtained in patients who met the diagnostic criteria for SIC. The baseline characteristics of the patients and the association between the heterogeneity of anticoagulant therapy and clinical outcomes (28-day and in-hospital mortality) were compared between the subphenotypes.
Results: We identified 4,993 patients with SIC. The LCA and K-means clustering analysis robustly identified three subphenotypes of SIC. Class 1 patients (n = 1,808) had the lowest blood cell counts (leukocytes, erythrocytes, and platelets). Class 2 patients (n = 1,157) had severe coagulopathy with a high prothrombin time and international normalized ratio, multiple-organ dysfunction, high lactate, sequential organ failure assessment score, and mortality. Class 3 (n = 2,028) were older, had more comorbidities, a higher fibrinogen concentration, and lower plasma and platelet infusion rates. After variable adjustments, heparin therapy reduced the 28-day mortality (odds ratio [OR] 0.39, 0.30-0.49, p < 0.001) and in-hospital mortality (OR 0.42, 0.33-0.53, p < 0.001) only in class 2.
Conclusion: Three SIC subphenotypes were defined using clinical findings and laboratory variables. The effects of heparin treatment differ between the subphenotypes. This finding will facilitate the identification of target patients with SIC who should receive anticoagulant therapy.
Sepsis is the leading cause of mortality and morbidity and remains a remarkable adversary to the intensive care unit (ICU).1,2 Coagulation disorder is a major manifestation of sepsis induced by infection and acute systemic inflammatory response that results in endothelial injury.3,4 Sepsis-induced coagulopathy (SIC) is the coagulation disturbance of sepsis and is defined by the prothrombin time (PT)/international normalized ratio (INR) as well as platelet count, together with the sequential organ failure assessment (SOFA) score.5 Retrieving coagulation abnormalities in patients with SIC is important; however, current evidence reveals that the effects of anticoagulation therapy are controversial. Moreover, the Surviving Sepsis Campaign does not provide any specific anticoagulation recommendations.1
Sepsis is a highly heterogeneous syndrome with different etiologies and pathophysiologies.6 The effectiveness of anticoagulant therapy in patients with SIC is controversial. Some therapies may benefit certain phenotypes; however, other phenotypes might be affected by the intervention, resulting in a neutral effect in all patients. Several studies have reported that anticoagulant therapy may improve outcomes in patients with SIC.7,8 Another study showed that only the high-risk group benefit but not low-to-moderate-risk subgroups.9 However, a phase III randomized controlled clinical trial revealed that recombinant human thrombomodulin (rhTM) did not significantly reduce the 28-day mortality rate in patients with sepsis.10 In a multicenter registry study, 3,195 patients with severe sepsis or septic shock were classified into four phenotypes. rhTM was only associated with lower 28-day mortality and in-hospital mortality rates in one of the phenotypes.11 Thus, identifying the distinct therapeutic phenotypes of SIC is essential for targeted anticoagulant therapy.
This study aimed to identify the subphenotypes of SIC based on clinical and laboratory variables. Thus, we applied unsupervised consensus clustering and examined which subphenotype of SIC would benefit most from anticoagulant therapy using retrospective data from a large public database.
Data Source
The Medical Information Mart for Intensive Care IV (MIMIC-IV) database was used to identify patients with SIC. MIMIC-IV is a longitudinal single-center public free database that includes data of more than 40,000 patients admitted to the ICU and 11,263 patients with sepsis (Sepsis-3 definition) at the Beth Israel Deaconess Medical Center from 2008 to 2019.12 All patients remained anonymous, and informed consent was approved by original ethical committee (Massachusetts Institute of Technology, No. 0403000206; Beth Israel Deaconess Medical Center, 2001P001699).
Study Population
Initially, the study enrolled patients with sepsis who were admitted to the ICU. Sepsis was defined as suspected or confirmed infection plus an increase in the SOFA score of ≥ 2.13 Then, we enrolled patients with SIC by calculating the PT, platelet count, and SOFA score after ICU admission for each patient, and a total score of ≥ 4 was used to diagnose SIC (Supplemental Table S1).5 Patients who met the following criteria were excluded: pregnancy, age of < 18 years, ICU stay of < 24 h, and ≥ 20% missing values.
Variables
Clinical and laboratory variables were collected after the diagnosis of SIC. Baseline and demographic variables included age, sex, weight, comorbidities, time of ICU admission, and length of ICU stay. Vital signs including heart rate, body temperature, respiratory rate, blood pressure, and blood oxygen saturation were measured. Laboratory indicators included pH, PO2, PCO2, HCO3, PaCO2, base excess, lactate, hemoglobin, hematocrit (HCT), red blood cells (RBCs), white blood cells (WBCs), RBC, distribution width, mean corpuscular hemoglobin (MCH), platelets, lymphocyte, albumin, alanine aminotransferase (ALT), creatinine, blood urea nitrogen (BUN), and electrolytes. Coagulation variables included fibrinogen, PT, INR, and partial thromboplastin time (PTT). Risk scores, including the SOFA score and the simplified acute physiology score (SAP III), were calculated the day after diagnosis of SIC. Anticoagulant therapy included anticoagulants, heparin, plasma infusion, and platelet infusion. Other treatment and prognosis data were also obtained from the database.
Missing values were imputed by first applying the next observation carried backward (NOCB) method, followed by the last observation carried forward (LOCF) method.14 Briefly, we preferentially used the observations after the timepoint when SIC was diagnosed, and if the observation was still missing, it was imputed with the last observation value before that timepoint. If the missing value was not available from the database, the missing value was imputed by multiple imputations using the MICE package of R (Supplemental Table 2S).15
SIC Subphenotypes
SIC subphenotypes were explored by the latent class analysis (LCA) and K-means clustering. Clinical and laboratory variables representing key pathophysiological domains were evaluated as class-defining variables, including baseline characteristics (age and heart rate), organ dysfunction severity (SOFA and SAP III scores), blood gas analysis (pH, PO2, and lactate), coagulation indicators (fibrinogen, INR, and PTT), hematology (WBCs, RBCs, hemoglobin, MCH, and platelets), and liver and renal functions (ALT, BUN, and creatinine). Correlations between variables were evaluated by Pearson’s correlation analysis, and highly correlated variables (> 0.7) were excluded, including PT, AST concentration, and total bilirubin (TBIL) concentration (Supplemental Figure 1S).
LCA is one of the probabilistic finite-mixture modeling algorithms that allows the determination of unmeasured or unobserved groups within the population.16 During model training, the parameters were estimated based on maximum likelihood estimation. For the LCA, the basic approach was to select the model with the fewest classes that best fitted the data. A lower Akaike information criterion (AIC), sample size adjusted Bayesian information criterion (SABIC), and higher entropy were considered a good fit. In addition, the bootstrapped likelihood ratio test was conducted to compare whether the k class was better than the k-1 class.17
We also determined the optimal number of clusters using a consensus K-means clustering approach. With K-means clustering, the separation of consensus matrix heatmaps was evaluated using the cumulative distribution function of the elbow method and cluster consensus plots. Statistical indices such as the Calinski-Harabasz (CH) index, Hartigan index, cubic clustering criterion (CCC), Scott index, Davies and Bouldin (DB) index; and the Rubin and Beale index were reported using the NbClust package.18 Visual clustering was also performed using t-distributed stochastic neighbor embedding (t-SNE) to reduce the dimensions and visualize in the lower dimensional space.19 The number of clusters was determined using the elbow and matrix heatmaps.20
Statistical Analysis
We described and compared the frequency and clinical characteristics of each class using the analysis of variance or Kruskal-Wallis test for numeric variables and the chi-square test or Fisher’s exact test for categorical variables. Thereafter, in each class, the relationship between anticoagulant therapy (anticoagulants, heparin, plasma, and platelet infusion) and clinical outcomes (28 days and in-hospital mortality) was explored using the logistic regression analysis. The adjusted variables were age, heart rate, systolic blood pressure, hypertension, diabetes mellitus, SOFA score, fibrinogen, INR, hemoglobin, platelets, WBCs, creatinine, and lactate.
A p-value of < 0.05 was considered statistically significant. All analyses were performed using Stata version 14.1 (StataCorp., College Station, TX, USA) and R version 3.6.2 (R Foundation, Vienna, Austria).
Patients’ Baseline Characteristics
Of the 11,263 patients with sepsis who were admitted to the ICU, SIC developed in 4,993, who were enrolled in this study. The mean age was 67 years, and 58.7% were male. The rates of hypertension, diabetes mellitus, chronic obstructive pulmonary disease (COPD), and cancer were 32.4%, 34.8%, 26.6%, and 19.4%, respectively. Overall, anticoagulants and heparin were administered to 14.1% and 51.5% of the patients, respectively, and the plasma and platelet infusion rates were 25.2% and 16.5%, respectively. The in-hospital mortality and 28-day mortality rates were 23.3% and 31.0%, respectively (Table 1).
SIC Subphenotypes
Overall, 18 features (age, heart rate, SOFA score, APS III score, fibrinogen, INR, PTT, platelets, hemoglobin, MCH, WBCs, RBCs, BUN, creatinine, ALT, pH, PO2, and lactate) were included in the subphenotype analysis (correlation analysis in Supplemental Figure 1S). Generally, the AIC and SABIC values declined from class 2 to class 9; however, class 3 had the highest entropy value (0.92) among all classes in the LCA model (Table 2). Similar results were observed for the K-means clustering analysis. The elbow method also showed that the decline in the slope of the sum of the square errors was the greatest in class 3 (Figure 1a). The matrix heatmaps of K-means clustering showed the overall samples divided into three classes (Figure 1b). The three-class clusters were also confirmed by K-means clustering, such as the CH index, Hartigan index, CCC index, and DB index (Supplemental Figure 3S), as well as hierarchical clustering (Supplemental Figure 1S). Thus, the three-class model was considered the best model by the LCA and K-means clustering. Then, we used t-SNE to reduce the dimensionality of the features and visualize the outputs. Each dot represents a patient that displayed clusters within the dimensionally reduced and scaled down feature space of the autoencoder embedding (Figure 2).
SIC Characteristics Among Subphenotypes
Figure 3 shows the characteristics of the three classes, and Table 1 presents the statistical comparisons. Class 1 (n = 1,808) had the lowest proportion of men (54.9%) and the highest rate of cancer (28.4%) and lowest body mass index, WBCs, RBCs, hemoglobin, HCT, platelets, INR, PT, TBIL, and creatinine. Class 2 (n = 1,157) was characterized by severe coagulopathy and multiple-organ dysfunction, had the highest INR, PT, PTT, TBIL, creatinine, lactate, and SOFA and SAP III scores, and had the highest rates of CRRT, vasopressin, and anticoagulant use; however, it still had the highest in-hospital mortality (48.1%) and 28-day mortality (55.9%). Class 3 was the largest (n = 2,028) and was characterized by older age and higher rates of comorbidities (hypertension, diabetes, and COPD), highest RBC and platelet counts, highest fibrinogen concentration, and lowest plasma and platelet infusion rates.
Effect of Anticoagulant Therapy in Subphenotypes with Outcomes
Class 2 had significantly higher 28-day mortality and in-hospital mortality rates than the other classes. Moreover, class 2 received anticoagulants and plasma and platelet infusion more than the other classes. As shown in Table 3, in the unadjusted analysis, most anticoagulant therapies (anticoagulants, heparin, and plasma and platelet infusions) were risk factors for 28-day mortality and in-hospital mortality (odds radio [OR] > 1, p < 0.05) in all three classes, except for heparin therapy in classes 1 and 2, which was associated with a reduced risk of 28-day and in-hospital mortality (OR < 1, p < 0.05). However, after variable adjustment, only class 2 benefited from heparin therapy, which reduced 28-day mortality (OR 0.39, 0.30-0.49, p < 0.001) and in-hospital mortality (OR 0.42, 0.33-0.53, p < 0.001)ç
In this study, we identified three SIC subphenotypes that showed distinct clinical and laboratory characteristics by the LCA, and the results were also confirmed by K-means clustering. The effects of anticoagulants varied by treatment and subphenotypes. Only class 2 benefited from heparin therapy, which reduced 28-day mortality and in-hospital mortality. These findings have important implications to understand the heterogeneity of SIC and inform future works to promote optimal anticoagulant therapy across subphenotypes.
The present study confirms previous findings that specific therapies confer benefits only in patients with specific sepsis phenotypes.11,21,22 For example, Joseph et al. identified four sepsis phenotypes with different anti-inflammatory responses using 25 bedside variables. They analyzed heterogeneous treatment interactions and mortality risks among these phenotypes and found that one phenotype had a lower mortality rate than other phenotypes when treated with combined immunoglobulin G and methylprednisolone.23 Activated protein C, a toll-like receptor 3 antagonist, and fluid input had different effects on each phenotype.24 In clinical practice, the goal of precision medicine is to choose the optimal therapy for each patient, for which machine learning-based clustering for optimal therapy is an effective method.25,26 Although the present study does not fully address the biological or pathophysiological mechanism-defined endotype of coagulation in sepsis, the findings improve the understanding of SIC subphenotypes.
The classification appears to be stable in the present study because both the LCA and K-means clustering obtained the same optimal number of classes, and the minimum and maximum class membership probabilities were 0.89 and 0.98, respectively. Our results have some similarities and differences with those of a previous study showing that sepsis can be classified into four phenotypes only with coagulation features and that rhTM therapy is associated with better outcomes only in the phenotype characterized by a low platelet count, high fibrin degradation product and D-dimer concentrations, and severe dysfunction.11 This study also showed that only the phenotype with severe coagulopathy and organ dysfunction (Class 2) benefited from heparin therapy; however, we aimed to identify the key and common variables of the underlying latent phenotypes of SIC using clinical and laboratory variables because we believe that SIC classification does not rely solely on coagulation features. This supports that machine-learning clustering is effective in identifying the optimal subphenotypes for anticoagulant therapy.
Class 1 was characterized by lower blood cell counts (WBCs, RBCs, platelets, and hemoglobin) and can be clinically characterized by blood loss rather than by coagulation disorder. However, the proportion of class 1 patients undergoing anticoagulant therapy was no lower than in those of other two classes. Furthermore, after adjustment, anticoagulant therapy was associated with increased 28-day and in-hospital mortality in Class 1.
Class 2 was characterized by severe coagulopathy and multiple-organ dysfunction. Moreover, class 2 had the highest INR, PT, PTT, TBIL concentrations, creatinine, lactate, and SOFA and SAP III scores and the highest rate of CRRT, vasopressin, and anticoagulant therapy; however, this class still had the highest mortality rate. This subphenotype resembles cluster dA phenotype in the JSEPTIC-DIC trial11 and the δ phenotype in the SENECA trial,24 is more likely to have a severe coagulopathy status and organ dysfunction, and could benefit from anticoagulant therapy.
Class 3 was the largest class and was characterized by older age and a higher rate of comorbidities, similar to the β-phenotype in the SENECA trial.24 The fibrinogen concentration was the highest in Class 3. Fibrinogen is a positive acute-phase protein that increases in response to systemic inflammation, tissue damage, and various cancers.27 Hyperfibrinogenemia during sepsis is due to increased fibrinogen and has been recognized as the cause of thrombosis and vascular damage.28
Heparin is a mammalian polysaccharide widely used in the treatment of thrombotic disorders in patients with sepsis. Heparin exerts anticoagulant effects by binding to lysine residue in antithrombin, resulting in irreversible conformational change at the arginine-reactive site.29 Previous animal experiments and a meta-analysis of clinical trials have demonstrated that heparin decreases 28-day mortality when compared with placebo in sepsis.30,31 Another meta-analysis revealed that the risk ratio for death was 1.30 when heparin was compared with other anticoagulants. However, the overall effect of heparin remains uncertain.32 These studies were performed in patients with sepsis, but not consistently in patients with SIC. Our study showed that heparin therapy was better than other anticoagulant therapies and was associated with reduced mortality only in Class 2. This finding may facilitate the identification of patients with SIC who are optimal for anticoagulant therapy.
This study has several limitations. First, the subphenotypes were derived from the large retrospective MIMIC-IV database; thus, the unsupervised clustering approach should be validated in an independent population. Second, other coagulation indicators (such as D-dimer and thrombin concentrations) and regimens (such as procoagulants and antiplatelet drugs) were excluded from the analysis because of the high rates of missing data in the database. In this study, missing data were also common for some of the variables; thus, NOCB, LOCF, and multiple imputations were used in the primary analysis. Third, although we tried to classify patients with SIC into three subphenotypes, granularity may not be sufficiently high enough to support individualized anticoagulant therapy. Sepsis is a highly heterogeneous syndrome, and specific phenotypes require different anticoagulation regimens. However, high granularity would reduce the interpretability and application in clinical practice.
We used data-driven unsupervised machine-learning approaches to identify three SIC subphenotypes. In this study, heparin therapy only benefited patients with severe coagulopathy and organ dysfunction. Thus, identifying distinct phenotypes and determining optimal treatments in future trials is warranted.
Ethics Committee Approval: The MIMIC-IV database was approved by the Massachusetts Institute of Technology (Cambridge, MA) and Beth Israel Deaconess Medical Center (Boston, MA).
Informed Consent: The consent of all patients was obtained for the original data collection.
Data Sharing Statement: The data that support the findings of this study are available from the corresponding author upon reasonable request.
Authorship Contributions: Design- Q.W., Y.C., M.G.; Writing- G.F.
Conflict of Interest: No conflict of interest was declared by the authors.
Funding: The work was supported by grants from the China National Key Research and Development Program (no. 2022YFC2504500 and 2020AAA0105005), Natural Science Foundation of Sichuan (2023NSFSC1471), and National Natural Science Foundation of China (81901998).
Acknowledgement: We appreciate the guidance from the ESICM mentorship program.
Supplementary: http://balkanmedicaljournal.org/uploads/pdf/2023-4-6-suplement.pdf