Upgrade to Pro — share decks privately, control downloads, hide ads and more …

How likely AKI-ICU patients need Dialysis? A machine learning approach

Miguel
January 18, 2019
100

How likely AKI-ICU patients need Dialysis? A machine learning approach

Webapp prototype to predict how likely is that Intensive Cure Unit (ICU) patients of an Acute Kidney Injury (AKI) may need Dyalisis with machine learning

Miguel

January 18, 2019
Tweet

Transcript

  1. Current AKI - ICU therapies in the US: • Continuous

    Renal Replacement Therapy (CRRT) $37,780 • Intermittent Renal Replacement Therapy (IRRT) $39,448 The problem Close to 50% of the expenditures are related to the provision of maintenance dialysis. AKI has emerged as a major public health concern with high human and financial costs, which were comparable to those for stroke, pancreatitis, and pneumonia $ Days AKI DIALYSIS AKI NO DIALYSIS
  2. The solution Dialysis pred Forecasting dialysis needs among AKI critical

    patients Early diagnosis of dialysis needs Cost reduction of AKI diagnosis and treatment Cost reduction and improvement of hospital resources management
  3. MIT eICU Google Cloud Engine VM Google Cloud Storage Big

    query Github eICU derived (temp) Google Cloud Storage Big query AKI critical patients Big query Python: sklearn, np, pd, plt, stats Dataset exploration Unbalanced data Attributes combination Correlations Dataset for ML Data cleaning (NAs) Feature scaling Handling Text and Categorical Attributes Look at the big picture -> Frame the problem -> Gain insights -> Prepare for ML Algorithms Data Workflow From eICU MIT’s database to Dialysis Pred attributes: Data gathering, mining and exploration. +30 GB 5.5 MB
  4. Models Binary classification problem in unbalanced dataset Training models Balancing

    set: oversampling? Undersampling? Adding/removing variables? Performance measures Is accuracy reliable? ROC curves Alternative metrics Fine tuning parameters Refining model KNN Naive bayes Logistic regression Random forest SVM AUC Kappa-factor Accuracy n, C, ...
  5. Results Binary classification problem in unbalanced dataset And the winner

    model is... And the most determinant variables are... importance ICU hours 0,09338 Urine output 0,069668 Creatinine Max 0,048844 Total output 0,043539 Creatinine Min 0,035437 Hemoglobin Min 0,029387 Age 0,028676 Hematocrite Min 0,028157 BUN Max 0,026927 Apache IV 0,026813
  6. Handle such vast amount of data Unbalanced dataset Medical knowledge

    Challenges we are still working on... Limitations
  7. Future perspectives Publish a paper Apply to Google’s AI for

    social good impact grant Give access to the tool for doctors around the world
  8. Uxue Lazkano Elisa G. de Lope Biochemist, Bioinformatician and Data

    Scientist at the Neurology department of Hospital del Mar Data scientist Biotech & Bioinformatics Data analyst at Capgemini Albert Sanchez Data scientist Industrial Engineer Fibre optic cable product engineer at OPTRAL The team We are machine learning enthusiasts, curious and fascinated by the potential of AI to transform the world we live in uxue-lazcano-dobao elisagdelope albertsl