• Unstructured • Lack of connection with clinical practice How does healthcare info spread on web? 2 effectively analyze and re-organize the rich source of personal health messages by applying advanced data mining, IR and NLP
drug outcomes (AMIA 2012) 5 Yahoo! Messages organized by Drug Classification News, Spam (eliminated) User comment (target) Comme nt units D Refine, extraction Outcome Integration & Separation Integrated Results Learn & Annotation Request, Search Audience s Gold Standard Judg es Make Insert Interactive system Interface Expert comments E
model • Effectively separate similar and opposite opinions • Find “unknown outcome” (e.g. Clonazepam may help to relief burning mouth syndrome) • A comprehensive evaluation framework with medical students
the effectiveness of different treatments are truly expressed by patients (patient opinion) • Personal messages: easy to achieve, scalable • We can conduct several CER hypothesis such as “Chemo is more effective than Hormonal therapy to treat breast cancer” – prove the existing CER conclusions – predict the unknown CER hypotheses 7
is more effective to younger group (age <50), compared with older group(age >50)” [Lancet] • “8 weeks of mindfulness meditation training was just as good as prolonged antidepressant treatment (SSRI, SNRI) over 18 months” [Archives of General Psychiatry – American Medical Association] • All consistent with our discoveries