relevant examples motivating • These datasets illustrate data challenges they may face • There are a few medical datasets in R packages, but widely scattered • Many datasets are poorly documented, hard to understand • Quite bare-bones • It is really convenient to have datasets wrapped in one package • Easier for students and for instructors • Can re-use datasets across teaching concepts.
trial on the HMS Salisbury by James Lind • 1948 streptomycin for tuberculosis trial • Five other RCTs • Sulindac for polyps, indomethacin for post-ERCP pancreatitis • Six Cohort & Case-Control studies • COVID testing, esophageal cancer case-control, CMV after BMT • Two pharmacokinetic studies • Indomethacin and Theophylline
levels • Background on each study • Description of study design, intervention, measurements • Specification of study outcomes • Some suggestions for uses of each dataset • Full help(dataset) files • Linked codebooks and description documents on the packagedown website & github README
add examples as issues • Used strep_tb dataset to teach table construction with {gtsummary} • Attach code (reprex) • Used scurvy dataset for categorical scatterplots of outcomes • Attach code (reprex) • Used the indo_rct dataset to make a covariate forest plot • Attach code (reprex) • Used theoph dataset for GAM modeling • Attach code (reprex)
medical datasets? • Randomized controlled trials • Cohort studies • Case-control studies • Must be of reasonable size (5MB limit on CRAN) • Must be anonymized • Fake names, fake study IDs are helpful • Need a reasonable level of documentation/codebook/publication
• Wide medical data that need pivot_longer()? • Untidy medical data that need help from {tidyr}? • Separate, unite • Separate_rows • Nest, unnest • Fill, complete, replace_na • Color-coded medical data that need {tidyxl}? • Multiheaded medical data that need {unheadr}? • Messy medical data that need {unpivotr}?
people who send an SASE • SASE = self-addressed, stamped envelope • Important – include sufficient postage on the SASE and the sending envelope! STAMP here Sender (your name and address) 123 Data Street Medical Center City, State, Country, Postal code Fold the SASE into thirds, and put the SASE into a 2nd envelope. Mail this outer envelope (with postage) with SASE enclosed to: Peter Higgins SPC 5682 1150 West Medical Center Drive Michigan Medicine Ann Arbor, Michigan, 48109 USA