1) automate part of the radiology workflow to support faster decisions 2) reduce the risk of the errors Our team • Backend/frontend • Sales & Marketing • Data Team • ML team (11 ML engineers)
acquisition protocols c) Human errors 2) Noisy y a) Inter-observer annotation variability b) Intra-observer annotation variability c) Unreliable data sources 3) Different label sets of y a) Different classes in the different datasets
Dataset 1 Bounding boxes: • Malignant mass • Benign mass • Malignant calcification • Benign calcification • Skin thickening Dataset 2 Bounding boxes: • Malignant mass • Benign mass We will penalize network for proposing non-annotated regions in Dataset 2 that contain objects!
noisy examples ◦ Usually doesn’t work well in the medical domain • Label smoothing ◦ Soft labels ◦ Smoothing for mask boundaries • Online noise correction ◦ Bounding box correction ◦ Mask refinement
update gold standard datasets: • Consensus labeling by at least 3 annotators • Preferably clinically verified • It must include all conditions that your system is supposed to detect But remember - every time you test your experiment on this dataset - you are one step closer to overfitting
want to work with the medical data • It is both challenging and rewarding • Read literature, but stay creative - every problem is unique • Invest time and money into creating gold standard dataset for pre-release testing of all your crazy ideas