Presentation given at the Paris-Saclay Center for Data Science (INRIA Saclay) on the topic of data challenges for astronomical high-contrast imaging and direct detection of exoplanets.
Space and Environmental Sciences ▪ WP2: Data Science for Life Sciences ▪ WP3: Massive and Rich Data for Humanities ▪ WP4: Data Science, Social Media and Social Sciences ▪ WP5: Data Governance, Data Protection and Privacy WP0: Coordination MSTIC - Mathematics, Information and Communication Sciences and Technologies CBS - Chemistry, Biology, Health PAGE - Particle physics, Astrophysics, Geosciences, Environment and ecology SHS - Humanities and Social Sciences PSS - Social Sciences USMB – Univ Savoie Mont Blanc 19 Labs involved that cover 5 research domains IDEX Cross disciplinary Program • 1.7 million euros • From 2017 to 2020
High-dimension Mediation Data Challenge, • audio-visual diarization, • cancer research, • home-made framework (codalab / jupyter)? • Data Science in the Alpes (March 20) • R in Grenoble group. • PySciDataGre group. • Data club. • Data Carpentry.
and astrophysics with CS & ML. • Integrating cutting-edge ML developments. • Ensuring the use of robust statistical approaches and well-suited metrics. • Open-source development. • Data challenges.
• Sky or thermal background subtraction • Bad pixel correction Raw astronomical images Detection on final residual image Image recentering Bad frames removal PSF modeling • Median • Pairwise, ANDROMEDA • LOCI • PCA, NMF • LLSG Image combination Model PSF subtraction De-rotation (ADI) or rescaling (mSDI) Characterization of detected companions Sequence of calibrated images
et al. 2007 A lgo- ZO O Marois et al. 2007 Soummer et al. 2012 Amara & Quanz 2012 Absil et al. 2013 Gomez Gonzalez et al. 2017 Gomez Gonzalez et al. 2016 Gomez Gonzalez et al. 2016 Marois et al. 2014 Marois et al. 2014 Hagelberg et al. 2015 Mugnier et al. 2009 Cantalloube et al. 2015
to astronomy: a tool, combining science and engineering. It lies outside the central field of principal competence among the researchers that rely on it. … it builds upon scientific progress and shapes our scientific vision.” Pradal et al. 2015
Box “…if the model is going to be wrong anyway, why not see if you can get the computer to ‘quickly’ learn a model from the data, rather than have a human laboriously derive a model from a lot of thought.” Peter Norvig
• Main organizer takes care of logistics/ leaderboard. • Main organizer writes a review-type paper. • Community effort. • Using a robust framework for data challenges creation. • Hands-on sessions. • Workshop for analyzing results and learning from different approaches. Old school Open science Data challenges
reality? • Several instruments/surveys with large databases. • New instruments coming online in the next years. • ~13 years of image processing techniques. • Discovering new techniques! Exoplanet DI challenge Motivation