Slide 3
Slide 3 text
■ Plugin/service-based architectures
□ One plugin/service per data source
□ Custom data schema
□ Alitheia-Core [Gousios et al., 2009], SOFAS [Ghezzi, 2012], Sonarqube
■ Graphical ETL-Tools
□ Plugin for each data source connection
□ Visual creation of ETL processes
□ RapidMiner, KNIME
■ Collections of Repository Data
□ Pre-collected, cleansed, and interlinked data sets
□ Boa [Dyer et al., 2013] with custom query language
□ GHTorrent [Gousios, 2013 and ongoing], StackExchange dumps
Christoph Matthies
Sep 5
DataRover
Related Work
Chart 3