Upgrade to Pro — share decks privately, control downloads, hide ads and more …

Web Scale Collaborations

03e2e7de45b193cac192ae7ea071e5ff?s=47 Arfon Smith
December 10, 2013

Web Scale Collaborations

03e2e7de45b193cac192ae7ea071e5ff?s=128

Arfon Smith

December 10, 2013
Tweet

Transcript

  1. Web Scale Collaborations Arfon Smith @arfon

  2. Citizen Science

  3. Distributed Computing

  4. None
  5. Distributed Data Collection

  6. None
  7. None
  8. Distributed Analysis

  9. None
  10. None
  11. None
  12. None
  13. http://www.novacelestia.com

  14. None
  15. None
  16. None
  17. None
  18. None
  19. 0 250,000 500,000 750,000 1,000,000 Professor Paper PhD SDSS

  20. Classifications per hour 0 10,000 20,000 30,000 40,000 50,000 60,000

    70,000 Hours 0 6 12 18 24 30 36 42 48 1 Kevin months Fukugita et al. 2007
  21. None
  22. None
  23. None
  24. None
  25. None
  26. None
  27. None
  28. None
  29. None
  30. SDSS HST Starforming pea Narrow-line Seyfert pea

  31. None
  32. None
  33. None
  34. None
  35. None
  36. None
  37. None
  38. Motivations

  39. None
  40. None
  41. None
  42. 1,000,000,000,000 hours / year

  43. Spectrum of cognitive surplus

  44. None
  45. None
  46. Begins with open data

  47. Open Source

  48. Not treating code and data as first class research objects

  49. GitHub

  50. What is a GitHub?

  51. None
  52. None
  53. None
  54. None
  55. Easier to work together than alone

  56. Open Source collaboration

  57. None
  58. None
  59. None
  60. None
  61. None
  62. None
  63. None
  64. None
  65. None
  66. None
  67. None
  68. None
  69. None
  70. None
  71. None
  72. None
  73. Open Public ≠

  74. Open (within your team, department or institution)

  75. Electronic

  76. Available

  77. Asynchronous

  78. Lock-free

  79. None
  80. None
  81. None
  82. Low friction collaboration

  83. “open source is… reproducible by necessity” Fernando Perez http://blog.fperez.org/2013/11/an-ambitious-experiment-in-data-science.html

  84. Better at collaborating because they have to be

  85. Towards Collaborative Versioned Science

  86. How do we make this behaviour the norm?

  87. Incentive model (it’s broken)

  88. Credit

  89. http://dx.doi.org/10.6084/m9.figshare.828487

  90. http://dx.doi.org/10.6084/m9.figshare.828487

  91. None
  92. None
  93. Derive meaningful metrics from open contributions

  94. “Academic environments of today do not reward tool builders” Ed

    Lazowska, OSTP event http://lazowska.cs.washington.edu/MS/MS.OSTP.pdf
  95. A VISION AND STRATEGY FOR SOFTWARE FOR SCIENCE, ENGINEERING, AND

    EDUCATION
  96. What can we do today?

  97. Take data management plans seriously

  98. Try versioning your research

  99. Share more than just data

  100. If you’re going to share it then you better put

    a licence on it
  101. Thanks. arfon@github.com @arfon "