Slide 1

Slide 1 text

CSIndexbr A Brazilian Computer Science Index csindexbr.org Marco Tulio Valente, ASERG/DCC/UFMG

Slide 2

Slide 2 text

Scholarly Communication Tools 2 Kramer, Bianca; Bosman, Jeroen (2015): 101 Innovations in Scholarly Communication - the Changing Research Workflow. https://doi.org/10.6084/m9.figshare.1286826.v1

Slide 3

Slide 3 text

CSIndexbr is a experimental scholarly communication tool, with two goals: - Discovery e.g. What are the "best" Brazilian papers in my area? - Assessment e.g. What are the "best" Brazilian CS depts in my area? 3

Slide 4

Slide 4 text

Index of recent papers published by Brazilian CS professors in selected conferences and journals in the last five years (2014-today) - Transparent - Open - but "unofficial" 4

Slide 5

Slide 5 text

Primary data source: DBLP (dblp.org) - High-quality metadata about CS papers - Covers all relevant CS venues - Open-license - Very reliable API 5

Slide 6

Slide 6 text

Some numbers 6

Slide 7

Slide 7 text

3.2K users, 31K pageviews (in one year) 7

Slide 8

Slide 8 text

Statistics Page (Fev, 2019) https://csindexbr.org/statistics.html 8

Slide 9

Slide 9 text

Key decision: organization by research areas 9

Slide 10

Slide 10 text

Research Areas (21) 10

Slide 11

Slide 11 text

Brazilian CS Professors - 1,070 professors - 799 with indexed papers 11

Slide 12

Slide 12 text

If you miss a name (or for any other question): https://goo.gl/forms/kz3F1fZIKtubWYiu1 or from csindexbr.org 12

Slide 13

Slide 13 text

Key contribution: Curated dataset of conferences and journals 13

Slide 14

Slide 14 text

Conferences - 15 conferences / area (max) - Only full, main-track papers (10 pages) - short, tool, workshop etc papers are not indexed - Criteria: - submitted > 100 papers - acceptance < 30% - h5-index > 20 14

Slide 15

Slide 15 text

Exceptions: - Many areas: full papers < 10 pages - Computer Networks: 18 confs - Algorithms & Complexity: accept. ~ 40% - etc 15

Slide 16

Slide 16 text

Exceptions are highlighted in yellow in Stats (C) 16

Slide 17

Slide 17 text

Top-Conferences (⭐) - 3 top-conferences / area (max) - submitted > 180 papers - h5-index > 30 17

Slide 18

Slide 18 text

Journals - 15 journals / area (max) - Criteria: - Indexed by JCR - h5-index > 25 18

Slide 19

Slide 19 text

Top-Journals - 3 top-journals / area (max) - Criteria: - ACM Transactions or IEEE Transactions (or similar) 19

Slide 20

Slide 20 text

Goal 1: Discovery 20

Slide 21

Slide 21 text

Papers / Conference [always in the last 5 yrs] 21

Slide 22

Slide 22 text

Papers / Journal 22

Slide 23

Slide 23 text

Papers (Conferences & Journals) 23

Slide 24

Slide 24 text

Professors with Papers (in a Research Area) 24

Slide 25

Slide 25 text

Author Pages 25

Slide 26

Slide 26 text

Goal 2: Assessment 26

Slide 27

Slide 27 text

Department Rankings - 1.0: paper in top-conference or top-journal - 0.40: paper in journals - 0.33: paper in - conference - magazines - journals with short papers - mega-journals - journals with normalized-h5-index < 0.2 27

Slide 28

Slide 28 text

Dept Rankings: per Research Area 28

Slide 29

Slide 29 text

More details: FAQ https://csindexbr.org/faq.html 29

Slide 30

Slide 30 text

Beyond rankings: a repository for scientometrics studies on Brazilian scientific production in CS 30

Slide 31

Slide 31 text

Source code and data is public on GitHub https://github.com/aserg-ufmg/CSIndex 31

Slide 32

Slide 32 text

Documentation (in progress) https://github.com/aserg-ufmg/CSIndex 32

Slide 33

Slide 33 text

First CSIndexbr-based study: Brazilian Workshop on Software Visualization, Evolution and Maintenance, 2018 33

Slide 34

Slide 34 text

Research Topics 34

Slide 35

Slide 35 text

This paper (workshop, portuguese) is a good opportunity for remembering that not at CSIndexbr ≠ "irrelevant" 35

Slide 36

Slide 36 text

Another Example: most common words in paper's titles (5-min analysis) 36

Slide 37

Slide 37 text

Other features: arXiv links & citations 37

Slide 38

Slide 38 text

Links to arXiv preprints (if available) 38

Slide 39

Slide 39 text

Only 5% of papers have preprints on arXiv 39

Slide 40

Slide 40 text

arXiv popularity (worldwide): 23% 40 Popularity of arXiv.org within Computer Science. Charles Sutton and Linan Gong, https://arxiv.org/pdf/1710.05225.pdf

Slide 41

Slide 41 text

Another feature: citations 41

Slide 42

Slide 42 text

CrossRef Citations - Crossref is an official DOI registration agency - They maintain a database of citations - used by ACM DL, IEEE DL, Dimensions etc - Has a public API (unlike Google Scholar) 42

Slide 43

Slide 43 text

Future Work 43

Slide 44

Slide 44 text

Future Work 1. Internal improvements, scripts, refactorings etc 2. Update conferences and journals statistics (2018) 3. Extend data collection to more than 5 years 4. Integration with CNPq: link to Lattes 5. (?) "Global" depts ranking (all areas) 6. (?) Adjust scores by number of authors 7. Other countries 44

Slide 45

Slide 45 text

Quiz [para incentivar acesso ao sistema] 45 1. Em qual área o DCC tem mais pesquisadores? 2. Em qual área o DCC tem menos pesquisadores? 3. Qual o paper do DCC tem mais citações? 4. Qual o journal aceita o maior número de papers BR? 5. Em qual área o BR tem mais pesquisadores? 6. Qual o dept BR tem mais pesquisadores? 7. Qual o paper BR tem mais citações? 8. Qual a participação BR nos papers da área X?

Slide 46

Slide 46 text

csindexbr.org Thanks 46