Upgrade to Pro — share decks privately, control downloads, hide ads and more …

CSIndexbr: A Brazilian Computer Science Index (EVCOMP 2019)

CSIndexbr: A Brazilian Computer Science Index (EVCOMP 2019)

ASERG, DCC, UFMG

February 20, 2019
Tweet

More Decks by ASERG, DCC, UFMG

Other Decks in Research

Transcript

  1. CSIndexbr
    A Brazilian Computer Science Index
    csindexbr.org
    Marco Tulio Valente,
    ASERG/DCC/UFMG

    View Slide

  2. Scholarly Communication Tools
    2
    Kramer, Bianca; Bosman, Jeroen (2015): 101 Innovations in Scholarly Communication -
    the Changing Research Workflow. https://doi.org/10.6084/m9.figshare.1286826.v1

    View Slide

  3. CSIndexbr is a experimental scholarly
    communication tool, with two goals:
    - Discovery
    e.g. What are the "best" Brazilian papers in my area?
    - Assessment
    e.g. What are the "best" Brazilian CS depts in my area?
    3

    View Slide

  4. Index of recent papers published by Brazilian CS
    professors in selected conferences and journals
    in the last five years (2014-today)
    - Transparent
    - Open
    - but "unofficial"
    4

    View Slide

  5. Primary data source: DBLP (dblp.org)
    - High-quality metadata about CS papers
    - Covers all relevant CS venues
    - Open-license
    - Very reliable API
    5

    View Slide

  6. Some numbers
    6

    View Slide

  7. 3.2K users, 31K pageviews (in one year)
    7

    View Slide

  8. Statistics Page (Fev, 2019)
    https://csindexbr.org/statistics.html
    8

    View Slide

  9. Key decision:
    organization by research areas
    9

    View Slide

  10. Research Areas (21)
    10

    View Slide

  11. Brazilian CS Professors
    - 1,070 professors
    - 799 with indexed papers
    11

    View Slide

  12. If you miss a name (or for any other question):
    https://goo.gl/forms/kz3F1fZIKtubWYiu1
    or from csindexbr.org 12

    View Slide

  13. Key contribution:
    Curated dataset of conferences and journals
    13

    View Slide

  14. Conferences
    - 15 conferences / area (max)
    - Only full, main-track papers (10 pages)
    - short, tool, workshop etc papers are not indexed
    - Criteria:
    - submitted > 100 papers
    - acceptance < 30%
    - h5-index > 20
    14

    View Slide

  15. Exceptions:
    - Many areas: full papers < 10 pages
    - Computer Networks: 18 confs
    - Algorithms & Complexity: accept. ~ 40%
    - etc
    15

    View Slide

  16. Exceptions are highlighted in yellow in Stats (C)
    16

    View Slide

  17. Top-Conferences (⭐)
    - 3 top-conferences / area (max)
    - submitted > 180 papers
    - h5-index > 30
    17

    View Slide

  18. Journals
    - 15 journals / area (max)
    - Criteria:
    - Indexed by JCR
    - h5-index > 25
    18

    View Slide

  19. Top-Journals
    - 3 top-journals / area (max)
    - Criteria:
    - ACM Transactions or IEEE Transactions (or similar)
    19

    View Slide

  20. Goal 1: Discovery
    20

    View Slide

  21. Papers / Conference [always in the last 5 yrs]
    21

    View Slide

  22. Papers / Journal
    22

    View Slide

  23. Papers (Conferences & Journals)
    23

    View Slide

  24. Professors with Papers (in a Research Area)
    24

    View Slide

  25. Author Pages
    25

    View Slide

  26. Goal 2: Assessment
    26

    View Slide

  27. Department Rankings
    - 1.0: paper in top-conference or top-journal
    - 0.40: paper in journals
    - 0.33: paper in
    - conference
    - magazines
    - journals with short papers
    - mega-journals
    - journals with normalized-h5-index < 0.2
    27

    View Slide

  28. Dept Rankings: per Research Area
    28

    View Slide

  29. More details: FAQ
    https://csindexbr.org/faq.html
    29

    View Slide

  30. Beyond rankings: a repository for scientometrics
    studies on Brazilian scientific production in CS
    30

    View Slide

  31. Source code and data is public on GitHub
    https://github.com/aserg-ufmg/CSIndex
    31

    View Slide

  32. Documentation (in progress)
    https://github.com/aserg-ufmg/CSIndex 32

    View Slide

  33. First CSIndexbr-based study:
    Brazilian Workshop on Software Visualization, Evolution and Maintenance, 2018
    33

    View Slide

  34. Research Topics
    34

    View Slide

  35. This paper (workshop, portuguese) is a good
    opportunity for remembering that
    not at CSIndexbr ≠ "irrelevant"
    35

    View Slide

  36. Another Example: most common words in
    paper's titles (5-min analysis)
    36

    View Slide

  37. Other features:
    arXiv links & citations
    37

    View Slide

  38. Links to arXiv preprints (if available)
    38

    View Slide

  39. Only 5% of papers have preprints on arXiv
    39

    View Slide

  40. arXiv popularity (worldwide): 23%
    40
    Popularity of arXiv.org within Computer Science. Charles Sutton and Linan Gong,
    https://arxiv.org/pdf/1710.05225.pdf

    View Slide

  41. Another feature: citations
    41

    View Slide

  42. CrossRef Citations
    - Crossref is an official DOI registration agency
    - They maintain a database of citations
    - used by ACM DL, IEEE DL, Dimensions etc
    - Has a public API (unlike Google Scholar)
    42

    View Slide

  43. Future Work
    43

    View Slide

  44. Future Work
    1. Internal improvements, scripts, refactorings etc
    2. Update conferences and journals statistics (2018)
    3. Extend data collection to more than 5 years
    4. Integration with CNPq: link to Lattes
    5. (?) "Global" depts ranking (all areas)
    6. (?) Adjust scores by number of authors
    7. Other countries
    44

    View Slide

  45. Quiz [para incentivar acesso ao sistema]
    45
    1. Em qual área o DCC tem mais pesquisadores?
    2. Em qual área o DCC tem menos pesquisadores?
    3. Qual o paper do DCC tem mais citações?
    4. Qual o journal aceita o maior número de papers BR?
    5. Em qual área o BR tem mais pesquisadores?
    6. Qual o dept BR tem mais pesquisadores?
    7. Qual o paper BR tem mais citações?
    8. Qual a participação BR nos papers da área X?

    View Slide

  46. csindexbr.org
    Thanks
    46

    View Slide