Upgrade to Pro — share decks privately, control downloads, hide ads and more …

CSIndexbr: A Brazilian Computer Science Index (EVCOMP 2019)

CSIndexbr: A Brazilian Computer Science Index (EVCOMP 2019)

ASERG, DCC, UFMG

February 20, 2019
Tweet

More Decks by ASERG, DCC, UFMG

Other Decks in Research

Transcript

  1. CSIndexbr
    A Brazilian Computer Science Index
    csindexbr.org
    Marco Tulio Valente,
    ASERG/DCC/UFMG

    View full-size slide

  2. Scholarly Communication Tools
    2
    Kramer, Bianca; Bosman, Jeroen (2015): 101 Innovations in Scholarly Communication -
    the Changing Research Workflow. https://doi.org/10.6084/m9.figshare.1286826.v1

    View full-size slide

  3. CSIndexbr is a experimental scholarly
    communication tool, with two goals:
    - Discovery
    e.g. What are the "best" Brazilian papers in my area?
    - Assessment
    e.g. What are the "best" Brazilian CS depts in my area?
    3

    View full-size slide

  4. Index of recent papers published by Brazilian CS
    professors in selected conferences and journals
    in the last five years (2014-today)
    - Transparent
    - Open
    - but "unofficial"
    4

    View full-size slide

  5. Primary data source: DBLP (dblp.org)
    - High-quality metadata about CS papers
    - Covers all relevant CS venues
    - Open-license
    - Very reliable API
    5

    View full-size slide

  6. Some numbers
    6

    View full-size slide

  7. 3.2K users, 31K pageviews (in one year)
    7

    View full-size slide

  8. Statistics Page (Fev, 2019)
    https://csindexbr.org/statistics.html
    8

    View full-size slide

  9. Key decision:
    organization by research areas
    9

    View full-size slide

  10. Research Areas (21)
    10

    View full-size slide

  11. Brazilian CS Professors
    - 1,070 professors
    - 799 with indexed papers
    11

    View full-size slide

  12. If you miss a name (or for any other question):
    https://goo.gl/forms/kz3F1fZIKtubWYiu1
    or from csindexbr.org 12

    View full-size slide

  13. Key contribution:
    Curated dataset of conferences and journals
    13

    View full-size slide

  14. Conferences
    - 15 conferences / area (max)
    - Only full, main-track papers (10 pages)
    - short, tool, workshop etc papers are not indexed
    - Criteria:
    - submitted > 100 papers
    - acceptance < 30%
    - h5-index > 20
    14

    View full-size slide

  15. Exceptions:
    - Many areas: full papers < 10 pages
    - Computer Networks: 18 confs
    - Algorithms & Complexity: accept. ~ 40%
    - etc
    15

    View full-size slide

  16. Exceptions are highlighted in yellow in Stats (C)
    16

    View full-size slide

  17. Top-Conferences (⭐)
    - 3 top-conferences / area (max)
    - submitted > 180 papers
    - h5-index > 30
    17

    View full-size slide

  18. Journals
    - 15 journals / area (max)
    - Criteria:
    - Indexed by JCR
    - h5-index > 25
    18

    View full-size slide

  19. Top-Journals
    - 3 top-journals / area (max)
    - Criteria:
    - ACM Transactions or IEEE Transactions (or similar)
    19

    View full-size slide

  20. Goal 1: Discovery
    20

    View full-size slide

  21. Papers / Conference [always in the last 5 yrs]
    21

    View full-size slide

  22. Papers / Journal
    22

    View full-size slide

  23. Papers (Conferences & Journals)
    23

    View full-size slide

  24. Professors with Papers (in a Research Area)
    24

    View full-size slide

  25. Author Pages
    25

    View full-size slide

  26. Goal 2: Assessment
    26

    View full-size slide

  27. Department Rankings
    - 1.0: paper in top-conference or top-journal
    - 0.40: paper in journals
    - 0.33: paper in
    - conference
    - magazines
    - journals with short papers
    - mega-journals
    - journals with normalized-h5-index < 0.2
    27

    View full-size slide

  28. Dept Rankings: per Research Area
    28

    View full-size slide

  29. More details: FAQ
    https://csindexbr.org/faq.html
    29

    View full-size slide

  30. Beyond rankings: a repository for scientometrics
    studies on Brazilian scientific production in CS
    30

    View full-size slide

  31. Source code and data is public on GitHub
    https://github.com/aserg-ufmg/CSIndex
    31

    View full-size slide

  32. Documentation (in progress)
    https://github.com/aserg-ufmg/CSIndex 32

    View full-size slide

  33. First CSIndexbr-based study:
    Brazilian Workshop on Software Visualization, Evolution and Maintenance, 2018
    33

    View full-size slide

  34. Research Topics
    34

    View full-size slide

  35. This paper (workshop, portuguese) is a good
    opportunity for remembering that
    not at CSIndexbr ≠ "irrelevant"
    35

    View full-size slide

  36. Another Example: most common words in
    paper's titles (5-min analysis)
    36

    View full-size slide

  37. Other features:
    arXiv links & citations
    37

    View full-size slide

  38. Links to arXiv preprints (if available)
    38

    View full-size slide

  39. Only 5% of papers have preprints on arXiv
    39

    View full-size slide

  40. arXiv popularity (worldwide): 23%
    40
    Popularity of arXiv.org within Computer Science. Charles Sutton and Linan Gong,
    https://arxiv.org/pdf/1710.05225.pdf

    View full-size slide

  41. Another feature: citations
    41

    View full-size slide

  42. CrossRef Citations
    - Crossref is an official DOI registration agency
    - They maintain a database of citations
    - used by ACM DL, IEEE DL, Dimensions etc
    - Has a public API (unlike Google Scholar)
    42

    View full-size slide

  43. Future Work
    43

    View full-size slide

  44. Future Work
    1. Internal improvements, scripts, refactorings etc
    2. Update conferences and journals statistics (2018)
    3. Extend data collection to more than 5 years
    4. Integration with CNPq: link to Lattes
    5. (?) "Global" depts ranking (all areas)
    6. (?) Adjust scores by number of authors
    7. Other countries
    44

    View full-size slide

  45. Quiz [para incentivar acesso ao sistema]
    45
    1. Em qual área o DCC tem mais pesquisadores?
    2. Em qual área o DCC tem menos pesquisadores?
    3. Qual o paper do DCC tem mais citações?
    4. Qual o journal aceita o maior número de papers BR?
    5. Em qual área o BR tem mais pesquisadores?
    6. Qual o dept BR tem mais pesquisadores?
    7. Qual o paper BR tem mais citações?
    8. Qual a participação BR nos papers da área X?

    View full-size slide

  46. csindexbr.org
    Thanks
    46

    View full-size slide