Upgrade to Pro — share decks privately, control downloads, hide ads and more …

20240828IGC2024OSMCha

 20240828IGC2024OSMCha

Spatio-temporal Editing Trends Validation of Volunteered Geographic Information with Manual and AI-assisted Mapping

Avatar for Toshikazu SETO

Toshikazu SETO

August 28, 2024

More Decks by Toshikazu SETO

Other Decks in Education

Transcript

  1. 2024/08/28 35th International Geographical Congress 2024 1 Spatio-temporal Editing Trends

    Validation of Volunteered Geographic Information with Manual and AI-assisted Mapping Toshikazu SETO* and Yuichiro Nishimura** A02 KOMAZAWA UNIVERSITY Visual Identity Guidelines Ϋʴ࿨จϩΰλΠϓ ,ϚʔΫʴ࿨จϩΰλΠϓͷ૊Έ߹Θͤ ͸ɺ ࠨͷछͰ͢ɻ ԣ جຊܗͱ͠ɺ ༏ઌతʹ࢖༻͠·͢ɻ ԣ ϫϯϙΠϯτͳͲɺ ʮԣʯ ͕഑ஔ͠ʹ͍͘ ৔߹ʹ࢖༻͠·͢ɻ ॎ ॎܕαΠϯͳͲɺ ࡉ௕͍഑ஔʹ࢖༻͠·͢ɻ ඞͣϚελʔσʔλΛ࢖༻͍ͯͩ͘͠͞ɻ ࠨهҎ֎ͷ૊Έ߹ΘͤΛ࡞੒͠ͳ͍Ͱ ͍ͩ͘͞ɻ ܗʣ ϙΠϯτ౳ʣ ॎ *: Associate Professor: Komazawa University Center for Spatial Information Science, the University of Tokyo **: Professor: Nara Women’s University [email protected]
  2. 2024/08/28 35th International Geographical Congress 2024 2 Outline • Background,

    methods • OSMCha quantitative analysis • Time transition trends • Detection reasons in OSMCha • Overview of “suspicious” changeset trends • OSMCha exploratory (qualitative) analysis • New Mappers activity, Map-with-AI (RapiD) • “harmful” editing, etc… • Findings and future works
  3. 2024/08/28 35th International Geographical Congress 2024 3 The research of

    OSM/VGI quality assessment • Quattrone et al., 2016 • Analysis of country-level update frequency and tag (uid, changeset, timestamp, version, lat, lon, taglist) • Anderson et al., 2019 • Focus on corporate editors using OSM-QA tiles • Raifer et al., 2019 • Development analytical platform and ohsome-API for OSHDB: Data Aggregation, Data Extraction, User stat • Minghini and Frassinelli, 2019 • “Is OSM up-to-date” date of creation (first edit), date of last edit, number of versions (revisions), number of different contributors who edited that node or way, frequency of update → Participatory GIS, community geography…
  4. 2024/08/28 35th International Geographical Congress 2024 4 New research topics

    in OSM in recent years https://www.openstreetmap.org/user/mvexel/diary/400035 • Quality improvement and monitoring as geospatial data • Provision of automatic mapping technology by AI and other means and improvement of accuracy • Data assessment, especially for local daily activities • Response to vandalism data destruction (Juhász et al., 2020) • Emergence of diverse actors and acceptance in the active community • Commercial: rise of corporate members and advancements in AI technology (Anderson et al., 2019) • Humanitarian: gender gap issues, local contributors and young development (Solís & Zeballos, 2023)
  5. 2024/08/28 35th International Geographical Congress 2024 5 OSM Changeset Analyzer

    (OSMCha) https://github.com/OSMCha/osmcha • Python package to automatically detect suspicious OSM changesets. • It was designed to be used with osmcha-django, but also can be used standalone or website via License: GPL-3.0.
  6. 2024/08/28 35th International Geographical Congress 2024 6 Methodology • Target

    period/data: Changeset (195,416 records) for one year from January 1 to December 31, 2023 • Extraction range: Area that includes Japan in the scope of coverage • Geometry indicates the edit range (rectangle) of the changeset. • Although there are a few that include the area around Japan and the entire world in the scope, for analysis purposes, it is limited to logs targeting Japan. • Acquire changeset logs for the relevant period from the API in GeoJSON format and map them using QGIS, etc. • https://osmcha.org/api/v1/changesets/ • In addition to basic OSM attributes (number of data edits, OSMuser, edit date/time, satellite images used, etc.), and OSMCha's own detection reasons (fragment) is included. • In this study, we mainly focus on OSMCha's “suspicious OSM editing” tag and explore its characteristics in terms of trends of OSM editing in Japan.
  7. 2024/08/28 35th International Geographical Congress 2024 8 Detection Rules of

    OSMCha (in part) • Possible Import • Tag a changeset as a possible import if the number of created elements is greater than 70% of the sum of elements created, modified and deleted and if it creates more than 1000 elements or 200 elements case it used one of the powerfull editors. • Mass Modification • Consider a changeset as a mass modification if the number of modified elements is greater than 70% of the sum of elements created, modified and deleted and if it modifies more than 200 elements. • Mass Deletion • All changesets that delete more than 1000 elements are considered a mass deletion. If the changeset deletes between 200 and 1000 elements and the number of deleted elements is greater than 70% of the sum of elements created, modified and deleted it's also tagged as a mass deletion. • New mapper • Verify if the user has less than 5 edits or less than 5 mapping days. • User has multiple blocks • Changesets created by users that has received more than one block will be flagged. https://github.com/OSMCha
  8. 2024/08/28 35th International Geographical Congress 2024 10 Kernel density estimation

    for all “suspicious” changesets Unit: 10km Tokyo Osaka Nagoya Fukushima Iwate Fukuoka
  9. 2024/08/28 35th International Geographical Congress 2024 12 Percentage of reasons

    based on OSMCha’s detection rules N=103,238* *:Multi Category Total)
  10. 2024/08/28 35th International Geographical Congress 2024 13 Time series of

    participation of New Mappers in Japan Unit: mappers Total: 2871(63.4%) / 4525
  11. 2024/08/28 35th International Geographical Congress 2024 14 “suspicious” changesets using

    “NewMapper” red=True / green=False Edited area range N=11,528* *: Excludes large range of changesets outside Japan
  12. 2024/08/28 35th International Geographical Congress 2024 16 ht m %

    14 51 70 % typ % 3% 20 D% % D% C% 2% te 01 01 % 01 01 A% Fore example, “mapwithai” case • A case where a check user pointed out that a “large number” of buildings have been added using Rapid and that the base map is out of date, resulting in misalignment. • There are also overlapping buildings and road/building intersection errors.
  13. 2024/08/28 35th International Geographical Congress 2024 18 https://www.openstreetmap.org/changeset/ 139284236 For

    example, “harmful” case It was pointed out that the road data was edited in the wide area and the tag indicating the direction of travel was incorrect.
  14. 2024/08/28 35th International Geographical Congress 2024 20 Findings and Future

    Works • Japan is characterized by a large contribution from New Mapper, it is undeniable that it may contain a certain amount of errors due to non-training. However, not all edits by New Mapper are suspect. • AI mapping by Rapid has also been actively incorporated, but even with these, errors have occurred. However, the spatial editing extent tends to be smaller than other errors. • In addition, there were some differences in the detection tendency of “suspicious editing” online mapping and field surveys. • Detection rules by OSMCha are detailed, so further case studies are important. • After the period under study, Japan experienced a major disaster and crisis mapping was conducted. This trend needs to be analyzed using similar methods. • AI implementation and automation in OSM is still in its infancy, and accumulation of research in the field of geography is expected in terms of mapping with geographical knowledge.
  15. 2024/08/28 35th International Geographical Congress 2024 21 Thank You !

    @tosseto [email protected] https://speakerdeck.com/tosseto https://tossetolab.github.io/