A quick overview of the data platform published by the Organized Crime and Corruption Reporting Project. It provides an easy-to-use tool to background people, search leaked data and share information between reporters.
investigations: companies, people, contracts, emails, documents… How does it work? • We import leaked data from confidential sources and the open web. • To add context, we also regularly scrape over 200 online sources.
politics, crime. • Mentions in historic leaks like Wikileaks Cables, ICIJ, HackingTeam, Kazaword, … • Many official documents from offshore jurisdictions, Eastern Europe, and Africa. Quick way to check:
7z, Access, SQLite, DBF, ODS, ODF, CSV, images, TIFF, video and audio metadata, XML, plain text, etc. We import and preview a wide range of document types, and do image text recognition and entity extraction for:
granted access when you work on a major OCCRP cooperation. Access Control Datasets can be shared publicly, with project teams or individual users. Reporters can also upload and share documents.
a re-usable open source package. We support technologists to set up a copy in-house, or on your own servers. Contributions, translations and ideas: https://github.com/alephdata