Slide 1

Slide 1 text

OCCRP Data Search Friedrich Lindenberg

Slide 2

Slide 2 text

Why data.occrp.org? • Support ongoing investigations to quickly survey material. • Make our collective source material searchable. • Become systematic about persons of interest. • NOT: All of OCCRP’s data needs. There’s also data vault and other tools.

Slide 3

Slide 3 text

What does data. do? • Makes Word documents, PDF files, Spreadsheets, Databases and Email archives searchable. • Recognises text in images (OCR). • Controls who can see what documents. • Keeps a list of alerts (“saved searches”) • Checks against a list of people we’re interested in.

Slide 4

Slide 4 text

Basic searches What do you want in, what do you want out? Tell the tool.

Slide 5

Slide 5 text

Search operators • Exact matches: “Vladimir Putin” • Boolean operators: +aliyev -ilham • Logical links: aliyev AND (ilham OR mehriban) • Proximity: "Bank America"~2 • Spelling errors: aliyew~

Slide 6

Slide 6 text

Using facets

Slide 7

Slide 7 text

More demos: • Using alerts (and getting rid of them) • Cross-referencing documents and persons of interest

Slide 8

Slide 8 text

Automatic research!

Slide 9

Slide 9 text

Getting data in! • Leaks, source docs! • Gazettes, Company DBs, Property, Procurement, Blacklists, Sanctions, Court docs, … • Persons of interest! • We can keep a secret :)

Slide 10

Slide 10 text

Get in touch: tech@occrp.org