not Tika Notably includes earlier Office formats 129 formats spotted by Tika but not DROID-B But at least 20 are due to not using the full DROID Conflicts Failed MIME type mapping, e.g. PDF 1.7 (since fixed) ‘Soft’ signatures – e.g. PICT matching 3M JPG (gone) DROID strictness – 9M GIF, 4M JPG, 1.3M PDF… Both tools bad at non-HTML/XML text formats CSS, scripting languages like JS, CSV, TSV, etc.