industrial- strength Natural Language Processing • focused on production use Current stats 17m+ total downloads 16k+ stars on GitHub 400+ contributors 80+ extension packages
• bootstrapped through consulting for the first 6 months • funded through software sales since 2017 • remote team, centered in Berlin Current stats 8 team members 100% independent & profitable
many more • spaCy v3.0: Transformer-based pipelines, custom models using any library, new training workflow • Prodigy v1.10: Dependencies & relation annotation, audio & video annotation & lots of new features • Prodigy Teams: Manage large annotation projects in your cloud
3 4 5 Imagineer. Forecast. Outsource. Wire. Ship. Pay someone else to gather your data. Think carefully about your accuracy requirements, and then ask for 10k rows.
to our internal database, so we can connect it to our analytics. We need to extract: buyer (official company name) and stock ticker acquired company with stock ticker sale price and currency #2