Big Data – In memory data model – Persistence – Data store abstraction • Supports persisting to – Column stores – Key/value stores – Document stores – RDBMS's • Supports use of Hadoop www.semtech-solutions.co.nz [email protected]
Apache 2 license • Written in Java • Offers a persistence framework • Designed for big data applications • Used by Nutch 2.x for web crawl data storage • Used for – Persistence – Indexing – Analytics www.semtech-solutions.co.nz [email protected]
– Abstracted storage – Data store independence – Handles object to persistent mappings – Use various NoSql solutions www.semtech-solutions.co.nz [email protected]
– Core classes • Query – Constructed via DataStore • PartitionQuery – Divide results of Query into partitions. – Run queries on data nodes. – Generate Hadoop InputSplits • Result www.semtech-solutions.co.nz [email protected]
www.semtech-solutions.co.nz – [email protected] • We offer IT project consultancy • We are happy to hear about your problems • You can just pay for those hours that you need • To solve your problems