CESCA BCN DIC 2009 Motivation Digital repository: Large digital content Metadata (following an arbitrary relational data model) Examples Medical Images METADATA: Patient data, Study, Diagnosis, etc. LARGE DIGITAL CONTENT: Mammograms Historial Archives METADATA: Manuscript reference data, historical context, etc. LARGE DIGITAL CONTENT: Digitalized manuscript pages The DRIVING GOAL is to reduce the cost of Hosting digital repositories (on GRID infrastructures) Developing applications to exploit them (clients, jobs, …)
CESCA BCN DIC 2009 DRI is a software platform based on Java Abstracts repository providers from intrinsics of final storage Abstracts application developers from low level APIs and services Provides web based applications for repository navigation and content management Inspired in JSR170 http://jcp.org/en/jsr/detail?id=170 Introduction DRI repository description DRI API applications users repository provider GRID STORAGE RELATIONAL DB metadata large digital content
CESCA BCN DIC 2009 Medical Image Repository Repository definition Name IMED Purpose To host medical image sets, along with patient info, diagnosis and processing data and training datasets Users Clinicians. CAD Builders, Patients Usage Storage and management of medical images and patient data (clinicians), CAD building, CAD using (clinicians), data browsing (patients) Patient Study Series Image ImageType Annotation GenericSeriesLayout one to many many to one many to many FeatureSetDefinition FeatureSetValues ClassFamily ClassMember Classification Classifier ImageProcessorSet ImageProcessor SeriesLayout ImageProcessorSetValues
CESCA BCN DIC 2009 Conclussions Cost reduction To host a new repository through XML declarations Base set of client applications for repository interaction To create applications through simple client API APIs based implementation Well defined roles Decoupled from technologies Repositories under development Medical Images (Mammograms) Historical Archive of Trujillo gPhase