Un sistema avanzato per la gestione di immagini biomedicali in
Transcription
Un sistema avanzato per la gestione di immagini biomedicali in
Un sistema avanzato per la gestione di immagini biomedicali in ambiente GRID G. Aloisio SPACI Consortium & University of Salento, Italy, Lecce Outline • Scenario • Related Works • Issues • Medical Imaging Environment – System Architecture – Data Storage – Workflow Management – Metadata Management: GRelC – Case Studies: CMCC & Climate-G – Security – Web Gateway • Conclusions 2 Scenario, issues and needs • Huge amount of medical data produced by several Centres (i.e. Hospitals) • Medical Interconnection Network: data “sharing” among several centres • Data integration, digital libraries, sharing are FP7 keywords • Need to move towards “open”, secure, distributed and service-based environments Patients Challenging issues o Security o Data distribution o Data format heterogeneity o Metadata management o Metadata schema o Transparent access to the system o Scalable approach o …. Infrastructure Clinicians 3 Grid for Biomedical Imaging, 1999 Raw data HPC Pre Processing Acquisition Computational Grid Globus Server 4 Surrounding context MediGrid …and more… 5 Issues (high level) • Medical images storage and processing – TAC, MRI, fMRI, PET, SPECT – DICOM image format – Privacy issues • Very large data sets handling – storage management (PACS) – Metadata management – Security of data • Process complex algorithms with large computing power and memory requirements – parallel processing – workflow Management System (WFMS) PET TAC MRI SPECT fMRI 6 AGIR - gLite based www.aci-agir.org 7 Medicus - Globus based http://dev.globus.org/wiki/Incubator/MEDICUS 8 Medical Imaging Environment Info Data Virtualization DICOM Images Info DICOM Images Computational Providers DICOM Images Cloud Environment Info Grid Environment 9 Main Issues (architectural level) • Full security support (data security/privacy) • Integration effort is needed to achieve high level results • Transparent management of data & metadata • Preservation of data locality (distribution of data) • Distribution of metadata • Access via Web Gateway • Workflow management for medical purposes • high level tools for end users • medical imaging methods composition • Grid support (storage, computation) on a large scale 10 Medical Imaging - Data Virtualization Info Data Virtualization DICOM Images Info DICOM Images - Distributed Data & Metadata - Locality/Autonomy - Scalability - Legacy systems (PACS) - Security - Metadata - DB Encryption - Authorization/policies/roles - Data - Storage Encryption - Network - Communication protection - Interchange Protocols DICOM Images Info 11 Medical Imaging - Analysis Data Virtualization Info DICOM Images Info DICOM Images Computational Providers Cloud Environment Grid Environment Analysis - Distributed - Grid Based - Cloud Based - Complex - Medical Workflow - Secure - Data anonymisation - Data Encryption - Authorization/Authentication 12 System Architecture Web Gateway Orchestrator/Collective Services Security Metadata Service Data Service Comput. Service Grid Middleware MediGrid Fabric Layer (data and metadata) 13 Medical imaging - Storage • Management of Medical Images Secure on site management of DICOM images Encryption of data Anonymisation of data for analysis outside the centre Access control through authorization Storage interfaces between distributed environment and local storage devices Secure and efficient data transport protocols Backup of data …. 14 Biomedical Imaging Processing Issues • Huge amount of data – 1 radiology department: 10TB/year; – 1 CT dataset: 500MB-2GB. • Compute-intensive analysis – Registration noisy; – Difficulty in the segmentation; – Volume Reconstruction. • Need to explore several imaging analysis algorithms, validation methods, visualization pipelines Data Mng CE Grid Scheduler – Grid Workflow Management System ******************************************************************************** UNIVERSITY OF CHICAGO HOSPITALS RADIOLOGY CONSULTATION ******************************************************************************** 342 02/05/96 UNIVERSITY OF CHICAGO HOSPITALS BHIS #: 1234567 INPATIENT 201-23-90 RADIOLOGY CONSULTATION Hematology / Oncology CHANDLER, CAROLYN 342 02/05/96 Mitchell-6NE 49 FEMALE 201-23-90 BHIS #: 1234567 INPATIENT Hematology / Oncology Admitting Diagnosis: NEUTROPENIC FEVER; HYPERBILIRUBEMIA Mitchell-6NE Clinical data: Biliary tube check. Carl M. Gompers, MD Admitting Diagnosis: NEUTROPENIC FEVER; HYPERBILIRUBEMIA Clinical data: Biliary tube check. Change Perc Drainage CarlBiliary M. Gompers, MD Cath Proced -- Change Perc Biliary Drainage Cath Proced COMPARISON: 07/23/95 and 06/27/95 CHANDLER, CAROLYN 49 FEMALE Exam #46 on 01/08/96 -- Exam #46 on 01/08/96 FINDINGS: After the procedure was explained to the patient and informed COMPARISON: 07/23/95 and 06/27/95 & Int -- Exam #47 on 02/05/96 FINDINGS: After the procedure was explained to the patient and informed & Int -- Exam #47 on 02/05/96 FINDINGS: As above. IMPRESSION: FINDINGS: As above. Successful biliary tube change, and findings consistent with interval tumor IMPRESSION: growth. Successful biliary tube change, and findings consistent with interval tumor Simon A. Templar, MD / Richard Nixon, MD (R19) growth. Signed 02/9/96 at 8:48 AM 3 Simon A. Templar, MD / Richard Nixon, MD Signed 02/9/96 at 8:48 AM (R19) 3 Node 15 Medical Imaging - Workflow Info DICOM Images Info DICOM Images Workflow Example Cloud Environment Computational Providers Grid Environment • Denoising • Segmentation • Rendering 16 Metadata on the GRID • Metadata is data about data • Metadata enables search and discovery • Metadata on the GRID – Mainly information about files – Usually stored on DBs • Need simple SOA & Grid interfaces for Metadata access – Advantages Easier to use by clients Robust paradigm Exploiting Grid Security paradigm Addressing Interoperability 17 Why are metadata so important? • Metadata is the key to manage, route and retrieve data properly • PACS systems can be “federated” exploiting metadata • Search and discovery of “clinical case studies” are afforded by metadata system • Metadata information increases the global knowledge about available patient data • From data to metadata and from metadata to knowledge to support Physicians for diagnostic and therapeutic purposes • How to manage “efficiently”, “securely” and transparently” metadata? METADATA SERVICE 18 Metadata Service • Metadata service – Compatibility with distributed systems – Full security support for metadata – authorization (policies, roles, etc.) – authentication (based on grid certificates) – Access/Integration of metadata sources – Distributed vs centralized approach – Suitable Metadata Schema • Grid Access interface – Common interface for RDBMS and XML-DB – Database backend independence – GSI support 19 Metadata Management: Stack 20 GRelC Project (starting date 2001) Grid Relational Catalog (GRelC) is a project which aims at designing and developing a set of efficient, secure and transparent Data Grid Services XML DB DB DB Grid 21 Grid Metadata Handling System: architecture in the small GRelC DAIS 22 International Testbeds sepac00.projects.cscs.ch Linux x86 Lecce (Italy) GRelC Data Access Data Sources (DB) Bejing (China) gandalf.unile.it Linux x86 spacina.na.infn.it Linux IA64 sara.unile.it Mac OS X sigma2.unile.it Linux IA64 gridsurfer.unile.it FreeBSD galileo.hpcc.unical.it Linux IA64 23 Test Performance SQ comparison (Beijing - Lecce) - large 450 400 350 300 250 MEMORY DIME DIME+ZIP STREAM PSQL time (sec) 200 150 100 50 0 10000 50000 tuples (n) 100000 24 CMCC: a fully distributed data grid environment Grid Layer - VOMS Server - CMCC Data Grid Portal - Metadata Server (GRelC DAIS) - Grid Storage - GridFTP - GRelC Storage - CMCC CA - P2P layer Data and Metadata resources - RDBMS (Oracle,MySQL, Postgres) - XML DBs (eXist, XIndice, etc.) Grid SS Grid SS Grid SS Search & Discovery - Two step process 25 A new research effort: Climate-G The main goal of Climate-G is to create a unified environment for climate change, able to concentrate in the same context big amount of data geographically spread among several centres, rich metadata descriptions, efficient data access services, advanced data analysis and visualization tools, etc. exploiting and joining knowledge and skills in the fields of climate change and computational science 26 Climate‐G partners Università del Salento 27 Metadata Distribution and virtualization For each site: Relational DB (index) XML DB (entire schema) Virtualization/Integration layer: GRelC DAIS Virtualization allows to conceal: Data distribution Number of sites, RDBMS and XML back-ends P2P Topology Data Integration aspects technological details … 28 Security What about security? – Authen'ca'on (mutual process) – Authoriza'on – Access control lists – Role Membership Services – Data encryp'on (to ensure secure data communica'on) – Data anonymisa'on (to move data outside the Centers) – A cer'ficate for each actor (user, service, host) – Each ac'on is associated with an actor – Firewall protec'on – … 29 Medical Web Gateway • Main Functionalities o Search & Discovery o Data access & viz o Workflow management o Users and roles mng o …. • Features o Easy to use interfaces o Platform independent o Secured by design o Integrated environment o Data and … o Analysis tools o Visualization tools o Post-processing tools 30 Medical Imaging Environment Info Data Virtualization DICOM Images Info DICOM Images Computational Providers DICOM Images Cloud Environemnt Info Grid Environment 31