Ideal case: extension-inflation test
Transcription
Ideal case: extension-inflation test
Simulating Human Aorta Material Behavior Using a GPU Explicit Finite Element Solver Vukasin Strbac†, David M. Pierce‡, Jos Vander Sloten†, Nele Famaey† †Biomechanics Section, Mechanical Engineering, KULeuven, Leuven, BE ‡Mechanical Engineering, Biomedical Engineering, Mathematics, Interdisciplinary Mechanics Lab University of Connecticut, Storrs, CT, US Vukasin Strbac GTC2016 Introduction: general biomech. motivation Accelerating FE analysis provides new clinical opportunities: pre-operative (e.g. faster custom stent design) intra-operative stress monitoring post-operative damage monitoring/fatigue estimation at lower cost Ever-advancing capabilities of modern hardware, e.g. GPGPUs, offer opportunities to accelerate established algorithms heterogeneous composition, aorta Vukasin Strbac tissue behavior GTC2016 angioplasty stenting 14.04.16 2/21 Introduction: core facts Explicit FE is pleasingly parallel (for the most part) Explicit FE is sensitive to material and geometric parameters Complex material model is necessary for accurate results GPUs are sensitive to floating point precision used What can we expect? How does anisotropy affect GPU explicit FE? How do hexahedral element formulations affect GPU explicit FE? Particularly in terms of Gaussian integration schemes How does that affect our research? Vukasin Strbac GTC2016 14.04.16 3/21 Introduction: GPU-based FE solver 𝑴 {𝒖} + 𝑐𝑑 [𝑴]{𝒖} + {𝑭 𝒖 } = [𝑹] Nonlinear, explicit, large strain, central differences Assign Boundary Conditions Trilinear hexahedral elements, unstructured grid Templated single/double precision, textures, output, etc.. per element Boundary conditions: kinematic, constant force, pressure Materials – following slides (linear, nonlinear) Pre-processing Custom input file structure for geometry, material and BCs Post-processing Validated against Abaqus (Dassault Systèmes) and - FEAP(University of California, Berkeley) Assemble global internal force vector Forward timemarching step Check energy balance Binary .vtu files + Paraview Real-time rendering - per node Compute stress Integrate stress n Co nv? y End Vukasin Strbac GTC2016 14.04.16 4/21 Element technology: Biofidelic materials • H Compute stress Integrate stress Linear elastic model (Hookean) 𝜎𝑖𝑗 = 𝑓 𝜖𝑖𝑗 = λ𝛿𝑖𝑗 𝜖𝑖𝑗 + 2𝜇𝜖𝑖𝑗 = Cε • Nonlinear elastic model, isotropic (neo-Hookean) 𝜕Ψ ) 𝜕𝑭 𝜎 = 𝑓( NH • Nonlinear elastic, anisotropic (fiber-reinforced arterial tissue model [Gasser et al., 2006]) GHO Anisotropic constituent [Weisbecker et al., 2012] Vukasin Strbac GTC2016 14.04.16 5/21 Element technology: Gaussian integration ζ UI ξ µ ζ FI ξ Under-integration -Fast -Inaccurate -Hourglassing -No volumetric locking -No shear locking Compute stress Integrate stress Arithmetic expense Memory expense 1x 1x (Not appropriate for anisotropic materials with low mesh density) Full integration (FI) -Slow -Very accurate -Volumetric locking -Shear locking ~8x ~3x Selective reduced (SR) -Very slow -Very accurate -No volumetric locking -Shear locking ~9x ~4x µ ζ SR ξ µ Vukasin Strbac GTC2016 14.04.16 6/21 Ideal case: extension-inflation test Extension 5% + systolic pressure Reference solutions FEAP & ABAQUS We implement the same materials in all solvers We solve using 3 different generations: Fermi, Kepler and Maxwell (no optimization) GHO material (+neo-Hooke for ref.) Scaling Convergence criteria based on reference solutions RMS < 0.0005mm deltaRMS < 0.0001mm Vukasin Strbac GTC2016 14.04.16 7/21 Ideal case: extension-inflation test Under-integration Full integration Selective-reduced integration Vukasin Strbac GTC2016 14.04.16 8/21 Ideal case: extension-inflation test FERMI (C2075) Vukasin Strbac GTC2016 14.04.16 9/21 Ideal case: extension-inflation test KEPLER (K20c) Vukasin Strbac GTC2016 14.04.16 10/21 Ideal case: extension-inflation test MAXWELL (GTX980) Vukasin Strbac GTC2016 14.04.16 11/21 Ideal case: extension-inflation test Anisotropy cost (GHO/NH) Integration cost (SR/UI) Vukasin Strbac GTC2016 14.04.16 12/21 Ideal case: conclusions Speed-ups are considerable Difficult to say exactly why one GPU is faster in a specific scenario No architecture-specific considerations are employed, speedup is free Useful for Parameter-fitting and geometry identification Sensitivity analyses …anything made possible by large numbers of FE simulations Not a clinically accurate scenario Vukasin Strbac GTC2016 14.04.16 13/21 Near incompressibility and floating point precision MPa Single precision Double precision Vukasin Strbac GTC2016 14.04.16 14/21 FI UI SR Double Single Vukasin Strbac GTC2016 14.04.16 15/21 Clinically relevant test case: AAA inflation p1 p2 p3 p4 p5 Patient-specific FE meshes of abdominal aortic aneurysms [Tarjuelo-Gutierrez et al., 2014] Vukasin Strbac GTC2016 14.04.16 16/21 Clinically relevant test case: AAA inflation thrombus The ‘silent killer’ Peak Wall Stress (PWS) estimate needed Thrombus: Separation Different material Layer specific material properties aorta Vukasin Strbac GTC2016 14.04.16 17/21 Presenter Type of presentation 14.04.16 18/21 Clinically relevant test case: AAA inflation p1 p2 p3 p4 p5 FEAP[h] 21.12 22.79 21.01 21.52 21.86 CUDA[h] 2.93 1.22 2.75 3.03 1.31 factor x7.2 x18.7 x7.6 x7.1 x16.8 Vukasin Strbac GTC2016 14.04.16 19/21 Poisson = 0.4995 Vukasin Strbac GTC2016 14.04.16 20/21 Conclusion We maintain significant speedup even using state-of-the-art materials, highorder integration and double precision on GPUs, with no compromise whatsoever on accuracy. Even for less than ideal meshes. Single precision becomes ineffective quickly, and depends on Poisson ratio. Double precision is necessary. Practical opportunities, enabling technology: FE sensitivity analysis Inverse FE simulations Indications of clinical use Generally: Memory-bound algorithm Lots of random reads and atomic writes due to unstructured grid For details on implementation/optimization see: S4497, Strbac GTC2014 Vukasin Strbac GTC2016 14.04.16 21/21 Thank you for your attention. Questions? Vukasin Strbac GTC2016 14.04.16 22/21
Similar documents
SDC
Radiation Effects Essentials Evaluation of GPU Radiation Sensitivity - Experimental Setup - Parallel Algorithms Error Rates Hardening Solution Efficiency Codes Optimizations Effects on HPC ...
More information