{contact:faraggcvip.uofl.edu}
Transcription
{contact:faraggcvip.uofl.edu}
2007 IEEE International Symposium on Signal Processing and Information Technology EXPERIMENTS ON SENSITIVITY OF TEMPLATE MATCHING FOR LUNG NODULE DETECTION IN LOW DOSE CT SCANS Shireen Y Elhabian1, Hossam Abd EL Munim1, Salwa Elshazly', AlyA.Farag1, and Mohamed Aboelghar2 'Computer Vision and Image Processing Laboratory, Univ. of Louisville, Louisville, KY, 40292 2Urology and Nephrology Center Mansoura University, Mansoura, Egypt {contact:faraggcvip.uofl.edu} ABSTRACT Template matching is a common approach for detection of lung nodules from CT scans. Templates may take different shapes, size and intensity distribution. The process of nodule detection is essentially two steps: isolation of candidate nodules, and elimination of false positive nodules. The processes of outlining the detected nodules and their classification (i.e., assigning pathology for each nodule) complete the CAD system for early detection of lung nodules. This paper is concerned with the template design and evaluating the effectiveness of the first step in the nodule detection process. The paper will neither address the problem of reducing false positives nor would it deal with nodule segmentation and classification. Only parametric templates are considered. Modeling the gray scale distribution for the templates is based on the prior knowledge of typical nodules extracted by radiologists. The effectiveness of the template matching is investigated by cross validation with respect to the ground truth and is described by hit rate curves indicating the probability of detection as function of shape, size and orientation, if applicable, of the templates. We used synthetic and sample real CT scan images in our experiments. It is found that template matching is more sensitive to additive noise than image blurring when tests conducted on synthetic data. On the sample CT scans small size circular and hollow-circular templates provided comparable results to human experts. Index Terms- Shape Representation, tion, Level Sets, Energy Minimization. I. INTRODUCTION With improvements in CT imaging in terms of resolution, dose, and scanning approach, it has been possible to think of employing CT scanning in designing and evaluating fully automatic computer-aided diagnosis (CAD) systems especially for the thorax (e.g., Boiselle and White, 2002 [1]). At present, low-dose spiral computed tomography (LDCT) is of prime interest for screening high risk groups for early detection of lung cancer (e.g., [2]). Automatic screening involves detection of nodules then segmentation and classification; i.e., a three-stage process. Our earlier work has 978-1 -4244-1 835-0/07/$25.00 ©2007 IEEE examined the first two stages (e.g., [11 - 13]) and the literature on this subject is becoming quite rich reflecting the importance and urgency of this problem (e.g., [3-12]). Our CAD system detects the nodules in LDCT images in three main steps: 1) segmentation of the raw scanning information to isolate the lung tissues from the rest of the structures in the chest cavity; 2) extraction of the 3D anatomic structures (e.g., blood vessels, bronchioles, alveoli, etc., and possible abnormalities) from the already segmented lung tissues; and 3) identification of the nodules by isolating the true nodules from other extracted structures. The purpose of the first two steps is to considerably reduce the searching space. For the purpose of detection, we used deformable circular, semicircular and spherical templates (e.g., [11]). We have also introduced a generalized deformable template in which nodules are modeled by a 3D deformable prototype, which closely approximates an empirical marginal probability distribution of image intensities in the real nodules of different sizes, and is analytically identified from the empirical distribution (e.g., [12]). For the purpose of segmentation, we introduced an appearance modeling approach for outlining the lung nodules (e.g., [13][14]). Despite the rich literature in image analysis of thoracic imaging, there exists no approach that provides a satisfactory answer/solution to the problem of automatic nodule detection and segmentation, and the process of nodule classification (i.e., assigning pathology to a particular nodule) has a long way to go. Hence, the entire front-end of CAD systems based on LDCT imaging is wide open. We will not attempt any survey in this paper of current approaches; in fact we will go back to the original step, i.e., nodule detection will study the effectiveness of template a common approach for many of the CAD matching which isand studies in the literature. This paper is organized as follows; section II discusses design issues of parametric templates used in nodule detection using the a priori knowledge of the shape and gray level distribution of real nodules. Template matching process is presented in section III. Section VI presents our experimental results on synthetic and real data. Conclusions and future work is presented in section V. 1029 u,,,,,,~~**** - Fig. 3. Parametric templates of various sizes. First column: Circular templates, second column: Hollow circular templates and the rest columns show semi-circular templates f000g - :-- -- - -~~~~~~~~~~~~~~~00 Fig. 1. An ensemble of manually segmented lung nodules taking various sizes and shapes. for different orientations. OA24 R 1o0 50 0 15D 200 250 Fig. 4. The gray level distrib ion for a circular template where a typical nodule histogram with exponential-like form is shown (e.g., [3][11][12]). 0 Fig. 2. 15 io¶ 20 25 DJi lawnc, Ifbro t~ hrnoduid denter mea 30 35 tue com 40 in The radial gray level distribution of the ensemble of nodules in Fig 1. The bars show the variations from the mean value. II. PARAMETRIC TEMPLATE DESIGN s - u t e nodule models used in our study. In [13] we used the Gaussian symmetric model (see also Lee, 2001 [3]) to automatically generate the gray level distribution of the nodules given the radius R and the histogram of typical nodule prototypes which were obtained by expert radiologists. The equations for the template gray level distribution are given by: R(ln(qma,,,) -ln(qmjmi)<(1 qoel usedi study. q( ed thexpau p))an r <oR (2) valuesq3m)atx and quoaic determine the range of the p Nodules may take various topologies and shapes in a < scan. Fig.1 shows an ensemble of nodules that has been manually segmented. Fig. 2 shows the gray level The distribution from the centroid of the nodules calculated at nodules gray levels, and are estimated from the gray level distances. The main characteristic among various histogram of the CT scans as will be shown later (e.g., Fig equi-radial corrlespodn sphrical,cupogis, and dhaesks Thsi hc m ila rity2) rtonwti a si) measure (eg. cReltio) nodules is the woul factbae that the gray level distribution tends to 6). be concentrated around a region with an exponential decay. ndulstr Widelyused1parametric.nodul aretspecified,h fonroaiveshpef theb aradusundes calculated btndlsgaetween, ansall wsindowdfresmbln models inA2DE reScircuIII TEMUPLATE. MATCHING the noduleve given CT noue shtthe tefcgray leveldistribution.Fg3ilutae so enisotropi be concentrated around a region with an exponential decay103 moe6n)h.ein o h mg.I hcreainsoei Le J.- P hrbiy DEMity Fd,1th Gf R0, __ ,,, C 1~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A5 _ 015 0.01 Fig. 5. Left: The generated synthetic image containing ideal 1f100 shapes of nodules with different sizes. Right: the ground truth generated from the synthetic image using gray level thresholding based on the image's histogram. ___ay___ III-A. Synthetic Data A synthetic image was generated with ideal nodule shapes and different sizes (circular, hollow-circular and semi-circular with dits Is We generated semi-circular with different orientations), ground truth using gray level thresholding over its histogram (see Fig. 5). The range of gray levels of the synthetic image was learnt during a training phase using the histogram of real nodules (see Fig. 6). We have applied circular average filter with different sizes ranging from 5 to 20 to obtain blurred versions of the synthetic image as shown in Fig. 7. We have also added white Gaussian noise with different variances ranging from 0.2 to 1 (Fig. 8 shows the first four noisy images). Template matching was then applied using templates of different shapes and sizes. The templates were generated dynamically for each given image. The histogram of the given image was used to extract the range of gray levels assumed by the nodules by fitting two Gaussians using the EM algorithm. The total number of templates tested for each image was 1020 templates. Circular template is defined by its radius ranging from 1 to 40 pixels, hollow-circular template is defined by the distance between the inner and outer circular which ranges from 1 to 40 pixels, while semicircular template is defined by its radius (from 1 to 40 pixels) and its orientation (angle of inclination measured with respect to the x-axis). These values were not empirically chosen; rather they were learnt during the training phase of Our study using real nodules images. 2200 2 _ _ _ N U 025 above a certain level then a nodule candidate is declared. In this paper we used the sum of the square difference between the image and the template as the similarity measure. The location of the nodule may be given by its centroid which may be taken at the location in the image (or volume) giving maximum similarity score. This paper provides template matching results in the 2D case using synthetic data and CT slices. I&D Gayleves 2;0 3m & t i 0 1 60 30 Graylevets Fig. 6. Top: the probability density function ofthe gray levels of CT images containing lung liquid and nodules extracted from the average histogram taken from these images, using the EM algorithm, the nodules gray levels were extracted shown inrthe tto nduter gray wlevels assumed by the nodules was leart. Fig. 7. Circular average filter with different sizes (5, 10, 15,and 20 respectively) was used to blur the synthetic image. 1031 Fig. 8. White Gaussian noise is added with variance 0.2, 0.4, 0.6 and 0.8 respectively. Fig. 10. Images segmented by level-sets approach [15]. to exract th threhol use E Fig. 9. Sample images of CT scan image. These images were segmented to isolate the lung tissues. Further, an expert radiologist outlined the nodules. III-B. Real Data Same experiments were conducted using real nodule images (Fig. 9shows sample images). The images were segmented using level-sets approach [15], (see Fig. 10). The EM algorithm was used to fit a mixture of Gaussians over the image's histogram in order to extract the range of gray levels assumed by the candidate nodule regions, hence the lung liquid regions were removed from the search space. Fig. 11 shows the histogram of the sample images shown in Fig. 9. Fig. 12 shows the result of removing lung liquid regions using the EM algorithm, I MI gray,,,_,,,,,,,,, levl asum by th (0 AI w Fig. It. Images' histograms, where the red line shows the threshold used to extract the gray levels assume by the candidate nodule regions. To check the effectiveness of the template matching results on the sample CT slices, the nodules were outlined independently by an experienced radiologist (Dr. Aboelghar, co-author of this article and provider of the CT data for this project). His labeling is shown in Fig. 13. IV. EXPERIMENTAL RESULTS tV-A. Synthetic data results Experimental results were conducted to investigate the sensitivity of nodule detection using template matching to shape, size and orientation. As shown in Fig. 14, in the ideal case where there is neither bluffing nor noise, the hit rate is decreased when using larger radii for circular 1032 Hiitet h LII~~~ ~ ithtbitmg ~ ~ ~~~~~~~~~~~~~~~~~~ t Hit MO iuofghd he it r" itd ird t lad utHidb*~alrt 4iAte Hit7l atedothebulrred Fig. 12. Lung liquid regions were removed using EM algorithm to reduce the search space of nodule detection. mages Hit rteoIh, Ythetjdil-g e b~e t&e Hi aeo it it it 2t iHt Fig. 14. Top: hit rate resulted from using circular templates. Bottom: hit rate resulted from using hollow-circular tem- plates. template size. Yet noisy images record the lowest hit rate compared to the ideal and blurred case (Fig. 16). IV-B. Real data results Fig. 17 and 18 shows the hit rates for different template shapes. It can be inferred for larger circular and hollowcircular templates, the hit rate decays significantly. However this is not the case for semi-circular templates, yet the Fig. 13. Ground truth labeling of the nodules by a human expert. The regions reported as nodules were circled in red. theidalas,heitraticraseuin lage raiu for te sei-cirulartemplte, owevethibehaiorFsino and hollow decrease is templates; however mainaindcircular fral oienatins.In cse f tethisburrdiage more in noticeable case of blufred images. In case of noisy the ero rietatin mintans hehghethiraewhle images, the hit rate for circular and semi-circular templates is orenttios hve smilr ith case, espct o te low compared to the idealbhavor and blurred however, very othe the hit rate is increasing (with small slope) along circular whiletuth small slope) radius, 3. Grund lbelinof (with he noules y a hman templateFig. decreasing along the hollow-circular template size. As shown in Fig. 15, in zeoorientatinsreordevuteryor lowheorate. v ttos hl te 0-p _ tissure. thi paper presentanAND fiurstdcattemptiat quntfiatono V. CONCLUSION FUTURE RESEARCH This work aimed at examination of the accuracy of the the tepate machn processh fotraenodul difetecin. Onmlyt detection of the nodules template matching approach for spaamesItrica tempaewnerred considergedcincla whiclungthelgray as shown in low dose chest CT scans. Accuracy can be tep ats, whelast size werey determicnedyHoempircarleaisribto measured in terms of the location of the nodules and their ziall grayescale ditheribuin ttion fa ofnsembhle It can also be measured terms ofhe shapes.usiengtheio separability from the anatomical structures. This is a very complicated task since nodules mayvey takelo1033e various shapes, spatial support, oienttionsrecored and may indeed resemble anatomical structures in the lungs SemWi Ciruuar Tnpa 9esupt s ltzs Wth Diffien Radiitad Oridenttioans indegrees CtouirTempties R F uOts w0h Different Radh 095. 0. 0 -00000007~~~~~~~Orat4io Orictatono90 0100 D.N 0@f /I ~ lSi.t -O. 0.0% tion 1 ~~~~~~~~~~~~Orientation360_: _ 0. 00 TotOrientation - U O. = . 0 m a_teRe 00s 00 40 o 0 Swhiten 6 25 s 4 fpartom O.9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~00 0~~~~~9~. X 0.7 \~~~~~~~~~~~~~~~~~~~~~M 9w 0c75 i h Oft.6: -QrieniAtolb o )oreeiatiaes7 d ior isatrdwe M - . 20 10 t TerF1HaieRAious i r Ternpla Rad ges s f m th tsttes Ri e ets vtth( D iln hren Radi i and Oroil ciations tn ipteg(ret ult ithDblferedntoWiidth (to) roesirUlault D Rer ular 5 10 ?E 20 Templapte Radus 26 SO - 25 40 0.55I Xl 0 Fig. 15. Hit rate resulted from using semi-circular templates with different orientations, results from the synthetic images (top), results from the blurred images(bottom). sn So0 n i lrrde7r _ 20 5o eTemplatem tVidth s o0 (rb r2n 25 on t 40 Fig. 17. Hit rates reported using real images using circular templates (top) and hollow-circular templates (bottom). hlcwrcu-crclarmtes behaveds, milarly regardless the template Fig. 16. Hit rate resulted fromusigsemi-circular t lat e size when using the synthetic data, however there is a size l g 0 N >> k ~~~~beyond which horizontal ternplate orientation (0 and 360 withXdifferent orientations for nois images. yetthedegrees) outperform other orientations, while this behavW \4 1t 0 j S/ 1 ~~~ior is altered when using bluffed image where the zero foun thatt lage circula and <~~~~~~~~~~~~~~borientation aS human, significantly expert.a It wasoutperform for0 other orientations. Yet real nodulethathavebeenmanuallysegmented.Ground wtemplate matching was found to be sensitive to additive noise truth was generated using syntheticimagesoftempla recording a very low hit rate when compare to the blurred ofvaiousshap,siean oriaio. Te gcase. The template matching was also performed on four generaedusnganexponntial modelithpaamslices of real CT data in which the nodules were identified by a human expert. It was found that for larger circular and hollow-circular templates, the hit rate decays significantly. Fig. 16. Hit rate resulted from using semi-circular templates However this is not the case for semi-circular templates, with different orientations for noisy images. yet the zero orientation outperform other orientations, while other orientations recorded very low rate. Current efforts are directed at more experimentation on real data collected with real nodules that have been manually segmented. Ground a well specified image acquisition approach. The ground truth was generated using synthetic images, temhtsoes truth will be specified by more than one radiologist and the overall robustness of template matching will be quantified. of various shape, size and orientation. The gray scale was generated using an exponential model with parameters es- We have intentionally ignored the standard subsequent steps of reducing false positives in order to test the effectiveness of timated from the nodule ensemble. The synthetic image the template matching process itself. Of course, reduction of was altered by both blurring and Gaussian noise of varying I lo 5 2. 25 x ,g 3 S Oetoh Semi cu aTTempatt esutthDifirIRaddietAtnsin d I ges 0.9 /18 O Oi7 0. 0, 5 9 90 - .,ntatio h = SW b 3 02 _ Orientao h0 flentlo n-Ig E O2004. -0Qletaticn 270 ierfO tatiorE - _ _ _ __1 [7] H. Arimura, S. Katsuragawa, K. Suzuki, F. Li, J. Shiraishi, S. Sone, and K. Doi, "Computerized scheme for automated detection of lung nodules in low-dose computed tomography images for lung cancer screening," Academic Radiology, vol. 1, no. 6, pp. 617-629, [8] G.-Q. Wei, L. Fan, and J. Qian, "Automatic detection _ofnodules attached to vessels in lung CT by volume projection analysis," Proc. of International Conference on Medical Image Computing and Computer-Assisted Sf-i 4Intervention, MICCAI-02, Tokyo, Japan, September 25-28, 2002, pp. 746-752. [9] K. Okada, D. Comaniciu, and A. Krishnan, "Robust Anisotropic Gaussian Fitting for Volumetric Characterization of Pulmonary Nodules in Multislice CT," IEEE Fig. 18. Hit rates reported using real images for semi-circular templates with different orientations. Trans. on Medical Imaging, vol. 24, no. 3, pp.409-423, March 2005. [10] I. Sluimer, A. Schilham, M. Prokop, and B. van GinVI. ACKNOWLEDGEMENTS neken,"Computer Analysis of Computed Tomography Scans of the Lung: A Survey," IEEE Transactions on Medical Imaging, vol. 25, NO. 4, pp. 385-405, April, This work has been supported by the Kentucky Lung Cancer Program. 2006. X 5 [1] [2] [3] [4] [5] [6] to inplraVpteRadilus25 3° VII. REFERENCES P. M. Boiselle and C. S. White (Eds.), "New techniques in thoracic imaging." M. Dekker, New York, 2002. 2. H. K. Weir, M. J. Thun, B. F. Hankey, L. A. G. Ries, H. L. Howe, P. A. Wingo, A. Jemal, E. Ward, R. N. Anderson, and B. K. Edwards, "Annual report to the nation on the status of cancer, 1975-2000." Journal of the National Cancer Institute, vol. 95, no. 17, pp. 1276-1299o 2003. Y. Lee, T. Hara, H. Fujita, S. Itoh, and T. Ishigaki, "Automated detection of pulmonary nodules in helical CT images based on an improved template-matching technique," IEEE Transactions on Medical Imaging, vol. 20, pp. 595-604, 2001. K. Awai, K. Murao, A. Ozawa, M. Komi, H. Hayakawa, S. Hori, and Y. Nishimura, "Pulmonary nodules at chest CT: Effect of computer-aided diagnosis on radiologists detection performance," Radiology, vol. 230, no. 2, pp. 347-352, 2004. C. I. Fetita, F. Preteux, C. Beigelman-Aubry, and P. Grenier, "3D automated lung nodule segmentation in HRCT," Proc. of International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI-03, Montreal, Canada, pp. 626-634, November, 2003. M. N. Gurcan, B. Sahiner, N. Petrick, H.-P. Chan, E. A. Kazerooni, P. N. Cascade, and L. Hadjiiski, "Lung nodule detection on thoracic computed tomography images: Preliminary evaluation of a computer-aided diagnosis system," Medical Physics, vol. 29, no. 11, pp. 2552-2558, 2002. [11] A. A. Farag, A. El-Baz, G.Gimelfarb, R Falk, and S.G. Hushek, "Automatic detection and recognition of lung abnormalities in helical CT images using deformable templates," Proc. ofInternational Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI'04), Saint Malo, France, September 2629, 2004, pp. 856-864). [12] A. A. Farag, A. El-Baz, and G. Gimel'farb, "Quantitative Nodule Detection in Low Dose Chest CT Scans: New Template Modeling and Evaluation for CAD System Design," Proc. of International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI), Palm Springs, California, USA, October 26-29, 2005, pp. 720-728. [13] A. A. Farag, A. El-Baz, G. L. Gimel'farb, R. Falk, M. Abou El-Ghar, T. Eldiasty, S. Elshazly, "Appearance Models for Robust Segmentation of Pulmonary Nodules in 3D LDCT Chest Images," Proc. of International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI'06), Copenhagen, Denmark, October 1-6, 2006, pp. 662-670. [14] A. Farag, A. El-Baz, and G. Gimel'farb, "Precise Segmentation of Multi-modal Images," IEEE Transactions on Image Processing, vol. 15, no. 4, pp. 952-968, April, 2006. [15] Hossam Abd EL Munim and Aly A. Farag, "Curve/Surface Representation and Evolution using Vector Level Sets with Application to the Shape-based Segmentation Problem," IEEE Transactions on Pattern Analysis and Machine Intelligence - Vol. 29, No. 6, pp. 945-958, June 2007. 1035