Recognition of Pulmonary Nodules in Thoracic CT Scans Using 3-D Deformable Object Models of Different Classes

Takizawa, Hotaka; Yamamoto, Shinji; Shiina, Tsuyoshi

doi:10.3390/a3020125

Open AccessArticle

Recognition of Pulmonary Nodules in Thoracic CT Scans Using 3-D Deformable Object Models of Different Classes

by

Hotaka Takizawa

^1,*

,

Shinji Yamamoto

² and

Tsuyoshi Shiina

³

¹

University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Japan

²

Chukyo University, 101 Kaitsu, Toyota, Japan

³

Kyoto University, 53 Sakyo, Kyoto, Japan

^*

Author to whom correspondence should be addressed.

Algorithms 2010, 3(2), 125-144; https://doi.org/10.3390/a3020125

Submission received: 1 February 2010 / Revised: 16 February 2010 / Accepted: 22 March 2010 / Published: 31 March 2010

(This article belongs to the Special Issue Machine Learning for Medical Imaging)

Download

Browse Figures

Versions Notes

Abstract

:

The present paper describes a novel recognition method of pulmonary nodules (i.e., cancer candidates) in thoracic computed tomography scans by use of three-dimensional spherical and cylindrical models that represent nodules and blood vessels, respectively. The anatomical validity of these object models and their fidelity to computed tomography scans are evaluated based on the Bayes theorem. The nodule recognition is employed by the maximum a posteriori estimation. The proposed method is applied to 26 actual computed tomography scans, and experimental results are shown.

Keywords:

recognition of pulmonary nodules; thoracic computed tomography scans; three-dimensional deformable object models; Bayes theorem; maximum a posteriori estimation

1. Introduction

Lung cancer is the most common cause of death among all cancers worldwide [1]. To cope with this serious situation, mass screening for lung cancer has been widely performed by simple X-ray films with sputum cytological tests. However, it is known that the accuracy of this method is not sufficient for the early detection of lung cancer [2,3]. Therefore, a lung cancer screening system by computed tomography (CT) for mass screening is proposed [4]. This system improves the accuracy of the cancer detection considerably [5], but has one problem that the number of the images is increased to over dozens of slice sections per patient from 1 X-ray film. It is difficult for a radiologist (i.e., a medical doctor who specializes in reading radiographs) to interpret all the images in a limited time. In order to make the system more practical, it is necessary to build a computer-aided diagnosis (CAD) system that automatically detects abnormal regions suspected to comprise pulmonary nodules that are the major radiographic indicators of lung cancers, and informs a radiologist of their positions in CT scans as a second opinion.

Extensive research has been dedicated to automated detection of pulmonary nodules in thoracic CT scans [6]. Morphological [7] image filters [8,9,10] are conventional approaches. In the work [11], a multiple-thresholding technique was used to detect nodules that had peaked intensity distribution. Hessian-based image filters [12,13] individually enhanced blob- and ridge-shaped regions that corresponded to nodules and blood vessels, respectively. These methods were often used for initial detection of nodules, and were intentionally adjusted to minimize the number of misdetection. Consequently, they yielded many false candidates called false positives (FP) that corresponded to normal pulmonary structures such as blood vessels.

In order to reduce false positives, feature-based discrimination methods between nodules and false positives have been also developed [14,15,16,17,18,19]. Kawata et al. reported a classification method [20] of nodules based on differences in shape indexes, which were computed from two principal curvatures of intensity profiles in images, between nodules and false positives. Suzuki et al. proposed a method [21] that suppressed false positives by using voxel values in regions of interest as input for a novel artificial neural network [22].

Model-based methods are the promising approaches as well. Several works with nodule models were reported. Lee et al. [23] proposed a template-matching method using nodular models with two-dimensional (2-D) Gaussian distribution as reference images to CT scans. The method had an advantage of being able to use the geometrical features of nodules as well as gray level features. Ozekes et al. [24] designed a 3-D prismatic nodule template that was composed of several layered matrices. Farag et al. [25] developed 3-D deformable nodule models such as spherical models of various radii. These methods can make use of the characteristics of the 3-D relation between a suspicious region in a slice section and the other regions in the adjacent slice sections.

Not only nodules but also normal pulmonary structures were modeled in a study reported by McCulloch et al. [26]. A ribbon model, for example, was used for representing a blood vessel region in a slice section. Based on the Bayes theorem, the probabilities of the models were calculated from image features such as step-edges extracted by the Canny edge detector. The most likely model was determined by the maximum a posteriori (MAP) estimation. In this study, by using the models, the anatomical knowledge on the normal structures of human organs can be introduced into recognition of diseases, but only 2-D models and image features were used.

In the present paper, we propose a novel recognition method of pulmonary nodules in thoracic CT scans by use of 3-D deformable spherical and cylindrical models that represent nodules and blood vessels, respectively. The anatomical validity of these object models are evaluated by the predefined probability distributions of their parameters. The fidelity of the object models to CT scans are also evaluated based on the differences in intensity distributions between the CT scans and templates produced from the object models. Through these evaluations, the posteriori probabilities of hypotheses that the object models appear in the CT scans are calculated by use of the Bayes theorem. The nodule recognition is employed by the MAP estimation.

One of the major contribution of the present paper is to enable the computers to make use of the 3-D anatomical knowledge of both lesions and normal structures in organs by use of the 3-D object models. The 3-D knowledge is more informative than the 2-D knowledge, and thereby the ability to use the 3-D knowledge is an advantage of the proposed method. The other is to formulate the 3-D model-based Bayes inference so that it can be applied to 3-D CT volume data. The Bayes inference uses both the anatomical knowledge and the CT data, and thus, even if they comprise uncertainty, it can achieve more accurate recognition by complementing them mutually.

2. Overview of the Proposed Method

Figure 1 illustrates the overview. First, initial nodule candidates are detected from individual slice sections of a thoracic CT scan by our previous detection methods [8,27,28], and then square areas of a certain size are settled on the slice sections so as to comprise the nodule candidates. White squares on a slice section in the leftmost box in the figure show examples of such square areas. Additional isometric square areas are settled at the same X–Y positions on the adjacent slice sections. These square areas are formed into volumes of interest (VOIs) in the CT scans.

Figure 1. The process of the proposed method.

Sphere and connected-cylinder models, that represent nodules and blood vessels, respectively, are generated as shown in the topmost box. These object models can be deformed by changing their parameters such as the radius of the sphere model. Template VOIs are produced by projecting the object models onto them.

The template VOIs are matched to each VOI in the CT scans, and the posteriori probabilities of hypotheses that the CT VOI arises from the object models are calculated using the Bayes theorem. The most likely object model is searched for by the MAP estimation with an optimization method. The CT VOI is determined to be abnormal if the most likely object model is the nodule model, and vice versa.

3. 3-D Deformable Object Models

Our preliminary study [29] indicated that the 64% of false positives originate from blood vessels in lungs. Hence, discrimination between nodules and blood vessels has been the most important issue. To solve the issue, we focus on analyzing differences in 3-D shape between them from the anatomical point of view. Nodules appear as sphere-like masses in lungs, especially when they are in the early stages of diseases. Therefore, they tend to be observed as isolated circle-like regions on slice sections. In contrast, blood vessels make 3-D tree structures in lungs and are observed as thin regions that are connected with each other. To analyze their shapes, we represent nodules and blood vessels as 3-D geometric figures that can be deformed by changing their parameters. The anatomical validity of the deformation is quantitatively evaluated by the probability distribution of the parameters.

3.1. Nodule

A nodule is represented as a sphere model as shown in Figure 2(a). The central position, radius and X-ray attenuation (i.e., transparency of X-ray) of the nodule model are denoted by

x^{N} = (x^{N}, y^{N}, z^{N})

,

r^{N}

and

α_{m}^{N}

, respectively. The X-ray attenuation of the area surrounding the nodule model (such an area is often called an air area) is denoted by

α_{a}^{N}

.

Figure 2. 3-D deformable object models that represent nodules, curved blood vessels (CBV) and bifurcated blood vessels (BBV), respectively.

In medical diagnosis, the size of a nodule is the most important indicator to determine the stage of disease. Therefore, the radius

r^{N}

is more essential to describe a nodule than the other parameters. In this paper, the former (i.e.,

r^{N}

) and latter are called essential and auxiliary parameters, respectively. A nodule model is denoted by

o^{N} (r^{N}; x^{N}, α_{m}^{N}, α_{a}^{N})

, where the essential parameter is placed on the left-hand side of the semicolon. The probability of appearance of the nodule model is defined by

\begin{matrix} p (o^{N} (r^{N}; x^{N}, α_{m}^{N}, α_{a}^{N})) = P^{N} \cdot g (r^{N}; μ_{r}^{N}, σ_{r}^{N}) \end{matrix}

(1)

where

P^{N}

is the probability of the set of all possible nodules, and

g (x; μ_{x}, σ_{x})

is a Gaussian distribution with a mean

μ_{x}

and standard deviation

σ_{x}

.

3.2. Curved blood vessel

A curved section in a blood vessel tree is represented as a two-connected-cylinder model as shown in Figure 2(b). The first cylinder corresponds to the parent vessel (indicated by “1” in the figure) that is nearer to a heart in respect of its blood flow, and the second its child vessel (“2”). The radius of the i-th cylinder (

i = 1, 2

) is denoted by

r_{i}^{B_{c}}

, and its zenith and azimuth angles are denoted by

θ_{i}^{B_{c}}

and

ϕ_{i}^{B_{c}}

, respectively. The connection position of the two cylinders is denoted by

x^{B_{c}} = (x^{B_{c}}, y^{B_{c}}, z^{B_{c}})

. The X-ray attenuation of the model and that of its surrounding area are denoted by

α_{m}^{B_{c}}

and

α_{a}^{B_{c}}

, respectively. From these parameters, two additional parameters are calculated: a difference in section area between the cylinders,

δ^{B_{c}}

, and an angle between the cylinders,

ψ^{B_{c}}

. Because the parameters

r_{1}^{B}

,

δ^{B_{c}}

and

ψ^{B_{c}}

are more essential to describe a curved blood vessel, the model is denoted by

o^{B_{c}} (r_{1}^{B}, δ^{B_{c}},

ψ^{B_{c}}; x^{B_{c}}, r_{2}^{B_{c}},

θ_{1}^{B_{c}}, ϕ_{1}^{B_{c}}, θ_{2}^{B_{c}},

ϕ_{2}^{B_{c}}, α_{m}^{B_{c}}, α_{a}^{B_{c}})

.

The probability of appearance of the curved blood vessel model is defined by

\begin{matrix} p (o^{B_{c}} (r_{1}^{B}, δ^{B_{c}}, ψ^{B_{c}}; \dots)) \\ = & P^{B_{c}} \cdot g (r_{1}^{B}; μ_{r 1}^{B} (x^{B_{c}}), σ_{r 1}^{B_{c}}) \cdot \\ g (δ^{B_{c}}; μ_{δ}^{B_{c}}, σ_{δ}^{B_{c}}) \cdot g (ψ^{B_{c}}; μ_{ψ}^{B_{c}}, σ_{ψ}^{B_{c}}) \end{matrix}

(2)

where

P^{B_{c}}

is the probability of the set of all possible curved sections in blood vessel trees.

In Equation (2), the mean radius of blood vessels

μ_{r 1}^{B} (x^{B_{c}})

is defined as being the function of the position

x^{B_{c}}

so as to represent a relation between the radius and position in a lung. Generally, blood vessels are thick in the central area of a lung, and they become thinner as they approach the peripheral area. The relation is formulated by

μ_{r 1}^{B} (x^{B_{c}}) = κ_{1}^{B} e x p (- κ_{2}^{B} \cdot | x^{B_{c}} - c^{L} |)

(3)

where

c^{L}

is the central position of the boundary box that circumscribes a lung,

| x - c |

is the distance between

x

and

c

. The coefficient values

κ_{1}^{B}

and

κ_{2}^{B}

are constant.

In our previous study [30], we investigated statistical relations between the radii and positions of blood vessels in lung regions from population data, and obtained the 3-D distribution of the blood vessel radii. From several functions, the exponential function represented in Equation (3) is selected as the best-fit one to the distribution, and the coefficient values

κ_{1}^{B}

and

κ_{2}^{B}

are determined from the fitting result.

3.3. Bifurcated blood vessel

A bifurcation in a blood vessel tree is represented as a three-connected-cylinder model as shown in Figure 2(c).

The probability of appearance of a bifurcated blood vessel model

o^{B_{b}} (r_{1}^{B_{b}}, δ_{23}^{B_{b}},

δ_{123}^{B_{b}}, ψ_{12}^{B_{b}}, ψ_{13}^{B_{b}},

ψ_{23}^{B_{b}}; \dots)

is defined in the same manner as a curved blood vessel as follows:

\begin{matrix} p (o^{B_{b}} (r_{1}^{B_{b}}, δ_{23}^{B_{b}}, δ_{123}^{B_{b}}, ψ_{12}^{B_{b}}, ψ_{13}^{B_{b}}, ψ_{23}^{B_{b}}; \dots)) \\ = & P^{B_{b}} \cdot g (r_{1}^{B_{b}}; μ_{r 1}^{B} (x^{B_{b}}), σ_{r 1}^{B_{b}}) \cdot \\ g (δ_{23}^{B_{b}}; μ_{δ 23}^{B_{b}}, σ_{δ 23}^{B_{b}}) \cdot g (δ_{123}^{B_{b}}; μ_{δ 123}^{B_{b}}, σ_{δ 123}^{B_{b}}) \cdot \\ g (ψ_{12}^{B_{b}}; μ_{ψ 12}^{B_{b}}, σ_{ψ 12}^{B_{b}}) \cdot g (ψ_{13}^{B_{b}}; μ_{ψ 13}^{B_{b}}, σ_{ψ 13}^{B_{b}}) \cdot \\ g (ψ_{23}^{B_{b}}; μ_{ψ 23}^{B_{b}}, σ_{ψ 23}^{B_{b}}) \end{matrix}

(4)

where

P^{B_{b}}

is the probability of the set of all possible bifurcations in blood vessel trees,

δ_{23}^{B_{b}}

is the difference in section area between the two child cylinders,

δ_{123}^{B_{b}}

is the difference in section area between the parent and child cylinders, that is

δ_{123}^{B_{b}}

=

π ({(r_{1}^{B_{b}})}^{2} - {(r_{2}^{B_{b}})}^{2} - {(r_{3}^{B_{b}})}^{2})

, and

ψ_{i j}^{B_{b}}

is the angle between the i-th and j-th cylinder (

i, j = 1, 2, 3

,

i \neq j

).

3.4. Appearance probabilities of object model sets

The probabilities of the model sets satisfy the following equation:

\sum_{τ \in {N, B_{c}, B_{b}}} P^{τ} = 1

(5)

where τ represents the class of the object model.

4. Modification of Probability Distributions of 3-D Object Models

The object models have the different essential parameters in number and type. For example, the nodule model has only one essential parameter

r^{N}

, whereas the bifurcated blood vessel model has six other essential parameters

r_{1}^{B_{b}}

,

δ_{23}^{B_{b}}

,

δ_{123}^{B_{b}}

,

ψ_{12}^{B_{b}}

,

ψ_{13}^{B_{b}}

and

ψ_{23}^{B_{b}}

. The differences in number cause a problem that generally, the probabilities of the object models that have more essential parameters are relatively underestimated.

Let us consider an example case where all the standard deviations are 1 and all the set appearance probabilities

P^{τ}

are

1 / 3

. The radius of a nodule model

r^{N}

is supposed to be

μ + 3

and all the essential parameters of a bifurcated blood vessel model are supposed to be μ’s. Their probabilities are calculated as follows:

\begin{matrix} p (o^{N_{}}) = 1 / 3 \cdot g {(μ + 3; μ, 1)}^{} \approx 0.00148 \end{matrix}

(6)

\begin{matrix} p (o^{B_{b}}) = 1 / 3 \cdot (g (μ; μ, 1))^{6} \approx 0.00134 \end{matrix}

(7)

Although the bifurcated blood vessel model has much more likely parameters, its probability is smaller than that of the nodule model.

Here, let us give a more generalized expression to the model probabilities as follows:

p (o^{τ} (ω^{τ})) = P^{τ} \cdot p (ω^{τ})

(8)

where

o^{τ} (ω^{τ})

represents an object model that has an essential parameter vector:

\begin{matrix} ω^{τ} & = & (ω_{1}^{τ} \in Ω_{1}^{τ}, ω_{2}^{τ} \in Ω_{2}^{τ}, \dots \\ ω_{d}^{τ} \in Ω_{d}^{τ}, \dots, ω_{D^{τ}}^{τ} \in Ω_{D^{τ}}^{τ}) \in Ω^{τ} \end{matrix}

(9)

The notation

Ω^{τ}

represents the parameter space and

D^{τ}

its dimension. For example,

r_{1}^{B_{c}}

,

δ^{B_{c}}

and

ψ^{B_{c}}

in Equation (2) correspond to

ω_{1}^{B_{c}}

,

ω_{2}^{B_{c}}

and

ω_{3}^{B_{c}}

, respectively, and the dimension

D^{B_{c}}

is 3. In Equation (8), the auxiliary parameters are omitted for simplicity. The differences in the dimension

D^{τ}

between the model classes τ cause underestimation in

p (ω^{τ})

.

One solution to correct the underestimation is to use the geometric average of the parameter probability:

\sqrt[D^{τ}]{p (ω^{τ})}

, that is adopted in, for example, recognition of language [31] and speech [32]. Because the dimension

D^{τ}

is normalized, the underestimation is no longer caused. However, the geometrically averaged probability causes another problem that generally, the integral of its distribution over the parameter space is not one:

\int_{ω \in Ω^{τ}} \sqrt[D^{τ}]{p (ω)} d ω \neq 1

. We desire a probability distribution that does not cause underestimation and that integrates to one. In this paper, we realize the desired probability distribution on a parameter space that is inhomogeneously divided so that the desired distribution may approximate to the geometrically averaged distribution.

First, each

Ω_{d}^{τ}

is divided into K isometric intervals

Δ Ω_{d}^{τ} (k)

(

k = 1, 2, . . ., K

) as shown in Figure 3. The interval size

| Δ Ω_{d}^{τ} (k) |

is equivalent to

\frac{| Ω_{d}^{τ} |}{K}

. By combining

Δ Ω_{d}^{τ} (k)

, the following subspaces are generated:

\begin{matrix} Δ Ω^{τ} (l) = Δ Ω_{1}^{τ} (k_{1}) \times Δ Ω_{2}^{τ} (k_{2}) \times \dots \times Δ Ω_{D^{τ}}^{τ} (k_{D^{τ}}) \end{matrix}

(10)

where

l = 1, 2, \dots, K^{D^{τ}}

. The operator

A \times B

yields a product space of A and B. If K is enough large, then the probability distribution

p (ω)

can be regarded as being constant in

Δ Ω^{τ} (l)

. Let

p^{τ} (l)

denote its constant value in

Δ Ω^{τ} (l)

. The probability of

Δ Ω^{τ} (l)

is calculated as

\begin{matrix} P (Δ Ω^{τ} (l)) = \int_{Δ Ω^{τ} (l)} p (ω) d ω \approx p^{τ} (l) \cdot | Δ Ω^{τ} (l) | \end{matrix}

(11)

Figure 3. Division of a parameter space.

Then, two types of further division are made for each subspace

Δ Ω^{τ} (l)

. One is the division into

M^{τ} (l)

isometric subsubspaces

Δ Δ Ω_{M}^{τ} (l, m)

(

m = 1, 2, \dots, M^{τ} (l)

) , and the other is the division into N isometric subsubspaces

Δ Δ Ω_{N}^{τ} (l, n)

(

n = 1, 2, \dots, N

). The subsubspace sizes

| Δ Δ Ω_{M}^{τ} (l, m) |

and

| Δ Δ Ω_{N}^{τ} (l, n) |

are equivalent to

\frac{| Δ Ω^{τ} (l) |}{M^{τ} (l)}

and

\frac{| Δ Ω^{τ} (l) |}{N}

, respectively. The division number N is constant, whereas

M^{τ} (l)

varies with τ and l. Therefore, the parameter space composed of

Δ Δ Ω_{M}^{τ} (l, m)

is inhomogeneous. On the inhomogeneously divided parameter space, the desired probability distribution is defined as follows:

\begin{matrix} P (Δ Δ Ω_{M}^{τ} (l, m)) = \frac{P (Δ Ω^{τ} (l))}{M^{τ} (l)} \end{matrix}

\begin{matrix} \approx \frac{p^{τ} (l) \cdot | Δ Ω^{τ} (l) |}{M^{τ} (l)} = \frac{p^{τ} (l) \cdot | Ω^{τ} |}{M^{τ} (l) \cdot K^{D^{τ}}} \end{matrix}

(12)

On the other hand, on the parameter space composed of

Δ Δ Ω_{N}^{τ} (l, n)

, the geometrically averaged probability distribution is defined as follows:

\begin{matrix} Q (Δ Δ Ω_{N}^{τ} (l, n)) = \frac{\sqrt[D^{τ}]{P (Δ Ω^{τ} (l))}}{N} \\ \approx \frac{1}{N} \cdot \sqrt[D^{τ}]{p^{τ} (l) \cdot | Δ Ω^{τ} (l) |} \\ = \frac{1}{N} \cdot \sqrt[D^{τ}]{p^{τ} (l) \cdot \frac{| Ω^{τ} |}{K^{D^{τ}}}} = \frac{\sqrt[D^{τ}]{p^{τ} (l) \cdot | Ω^{τ} |}}{N \cdot K} \end{matrix}

(13)

Because the dimension

D^{τ}

is normalized in

\sqrt[D^{τ}]{p^{τ} (l)}

and

\sqrt[D^{τ}]{| Ω^{τ} |}

, the underestimation is not caused in

Q (Δ Δ Ω_{N}^{τ} (l, n))

.

By setting M as follows:

M^{τ} (l) = ⌊N {(\frac{p^{τ} (l) \cdot | Ω^{τ} |}{K^{D^{τ}}})}^{1 - \frac{1}{D^{τ}}}⌋

(14)

with an enough large number N,

P (Δ Δ Ω_{M}^{τ} (l, m))

approximates to

Q (Δ Δ Ω_{N}^{τ} (l, n))

. In Equation (14),

⌊ x ⌋

is the nearest integer value of x. Consequently, we obtain the probability distribution that suppresses the underestimation and integrates to approximately one.

Let

O^{τ} (Δ Δ Ω_{M}^{τ} (l, m))

denote a set of the object models as follows:

\begin{matrix} O^{τ} (Δ Δ Ω_{M}^{τ} (l, m)) = \{o^{τ} (ω^{τ}) | ω^{τ} \in Δ Δ Ω_{M}^{τ} (l, m)\} \end{matrix}

(15)

The following probability distribution of the object model set:

P (O^{τ} (Δ Δ Ω_{M}^{τ} (l, m))) = P^{τ} \cdot P (Δ Δ Ω_{M}^{τ} (l, m))

(16)

is used instead of

p (o^{τ} (ω^{τ}))

as a priori probability distribution in the Bayes formula.

5. Observation Model

The Bayes formula also uses a likelihood function that evaluates the fidelity of an object model to a CT VOI and is defined as the conditional probability of the CT VOI given the object model. To evaluate the fidelity, a template VOI is produced from the object model by simulating the CT imaging process where nodules and blood vessels are observed as regions with particular shapes on CT slice sections. The conditional probability is formulated by use of similarity between the CT VOI and the template VOI.

5.1. Production of templates from 3-D object models

Figure 4 illustrates an object model

o^{τ} (ω^{τ})

(a nodule model as an example) in 3-D subspace that corresponds to a template VOI of the same size of a CT VOI. Let

v_{T} (x, y, z; o^{τ} (ω^{τ}))

denote a voxel value at

x, y, z

in the template VOI. The value is calculated as

\begin{matrix} v_{T} (x, y, z; o^{τ} (ω^{τ})) = α_{m}^{τ} \cdot ς_{m}^{τ} + α_{a}^{τ} \cdot (ς_{v} - ς_{m}^{τ}) \end{matrix}

(17)

where

α_{m}^{τ}

and

α_{a}^{τ}

are the X-ray attenuation of the object model and that of its surrounding area, respectively. The notations

ς_{v}

and

ς_{m}^{τ}

are the volume of the whole voxel and that of a part where the object model intersects with the voxel, respectively.

Figure 4. Production of template VOI.

5.2. Likelihood of VOIs to 3-D object models

The fidelity of the object model to a CT VOI is evaluated by the following correlation coefficient between the CT VOI and the template VOI:

\begin{matrix} γ (v_{C}, v_{T}) \\ = \frac{\sum_{x, y, z} (v_{C} (x, y, z) - {\bar{v}}_{C}) (v_{T} (x, y, z) - {\bar{v}}_{T})}{\sqrt{\sum_{x, y, z} (v_{C} (x, y, z) - {\bar{v}}_{C})^{2}} \sqrt{\sum_{x, y, z} (v_{T} (x, y, z) - {\bar{v}}_{T})^{2}}} \end{matrix}

(18)

where

v_{C} (x, y, z)

is the voxel value at

x, y, z

in the CT VOI. The notations

{\bar{v}}_{C}

and

{\bar{v}}_{T}

are the mean voxel values of the CT VOI and template VOI, respectively.

The correlation coefficient

γ (v_{C}, v_{T})

takes a value between 1 and −1. The higher the fidelity is, the larger the correlation coefficient is. The likelihood is defined as

p (v_{C} | o^{τ} (ω^{τ})) = \frac{γ (v_{C}, v_{T}) + 1}{2}

(19)

6. Recognition of Nodules Based on the MAP Estimation

In the MAP estimation, the posteriori probabilities of hypotheses established beforehand are calculated by the Bayes theorem, and the hypothesis with the maximum posteriori probability is adopted as the most likely one.

Given a CT VOI

v_{C}

, the posteriori probability of the hypothesis that an object model set

O^{τ} (Δ Δ Ω_{M}^{τ} (l, m))

appears in the VOI is defined by

\begin{matrix} P (O^{τ} (Δ Δ Ω_{M}^{τ} (l, m)) | v_{C}) \\ = \int_{Δ Δ Ω_{M}^{τ} (l, m)} p (o^{τ} (ω) | v_{C}) d ω \\ = \int_{Δ Δ Ω_{M}^{τ} (l, m)} β \cdot p (v_{C} | o^{τ} (ω)) p (o^{τ} (ω)) d ω \end{matrix}

(20)

where

β = {[p (v_{C})]}^{- 1}

. In Equation (20), the Bayes formula:

p (o^{τ} (ω) | v_{C}) = β \cdot p (v_{C} | o^{τ} (ω)) p (o^{τ} (ω))

(21)

is used. Since the size of

Δ Δ Ω_{M}^{τ} (l, m)

is small, the likelihood function

p (v_{C} | o^{τ} (ω))

can be regarded as being constant in

Δ Δ Ω_{M}^{τ} (l, m)

. Thus,

\begin{matrix} the last term in Equation (20) \\ \approx β \cdot p (v_{C} | o^{τ} ({\overset{ˇ}{ω}}^{τ})) \int_{Δ Δ Ω_{M}^{τ} (l, n)} p (o^{τ} (ω)) d ω \\ = β \cdot p (v_{C} | o^{τ} ({\overset{ˇ}{ω}}^{τ})) P (O^{τ} (Δ Δ Ω_{M}^{τ} (l, m))) \end{matrix}

(22)

where

{\overset{ˇ}{ω}}^{τ}

is a certain essential parameter vector in

Δ Δ Ω_{M}^{τ} (l, m)

. The likelihood

p (v_{C} | o^{τ} ({\overset{ˇ}{ω}}^{τ}))

and the priori probability

P (O^{τ} (Δ Δ Ω_{M}^{τ} (l, m)))

are obtained from Equation (19) and Equation (16), respectively.

For each class

τ = {N, B_{c}, B_{b}}

, the following indices that maximize the posteriori probability:

\begin{matrix} (l^{τ *}, m^{τ *}) = arg max_{l, m} P (O^{τ} (Δ Δ Ω_{M}^{τ} (l, m)) | v_{C}) \end{matrix}

(23)

are searched for by the Powell method [33]. From the following ratio between the posteriori probabilities:

\begin{matrix} ρ_{1} (v_{C}) = \frac{P (O^{N} (Δ Δ Ω_{M}^{N} (l^{N *}, m^{N *})) | v_{C})}{max_{τ \neq N} P (O^{τ} (Δ Δ Ω_{M}^{τ} (l^{τ *}, m^{τ *})) | v_{C})} \end{matrix}

(24)

the CT VOI

v_{C}

is determined to be abnormal if

ρ_{1} (v_{C}) \geq T_{ρ}

and to be normal if

ρ_{1} (v_{C}) < T_{ρ}

with a certain threshold

T_{ρ}

.

7. Experimental Results

In this experiment, 26 thoracic CT scans are used with 30 actual pulmonary nodules. They are acquired by the Asterion, TOSHIBA Medical Systems Corporation. The tube current, tube voltage, slice thickness and reconstruction interval are 30 mA, 120 kV, 8 mm and 8 mm, respectively. The CT scans contain from 31 to 44 slice cross sections, each of which has 512 × 512 pixels.

From the CT scans, lung regions are extracted by a threshold-based technique [4], and then initial nodule candidates are detected from the lung regions by our methods [8,27,28]. The number of nodule candidates per scan is 93.8. They are composed of 28 actual nodules (two false negatives occurs) and 92.8 false positives per scan.

The proposed method is applied to the nodule candidates and, as a final result, Table 1 is obtained that shows the relations between the true positive ratio (TPR) and false positive number per scan (#FP) at the representative threshold values

T_{ρ}

. The relations are depicted by the fROC curve [34] shown in Figure 5. The optimal threshold

T_{ρ}

is experimentally determined to be 0.954 so as to detect all the true nodules in the candidates and minimize the number of false positives. Table 1 indicates that, by making

T_{ρ}

less than 0.995, the TPR and #FP can be stably kept more than 82% and less than 22, respectively.

Three cases of the experimental results are examined in detail below.

Table 1. The relations between

T_{ρ}

, TPR and #FP.

**Table 1.** The relations between $T_{ρ}$ , TPR and #FP.
$T_{ρ}$	TPR %	#FP
$0.954$	$100.0$	$21.2$
$0.959$	$92.9$	$19.7$
$0.995$	$82.1$	$12.4$
$1.01$	$71.4$	$10.9$
$1.06$	$60.7$	$4.5$
$1.10$	$50.0$	$2.8$

Figure 5. The fROC curve of the proposed method.

7.1. Case 1

Figure 6 shows a slice cross section of a sample CT scan. The arrow indicates a nodule identified by a radiologist. The nodule is detected by our previous methods [8,27,28] as a nodule candidate. Figure 7 (a–c) shows a VOI of the nodule (Figure 7(b) shows the nodule). Figure 7(d–f) is template produced from the most likely nodule model that is depicted in Figure 8(a). They correspond to Figure 7(a–c), respectively. Figure 7(g–i) is produced from the most likely curved blood vessel model depicted in Figure 8(b), and Figure 7(j–l) is produced from the most likely bifurcated blood vessel model depicted in Figure 8(c). They correspond to Figure 7(a–c), respectively, in the same manner.

Figure 6. A slice cross section of a sample thoracic CT scan. The arrow indicates a nodule identified by a radiologist.

The posteriori probabilities of the most likely object models

o^{N}

,

o^{B_{c}}

and

o^{B_{b}}

are 0.124, 0.087 and 0.100, respectively (the common constant values are omitted), and the ratio of the posteriori probabilities

ρ_{1}

is 1.24. Because the ratio is larger than the threshold

T_{ρ}

, the nodule candidate is correctly determined to be a nodule.

The nodule (Figure 7(a–c)) is faithfully reconstructed by the templates (Figure 7(d–f)). Although the nodule is shifted downward against the VOI (the nodule is not observed in the upper slice section shown in Figure 7(a)), the center of the nodule model is moved adequately. The nodule size is also estimated exactly. The diameters of the nodule and the nodule model are approximately 5.0 mm and 5.3 mm, respectively.

The effectiveness of Equation (3) can be seen in the templates of the curved and bifurcated blood vessel models shown in Figure 7(g–i) and (j–l), respectively. Because the nodule candidate is in the peripheral area of the lung region (see Figure 6 again), the mean radius of a blood vessel model

μ_{r 1}^{B} (x^{B_{c}})

becomes rather small, 0.89 mm, at this position. The small mean radius produces thin regions on the templates and the thin regions make the correlation coefficients low. Therefore, the probabilities of the blood vessel models become smaller than that of the nodule model.

Figure 7. The first row represents the VOI comprising the nodule shown in Figure 6. The second, third and fourth rows represent the templates that are produced from the most likely nodule, curved blood vessel and bifurcated blood vessel models, respectively.

Figure 8. The most likely nodule, curved blood vessel and bifurcated blood vessel models for the nodule shown in Figure 6.

7.2. Case 2

Figure 9 shows another CT scan with a nodule. Figure 10 shows its VOI and templates, and Figure 11 the most likely object models. The posteriori probabilities are 0.121, 0.100 and 0.101, and the ratio is 1.20, that is larger than the threshold. Although the nodule is small (its diameter is approximately 3.8 mm), it is correctly determined to be a nodule.

Figure 9. The arrow indicates another nodule.

Figure 10. A VOI of the nodule shown in Figure 9 and its templates.

Figure 11. The most likely object models for the nodule shown in Figure 9.

7.3. Case 3

Figure 12 shows another CT scan with a false positive (a bifurcation in a blood vessel tree). Figure 13 shows its VOI and templates, and Figure 14 the most likely object models. The posteriori probabilities are 0.0960, 0.111 and 0.126, and the ratio is 0.762, that is smaller than the threshold. The candidate is correctly determined to be a blood vessel.

The candidate region observed in Figure 13(b) seems to be a nodule that has an irregular shape. It is difficult to recognize the candidate region only from the single slice section. However, by considering the relations between the candidate region and the other regions in the adjacent slice sections, it becomes to be easy to determine that the regions arise from a blood vessel bifurcation.

Figure 12. A bifurcation in a blood vessel tree.

Figure 13. A VOI of the nodule candidate shown in Figure 12 and its templates.

Figure 14. The most likely object models for the candidate shown in Figure 12.

8. Discussion

Many detection methods based on template-matching techniques use only nodule templates [23,24,25]. Our method, on the other hand, uses not only the nodule templates but also blood vessel templates, that enable the computers to make the most of anatomical knowledge, especially, for blood vessel structures in lung regions. If the blood vessel models are not used in the present method and the nodule recognition is performed by

ρ_{2} (v_{C}) = P (O^{N} (Δ Δ Ω_{N}^{N} (l, n; ω^{N *})) | v_{C})

(25)

instead of Equation (24), the false positive number increases to 60.3 per case. It implies that the blood vessel models play an important role for nodule recognition.

The geometric averages of probabilities are also important. If the following probability ratio that consists of the ordinary joint probabilities is used:

\begin{matrix} ρ_{3} (v_{C}) = \frac{p (o^{N} (ω^{N}) | v_{C})}{max_{τ \neq N} p (o^{τ} (ω^{τ}) | v_{C})} \end{matrix}

(26)

the false positive number increases to 34.8 per case. The ordinary joint probabilities suppress all the priori probabilities of bifurcated blood vessel models as mentioned in Section 4, and therefore most bifurcated blood vessel models are ignored in the maximum operation in Equation (26) and the anatomical knowledge on bifurcated blood vessels is not made use of at all. It makes the recognition accuracy degraded.

The present method needs approximately 40 seconds for recognition of one candidate with a computer that has a 3 GHz CPU. We have already proposed an efficient template-matching method [35], but it uses simple object models that possess less parameters. A more efficient technique are needed for high-dimensional object models used in the present paper.

Our previous methods [8,27,28] are applied to CT scans for initial detection of nodule candidates as mentioned in Section 1 The method [8] is a Morphological [7] image filter that is designed to respond only to blob regions that are major shapes of pulmonary nodules on CT slice sections. The method [27] discriminates pulmonary nodules from false positives based on analysis of image features such as intensities, sizes and shapes. The method [28] recognizes nodules using a principal component analysis based clustering followed by a subspace method [36]. Although these methods use only 2-D image features and therefore yield many false positives, they can complete the recognition in a shorter time. From the practical point of view, we use these initial detection methods before applying the method proposed in the present paper.

9. Conclusions

The present paper described a novel recognition method of pulmonary nodules in thoracic CT scans by use of 3-D deformable nodule and blood vessel models. The anatomical validity of these object models and their fidelity to CT scans were evaluated in the framework of the Bayes theorem. The most likely object model that maximized the posteriori probability was obtained for each VOI comprising a nodule candidate. The VOI was determined to be abnormal if the most likely object model was a nodule model, and vice versa. The proposed method was applied to 26 actual CT scans. The 77.2% of false positives were reduced successfully.

References

Weir, H.K. Annual report to the nation on the status of cancer, 1975–2000. J. Natl. Cancer Inst. 2003, 95, 1276–1299. [Google Scholar] [CrossRef] [PubMed]
Tanaka, T.; Yuta, K.; Kobayashi, Y. A study of false-negative case in mass-screening of lung cancer. Jay. J. Thor. Med. 1984, 43, 832–838. [Google Scholar]
Oda, J.; Akita, S.; Shimada, K. A study of false-negative case in comparative reading of mass-screening of lung cancer. Lung Cancer 1989, 29, 271–278. [Google Scholar]
Yamamoto, S.; Tanaka, I.; Senda, M.; Tateno, Y.; Iinuma, T.; Matsumoto, T.; Matsumoto, M. Image processing for computer-aided diagnosis of lung cancer by CT(LSCT). Syst. Comp. Japan 1994, 25, 67–80. [Google Scholar] [CrossRef]
Henschke, C.I.; McCauley, D.I.; Yankelevitz, D.F.; Naidich, D.P.; McGuinness, G.; Miettinen, O.S.; Libby, D.M.; Pasmantier, M.W.; Koizumi, J.; Altorki, N.K.; Smith, J.P. Early lung cancer action project: Overall design and findings from baseline screening. Lancet 1999, 354, 99–105. [Google Scholar] [CrossRef]
van Ginneken, B. Computer-aided diagnosis in thoracic computed tomography. Imaging Decisions MRI 2009, 12, 11–22. [Google Scholar] [CrossRef]
Haralick, R.M.; Sternberg, S.R.; Zhuang, X. Imaging analysis using mathematical morphology. IEEE Trans. Pattern Anal. Machine Intell. 1987, 9, 532–550. [Google Scholar] [CrossRef]
Yamamoto, S.; Matsumoto, M.; Tateno, Y.; Iinuma, T.; Matshmoto, T. Quoit filter: A new filter based on mathematical morphology to extract the isolated shadow, and its application to automatic detection of lung cancer in x-ray CT. In Proceedings of the 13th International Conference on Pattern Recognition II, Vienna, Austria, August 25–29, 1996; pp. 3–7.
Okumura, T.; Miwa, T.; Kako, J.; Yamamoto, S.; Matsumoto, M.; Tateno, Y.; Iinuma, T.; Matshmoto, T. Variable N-Quoit filter applied for automatic detection of lung cancer by X-ray CT. In Proceedings of Computer Assisted Radiology and Surgery(CAR’98), Tokyo, Japan, June 24–27, 1998; pp. 242–247.
Kostis, W.; Reeves, A.; Yankelevitz, D.; Henschke, C. Three-dimensional segmentation and growth-rate estimation of small pulmonary nodules in helical CT images. IEEE Trans. Med. Imaging 2003, 22, 1259–1274. [Google Scholar] [CrossRef] [PubMed]
Giger, M.L.; Bae, K.T.; MacMahon, H. Computerized detection of pulmonary nodules in CT images. Invest. Radiol. 1994, 29, 459–465. [Google Scholar] [CrossRef] [PubMed]
Sato, Y.; Nakajima, S.; Shiraga, N.; Atsumi, H.; Yoshida, S.; Koller, T.; Gerig, G.; Kiknis, R. Three-dimensional multi-scale line filter for segmentation and visualization of curvilinear structures in medical images. Med. Image Anal. 1998, 2, 143–168. [Google Scholar] [CrossRef]
Li, Q.; Doi, K. New selective nodule enhancement filter and its application for significant improvement of nodule detection on computed tomography. Proc. SPIE 2004, 5370. [Google Scholar] [CrossRef]
Kanazawa, K.; Niki, N.; Nishitani, H.; Satoh, H.; Ohmatsu, H.; Moriyama, N. Computer assisted diagnosis of lung cancer using helical X-ray CT. In Proceedings of IEEE Workshop on Biomedical Image Analysis, Seattle, Washington, DC, USA, June 24–25, 1994; pp. 261–267.
McNitt-Gray, M.F.; Wyckoff, N.; Hart, E.M.; Sayre, J.W.; Goldin, J.G.; Aberle, D.R. Computer-Aided Techniques to Characterize Solitary Pulmonary Nodules Imaged on CT. In Computer-Aided Diagnosis in Medical Imaging; Elsevier: Maryland Heights, MO, USA, 1999; pp. 101–106. [Google Scholar]
Armato, S.G., III; Giger, M.L.; Moran, C.J.; Doi, K.; MacMahon, H. Computerized detection of lung nodules on CT scans. RadioGraphics 1999, 19, 1303–1311. [Google Scholar] [CrossRef] [PubMed]
Arimura, H.; Katsuragawa, S.; Suzuki, K.; Li, F.; Shiraishi, J.; Sone, S.; Doi, K. Computerized scheme for automated detection of lung nodules in low-dose CT images for lung cancer screening. Acad. Radiol. 2004, 11, 617–629. [Google Scholar] [CrossRef] [PubMed]
Kung, J.W.; Matsumoto, S.; Hasegawa, I.; Nguyen, B.; Toto, L.C.; Kundel, H.; Hatabu, H. Mixture distribution analysis of a computer assisted diagnostic method for the evaluation of pulmonary nodules on computed tomography scan. Acad. Radiol. 2004, 11, 281–285. [Google Scholar] [CrossRef]
Matsumotoa, S.; Kundelb, H.L.; Geeb, J.C.; Gefterb, W.B.; Hatabu, H. Pulmonary nodule detection in CT images with quantized convergence index filter. Med. Image Anal. 2006, 10, 343–352. [Google Scholar] [CrossRef] [PubMed]
Kawata, Y.; Niki, N.; Ohmatsu, H.; Kakinuma, R.; Eguchi, K.; Kaneko, M.; Moriyama, N. Quantitative surface characterization of pulmonary nodules based on thin-section CT images. IEEE Trans. Nuclear Sci. 1998, 45, 2132–2138. [Google Scholar] [CrossRef]
Suzuki, K.; Armato, S.G.; Li, F.; Sone, S.; Doi, K. Massive training artificial neural network (MTANN) for reduction of false positives in computerized detection of lung nodules in low-dose computed tomography. Med. Phys. 2003, 30, 1602–1617. [Google Scholar] [CrossRef] [PubMed]
Suzuki, K.; Horiba, I.; Sugie, N. Neural Edge Enhancer for Supervised Edge Enhancement from Noisy Images. IEEE Trans. Pattern Anal. Mach. Intell. 2003, 25, 1582–1596. [Google Scholar] [CrossRef]
Lee, Y.; Hara, T.; Fujita, H.; Itoh, S.; Ishigaki, T. Automated detection of pulmonary nodules in helical CT images based on an improved template-matching technique. IEEE Trans. Med. Imaging 2001, 20, 595–604. [Google Scholar] [PubMed]
Ozekes, S.; Saman, O.; Ucan, O.N. Nodule detection in a lung region that’s segmented with using genetic cellular neural networks and 3D template matching with fuzzy rule based thresholding. Korean J. Radiol. 2008, 9, 1–9. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Farag, A.A.; El-Baz, A.; Gimel’farb, G.; Falk, R. Detection and recognition of lung abnormalities using deformable templates. In Proceedings of 17th International Conference on Pattern Recognition, Cambridge, UK, August 23-26, 2004; Volume 3, pp. 738–741.
McCulloch, C.C.; Kaucic, R.A.; Mendonca, P.; Walter, D.J.; Avila, R.S. Model-based detection of lung nodules in computed tomography exams. Acad. Radiol. 2004, 11, 258–266. [Google Scholar] [CrossRef]
Ezoe, T.; Takizawa, H.; Yamamoto, S.; Matsumoto, T.; Tateno, Y.; Iinuma, T.; Matsumoto, M. Detection method of ground glass opacities from chest X-ray CT images. In Proceedings of Society of Photo-Optical Instrumentation Engineers, Medical Imaging 2002, San Diego, CA, USA, February 24–26, 2002; pp. 1672–1680.
Fukano, G.; Nakamura, Y.; Takizawa, H.; Mizuno, S.; Yamamoto, S.; Doi, K.; Katsuragawa, S.; Matsumoto, T.; Tateno, Y.; Iinuma, T. Eigen image recognition of pulmonary nodules from thoracic CT images by use of subspace method. IEICE Trans. Inf. Syst. 2005, E88-D-II, 1273–1283. [Google Scholar] [CrossRef]
Fukano, G.; Takizawa, H.; Shigemoto, K.; Yamamoto, S.; Matsumoto, T.; Tateno, Y.; Iinuma, T.; Matsumoto, M. Recognition method of lung nodules using blood vessel extraction techniques and 3D object models. In Proceedings of Society of Photo-Optical Instrumentation Engineers, Medical Imaging 2003, Bellingham, WA, USA, July 2003; pp. 190–198.
Takizawa, H.; Yamamoto, S.; Shiina, T. Accuracy improvement of pulmonary nodule detection based on spatial statistical analysis of thoracic CT scans. IEICE Trans. Inf. Syst. 2007, E90-D, 1168–1174. [Google Scholar] [CrossRef]
Li, H. A probabilistic disambiguation method based on psycholinguistic. In Proceedings of the 4th Workshop on Very Large Corpora, Copenhagen, Denmark, August 4, 1996; pp. 141–154.
Li, G.; Huang, T. An improved training algorithm in HMM-based speech recognition. In Proceedings of the 4th International Conference on Spoken Language Processing, Philadelphia, PA, USA, October 3–6, 1996; Volume 2, pp. 1057–1060.
Press, W.H.; Flannery, B.P.; Teukolsky, S.A.; Vetterling, W.T. Numerical Recipes in C; Cambridge University Press: Cambridge, UK, 1988. [Google Scholar]
Fawcett, T. An introduction to ROC analysis. Pattern Rec. Lett. 2006, 27, 861–874. [Google Scholar] [CrossRef]
Shigemoto, K.; Takizawa, H.; Yamamoto, S.; Matsumoto, T.; Tateno, Y.; Iinuma, T.; Matsumoto, M. An efficient recognition of lung nodules from X-ray CT images using 3D object models. In Proceedings of the 16th International Congress and Exhibition, Computer Assisted Radiology and Surgery (CARS2002), Paris, France, June 26–29, 2002; pp. 717–722.
Oja, E. Subspace Methods of Pattern Recognition; Research Studies Press: Letchworth, Herfordshire, UK, 1983. [Google Scholar]

© 2010 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).

Share and Cite

MDPI and ACS Style

Takizawa, H.; Yamamoto, S.; Shiina, T. Recognition of Pulmonary Nodules in Thoracic CT Scans Using 3-D Deformable Object Models of Different Classes. Algorithms 2010, 3, 125-144. https://doi.org/10.3390/a3020125

AMA Style

Takizawa H, Yamamoto S, Shiina T. Recognition of Pulmonary Nodules in Thoracic CT Scans Using 3-D Deformable Object Models of Different Classes. Algorithms. 2010; 3(2):125-144. https://doi.org/10.3390/a3020125

Chicago/Turabian Style

Takizawa, Hotaka, Shinji Yamamoto, and Tsuyoshi Shiina. 2010. "Recognition of Pulmonary Nodules in Thoracic CT Scans Using 3-D Deformable Object Models of Different Classes" Algorithms 3, no. 2: 125-144. https://doi.org/10.3390/a3020125

Article Menu

Recognition of Pulmonary Nodules in Thoracic CT Scans Using 3-D Deformable Object Models of Different Classes

Abstract

1. Introduction

2. Overview of the Proposed Method

3. 3-D Deformable Object Models

3.1. Nodule

3.2. Curved blood vessel

3.3. Bifurcated blood vessel

3.4. Appearance probabilities of object model sets

4. Modification of Probability Distributions of 3-D Object Models

5. Observation Model

5.1. Production of templates from 3-D object models

5.2. Likelihood of VOIs to 3-D object models

6. Recognition of Nodules Based on the MAP Estimation

7. Experimental Results

7.1. Case 1

7.2. Case 2

7.3. Case 3

8. Discussion

9. Conclusions

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI