A Multi-Layer Perceptron Network for Perfusion Parameter Estimation in DCE-MRI Studies of the Healthy Kidney

Klepaczko, Artur; Strzelecki, Michał; Kociołek, Marcin; Eikefjord, Eli; Lundervold, Arvid

doi:10.3390/app10165525

Open AccessFeature PaperArticle

A Multi-Layer Perceptron Network for Perfusion Parameter Estimation in DCE-MRI Studies of the Healthy Kidney

by

Artur Klepaczko

¹,

Michał Strzelecki

^1,*

,

Marcin Kociołek

¹

,

Eli Eikefjord

² and

Arvid Lundervold

^2,3,4

¹

Institute of Electronics, Lodz University of Technology, 90-924 Lodz, Poland

²

Department of Health and Functioning, Western Norway University of Applied Sciences, 5063 Bergen, Norway

³

Department of Biomedicine, University of Bergen, 5009 Bergen, Norway

⁴

Mohn Medical Imaging and Visualization Centre, Department of Radiology, Haukeland University Hospital, 5021 Bergen, Norway

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2020, 10(16), 5525; https://doi.org/10.3390/app10165525

Submission received: 28 June 2020 / Revised: 1 August 2020 / Accepted: 5 August 2020 / Published: 10 August 2020

(This article belongs to the Special Issue Machine Learning for Biomedical Application)

Download

Browse Figures

Versions Notes

Abstract

:

Background: Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is an imaging technique which helps in visualizing and quantifying perfusion—one of the most important indicators of an organ’s state. This paper focuses on perfusion and filtration in the kidney, whose performance directly influences versatile functions of the body. In clinical practice, kidney function is assessed by measuring glomerular filtration rate (GFR). Estimating GFR based on DCE-MRI data requires the application of an organ-specific pharmacokinetic (PK) model. However, determination of the model parameters, and thus the characterization of GFR, is sensitive to determination of the arterial input function (AIF) and the initial choice of parameter values. Methods: This paper proposes a multi-layer perceptron network for PK model parameter determination, in order to overcome the limitations of the traditional model’s optimization techniques based on non-linear least-squares curve-fitting. As a reference method, we applied the trust-region reflective algorithm to numerically optimize the model. The effectiveness of the proposed approach was tested for 20 data sets, collected for 10 healthy volunteers whose image-derived GFR scores were compared with ground-truth blood test values. Results: The achieved mean difference between the image-derived and ground-truth GFR values was 2.35 mL/min/1.73 m², which is comparable to the result obtained for the reference estimation method (−5.80 mL/min/1.73 m²). Conclusions: Neural networks are a feasible alternative to the least-squares curve-fitting algorithm, ensuring agreement with ground-truth measurements at a comparable level. The advantages of using a neural network are twofold. Firstly, it can estimate a GFR value without the need to determine the AIF for each individual patient. Secondly, a reliable estimate can be obtained, without the need to manually set up either the initial parameter values or the constraints thereof.

Keywords:

dynamic contrast-enhanced MRI; kidney perfusion; glomerular filtration rate; pharmacokinetic modeling; multi-layer perceptron; parameter estimation

1. Introduction

Monitoring the state of kidney functioning has become an increasingly important task in modern medical diagnostics. Failures in renal operation may impair physiological homeostasis, leading to the improper management of electrolytes, acid-base balance perturbation, or the deregulation of arterial blood pressure. Kidneys are responsible for blood filtration, and the removal of water-soluble waste products of metabolism and surplus glucose and other organic substances. Kidney diseases include, among others, acute kidney injury [1], chronic kidney disease (CKD) [2] and various cancers (e.g., renal cell carcinoma). Treatment depends on the pathological condition, and may require life-long dialysis, or kidney removal or transplantation. In any case, precise knowledge about kidney performance is needed for the proper diagnosis, treatment and follow-up prognosis. Especially in the case of CKD, which may be caused by longstanding hypertension or diabetes mellitus, it is important to continuously and precisely monitor renal function, since early detection of the disease allows the prevention of its development to the end stage.

Tissue perfusion is a viable means of preserving cell nutrition, humoral communication and the elimination of waste products over a lifetime [3]. Thus information about local blood flow and the blood filtration taking part in the glomeruli plays a fundamental role in the diagnosis and follow-up of the aforementioned diseases. Routinely, renal diagnosis is accomplished through the creatinine clearance procedure, or by using the more specific iohexol clearance test. These tests lead to determination of the glomerular filtration rate (GFR)—the main indicator of kidney performance. However, neither procedure enables differentiation between left and right kidney function [4].

Dynamic contrast-enhanced (DCE) magnetic resonance imaging (MRI) is an alternative technique available for the assessment of kidney functioning [5,6,7]. The procedure requires the administration of a contrast agent (CA) intravenously, given as a bolus injection, whose passage through the abdominal arterial system and the capillary bed and tubular systems of the kidneys is measured. In contrast to conventional methods, DCE-MRI enables the estimation of GFR for a single kidney, ultimately at the voxel-level, and simultaneously it provides anatomical characteristics of the renal parenchyma.

The estimation of renal filtration based on the DCE-MR images involves the pharmacokinetic (PK) modeling of the contrast agent, arriving from the abdominal aorta into a kidney artery, the renal parenchyma and the afferent arterioles of the renal corpuscles, and then passing from the glomerular capillaries through the filtration barrier into the Bowman’s space and the tubular system, ending up in the renal pelvis, the ureter and then the bladder as a constituent of the urine. There have been numerous PK models proposed in the literature [8], which vary in (1) the number of tissue compartments, (2) the level of incorporating compartment-specific impulse residue functions, (3) the inclusion of the effect of tracer agent leakage from capillaries in cancerous cells, and (4) the way in which the delay and the broadening of peak CA concentration in the renal parenchyma, relative to the abdominal aorta, is modeled. In the case of the normal-state kidney, all recent models exploit a two-compartment approach [9]. In this setting, at any point in time, the contrast agent concentration in the tissue is the sum of concentrations in the intravascular (IV) and extracellular extravascular (EEV) compartments. In the present study, we used the two-compartment filtration model (2CFM) proposed by Tofts et al. [10].

One approach to finding the PK model’s parameters (defined below) is to fit the model-based signal intensity time course to the image data using a non-linear least-squares (NLLS) method. Depending on the model, various explicit parameters are used to characterize tracer agent kinetics. The so-called primary or independent parameters include plasma volume, plasma flow, interstitial volume and permeability product [9]. Based on these attributes, one may derive dependent parameters, i.e., the extraction fraction and the volume transfer constant. The latter, denoted as K^trans, characterizes regional renal filtration, as it measures the ratio of plasma flow that is transferred from the glomerular capillary bed into the Bowman´s space. K^trans multiplied by the kidney volume leads to an estimate of the glomerular filtration rate, and is therefore a central descriptor in quantifying kidney function.

In this study, we propose to use a multi-layer perceptron (MLP) network for PK model parameter estimation, instead of the non-linear least-squares curve-fitting method. The rationale behind this concept arises from the fact that the results of the iterative methods may be located in a local and—in general—suboptimal solution. Thus, the outcome depends largely on the initial choice of parameter values. Moreover, one usually has to establish constraints on the expected parameter values, so that the result falls into a physiologically feasible range. Bounds that are too narrow, however, may prevent the fitting algorithm from finding an optimum. Contrary to NLLS, MLP does not require that one provides the initial parameter values; moreover, it ensures the optimal solution when properly trained.

Although MLP networks could be regarded as a legacy method, they are still in use, and prove efficient in complex non-linear regression problems [11,12]. Moreover, according to the universal approximation theorem, an artificial neural network with at least one hidden layer of nonpolynomial activation (such as sigmoid or rectified linear) can model any measurable function that maps one finite-dimensional space to another, provided the hidden layers contain enough units [13]. Therefore, MLP seems to be particularly usefully in its application to the specific problem of PK model parameter estimation in renal DCE-MRI examinations.

Also, the reliability of the fitted PK model parameters is conditioned by—among other factors—the determination of the arterial input function (AIF), i.e., the tracer concentration’s time course in the feeding artery. The standard way of annotating AIF consists of estimating it for each patient. However, other approaches are accepted if patient-specific AIF measurements are not possible. Such problems can arise due to data acquisition constraints or the lack of a suitable artery within the imaging field of view. In the alternative strategy, one either utilizes a population-based mean AIF [14] (calculated for a cohort of studies, where it is feasible to do so) or defines its parameterized functional form [15]. In this study, we implicitly follow the population-based approach. The neural network is trained using a pool of AIFs automatically derived for a subset of available DCE images. Then, the GFR for an individual test case is estimated without the need to determine the case-specific AIF.

To sum up, the main contribution of this paper is the establishment of the architecture of an MLP network for estimating parameters of the 2CFM, based solely on image intensity time courses in the kidney region. Additionally, a side-contribution is the introduction of the algorithm for automatic determination of the AIF in DCE-MRI examinations. The latter achievement was needed in order to construct the data set for MLP training, but the algorithm can be equally used in standard approaches to PK modeling.

2. Methods and Materials

2.1. Subjects

A total of 20 DCE-MRI examinations were available for experiments [16]. The data sets were collected from 10 healthy volunteers. Each subject was imaged twice, seven days apart (further, these examinations will be referred to as Session 1 and 2). The acquisition sequence used the standard 3D fast low-angle shot (FLASH) spoiled gradient recalled echo technique with the following parameters: TR = 2.36 ms, TE = 0.8 ms, FA = 20°, in-plane resolution = 2.2 × 2.2 mm², slice thickness = 3 mm, acquisition matrix = 192 × 192, number of slices = 30. Prior to image acquisition, patients were administered 0.025 mmol/kg of GdDOTA at 3 mL/s flow rate. The contrast agent was injected intravenously. Then, 74 volumetric scans were gathered in 2.3 s time intervals.

Every image series was registered in the time domain using the b-splines method, as implemented in the Insight Toolkit (ITK) library [17]. Specifically, we used Slicer 3D software to accomplish the task. For every study, the same reference frame was used as a fixed volume that every other (moving) volume was matched to. B-splines registration was performed fully automatically, i.e., no fiducial points were marked over tissues of interest. Furthermore, the procedure was launched in a multi-stage configuration. In each stage, various settings of grid size and subsampling rates were used. For a detailed interpretation of these parameters, the reader is referred to the ITK documentation. In short, they allow the registration of images at different scales—starting from coarse matching and then refining the outcome.

A common practice in DCE series analysis is to interpolate the measured signal so as to achieve a higher temporal resolution and capture more precisely the critical phases of the perfusion process, such as, e.g., bolus arrival time or the peak of the signal enhancement [10]. As such, we applied cubic spline interpolation to the observed intensity time courses, and then resampled the interpolated signal curves at 0.4 milliseconds intervals.

As stated in the introduction, analysis of the DCE images leads to estimation of the GFR. Thus, in order to validate the obtained estimates against ground-truth values, volunteers underwent iohexol clearance tests. The measurement was carried out by administrating a dose of 5 mL of iohexol (300 mg I/mL; Omnipaque 300, GE Healthcare), and then acquiring a venous blood sample after 4 h.

No specific diet was imposed on the participating subjects, but they were asked to abstain from alcohol and high-protein meals, avoid exhausting physical exertion, be normally hydrated at least 2 days before examination, and have no caffeine on the examination day. Apart from these restrictions, participants were instructed to retain their ordinary habits concerning the type and amount of consumed food so as to assure stable examination conditions. All subjects gave their informed consent for inclusion before they participated in the study. The study was conducted in accordance with the Declaration of Helsinki, and the protocol was approved by the Regional Committees for Medical Research Ethics Western Norway (REC West 2012/1869).

2.2. Two-Compartment Filtration Model of Kidney Perfusion

In the PK model considered in this study [10], it is assumed that renal tissue is composed of two compartments—an intravascular (IV) one and an extracellular extravascular (EEV) one (Figure 1). The model can be applied either to the whole kidney parenchyma or the kidney cortex only. The assumption which could be violated here is that CA does not flow out from the tubular system within the fitting period.

The contrast agent’s concentration in the renal tissue C_tissue(t) is governed by the equation

C_{t i s s u e} (t) = K^{t r a n s} \int_{0}^{t} C_{p}^{k i d} (τ) + v_{p} C_{p}^{k i d} (t),

(1)

where

C_{p}^{a r t}

denotes the arterial input function, v_p—plasma volume fraction, and

C_{p}^{k i d}

—CA concentration in the plasma, given by the convolution

C_{p}^{k i d} = C_{p}^{a r t} \otimes g (t) = \int_{0}^{t} C_{p}^{a r t} (t - τ) g (τ) d τ .

(2)

The first term in (1) represents the EEV compartment, whereas Equation (2) models the delay and dispersion of the arterial input upon its arrival to the IV compartment by using the concept of the vascular impulse response function (VIRF), denoted as g(t). In our study we used the delayed exponential VIRF, defined as

g (t) = {\begin{array}{l} 0 & t < Δ \\ \frac{1}{T_{g}} e^{- \frac{t - Δ}{T_{g}}} & t \geq Δ \end{array},

(3)

with

\int_{0}^{\infty} g (t) d t = 1 .

(4)

The variables T_g—the dispersion time constant—and

Δ

—the delay interval—together with the volume fraction v_p and transfer constant K^trans form the complete set of the Tofts model’s parameters.

2.3. Arterial Input Function

Although in the proposed approach, the trained neural network enables the determination of GFR without annotation of a patient-specific arterial input function, information about the time course of the contrast agent’s concentration in a tissue-feeding artery is required at the training stage. As further described, we use a subset of image-derived AIFs to construct a training data set. Therefore, in this section, we present our custom method for automatic AIF determination.

Most frequently, AIF is determined based on signal averaged over a manually delineated region of interest near to the perfused organ [18,19]. Specifically, in the case of renography it is recommended to locate the AIF region-of-interest (ROI) near the place where the aorta bifurcates into common iliac arteries. This location is in relative proximity to the kidney, and is simultaneously sufficiently far from the upper aorta section, where the signal coming from the blood that enters the imaging slab at high velocity is high, because of the inflow artifact and not because of the contrast agent [19]. Moreover, Cutajar et al. in [18] confirmed the significant dependence of renal perfusion and the filtration coefficients on the size of the chosen AIF ROI. In [20], AIF was determined from voxels chosen semi-automatically on the maximum signal-enhancement maps as the brightest locations in the abdominal aorta, directly below the inlets of the renal arteries. Note that the manual or semi-automatic approaches suffer from poor reproducibility and subjectivism, and they can be time-consuming and can require the availability of well-trained staff. In addition, it may be necessary in such cases to obtain patient-specific hemodynamic parameters in order to adjust the DCE-MRI protocol to the arterial pulse wave, and thus reduce the inflow artifact. Therefore, there were several attempts made to automate determination of the AIF. A common approach employs clustering techniques [21]. On the other hand, in [22], the authors used Kendall’s coefficient of concordance. These attempts, however, were either applied in preclinical studies, or to different organs such as the head or neck, but not to the kidney.

We postulate a fully automatic procedure for determining a patient-specific AIF based on the clustering of abdominal aorta voxels’ time-series. Our approach consists of three stages, as illustrated in Figure 2. The algorithm starts with delineation of the abdominal aorta region. This task, further decomposed into four steps, is accomplished through the vessel segmentation of one time frame of the DCE-MR image series. The appropriate time frame is selected for maximum signal enhancement in the bottom-central region of the image, near the aorta’s bifurcation into common iliac arteries. The peak intensity value in this region corresponds to the time point when the CA fills the whole aorta lumen, thus ensuring the highest contrast between the background tissue and the aorta at its entire length.

Next, the image is blurred by the low-pass Gaussian filter (

σ = 3

voxels) to suppress fine structures that could be misinterpreted as vessels. Such a smoothed image is then submitted to the vessel enhancement routine. In this step, we computed a multi-scale vesselness function in the form proposed by Sato [23], using 5 scales of radius standard deviation ranging from 0.5 to 2.5 mm. As a result, the filtered image contains only the elongated bright structures. Apart from the aorta, it may contain smaller vessels, like renal arteries and some other elongated structures. In order to remove them, the flood fill operation is executed, with the lower threshold set to an empirically found value equal to 12.5% of the maximum image intensity. The upper bound is set to maximum, as we found the aorta to be the brightest structure visible in the time frames selected for segmentation. Similarly, the seed point of the flood fill operation was determined as the brightest point in the central part of the image along the coronal direction.

In the second stage, abdominal aorta voxels are grouped into clusters using their signal intensity time courses as the feature vectors. For this purpose, we employed a k-means clustering algorithm with k = 5. We found this value to be a reasonable tradeoff between the need to reflect apparent diversity in the aorta signal dynamics and the need to keep the cluster size sufficiently large, so that each one represents a significant portion of aorta voxels. As an effect of clustering, points whose temporal signal characteristics are similar are joined. Eventually, we select one of the clusters as the AIF ROI. The selection is based on the minimum mean signal value calculated in the pre-bolus phase. We assume that the best AIF candidate region has the lowest signal in this initial phase of bolus passage, as it is free from the inflow-enhancement artifact.

2.4. PK Model Fitting

There are several optimizers utilized in the NLLS curve-fitting problems. Downhill Simplex, Levenberg–Marquardt or non-linear conjugate gradients belong to the most frequently cited algorithms [24]. In the DCE-MRI context, a popular solver is based on the gradient expansion method implemented in the IDL Software [7]. Often, in order to ensure that the resulting parameter set falls into a range of reasonable physiological values, it is also necessary to restrain the optimization procedure to a specific solution subspace. In such a case, the Trust Region Reflective technique can be considered as the algorithm of choice as it enables optimization with constraints. All the above-mentioned methods are iterative, and require initiation with a starting point representing a feasible solution. As such, there is a risk of getting stuck in a local minimum of the objective function, usually defined as the mean squared error of the model residuals.

Therefore, we propose to solve the curve-fitting problem using an artificial neural network (ANN); specifically, a fully-connected multi-layer perceptron structure. ANNs have been successfully employed in a variety of optimization scenarios, including the fitting of complex, multi-parametric relationships to the observed noisy data [25]. The neural network training process is configured to explore a wide range of feasible parameter values, potentially covering the whole solution space. For every parameter candidate set, there is a corresponding CA concentration time course generated based on the PK model equation. Thus, the network incorporates a priori knowledge about the underlying mechanism of signal generation. Then, the network’s response to an observed, patient-specific CA time-course is estimated in the generalized context of tracer kinetics, making the fitting process more robust to noise, measurement errors and local minima.

The first step in the design of an MLP network is the choice of the number of hidden layers. As formulated by the already recalled universal approximation theorem, one sufficiently large hidden layer should enable the accurate approximation of the modeled function. In cases of complex problems, however, two layers with lower numbers of units may be easier to train, and could better acquire a general pattern that links input data vectors with estimated function parameters. We have observed such behavior in the current study. One-hidden-layer structures turned out to be more prone to overfitting, and their training process was less liable to converge. Therefore, in the rest of the paper we concentrate on the description of the three-layer architecture, i.e., consisting of two hidden layers and one output layer. However, in the Results section we provide example outcomes of training two-layer structures to support the above observations. On the other hand, increasing the network depth to three hidden layers did not improve training efficacy, while making the task of optimizing the network architecture more complicated. Hence, the results of these computations are omitted in the experiments description.

The overall concept of a three-layer perceptron network operating as an estimator of the Tofts model parameters is shown in Figure 3. The network design is represented by weight vectors w = [w₁, w₂], and v for two hidden layers and the output layer neurons respectively. The symbols f(∙) and h(∙) denote the activation functions of perceptrons in a given layer. In our implementation we used the same activation for both hidden layers, which was the rectified linear unit function, i.e., h(x) = max(x, 0). For the output layer we used linear activation. The detailed structure of implemented MLP is presented in the Results section.

In principle, the network inputs accept a vector C_tissue(t) of CA concentrations in the subsequent time frames of a given DCE series. Transformation from the image signal intensity S(t) to concentration C_tissue(t) is given in the Appendix A. In the following, the time variable t will be skipped where possible, for the sake of clarity.

There are two modes of network operation. In the recall mode, a trained network recognizes the pattern in the observed data and predicts the possible values of the PK model’s parameters. Formally, the network realizes the transformation

\hat{θ} = f (v, h (w, C_{t i s s u e}))

, which approximates the unknown mapping

\hat{θ} = ℱ (C_{t i s s u e})

from the observation space to the parameter space. It is assumed that this mapping exists and is unique. In the training mode, the network is fed by a large and versatile collection of input vectors, and simultaneously it is presented with an expected parameter set

θ

for each individual input example. The difference between these true parameter values and the output drives the process of network weights adjustment. Specifically, we applied the mean squared error (MSE) metric as the network loss function. Apart from MSE, during the network weights adjustment, the mean absolute percentage error (MAPE) was also calculated for both the training and validation samples. The definitions of the MSE and MAPE metrics are given in Appendix A.

The weights assigned to random values are updated after each iteration using the error back-propagation algorithm [26]. The loss function (dependent on inter-layer weights) was optimized using the stochastic gradient descent algorithm [27]. The network was implemented in a Python script using Keras library with the Tensorflow backend [28].

The crucial step in the above-described procedure is in the preparation of the training set. On the one hand it must be sufficiently large and varied for the network to gain the generalization capabilities. On the other hand, the input signal intensity vectors must be accompanied by the true parameter values, which allow the observation of a given C_tissue in response to a specific arterial input function. Moreover, since the available data set size is relatively small, we repeat this procedure using the leave-one-out approach. Having a data set of 20 examinations, we invoked calculations 20 times, each time excluding another subject AIF from the training stage. Hence, a training set in a given repetition was established via the 3-step procedure described below.

Step 1. Selecting a subset of 19 realistic arterial input functions.

Step 2. Establishing ranges of model parameter values. Table 1 presents the assumed parameter limits along with their corresponding probability distributions. These limits were fixed with respect to the published values for normal subjects [7,10,29]. Training input samples were generated using the model equation and parameters sampled from these distributions. According to the Tofts model [10], there are four such parameters: K^trans, v_p, T_g and Δ. The latter two are the parameters of the vascular impulse response function, whose sum defines the so-called mean residence time (MRT). Since [7,10] report only the value of MRT, we sampled T_g and MRT, whereas Δ was calculated as

Δ = MRT − T_g

(5)

Step 3. For a given AIF, we sample the model parameters

θ

and calculate C_tissue according to Equation (1). Sampling is repeated 10⁴ times for each AIF. As a result, there were 19 × 10⁴ < C(t),

θ

> pairs available in each leave-one-out repetition. The calculated C_tissue curves were probed at time steps adjusted to match the temporal resolution of the network input vector.

The simulated tracer concentration time curves were optionally corrupted by the Gaussian noise to increase their variability and make them more realistic. Data corruption was performed by adding to each simulated CA concentration a value drawn from the normal distribution, with mean = 0, and standard deviation was adjusted to a current time step, i.e.,

σ_{n o i s e} = s \cdot [C_{t i s s u e} (t) - m i n (C_{t i s s u e} (t))]

. The scale factor s was experimentally set to 0.025 to achieve reasonable deflection from the modeled CA concentration time curves. Then, the data set was partitioned into training and test sets in the proportion 7:3. Moreover, 30% of the training data vectors were randomly set apart in each epoch for validation purposes.

The last issue which needs to be fixed is the establishing of a proper network architecture. During experiments, it occurred that only one hidden layer cannot accurately encapsulate the non-linear relationship between the PK model’s parameters and the variety of CA concentration time-curves. The two-layer structure ensured a significant decrease in the loss function value, while the usage of more than two hidden layers did not offer any further improvement in this respect. Eventually, the number of perceptrons in each layer had to be established. We tested 12 configurations, listed in Table 2. Apart from the various cardinalities of neurons in hidden layers, we also considered inclusion of additional dropout layers between them to overcome the problem of overfitting [30]. For every configuration and each leave-one-out fold, the training was run for a fixed number of 100 epochs.

3. Results

We compared the effectiveness of the postulated ANN-based approach to optimizing a PK model’s parameters in reference to the algebraic curve-fitting method, configured to utilize the Trust Region Reflective algorithm [31]. Apart from the direct comparison of particular model parameters, we also estimated single- and two-kidney GFRs. The image-derived estimates were then compared against reference values measured with iohexol clearance tests.

We also validated the designed algorithm for the determination of the arterial input function. In this experiment, we calculated the GFR scores using the Trust Region Reflective method for AIFs annotated manually, and using the proposed automated approach.

For every study we performed manual segmentation of the whole kidney, as well as cortex and medulla. This step was accomplished by two professionals experienced in medical image analysis, using our custom software annotation tool. The rate of agreement between the respective kidney regions delineated by both experts was estimated by the Jaccard coefficient, and was equal to 0.93 on average. For a given study, kidney segments were outlined only in a time frame corresponding to the maximum signal enhancement in the perfusion phase, which ensured the largest contrast between cortex, medulla and pelvis. Depending on the study, this maximal enhancement was observed in frames numbered from 12 to 18. Then, the segmentations were applied to all other time frames reflected in the parameter estimation procedure, so as to determine mean signal time courses in the perfused renal tissue, and the selected frames became fixed reference frames in the b-spline registration algorithm.

3.1. Assessment of Automatic AIF Determination Method

Figure 4 visualizes the locations of the arterial input function ROIs, as identified by the proposed automatic method and manual selection. The presented examples correspond to the two cases, where the best and the worst model fits were obtained. As shown, in comparison to manual AIF annotations, automatically found ROIs occupy many more voxels that are free from inflow artifacts. As a result, fitted intensity time curves better encapsulate image signal changes. For example, in Figure 4c,e, the curve is visible in the region of maximum signal enhancement. In the case of automatically determined AIF, the curve approximates well the measured signal samples, whereas in the case of manual annotations, the model overestimates signal values. The reliability of the GFR measurements based on automatically determined AIFs is visualized in Figure 5b, and can be compared to the Bland–Altman plots shown in Figure 5a, obtained with manually annotated AIFs. Since the automatically determined AIFs seemed to improve the accuracy and precision of GFR assessment, this allowed us to use them in the generation of the training set for the neural network.

3.2. Selecting Network Configuration

In the first phase of experiments, we aimed to select an architecture for the neural network suitable to solving the problem of fitting the 2CFM to the DCE-MRI data. We began with deciding upon the number of hidden layers and their activation units. Figure 6 presents the learning curves for three configurations of two-layer architectures, whereas the plots shown in Figure 7 correspond to the three-layer network configurations listed in Table 2. Although the selected learning curves were obtained only for one of the leave-one-out repetitions, they are representative for all the subjects in the study. It can be seen that in all cases, the loss function initially decreases rapidly, and converges to a value in the range 0.01–0.13 depending on the network architecture and the existence of noise in the training data. However, for configurations without the dropout layer, it is observable that the MSE value in a validation subset heavily fluctuates until the very end of the training process. It must be noted that for the single-hidden-layer configurations (Figure 6), the problem with the convergence of the loss function also persists in the largest tested network, with 50 hidden units, despite even inserting a dropout layer before the output nodes. This observation, combined with potential overfitting (which manifests in larger errors in a validation sample, with respect to a training set), substantiates the use of two hidden layers. In the latter case, the loss function stabilizes after around 40 iterations for ‘dropout’ configurations with higher numbers of neurons (architectures number 6, 8, 10 and 12). It is especially pronounced if the training data was corrupted by noise.

Apparently, increased data variability due to the presence of noise, along with the inclusion of a dropout layer, ensures better loss function convergence and helps prevent the network from overfitting. As can be observed in Figure 7 (second column), the loss function scores on the validation set start to surpass the values obtained for the training set quite early in the learning process. This overfitting dynamic is further confirmed by the MAPE scores obtained after 100 learning epochs (see Figure 8). Without the dropout mechanism, the error remains higher for the validation set than for the training one, for almost every network configuration.

While the above analysis settles upon the usage of the dropout layer and the noisy training vectors, the question regarding the sizes of the hidden layers must still be resolved. On one side, increasing the number of neurons leads to the lowering of the MSE and MAPE scores. On the other hand, however, it escalates the risk of overfitting. Figure 9 presents the distributions of the Δ parameter estimates against the true values in the test set. Regression analysis shows good agreement between the predicted and the actual parameter estimates. The performed t-test and its resulting large p-values indicate that one cannot reject the hypothesis of equality between the group means. However, the overfitting effect manifests itself in the case of noise-free data, and it appears deeper for the more complex MLP architectures. The network tends to predict parameter values close to some discrete levels. Although the determination coefficients R² are higher in these cases, clearly the predicated outcomes do not conform to the actual data distribution. This tendency is not observed in the case of noisy data; however, changing the number of neurons in the first hidden layer from 40 to 50 did not bring significant improvement in the R² score (0.85 versus 0.86), and simultaneously, as shown in Figure 8d, it slightly increased the mean absolute percentage error for the validation set (from 13.9% to 14.3%).

Therefore, for the rest of the experiments we chose configuration #10, with 40 and 20 perceptrons in the first and second hidden layers, respectively, and with a dropout layer in between, with a dropout rate = 0.2. This setting ensures the lowest mean absolute percentage error, relatively high R² scores for all estimated parameters (Figure 10), and that the network model’s capacity is appropriate for the data set’s complexity.

3.3. Evaluation of MLP-Based Model Fitting Reliability

The second phase of experiments consisted of using the trained MLP network to estimate 2CFM model parameters, as well as the GFR values for the 20 DCE-MRI examinations available in this study. The statistics of the parameter estimates are summarized in Table 3, whereas the calculated GFR values are collected in Table 4. The reliability of GFR estimation was also evaluated against the ground-truth using the Bland–Altman plot, shown in Figure 11. When compared to Figure 5b, it emerges that the mean biases from the reference values are similar for both optimization methods—MLP and NLLS curve-fitting. However, the limits of agreement

κ

and confidence intervals are narrower for the neural network. Moreover, our ANNs ensure the better balance of the estimated values around the mean. The NLLS method, on the other hand, tends to overestimate the measured quantity, especially in the lower range of values. On the other hand, in the case of ANN, GFR was estimated without explicit determination of AIF for a given subject. It proves that the designed network was capable of extracting the general characteristics of the underlying perfusion process.

From the analysis of Table 3, it emerges that MLP determined K^trans and v_p parameter values close to those of the reference method. The box plots presented in Figure 12a,b visually illustrate that the ranges of estimated K^trans and v_p values partially overlap, and their medians are similar. The result of the t-test gives evidence that the v_p results, as estimated by MLP and NLLS, are not statistically different (p-value > 0.3), which effectively shows that both methods can be used interchangeably for blood volume fraction assessment. In the case of K^trans, the difference in parameter estimates becomes statistically significant (p-value < 10⁻³), although the medians remain relatively close with respect to the overall data variability. Estimates of T_g and Δ vary more significantly (cf. Figure 12c,d). The evidence against the null hypothesis of there being equal means in the MLP- and NLLS-based calculations is even stronger than for the K^trans parameter (p-value < 10⁻⁶). Apparently, the variation in the vascular impulse response function manifests itself in the CA concentration time courses in a way which is less evident in comparison to the other two PK model parameters. As such, the neural network could not precisely infer the relationship between the concentration time curves and the expected T_g and Δ values. This effect is consistent with the observed R² scores calculated on the test set upon MLP training. The VIRF timing factors are remarkably lower than the blood volume fraction and transfer constant.

4. Discussion and Conclusions

The main goal of this study was to evaluate the possibility of estimating the parameters of the two-compartment filtration model using an artificial neural network. As shown in the Results section, especially in Table 4, overall the task was accomplished successfully. The estimated single- and two-kidney estimates of glomerular filtration rate fall into the physiologically feasible range, and are close to the reference values calculated with the Trust Region Reflective algorithm (a non-linear least-squares curve-fitting method), as well as those measured by blood tests. Comparison with the latter method, which serves as a gold standard in clinics, appears particularly optimistic. The mean difference between the corresponding measurements is to

μ_{d} =

2.35 mL/min/1.73 m², with the agreement interval

κ = μ_{d} \pm

36.16 mL/min/1.73 m². This interval is even narrower than the one obtained for the NLLS method (

κ = μ_{d} \pm

63.97 mL/min/1.73 m²), which signals the greater precision of the designed MLP structures. This effect was achieved thanks to the usage of dropout layers between the hidden ones, and the training of the network with data vectors purposely corrupted with noise.

Although the trained MLP network performs relatively well in predicting K^trans, v_p and Δ parameters, it actually fails in estimating accurate values for the VIRF decay time constant T_g. The corresponding R² determination coefficients calculated for the test sets do not exceed the value of 0.64, which is remarkably less than for the other three parameters (0.85–0.92). T_g, however, does not affect the calculation of GFR, which depends only on K^trans. In contrast with the NLLS procedure, a neural network does not actually fit a modeled curve to the input data. A failure in predicting a given parameter value does not necessarily cause an error in another parameter’s estimation.

It is instructive to compare the results obtained in this study to similar measurements presented in [16], partly conducted on the same subjects. The scores presented therein were obtained using a conventional curve-fitting approach, and were divided with respect to examination session. The mean differences and limits of agreement reported in their study were

μ_{d} =

1.5 mL/min/1.73 m² and

κ = μ_{d} \pm

43.2 mL/min/1.73 m² (Session 1), and

μ_{d} =

6.1 mL/min/1.73 m² and

κ = μ_{d} \pm

31.9 mL/min/1.73 m² (Session 2). Hence, our MLP-based method estimates true GFR values at a comparable level, demonstrating either slightly better (in the case of Session 1) or poorer (Session 2) precision. Apart from different parameter estimation methods, the observed discrepancies may be caused by other factors, including various post-processing algorithms, e.g., image registration and segmentation. Above all, a different pharmacokinetic model has been utilized, namely the Sourbron’s two-compartment separable model [7].

Regarding the PK models themselves, the most common of them are often claimed to be too simple to properly represent patient-specific biophysiological processes. Then, even the most precise parameter estimation may not lead to accurate GFR calculation. There have been several attempts to develop more advanced models, including multi-compartmental [32] or patient-specific models [33], as well as those with a modified functional form of CA retention in the renal tissue compartments [34]. As such, the reliability of our proposed MLP-based method for DCE-MRI processing should be further examined with respect to various mathematical models of kidney perfusion.

Another constraint that narrows the scope of the conclusions that can be derived from this study is that it presents methodological achievements tested solely on healthy subjects. Although we believe that the proposed methods of ANN-based PK model parameter estimation and AIF determination will manifest its applicability to diseased kidneys, this paper focuses exclusively on algorithms per se and numerical image processing, rather than clinical applications. Such an approach is common in numerous works published in the biomedical engineering domain, which postulate novel processing techniques, including those devoted to renal perfusion measurement [7,10,16,35]. The reasoning behind this approach states that the technical quality of the designed solution must be verified under well-defined and controlled conditions for a statistically representative data sample. Such conditions are ensured by a cohort of healthy volunteers. Besides, in the current study, the inclusion of abnormal cases would not necessarily provide additional insight into the behavior of the automatic determination of AIF. In the case of diseased kidneys, it should run in an unchanged manner, since tissue lesion is independent of the blood flow dynamics in abdominal aorta. On the other hand, taking into account renal impairments would involve experimenting with various pharmacokinetic models of perfusion, and it would remarkably extend the scope of this study.

To conclude, the performed experiments show the potential of neural networks as an alternative computational framework to the standard curve-fitting procedure in the context of quantitative perfusion estimation. The overall agreement with the gold standard method obtained by ANN was comparable to the results obtained using the non-linear least-squares approach. Moreover, GFR can be estimated without explicit determination of AIF for a given patient, which constitutes the major advantage of the proposed approach. Instead, AIFs must be found only for a limited number of studies included in the training data set. The described algorithm for AIF ROI determination allows one to determine it in an automatic manner, reducing the effect of the inflow artifact. It simultaneously increases the reliability and precision of the 2CFM model’s parameter estimation. Hence, no additional process is required in assessing patient-specific hemodynamic parameters by utilizing either clinical examinations (MR angiography [36,37] or ultrasound [38]) or non-clinical methods (e.g., photopletysmographic arterial pulse wave measurement [39]) that would otherwise be necessary to properly trigger the DCE-MRI sequence.

Author Contributions

Conceptualization, A.K.; Data curation, E.E. and A.L.; Investigation, A.K. and M.S.; Methodology, A.K.; Resources, E.E. and A.L.; Software, A.K. and M.K.; Supervision, M.S. and A.L.; Validation, E.E.; Visualization, M.K.; Writing—original draft, A.K. All authors have read and agreed to the published version of the manuscript.

Funding

The paper was partially supported by the Polish National Science Centre grant no. 2014/15/B/ST7/05227.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Appendix A.1. Transforming Signal to Concentration

Assuming the gradient echo sequence, C_tissue in a given time step t is derived from the observed signal S(t) using the transformation

C_{t i s s u e} (t) = \frac{R_{1} (t) - 1 / T_{10}}{r_{1}}

(A1)

where T₁₀ is the longitudinal relaxation time before arrival of the tracer agent bolus, r₁ is the compartment-specific longitudinal relaxivity constant and

R_{1} (t) = - \frac{1}{T R} \ln [\frac{1 - a (t) b}{1 - a (t) b \cos α}]

(A2)

with

a (t) = \frac{S (t)}{S_{0}}

(A3)

b = \frac{(1 - e^{- R_{10} T R})}{1 - \cos α e^{- R_{10} T R}} .

(A4)

In the equations above, α and TR are the MRI sequence parameters of flip angle and repetition time, S₀ is the signal baseline, i.e., before arrival of the CA bolus, and R₁₀ is the relaxation constant equal to 1/T₁₀. Note that in the case of the tissue signal, the conversion function is valid only under the assumption of equal longitudinal relaxivities in the EEV and IV compartments.

Appendix A.2. Assessment of MLP Training Process

The MSE metric used in our study as the loss function driving the MLP training procedure and the MAPE score used to validate training efficiency are defined as follows

MSE = \frac{1}{N M} \sum_{i = 0}^{N - 1} \sum_{j = 0}^{M - 1} {({\hat{θ}}_{i j} - θ_{i j})}^{2}

(A5)

MAPE = \frac{1}{N M} \sum_{i = 0}^{N - 1} \sum_{j = 0}^{M - 1} | \frac{{\hat{θ}}_{i, j} - θ_{i, j}}{θ_{i, j}} |

(A6)

where

θ_{i, j}, {\hat{θ}}_{i, j}

denote the actual and estimated value of a j-th parameter for an i-th data training vector, whereas M and N are the total numbers of parameters and data vectors, correspondingly. Since the model parameters have different scales, it was necessary to scale their values to the same range so as to balance their contribution to the overall loss function.

References

Gameiro, J.; Fonseca, A.J.; Jorge, S.; Lopes, A.J. Acute Kidney Injury Definition and Diagnosis: A Narrative Review. J. Clin. Med. 2018, 7, 307. [Google Scholar] [CrossRef]
Thomas, R.; Kanso, A.; Sedor, R.J. Chronic kidney disease and its complications. Prim. Care 2008, 35, 329–344. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Vander, A.J.; Sherman, J.H.; Luciano, D.S. Human Physiology. The Mechanism of Body Function, 8th ed.; McGraw-Hill Publishing: New York City, NY, USA, 2001. [Google Scholar]
Stevens, L.A.; Levey, A.S. Measured GFR as a Confirmatory Test for Estimated GFR. J. Am. Soc. Nephrol. 2009, 20, 2305–2313. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Hackstein, N.; Heckrodt, J.; Rau, W.S. Measurement of single-kidney glomerular filtration rate using a contrast-enhanced dynamic gradient-echo sequence and the Rutland-Patlak plot technique. J. Magn. Reson. Imaging 2003, 18, 714–725. [Google Scholar] [CrossRef] [PubMed]
Annet, L.; Hermoye, L.; Peeters, F.; Jamar, F.; Dehoux, J.P.; Van Beers, B.E. Glomerular filtration rate: Assessment with dynamic contrast enhanced MRI and a cortical compartment model in the rabbit kidney. J. Magn. Reson. Imaging 2004, 20, 843–849. [Google Scholar] [CrossRef] [PubMed]
Sourbron, S.P.; Michaely, H.J.; Reiser, M.F.; Schoenberg, S.O. MRI-measurement of perfusion and glomerular filtration in the human kidney with a separable compartment model. Investig. Radiol. 2008, 43, 40–48. [Google Scholar] [CrossRef] [PubMed]
Cárdenas-Rodríguez, J.; Li, X.; Whisenant, J.G.; Barnes, S.; Stollberger, R.; Gore, J.C.; Yankeelov, T.E. The basic principles of dynamic contrast-enhanced magnetic resonance imaging. In MR and CT Perfusion and Pharmacokinetic Imaging. Clinical Applications and Theory; Bammer, R., Ed.; Wolters Kluwer: Philadelphia, PA, USA, 2016; Chapter 23; pp. 332–347. [Google Scholar]
Sourbron, S.; Lee, T.-Y. Pharmacokinetic models for dynamic contrast-enhanced computed tomography and magnetic resonance imaging. In MR and CT Perfusion and Pharmacokinetic Imaging. Clinical Applications and Theory; Bammer, R., Ed.; Wolters Kluwer: Philadelphia, PA, USA, 2016; Chapter 29; pp. 412–430. [Google Scholar]
Tofts, P.S.; Cutajar, M.; Mendichovszky, I.A.; Peters, A.M.; Gordon, I. Precise measurement of renal filtration and vascular parameters using a two-compartment model for dynamic contrast-enhanced MRI of the kidney gives realistic normal values. Eur. Radiol. 2012, 22, 1320–1330. [Google Scholar] [CrossRef]
Ohn, I.; Kim, Y. Smooth Function Approximation by Deep Neural Networks with General Activation Functions. Entropy 2019, 21, 627. [Google Scholar] [CrossRef] [Green Version]
Liu, L.; Ma, D.; Azar, A.T.; Zhu, Q. Neural Computing Enhanced Parameter Estimation for Multi-Input and Multi-Output Total Non-Linear Dynamic Models. Entropy 2020, 22, 510. [Google Scholar] [CrossRef]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; The MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Shukla-Dave, A.; Lee, N.; Stambuk, H.; Wang, Y.; Huang, W.; Thaler, H.T.; Patel, S.G.; Shah, J.P.; Koutcher, J.A. Average arterial input function for quantitative dynamic contrast enhanced magnetic resonance imaging of neck nodal metastases. BMC Med. Phys. 2009, 9, 4. [Google Scholar] [CrossRef] [Green Version]
Parker, G.J.M.; Roberts, C.; Macdonald, A.; Buonaccorsi, A.; Cheung, S.; Buckley, D.L.; Jackson, A.; Watson, Y.; Davies, K.; Jayson, G.C. Experimentally-Derived Functional Form for a Population-Averaged High-Temporal-Resolution Arterial Input Function for Dynamic Contrast-Enhanced MRI. Magn. Reson. Med. 2006, 56, 993–1000. [Google Scholar] [CrossRef] [PubMed]
Eikefjord, E.; Andersen, E.; Hodneland, E.; Hanson, E.A.; Sourbron, S.; Svarstad, E.; Lundervold, A.; Rørvik, J.T. Dynamic contrast-enhanced MRI measurement of renal function in healthy participants. Acta Radiol. 2017, 58, 748–757. [Google Scholar] [CrossRef] [PubMed]
Johnson, H.J.; McCormick, M.M.; Ibanez, L. The ITK Software Guide: Design and Functionality, 4th ed.; Kitware: Clifton Park, NY, USA, 2015. [Google Scholar]
Dujardin, M.; Luypaert, R.; Vandenbroucke, F.; Van der Niepen, P.; Sourbron, S.; Verbeelen, D.; Stadnik, T.; de May, J. Combined T1-based perfusion MRI and MR angiography in kidney: First experience in normals and pathology. Eur. J. Radiol. 2009, 69, 542–549. [Google Scholar] [CrossRef] [PubMed]
Cutajar, M.; Mendichovszky, I.A.; Tofts, P.S.; Gordon, I. The importance of AIF ROI selection in DCE-MRI renography: Reproducibility and variability of renal perfusion and filtration. Eur. J. Radiol. 2010, 74, e154–e160. [Google Scholar] [CrossRef]
Shi, L.; Wang, D.; Liu, W.; Fang, K.; Wang, Y.X.; Huang, W.; King, A.D.; Heng, P.A.; Ahuja, A.T. Automatic detection of arterial input function in dynamic contrast enhanced MRI based on affinity propagation clustering. J. Magn. Reson. Imaging 2014, 39, 1327–1337. [Google Scholar] [CrossRef]
Yin, J.; Yang, J.; Guo, Q. Automatic determination of the arterial input function in dynamic susceptibility contrast MRI: Comparison of different reproducible clustering algorithms. Neuroradiology 2015, 57, 535–543. [Google Scholar] [CrossRef] [Green Version]
Kim, J.; Im, G.H.; Yang, J.; Choi, D.; Lee, W.J.; Lee, J.H. Quantitative dynamic contrast-enhanced MRI for mouse models using automatic detection of the arterial input function. NMR Biomed. 2012, 25, 674–684. [Google Scholar] [CrossRef]
Yoshinobu, S.; Shin, N.; Hideki, A.; Thomas, K.; Guido, G.; Shigeyuki, Y.; Ron, K. 3D Multi-Scale Line Filler for Segmentation and Visualization of Curvilinear Structures in Medical Images; Troccaz, J., Grimson, E., Mösges, R., Eds.; Proc. CVRMed-MRCAS’97, LNCS; Springer: Berlin/Heidelberg, Germany, 1997; pp. 213–222. [Google Scholar]
Bammer, R. MR and CT Perfusion and Pharmacokinetic Imaging. Clinical Applications and Theory; Wolters Kluwer: Philadelphia, PA, USA, 2016. [Google Scholar]
Materka, A.; Mizushina, S. Parametric Signal Restoration Using Artificial Neural Networks. IEEE Trans. Biomed. Eng. 1996, 43, 357–372. [Google Scholar]
Witten, I.H.; Frank, E. Data Mining. Practical Machine Learning Tools and Techniques, 2nd ed.; Morgan Kaufman, Elsevier: Amsterdam, The Netherlands, 2005. [Google Scholar]
Bottou, L. On-line Learning and Stochastic Approxmiations. In On-Line Learning in Neural Networks; Saad, D., Ed.; Cambridge University Press: Cambridge, UK, 1998; Chapter 2; pp. 9–42. [Google Scholar]
Chollet, F.; Allison, K.; Wicke, M.; Bileschi, S.; Bailey, P.; Gibson, A.; Allaire, J.J. “Keras”. 2015. Available online: https://keras.io (accessed on 7 August 2020).
Tsushima, Y.; Blomley, M.J.; Kusano, S.; Endo, K. Use of contrast-enhanced computed tomography to measure clearance per unit renal volume: A novel measurement of renal function and fractional vascular volume. Am. J. Kidney Dis. 1999, 33, 754–760. [Google Scholar] [CrossRef]
Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A Simple Way to Prevent Neural Networks from Overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
Yuan, Y. Recent advances in trust region algorithms. Math. Program. 2015, 151, 249–281. [Google Scholar] [CrossRef]
Lee, V.S.; Rusinek, H.; Bokacheva, L.; Huang, A.J.; Oesingmann, N.; Chen, Q.; Kaur, M.; Prince, K.; Song, T.; Kramer, E.L.; et al. Renal function measurements from MR renography and a simplified multicompartmental model. Am. J. Physiol. Renal Physiol. 2007, 292, F1548–F1559. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Tipirneni-Sajja, A.; Loeffler, R.B.; Oesingmann, N.; Bissler, J.; Song, R.; McCarville, B.; Jones, D.P.; Hudson, M.; Spunt, S.L.; Hillenbrand, C.M. Measurement of glomerular filtration rate by dynamic contrast-enhanced magnetic resonance imaging using a subject-specific two-compartment model. Physiol. Rep. 2016, 4, e12755. [Google Scholar] [CrossRef] [PubMed]
Chen, B.; Zhang, Y.; Song, X.; Wang, X.; Zhang, J.; Fang, J. Quantitative Estimation of Renal Function with Dynamic Contrast-Enhanced MRI Using a Modified Two-Compartment Model. PLoS ONE 2014, 9, e105087. [Google Scholar] [CrossRef]
Zoellner, F.G.; Sance, R.; Rogelj, P.; Ledesma-Carbayo, M.J.; Rørvik, J.; Santos, A.; Lundervold, A. Assessment of 3D DCE-MRI of the kidneys using non-rigid image registration and segmentation of voxel time courses. Comput. Med. Imaging Graph. 2009, 33, 171–181. [Google Scholar] [CrossRef]
Klepaczko, A.; Szczypiński, P.; Strzelecki, M.; Stefańczyk, L. Simulation of phase contrast angiography for renal arterial models. Biomed. Eng. OnLine 2018, 17, 41. [Google Scholar] [CrossRef] [Green Version]
Klepaczko, A.; Szczypiński, P.; Dwojakowski, G.; Strzelecki, M.; Materka, A. Computer simulation of magnetic resonance angiography imaging: Model description and validation. PLoS ONE 2014, 9, e93689. [Google Scholar] [CrossRef] [Green Version]
Ciccone, M.M.; Iacoviello, M.; Gesualdo, L.; Puzzovivo, A.; Antoncecchi, V.; Doronzo, A.; Monitillo, F.; Citarelli, G.; Paradies, V.; Favale, S. The renal arterial resistance index: A marker of renal function with an independent and incremental role in predicting heart failure progression. Eur. J. Heart Fail. 2014, 16, 210–216. [Google Scholar] [CrossRef]
Gircys, R.; Kazanavicius, E.; Maskeliunas, R.; Damasevicius, R.; Wozniak, M. Wearable system for real-time monitoring of hemodynamic parameters: Implementation and evaluation. Biomed. Signal. Proc. Control 2020, 59, 101873. [Google Scholar] [CrossRef]

Figure 1. Schematic illustration of the two-compartment filtration model.

Figure 2. Steps of automatic arterial input function (AIF) determination algorithm.

Figure 3. Overview of a pharmacokinetic (PK) model parameter fitting using an artificial neural network operating in the training (a) and recall (b) modes. The three-layer perceptron structure is shown only conceptually—the number of hidden units was adjusted during the network design process. The number of output units was always set to 4, i.e., the number of PK model parameters. All layers are fully connected, as in the conventional multi-layer perceptron (MLP) structures, except dropout mechanism was switched on, in which case random connections between hidden layers were removed during training to prevent network overfitting.

Figure 4. Examples of automatically determined arterial input function region-of-interests (ROIs) (a,b). For comparison, manually delineated ROIs placed directly above aorta bifurcation into iliac arteries are shown as red rectangles. The corresponding fitted model curves are shown in (c,d) for manual ROIs and in (e,f) for the automatic ones.

Figure 5. Bland–Altman plots of agreement for manually (a) and automatically (b) determined AIF ROIs. Measurements were evaluated against normality using Shapiro–Wilk test. The obtained p-values were equal to 0.52 (iohexol-based glomerular filtration rate (GFRs)), 0.29 (image-derived GFRs with manual ROIs) and 0.12 (for image-derived GFRs with automatically determined ROIs), thus disallowing us to reject the null hypothesis of the data with a normal distribution.

Figure 6. Learning curves for various two-layer MLP configurations—training set scores in blue, validation set scores in orange. The plots in the first two and second two columns correspond to architectures without and with the dropout layer, respectively. The curves in columns 1 and 3 were obtained for the noise-free training data, whereas in columns 2 and 4 for the data corrupted with Gaussian noise.

Figure 7. Learning curves for various three-layer MLP configurations—training set scores in blue, validation set scores in orange. The plots in the first two and two second two columns correspond to architectures without and with the dropout layer, respectively. The curves in columns 1 and 3 were obtained for the noise-free training data, whereas in columns 2 and 4 for the data corrupted with Gaussian noise.

Figure 8. (a–d): Mean absolute percentage error obtained for various network architectures and with respect to existence of noise in the training and validation data sets.

Figure 9. Predicted against actual values of the Δ parameter in the test set for three configurations of the MLP structure with the dropout layer. The number of neurons in the hidden layers: 30/15 (a,b), 40/20 (c,d), 50/25 (e,f). Plots in the left and right columns represent, correspondingly, noise-free and noisy data.

Figure 10. Determination coefficients for two-compartment filtration model (2CFM) model parameters obtained for various network configurations (test set).

Figure 11. Bland–Altman plot of agreement for MLP-based GFR estimation method. Normal distribution of the measurements was confirmed using Shapiro–Wilk test. The obtained p-values were equal to 0.52 (iohexol-based GFRs) and 0.97 (image-derived MLP-based GFRs).

Figure 12. (a–d): Box plots showing distribution of 2CFM parameter values estimated by MLP and NLLS methods. Provided p-values were obtained for the t-test under the null hypothesis that mean estimated values are the same for both methods.

Table 1. Parameter limits and types of probability distributions used for sampling training data set for ANN (artificial neural network).

Parameter	Min	Max	Mean	SD	Probability Distribution
K^trans [min⁻¹]	0	–	0.25	0.1	Normal
v_p [mm³]	0.2	0.8	–	–	Uniform
Δ [s]	1	3.5	–	–	Uniform
MRT [s]	–	–	5.5	0.7	Normal
T_g [s]	0.02	–	–	–	Normal

Table 2. Number of neurons in the tested two-layer perceptron structures.

Configuration No.	First Hidden Layer	Dropout Rate *	Second Hidden Layer	Output Layer
#1	20	0.0	10	4
#2	20	0.2	10
#3	26	0.0	13
#4	26	0.2	13
#5	30	0.0	15
#6	30	0.2	15
#7	34	0.0	17
#8	34	0.2	17
#9	40	0.0	20
#10	40	0.2	20
#11	50	0.0	25
#12	50	0.2	25

* Dropout rate = 0.0 means that effectively there was no dropout mechanism between hidden layers.

Table 3. Statistics of the estimated values of the two-compartment filtration model parameters.

Parameter	MLP Estimates				NLLS Estimates
Parameter	Min	Max	Mean	Std. Dev.	Min	Max	Mean	Std. Dev.
K^trans [min⁻¹] (×10⁻²)	0.40	0.50	0.45	0.02	0.39	0.60	0.51	0.03
v_p [mm³]	0.33	0.91	0.62	0.08	0.35	0.86	0.67	0.09
Δ [s]	1.10	1.69	1.34	0.15	1.00	4.97	2.46	0.92
T_g [s]	4.68	5.35	4.95	0.18	1.00	4.70	1.84	0.82

Table 4. Juxtaposition of GFR values estimated by MLP and NLLS methods based on DCE-MRI data acquired in two examination sessions in comparison with ground-truth scores measured with iohexol clearance test [ml/min/1.73 m²].

Subject Number	MLP Estimates						NLLS Estimates						Ground-Truth (Total)
	Session 1			Session 2			Session 1			Session 2
	Left Kidney	Right Kidney	Total	Left Kidney	Right Kidney	Total	Left Kidney	Right Kidney	Total	Left Kidney	Right Kidney	Total
1	48	70	118	56	59	115	48	62	110	69	75	144	107
2	52	62	114	56	71	127	52	67	119	34	48	82	98
3	40	35	75	44	42	86	41	25	66	67	69	136	90
4	43	49	92	46	58	104	49	55	103	72	68	140	93
5	42	45	87	41	44	85	19	34	53	49	41	90	94
6	33	32	65	51	54	105	32	33	65	47	75	122	103
7	50	46	96	55	47	101	72	65	137	71	83	154	112
8	41	48	89	41	47	88	38	34	72	44	42	86	96
9	49	55	104	54	65	119	55	69	124	70	79	149	119
10	64	72	136	52	53	105	58	54	112	63	89	152	112

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Klepaczko, A.; Strzelecki, M.; Kociołek, M.; Eikefjord, E.; Lundervold, A. A Multi-Layer Perceptron Network for Perfusion Parameter Estimation in DCE-MRI Studies of the Healthy Kidney. Appl. Sci. 2020, 10, 5525. https://doi.org/10.3390/app10165525

AMA Style

Klepaczko A, Strzelecki M, Kociołek M, Eikefjord E, Lundervold A. A Multi-Layer Perceptron Network for Perfusion Parameter Estimation in DCE-MRI Studies of the Healthy Kidney. Applied Sciences. 2020; 10(16):5525. https://doi.org/10.3390/app10165525

Chicago/Turabian Style

Klepaczko, Artur, Michał Strzelecki, Marcin Kociołek, Eli Eikefjord, and Arvid Lundervold. 2020. "A Multi-Layer Perceptron Network for Perfusion Parameter Estimation in DCE-MRI Studies of the Healthy Kidney" Applied Sciences 10, no. 16: 5525. https://doi.org/10.3390/app10165525

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Multi-Layer Perceptron Network for Perfusion Parameter Estimation in DCE-MRI Studies of the Healthy Kidney

Abstract

1. Introduction

2. Methods and Materials

2.1. Subjects

2.2. Two-Compartment Filtration Model of Kidney Perfusion

2.3. Arterial Input Function

2.4. PK Model Fitting

3. Results

3.1. Assessment of Automatic AIF Determination Method

3.2. Selecting Network Configuration

3.3. Evaluation of MLP-Based Model Fitting Reliability

4. Discussion and Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A

Appendix A.1. Transforming Signal to Concentration

Appendix A.2. Assessment of MLP Training Process

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Configuration No.	First Hidden Layer	Dropout Rate *	Second Hidden Layer	Output Layer
#1	20	0.0	10	4
#2	20	0.2	10
#3	26	0.0	13
#4	26	0.2	13
#5	30	0.0	15
#6	30	0.2	15
#7	34	0.0	17
#8	34	0.2	17
#9	40	0.0	20
#10	40	0.2	20
#11	50	0.0	25
#12	50	0.2	25

Configuration No.	First Hidden Layer	Dropout Rate *	Second Hidden Layer	Output Layer
#1	20	0.0	10	4
#2	20	0.2	10
#3	26	0.0	13
#4	26	0.2	13
#5	30	0.0	15
#6	30	0.2	15
#7	34	0.0	17
#8	34	0.2	17
#9	40	0.0	20
#10	40	0.2	20
#11	50	0.0	25
#12	50	0.2	25

Configuration No.	First Hidden Layer	Dropout Rate *	Second Hidden Layer	Output Layer
#1	20	0.0	10	4
#2	20	0.2	10
#3	26	0.0	13
#4	26	0.2	13
#5	30	0.0	15
#6	30	0.2	15
#7	34	0.0	17
#8	34	0.2	17
#9	40	0.0	20
#10	40	0.2	20
#11	50	0.0	25
#12	50	0.2	25