- freely available
- re-usable

*Sensors*
**2012**,
*12*(6),
6893-6919;
doi:10.3390/s120606893

^{1}

^{2}

^{*}

## Abstract

**:**This paper deals with the generation of accurate, dense and coloured 3D models of outdoor scenarios from scanners. This is a challenging research field in which several problems still remain unsolved. In particular, the process of 3D model creation in outdoor scenes may be inefficient if the scene is digitalized under unsuitable technical (specific scanner on-board camera) and environmental (rain, dampness, changing illumination) conditions. We address our research towards the integration of images and range data to produce photorealistic models. Our proposal is based on decoupling the colour integration and geometry reconstruction stages, making them independent and controlled processes. This issue is approached from two different viewpoints. On the one hand, given a complete model (geometry plus texture), we propose a method to modify the original texture provided by the scanner on-board camera with the colour information extracted from external images taken at given moments and under specific environmental conditions. On the other hand, we propose an algorithm to directly assign external images onto the complete geometric model, thus avoiding tedious on-line calibration processes. We present the work conducted on two large Roman archaeological sites dating from the first century A.D., namely, the Theatre of Segobriga and the Fori Porticus of Emerita Augusta, both in Spain. The results obtained demonstrate that our approach could be useful in the digitalization and 3D modelling fields.

## 1. Introduction

The construction of geometrical outdoor models, which consist of millions of points and patches, can be carried out with relatively efficiency and accuracy by using current 3D sensors, particularly with phase-shift and time of flight scanners. Colour information can also be acquired with high resolution by using appropriate digital cameras and lenses, which lead to a wide variety of parameters that can be set by hand.

However, the combination of a set of 3D clouds of points and colour images to obtain a photorealistic model is still an area of research which has a number of unsolved problems. The first of these relates to the experimental setup itself. Some scanners have an on-board camera with determined technical characteristics, and others permit the attachment of an external camera that has to be fitted in a specific position. Thus, in the field of digitalization, geometrical information and colour information are usually captured at the same time, under the illumination conditions that exist at the very moment of the 3D data acquisition. This makes the fusion of both types of information highly prone to errors. Under these circumstances, the process of geometrical and colour data acquisition is subdued in terms of both space and time: in terms of space because the sensor and the camera take up established positions during the data acquisition stage, and in terms of time because the geometrical information and the colour information are sensed simultaneously.

Consequently, mismatches commonly occur when combining texture images with 3D geometrical data either due to parallax effects that depend on the baseline between the laser scanner and the optical centre of the camera, or to differences between several colour images taken at different times and, therefore, under different illumination conditions.

The second problem arises when attempting to register and merge the set of data acquired from different positions in a unique representation model. Geometrical registration and 3D data integration problems, of course, were solved a number of years ago. There are numerous approaches in the literature related to registration, many of them based on the original well-known algorithm by Besl and McKay known as ICP (Iterative Closest Point Algorithm) [1]. Zhang presents [2] an important variant of the original algorithm. Xiao et al. [3] modify the ICP and utilize both regional surface properties and shape rigidity constraint to align a partial object surface and its corresponding complete surface. In addition to ICP, some other methods have been proposed for 3D registration. For example, in [4] genetic algorithms are used for registration of 3D data with low overlap and which may include substantial noise. Extensive comparison of registration methods can be seen in [5,6].

For the resolution of the integration problem, several methods have also been proposed. The most important classic ones are those based on “zippering” [7] or volumetric [8] algorithms. In [9] a faster variant of zippering algorithm is presented. Santos et al. propose in [10] a high fidelity merging algorithm that it is used for digital art preservation. Nevertheless, the integration of colour information taken from different viewpoints is an issue that still requires further research.

The problem in texture fusion is mainly caused by the colour disparity between two images. This is brought about by several reasons: First, changes in the angle between the observer direction and the normal of the reflecting surface and, second, variations in illumination conditions in the environments where they were captured. The third reason concerns the camera parameters, which may change from one scan to the next.

Many researchers have proposed solutions based on pairwise colour correction, colour mixing algorithms and image illumination change detection. When the scene is an artificially illuminated interior the difficulty for texture fusion is lessened, and acceptable results are obtained in most of the cases. In some cases, the light of the scene is controlled, so it is assumed that the scene is scanned under constant illumination. This assumption greatly simplifies the colour integration method which consists merely of searching for optimal blending functions [11–14]. For non-controlled scenarios, researchers have come up with only partial solutions. Most of these solutions are based either on global approaches, in which the complete colour image from one position of the scanner is processed (and therefore, modified), or on local strategies, in which only the colours at certain points of the image are corrected. In [15] the colours are inserted by detecting featured points in the image and defining the overlapped regions. Park et al. [16] developed a method that simultaneously fills in holes and associates (not acquired) colour. The study in [17] deals with several images of the scene, which are taken under different light conditions, and corrects discontinuities in the overlapped zones. Troccoli et al. [18] compute a relighting operator by analysing the overlapped area of a pair of images: the source image to be relighted and a target image. They then apply this operator over the non-overlapping region of the source, making it consistent with the target image. In [19], Dellepiane et al. present a colour correction space which is capable of providing a space-dependent colour correction for each pixel of the image. It estimates the flash light position and permits identification and removal of annoying artifacts, such as highlights and shadows. Since many of the developed methodologies for the mapping of colour information on 3D models also include the possibility of discarding parts of the input images or of selectively assigning a weight to contributing pixels, these authors state that their proposal is an interesting addition to such methods

Drastic approaches can be also found. For instance Sun et al. [20] remove, in overlapped regions, the less confident triangles and consider only the texture of the selected patches, avoiding any colour fusion procedure. Bernardini et al. [21] adjust the colours by global colour registration, imposing a restriction that distributes error in the results equally on the model. Finally, Xu et al. [22] integrate texture maps of large structures by decomposing the colour into two frequency bands and combining them.

The other line of research focuses on the obtention of a surface reflectance map of the scene. In this, the 3D model is illumination-free and can be rendered under any illumination conditions. Some interesting methods of obtaining a reflectance function are presented in [23] and [24]. These proposals were developed under laboratory conditions in which light positions are assumed to be known. Later, Debevec et al. [25] estimated the spatially-varying surface reflectance of complex scenes under natural illumination conditions. This approach uses a novel lighting measurement apparatus to record the full dynamic range of both sunlit and cloudy natural illumination conditions. Finally, in [26] the authors present a model of natural illumination as a combination of point source plus ambient component. They are able to compute the actual reflectance and to remove any effects of illumination in the images.

In spite of researchers' efforts over the last decade, the generated models are frequently incomplete or have neither sufficient accuracy nor quality. In particular, when dealing with outdoor scenarios the process becomes much harder. The colour images are severely affected by weather variability (rain and humidity) as well as the time of day at which they are taken. Furthermore, differing orientation and placement of the camera with respect to the surface must once again be taken into account.

This paper addresses the problem of how to build complete photorealistic models by uncoupling the time and particular circumstances in which 3D data and colour of the scene are captured. This issue is tackled from two different perspectives. Firstly, using a complete model (geometry plus texture) obtained after several data acquisition sessions, our aim is to modify the original texture provided on site by the scanner on-board camera with the colour information extracted from external images taken at particular times of day, times of the year (season) or under more suitable environmental conditions. We discuss this in Section 2. Secondly, we aim to keep the processes of geometrical data acquisition and colour acquisition apart, so that they become independent. Thus, the acquisition of colour information with external high-resolution cameras could be executed at chosen times, separately from the data acquisition for the geometrical modelling process, which is not sensitive to time or weather variations. This task therefore involves assigning colour to the geometrical model once it is finished, rather than modifying an existing colour model on it. Section 3 of this paper explains the procedure carried out to do this.

A further aspect to bear in mind with regard to our study relates to the fact that our algorithms increase the automation level in the overall geometry and colour merging process. In particular, we aim to automate two sub-processes: the colour fusion algorithm for several views by using a previous analysis of the global colour of the scene, and the search for the best view in the stage of making the match between the model and the external images. The automation of these phases is intended to go beyond manual and user dependent procedures. Despite this, the current state of the method presented in this paper requires user intervention in some steps. For instance, the user selects the set of external images to be used and can also vary certain specific parameters at the colour integration stage, such as window sizes, masks, thresholds, etc. These aspects are dealt with in Section 4.

Finally, the experimental section shows the results obtained in large outdoor scenarios. Specifically, we present the work carried out in two Roman archaeological sites that date back to the first century A.D., the theatre of Segobriga and the Fori Porticus of Emerita Augusta, both in Spain.

## 2. Colour Modification on Synthetic Models

Let us assume a 3D model **M** of an object obtained from the information acquired with a scanner equipped with an internal low-resolution camera. The rough information provided by the sensor is composed of the 3D coordinates of the points of the object and their corresponding colors in RGB format. A meshing algorithm then yields a patch representation, including vertices and normal vectors. We specifically used the volumetric approach published in [8] to generate the partial meshes. The view registration and geometric integration problem is solved using the k–d tree algorithm published by Zhang in [2]. Given the model **M** (**V, F, N, C**) where:

**V =**{**V**,_{1}, V_{2}**…, V**} are the 3D points of the model, that is, the vertices of the final mesh._{n}**F =**{**F**,_{1}, F_{2}**…, F**} are the patches of the mesh model._{n}**N =**{**N**,_{1}, N_{2}**…, N**} are the normal vectors for each vertex of the mesh model._{n}**C =**{**C**,_{1}, C_{2}**…, C**} are the colour components for each vertex of the mesh model._{n}

Our goal is to modify **C** by using a new set of images taken with external high-quality cameras and to obtain a photo-realistic model **M′** (**V, F, N, C**).We achieve this after the sequence of steps drawn in the sketch shown in Figure 1.

In the first step, we choose two images, **I**_{1} and **I**_{2}. **I**_{1} can be either the image provided by the camera of the scanner itself or an ortho-photo of **M** from a viewpoint close to the viewpoint of the external high-quality camera when taking **I**_{2}. The following step consists of selecting control points in both images. In the software we have created to execute colour modification, these can be marked automatically or by hand. If the automatic option is selected, the algorithm picks several pixels equally separated in the image **I**_{2} and looks for their corresponding ones in **I**_{1}. Then, for every pair of marked pixels, an iterative algorithm is run so that the position of each selected pixel in **I**_{1} is recalculated, and the correspondence error between pixels is minimized. The procedure for the iterative algorithm can be summarized as follows: for a given pair of pixels, a window W_{2} centred in the corresponding pixel from **I**_{2} is set. At the same time, a smaller window W_{1} is selected for the pixel in **I**_{1}. By means of a cross-correlation algorithm all the possible superimpositions of W_{1} onto W_{2} are obtained. The cross-correlation factor R is calculated by following Equation (1).

_{1},

_{xy}is the pixel (x,y) in the image

**I**

_{1}, P

_{2},

_{xy}is the pixel (x,y) in

**I**

_{2}and

**Ī**and

_{1}**Ī**are the arithmetic means of the components of

_{2}**I**

_{1}and

**I**

_{2}, respectively. As is known, the superimposition that gives the cross-correlation factor closest to 1 corrects the correspondence of pixels between

**I**

_{1}and

**I**

_{2}. Figure 2 shows an example detailing several stages of the pixels correspondence process.

The next step consists of correcting the original image **I**_{1}. The correspondence between n pixels from **I**_{1} and **I**_{2} can be expressed by means of a homogeneous transformation **A** in Equation (2):

**P**

_{e}and

**P**

_{m}are the coordinates of the pixels in

**I**

_{1}and

**I**

_{2}, respectively.

Note that the alignment between images **I**_{1} and **I**_{2} is not an affine homography because the scene is not exactly the same for both cases. **I**_{1} and **I**_{2} are taken for different scenes and seen under different optical properties. Thus, **I**_{1} corresponds to the image of a 3D model and is generated under graphical characteristics. In this case, the scene is a 3D model. Nevertheless, **I**_{2} is the real image captured by the camera, which has its own optical properties, and the scene is now the real environment.

For a general case:

If we chose a_{33} = 1, then: | λ_{1}=a_{31}x_{m1}+a_{32}y_{m1}+1 |

λ_{2}=a_{31}x_{m2}+a_{32}y_{m2}+1 | |

⋮ | |

λ_{n}=a_{31}x_{mn}+a_{32}y_{mn}+1 |

Taking these n related points as a starting point, Equation (2) leads to the expression:

**X**

_{p}and

**H**are obtained from

**P**

_{e}and

**P**

_{m}in the following way:

Then we solve the Equation (4) and obtain the transformation matrix between the two images **I**_{1} and **I**_{2}:

Once we have **A**, the final corrected image,
${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ will be the one obtained after modifying the RGB components with the RGB values of the corresponding pixels in the image **I**_{2}. Let us denote **C** as the vector which contains the three colour components, **C** = [R G B]^{t}. For each pixel of **I**_{1} the colour is modify as follows:

P_{k} and
${\mathit{P}}_{\mathit{k}}^{\mathbf{\prime}}$ being the coordinates of the k-th pixel in **I**_{2} and
${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$, respectively, and C(P_{k}) being the colour information stored in the k-th pixel of **I**_{2} and
$\mathit{C}({\mathit{P}}_{\mathit{k}}^{\mathbf{\prime}})$ the colour information assigned to the k-th pixel in
${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$.

In those cases when the image deformations are not considered, that is, when dealing with a geometric transformation, the following equalities are verified:

At the end of this process, a global correction algorithm is applied to the initial image **I**_{1}. Thus, the transformation that relates the colour components of image **I**_{1} to the colour components of image
${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ can be modelled by a linear transformation as follows:

**Z**can be computed using Equation (10):

Note that this section is developed taking one view of the original coloured model and therefore **Z** represents a colour transformation for this particular view. Nevertheless, in the case of uniform colour models, transformation **Z** could be used to modify the colour of any partial view of the model. This circumstance frequently occurs in heritage pieces, like marble statues or stone monuments. Formally:

Figure 3 shows an example in which the colour of the whole model of the piece has been modified.

## 3. Applying External Images onto the Geometrical Models

Let us now suppose that we have a geometrical model **M** (**V, F, N**) of a large scenario with no integrated texture and a set of high resolution images taken without any restrictions with an external camera. Restrictions concern any aspect related to the image (size, accuracy, resolution…) and the time at which the images were taken. With the collection of images and the geometry of the scene, a procedure to obtain a final complete model that comprises colour and geometry has been developed.

It is well known that colour assignation can be carried out on-line using a camera and always under an accuracy camera-scanner calibration. As mentioned in Section 1, this involves certain inconveniences in colour integration post-processing. By decoupling geometry and colour with respect to the moment at which the data are taken, our aim is to integrate the texture in the geometry in a controlled off-line process.

Our procedure consists of three main stages:

Images matching. We look for a viewpoint in the geometrical model to generate an ortho-image as similar as possible to an external image.

Generation of the coloured ortho-image.

Colour assignation into the geometrical model.

## 3.1. Images Matching

At this first stage, two images are chosen: an ortho-photo of the 3D geometrical model, which will be denoted as **I**_{1}, and an external image acquired without any restrictions, **I**_{2}. The viewpoint that provides the image **I**_{1} must be defined in such a way that the resulting projected view can be compared to the external image **I**_{2}, as shown in Figure 4. Several transformation matrixes must be stored when projecting the model to assure the reverse transformation from the coloured ortho-image to the 3D model, i.e., the matrix that controls the position of the viewpoint which the user views the model from, the projection transformation matrix, which defines the kind of perspective applied to the 3D model (either orthogonal or orthographical), and the model rotation/translation matrix that stores the rotation and/or translation undergone by the 3D model to be placed in a view close to the external image view. It is also necessary to record the size (in pixels) of the window in which the 3D model is viewed.

#### 3.2. Generation of the Coloured Ortho-Image

To begin this stage, we establish a correspondence between images **I _{1}** and

**I**by marking corresponding pixels manually. These sets of points define two matrixes of 2D coordinates, denoted as P

_{2}_{m}and P

_{e}, can be written as:

The transformation between the two images is modelled as:

**A**being a transformation matrix 3x3 that includes the rotation, translation and deformation of image

**I**

_{1}. We find the values of

**A**by carrying through a sequence of operations as described in Section 2 (Equations (3–6)). We then build a new image with the RGB components of external image

**I**

_{2}and matrix

**A**, that is, we build a new image ${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$, in which the pixels have the colour of their corresponding pixels in

**I**

_{2}, as expressed in Equation (7).

Figure 5 displays some stages of this procedure, such as the selection of the corresponding points in both the projected image and the external one, or the generated coloured ortho-image achieved in the end.

#### 3.3. Colour Assignation in the Geometric Model

Once we have the coloured image generated in the preceding step, we look for the correspondence between each pixel in image
${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ and the points of the 3D geometrical model. To do so, we start by restoring the point of view of **M**. The basic idea is to recover the size and placement in which the ortho-image **I**_{1} was taken. Therefore, we need the information in the different matrixes stored previously.

As is known, the description of a 3D object can be given by a list of 3D coordinates (points), and the colour associated to these points. An optional point indexation to form faces can also exist. In order to map the image
${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ onto the model **M**, we use a function which maps the 2D coordinates of a graphical window to the corresponding 3D coordinates of the model. It generates a matrix **I _{3}**

_{D}in which every pixel in the image has an associated point with 3D coordinates. Figure 6(a) illustrates this step. It is understood that the value of the 3D coordinates is calculated by interpolation on a mesh model, but the assignation must be necessarily made to a vertex of the geometrical mesh, as will be discussed below. The matrix

**I**

_{3}_{D}is subjected to a cleaning process to eliminate those elements which have no associated 3D components, in order to purge the assignation of pixels that corresponds to the background of the image ${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ (Figure 6(b)). For every valid element in matrix

**I**

_{3}_{D}, the closest vertex is sought in the mesh model, generating a new matrix ${\mathit{I}}_{\mathbf{3}\mathit{D}}^{\mathbf{\prime}}$ that consists of the vertex indexes associated to the model (Figure 6(c)). The matrixes ${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ (

**l**x

**d**x

**3**) and ${\mathit{I}}_{\mathbf{3}\mathit{D}}^{\mathbf{\prime}}$ (

**l**x

**d**) are then combined in a unique matrix

**T**

_{i}(

**k**x

**4**) which contains the vertex indexes of the 3D model and their associated colour in the image ${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ (Figure 6(d)). Since it may be the case that the same node has more than one colour assigned, the matrix

**T**

_{i}is resized so that each vertex in the model has a unique associated colour, this colour being the mean of the possible colour values assigned in matrix

**T**

_{i}(Figure 6(e)).

## 4. Multi-View Colour-Fusion Algorithm

When the colour assignation for a specific viewpoint is finished, we have a coloured model **M** (**V, F, N, C′**) in which only a part of its vertices has colour information. In order to have the whole model coloured, we need to assign the texture provided by the set of external images to the rest of the vertices. As was mentioned before, the images are taken without restrictions about the point of view and they may or may not overlap.

We carry out an iterative procedure that assigns colour to some of the nodes as new images are matched onto the geometric model. Let us assume a particular step **t** in the iterative process. At this point, we denote **T̃**_{t} as the matrix generated throughout the earlier **t-1** iterations which stores the colour information extracted from **t-1** external images. A new external image can provide texture for some of the vertices that have not yet been assigned, leading to a matrix **T**_{t}** _{+1}** that must be combined with

**T̃**

_{t}to produce a new matrix

**T̃**

_{t}

**(Figure 7). This process is generically denoted by the symbol ⊗ in Equation (13)**

_{+1}In a sense, matrix **T̃**_{t} is a matrix accumulated after **t-1** iterations (see Figure 7). Let us suppose that we have the mesh model without colour in iteration number 0. The colour fusion process is defined depending on the overlapping between the new view and the current status of the model. Thus if **T̃**_{t} y **T**_{t}** _{+1}** have no nodes in common, Equation (13) then becomes

**T̃**

_{t}

_{+1}= T̃_{t}

**∪**.

**T**

_{t}

**Nevertheless, if**

_{+1}**T̃**

_{t}y

**T**

_{t}

**have common nodes the colour merging then becomes a more complex process. We can distinguish between global and local approaches.**

_{+1}Depending on the kind of scene we are working on, we apply either one of the two processes described below or both of them. We will denote the two processes using symbols ➀ and ➁ in Equations (14) and (15):

The chart in Figure 8 illustrates both processes. The process ➀ is illustrated in Figure 8(a). In order to make the notation more readable from here on we will assume that the superscript “**c**” in the following equations will signify the overlapped part of matrices **T̃**_{t} and **T**_{t}** _{+1}** and superscript “

**c̄**” will be the non-overlapped part. Thus ${\tilde{\mathit{T}}}_{\mathit{t}}^{\mathit{c}}$ and ${\tilde{\mathit{T}}}_{\mathit{t}\mathbf{+}\mathbf{1}}^{\mathit{c}}$ are the sub matrixes for the common nodes in matrices

**T̃**

_{t}and

**T**

_{t}

**.**

_{+1}Process ➀ consists of three steps. First, a transformation is applied over
${\mathit{T}}_{\mathit{t}\mathbf{+}\mathbf{1}}^{\mathit{c}}$, and **T**_{t}** _{+1}** turns on

**T′**

_{t}

**. This transformation is obtained by using the colour relationship between ${\tilde{\mathit{T}}}_{\mathit{t}}^{\mathit{c}}$ and ${\mathit{T}}_{\mathit{t}\mathbf{+}\mathbf{1}}^{\mathit{c}}$, which can be modelled by means of transformation Z in Equation (16):**

_{+1}The solution for Z is given by the equation:

Thus, matrix **T**_{t}** _{+1}** is transformed after:

After this, we calculate the average values between
${\tilde{\mathit{T}}}_{\mathit{t}}^{\mathit{c}}$ and
${\mathit{T}}_{\mathit{t}\mathbf{+}\mathbf{1}}^{\mathit{c}\mathbf{\prime}}$ and add the non-common part
${\mathit{T}}_{\mathit{t}\mathbf{+}\mathbf{1}}^{\overline{\mathit{c}}\mathbf{\prime}}$, generating the partial matrix
${\mathit{T}}_{\mathit{t}\mathbf{+}\mathbf{1}}^{\mathit{a}}$ (Equation (19)). **T′**_{t}** _{+1}** then turns on

**T**″

_{t}

**.**

_{+1}Finally,
${\mathit{T}}_{\mathit{t}\mathbf{+}\mathbf{1}}^{\mathit{a}}$ is inserted in **T̃**_{t} and **T̃**_{t}** _{+1}** is generated. We denote the insertion operator by means of the symbol ⊕ in Equation (20):

Process ➁ is illustrated in Figure 8(b,c). It consists of making a colour substitution of ${\tilde{\mathit{T}}}_{\mathit{t}}^{\mathit{c}}$ by using a weighted expression of the colour. The weight factors are based on the observation angles for the nodes of which the colour values are about to be modified.

We start by defining a weighted observation angle for each node of the model which is seen in previous scans. This parameter will be used if we want to take into account local properties in the model. Formally, we define the weighted observation angle as follows.

Let us suppose **N** be a node of **T̃**_{t} and **T**_{t}** _{+1}** with assigned colours

**C̃**

_{t}(

**N**) and

**C**

_{t}

**N**

_{+1}(**), r**espectively. Let

**θ**

_{t}

**(**

_{+1}**N**) be the observation angles of

**N**in the step

**t**, where ${\mathit{\theta}}_{\mathit{t}\mathbf{+}\mathbf{1}}(\mathit{N})\mathbf{=}(\widehat{\overrightarrow{\mathit{n}}\mathbf{,}\overrightarrow{\mathit{l}}})\mathbf{,}\overrightarrow{\mathit{l}}\mathbf{=}{\overrightarrow{\mathit{N}\mathit{R}}}_{\mathit{t}\mathbf{+}\mathbf{1}}\mathbf{,}\overrightarrow{\mathit{n}}$ being the normal at

**N**and

**R**

_{t}

**being the camera position in iteration**

_{+1}**t**. Finally, let

**θ̃**

_{t}(

**N**) be a weighted observation angle which is computed after

**t-1**iterations. The weighted observation angle at iteration

**t**is calculated as follows:

**w**

_{t}

**are based on angles**

_{+1}**θ**

_{t}

**:**

_{+1}**w̃**

_{t}is based on previous weights in the following manner:

In Equation (22)**a** and **b** are two thresholds imposed to the observation angles.

With this algorithm, matrix **T̃**_{t} is modified taking into account the colour and the observation angle for each common node in **T̃**_{t} and **T**_{t}** _{+1}**. The colour updating for a node N in

**T̃**

_{t}is given by the expression:

Where **C̃**_{t}(**N**) is the colour of **N** in **T̃**_{t} and **C**_{t}** _{+1}** is the colour of

**N**in

**T**

_{t}

**, the weights**

_{+1}**w**

_{t}

**and**

_{+1}**w̃**

_{t}are calculated with Equations (22) and (23) from the observation angle s

**θ**

_{t}

**(**

_{+1}**N**) and

**θ̃**

_{t}(

**N**), respectively.

In conclusion, the process of colour merging, depending on the type of scene we are dealing with, can consist of:

A global correction plus average following process ➀; that is, applying Equations (16) to (20).

A local correction depending on the observation angle following process ➁; that is, applying Equations (21) to (24).

A global correction followed by a local correction, which results in:

$${\tilde{\mathit{T}}}_{\mathit{t}\mathbf{+}\mathbf{1}}\mathbf{=}({\tilde{\mathit{T}}}_{\mathit{t}}\mathbf{\u2780}{\mathit{T}}_{\mathit{t}\mathbf{+}\mathbf{1}})\mathbf{\u2781}{\mathit{T}}_{\mathit{t}\mathbf{+}\mathbf{1}}$$

## 5. Experimental Results

In order to illustrate the worthiness of our proposal, we present in this section the results for the reconstruction of two Roman archaeological sites from to the first century A.D., the Roman Theatre of Segobriga and the Fori Porticus of Emerita Augusta (Figure 9). They are both good prototypes of exterior scenarios, huge outdoor spaces with a high degree of difficulty due to the multitude of occlusions that arise when acquiring the surfaces.

For the acquisition of the geometry, we used both a Faro LS 880 laser scanner and a Faro Photon 80 sensor to provide a panoramic view of the scene. Azimuth and elevation angles range over [0, 360°] and [−65°, 90°], respectively, and the maximum data density of the scanner is 40.110 × 17.223 points. Obviously when the data is used in outdoor scenes most of the potential data are missed because they correspond to the sky and only a part of the data are useful. The colour information has been obtained with two Nikon D200 cameras with Nikon AF DX Fisheye lens, one for each scanner. This system composes a single panoramic image from 10 photos, 10.2 Mpixels each. The system takes 1 hour and 49 minutes to scan a panoramic sample plus 8 minutes to take the sequence of photos.

#### 5.1. The Roman Theatre of Segobriga

As usual, the first task we accomplished in situ was the planning of the accessible scanner positions. The volume that could be covered in a single scan is limited by several factors: the field of view, the accessibility of the scanner, the occlusion conditions and the overlap with other scans. The theatre is split into four parts: Cavea, Poedria, Orchesta and Proscaenium. Most of the scans were taken in the Cavea zone because of the multiple occlusions that the stairs generate everywhere in this area. Figure 10 shows the 56 scanner positions we planned initially on a map of the theatre. But as we took new scans we sometimes detected some redundant views that were discarded. Thus, we ended up taking 45 scans of the theatre.

We worked with resolution of up to 4,010 × 1,723 points per scan. The average amount of data and memory requirement per range image at this resolution were 3,084,840 points. It must be remembered that many data are missed in outdoor scenes. The geometric model building stage was run over an Intel^{®} Core™ 2 6400 2.13 GHz with 2 GB RAM computer whereas in the colour model phase we used an Intel^{®} Pentium™ 4 3.2 GHz, 3 GB RAM.

Apart from noise and outliers filtering tasks in the pre-processing stage, we had to correct errors in colour assignment. The system is capable of associating colour information to each point of the scene but, due to small camera alignment errors, the colour-geometry registration must be refined. To solve this problem we set a pair of corresponding points in reflectance and colour images and aligned accurately.

To begin generating the model, the acquired data have to be registered so that they are all in the same coordinate system. Data registration is a well-studied problem, and methods to automatically register laser scans are commercially available. In order to make the registration process more precise, we carried out an initial alignment of the two scans by manually marking related points in the two datasets to be registered. After this, the ICP algorithm was run. Finally, with the aim of having as regular and homogenous a model as possible, we performed normalization and resampling work. Table 1 shows computational times for different steps in the creation of the geometrical model and for several resolutions of the scanner. Resolution R1 corresponds to 4,010 × 1,723 = 6,909,230 points per scan, and R2 to R7 are resolutions that go from 90%R1 to 40%R1. The filtering processes eliminate erroneous points provided by the scanner: points generated in the infinite (spread points), noise (owing to reflections and shiny surfaces) and outliers (points outside the scene). Registration signifies the process of putting in correspondence two overlapped surfaces. After this process a coarse transformation is achieved. The registration refinement stage calculates an accurate alignment between both surfaces. Finally, the merging stage samples the points on the surface and rewraps it in order to make smooth transitions and generate only one cloud of points.

As we have said, one of the most common failures of current conventional 3D sensor devices is that they are unable to make a realistic colour fusion between overlapped samples. To overcome this drawback, we used our colour fusion algorithm in the geometric model generated in the previous phase. Figure 11(a) displays an example of a deficient colour assignment carried out with on-line images which were processed using commercial software. Commercial modelling tools are currently inefficient and do not provide an acceptable texture from different light conditions and scanner positions. This partially can occur owing to the lack of information that the software manages when the colour is assigned to the model. For example, the position of the scanner with respect to the surface of the object might be unknown if the 3D data are loaded from a global reference system. In the majority of the cases, the colour of overlapped nodes from different viewpoints is assigned drastically (RapidForm, Faro Scene). Thus, sometimes the colour of overlapped zone in the last scan replaces the existing colour in the model or, at the much, a simple average is calculated. Apparently, there are no filtering and colour inconsistency detection processes so that the final colour is erroneous. Figure 11(b) shows the result using external off-line images and the colour fusion process explained here. Our solution, as explained throughout the paper, follows a sequential merging strategy that entails three stages: global processing, local processing and filtering. Figure 12 illustrates how the geometric plus colour model is sequentially updated as the colour information of a new scan is integrated.

Colour discontinuities corresponding to typical seam or edge points arise after local correction. The solution to such types of colour discontinuities is addressed here by means of a linear smoothing filter in a 3D environment. We applied a 3D discrete Gaussian filter over the three colour components. After merging all scans several colour-holes may appear. This is due to the fact that the colour merging algorithm imposes several restrictions that a few points do not satisfy. We solved the colour-hole problems following a simple nearest neighbour interpolation algorithm. Figure 13 shows the colour-holes detected in the model in red and the result after the filling process. Finally, we generated the colour mesh model in which the colour of the patch is assigned by means of a bicubic interpolation over the vertices of the patch. Table 2 presents computational cost, memory requirement and number of filled holes concerning the colour assignation stage. The table includes several mesh model resolutions. The primary complete mesh-model was a 746 MB file with 9,587,505 triangular patches and average edge of 1.6 cm. N_{p} symbolizes the number of nodes of the mesh model (in million units). T_{1}, T_{2} and T_{3} concern loading, colour merging and colour-hole filling computation times, respectively. The next column (RAM) is the amount of memory required and N_{h} corresponds to the number of filled patches.

#### 5.2. The Fori Porticus of Emerita Augusta

Following the same procedure as for the theatre model, we also generated a complete geometrical model for the Fori Porticus of Emerita Augusta in Merida that can be seen in Figure 14. We worked with resolution up to 5,014 × 2,153 points per scan. This site needed 25 scans to be completely reconstructed, each having an average of six million points. These data were taken during four sessions, eight hours per session. The final point cloud of the definitive model was reduced to 48,560,845 points, with a memory size of 1.148 GB. To integrate the texture in the model we took, on a different day and over one hour, 126 shots of the site and applied our method. We chose a cloudy but light day so that illumination conditions were the best. The model was run over on two 3.2 GHz, 4 GB RAM personal computers and a graphic server with two processors, six cores each, two threads per core and 32 GB RAM.

Tables 3 and 4 present approximate computational times for different stages and scanner resolutions concerning the geometrical model generation and colour assignation processes respectively. Resolution R1 corresponds to 5,014 × 2,153 points per scan, and R2 to R7 are resolutions that go from 90%R1 to 40%R1. Table 3 shows, besides the times concerning the registration between two range images, the time required to obtain the whole mesh model (“Meshing”) and the time required to normalize (edge regularization) and refinement (mesh smoothing). Table 4 provides details of specific steps referred to in Sections 2, 3 and 4: extraction of the ortho-image from the 3D model, correspondence between the model and the external image, colour filling process and colour merging in the whole coloured model.

Figures 15 to 18 highlight some moments of the sequence of steps carried out to assign colour from external images to the geometrical model built.

Finally, we present a set of pictures to show the results obtained after applying our method to the Fori Porticus of Emerita Augusta in comparison to the previous coloured model generated by using the colour information provided by the scanner on-board camera (Figure 18). The improvement in the colour model generated from external images can clearly be seen.

## 6. Conclusions

Building accurate and complete (geometry+texture) 3D models of large heritage indoor/outdoor scenes from scanners is still a challenging and unresolved issue on which many researchers are currently working [27,28]. In the last decade technology relating to 3D sensors has significantly improved in many respects and the devices are now more flexible, more precise and less costly than in the past. Consequently, we believe that better reverse engineering results will be obtained in this area. Processing of information, nevertheless, may be inefficient if the scene is digitalized under unsuitable conditions. This frequently occurs in outdoor scenarios in which the illumination and the environment conditions of the scene cannot be controlled. This paper proposes to improve the existing modelling techniques by decoupling the colour integration and geometry reconstruction stages. Colour assignation in particular is carried out as an independent and controlled process. The method has basically two parts: colour assignation for one shot of the scanner and colour fusion of multiple views.

Colour assignation focuses firstly on modifying low-quality coloured 3D models by using sets of images of the scene that have been taken at different times and circumstances than those of the geometrical information capture phase. In this approach a colour transformation is obtained after matching the model's orthoimages with external images. The second line is addressed to directly assigning external images to the complete geometric model. Both approaches are quite simple and can easily be implemented in practice.

The merging colour algorithm is proposed in the last part of the paper. This stage is carried out under an iterative process in which the next external image is automatically integrated into the ‘as-built’ model. The result is a complete coloured model that can be refined in the last step.

The proposed method has been widely tested in large outdoor scenes with very good results; the present paper shows the results obtained at two popular heritage sites in Spain. We have achieved high-quality photorealistic models in all tests we have performed. Comparison with former models is also illustrated at the end of the paper.

## Acknowledgments

We would like to express our appreciation to the staff of both the Segobriga Archaeological Park and the Consortium of the Monumental City of Mérida, for their assistance in our work. We would also like to thank the Spanish Ministry of Science and Innovation, the Castilla La Mancha Government and the Extremadura Regional Government for financing it through projects DPI2009-14024-C02-01, PCI08-0052-1401 and PRI09C088.

## References

- Besl, P.J.; McKay, H.D. A method for registration of 3-D shapes. IEEE Trans. Pattern Anal. Mach. Intell.
**1992**, 14, 239–256. [Google Scholar] - Zhang, Z. Iterative point matching for registration of free-form curves and surfaces. Int. J. Comput. Vis.
**1994**, 13, 119–152. [Google Scholar] - Xiao, G.; Ong, S.H.; Foong, K.W.C. Efficient partial-surface registration for 3D objects. Comput. Vis. Image Underst.
**2005**, 98, 271–293. [Google Scholar] - Silva, L.; Bellon, O.R.P.; Boyer, K.L. Precision range image registration using a robust surface interpenetration measure and enhanced genetic algorithms. IEEE Trans. Pattern Anal. Mach. Intell.
**2005**, 27, 762–776. [Google Scholar] - Salvi, J.; Matabosch, C.; Fofi, D.; Forest, J. A review of recent range image registration methods with accuracy evaluation. Image Vis. Comput.
**2007**, 25, 578–596. [Google Scholar] - Santamaría, J.; Cordón, O.; Damas, S. A comparative study of state-of-the-art evolutionary image registration methods for 3D modelling. Comput. Vis. Image Underst.
**2011**, 115, 1340–1354. [Google Scholar] - Turk, G.; Levoy, M. Zippered Polygon Meshes from Range Images. Proceedings of the 21st Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH '94), Orlando, FL, USA, 24–29 July 1994; pp. 311–318.
- Curless, B.; Levoy, M. A Volumetric Method for Building Complex Models from Range Images. Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques, New Orleans, LA, USA, 4–9 August 1996; pp. 303–312.
- Di Angelo, L.; di Stefano, P.; Giaccari, L. A new mesh-growing algorithm for fast surface reconstruction. Comput. Aided Des.
**2011**, 43, 639–650. [Google Scholar] - Santos, J., Jr.; Bellon, O.; Silva, L.; Vrubel, A. Improving 3D reconstruction for digital art preservation. Lect. Notes Comput. Sci.
**2011**, 6978, 374–383. [Google Scholar] - Wang, L.; Kang, S.B.; Szeliski, R.; Shum, H.-Y. Optimal Texture Map Reconstruction from Multiple Views. Proceedings of the 2001 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 8–14 December 2001; pp. 347–354.
- Callieri, M.; Cignoni, P.; Corsini, M.; Scopigno, R. Masked photo blending: Mapping dense photographic data set on high- resolution sampled 3D models. Comput. Graph.
**2008**, 32, 464–473. [Google Scholar] - Goldluecke, B.; Cremers, D. Superresolution texture maps for multiview reconstruction. IEEE Int. Conf. Comput. Vis.
**2009**, 2009, 1677–1684. [Google Scholar] - Liu, X.M.; Peng, X.; Yin, Y.K.; Li, A.M.; Liu, X.L.; Wu, W. Generation of photorealistic 3D image using optical digitizer. Appl. Opt.
**2012**, 51, 1304–1311. [Google Scholar] - Wyngaerd, J.V.; van Gool, L. Combining Texture and Shape for Automatic Crude Patch Registration. Proceedings of the Fourth International Conference on 3D Digital Imaging and Modeling, Banff, Canada, 6–10 October 2003; pp. 179–186.
- Park, S.; Guo, X.; Shin, H.; Qin, H. Shape and Appearance Repair for Incomplete Point Surfaces. Proceedings of the 10th IEEE International Conference on Computer Vision, Beijing, China, 17–20 October 2005; pp. 1260–1267.
- Bannai, N.; Fisher, R.B.; Agathos, A. Multiple color texture map fusion for 3D models. Pattern Recognit. Lett.
**2007**, 28, 748–758. [Google Scholar] - Troccoli, A.; Allen, P. Building illumination coherent 3D models of large-scale outdoor scenes. Int. J. Comput. Vis.
**2008**, 78, 261–280. [Google Scholar] - Dellepiane, M.; Callieri, M.; Corsini, M.; Cignoni, P.; Scopigno, R. Improved color acquisition and mapping on 3D models via flash-based photography. J. Comput. Cult. Herit
**2010**, (2). [Google Scholar] [CrossRef] - Sun, Y.; Dumont, C.; Abidi, M. Mesh-based Integration of Range and Color Images. Proceedings of the SPIE Conference on Sensor Fusion: Architectures, Algorithms, and Applications IV, Orlando, FL, USA, 24–28 April 2000; Volume 4051. pp. 110–117.
- Bernardini, F.; Martin, L.M.; Rushmeier, H. High-quality texture reconstruction from multiple scans. IEEE Trans. Vis. Comput. Graph.
**2001**, 7, 318–332. [Google Scholar] - Xu, C.; Georghiades, A.; Rushmeier, H.; Dorsey, J. A System for Reconstructing Integrated texture Maps for Large Structures. Proceeding of 3rd International Symposium on 3D Data Processing Visualization and Transmission, Chapel Hill, NC, USA, 14–16 June 2006; pp. 822–829.
- Debevec, P.; Hawkins, T.; Tchou, C.; Duiker, H.-P.; Sarokin, W.; Sagar, M. Acquiring the Reflectance Field of a Human Face. Proceedings of the 27th Annual Conference on Computer Graphics and Interactive Techniques, New Orleans, LA, USA, 23–28 July 2000; pp. 145–156.
- Lensch, H.P.A.; Kautz, J.; Goesele, M.; Heidrich, W.; Seidel, H.-P. Image-based reconstruction of spatial appearance and geometric detail. ACM Trans. Graph.
**2003**, 22, 234–257. [Google Scholar] - Debevec, P.; Tchou, C.; Gardner, A.; Hawkins, T.; Poullis, C.; Stumpfel, J.; Jones, A.; Yun, N.; Einarsson, P.; Lundgren, T.; Fajardo, M.; Martinez, P. Estimating Surface Reflectance Properties of a Complex Scene under Natural Captured Illumination; USC ICT Technical Report ICT-TR-06.2004; University of Southern California: Los Angeles, CA, USA, 2004. [Google Scholar]
- Troccoli, A.; Allen, P. Recovering Illumination and Texture Using Ratio Images. Proceeding of the 3rd International Symposium 3D Data Processing, Visualization and Transmission, Chapel Hill, NC, USA, 14–16 June 2006; pp. 655–662.
- Remondino, F. Heritage recording and 3D modeling with photogrammetry and 3D scaning. Remote Sens.
**2011**, 3, 1104–1138. [Google Scholar] - Gonzalez, D.; Muñoz, A.; Gómez, J.; Herrero, J.; Gutierrez, G. 3D modelling surveying and modelling of cave geometry: Application to palaeolithic rock art. Sensors
**2009**, 9, 1108–1127. [Google Scholar]

**Figure 2.**(

**a**) Selection of relevant pixels in images

**I**

_{1}(taken from the coloured model) and

**I**

_{2}(external image) and one pair of pixels chosen to run the algorithm of error minimization; (

**b**) Corrected position of the chosen pixel in

**I**

_{1}depicted versus the position of its related pixel in

**I**

_{2}.

**Figure 3.**Correction of texture: (

**left**) Corrected model assuming a pure geometric transformation between I

_{1}and I

_{2}; (

**right**) Result by considering image deformations.

**Figure 4.**(

**left**) External image

**I**

_{2}; (

**right**) Uncoloured 3D model and chosen view to obtain image

**I**

_{1}.

**Figure 5.**Generation of the coloured ortho-image: (

**a**) Selection of the corresponding pairs of points onto the 3D geometrical model ortho-image and the external image; (

**b**) Result of the colour transformation method on

**I**

_{1}(left) to obtain ${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ (right); (

**c**) Comparison between the original external image

**I**

_{2}(left) and the generated coloured ortho-image ${\mathit{I}}_{\mathbf{1}}^{\mathbf{\prime}}$ (right).

**Figure 6.**(

**a**) Mapping of the 2D information in the 3D coordinates; (

**b**) Elimination of the point coordinates that do not belong to the geometrical model; (

**c**) Definition of the vertex indexes associated to the model; (

**d**) Storage of the vertex indexes and their associated colour; (

**e**) Assignation of a unique colour to each vertex in the geometrical model.

**Figure 7.**(

**a**) Assigning images to the mesh model and evolution of matrix

**T̃**

_{i}; (

**b**) Integration of a new image.

**Figure 9.**Pictures of the Roman Theatre of Segobriga (

**left**) and the Fori Porticus of Emerita Augusta (

**right**).

**Figure 11.**Erroneous colour fusion made with commercial software (

**a**) and the result after applying our algorithm (

**b**).

**Figure 12.**(

**1**–

**2)**. Two coloured scans, (

**3)**Matching of coloured scan 1 onto the geometrical model, (

**4)**Result after merging scan 2 with the current coloured model.

**Figure 13.**Colour holes depicted in red (left); Result for the nearest neighbour filling algorithm run on them (

**right**).

**Figure 15.**(

**a**) Selection of the corresponding pairs on the 3D geometrical model ortho-image and the external image; (

**b**) Result of the colour transformation method on the ortho-image (left) to obtain the coloured ortho-image (right); (

**c**) Colour information mapped onto the 3D model.

**Figure 17.**External images taken under appropriate environmental conditions (

**top**); Evolution of the coloured model as the colour information is added (

**bottom**).

**Figure 18.**Pictures of a previous coloured model generated from the images provided for the scanner on-board camera (

**left**) in comparison with pictures of the photorealistic model created from the geometrical model and a set of external images taken without any restrictions (

**right**).

**Table 1.**Average computational times in different processing phases of theTheatre of Segobriga geometrical model and for several scanner resolutions.

Process | R1 | R2 | R3 | R4 | R5 | R6 | R7 |
---|---|---|---|---|---|---|---|

Spread points filtering (one scan) | 21s | 20s | 20s | 16s | 16s | 13s | 12s |

Outliers and noise filtering (one scan) | 6s | 6s | 5s | 5s | 5s | 3s | 3s |

Normalization and resampling (one scan) | 49s | 40s | 34s | 22s | 12s | 9s | 8s |

Registration (two scans) | 4m35s | 3m13s | 1m40s | 1m14s | 1m14s | 1m14s | 1m14s |

Registration refinement (two scans) | 4m59s | 7m11s | 6m33s | 5m49s | 5m04s | 4m35s | 3m31s |

Merging (two scans) | 21s | 21s | 20s | 17s | 16s | 15s | 15s |

N_{p} | T_{1} | T_{2} | T_{3} | RAM | N_{h} |
---|---|---|---|---|---|

2,93 | 30s | 1h | 1h 53m | 115 MB | 32524 |

6,02 | 53s | 1h 21m | 3h 55m | 195MB | 61096 |

7,88 | 1m 10s | 1h 47m | 5h 20m | 143MB | 82959 |

9,58 | 1m 28s | 2h 10m | 8h 5m | 298MB | 102314 |

**Table 3.**Computational times for different stages and scanner resolutions of the Fori Porticus geometrical model.

Process | R1 | R2 | R3 | R4 | R5 | R6 | R7 |
---|---|---|---|---|---|---|---|

Registration (two scans) | 2m | 1m42s | 1m33s | 1m24s | 1m22s | 1m19s | 1m20s |

Registration refinement (two scans) | 1m | 56s | 50s | 44s | 42s | 37s | 33s |

Meshing (complete model) | 30m | 27m | 24m | 21m | 18m | 15m | 12m |

Normalization and refinement of the final mesh (complete model) | 8h | 7h | 5h | 3h | 1h | 30m | 10m |

**Table 4.**Computational times for several steps and scanner resolutions in the Fori Porticus coloured model building.

Stages | R1 | R2 | R3 | R4 | R5 | R6 | R7 |
---|---|---|---|---|---|---|---|

Pre-processing of each view (normal calculation, format conversion, etc.) | 5 m | 4m30s | 4m | 3m30s | 3m | 2m30s | 2m |

Definition of the ortho-image from the 3D model | 2m | 1m47s | 1m35s | 1m23s | 1m13s | 1m2s | 45s |

Definition of corresponding pairs between the model and the external image | 3m | 3m10s | 3m20s | 3m30s | 3m45s | 4m5s | 5m |

New colour modification/assignation | 1m | 56s | 50s | 43s | 35s | 31s | 23s |

Filling of colour-holes | 10m | 9m6s | 8m10s | 7m15s | 6m25s | 5m30s | 4m |

Merging of colour with the whole coloured model | 3m | 2m40s | 2m20s | 2m | 1m49s | 1m36s | 1m10s |

© 2012 by the authors; licensee MDPI, Basel, Switzerland This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).