1. Introduction
Recent advancements in histopathology have been driven by innovations in digital imaging and computational analysis [
1]. A critical area of focus is evaluating single-cell data within specific regions of interest (ROIs) from high-resolution histological images, particularly for cancer research, immunohistochemistry (IHC), and tissue microenvironment studies. The ability to accurately identify and quantify individual cells within a defined ROI even in a simple brightfield image has significant implications for diagnosis, prognosis, and therapeutic decisions [
2]. In fact, it is worth noting that brightfield images are currently used for several pre-clinical and clinical analyses. For instance, multiplexed bright-field methodologies and platforms are, today, the reference for the analysis of predictive markers for immunotherapy [
3].
The rise of two-dimensional (2D) whole slide imaging (WSI) and the need for efficient, scalable tools to manage, analyse, and interpret these large datasets has led to the development of various software platforms [
4]. These platforms range from commercial to open-source solutions [
5], from generalist [
6] to specific tools [
7], and offer distinct features to accommodate the increasing complexity of modern histological analysis [
8]. Despite the availability of such tools, there is a critical need for a comprehensive comparison to guide researchers in selecting the most appropriate software for specific tasks, particularly those involving single-cell analysis within freehand-selected histological ROIs on brightfield images. For example, the authors in [
5] recently discussed the growing role of artificial intelligence (AI) in the field of digital pathology, introducing some open-source segmentation tools and analysing their usage and customisation in histological imaging. Similarly, the authors in [
9] examined the latest AI and deep learning (DL) techniques for cell analysis and data mining in microscopy images. It is important to note that whilst most available tools permit rectangular ROI selection, very few allow for true irregular freehand selection, and many are optimised for fluorescent images rather than classical widefield brightfield microscopy [
10].
This study provides a detailed evaluation of 14 popular tools, encompassing commercial, freely available, and open-source solutions. Specifically, six open-source tools (i.e.,
CellProfiler [
11],
Cytomine [
12],
Digital Slide Archive [
13],
Icy [
14],
ImageJ/Fiji [
15,
16],
QuPath [
17]), four freely available tools (i.e.,
Aperio ImageScope,
NIS Elements Viewer,
Sedeen,
SlideViewer), and four commercial tools (i.e.,
Amira,
Arivis,
HALO,
Imaris) have been considered. The evaluation focuses on aspects important to researchers, including the ability to handle large file formats (e.g., scanner vendor-specific format—SVS, digital imaging and communications in medicine—DICOM, tagged image file format—TIFF), flexibility in defining and extracting ROIs, and the functionality for classifying single cells on brightfield images using automated, semi-automated, or manual techniques.
In this work, a benchmark analysis was conducted using a representative irregular freehand-defined ROI from a brightfield WSI red, green, and blue (RGB) case, providing quantitative insights into the performance of currently available software tools. Notably, most of these tools are originally designed for multi-channel images, the most common image type in the fluorescence domain. Although brightfield multi-channel immunohistochemistry images can be generated by sequentially staining the sample [
18], or through multispectral imaging in the range of the visible light [
19], we focused the present study on the most prevalent case in brightfield microscopy, the RGB image type [
20]. The results indicate that several tools now offer highly accurate solutions. For example, the open-source software
QuPath stands out for its comprehensive feature set, comparable to those provided by commercial solutions with annual costly license fees ranging from a few thousand to EUR 25,000, depending on the desired configuration and service. However, it is important to note that evaluating the commercial tools was challenging due to the restricted functionality of the testing license. As such, the reported performance values can be considered a lower bound, with potential for improvement under unrestricted conditions. In conclusion, this analysis serves as a valuable guide to the current options available for single-cell histological analysis, helping researchers better understand what to expect for their specific analytical needs.
The remainder of this article is structured as follows:
Section 2 provides a detailed overview of the single-cell analysis tools available today.
Section 3 presents a qualitative comparison of the different tools, focusing on performance in handling large datasets and the capabilities of defining ROIs and conducting automated analysis.
Section 4 introduces the case study used for the quantitative comparison of the tools, with the results and discussion following in
Section 5. Finally,
Section 6 concludes the paper by summarising the key findings and offering recommendations for researchers seeking to select the most suitable tool for analysing brightfield images.
4. Quantitative Comparison
In order to evaluate the performance of the different tools, (
a) the total number of cells and (
b) the number of cells positive to some specific markers within a specific irregular freehand-defined ROI of a representative brightfield image (
Figure 4a) were counted using the different tools.
The freehand-defined ROI (
Figure 4b) was defined on an extracted 2346 × 1352 full-resolution rectangular image within a large high-resolution WSI of an anonymised IHC-stained tissue section of a head and neck squamous cell carcinoma (HNSCC) tumour biopsy, precisely, a tongue cancer, used as a sample dataset (
Figure 4c). The image was acquired using a brightfield
Aperio CS2 microscope (
Leica Biosystems), equipped with a 40× objective. The section was stained using three different markers: nuclei are stained with haematoxylin (visible in blue), CD8-positive lymphocytes with DAB (
3,
3′-Diaminobenzidine, visible in brown), and CD163-positive macrophages with AP-red (
Alkaline Phosphatase Red, visible in red). Haematoxylin (
Figure 4d), DAB (
Figure 4e), and AP-red (
Figure 4f) channels were extracted and saved as different image files using the
Colour Deconvolution tool, an
ImageJ/Fiji plugin for stain unmixing in RGB histological images [
28].
The representative rectangular image extracted from the WSI file, the binary mask representing the irregular freehand-defined ROI, the haematoxylin image, the DAB image, and the AP-red image are freely available as
Supplementary File S1.
The ground truth was established by an expert life scientist who manually counted the total number of cells (i.e., 823), as well as the number of cells positive for DAB (i.e., 135) and AP-red (i.e., 98) within the irregular freehand-defined ROI. Each tool was then tested by the same operator, an expert computer scientist, using the same previously freehand-defined ROI saved and loaded into the different tools. It is worth noting that the computer scientist testing the tools was trained by the life scientist responsible for creating the ground truth, learning to recognise cellular structures and identify various staining signals on ROIs similar to, but different from, the one used in the experiments.
All the tools unable to load the previously defined irregular freehand-defined ROI or unable to detect cells stained with DAB and/or AP-red were discarded, remaining just: (1) Amira, (2) Arivis, (3) HALO, (4) ImageJ/Fiji, (5) NIS-Elements (in this case we did not use NIS-Elements Viewer but its commercial version NIS-Elements with several segmentation and classification functionalities), (6) QuPath. The tested tools were installed on a workstation with the following configuration: an Intel Core i7 processor, 32 gigabyte (GB) random access memory (RAM), and NVIDIA GTX 1080 graphics card. Software packages were obtained from official sources, with open-source and freely-available tools downloaded from their respective websites on May 2024, while commercial tools were tested using trial time-limited licenses provided by vendors. Setups were completed by following the user manuals or installation guides, ensuring that all necessary dependencies and configurations were met for each software.
5. Results and Discussion
Among the 14 tools initially reviewed, only six met the requirements for the experiments: They supported loading the predefined irregular freehand-selected ROI and analysing both nuclear (i.e., haematoxylin) and cellular (i.e., DAB and AP-red) signals. Precisely, the tools considered in the experiments were (hereafter reported in alphabetical order) (1) Amira (commercial), (2) Arivis (commercial), (3) HALO (commercial), (4) ImageJ/Fiji (open source), (5) NIS-Elements (commercial version of the freely available NIS-Elements Viewer, including options for importing image ROIs and performing automatic single-cell segmentation/classification), and (6) QuPath (open source). What follows summarises the parameters applied and strategy used for testing the following single-cell analysis software tools (hereafter reported in alphabetical order):
(1) Amira: Amira is a commercial tool, and the testing was conducted using a license provided by the selling company for a limited evaluation period. Accordingly, the reported values can be considered a lower bound, with potential for improvements. For the segmentation, we analysed the haematoxylin, DAB, and AP-red images separately. A basic global thresholding step was followed by the application of the standard Watershed algorithm to segment and split objects, as well as to remove small debris. The key internal parameters applied during the process were as follows: threshold range, 0–120 grey levels; Watershed option, DARK; global segmentation range, 1.00–6.75; segmentation level, 0.6; and minimum object area, 30 pixels.
(2) Arivis: Arivis is a commercial tool, and the testing was conducted in collaboration with specialists from ZEISS, the selling company, through an online meeting aimed at ensuring proper usage of the software. This collaborative effort allowed us to apply the tool to brightfield images as effectively as possible while acknowledging that the tool was primarily developed for fluorescent images. However, the limited time of interaction with the specialists made the reported values just a lower bound with a high potential for improvements. For instance, pre-training specific networks could obviously provide better accuracy. For the segmentation, we separately considered the haematoxylin, DAB, and AP-red images. Built-in machine learning algorithms originally validated for fluorescent images and not optimised for brightfield applications were used. Following segmentation, the internal Blob Finder algorithm was used to count both nuclei and cells with positive signals. The preliminary results were further refined by applying area and circularity filters to reduce false positives and debris.
(3) HALO: HALO is a commercial tool, and the testing was conducted in collaboration with specialists from Indica Labs, the selling company, through an online meeting to ensure proper usage and application of the software. This collaborative effort enabled us to apply the tool to brightfield images as effectively as possible while acknowledging that the tool was primarily developed for fluorescent image analysis. However, the limited time of interaction with the specialists made the reported values just a lower bound with a high potential for improvements. For segmentation, we analysed the haematoxylin, DAB, and AP-red images separately. An available pre-trained neural network, optimised for images similar to the haematoxylin stain, was used for nuclei segmentation, yielding extremely highly accurate results. However, no available neural networks had been trained on images resembling the DAB and AP-red stains, resulting in very poor accuracy when identifying cells positive for these markers. Obviously, specific pre-trained networks could provide better accuracy.
(4) ImageJ/Fiji: ImageJ/Fiji is an open-source tool providing several image processing options and plugins. For the segmentation, we separately considered the haematoxylin, DAB, and AP-red images. Each image has been converted into a binary mask applying a basic global thresholding step, followed by the application of the standard Watershed algorithm to segment and split objects, as well as to remove small debris. Finally, the Particle Analysis plugin was used for counting the objects. The key internal parameters applied during the process were as follows: threshold range, 0–175 grey levels; circularity, 0.00–1.00; minimum object area, 60 pixels; and removal of the objects touching borders, Yes.
(5) NIS-Elements: NIS-Elements is a commercial tool, and the testing was conducted using a license acquired by the selling company. NIS-Elements provides several image processing options. For the segmentation, we separately considered the haematoxylin, DAB, and AP-red images. Each image has been converted into a binary mask by applying a basic global thresholding step, followed by the application of internal built-in filters for smoothing the images, separating objects, and removing small debris. The key internal parameters applied during the process were as follows: threshold range, 0–225 grey levels; smooth, 2x; separate, 2x; clean, 2x; all filling, No.
(6) QuPath: QuPath is an open-source tool providing several image processing options and macros. It also provides automatic neural networks and options for different microscopy image classes, including specific opportunities for histological brightfield images. For the segmentation, we just considered the original rectangular RGB image. In particular, it was processed using an automatic segmentation option available for brightfield images, providing as output both the single-nucleus and the single-cell segmentations. Those segmentations have been then analysed using a supervised classification neural network available for counting the number of nuclei and the number of DAB- and AP-red-positive cells within the irregular freehand-defined ROI. The key internal parameters applied during the process were as follows: pixel size, 0.5 µm; nuclear background radius, 8 µm; median filter radius, 0 µm; sigma, 1.5 µm; minimum nuclear area, 8 µm2; maximum nuclear area, 400 µm2; threshold value, 0.1; maximum background intensity, 2.0; and cellular expansion, 5 µm.
The other eight tools were excluded for specific reasons. Aperio ImageScope (freely available), Sedeen (freely available), and SlideViewer (freely available) lacked semi- and fully-automatic classification functionalities. CellProfiler (open source), Cytomine (open source), Digital Slide Archive (open source), and Icy (open source) failed to import the irregular freehand-defined ROI or required advanced preprocessing of the input image, unsuitable for a fair comparison of the different tools. Imaris (commercial), full of opportunities for fluorescent imaging, lacks segmentation methods for brightfield images.
Figure 5 shows the segmentations obtained using the different tools in a real case study.
Table 4,
Table 5,
Table 6 and
Table 7 report the counts, the absolute normalized difference percentage (
PAND), and the rank position obtained by the different tools when analysing the signals in the preselected irregular freehand-defined ROI. The
PAND between value (
Vi) recorded by the tool (
i) and ground truth (
G), was computed to measure the closeness of agreement of the different counts according to Equation (1) [
29]:
Table 4 reports the number of nuclei counted considering the haematoxylin signal,
Table 5 reports the number of DAB positive cells,
Table 6 reports the number of AP-red positive cells, and
Table 7 reports the mean
PAND and the final rank obtained by averaging together the single
PANDs computed for the three different signals (i.e., haematoxylin, DAB, and AP-red).
The results reported in
Table 4 demonstrate that for nuclei counting using the haematoxylin signal, most tools performed exceptionally well, with a
PAND below 10%, except for
Arivis, which showed a higher error rate. Notably,
QuPath and
HALO achieved the highest accuracy, with
PAND values close to 1%. This strong performance is due to segmentation algorithms specifically optimised for nuclei detection in haematoxylin-stained images, a common benchmark for many tools. In general, the relatively low variability across tools reflects the maturity of algorithms for this task, as haematoxylin staining provides strong contrast and clear boundaries for nuclei segmentation. In contrast, the results in
Table 5 and
Table 6 highlight the greater challenge of accurately detecting specific cellular markers, such as DAB and AP-red. For these markers,
PAND values were generally higher, often exceeding 10%, indicating a notable decrease in accuracy compared to nuclei counting. Among the evaluated tools,
QuPath consistently stood out, achieving robust detection capabilities with
PAND values around 5% for both markers. This suggests that
QuPath’s segmentation and classification algorithms are better adapted to handling the variability and complexity of brightfield image data, which is less uniform and more prone to staining inconsistencies than fluorescence-based imaging. Lastly, the summary in
Table 7 underscores the limitations of certain tools, particularly
Arivis and
HALO, which exhibited average
PAND values exceeding 20%. This underperformance is attributed to the segmentation and classification models implemented in these tools, which are pre-trained and optimised for fluorescence imaging applications. These models struggle to generalise effectively to brightfield images, which involve different optical characteristics, such as variations in illumination and colourimetric dependencies. This highlights a broader challenge in adapting tools initially designed for one imaging modality to another, emphasising the importance of algorithm flexibility and data-driven optimization. In addition, it is important to remember that evaluating the commercial tools was challenging due to the restricted functionality of the testing license. For instance, it was not possible to pre-train specific neural networks for the analysed signals as it was possible to do in
QuPath. As such, the reported performance values represent a baseline, with potential for improvements.
6. Conclusions and Perspectives
In this work, we reviewed and compared the software currently available for performing single-cell analysis of histological brightfield images. Specifically, we focused on tools capable of analysing stained cells within irregular freehand-defined ROIs. In total, we considered 14 tools (six open-source, four freely available, and four commercial ones) and we practically compared six of them by performing a case study using a representative brightfield dataset.
The results of this comparative analysis revealed that even the open-source tools are today characterised by comprehensive feature sets, which rival those of commercial software solutions. In particular, QuPath demonstrated flexibility in handling large file formats, along with robust options for ROI definition and single-cell classification. In contrast, the commercial solutions offered more advanced automation features but lacked the versatility and cost-effectiveness of some open-source tools. This comparison highlights the trade-offs between accessibility and functionality, with QuPath emerging as a balanced option for single-cell analysis within histological brightfield images.
To promote future benchmarking, the image dataset and the freehand-defined ROI used in the comparison have been shared as
Supplementary Materials to be used by the research community to conduct future comparative analyses of new tools for single-cell analysis in histological brightfield images. It is worth remarking that fluorescent images were out of our interest and the analysis has been limited to cell counting within a single brightfield ROI. The accuracy of the segmentation masks for detected cells was not evaluated, nor was the reproducibility of multiple determinations assessed. Consequently, this evaluation does not provide insights into the precision or reliability of the cell segmentation process. Future work will involve a more detailed analysis of the segmentation masks, including an assessment of false positives and false negatives. Additionally, several tools were excluded (e.g.,
Imaris) or delivered suboptimal results (e.g.,
Arivis) due to their lack of segmentation methods trained for brightfield images, despite their strong performance with fluorescent ones. In future, the experiments may be extended to WSI acquired with motorised fluorescent confocal microscopes to provide a more comprehensive assessment of the tools also in different specific cases. Eventually, the tests could also be extended to monitor how different tools handle various image types. For instance, we currently consider the most popular format, i.e., RGB, but multi-channel images with more than three channels are becoming increasingly common and could also be included [
30]. Finally, the experiments in this study were conducted by an expert in computer science. However, future work could benefit from incorporating usability feedback from less experienced users, such as an early-career life scientist. Nevertheless, this work offers a deeper understanding of the strengths and limitations of various software platforms, facilitating more informed decision making for researchers seeking tools to analyse histological brightfield images and paving the way for further insights.