Next Article in Journal
Prognostic Accuracy of CTP Summary Maps in Patients with Large Vessel Occlusive Stroke and Poor Revascularization after Mechanical Thrombectomy—Comparison of Three Automated Perfusion Software Applications
Previous Article in Journal
The Importance of Correlation between CBCT Analysis of Bone Density and Primary Stability When Choosing the Design of Dental Implants—Ex Vivo Study
 
 
Review
Peer-Review Record

Automated Coronary Optical Coherence Tomography Feature Extraction with Application to Three-Dimensional Reconstruction

Tomography 2022, 8(3), 1307-1349; https://doi.org/10.3390/tomography8030108
by Harry J. Carpenter 1,*, Mergen H. Ghayesh 1,*, Anthony C. Zander 1, Jiawen Li 2,3,4, Giuseppe Di Giovanni 5 and Peter J. Psaltis 5,6,7
Reviewer 1: Anonymous
Reviewer 2:
Reviewer 3: Anonymous
Reviewer 4: Anonymous
Tomography 2022, 8(3), 1307-1349; https://doi.org/10.3390/tomography8030108
Submission received: 15 March 2022 / Revised: 3 May 2022 / Accepted: 10 May 2022 / Published: 17 May 2022
(This article belongs to the Section Cardiovascular Imaging)

Round 1

Reviewer 1 Report

This review shows an interesting classification of research works conducted in the area of automated techniques, applied to coronary OCT imaging and their subsequent application to 3D reconstruction and biomechanical simulation. The authors provide very good, systematic overview of 78 articles from the last 5 years (2016-2021) and systematized them according to the research object.

Although this review summarised recent advances in automated techniques of object reconstruction and quantification, in my opinion, there is a lack of general short characteristics of OCT technology, especially with regard to intravascular OCT.

Author Response

Thank you to the reviewer for their time to review our manuscript and provide helpful comments. We hope that with these changes the reviewer finds our manuscript suitable for publication.

 

Reviewer 1 Comments
This review shows an interesting classification of research works conducted in the area of automated techniques, applied to coronary OCT imaging and their subsequent application to 3D reconstruction and biomechanical simulation. The authors provide very good, systematic overview of 78 articles from the last 5 years (2016-2021) and systematized them according to the research object.
Although this review summarised recent advances in automated techniques of object reconstruction and quantification, in my opinion, there is a lack of general short characteristics of OCT technology, especially with regard to intravascular OCT.
Response:
We thank the reviewer for their supportive comments and favourable assessment of our manuscript. We have expanded on the characteristics of intravascular OCT (specifically focusing on intracoronary OCT) in the introductory section to include a more detailed discussion of its capability.
Please see Introduction, lines 68-76 and 83-89:
‘Among current commercially available intracoronary imaging modalities applied in real-world clinical scenarios, OCT is uniquely placed to deliver sufficient accuracy, given that it has axial and lateral resolutions of 5-20 μm and 10-90 μm, respectively, depending on laser source and lens properties, approximately ten-fold higher axial and lateral resolutions than IVUS [26, 27]. OCT achieves this accuracy through light-based, near-infrared spectrum wavelengths of 1,250 to 1,350 nm emitted from a single invasive fiberoptic wire, which rotates as it is pulled backwards through the target vessel [28]. The backscattering of light measured by the time for light to travel from tissue to the catheter lens over each revolution of the fibreoptic wire forms each cross-sectional image of the vessel wall.’

‘The primary limitation of commercially available intracoronary OCT is its penetration depth of 0.1 to 2 mm in plaques, compared to up to 10 mm for IVUS, which prevents visualisation of the deep content of plaques, the external elastic membrane and adventitial layer in diseased regions [28, 35]. This penetration depth decreases significantly in the presence of lipid rich plaques due to the high attenuation and low backscattering properties of lipid. However, OCT does overcome IVUS’s limited penetration depth in calcified lesions which ultrasound cannot penetrate.’

Reviewer 2 Report

This systematic review gives the advances in automated segmentation techniques from the past five years (2016-2021) with a focus on their application to the three-dimensional reconstruction of vessels and their subsequent simulation.

1). The novelty of the paper seems a little bit limited. More discussions about the systematic review are needed in Section I to highlight the contributions of this paper.

2). Symbols used in this paper are not clearly explained. Please explain the symbols in mathematics.

3). Abbreviations (shortened forms of full terms, such as U-Net) may not be familiar to all readers. For clarity, write out the full term the first time you mention it and put the abbreviation in parentheses after the name—both in the abstract and the main parts.

4). More SOTA methods in the last three years could be reviewed and compared. The analysis and the experimental sections can be further extended to evaluate the SOTA methods.

5). Please analyze and evaluate more state-of-the-art algorithms and references (2016-2022).

6). All the images are not clear enough.

Author Response

Thank you to the reviewer for their time to review our manuscript and provide helpful comments. We hope that with these changes the reviewer finds our manuscript suitable for publication.

Reviewer 2 Comments
This systematic review gives the advances in automated segmentation techniques from the past five years (2016-2021) with a focus on their application to the three-dimensional reconstruction of vessels and their subsequent simulation.

1. The novelty of the paper seems a little bit limited. More discussions about the systematic review are needed in Section I to highlight the contributions of this paper.

Response:
The novelty of this paper lies in both its specific focus on coronary intravascular optical coherence tomography and on the output and use of automated techniques for 3D artery simulations. We have chosen to focus specifically on intravascular OCT as no recent review has captured this specific field, nor has any review focused on how automated processing techniques can be used to build towards patient-specific 3D simulations. We aim for this paper to be a benchmark for techniques looking to use optical coherence tomography for patient simulation and have better described this novelty and need for the paper in the introductory section.
Please see introduction lines 116-122:
‘In this systematic review, we evaluate recent methods to automatically segment and classify pathological and non-pathological features in coronary OCT imaging. This automated segmentation is critical to rapidly and quantitatively assessing atherosclerotic lesions in clinical scenarios. Uniquely, we focus this review on the application of automated techniques to 3D computational reconstruction and subsequent patient-specific simulation which requires specific characteristics to be accurately delineated, such as the outer elastic membrane and deep plaque components.’

2. Symbols used in this paper are not clearly explained. Please explain the symbols in mathematics.


Response:
The metrics used to evaluate results from each reviewed article have now been described mathematically in a glossary which immediately precedes the Appendices. We have also expanded the discussion to include how the large variability and non-standardisation of these evaluation metrics could impact the reported outcomes and their comparisons. Please see the glossary on page 21, line 833 onwards. We highlight the glossary in the introduction.
Please see line 130:
‘A glossary of evaluation metrics used to assess algorithm performance is also provided.’

3. Abbreviations (shortened forms of full terms, such as U-Net) may not be familiar to all readers. For clarity, write out the full term the first time you mention it and put the abbreviation in parentheses after the name—both in the abstract and the main parts.

Response:
We have revised the use of abbreviations in the text to ensure that all shortened terms are described in full at their first appearance in the main text. U-Net is a unique case as it is a name for an architecture rather than an abbreviation. We have also clarified this in the manuscript.
Please see line 111 in the introduction:
‘Such as the U-Net, which is named after its characteristic U-shaped structure’

4. More SOTA methods in the last three years could be reviewed and compared. The analysis and the experimental sections can be further extended to evaluate the SOTA methods.

Response:
To the best of the authors knowledge, we have evaluated all state-of-the-art algorithms reported in full-length journal articles in the target period (2016-2021) which were applied to intravascular coronary optical coherence tomography imaging. If there are more state of the art algorithms for use in intracoronary optical coherence tomography imaging that we may have missed between the target period (2016-2021), could the reviewer please direct us to the appropriate articles so we can include them?
We have added to the discussion of emerging unsupervised learning techniques and how improvements from these techniques that have been applied to other imaging modalities may be applied to intravascular OCT segmentation.
Please see lines 707-719:
‘Reviewed studies primarily used supervised learning techniques, such as neural networks, RF and SVM, where the model has access to both the original image, as well as manually annotated versions during training to effectively learn the correct parameters [85, 101, 156]. This requires large, high-quality, manually annotated datasets for training and validation to produce accurate and robust results, a significant cost. A focus on addressing this challenge by handling imperfect datasets with sparse or no manual annotations is emerging [55]. State-of-the-art unsupervised learning techniques, such as generative adversarial networks (GAN) and autoencoders, are also gaining in popularity and could reduce this burden by learning patterns from unlabelled data or generating further image labels to optimize segmentation [203, 204]. While Abdolmanafi et al. applied a sparse autoencoder in their work segmenting atherosclerotic tissue types [134], recent advancements in autoencoders applied to CT imaging are also leading to stronger feature learning and dimensionality reductions that could translate for use in intravascular OCT [205].’

5. Please analyze and evaluate more state-of-the-art algorithms and references (2016-2022).

Response:
To the best of the authors knowledge, we have evaluated all state-of-the-art algorithms reported in full-length journal articles in the target period (2016-2021) which were applied to intravascular coronary optical coherence tomography imaging. We recognise that this research field is growing rapidly, and we may have missed some papers; If there are further full-length journal papers could the reviewer, please direct us to the appropriate articles so we can include them?
As we are focusing on intracoronary optical coherence tomography imaging, the scope of the review may be narrower than if we focused on multiple coronary imaging modalities. We have chosen to focus specifically on intravascular OCT as no recent review has captured this specific
field, nor has any review focused on how automated processing techniques can be used to build towards patient-specific 3D simulations. We recognise that there are numerous other state-of-the-art algorithms that have been applied to many medical imaging modalities. Hence, we have expanded the discussion section to highlight what recent advancements from the automated processing of other modalities could be utilised in intravascular coronary OCT processing in the future, as per the previous comment, and directed readers to further reviews looking at other modalities instead.
Please see introduction, line 114-115:
‘Various architectures exist depending on the task to be completed and interested readers are directed to references [54-58] for more detail.’

6. All the images are not clear enough.

Response:
Images have been updated with clearer text/labels including larger fonts where possible. Please see Figures 1-3. The remaining images are reproduced from reviewed articles, with the highest possible image quality already obtained from the publishers. These have been reproduced in TIFF format to improve clarity as much as possible.

Reviewer 3 Report

The review article is summarized well.

Shortcomings of each approaches can be summarized in a general view in discussions.

The discussion briefly discuss about data biases. It could be elaborated in relation to different approaches discussed and also suggest which approaches are most impacted.

A Glossary on interpretation metrics would be beneficial.

Author Response

Thank you to the reviewer for their time to review our manuscript and provide helpful comments. We hope that with these changes the reviewer finds our manuscript suitable for publication.

 

Reviewer 3 Comments
The review article is summarized well.
We thank the reviewer for their encouraging assessment of our manuscript.
1. Shortcomings of each approaches can be summarized in a general view in discussions.
We have improved the discussion section to cover a general overview of the most promising methods and their shortcomings. This is now the introductory paragraph of the discussion.
Please see line 661-685:
‘Methods to automate the classification and segmentation of pathological and non-pathological formations in intravascular OCT images are emerging as clinically feasible. To automatically segment the lumen, the deep capsules approach presented by Balaji et al. showed impressive accuracy, speed and efficient computational use which make it an ideal candidate to make it to clinical use [93]. This approach built upon the useful characteristics of the U-Net to maintain high-level feature accuracy and shows strong promise to be expanded to plaque component analysis. However, this approach should also be expanded to be able to segment bifurcation regions and requires further work to better handle fringe cases (i.e. increasing the number of cases with artefacts and difficult geometries). Addressing the artery layers and outer wall, the mechanical approach presented by Olender et al. demonstrated impressive speed when fitting and smoothing a 3D surface from all images in a pullback [113]. This overcomes OCT’s most significant limitation, penetration depth in deep atherosclerotic components. However, its lumen and outer elastic membrane identification speed still lacks and could benefit from the U-Net based network proposed by Haft-Javaherian et al [110]. This approach could also show promise for automating the segmentation of tissue in future hybrid imaging modalities, such as a combined IVUS-OCT probe [193], as its multi-variate loss function could manage the added information that IVUS presents. Various techniques provided strong segmentation capability for plaque compositions and coronary stents, with CRF de-noising and strut detection constraints with prior knowledge of stent design more critical factors than the underlying network to providing strong results. However, further research is required to target quantifying fibrous cap thickness accurately in image datasets that well represent real-world scenarios, with current studies significantly limited to small datasets (179-348 images in each study to date [123-125]).
Until studies have access to datasets that are representative of real-world scenarios, clinical application will remain limited.’

2. The discussion briefly discuss about data biases. It could be elaborated in relation to different approaches discussed and also suggest which approaches are most impacted.

We have expanded the discussion to include data bias as well as methodology and evaluation metric bias and how these could impact results and the comparability of the investigations reviewed.

Please see lines 686-706:
‘Furthermore, while these methods show strong promise, assessing their effectiveness is not a straight-forward task, as heterogeneity in evaluation metrics can lead to an incomplete assessment of a methodology. A wide range of evaluation metrics have been used to assess the performance of automated techniques, with significant research applied to developing distance, similarity and boundary overlap metrics [194, 195]. Choosing the most effective measure for the task at hand is difficult and can lead to bias in results, particularly when dealing with class imbalance [196]. Making use of frequency weighted evaluation metrics, such as the frequency weighted intersection over union rather than the commonly used Jaccard similarity index could assist in dealing with this challenge. Development of consensus documents for OCT based deep learning may also assist researchers reduce other biases in their work, including data distribution, dataset leakage and methodological bias, factors already shown to significantly skew results in cancer diagnoses [197-200]. Improving access to large scale, longitudinal and multicentre datasets that are representative of real-world scenarios coupled with consistent use of techniques including cross-validation, model regularization (to prevent overfitting or underfitting) and de-biasing through oversampling and adversarial de-biasing will help in addressing these challenges. Competitions, such as [201], could further assist by standardising the development and evaluation of methods on pre-defined datasets, improving transparency, while open-source projects, such as the medical open network for artificial intelligence (MONAI), first publicly released in 2020, provide best practice deep learning frameworks [202].’

3. A Glossary on interpretation metrics would be beneficial.

Response:
We have added a Glossary section immediately preceding the Appendices describing in detail the metrics used in each article to interpret and evaluate their automated processing technique(s). Please see page 21, line 833 onwards.

Reviewer 4 Report

Comments to the Author

Title: Automated Coronary Optical Coherence Tomography Feature Extraction with Application to Three-Dimensional Reconstruction

Authors: Harry J. Carpenter, Mergen H. Ghayesh, Anthony C. Zander, Jiawen Li, Giuseppe Di Giovanni, and Peter J. Psaltis

à The paper provides an extensive review of the advances in automated segmentation techniques from the past five years (2016-2021). The review focusses on the application of automated segmentation to the 3D reconstruction of vessels and their subsequent simulation. In addition, the authors discuss four categories based on the feature being processed, and the study of future areas (potential and translation).

à I believe this is an interesting work suitable for its publication in Tomography. I would endorse the publication of the present manuscript after carefully addressing the comments hereafter.

General: The manuscript is well written; all the sections are well linked and the contents are easy to follow. The use of English is correct as well as the length for each section.

In the following: MA=Major comment, MI = Minor comment, OP = Optional Comment

(OP) P1, Abstract, L24 and L29: Consider introducing 3D in L24 and the using this abbreviation instead of three-dimensional.

(MI) P1, Introduction, L43: Which imaging modalities are you referring to? Molecular imaging, functional imaging?

(OP) P2, Introduction, L57-59: Has this been observed for a specific group? Or, is this statement general for all population types?

(MI) P2, Introduction, L74: There is a space missing between arc [29, 30].

(MI) P2, Introduction, L76: Please, provide values for the penetration depth of OCT.

(MI) P3, Introduction, L97: Please, modify the sentence as: ‘However, this usually leads to a reduction in image resolution’. The reduction in image reconstruction is not always happening thus, I suggest modifying the sentence as exemplified for a more general statement.

(OP) P4, Introduction, Figure 3: Include how many articles (out of those 78 reviewed articles) were included for each category.

(MI) P5, Coronary lumen, L122-124: Please, clarify if these methods are commonly used world-wide or to which world-region are you referring when discussing them. Same comment for the entire manuscript.

(MI) P5, Coronary lumen, L149-151: How much improvement are these values in comparison with the other discussed methods?

(OP) P6, Coronary lumen, L204: Do you have any insights regarding clinician acceptance of machine learning algorithms?

(OP) P7, Coronary lumen, Figure 5: I think the figure is too big. I would suggest to make it coherent with the size of previous figures.

(MI) P8, Coronary lumen, L242: This is the first time you are referring to Figure 5 so, move the Figure after its call.

(OP) P8, Artery layers, L266-267: At a first glance the errors associated to their reported values seem to be really big. Could you comment why?

(MI) P10, Artery layers, L316: Among all described methods, which is the most likely to be adopted by clinicians and why?

What do the authors think is key to normalize the use of AI algorithm in daily-practice?

(MI) P10, Plaque characteristics and subtypes, L316: Among all described methods, which is the most likely to be adopted by clinicians and why?

(MI) P10, Plaque characteristics and subtypes, L338-340: Do the authors know if any of these developments are already being studied? If so, please, comment on that.

(MI) P11, Plaque characteristics and subtypes, L356-359: Are any of these networks being used in real-practice? Or are they still under validation/acceptance?

(MI) P12, Plaque characteristics and subtypes, L401-401: What about the time required by each method? Are any remarkable differences?

(MI) P13, Plaque characteristics and subtypes, L442-444: Has this been reported?

(MI) P14, Plaque characteristics and subtypes, L484-486: Do the authors know if any of these developments are already being studied? If so, please, comment on that.

(OP) P14, Stents, L507-508: Reword this sentence, is a little bit confusing.

(OP) P15, Stents, L549-550: It would be interesting to include the year in which the Toolkit was released.

(MI) P17, Discussion, L614-616: Add the date at which deep learning was implemented for the first time.

(MI) P17, Discussion, L621: Add references.

(MI) P18, Discussion, L648: What about MR imaging? Could this be an alternative to CT?

(MI) P18, Discussion, L652: Add penetration depth value.

(MI) P18, Discussion, L664-666: Have nuclear imaging techniques (such as PET, SPECT, gamma cameras or nuclear imaging probes) any relevant application in this regard?

(OP) P19, Discussion, L681: Or would be interesting if the authors provide a paragraph summarizing which one of the methods reviewed they considered as the most suitable to make it to the daily-practice.

(MI) P19, Discussion, L681: The authors should provide a closing paragraph describing the future perspective of these techniques.

(MI) P19, Conclusion, L681: The authors should include few lines describing the acceptance of the clinician for these techniques.

(MA) Tables: The tables provide very useful and complete information. I have read them and couldn’t find any typo but, double check (just in case)

(MA) References: Please, check that all references follow the same style as outlined by the Journal author instructions

Author Response

Thank you to the reviewer for their time to review our manuscript and provide helpful comments. We hope that with these changes the reviewer finds our manuscript suitable for publication.

 

Reviewer 4 Comments
The paper provides an extensive review of the advances in automated segmentation techniques from the past five years (2016-2021). The review focusses on the application of automated segmentation to the 3D reconstruction of vessels and their subsequent simulation. In addition, the authors discuss four categories based on the feature being processed, and the study of future areas (potential and translation).

I believe this is an interesting work suitable for its publication in Tomography. I would endorse the publication of the present manuscript after carefully addressing the comments hereafter.

General: The manuscript is well written; all the sections are well linked and the contents are easy to follow. The use of English is correct as well as the length for each section.
In the following: MA=Major comment, MI = Minor comment, OP = Optional Comment
We thank the reviewer for their supportive comments and favourable assessment of our manuscript.

1. (OP) P1, Abstract, L24 and L29: Consider introducing 3D in L24 and the using this abbreviation instead of three-dimensional.
Response:
3D has been introduced in line 24 and used in line 29 in place of three-dimensional.
Please see abstract line 24 and 29:
‘construct accurate three-dimensional (3D) simulations’
‘focus on their application to the 3D reconstruction’

2. (MI) P1, Introduction, L43: Which imaging modalities are you referring to? Molecular imaging, functional imaging?


Here we are referring primarily to structural imaging which is applied in clinical scenarios. However, we have updated the manuscript to highlight that these events continue to occur at high rates despite improvements in structural, molecular, and functional imaging modalities. While coronary OCT is a structural imaging modality, we discuss further the potential for molecular and functional imaging to supplement or combine with OCT in the future and what impact this may have on automated processing techniques in the discussion section.
Please see introduction, line 43:

‘This is despite advances in structural, molecular and functional imaging technology’


3. (OP) P2, Introduction, L57-59: Has this been observed for a specific group? Or, is this statement general for all population types?


The statement on structural stress and plaque instability and rupture is a general statement, applicable to all population types. The field of biomechanical/patient-specific simulation is still emerging and as yet has not focused on specific population groups. We have made this clearer in the manuscript.
Please see line 58:
‘Conversely, in the general population heightened structural stress has been associated with plaque instability and rupture’

4. (MI) P2, Introduction, L74: There is a space missing between arc [29, 30].
Response:
A space has been added between arc and the reference.
Please see line 80:
‘lipid arc [31,32]’

5. (MI) P2, Introduction, L76: Please, provide values for the penetration depth of OCT.

OCT penetration depth ranges from 0.1-2 mm, as compared to up to 10 mm in IVUS. However, this is dependent on the presence of lesions and their composition. We have included this earlier in the text (line 70) and discuss what factors play a role in this penetration depth.
Please see lines 83-89:
‘The primary limitation of commercially available intracoronary OCT is its penetration depth of 0.1 to 2 mm in plaques, compared to up to 10 mm for IVUS, which prevents visualisation of the deep content of plaques, the external elastic membrane and adventitial layer in diseased regions [28, 35]. This penetration depth decreases significantly in the presence of lipid rich plaques due to the high attenuation and low backscattering properties of lipid. However, OCT does overcome IVUS’s limited penetration depth in calcified lesions which ultrasound cannot penetrate.’

6. (MI) P3, Introduction, L97: Please, modify the sentence as: ‘However, this usually leads to a reduction in image resolution’. The reduction in image reconstruction is not always happening thus, I suggest modifying the sentence as exemplified for a more general statement.

Response:
The sentence has been modified as suggested.
Please see line 108:
‘However, this usually leads to a reduction in image resolution’

7. (OP) P4, Introduction, Figure 3: Include how many articles (out of those 78 reviewed articles) were included for each category.

Response:
The number of articles in each section has been included in Figure 3 as suggested. Of the 78 reviewed, 21 focused on the coronary lumen, 8 on artery layers, 35 on plaque characteristics and subtypes and 14 on stents.

8. (MI) P5, Coronary lumen, L122-124: Please, clarify if these methods are commonly used world-wide or to which world-region are you referring when discussing them. Same comment for the entire manuscript.

Response:
The methods discussed here are independent of any region, they are used world-wide in automated image processing techniques in a variety of fields. We have clarified this at the beginning of the coronary lumen section.
Please see line 137:
‘Here, globally used binarisation methods, such as Otsu filtering’

9. (MI) P5, Coronary lumen, L149-151: How much improvement are these values in comparison with the other discussed methods?

Response:
We have added the improvements noted compared to earlier, comparable investigations. We have also improved this discussion be adding the improvements noted in mean average difference in area and the Hausdorff distance which highlight the impact that heterogeneity in evaluation metrics can show.
Please see lines 165-174:
‘This approach achieved a sensitivity, specificity and Jaccard similarity index of 95.55 ± 3.19%, 99.84 ± 0.29%, and 0.95 ± 0.03, respectively, improving upon earlier first-order gaussian derivative approaches that achieved 89.76 ± 5.99%, 99.80 ± 0.56%, and 0.89 ± 0.06 in the same metrics [73]. Compared to using image intensity values alone, classification accuracy increased 6.80% in a dataset of 1,846 images from 13 pullbacks (457 training, 1,389 testing), whilst the mean average difference in area and the Hausdorff distance were reduced 55% and 70% respectively. This highlights both that evaluation metric heterogeneity can significantly bias how improvement is measured, and that spatio-temporal approaches that consider all images in a pullback can achieve smooth contour segmentation in complex lumen geometries.’

10. (OP) P6, Coronary lumen, L204: Do you have any insights regarding clinician acceptance of machine learning algorithms?

Response:
Clinician acceptance of machine learning algorithms, especially in the case of intravascular OCT, is still tied to the imaging modality’s clinical utility. While OCT and IVUS are still not a part of routine coronary angiography procedures, automated segmentation approaches that can run in near real time in the catheterization laboratory could provide a significant advance in making quantitative data (e.g. fibrous cap thickness measurement) readily available to the interventional Cardiologist and assist with the interpretation of OCT images. In turn, this could inform clinical decision making and lead to better patient outcomes. This is added to the manuscript discussion.
However, to date, machine learning has not typically been used in clinical settings as there are still significant challenges that need to be addressed, including: 1) Improving access to large scale, expertly annotated datasets to train and test techniques on data that is representative of real world scenarios; 2) Evidence that techniques are robust and reliable enough to enable clinical use and provide sufficient incremental value to justify the associated costs (i.e. health economic analysis); 3) Regulations surrounding the updates of medical technology could inhibit the rapid adoption required for AI in clinical scenarios; 4) Data ownership could impact how techniques develop, particularly if research techniques develop with large scale datasets to the point of commercial potential. These challenges have been added to the discussion section.
Please see lines 258-262:
‘This rapid clinical application of automated lumen segmentation could produce a significant leap in quantitative data available to clinicians, improving patient outcomes and the utility and acceptance of intravascular imaging modalities, machine learning approaches and the translation of 3D simulation capability, such as WSS computation.’

11. (OP) P7, Coronary lumen, Figure 5: I think the figure is too big. I would suggest to make it coherent with the size of previous figures.

Response:
The figure has been resized to better match other figures. Please see Figure 5 on page 8.

12. (MI) P8, Coronary lumen, L242: This is the first time you are referring to Figure 5 so, move the Figure after its call.
Response:
The figure has been moved to after its call. Similarly, we have checked other figures and also ensured they are shown after their call.

13. (OP) P8, Artery layers, L266-267: At a first glance the errors associated to their reported values seem to be really big. Could you comment why?
Response:
The errors reported are similar in magnitude to other investigations in the micron range. The inter-observer variability in manual annotations between two expert annotators was 6.76 ± 10.61 μm, which agrees well with the automated approaches. However, the automated approaches did present larger standard deviations, most likely due to surface smoothness constraints put on the algorithm, which prevent frame-to-frame variances beyond a certain limit to assist in overcoming image artefacts. We have included this in the discussion of these results.
Please see lines 294-297:
‘These errors were less than the inter-observer variability reported of 6.76 ± 10.61 μm, although their standard deviations were significantly larger, possibly due to the surface smoothness constraint put on the algorithm.’

14. (MI) P10, Artery layers, L316: Among all described methods, which is the most likely to be adopted by clinicians and why? What do the authors think is key to normalize the use of AI algorithm in daily-practice?
Response:
Normalising AI in clinical practice requires overcoming a number of significant challenges already discussed, including: 1) Improving access to large scale, expertly annotated datasets to train and test techniques on data that is representative of real world scenarios; 2) Evidence that techniques are robust and reliable enough to enable clinical use and provide sufficient incremental value to justify the associated costs (i.e. health economic analysis); 3) Regulations surrounding the updates of medical technology could inhibit the rapid adoption required for AI in clinical scenarios; 4) Data ownership could impact how techniques develop, particularly if research techniques develop with large scale datasets to the point of commercial potential. These challenges have been added to the discussion section.
In the case of artery layers and the outer wall, clinical acceptance of the best technique could also be impacted by advances in intravascular imaging technology. For current OCT imaging, the approach developed by Olender et al. [113] shows significant promise, with smoothing and surface fitting taking only 2.74 ms and 40.2 ms per frame, respectively. As hybrid intravascular imaging probes develop, it seems most likely that the information available to clinicians and automated processing techniques will improve, overcoming OCT’s current limitations, removing the need for some estimations, such as outer elastic membrane location. For example, a hybrid combination of IVUS and OCT will enable high resolution fibrous cap identification and determination of the outer elastic membrane in the presence of deep atherosclerotic features. In this case, the U-Net approach demonstrated by Haft-Javaherian et al. [110] on polarisation sensitive OCT that makes use of a multivariate loss function seems most likely to progress. This is due to the multivariate loss function being able to handle the added information provided by IVUS while maintaining the high-resolution OCT image characteristics during segmentation. However, the computational cost of implementing such a network was not explicitly discussed and would likely impact clinical application. This has been added to the artery layers section whilst the key to normalising AI in daily practice has been elaborated in the discussion section.
Please see lines 321-324:
‘This approach could also be useful in segmenting the outer elastic membrane in hybrid IVUS-OCT systems [112], where the multivariate loss function could manage the added in-formation provided by IVUS while maintaining the high-resolution OCT image characteristics during segmentation.’
Please see lines 340-343:
‘While surface smoothing and fitting times were 2.74 ± 0.28 ms and 40.20 ± 7.50 ms per frame, respectively, this approach would benefit from improvements to the lumen and edge detection speeds which required a much greater 4.20 ± 1.50 s and 5.35 ± 0.85 s per frame, respectively, to make it clinically applicable.’

15. (MI) P10, Plaque characteristics and subtypes, L316: Among all described methods, which is the most likely to be adopted by clinicians and why?
Response:
Based on the performance and completeness of the technique, the most likely to be adopted is the approach that combines the outer wall surface fitting of Olender et al. and the segmentation approach of Athanasiou et al. Rather than adding to the plaque section, this has been discussed further in the discussion section where we also elaborate further on the most promising
approaches from each section. The first paragraph of the discussion has also been updated based on previous comments to cover the best techniques and their associated shortcomings.
Please see the discussion, line 728-733:
‘This framework, shown in Figure 14, built on their previous studies to classify pixels into six tissue components within a constrained wall area region, makes use of 3D mode filtering to improve spatial consistency and continuity of contours. This approach shows significant potential to translate to clinical use, as it brings together the relevant processing steps into a single framework.’

16. (MI) P10, Plaque characteristics and subtypes, L338-340: Do the authors know if any of these developments are already being studied? If so, please, comment on that.
Response:
Fibrous cap thickness quantification is being studied in the literature but is still very limited due to the challenge of identifying the cap boundary in a region with high attenuation and diffuse contours. Many investigations remain focused on automatically segmenting plaque components, often by only identifying the presence and type of plaque. We have included this in the manuscript.
Please see lines 377-385:
‘A critical measure of plaque stability, respective fibrous cap thickness errors of 13.06%, 22.20% and 17.46% were shown [120-122]. These errors are due to the high signal attenuation and diffuse contours representative of a fibrous cap overlying a lipid pool coupled with inter-observer variability and expert interpretation in the manually segmented ground truth. As accurate thickness measurement is a critical parameter for quantification of plaque vulnerability and biomechanical stress, further research to address these challenges and reduce errors is required [123]. Techniques such as dynamic programming have also demonstrated the capability to overcome these challenges and could be further explored [124, 125].’

17. (MI) P11, Plaque characteristics and subtypes, L356-359: Are any of these networks being used in real-practice? Or are they still under validation/acceptance?
Response:
These networks are still research focused and under validation. We have made this amendment in the manuscript.
Please see lines 393-394:
‘The method significantly outperformed five other algorithms under ongoing research’

18. (MI) P12, Plaque characteristics and subtypes, L401-401: What about the time required by each method? Are any remarkable differences?
Response:
The time required for the hybrid learning approach is not explicitly stated. However, time that the random forest classifier takes to train, and run was reported as one quarter and one third the time of the support vector machine, totalling 1 second per image (0.05-sec for pre-processing, 0.9-sec for feature extraction, 0.03-sec for classification, and 0.02-sec for post-processing) and was the reason behind using the random forest in the hybrid method. We have included this in the manuscript.
Please see lines 438-446:
‘This architecture was then applied in a hybrid learning approach on 6,556 images from 49 patients with a RF classifier [156] implemented due to the faster computation time, needing only 25% of the training time and 33% run time of a SVM to achieve comparable accuracy. When a CRF was applied for noise postprocessing, the hybrid model approach outperformed a purely
CNN for fibro-calcific (sensitivity: 97.20% vs 80.20%; specificity: 91.90% vs 92.90%) and fibro-lipid (sensitivity: 77.30% vs 46.80%; specificity: 91.90% vs 92.90%) classification, needing approximately one second per image (the majority, 0.9 s, required for feature extraction).’

19. (MI) P13, Plaque characteristics and subtypes, L442-444: Has this been reported?
Response:
This was reported as the reason for combining the two loss functions in the article being cited. We have re-worded this section to make this clearer.
Please see lines 481-489:
‘A multi-term loss function was proposed to overcome imbalances in foreground/background pixels, which can lead to incomplete vulnerable region detection. By combining the weighted cross-entropy loss function, to enhance boundary pixels and improve boundary segmentation, and dice coefficient, to increase pixel classification accuracy, an overall pixel accuracy and precision of 93.31% and 94.33%, respectively, were reached [135], improvements of 49% and 14%, respectively, over the initial prototype U-Net. More impressively, the mean intersection over union and frequency weighted intersection over the union, improved measures of the overlap in two regions, improved 103% and 71%, respectively.’

20. (MI) P14, Plaque characteristics and subtypes, L484-486: Do the authors know if any of these developments are already being studied? If so, please, comment on that.
Response:
Yes, these approaches to reduce the number of manually annotated images are already being studied (though have not always been applied to OCT specifically). Dealing with limited datasets, with either scarce or weak annotations, is a significant challenge in the medical field and an ongoing research focus. We have commented on this and expanded this in the discussion section.
Please see lines 526-527:
‘Dealing with limited datasets, with either scarce or weak annotations, is a significant challenge in the medical field and an ongoing research focus [55]’
Please see lines 535-537:
‘Further development and use of approaches such as data augmentation, transfer, and active learning, CRF post-processing and class activation mapping to reduce the number of annotated images needed for accurate training and classification would benefit the field.’
Please see discussion, lines 681-685:
‘However, further research is required to target quantifying fibrous cap thickness accurately in image datasets that well represent real-world scenarios, with current studies significantly limited to small datasets (179-348 images in each study to date [123-125]). Until studies have access to datasets that are representative of real-world scenarios, clinical application will remain limited.’

21. (OP) P14, Stents, L507-508: Reword this sentence, is a little bit confusing.
Response:
We have reworded the sentence to clarify that ‘suboptimal images’ predominantly focuses on residual blood artefacts in this article.
Please see lines 559-560:
‘To improve stent strut segmentation in suboptimal images, such as those with residual blood artefacts, Cao et al. investigated an AdaBoost trained, cascade classifier [176].’

22. (OP) P15, Stents, L549-550: It would be interesting to include the year in which the Toolkit was released.
Response:
We have added the year that the toolkit was published.
Please see line 598:
‘This approach was further developed into a toolkit (OCTivat-Stent), published in 2020, capable of reducing total segmentation time to just 30 min per pullback’

23. (MI) P17, Discussion, L614-616: Add the date at which deep learning was implemented for the first time.
Response:
We have added the year that the MONAI deep learning program was first released.
Please see line 705:
‘while open source projects, such as the medical open network for artificial intelligence (MONAI), first publicly released in 2020, provide best practice deep learning frameworks’

24. (MI) P17, Discussion, L621: Add references.
Response:
References have been added and the sentence expanded to show which supervised learning techniques were most often used.
Please see lines 707-710:
‘Reviewed studies have primarily used supervised learning techniques such as neural networks, RF and SVM, where the model has access to both the original image, as well as manually annotated versions during training to effectively learn the correct parameters [85, 101, 156].’

25. (MI) P18, Discussion, L648: What about MR imaging? Could this be an alternative to CT?
Response:
Magnetic resonance imaging could be an alternative to CT and has some clinical advantages over CT, such as reduced radiation exposure. However, the lower spatial resolution, motion related image degradation and longer scan times make it difficult to assess coronary artery disease. The lower resolution also impacts 3D reconstructions which in turn can impact any patient-simulations. We have included this in the discussion.
Please see lines 738-742 and 749-750:
Of the available modalities that could be used, invasive coronary angiography is the primary candidate due to its widespread clinical use and requirement during intracoronary OCT procedures. However, computed tomography coronary angiography is a rising noninvasive contender and coronary magnetic resonance imaging could also be a useful addition to reduce patient radiation and contrast exposure, although lower image resolution and susceptibility to motion related image degradation could impact reconstruction accuracy in these cases [212, 213].

26. (MI) P18, Discussion, L652: Add penetration depth value.
Response:
We have added OCT’s penetration depth.
Please see line 753:
‘The integration of OCT and IVUS, for example, could overcome the limited 0.1 to 2 mm penetration depth associated with OCT in plaques’

27. (MI) P18, Discussion, L664-666: Have nuclear imaging techniques (such as PET, SPECT, gamma cameras or nuclear imaging probes) any relevant application in this regard?
Response:
Nuclear imaging techniques do have application in this context. They will likely supplement automated structural feature extraction with functional or molecular capabilities which could be included in patient-specific simulations. We have included this in the discussion.
Please see lines 764-769:
‘Further development of near-infrared spectroscopy/Raman, fluorescence lifetime (FLIM) and near-infrared autofluorescence (NIRAF) modalities in combination with OCT also shows promise to extract biochemical and molecular tissue information on elastin and macrophages whilst nuclear imaging techniques such as positron emission tomography (PET) could supplement this with information on local inflammatory responses [112, 218-220].’

28. (OP) P19, Discussion, L681: Or would be interesting if the authors provide a paragraph summarizing which one of the methods reviewed they considered as the most suitable to make it to the daily-practice.
Response:
We have adjusted the discussion section to now include a paragraph summarising which methods show the most promise moving forward. This is now the introductory paragraph of the discussion and also highlights shortcomings of the selected approaches.
Please see lines 661-685:
‘Methods to automate the classification and segmentation of pathological and non-pathological formations in intravascular OCT images are emerging as clinically feasible. To automatically segment the lumen, the deep capsules approach presented by Balaji et al. showed impressive accuracy, speed and efficient computational use which make it an ideal candidate to make it to clinical use [93]. This approach built upon the useful characteristics of the U-Net to maintain high-level feature accuracy and shows strong promise to be expanded to plaque component analysis. However, this approach should also be expanded to be able to segment bifurcation regions and requires further work to better handle fringe cases (i.e. increasing the number of cases with artefacts and difficult geometries). Addressing the artery layers and outer wall, the mechanical approach presented by Olender et al. demonstrated impressive speed when fitting and smoothing a 3D surface from all images in a pullback [113]. This overcomes OCT’s most significant limitation, penetration depth in deep atherosclerotic components. However, its lumen and outer elastic membrane identification speed still lacks and could benefit from the U-Net based network proposed by Haft-Javaherian et al [110]. This approach could also show promise for automating the segmentation of tissue in future hybrid imaging modalities, such as a combined IVUS-OCT probe [193], as its multi-variate loss function could manage the added information that IVUS presents. Various techniques provided strong segmentation capability for plaque compositions and coronary stents, with CRF de-noising and strut detection constraints with prior knowledge of stent design more critical factors than the underlying network to providing strong results. However, further research is required to target quantifying fibrous cap thickness accurately in image datasets that well represent real-world scenarios, with current studies significantly limited to small datasets (179-348 images in each study to date [123-125]). Until studies have access to datasets that are representative of real-world scenarios, clinical application will remain limited.’

29. (MI) P19, Discussion, L681: The authors should provide a closing paragraph describing the future perspective of these techniques.
Response:
We have updated the first paragraph in the discussion to include future perspectives of the methods reviewed which now flows well into the subsequent paragraphs discussing evaluation metrics, data bias, emerging techniques from other imaging modalities and integrating these findings into a complete framework. The closing paragraph of the discussion now integrates the role of clinical acceptance into the future perspectives of these techniques, as suggested.
Please see lines 783-800:
‘Clinician acceptance of machine learning algorithms, especially in the case of intravascular OCT, is still tied to the imaging modality’s clinical utility. While OCT and IVUS are still not a part of routine coronary angiography procedures, automated segmentation approaches that can run in near real time in the catheterization laboratory could provide a significant advance in making quantitative data (e.g. fibrous cap thickness measurement) readily available to the interventional Cardiologist and assist with the interpretation of OCT images. In turn, this could inform clinical decision making and lead to better patient outcomes. The future potential for automated approaches to make it into clinical use also require addressing a number of systemic challenges, including: 1) Improving access to large scale, expertly annotated datasets to train and test techniques on data that is representative of real world scenarios; 2) Evidence that techniques are robust and reliable enough to enable clinical use and provide sufficient incremental value to justify the associated costs (i.e. health economic analysis); 3) Regulations surrounding the updates of medical technology could inhibit the rapid adoption required for AI in clinical scenarios; 4) Data ownership could impact how techniques develop, particularly if research techniques develop with large scale datasets to the point of commercial potential [235]. These are both multi-disciplinary challenges and opportunities for the engineering, computer science and medical research fields.’

30. (MI) P19, Conclusion, L681: The authors should include few lines describing the acceptance of the clinician for these techniques.
Response:
We have added concluding remarks on the clinical acceptance of these techniques.
Please see conclusion, lines 815-817:
‘However, challenges surrounding access to large scale, expertly annotated image datasets that represent real-world scenarios and robustness of automated techniques to clinical use still need to be addressed before clinical acceptance.’

31. (MA) Tables: The tables provide very useful and complete information. I have read them and couldn’t find any typo but, double check (just in case)
Response:
We have double checked the tables for typos and grammatical errors.

32. (MA) References: Please, check that all references follow the same style as outlined by the Journal author instructions
Response:
We have re-checked the reference list and updated to match Tomography’s format.

Round 2

Reviewer 2 Report

The authors have addressed all the comments.

Back to TopTop