Next Article in Journal
A Novel Circularly Polarized Folded Transmitarray Antenna with Integrated Radiation and Scattering Performance
Next Article in Special Issue
Speaker Adaptation on Articulation and Acoustics for Articulation-to-Speech Synthesis
Previous Article in Journal
Estimation of Greenhouse Lettuce Growth Indices Based on a Two-Stage CNN Using RGB-D Images
Previous Article in Special Issue
FlexLip: A Controllable Text-to-Lip System
 
 
Article
Peer-Review Record

Reliability-Based Large-Vocabulary Audio-Visual Speech Recognition

Sensors 2022, 22(15), 5501; https://doi.org/10.3390/s22155501
by Wentao Yu *, Steffen Zeiler and Dorothea Kolossa
Reviewer 1:
Reviewer 2: Anonymous
Reviewer 3:
Sensors 2022, 22(15), 5501; https://doi.org/10.3390/s22155501
Submission received: 6 May 2022 / Revised: 15 July 2022 / Accepted: 17 July 2022 / Published: 23 July 2022
(This article belongs to the Special Issue Future Speech Interfaces with Sensors and Machine Intelligence)

Round 1

Reviewer 1 Report

Finding a good paper at the first submission is really unusual. Said so, the paper is good and fits into a modern and relevant topic. It may appear a bit out of context respect to the journal’s aim, and my main suggestion to the authors is then to try add some more specs regarding the sensors’ aims.

Beside this, while processing the re-editing of the text, I only suggest the authors to avoid the use of “we” in English scientific writing, and fix so minor English errors occurring here and there,

dB are linear or A-weighted? If the second, please use dB(A) instead.  

Author Response

Please see the attachment.

Author Response File: Author Response.pdf

Reviewer 2 Report

The paper proposes an interesting methodology.

Please make a revision of some paragraphs, the english could be improved.

Author Response

Please see the attachment.

Author Response File: Author Response.pdf

Reviewer 3 Report

One major weakness of this paper is its novelty which is unclear. The paper tries to propose a new unified view of decision and representation fusion strategies. But the actual method is just to fuse the model's output at different layers. It's nice to use the idea of posterior probability to unify different strategies. But how could those strategies correspond to the idea should be discussed or even proved. Only corresponding these two aspects with text description and with a virtual concept is not enough in my opinion. 

Author Response

Please see the attachment.

Author Response File: Author Response.pdf

Round 2

Reviewer 3 Report

The authors' answer has solved part of my questions. I suggest the authors claim more clear in the paper to include the points in the response. 

Author Response

Please see the attachment.

Author Response File: Author Response.docx

Back to TopTop