Next Article in Journal
Economic Development, Informal Land-Use Practices and Institutional Change in Dongguan, China
Next Article in Special Issue
Prevention of Mountain Disasters and Maintenance of Residential Area through Real-Time Terrain Rendering
Previous Article in Journal
New Media Development, Sleep and Lifestyle in Children and Adolescents
Previous Article in Special Issue
A Structure Landmark-Based Radio Signal Mapping Approach for Sustainable Indoor Localization
 
 
Article
Peer-Review Record

A Video Captioning Method Based on Multi-Representation Switching for Sustainable Computing

Sustainability 2021, 13(4), 2250; https://doi.org/10.3390/su13042250
by Heechan Kim 1 and Soowon Lee 2,*
Reviewer 1: Anonymous
Reviewer 2: Anonymous
Sustainability 2021, 13(4), 2250; https://doi.org/10.3390/su13042250
Submission received: 8 January 2021 / Revised: 25 January 2021 / Accepted: 27 January 2021 / Published: 19 February 2021
(This article belongs to the Collection Advanced IT based Future Sustainable Computing)

Round 1

Reviewer 1 Report

The article proposes a multi-representation switching method to extract the important information for a given video and description pair as efficiently as possible.

First, a good introduction to the research topic and the related works are given. In the further part, the proposed method is presented in detail and supported by examples and calculations. The fourth part presents the results of the application of the model using the Microsoft Research Video Description (MSVD) dataset.

The structure and explanations are clearly comprehensible. The proportions of the sections are balanced. The information is coherent and superfluous. The figure and tables are in the right places.

The authors’ contribution to the topic is important.

I have the following observations.

The research questions and hypotheses are missing.

The references must be formatted according to the journal requirements.

Line 350 is Qualiatative results and has to be “Qualitative….

 

Author Response

Thank you for your dedicate comments.

 

Following the comments, we revised the manuscript.

 

We added the paragraph for the research questions and hyhothesis in line 56-74.

We revised the format of references.

We revised the misspellings.

All changes are tracked by "Track changes" function of Word.

 

Sincerly.

Reviewer 2 Report

Authors propose the video captioning method based on multi-representation switching to model characteristics in the video and description pair. The proposed method consists of three parts, i.e., entity extraction, motion extraction, and textual feature extraction.

The article is expected to be relevant. However, authors mentioned about the sustainable computing in the last part and they have no discussion on it.

The article is assumed to be original, but authors are requested to expand the reference list and discuss the achievement of other researchers.

In the introduction authors could add the research method explanations. They presented an experiment. Although, the main posed question has been achieved, authors could explain more further development and usability of proposed solution.

Author Response

Thank you for your dedicate comments.

 

Following the comments, we revised the manuscript.

 

We added the paragraphs for the sustainable computing in line 56-74.

We expanded the references in the related works (from line 110), and we also added the disscussions the achievements in line 397-425.

We also added the paragraphs explaining further development and usability of the proposed method in line 462-469.

We revised the misspellings.

All changes are tracked by "Track changes" function of Word.

 

Sincerly.

Back to TopTop