Multi-Modal Deep Learning and Its Applications
A special issue of Applied Sciences (ISSN 2076-3417). This special issue belongs to the section "Computing and Artificial Intelligence".
Deadline for manuscript submissions: closed (20 November 2023) | Viewed by 34183
Special Issue Editors
Interests: natural language processing; image captioning; text–image retrieval; visual storytelling
Special Issues, Collections and Topics in MDPI journals
Interests: facial analysis; medical imaging; metric learning; representation learning; self-supervised learning; reinforcement learning; deep learning
Interests: data mining; pattern recognition; image processing
Special Issue Information
Dear Colleagues,
The Sixth Asian Conference on Artificial Intelligence Technology will be held in Changzhou, China. The ACAIT-2022 conference invites the submission of substantial, original, and unpublished research papers regarding Artificial Intelligence (AI) applications in image analysis, video analysis, medical image processing, intelligent vehicles, natural language processing, and other AI-enabled applications.
Multi-modal learning, which is an important sub-area of AI, has recently attracted noticeable attention due to its broad applications in the multi-media community. Early studies relied heavily on feature engineering, which is time-consuming and labor intensive. With the advancements made in deep learning, great efforts have been made to improve the performances of multi-modal applications with multi-modal deep learning. However, this progress still does not bridge the heterogeneity gaps between different modalities (i.e., computer vision, natural language process, speech, and heterogeneous signals) with deep learning techniques. The goal of this Special Issue is to collect contributions regarding multi-modal deep learning and its applications.
Papers for this Special Issue, entitled “Multi-modal Deep Learning and its Applications”, will be focused on (but not limited to):
- Deep learning for cross-modality data (e.g., video captioning, cross-modal retrieval, and video generation);
- Deep learning for video processing;
- Multi-modal representation learning;
- Unified multi-modal pre-training;
- Multi-modal metric learning;
- Multi-modal medical imaging;
- Unsupervised/self-supervised approaches in modality alignment;
- Model-agnostic approaches in modality fusion;
- Co-training, transfer learning, and zero-shot learning;
- Industrial visual inspection.
Dr. Min Yang
Dr. Hao Liu
Prof. Dr. Shanxiong Chen
Dr. Yinong Chen
Guest Editors
Manuscript Submission Information
Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.
Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Applied Sciences is an international peer-reviewed open access semimonthly journal published by MDPI.
Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.
Keywords
- deep multi-modal learning
- deep video processing
- multi-modal representation learning
- unified multi-modal pre-training
- multi-modal metric learning
- multi-modal medical imaging
- unsupervised modality alignment
- self-supervised modality alignment
- model-agnostic modality fusion
- co-training
- transfer learning
- zero-shot learning
- industrial visual inspection
Benefits of Publishing in a Special Issue
- Ease of navigation: Grouping papers by topic helps scholars navigate broad scope journals more efficiently.
- Greater discoverability: Special Issues support the reach and impact of scientific research. Articles in Special Issues are more discoverable and cited more frequently.
- Expansion of research network: Special Issues facilitate connections among authors, fostering scientific collaborations.
- External promotion: Articles in Special Issues are often promoted through the journal's social media, increasing their visibility.
- e-Book format: Special Issues with more than 10 articles can be published as dedicated e-books, ensuring wide and rapid dissemination.
Further information on MDPI's Special Issue polices can be found here.