Image and Video Coding Technology

A special issue of Electronics (ISSN 2079-9292). This special issue belongs to the section "Computer Science & Engineering".

Deadline for manuscript submissions: 15 August 2024 | Viewed by 717

Special Issue Editors

Department of Computer Science and Technology, Harbin Institute of Technology, Shenzhen 518055, China
Interests: coding and information theory; trustworthy AI; digital watermarking
Special Issues, Collections and Topics in MDPI journals

E-Mail Website
Guest Editor
Department of Computer Science and Technology, Harbin Institute of Technology, Shenzhen 518055, China
Interests: multimedia information retrieval

E-Mail Website
Guest Editor
Peng Cheng Laboratory, University of Science and Technology of China, Shenzhen 518000, China
Interests: intelligent transportation

Special Issue Information

Dear Colleagues,

In recent years, deep image/video coding technologies represented by deep learning have significantly outperformed traditional image/video coding formats such as JPEG and HEVC in terms of human perception and machine vision tasks. The coding quality of images/videos depends on various quality evaluation metrics, and novel perceptual metrics have led to higher requirements for image and video compression. At the same time, the development of large-scale foundation models has provided good data priors for image and video compression, creating possibilities for further enhancing the machine vision accuracy of image/video coding. However, for practical application deployment, designing low-latency and low-complexity image/video coding algorithms is still a key problem to be solved.

In this context, for this Special Issue entitled “Image and Video Coding Technology”, we invite original research and comprehensive reviews on, but not limited to, the following:

  • Advances in perceptual image/video coding;
  • Advances in lossy and lossless image/video coding;
  • Advances in point cloud coding;
  • Advances in machine vision-oriented image/video coding;
  • Advances in image/video compressive sensing;
  • Reinforcement learning and game theory for image/video coding;
  • Real-time video encoding;
  • Scalable Video Coding (SVC);
  • Enabling technologies for high-performance video coding;
  • Video coding for mobile communication;
  • Rate distortion theory for lossy image/video coding.

Dr. Bin Chen
Dr. Shuhan Qi
Dr. Yaowei Wang
Guest Editors

Manuscript Submission Information

Manuscripts should be submitted online at www.mdpi.com by registering and logging in to this website. Once you are registered, click here to go to the submission form. Manuscripts can be submitted until the deadline. All submissions that pass pre-check are peer-reviewed. Accepted papers will be published continuously in the journal (as soon as accepted) and will be listed together on the special issue website. Research articles, review articles as well as short communications are invited. For planned papers, a title and short abstract (about 100 words) can be sent to the Editorial Office for announcement on this website.

Submitted manuscripts should not have been published previously, nor be under consideration for publication elsewhere (except conference proceedings papers). All manuscripts are thoroughly refereed through a single-blind peer-review process. A guide for authors and other relevant information for submission of manuscripts is available on the Instructions for Authors page. Electronics is an international peer-reviewed open access semimonthly journal published by MDPI.

Please visit the Instructions for Authors page before submitting a manuscript. The Article Processing Charge (APC) for publication in this open access journal is 2400 CHF (Swiss Francs). Submitted papers should be well formatted and use good English. Authors may use MDPI's English editing service prior to publication or during author revisions.

Keywords

  • advances in perceptual image/video coding
  • advances in lossy and lossless image/video coding
  • advances in point cloud coding
  • advances in machine vision-oriented image/video coding
  • advances in image/video compressive sensing
  • reinforcement learning and game theory for image/video coding
  • real-time video encoding
  • scalable video coding (SVC)
  • enabling technologies for high-performance video coding
  • video coding for mobile communication
  • rate distortion theory for lossy image/video coding

Published Papers (1 paper)

Order results
Result details
Select all
Export citation of selected articles as:

Research

16 pages, 1324 KiB  
Article
Fast Hybrid Search for Automatic Model Compression
by Guilin Li , Lang Tang and Xiawu Zheng
Electronics 2024, 13(4), 688; https://doi.org/10.3390/electronics13040688 - 7 Feb 2024
Viewed by 537
Abstract
Neural network pruning has been widely studied for model compression and acceleration, to facilitate model deployment in resource-limited scenarios. Conventional methods either require domain knowledge to manually design the pruned model architecture and pruning algorithm, or AutoML-based methods to search the pruned model [...] Read more.
Neural network pruning has been widely studied for model compression and acceleration, to facilitate model deployment in resource-limited scenarios. Conventional methods either require domain knowledge to manually design the pruned model architecture and pruning algorithm, or AutoML-based methods to search the pruned model architecture but still prune all layers with a single pruning algorithm. However, many pruning algorithms have been proposed and they all differ regarding the importance they attribute to the criterion of filters. Therefore, we propose a hybrid search method, searching for the pruned model architecture and the pruning algorithm at the same time, which automatically finds the pruning ratio and pruning algorithm for each convolution layer. Moreover, to be more efficient, we divide the search process into two phases. Firstly, we search in a huge space with adaptive batch normalization, which is a fast but relatively inaccurate model evaluation method; secondly, we search based on the previous results and evaluate models by fine-tuning, which is more accurate. Therefore, our proposed hybrid search method is efficient, and achieves a clear improvement in performance compared to current state-of-the-art methods, including AMC, MetaPruning, and ABCPruner. For example, when pruning MobileNet, we achieve a 59.8% test accuracy on ImageNet with only 49 M FLOPs, which is 2.6% higher than MetaPruning. Full article
(This article belongs to the Special Issue Image and Video Coding Technology)
Show Figures

Figure 1

Back to TopTop