An Infrared and Visible Image Alignment Method Based on Gradient Distribution Properties and Scale-Invariant Features in Electric Power Scenes
Abstract
:1. Introduction
- 1.
- A normalised directional histogram of orientation gradients combined with scale-invariant feature transform (HOFT) is proposed as an alignment algorithm for infrared and visible images;
- 2.
- We propose a feature descriptor that can overcome the effects of scale, rotation, and viewpoint differences to accurately describe key information in visible and infrared images;
- 3.
- We optimise and design an efficient matching strategy to obtain denser matching pair information and more accurate image alignment results.
2. Related Work
2.1. Region-Based Image Alignment Method
2.2. Feature-Based Image Alignment Method
3. Methods
3.1. Edge Detection and Feature Extraction
3.2. Generate Feature Descriptors
- 1.
- An 8 × 8 window neighbourhood is selected centred on the key point, each window represents a pixel in the scale space where the neighbourhood of the key point is located, these are weighted by a Gaussian window, represented by an overlaying circle, the direction of the arrow represents the gradient direction of that pixel, and the length of the arrow represents the gradient magnitude, and the gradient magnitude and direction are calculated for each pixel in the image.
- 2.
- Divide the image into Cells and compute the gradient information for each Cell.
- 3.
- Equalise the entire rectangular neighbourhood into 4 × 4 sub-neighbourhoods, combine the Cell within each sub-area into a Block, and compute the gradient features within each Block.
- 4.
- The neighbourhood window centred on the key point is inwardly sampled, the Block in each sub-neighbourhood is combined to form the HOG features, the feature direction creates the histogram of gradient direction at intervals.
- 5.
- The HOG feature descriptor of the image is obtained by normalisation. The generated feature vector is first subjected to an initial normalisation process, which usually involves dividing each element in the feature vector by the number of vanes of the feature vector so that its length becomes 1.A is the original feature vector and is the normalised feature vector. To further improve the discriminative nature of the feature descriptors, the normalised feature vector is also truncated. Element values greater than 0.2 in the feature vector are truncated to 0.2 to reduce the impact of too-large feature values on the matching results. After truncation, the length of the eigenvector may change, so it needs to be normalised again by dividing each element in the eigenvector by the number of vanes of the eigenvector so that its length becomes 1.
3.3. Feature Matching
4. Experiments and Discussion
4.1. Datasets and Evaluation Indicators
4.2. Comparison of Results with Other Methods
4.3. Evaluation of Improved Feature Matching
5. Conclusions
- 1.
- Deep optimisation of feature extraction and description: More fine-grained and robust extraction and description of features for infrared and visible images avoids the tedious process of manually selecting and designing features, and at the same time is able to better handle complex image transformations.
- 2.
- Deep learning-driven similarity measurement and matching strategy: Deep learning has obvious advantages such as being completely data-driven and being able to extract deep semantic features of images. Deep learning-based matching strategies are able to find the best correspondence between two images.
- 3.
- End-to-end alignment network design: The end-to-end alignment network can simplify the alignment process, improve the alignment efficiency, and potentially achieve higher alignment accuracy.
- 4.
- Hybrid model and multimodal fusion strategy: Combine traditional feature-based methods with deep learning algorithms to form a hybrid model. At the same time, multimodal fusion strategies for infrared and visible images are explored to make full use of information from both images.
- 5.
- Small-sample learning and migration learning: Explore deep learning training strategies based on small samples, such as migration learning and semi-supervised learning, in the case of a lack of datasets.
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
Abbreviations
CSS | Curvature Scale Space |
HOG | Histogram of Orientation Gradients |
SIFT | Scale-Invariant Feature Transform |
FLANN | Fast Library for Approximate Nearest Neighbours |
RANSAC | Random Sample Consensus |
HOFT | Histogram of Orientation Gradients Combined with Feature Transform |
MI | Mutual Information |
NCC | Normalised Cross-Correlation |
SURF | Speeded-Up Robust Features |
PIIFD | Partial-Intensity Invariant Feature Descriptor |
EAT | Enhanced Affine Transform |
References
- Luo, Y.; Yu, X.; Yang, D.; Zhou, B. A survey of intelligent transmission line inspection based on unmanned aerial vehicle. Artif. Intell. Rev. 2023, 56, 173–201. [Google Scholar] [CrossRef]
- Ma, J.; Jiang, X.; Fan, A.; Jiang, J.; Yan, J. Image matching from handcrafted to deep features: A survey. Int. J. Comput. Vis. 2021, 129, 23–79. [Google Scholar] [CrossRef]
- Jhan, J.P.; Rau, J.Y. A generalized tool for accurate and efficient image registration of UAV multi-lens multispectral cameras by N-SURF matching. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 6353–6362. [Google Scholar] [CrossRef]
- Ma, W.; Wang, K.; Li, J.; Yang, S.X.; Li, J.; Song, L.; Li, Q. Infrared and visible image fusion technology and application: A review. Sensors 2023, 23, 599. [Google Scholar] [CrossRef] [PubMed]
- Zhou, H.; Yu, G. Research on pedestrian detection technology based on the SVM classifier trained by HOG and LTP features. Future Gener. Comput. Syst. 2021, 125, 604–615. [Google Scholar] [CrossRef]
- Li, L.; Wang, S.; Zeng, W.; Qin, F. Research on Zero Watermarking Technique Based on Improved Sobel Operator. In Proceedings of the 2021 IEEE International Conference on Artificial Intelligence and Industrial Design (AIID), Guangzhou, China, 28–30 May 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 594–597. [Google Scholar]
- Awrangjeb, M.; Lu, G. An improved curvature scale-space corner detector and a robust corner matching approach for transformed image identification. IEEE Trans. Image Process. 2008, 17, 2425–2441. [Google Scholar] [CrossRef]
- Zhang, W.; Zhao, Y. An improved SIFT algorithm for registration between SAR and optical images. Sci. Rep. 2023, 13, 6346. [Google Scholar] [CrossRef]
- Andronache, A.; von Siebenthal, M.; Székely, G.; Cattin, P. Non-rigid registration of multi-modal images using both mutual information and cross-correlation. Med. Image Anal. 2008, 12, 3–15. [Google Scholar] [CrossRef]
- Pluim, J.P.; Maintz, J.A.; Viergever, M.A. Image Registration by Maximization of Combined Mutual Information and Gradient Information. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2000, Proceedings of the Third International Conference, Pittsburgh, PA, USA, 11–14 October 2000; Proceedings 3; Springer: Berlin/Heidelberg, Germany, 2000; pp. 452–461. [Google Scholar]
- Lowe, D.G. Distinctive image features from scale-invariant keypoints. Int. J. Comput. Vis. 2004, 60, 91–110. [Google Scholar] [CrossRef]
- Jiang, Q.; Liu, Y.; Yan, Y.; Deng, J.; Fang, J.; Li, Z.; Jiang, X. A contour angle orientation for power equipment infrared and visible image registration. IEEE Trans. Power Deliv. 2020, 36, 2559–2569. [Google Scholar] [CrossRef]
- Bay, H.; Ess, A.; Tuytelaars, T.; Van Gool, L. Speeded-up robust features (SURF). Comput. Vis. Image Underst. 2008, 110, 346–359. [Google Scholar] [CrossRef]
- Tang, C.; Tian, G.Y.; Chen, X.; Wu, J.; Li, K.; Meng, H. Infrared and visible images registration with adaptable local-global feature integration for rail inspection. Infrared Phys. Technol. 2017, 87, 31–39. [Google Scholar] [CrossRef]
- Chen, J.; Tian, J.; Lee, N.; Zheng, J.; Smith, R.T.; Laine, A.F. A partial intensity invariant feature descriptor for multimodal retinal image registration. IEEE Trans. Biomed. Eng. 2010, 57, 1707–1718. [Google Scholar] [CrossRef] [PubMed]
- Du, Q.; Fan, A.; Ma, Y.; Fan, F.; Huang, J.; Mei, X. Infrared and visible image registration based on scale-invariant piifd feature and locality preserving matching. IEEE Access 2018, 6, 64107–64121. [Google Scholar] [CrossRef]
- Lu, C.; Qi, X.; Ding, K.; Yu, B. An Improved FAST Algorithm Based on Image Edges for Complex Environment. Sensors 2022, 22, 7127. [Google Scholar] [CrossRef]
- Andriyanov, N.A.; Magdeev, R.G.; Dementyiev, V.E. Building Structures Image Matching Using Convolutional Neural Network. In Proceedings of the 2024 Systems of Signals Generating and Processing in the Field of on Board Communications, Moscow, Russia, 12–14 March 2024; pp. 1–4. [Google Scholar] [CrossRef]
- Harris, C.; Stephens, M. A Combined Corner and Edge Detector. In Proceedings of the Alvey Vision Conference, Manchester, UK, 13–September 1988; Volume 15, pp. 147–151. [Google Scholar]
- Zhang, W. Robust registration of SAR and optical images based on deep learning and improved Harris algorithm. Sci. Rep. 2022, 12, 5901. [Google Scholar] [CrossRef]
- Si, H.; Wang, Y.; Liu, Q.; Li, W.; Wan, L.; Song, J.; Zhao, W.; Sun, C. A registration algorithm for the infrared and visible images of apple based on active contour model. Vis. Comput. 2024, 40, 2833–2855. [Google Scholar] [CrossRef]
- Min, C.; Gu, Y.; Li, Y.; Yang, F. Non-rigid infrared and visible image registration by enhanced affine transformation. Pattern Recognit. 2020, 106, 107377. [Google Scholar] [CrossRef]
- Wang, J.; Yu, H. A new chaos cat swarm optimization algorithm based on saliency gradient for power equipment infrared and visible images registration. Evol. Intell. 2023, 16, 1951–1965. [Google Scholar] [CrossRef]
- Han, L.; Tian, Y.; Qi, Q. Research on Edge Detection Algorithm Based on Improved Sobel Operator. In Proceedings of the MATEC Web of Conferences, Sanya, China, 22–23 December 2019; EDP Sciences: Les Ulis, France, 2020; Volume 309, p. 03031. [Google Scholar]
- Zhang, C.C.; Fang, J.D. Edge Detection Based on Improved Sobel Operator. In Proceedings of the 2016 International Conference on Computer Engineering and Information Systems, Guilin, China, 24–25 September 2016; Atlantis Press: Amsterdam, The Netherlands, 2016; pp. 129–132. [Google Scholar]
- Tian, R.; Sun, G.; Liu, X.; Zheng, B. Sobel edge detection based on weighted nuclear norm minimization image denoising. Electronics 2021, 10, 655. [Google Scholar] [CrossRef]
- Sun, X.; Zhong, B.; Yang, J.; Ma, K.K. Corner detection via scale-space behavior-guided trajectory tracing. IEEE Signal Process. Lett. 2023, 30, 50–54. [Google Scholar] [CrossRef]
- Wu, Q.; Zhu, S. Multispectral image matching method based on histogram of maximum gradient and edge orientation. IEEE Geosci. Remote Sens. Lett. 2021, 19, 1–5. [Google Scholar] [CrossRef]
- Vashaee, A.; Jafari, R.; Ziou, D.; Rashidi, M.M. Rotation invariant HOG for object localization in web images. Signal Process. 2016, 125, 304–314. [Google Scholar] [CrossRef]
- Kihl, O.; Picard, D.; Gosselin, P.H. A unified framework for local visual descriptors evaluation. Pattern Recognit. 2015, 48, 1174–1184. [Google Scholar] [CrossRef]
- Dong, L.; Jin, J.; Jiang, Y.; Zhang, M.; Xu, W. Selection-based subpixel-shifted images super-resolution. IEEE Access 2019, 7, 110951–110963. [Google Scholar] [CrossRef]
- Ma, W.; Wen, Z.; Wu, Y.; Jiao, L.; Gong, M.; Zheng, Y.; Liu, L. Remote sensing image registration with modified SIFT and enhanced feature matching. IEEE Geosci. Remote Sens. Lett. 2016, 14, 3–7. [Google Scholar] [CrossRef]
- Hossein-Nejad, Z.; Nasri, M. An adaptive image registration method based on SIFT features and RANSAC transform. Comput. Electr. Eng. 2017, 62, 524–537. [Google Scholar] [CrossRef]
- Han, L.; Wang, J.; Zhang, Y.; Sun, X.; Wu, X. Research on adaptive ORB-SURF image matching algorithm based on fusion of edge features. IEEE Access 2022, 10, 109488–109497. [Google Scholar] [CrossRef]
- Wang, Z.; Feng, X.; Xu, G.; Wu, Y. A robust visible and infrared image matching algorithm for power equipment based on phase congruency and scale-invariant feature. Opt. Lasers Eng. 2023, 164, 107517. [Google Scholar] [CrossRef]
Characteristic | Group | Image Pair Details and Differences |
---|---|---|
No significant difference in scale, rotation, and viewpoint | 1 | Image Information of Power Lines |
2 | ||
3 | Image Information of Power Equipment | |
4 | ||
5 | ||
With significant differences in scale, rotation, and viewpoint | 6 | Differences in Scale are Evident |
7 | ||
8 | Differences in Rotation are Evident | |
9 | ||
10 | Differences in Scale, Rotation, and viewpoint | |
11 | ||
12 |
Group | 1 | 2 | 3 | 4 | 5 | 6 |
Running Time | 8.7681 s | 7.8447 s | 7.9695 s | 8.5594 s | 7.5707 s | 7.9513 s |
Group | 7 | 8 | 9 | 10 | 11 | 12 |
Running Time | 8.4584 s | 8.1070 s | 9.2244 s | 14.9479 s | 14.1430 s | 13.7984 s |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Zhu, L.; Mao, Y.; Chen, C.; Ning, L. An Infrared and Visible Image Alignment Method Based on Gradient Distribution Properties and Scale-Invariant Features in Electric Power Scenes. J. Imaging 2025, 11, 23. https://doi.org/10.3390/jimaging11010023
Zhu L, Mao Y, Chen C, Ning L. An Infrared and Visible Image Alignment Method Based on Gradient Distribution Properties and Scale-Invariant Features in Electric Power Scenes. Journal of Imaging. 2025; 11(1):23. https://doi.org/10.3390/jimaging11010023
Chicago/Turabian StyleZhu, Lin, Yuxing Mao, Chunxu Chen, and Lanjia Ning. 2025. "An Infrared and Visible Image Alignment Method Based on Gradient Distribution Properties and Scale-Invariant Features in Electric Power Scenes" Journal of Imaging 11, no. 1: 23. https://doi.org/10.3390/jimaging11010023
APA StyleZhu, L., Mao, Y., Chen, C., & Ning, L. (2025). An Infrared and Visible Image Alignment Method Based on Gradient Distribution Properties and Scale-Invariant Features in Electric Power Scenes. Journal of Imaging, 11(1), 23. https://doi.org/10.3390/jimaging11010023