Research article

Large-scale content-based image retrieval system with metric learning and discrete binary encoding

  • Published: 09 July 2025
  • Hashing methods have been widely used in large-scale content-based image retrieval systems, which have extensive and practical applications in search engines and social networks. However, existing deep supervised hashing methods focus on enforcing the supervision information to learn the discrete hash codes, ignoring the latent clustering in the Hamming space. In addition, the discrete constraints in the above optimization problem are usually hard to solve. To address these issues, we propose a new deep supervised hashing method for image retrieval tasks. Specifically, we utilized a metric learning strategy to minimize the intra-class Hamming distance and maximize the inter-class Hamming distance as much as possible. Then, a novel label fitting approach was developed to generate discrete proxies and hash codes simultaneously. Finally, an effective alternating optimization method was designed to solve the discrete optimization problem. Experimental results on various datasets demonstrate that the proposed method can achieve promising retrieval performance.

    Citation: Yahui Liu, Jianbo Dai, Liang Hu, Guangxu Zhao. Large-scale content-based image retrieval system with metric learning and discrete binary encoding[J]. Electronic Research Archive, 2025, 33(7): 4151-4164. doi: 10.3934/era.2025186

    Related Papers:

  • Hashing methods have been widely used in large-scale content-based image retrieval systems, which have extensive and practical applications in search engines and social networks. However, existing deep supervised hashing methods focus on enforcing the supervision information to learn the discrete hash codes, ignoring the latent clustering in the Hamming space. In addition, the discrete constraints in the above optimization problem are usually hard to solve. To address these issues, we propose a new deep supervised hashing method for image retrieval tasks. Specifically, we utilized a metric learning strategy to minimize the intra-class Hamming distance and maximize the inter-class Hamming distance as much as possible. Then, a novel label fitting approach was developed to generate discrete proxies and hash codes simultaneously. Finally, an effective alternating optimization method was designed to solve the discrete optimization problem. Experimental results on various datasets demonstrate that the proposed method can achieve promising retrieval performance.



    加载中


    [1] P. Indyk, R. Motwani, Approximate nearest neighbors: Towards removing the curse of dimensionality, ACM Symp. Theory Comput., (1998), 604–613. https://doi.org/10.1145/276698.276876
    [2] C. S. Anan, R. I. Hartley, Optimised kd-trees for fast image descriptor matching, in IEEE Conference on Computer Vision and Pattern Recognition, (2008), 1–12. https://doi.org/10.1109/CVPR.2008.4587638
    [3] A. Gionis, P. Indyk, R. Motwani, Similarity search in high dimensions via hashing, in International Conference on Very Large Data Bases, (1999), 518–529. https://dl.acm.org/doi/10.1145/3701716.3717581
    [4] Y. C. Gong, S. Lazebnik, Iterative quantization: A procrustean approach to learning binary codes, in IEEE Conference on Computer Vision and Pattern Recognition, (2011), 817–8294. https://doi.org/10.1109/CVPR.2011.5995432
    [5] F. M. Shen, C. H. Shen, W. Liu, H. T. Shen, Supervised discrete hashing, in IEEE Conference on Computer Vision and Pattern Recognition, (2015), 37–45. https://doi.org/10.1109/CVPR.2015.7298598
    [6] J. Wang, W. Liu, S. Kumar, S. F. Chang, Learning to hash for indexing big data–-A survey, Proc. IEEE, 104 (2016), 34–57. https://doi.org/10.1109/JPROC.2015.2487976 doi: 10.1109/JPROC.2015.2487976
    [7] R. K. Xia, Y. Pan, H. J. Lai, C. Liu, S. C. Yan, Supervised hashing for image retrieval via image representation learning, in AAAI Conference on Artificial Intelligence, (2014), 2156–2162. https://doi.org/10.1609/AAAI.V28I1.8952
    [8] H. J. Lai, Y. Pan, Y. Liu, S. C. Yan, Simultaneous feature learning and hash coding with deep neural networks, in IEEE Conference on Computer Vision and Pattern Recognition, (2015), 3270–3278. https://doi.org/10.1109/CVPR.2015.7298947
    [9] F. Zhao, Y. Z. Huang, L. Wang, T. N. Tan, Deep semantic ranking based hashing for multi-label image retrieval, in IEEE Conference on Computer Vision and Pattern Recognition, (2015), 1556–1564. https://doi.org/10.1109/CVPR.2015.7298763
    [10] R. M. Zhang, L. Lin, R. Zhang, W. M. Zuo, L. Zhang, Bit-scalable deep hashing with regularized similarity learning for image retrieval and person re-identification, IEEE Trans. Image Process., 24(2015), 4766–4779. https://doi.org/10.1109/TIP.2015.2467315
    [11] W. J. Li, S. Wang, W. C. Kang, Feature learning based deep supervised hashing with pairwise labels, in International Joint Conference on Artificial Intelligence, (2016), 1711–1717. https://dl.acm.org/doi/10.5555/3060832.3060860
    [12] H. M. Liu, R. P. Wang, S. G. Shan, X. L. Chen, Deep supervised hashing for fast image retrieval, in IEEE Conference on Computer Vision and Pattern Recognition, (2016), 2064–2072. https://doi.org/10.1109/CVPR.2016.227
    [13] H. Zhu, M. S. Long, J. M. Wang, Y. Cao, Deep hashing network for efficient similarity retrieval, in Proceedings of the AAAI Conference on Artificial Intelligence, (2016), 2415–2421. https://doi.org/10.1609/AAAI.V30I1.10235
    [14] Y. Cao, M. S. Long, J. M. Wang, H. Zhu, Q. F. Wen, Deep quantization network for efficient image retrieval, in Proceedings of the AAAI Conference on Artificial Intelligence, (2016), 3457–3463. https://doi.org/10.1609/AAAI.V30I1.10455
    [15] Z. J. Cao, M. S. Long, J. M. Wang, P. S. Yu, Hashnet: Deep learning to hash by continuation, in Proceedings of the IEEE International Conference on Computer Vision, (2017), 5609–5618. https://doi.org/10.1109/ICCV.2017.598
    [16] Z. D. Chen, L. J. Zhao, Z. C. Zhang, X. Luo, X. S. Xu, Characteristics matching based hash codes generation for efficient fine-grained image retrieval, in IEEE Conference on Computer Vision and Pattern Recognition, (2024), 17273–17281. https://doi.org/10.1109/CVPR52733.2024.01635
    [17] L. D. Wang, Y. Pan, C. Liu, H.J. Lai, J. Yin, Y. Liu, Deep hashing with minimal-distance-separated hash centers, in IEEE Conference on Computer Vision and Pattern Recognition, (2023), 23455–23464. https://doi.org/10.1109/CVPR52729.2023.02246
    [18] L. Yuan, T. Wang, X. P. Zhang, F. E. H. Tay, Z. Q. Jie, Y. H. Tian, et. al., Learnable central similarity quantization for efficient image and video retrieval, IEEE Trans. Neural Networks Learn. Syst., 35 (2024), 18717–18730. https://doi.org/10.1109/TNNLS.2023.3321148 doi: 10.1109/TNNLS.2023.3321148
    [19] W. C. Kang, W. J. Li, Z. H. Zhou, Column sampling based discrete supervised hashing, in Proceedings of the AAAI Conference on Artificial Intelligence, (2016), 1230–1236. https://doi.org/10.1609/aaai.v30i1.10176
    [20] K. Chatfield, K. Simonyan, A. Vedaldi, A. Zisserman, Return of the devil in the details: Delving deep into convolutional nets, preprint, arXiv: 1405.3531.
    [21] A. Krizhevsky, I. Sutskever, G. E. Hinton, Imagenet classification with deep convolutional neural networks, Adv. Neural Inf. Process. Syst., (2012), 1106–1114. https://doi.org/10.1145/3065386
    [22] V. E. Liong, J. W. Lu, Deep coupled metric learning for cross-modal matching, IEEE Trans. Multimedia, 19 (2017), 1234– 1244. https://doi.org/10.1109/TMM.2016.264618 doi: 10.1109/TMM.2016.264618
    [23] Y. Netzer, T. Wang, A. Coates, A. Bissacco, B. Wu, A. Y. Ng, Reading digits in natural images with unsupervised feature learning, in NIPS Workshop on Deep Learning and Unsupervised Feature Learning, (2011), 12–17. https://doi.org/10.1007/978-3-319-10593-2_33
    [24] T. S. Chua, J. H. Tang, R. C. Hong, H. J. Li, Z. P. Luo, Y. T. Zheng, NUS-WIDE: a real-world web image database from national university of singapore, in ACM International Conference on Image and Video Retrieval, (2009), 1–9. https://doi.org/10.1145/1646396.1646452
    [25] J. Wang, S. Kumar, S. F. Chang, Semi-supervised hashing for large-scale search, IEEE Trans. Pattern Anal. Mach. Intell., 34 (2012), 2393–2406. https://doi.org/10.1109/TPAMI.2012.48 doi: 10.1109/TPAMI.2012.48
    [26] P. C. Zhang, W. Zhang, W. J. Li, M. Y. Guo, Supervised hashing with latent factor models, in ACM SIGIR Conference on Research and Development in Information Retrieval, (2014), 173–182. https://doi.org/10.1145/2600428.2609600
    [27] G. S. Lin, C. H. Shen, Q. F. Shi, A. V. D. Hengel, D. Suter, Fast supervised hashing with decision trees for high-dimensional data, in IEEE Conference on Computer Vision and Pattern Recognition, (2014), 1971–1978. https://doi.org/10.1109/CVPR.2014.253
  • Reader Comments
  • © 2025 the Author(s), licensee AIMS Press. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0)
通讯作者: 陈斌, bchen63@163.com
  • 1. 

    沈阳化工大学材料科学与工程学院 沈阳 110142

  1. 本站搜索
  2. 百度学术搜索
  3. 万方数据库搜索
  4. CNKI搜索

Metrics

Article views(730) PDF downloads(34) Cited by(0)

Article outline

Figures and Tables

Figures(3)  /  Tables(4)

Other Articles By Authors

/

DownLoad:  Full-Size Img  PowerPoint
Return
Return

Catalog