• 首页
  • 期刊简介
  • 编委会
  • 投稿指南
  • 收录情况
  • 杂志订阅
  • 联系我们
引用本文:【点击复制】
【打印本页】   【下载PDF全文】   【查看/发表评论】  【下载PDF阅读器】  
←前一篇|后一篇→ 过刊浏览
分享到: 微信 更多
各向异性双曲神经网络在细粒度和分层图像分类中的应用
陈羿  马令坤  章为川
陕西科技大学电子信息与人工智能学院
摘 要: 图像分类任务常涉及层次化类别结构、非均匀语义差异及细粒度区分,这些特征在欧几里得空间中难以有效建模。现有方法多假设各向同性曲率,并依赖基于距离的softmax分类器,难以捕捉特征方向变化和形成紧凑类簇。为解决上述问题,提出各向异性双曲神经网络(AHNN),以Vision Transformer为特征骨干,在指数映射前引入可学习各向异性变换,实现方向特定扩展,更好对齐视觉数据内在结构。同时设计双曲质心损失(Hyperbolic Centroid Loss),结合基于双曲距离的交叉熵、原型正则化和辅助欧几里得损失,提升类内聚类与训练稳定性。在六个有代表性基准数据集上的实验表明,所提方法显著优于欧几里得基线和现有双曲模型,尤其在细粒度和层次化数据集上优势明显。
关键词: 双曲几何  各向异性嵌入  Vision Transformer  度量学习  基于原型的分类  层次表示
中图分类号: TP391.4    文献标识码: 
基金项目: 国家自然科学基金项目(面上项目,重点项目,重大项目)
Anisotropic Hyperbolic Neural Networks for Fine-grained and Hierarchical Image Classification
CHEN Yi1,2,3,4, MA Lingkun1,2,3,4, ZHANG Weichuan1,2,3,4
1.College of Electronic Information and Artificial Intelligence, Shaanxi University of Science and Technology, Xi'2.'3.an 710021;4.China
Abstract: Image classification tasks often involve hierarchical category structures, non-uniform semantic differences, and fine-grained distinctions, which are difficult to effectively model in Euclidean space. Existing methods often assume isotropic curvature and rely on distance-based softmax classifiers, making it difficult to capture changes in feature direction and form compact class clusters. To address these issues, we propose the Anisotropic Hyperbolic Neural Network (AHNN), which utilizes the Vision Transformer as the feature backbone and introduces a learnable anisotropic transformation before exponential mapping to achieve direction-specific expansion and better align with the inherent structure of visual data. Additionally, we design the Hyperbolic Centroid Loss, which combines cross-entropy based on hyperbolic distance, prototype regularization, and auxiliary Euclidean loss to enhance intra-class clustering and training stability. Experiments on six representative benchmark datasets demonstrate that the proposed method significantly outperforms Euclidean baselines and existing hyperbolic models, especially on fine-grained and hierarchical datasets.
Keywords: hyperbolic geometry  anisotropic embedding  Vision Transformer  metric learning  prototype-based classification  hierarchical representation


版权所有:软件工程杂志社
地址:辽宁省沈阳市浑南区创新路195号 邮政编码:110169
电话:0411-84767887 传真:0411-84835089 Email:semagazine@neusoft.edu.cn
备案号:辽ICP备17007376号-1
技术支持:北京勤云科技发展有限公司

用微信扫一扫

用微信扫一扫