MAF-Net:面向图像伪造检测的多粒度注意力融合网络

文章摘要

MAF-Net:面向图像伪造检测的多粒度注意力融合网络

MAF-Net: A Multi-granularity Attention Fusion Network for Image Forgery Detection

投稿时间：2026-01-05 修订日期：2026-01-18

DOI：

中文关键词: 全局-局部双分支选择性核卷积高效通道注意力注意力机制门控机制图像伪造检测

英文关键词: Global-Local Dual-Branch Selective Kernel Convolution Efficient Channel Attention Attention Mechanism Gating Mechanism Image Forgery Detection

基金项目:贵州省教育厅自然科学研究项目(黔教技[2023] 012)；贵州民族大学校级科研项目(GZMUZK[2021] YB23,GZMUZK[2023] QN10)；

作者	单位	邮编
朱莲	贵州民族大学	550025
张乾^*	贵州警察学院计算机科学系	550005
金海艳	贵州民族大学

摘要点击次数: 40

全文下载次数: 0

中文摘要:

随着人工智能的发展，人眼已很难判别图像是自然图像还是通过生成对抗网络或扩散模型等生成式模型生成的。但现有的生成图像检测方法普遍存在泛化能力不足的问题。为此，本文提出一种多粒度注意力融合网络(MAF-Net)，其核心在于系统性地协同三个层面的信息粒度：1)架构粒度上，设计全局-局部双分支，分别建模图像的整体语义上下文与局部细微伪造痕迹；2)特征表示粒度上，在骨干网络中，进一步集成选择性核卷积(SKConv)灵活捕捉不同尺度的伪造特征；同时引入轻量级高效通道注意力(ECA)强化对关键伪造痕迹的信道特征响应；3)注意力粒度上，构建由空间硬注意力、自注意力与全局语义引导的高效交互注意力组成的三重协同机制，并引入门控机制实现从区域聚焦到跨粒度特征的自适应融合。MAF-Net在ForenSynths数据集上的AP、ACC与AUC均值分别为99.3%、95.1%、99.1%；在GenImage数据集上的AP、ACC与AUC均值分别为99.5%、91.3%、99.3%，验证了该模型卓越的泛化能力与鉴定效能。

英文摘要:

With the advancement of artificial intelligence, it has become increasingly difficult for the human eye to distinguish whether an image is natural or generated by generative models such as generative adversarial networks or diffusion models. However, existing generated image detection methods generally suffer from insufficient generalization capability. To address this issue, this paper proposes a multi-granularity attention fusion network (MAF-Net), whose core lies in the systematic integration of information from three granularity levels: 1) At the architectural granularity, a global-local dual-branch structure is designed to model the overall semantic context and local subtle forgery traces of an image, respectively. 2) At the feature representation granularity, the backbone network is enhanced by integrating selective kernel convolution (SKConv) to flexibly capture multi-scale forgery features, along with a lightweight efficient channel attention (ECA) module to strengthen the channel-wise response to critical forgery artifacts. 3) At the attentional granularity, a triple collaborative attention mechanism comprising spatial hard attention, self-attention, and global semantic-guided efficient interactive attention are constructed. A gating mechanism is further introduced to achieve adaptive fusion from regional focusing to cross-granularity feature integration. MAF-Net achieves average AP, ACC, and AUC scores of 99.3%, 95.1%, and 99.1%?on the ForenSynths dataset, and 99.5%, 91.3%, and 99.3%?on the GenImage dataset, demonstrating its excellent generalization capability and detection performance.

View Fulltext 查看/发表评论下载PDF阅读器

关闭