
GitHub - jam-cc/MMAD: The Codes and Data of A Comprehensive …
2024年10月16日 · To bridge this gap, we present MMAD, the first-ever full-spectrum MLLMs benchmark in industrial Anomaly Detection. We defined seven key subtasks of MLLMs in industrial inspection and designed a novel pipeline to generate the MMAD dataset with 39,672 questions for 8,366 industrial images.
ICLR 2025 | 多模态大模型能否胜任工业异常检测?MMAD基准揭 …
最终构建的mmad数据集包含8,366张工业图像,涵盖38类产品和244种缺陷类型,生成39,672道多选问题,形成了工业领域最全面的mllm能力测评基准。 (左)MMAD数据集的数据信息,涵盖了7个关键子任务和38个代表性IAD类别。
MMAD: A Comprehensive Benchmark for Multimodal Large …
2024年10月12日 · To bridge this gap, we present MMAD, the first-ever full-spectrum MLLMs benchmark in industrial Anomaly Detection. We defined seven key subtasks of MLLMs in industrial inspection and designed a novel pipeline to generate the MMAD dataset with 39,672 questions for 8,366 industrial images.
MMAD: A Comprehensive Benchmark for Multimodal Large …
2025年1月22日 · With MMAD, we have conducted a comprehensive, quantitative evaluation of various state-of-the-art MLLMs. The commercial models performed the best, with the average accuracy of GPT-4o models reaching 74.9%. However, this …
MMAD: A Comprehensive Benchmark for Multimodal Large …
2025年2月21日 · With MMAD, we have conducted a comprehensive, quantitative evaluation of various state-of-the-art (SOTA) MLLMs, including the GPT-4 series and Gemini 1.5 series (Reid et al., 2024), as well as open-source image models like InternVL2 (Chen et al., 2023) and LLaVA-NeXT (Liu et al., 2024a), and industry anomaly detection models like AnomalyGPT (Gu ...
MMAD: The First-Ever Comprehensive Benchmark for Multimodal …
With MMAD, we have conducted a comprehensive, quantitative evaluation of various state-of-the-art MLLMs. The commercial models performed the best, with the average accuracy of GPT-4o models reaching 74.9%.
Paper page - MMAD: The First-Ever Comprehensive Benchmark …
2024年10月12日 · To bridge this gap, we present MMAD, the first-ever full-spectrum MLLMs benchmark in industrial Anomaly Detection. We defined seven key subtasks of MLLMs in industrial inspection and designed a novel pipeline to generate the MMAD dataset with 39,672 questions for 8,366 industrial images.
MMAD|工业异常检测数据集|多模态大语言模型数据集
2024年10月12日 · mmad数据集是由南方科技大学和腾讯优图实验室等机构联合创建的首个用于工业异常检测的多模态大语言模型综合基准。 该数据集包含39,672个多选题,基于8,366张工业图像,涵盖了38个工业产品类别和244种缺陷类型。
With MMAD, we have conducted a comprehensive, quantitative evaluation of various state-of-the-art (SOTA) MLLMs, including the GPT-4 series and Gemini 1.5 series (Reid et al., 2024), as well as open-source image models like InternVL2 (Chen et …
ICLR 2025 | 多模态大模型能否胜任工业异常检测?MMAD基准揭 …
2025年2月17日 · 最终构建的 mmad 数据集包含 8,366 张工业图像,涵盖 38 类产品和 244 种缺陷类型,生成 39,672 道多选问题,形成了工业领域最全面的 mllm 能力测评基准。 (左)MMAD 数据集的数据信息,涵盖了 7 个关键子任务和 38 个代表性 IAD 类别。