arXiv 论文速递

2025-08-22 12:33
Snapshot: 20250822_1233
Unveiling Trust in Multimodal Large Language Models: Evaluation, Analysis, and Mitigation
Authors: Yichi Zhang, Yao Huang, Yifan Wang, Yitong Sun, Chang Liu, Zhe Zhao, Zhengwei Fang, Huanran Chen, Xiao Yang, Xingxing Wei, Hang Su, Yinpeng Dong, Jun Zhu · First: 2025-08-21T09:00:01+00:00 · Latest: 2025-08-21T09:00:01+00:00
Comments: For Appendix, please refer to arXiv:2406.07057
Abstract
The trustworthiness of Multimodal Large Language Models (MLLMs) remains an intense concern despite the significant progress in their capabilities. Existing evaluation and mitigation approaches often focus on narrow aspects and overlook risks introduced by the multimodality. To tackle these challenges, we propose MultiTrust-X, a comprehensive benchmark for evaluating, analyzing, and mitigating the trustworthiness issues of MLLMs. We define a three-dimensional framework, encompassing five trustworthiness aspects which include truthfulness, robustness, safety, fairness, and privacy; two novel risk types covering multimodal risks and cross-modal impacts; and various mitigation strategies from the perspectives of data, model architecture, training, and inference algorithms. Based on the taxonomy, MultiTrust-X includes 32 tasks and 28 curated datasets, enabling holistic evaluations over 30 open-source and proprietary MLLMs and in-depth analysis with 8 representative mitigation methods. Our extensive experiments reveal significant vulnerabilities in current models, including a gap between trustworthiness and general capabilities, as well as the amplification of potential risks in base LLMs by both multimodal training and inference. Moreover, our controlled analysis uncovers key limitations in existing mitigation strategies that, while some methods yield improvements in specific aspects, few effectively address overall trustworthiness, and many introduce unexpected trade-offs that compromise model utility. These findings also provide practical insights for future improvements, such as the benefits of reasoning to better balance safety and performance. Based on these insights, we introduce a Reasoning-Enhanced Safety Alignment (RESA) approach that equips the model with chain-of-thought reasoning ability to discover the underlying risks, achieving state-of-the-art results.
中文标题/摘要
标题:揭示多模态大语言模型中的信任:评估、分析与缓解
尽管多模态大语言模型(MLLMs)能力显著提升,其可信度仍是重点关注的问题。现有评估与缓解方法常局限于单一维度,忽视了多模态带来的风险。为此,我们提出MultiTrust-X——一个用于评估、分析和缓解MLLMs可信度问题的综合基准。我们构建了三维框架:包含真实性、鲁棒性、安全性、公平性和隐私性五个可信维度;涵盖多模态风险与跨模态影响的两种新型风险类型;以及从数据、模型架构、训练和推理算法角度出发的多种缓解策略。基于该体系,MultiTrust-X包含32项任务和28个精选数据集,实现对30余个开源与专有MLLMs的整体评估,并采用8种代表性缓解方法进行深度分析。大量实验揭示了当前模型的显著脆弱性,包括可信度与通用能力间的差距,以及多模态训练和推理对基础LLMs潜在风险的放大效应。控制实验进一步发现现有缓解策略的关键局限:虽部分方法在特定方面有所改进,但鲜有能有效提升整体可信度,且多数会引发损害模型效用的意外权衡。这些发现为未来改进提供了实践洞见,例如通过推理能力更好平衡安全与性能。基于此,我们提出思维链推理增强的安全对齐方法(RESA),使模型具备发现潜在风险的推理能力,取得了最先进的效果。
Largeness and generalized t-henselianity
Authors: Will Johnson · First: 2025-08-21T08:44:22+00:00 · Latest: 2025-08-21T08:44:22+00:00
Comments: 17 pages. Addendum to arXiv:2508.10886 [math.LO]
Abstract
Let $K$ be a countable field. Then $K$ is large in the sense of Pop if and only if it admits a field topology which is "generalized t-henselian" (gt-henselian) in the sense of Dittmann, Walsberg, and Ye, meaning that the implicit function theorem holds for polynomials. Moreover, the \'etale open topology can be characterized in terms of the gt-henselian topologies on $K$: a subset $U \subseteq K^n$ is open in the \'etale open topology if and only if it is open with respect to every gt-henselian topology on $K$.
中文标题/摘要
标题:大性与广义t-亨泽性
设$K$为可数域。则$K$在Pop意义下是大的,当且仅当它承认一种在Dittmann、Walsberg和Ye意义下的“广义t-亨泽”(gt-亨泽)域拓扑,即多项式满足隐函数定理。此外,étale开拓扑可以通过$K$上的gt-亨泽拓扑来刻画:子集$U \subseteq K^n$在étale开拓扑中开,当且仅当它对于$K$上的每一个gt-亨泽拓扑都是开的。
On the extremal functions of second order uncertainty principles: symmetry and symmetry breaking
Authors: Xiao-Ping Chen, Chun-Lei Tang · First: 2025-08-21T04:19:45+00:00 · Latest: 2025-08-21T04:19:45+00:00
Abstract
This paper focus on the symmetry and symmetry breaking about the second order Hydrogen Uncertainty Principle. \emph{Firstly}, by choosing a suitable test function, we give a negative answer to the conjecture presented by Cazacu, Flynn and Lam in [\emph{J. Funct. Anal.} \textbf{283} (2022), Paper No. 109659, 37 pp] for $N\in\{2,3\}$, and emphasizing the symmetry breaking phenomenon. \emph{Secondly}, we obtain a family of sharp weighted second order Hydrogen Uncertainty Principle, and prove the extremal functions are radial, which extends the work of Duong and Nguyen [The sharp second order Caffareli-Kohn-Nirenberg inequality and stability estimates for the sharp second order uncertainty principle, arXiv:2102.01425].
中文标题/摘要
标题:关于二阶不确定性原理极值函数的对称性与对称破缺
本文聚焦于二阶氢不确定性原理的对称性与对称破缺现象。首先,通过选取合适的测试函数,对Cazacu、Flynn和Lam在[J. Funct. Anal. 283 (2022)]中针对N∈{2,3}情形提出的猜想给出否定回答,并强调了对称破缺现象。其次,我们获得了一系列尖锐的加权二阶氢不确定性原理,证明极值函数具有径向性质,扩展了Duong和Nguyen在[arXiv:2102.01425]中的工作。
aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists
Authors: Pengsong Zhang, Xiang Hu, Guowei Huang, Yang Qi, Heng Zhang, Xiuxu Li, Jiaxing Song, Jiabin Luo, Yijiang Li, Shuo Yin, Chengxiao Dai, Eric Hanchen Jiang, Xiaoyan Zhou, Zhenfei Yin, Boqin Yuan, Jing Dong, Guinan Su, Guanren Qiao, Haiming Tang, Anghong Du, Lili Pan, Zhenzhong Lan, Xinyu Liu · First: 2025-08-20T23:16:41+00:00 · Latest: 2025-08-20T23:16:41+00:00
Comments: Preprint under review. Code is available at https://github.com/aixiv-org. Website is available at https://forms.gle/DxQgCtXFsJ4paMtn8
Abstract
Recent advances in large language models (LLMs) have enabled AI agents to autonomously generate scientific proposals, conduct experiments, author papers, and perform peer reviews. Yet this flood of AI-generated research content collides with a fragmented and largely closed publication ecosystem. Traditional journals and conferences rely on human peer review, making them difficult to scale and often reluctant to accept AI-generated research content; existing preprint servers (e.g. arXiv) lack rigorous quality-control mechanisms. Consequently, a significant amount of high-quality AI-generated research lacks appropriate venues for dissemination, hindering its potential to advance scientific progress. To address these challenges, we introduce aiXiv, a next-generation open-access platform for human and AI scientists. Its multi-agent architecture allows research proposals and papers to be submitted, reviewed, and iteratively refined by both human and AI scientists. It also provides API and MCP interfaces that enable seamless integration of heterogeneous human and AI scientists, creating a scalable and extensible ecosystem for autonomous scientific discovery. Through extensive experiments, we demonstrate that aiXiv is a reliable and robust platform that significantly enhances the quality of AI-generated research proposals and papers after iterative revising and reviewing on aiXiv. Our work lays the groundwork for a next-generation open-access ecosystem for AI scientists, accelerating the publication and dissemination of high-quality AI-generated research content. Code is available at https://github.com/aixiv-org. Website is available at https://forms.gle/DxQgCtXFsJ4paMtn8.
中文标题/摘要
标题:aiXiv:面向AI科学家的新一代开放获取科学发现生态系统
大型语言模型(LLM)的最新进展使得AI代理能自主生成科学提案、开展实验、撰写论文并进行同行评审。然而,AI生成研究内容的激增与碎片化且基本封闭的出版生态系统产生冲突。传统期刊和会议依赖人工同行评审,难以规模化且常拒收AI生成内容;现有预印本服务器(如arXiv)缺乏严格质控机制。这导致大量高质量AI研究成果缺乏传播渠道,阻碍其推动科学进步。为此我们推出aiXiv——面向人类与AI科学家的新一代开放获取平台。其多智能体架构支持研究提案与论文由人类和AI科学家共同提交、评审及迭代优化,并通过API和MCP接口实现异构科研主体的无缝集成,构建可扩展的自主科学发现生态系统。实验表明,经过aiXiv的迭代修订与评审,AI生成研究提案和论文质量显著提升。本研究为AI科学家构建了新一代开放获取生态基础,加速高质量AI研究成果的出版传播。代码详见https://github.com/aixiv-org,网站地址https://forms.gle/DxQgCtXFsJ4paMtn8。
Fast reliable pricing and calibration of the rough Heston model
Authors: Svetlana Boyarchenko, Marco de Innocentis, Sergei Levendorskiĭ · First: 2025-08-20T21:36:22+00:00 · Latest: 2025-08-20T21:36:22+00:00
Comments: arXiv admin note: text overlap with arXiv:2412.16067
Abstract
The paper is an extended and modified version of the preprint S.Boyarchenko and S.Levendorski\u{i} ``Correct implied volatility shapes and reliable pricing in the rough Heston model". We combine a modification of the Adams method with the SINH-acceleration method S.Boyarchenko and S.Levendorskii (IJTAF 2019, v.22) of Fourier inversion (iFT) to price vanilla options under the rough Heston model. For moderate or long maturities and strikes near spot, thousands of prices are computed in several milliseconds (ms) in Matlab on a Mac with moderate specs, with relative errors $\lesssim 10^{-4}$. Even for options close to expiry and far-OTM, the pricing takes a few tens or hundreds of ms. We show that, for the calibrated parameters in El Euch and Rosenbaum (Math.Finance 2019, v.29), the model implied vol surface is much flatter and fits the market data poorly; thus the calibration in op.cit. is a case of ``ghost calibration'' (M.Boyarchenko and S.Levendorski\u{i}, Quant. Finance 2015, v.15): numerical error and model specification error offset each other, creating an apparently good fit that vanishes when a more accurate pricer is used. We explain how such errors arise in popular iFT implementations that use fixed numerical parameters, yielding spurious smiles/skews, and provide numerical evidence that SINH acceleration is faster and more accurate than competing methods. Robust error control is ensured by a general Conformal Bootstrap principle that we formulate; the principle is applicable to many Fourier-pricing methods. We outline how this principle and our method enable accurate calibration procedures that are hundreds of times faster than approaches commonly used in the industry. Disclaimer: The views expressed herein are those of the authors only. No other representation should be attributed.
中文标题/摘要
标题:粗糙Heston模型的快速可靠定价与校准
本文是预印本S.Boyarchenko和S.Levendorskiĭ《粗糙Heston模型中正确的隐含波动率形态与可靠定价》的扩展修订版。我们结合改进的Adams方法与SINH加速法(S.Boyarchenko与S.Levendorskii,IJTAF 2019, v.22)进行傅里叶反演(iFT),实现粗糙Heston模型下普通期权的定价。在中等配置的Mac电脑上使用Matlab,对于中等或长期限且行权价接近现价的期权,可在数毫秒内计算数千个价格,相对误差≤10⁻⁴。即使临近到期或深度虚值期权,定价也仅需数十至数百毫秒。我们发现,采用El Euch和Rosenbaum(Math.Finance 2019, v.29)的校准参数时,模型隐含波动率曲面过于平坦,与市场数据拟合不佳,这属于‘幽灵校准’现象(M.Boyarchenko与S.Levendorskiĭ,Quant. Finance 2015, v.15):数值误差与模型设定误差相互抵消,形成虚假的良好拟合。我们揭示了固定参数iFT实现中产生伪波动率微笑/偏斜的机制,并通过数值证明SINH加速法更具速度与精度优势。基于提出的通用共形自举原则确保稳健误差控制,该原则可应用于多种傅里叶定价方法。本文方法使校准速度较行业常用方法提升数百倍,且保持高精度。免责声明:本文观点仅代表作者立场。
History