Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

HPC Digital Twins for Evaluating Scheduling Policies, Incentive Structures and their Impact on Power and Cooling

NLKI: A lightweight Natural Language Knowledge Integration Framework for Improving Small VLMs in Commonsense VQA Tasks

Interact-Custom: Customized Human Object Interaction Image Generation

A Self-Supervised Mixture-of-Experts Framework for Multi-behavior Recommendation

MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation

From Tabula Rasa to Emergent Abilities: Discovering Robot Skills via Real-World Unsupervised Quality-Diversity

Dynamic Triangulation-Based Graph Rewiring for Graph Neural Networks

STDiff: A State Transition Diffusion Framework for Time Series Imputation in Industrial Systems

LLMs Can't Handle Peer Pressure: Crumbling under Multi-Agent Social Interactions

Graph-R1: Incentivizing the Zero-Shot Graph Learning Capability in LLMs via Explicit Reasoning

Modality-Specific Speech Enhancement and Noise-Adaptive Fusion for Acoustic and Body-Conduction Microphone Framework

Humans Perceive Wrong Narratives from AI Reasoning Texts

SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning

Pareto Actor-Critic for Communication and Computation Co-Optimization in Non-Cooperative Federated Learning Services

Learning to Drive Ethically: Embedding Moral Reasoning into Autonomous Driving

Generative AI Against Poaching: Latent Composite Flow Matching for Wildlife Conservation

Privacy-Aware Detection of Fake Identity Documents: Methodology, Benchmark, and Improved Algorithms (FakeIDet2)

Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics

Steering Towards Fairness: Mitigating Political Bias in LLMs

Dynamic Context Compression for Efficient RAG

Irredundant $k$-Fold Cross-Validation

Prompt Engineering and the Effectiveness of Large Language Models in Enhancing Human Productivity

A Highly Clean Recipe Dataset with Ingredient States Annotation for State Probing Task

Entropy-Memorization Law: Evaluating Memorization Difficulty of Data in LLMs

The Joys of Categorical Conformal Prediction

Adversarial Manipulation of Reasoning Models using Internal Representations

Agent-to-Agent Theory of Mind: Testing Interlocutor Awareness among Large Language Models

A Hybrid Artificial Intelligence Method for Estimating Flicker in Power Systems (Changes are marked)

GLProtein: Global-and-Local Structure Aware Protein Representation Learning

Program Semantic Inequivalence Game with Large Language Models

DSO: Aligning 3D Generators with Simulation Feedback for Physical Soundness

Improving Quantization with Post-Training Model Expansion

Safe and Efficient Social Navigation through Explainable Safety Regions Based on Topological Features

A Simple Approach to Constraint-Aware Imitation Learning with Application to Autonomous Racing

Federated nnU-Net for Privacy-Preserving Medical Image Segmentation

ExPath: Targeted Pathway Inference for Biological Knowledge Bases via Graph Learning and Explanation

Enhancing Automated Loop Invariant Generation for Complex Programs with Large Language Models

RevPRAG: Revealing Poisoning Attacks in Retrieval-Augmented Generation through LLM Activation Analysis

Categorical Data Clustering via Value Order Estimated Distance Metric Learning

Application of AI to formal methods - an analysis of current trends

Reconsidering the Performance of GAE in Link Prediction

See then Tell: Enhancing Key Information Extraction with Vision Grounding

Enhancing Natural Language Inference Performance with Knowledge Graph for COVID-19 Automated Fact-Checking in Indonesian Language

Puppet-Master: Scaling Interactive Video Generation as a Motion Prior for Part-Level Dynamics

FFHFlow: Diverse and Uncertainty-Aware Dexterous Grasp Generation via Flow Variational Inference

SoAy: A Solution-based LLM API-using Methodology for Academic Information Seeking

Investigating the Robustness of Counterfactual Learning to Rank Models: A Reproducibility Study

Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off

Network Formation and Dynamics Among Multi-LLMs

NetGPT: Generative Pretrained Transformer for Network Traffic

OLKAVS: An Open Large-Scale Korean Audio-Visual Speech Dataset

Explainability of Text Processing and Retrieval Methods: A Survey

The Ramon Llull's Thinking Machine for Automated Ideation

RLMR: Reinforcement Learning with Mixed Rewards for Creative Writing

LLM-Based Agents for Competitive Landscape Mapping in Drug Asset Due Diligence

MSARL: Decoupling Reasoning and Tool Use with Multi-Small-Agent Reinforcement Learning

Automated Algorithmic Discovery for Gravitational-Wave Detection Guided by LLM-Informed Evolutionary Monte Carlo Tree Search

Can Large Language Models Develop Strategic Reasoning? Post-training Insights from Learning Chess

Technology as uncharted territory: Contextual integrity and the notion of AI as new ethical ground

Possible Principles for Aligned Structure Learning Agents

OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale

Prompt-to-Product: Generative Assembly via Bimanual Manipulation

OnGoal: Tracking and Visualizing Conversational Goals in Multi-Turn Dialogue with Large Language Models

Mixture of Contexts for Long Video Generation

FakeParts: a New Family of AI-Generated DeepFakes

Enabling Equitable Access to Trustworthy Financial Reasoning

Veritas: Generalizable Deepfake Detection via Pattern-Aware Reasoning

Understanding, Protecting, and Augmenting Human Cognition with Generative AI: A Synthesis of the CHI 2025 Tools for Thought Workshop

Inference-Time Alignment Control for Diffusion Models with Reinforcement Learning Guidance

ChainReaction! Structured Approach with Causal Chains as Intermediate Representations for Improved and Explainable Causal Video Question Answering

Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees

ExpertSim: Fast Particle Detector Simulation Using Mixture-of-Generative-Experts

WoW-Bench: Evaluating Fine-Grained Acoustic Perception in Audio-Language Models via Marine Mammal Vocalizations

ProactiveEval: A Unified Evaluation Framework for Proactive Dialogue Agents

Research Challenges in Relational Database Management Systems for LLM Queries

Quantum Verifiable Rewards for Post-Training Qiskit Code Assistant

AI Agentic Vulnerability Injection And Transformation with Optimized Reasoning

JADES: A Universal Framework for Jailbreak Assessment via Decompositional Scoring

Learning Primitive Embodied World Models: Towards Scalable Robotic Learning

Multi-Agent Penetration Testing AI for the Web

Uncertainty Aware-Predictive Control Barrier Functions: Safer Human Robot Interaction through Probabilistic Motion Forecasting

Exploring Machine Learning and Language Models for Multimodal Depression Detection

Speech Emotion Recognition via Entropy-Aware Score Selection

Surfel-based 3D Registration with Equivariant SE(3) Features

Evaluating Compositional Generalisation in VLMs and Diffusion Models

Safer Skin Lesion Classification with Global Class Activation Probability Map Evaluation and SafeML

Unleashing Uncertainty: Efficient Machine Unlearning for Generative AI

Signs of Struggle: Spotting Cognitive Distortions across Language and Register

Turning the Spell Around: Lightweight Alignment Amplification via Rank-One Safety Injection

Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding

SKGE-SWIN: End-To-End Autonomous Vehicle Waypoint Prediction and Navigation Using Skip Stage Swin Transformer

Occlusion Robustness of CLIP for Military Vehicle Classification

SeqVLM: Proposal-Guided Multi-View Sequences Reasoning via VLM for Zero-Shot 3D Visual Grounding

Provable Benefits of In-Tool Learning for Large Language Models

${C}^{3}$-GS: Learning Context-aware, Cross-dimension, Cross-scale Feature for Generalizable Gaussian Splatting

Rethinking Testing for LLM Applications: Characteristics, Challenges, and a Lightweight Interaction Protocol

EEGDM: Learning EEG Representation with Latent Diffusion Model

Generative Annotation for ASR Named Entity Correction

MobileCLIP2: Improving Multi-Modal Reinforced Training

Task Allocation for Autonomous Machines using Computational Intelligence and Deep Reinforcement Learning

A Few Words Can Distort Graphs: Knowledge Poisoning Attacks on Graph-based Retrieval-Augmented Generation of Large Language Models

Created by

Haebom

저자

Jiayi Wen, Tianxin Chen, Zhirun Zheng, Cheng Huang

개요

본 논문은 Graph-based Retrieval-Augmented Generation (GraphRAG) 모델의 취약점을 공격하는 두 가지 지식 중독 공격(Knowledge Poisoning Attacks, KPAs)을 제안합니다. GraphRAG는 원시 텍스트를 구조화된 지식 그래프로 변환하여 LLM의 정확성과 설명력을 향상시키는 모델이지만, 그래프 생성 과정에서 LLM이 원시 텍스트에서 지식을 추출하는 과정이 악의적으로 조작될 수 있습니다. 논문에서는 표적 지식 중독 공격(Targeted KPA, TKPA)과 범용 지식 중독 공격(Universal KPA, UKPA) 두 가지 공격 방법을 제시합니다. TKPA는 그래프 이론적 분석을 이용하여 생성된 그래프에서 취약한 노드를 찾아 해당 서술을 LLM으로 다시 작성하여 특정 질문응답(QA) 결과를 정확하게 제어합니다(성공률 93.1%). UKPA는 대명사나 의존 관계와 같은 언어적 단서를 이용하여 전반적으로 영향력 있는 단어를 변경하여 생성된 그래프의 구조적 무결성을 깨뜨립니다. 전체 텍스트의 0.05% 미만을 수정하여 QA 정확도를 95%에서 50%로 떨어뜨립니다. 실험 결과, 최첨단 방어 기법조차 이러한 공격을 탐지하지 못함을 보여주며, GraphRAG 파이프라인을 지식 중독으로부터 보호하는 것은 아직 미개척 분야임을 강조합니다.

시사점, 한계점

•

시사점: GraphRAG 모델의 취약점을 보여주는 새로운 지식 중독 공격 기법 (TKPA, UKPA)을 제시함으로써, LLM 기반 시스템의 보안 강화에 대한 중요성을 부각합니다. 최소한의 텍스트 수정으로도 큰 영향을 미칠 수 있음을 보여줍니다. 기존 방어 기법의 한계를 드러냄으로써 향후 연구 방향을 제시합니다.

•

한계점: 현재 제시된 공격은 특정한 GraphRAG 구현에 국한될 수 있습니다. 다양한 GraphRAG 모델과 데이터셋에 대한 추가적인 연구가 필요합니다. 제안된 공격에 대한 효과적인 방어 기법 개발이 향후 과제로 남습니다. 실제 환경에서의 공격 성공률에 대한 추가적인 검증이 필요합니다.

Made with Slashpage