Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

IG Parser: A Software Package for the Encoding of Institutional Statements using the Institutional Grammar

One-Step Offline Distillation of Diffusion-based Models via Koopman Modeling

J4R: Learning to Judge with Equivalent Initial State Group Relative Policy Optimization

Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset

From Assistants to Adversaries: Exploring the Security Risks of Mobile LLM Agents

Leveraging LLM Inconsistency to Boost Pass@k Performance

Sinusoidal Initialization, Time for a New Start

Enhancing Channel-Independent Time Series Forecasting via Cross-Variate Patch Embedding

Any-to-Any Learning in Computational Pathology via Triplet Multimodal Pretraining

Predicting Turn-Taking and Backchannel in Human-Machine Conversations Using Linguistic, Acoustic, and Visual Signals

ChromFound: Towards A Universal Foundation Model for Single-Cell Chromatin Accessibility Data

IP Leakage Attacks Targeting LLM-Based Multi-Agent Systems

RoboFAC: A Comprehensive Framework for Robotic Failure Analysis and Correction

LLM-DSE: Searching Accelerator Parameters with LLM Agents

Online Iterative Self-Alignment for Radiology Report Generation

Reachability Barrier Networks: Learning Hamilton-Jacobi Solutions for Smooth and Flexible Control Barrier Functions

Improving Medium Range Severe Weather Prediction through Transformer Post-processing of AI Weather Forecasts

BioCube: A Multimodal Dataset for Biodiversity Research

One Shot Dominance: Knowledge Poisoning Attack on Retrieval-Augmented Generation Systems

TCC-Bench: Benchmarking the Traditional Chinese Culture Understanding Capabilities of MLLMs

FALCON: False-Negative Aware Learning of Contrastive Negatives in Vision-Language Pretraining

GRoQ-Loco: Generalist and Robot-agnostic Quadruped Locomotion Control using Offline Datasets

Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation

Artificial Intelligence Bias on English Language Learners in Automatic Scoring

Learning Long-Context Diffusion Policies via Past-Token Prediction

Fast Text-to-Audio Generation with Adversarial Post-Training

Unified Continuous Generative Models

Can LLM-based Financial Investing Strategies Outperform the Market in Long Run?

Technical Report: Quantifying and Analyzing the Generalization Power of a DNN

Prompting Large Language Models for Training-Free Non-Intrusive Load Monitoring

On-Device LLM for Context-Aware Wi-Fi Roaming

Efficient Fine-Tuning of Quantized Models via Adaptive Rank and Bitwidth

Understanding University Students' Use of Generative AI: The Roles of Demographics and Personality Traits

Adaptive Thinking via Mode Policy Optimization for Social Language Agents

LLM-hRIC: LLM-empowered Hierarchical RAN Intelligent Control for O-RAN

On the Boolean Network Theory of Datalog$^\neg$

Learning to Reason under Off-Policy Guidance

How Effective Can Dropout Be in Multiple Instance Learning ?

Learning Joint ID-Textual Representation for ID-Preserving Image Synthesis

Walk the Talk? Measuring the Faithfulness of Large Language Model Explanations

Cross-Document Cross-Lingual NLI via RST-Enhanced Graph Fusion and Interpretability Prediction

S1-Bench: A Simple Benchmark for Evaluating System 1 Thinking Capability of Large Reasoning Models

Beyond Self-Reports: Multi-Observer Agents for Personality Assessment in Large Language Models

Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation

Scaling Test-Time Inference with Policy-Optimized, Dynamic Retrieval-Augmented Generation via KV Caching and Decoding

Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization

LogicQA: Logical Anomaly Detection with Vision Language Model Generated Questions

Conjuring Positive Pairs for Efficient Unification of Representation Learning and Image Synthesis

CRCE: Coreference-Retention Concept Erasure in Text-to-Image Diffusion Models

LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation

MirrorShield: Towards Universal Defense Against Jailbreaks via Entropy-Guided Mirror Crafting

HICD: Hallucination-Inducing via Attention Dispersion for Contrastive Decoding to Mitigate Hallucinations in Large Language Models

RouterEval: A Comprehensive Benchmark for Routing LLMs to Explore Model-level Scaling Up in LLMs

Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More

Cost-Optimal Grouped-Query Attention for Long-Context Modeling

Beyond Matryoshka: Revisiting Sparse Coding for Adaptive Representation

On the Vulnerability of Concept Erasure in Diffusion Models

Char-mander Use mBackdoor! A Study of Cross-lingual Backdoor Attacks in Multilingual LLMs

SQLong: Enhanced NL2SQL for Longer Contexts with LLMs

DiffSampling: Enhancing Diversity and Accuracy in Neural Text Generation

TreeCut: A Synthetic Unanswerable Math Word Problem Dataset for LLM Hallucination Evaluation

Attention Mechanism for LLM-based Agents Dynamic Diffusion under Information Asymmetry

Robust Adaptation of Large Multimodal Models for Retrieval Augmented Hateful Meme Detection

R2-KG: General-Purpose Dual-Agent Framework for Reliable Reasoning on Knowledge Graphs

DeepResonance: Enhancing Multimodal Music Understanding via Music-centric Multi-way Instruction Tuning

MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval

EquiBench: Benchmarking Large Language Models' Understanding of Program Semantics via Equivalence Checking

EssayJudge: A Multi-Granular Benchmark for Assessing Automated Essay Scoring Capabilities of Multimodal Large Language Models

Uncovering Untapped Potential in Sample-Efficient World Model Agents

MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models

CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation

AI-driven Personalized Privacy Assistants: a Systematic Literature Review

Early Risk Prediction of Pediatric Cardiac Arrest from Electronic Health Records via Multimodal Fused Transformer

Online Scheduling for LLM Inference with KV Cache Constraints

Uni-Retrieval: A Multi-Style Retrieval Framework for STEM's Education

Speculative Prefill: Turbocharging TTFT with Lightweight and Training-Free Token Importance Estimation

Latent Action Learning Requires Supervision in the Presence of Distractors

Redefining Machine Unlearning: A Conformal Prediction-Motivated Approach

On the Role of Transformer Feed-Forward Layers in Nonlinear In-Context Learning

People who frequently use ChatGPT for writing tasks are accurate and robust detectors of AI-generated text

NBDI: A Simple and Effective Termination Condition for Skill Extraction from Task-Agnostic Demonstrations

MMDocIR: Benchmarking Multi-Modal Retrieval for Long Documents

Building Symbiotic AI: Reviewing the AI Act for a Human-Centred, Principle-Based Framework

TiEBe: Tracking Language Model Recall of Notable Worldwide Events Through Time

xLSTM-SENet: xLSTM for Single-Channel Speech Enhancement

A Separable Self-attention Inspired by the State Space Model for Computer Vision

Cross-model Transferability among Large Language Models on the Platonic Representations of Concepts

KunServe: Efficient Parameter-centric Memory Management for LLM Serving

Hotspot-Driven Peptide Design via Multi-Fragment Autoregressive Extension

Can LLMs be Good Graph Judge for Knowledge Graph Construction?

RoCoDA: Counterfactual Data Augmentation for Data-Efficient Robot Learning from Demonstrations

Rate, Explain and Cite (REC): Enhanced Explanation and Attribution in Automatic Evaluation by Large Language Models

Knowledge-Guided Prompt Learning for Request Quality Assurance in Public Code Review

Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study

M-RewardBench: Evaluating Reward Models in Multilingual Settings

Unlearning Backdoor Attacks for LLMs with Weak-to-Strong Knowledge Distillation

RATE: Causal Explainability of Reward Models with Imperfect Counterfactuals

Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning

Large Continual Instruction Assistant

TopoTune : A Framework for Generalized Combinatorial Complex Neural Networks

Evaluating the efficacy of LLM Safety Solutions : The Palit Benchmark Dataset

Created by

Haebom

저자

Sayon Palit, Daniel Woods

개요

본 논문은 의료 및 금융과 같은 중요 산업 시스템에 점점 더 많이 통합되고 있는 대규모 언어 모델(LLM)의 보안 위협을 다룹니다. 사용자가 민감한 데이터를 저장하는 내부 데이터베이스에서 정보를 검색하여 응답을 풍부하게 하는 LLM 기반 챗봇에 악성 질의를 제출하여 내부 데이터 유출이나 제3자 피해로 인한 법적 책임 발생 등의 피해를 야기하는 다양한 공격이 가능합니다. 본 연구는 이러한 위협에 대응하기 위해 개발되고 있는 보안 도구들의 효과와 사용성에 대한 공식적인 평가가 부족한 점을 해결하고자 13개의 LLM 보안 도구(9개 독점 소스, 4개 오픈 소스)를 비교 분석했습니다. 7개 도구만 평가되었으며, 악성 프롬프트의 벤치마크 데이터 세트를 구축하여 기준 LLM 모델(ChatGPT-3.5-Turbo)과 비교 평가했습니다. 결과적으로 기준 모델은 허위 긍정이 너무 많아 이 작업에 사용하기에는 적합하지 않은 것으로 나타났으며, Lakera Guard와 ProtectAI LLM Guard가 사용성과 성능 간의 균형을 보여주는 최고의 도구로 나타났습니다. 마지막으로, 독점 소스 제공업체의 투명성 증대, 상황 인식 탐지 개선, 오픈 소스 참여 증진, 사용자 인식 제고 및 더욱 대표적인 성능 지표 채택을 권장했습니다.

시사점, 한계점

•

시사점:

◦

LLM 기반 시스템의 보안 위협에 대한 체계적인 평가 및 분석 제공

◦

LLM 보안 도구의 성능 및 사용성 비교 분석을 통해 효과적인 도구 식별 (Lakera Guard, ProtectAI LLM Guard)

◦

LLM 보안 강화를 위한 구체적인 권고안 제시 (투명성 증대, 상황 인식 탐지 개선 등)

•

한계점:

◦

독점 소스 모델 소유자의 참여 부족으로 인한 제한된 도구 평가 (13개 중 7개만 평가)

◦

기준 LLM 모델의 높은 허위 긍정률로 인한 평가의 정확성 저하 가능성

◦

평가에 사용된 악성 프롬프트 데이터셋의 일반화 가능성에 대한 추가 검토 필요

Made with Slashpage