Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

Something's Fishy In The Data Lake: A Critical Re-evaluation of Table Union Search Benchmarks

Breaking the Ceiling: Exploring the Potential of Jailbreak Attacks through Expanding Strategy Space

SageAttention2++: A More Efficient Implementation of SageAttention2

FCKT: Fine-Grained Cross-Task Knowledge Transfer with Semantic Contrastive Learning for Targeted Sentiment Analysis

Towards Conversational Development Environments: Using Theory-of-Mind and Multi-Agent Architectures for Requirements Refinement

Cooperation of Experts: Fusing Heterogeneous Information with Large Margin

RSCF: Relation-Semantics Consistent Filter for Entity Embedding of Knowledge Graph

CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models

VLM Can Be a Good Assistant: Enhancing Embodied Visual Tracking with Self-Improving Vision-Language Models

In-context Language Learning for Endangered Languages in Speech Recognition

OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video Generation

Inference-time Alignment in Continuous Space

Incentivizing Strong Reasoning from Weak Supervision

JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning

A Comprehensive Real-World Assessment of Audio Watermarking Algorithms: Will They Survive Neural Codecs?

AgentRecBench: Benchmarking LLM Agent-based Personalized Recommender Systems

Towards Large Reasoning Models for Agriculture

VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use

OptiMindTune: A Multi-Agent Framework for Intelligent Hyperparameter Optimization

FastMamba: A High-Speed and Efficient Mamba Accelerator on FPGA with Accurate Quantization

Moderating Harm: Benchmarking Large Language Models for Cyberbullying Detection in YouTube Comments

ALPS: Attention Localization and Pruning Strategy for Efficient Alignment of Large Language Models

Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding

Towards Practical Defect-Focused Automated Code Review

EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion

PAEFF: Precise Alignment and Enhanced Gated Feature Fusion for Face-Voice Association

Text Generation Beyond Discrete Token Sampling

AKRMap: Adaptive Kernel Regression for Trustworthy Visualization of Cross-Modal Embeddings

Information Science Principles of Machine Learning: A Causal Chain Meta-Framework Based on Formalized Information Mapping

Advancing Sequential Numerical Prediction in Autoregressive Models

Towards Visuospatial Cognition via Hierarchical Fusion of Visual Experts

Visuospatial Cognitive Assistant

Exploring Criteria of Loss Reweighting to Enhance LLM Unlearning

Search and Refine During Think: Autonomous Retrieval-Augmented Reasoning of LLMs

Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL

LibIQ: Toward Real-Time Spectrum Classification in O-RAN dApps

Continuous Thought Machines

Robust Localization, Mapping, and Navigation for Quadruped Robots

Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization

Automating tumor-infiltrating lymphocyte assessment in breast cancer histopathology images using QuPath: a transparent and accessible machine learning pipeline

Progressive Language-guided Visual Learning for Multi-Task Visual Grounding

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Generative Framework for Personalized Persuasion: Inferring Causal, Counterfactual, and Latent Knowledge

Adjoint Sampling: Highly Scalable Diffusion Samplers via Adjoint Matching

Layers at Similar Depths Generate Similar Activations Across LLM Architectures

A Knowledge-guided Adversarial Defense for Resisting Malicious Visual Manipulation

SynWorld: Virtual Scenario Synthesis for Agentic Action Knowledge Refinement

Token embeddings violate the manifold hypothesis

HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment

Improving User Behavior Prediction: Leveraging Annotator Metadata in Supervised Machine Learning Models

Experience Retrieval-Augmentation with Electronic Health Records Enables Accurate Discharge QA

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Language-Enhanced Representation Learning for Single-Cell Transcriptomics

TS-RAG: Retrieval-Augmented Generation based Time Series Foundation Models are Stronger Zero-Shot Forecaster

When Trust Collides: Decoding Human-LLM Cooperation Dynamics through the Prisoner's Dilemma

WISE: A World Knowledge-Informed Semantic Evaluation for Text-to-Image Generation

Optimal Output Feedback Learning Control for Discrete-Time Linear Quadratic Regulation

Wanda++: Pruning Large Language Models via Regional Gradients

LINGOLY-TOO: Disentangling Reasoning from Knowledge with Templatised Orthographic Obfuscation

Deficient Excitation in Parameter Learning

Interpreting CLIP with Hierarchical Sparse Autoencoders

BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life Prediction

Self-Taught Agentic Long Context Understanding

ReQFlow: Rectified Quaternion Flow for Efficient and High-Quality Protein Backbone Generation

How Do LLMs Perform Two-Hop Reasoning in Context?

ThinkGuard: Deliberative Slow Thinking Leads to Cautious Guardrails

MUDDFormer: Breaking Residual Bottlenecks in Transformers via Multiway Dynamic Dense Connections

ReLearn: Unlearning via Learning for Large Language Models

Revisiting Weak-to-Strong Generalization in Theory and Practice: Reverse KL vs. Forward KL

A Physics-Informed Machine Learning Framework for Safe and Optimal Control of Autonomous Systems

Non-Markovian Discrete Diffusion with Causal Language Models

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

TransMLA: Migrating GQA Models to MLA with Full DeepSeek Compatibility and Speedup

UniDB: A Unified Diffusion Bridge Framework via Stochastic Optimal Control

Beyond External Monitors: Enhancing Transparency of Large Language Models for Easier Monitoring

ExpProof : Operationalizing Explanations for Confidential Models with ZKPs

Advancing Reasoning in Large Language Models: Promising Methods and Approaches

Path Planning for Masked Diffusion Model Sampling

LV-XAttn: Distributed Cross-Attention for Long Visual Inputs in Multimodal Large Language Models

Robust LLM Alignment via Distributionally Robust Direct Preference Optimization

Improving Rule-based Reasoning in LLMs using Neurosymbolic Representations

Message-Passing GNNs Fail to Approximate Sparse Triangular Factorizations

LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently

Distribution-aware Fairness Learning in Medical Image Segmentation From A Control-Theoretic Perspective

A Checks-and-Balances Framework for Context-Aware Ethical AI Alignment

Risk-Informed Diffusion Transformer for Long-Tail Trajectory Prediction in the Crash Scenario

Can Large Language Models Be Trusted as Evolutionary Optimizers for Network-Structured Combinatorial Problems?

Redundancy Principles for MLLMs Benchmarks

K-COMP: Retrieval-Augmented Medical Domain Question Answering With Knowledge-Injected Compressor

An Innovative Data-Driven and Adaptive Reinforcement Learning Approach for Context-Aware Prescriptive Process Monitoring

Diffusion Adversarial Post-Training for One-Step Video Generation

Gender-Neutral Large Language Models for Medical Applications: Reducing Bias in PubMed Abstracts

PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward Models

Constraint-Adaptive Policy Switching for Offline Safe Reinforcement Learning

Revisiting In-Context Learning with Long Context Language Models

Energy and polarization based on-line interference mitigation in radio interferometry

How to Synthesize Text Data without Model Collapse?

Preference Adaptive and Sequential Text-to-Image Generation

Sample Efficient Robot Learning in Supervised Effect Prediction Tasks

Federated Continual Graph Learning

How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation

Created by

Haebom

저자

Ruohao Guo, Wei Xu, Alan Ritter

개요

본 논문은 대규모 언어 모델(LLM)이 다양한 상황에서 널리 배포됨에 따라 암묵적으로 잘못된 정보를 확산시킬 수 있는 정도가 중요한 안전 문제로 부상하고 있음을 다룹니다. 기존 연구는 명시적인 허위 진술에 대한 LLM의 평가에 주로 초점을 맞추었지만, 실제 상호 작용에서 잘못된 정보가 어떻게 암묵적인 전제로 미묘하게 나타나는지에 대해서는 간과했습니다. 본 연구는 암묵적인 잘못된 정보에 대한 첫 번째 종합적인 벤치마크인 EchoMist를 제시합니다. EchoMist는 실제 사람-AI 대화 및 소셜 미디어 상호 작용을 포함한 다양한 출처에서 유통되고, 해롭고, 끊임없이 진화하는 암묵적인 잘못된 정보를 대상으로 합니다. 15개의 최첨단 LLM에 대한 광범위한 실증 연구를 통해 현재 모델이 이 작업에서 놀라울 정도로 성능이 저조하며, 종종 잘못된 전제를 감지하지 못하고 반사실적인 설명을 생성하는 것을 발견했습니다. 또한 LLM의 암묵적인 잘못된 정보에 대응하는 능력을 향상시키기 위한 두 가지 완화 방법(Self-Alert 및 RAG)을 조사했습니다. 연구 결과는 EchoMist가 지속적인 과제임을 나타내며 암묵적인 잘못된 정보의 위험으로부터 보호해야 할 중요한 필요성을 강조합니다.

시사점, 한계점

•

시사점:

◦

암묵적 허위 정보에 대한 LLM의 취약성을 체계적으로 평가하기 위한 새로운 벤치마크인 EchoMist를 제시합니다.

◦

최첨단 LLM들이 암묵적 허위 정보를 효과적으로 감지하고 처리하는 데 어려움을 겪고 있음을 보여줍니다.

◦

암묵적 허위 정보 문제 해결을 위한 Self-Alert 및 RAG와 같은 완화 전략에 대한 추가 연구의 필요성을 강조합니다.

◦

LLM의 안전성 및 신뢰성을 향상시키기 위해 암묵적 허위 정보에 대한 추가 연구가 시급함을 보여줍니다.

•

한계점:

◦

EchoMist 벤치마크의 범위와 일반화 가능성에 대한 추가 연구가 필요합니다.

◦

제시된 완화 전략의 효과는 특정 상황과 모델에 따라 달라질 수 있습니다.

◦

암묵적 허위 정보의 다양한 형태와 복잡성을 완전히 포착하는 데 어려움이 있을 수 있습니다.

◦

더욱 광범위하고 다양한 LLM에 대한 평가가 필요합니다.

Made with Slashpage