Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

AI/ML in 3GPP 5G Advanced -- Services and Architecture

CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation

Dynamic Correction of Erroneous State Estimates via Diffusion Bayesian Exploration

Concept-Guided Backdoor Attack on Vision Language Models

On the Holographic Geometry of Deterministic Computation

Decoding inner speech with an end-to-end brain-to-text neural interface

Physics-informed Neural Operator Learning for Nonlinear Grad-Shafranov Equation

MindEval: Benchmarking Language Models on Multi-turn Mental Health Support

LAET: A Layer-wise Adaptive Ensemble Tuning Framework for Pretrained Language Models

FAST-CAD: A Fairness-Aware Framework for Non-Contact Stroke Diagnosis

Designing LLM-based Multi-Agent Systems for Software Engineering Tasks: Quality Attributes, Design Patterns and Rationale

SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control

MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling

Data-Augmented Deep Learning for Downhole Depth Sensing and Field Validation

Chinese Discharge Drug Recommendation in Metabolic Diseases with Large Language Models

Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability

Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting

Reinforce-Ada: An Adaptive Sampling Framework under Non-linear RL Objectives

TempoControl: Temporal Attention Guidance for Text-to-Video Models

AortaDiff: A Unified Multitask Diffusion Framework For Contrast-Free AAA Imaging

SoREX: Towards Self-Explainable Social Recommendation with Relevant Ego-Path Extraction

The AI Productivity Index (APEX)

Uncovering Grounding IDs: How External Cues Shape Multimodal Binding

Pushing Toward the Simplex Vertices: A Simple Remedy for Code Collapse in Smoothed Vector Quantization

V-CECE: Visual Counterfactual Explanations via Conceptual Edits

Momentum-constrained Hybrid Heuristic Trajectory Optimization Framework with Residual-enhanced DRL for Visually Impaired Scenarios

HARP: Hallucination Detection via Reasoning Subspace Projection

Rethinking Sparse Autoencoders: Select-and-Project for Fairness and Control from Encoder Features Alone

IPA: An Information-Reconstructive Input Projection Framework for Efficient Foundation Model Adaptation

Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders

ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal

Retro-Expert: Collaborative Reasoning for Interpretable Retrosynthesis

A Survey on Diffusion Language Models

CodeNER: Code Prompting for Named Entity Recognition

ReSem3D: Refinable 3D Spatial Constraints via Fine-Grained Semantic Grounding for Generalizable Robotic Manipulation

VERIRAG: A Post-Retrieval Auditing of Scientific Study Summaries

SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models

LittleBit: Ultra Low-Bit Quantization via Latent Factorization

Real-Time Execution of Action Chunking Flow Policies

AURA: A Diagnostic Framework for Tracking User Satisfaction of Interactive Planning Agents

Enhancing SPARQL Query Rewriting for Complex Ontology Alignments

Exploring Ordinal Bias in Action Recognition for Instructional Videos

Robust Weight Imprinting: Insights from Neural Collapse and Proxy-Based Aggregation

Experiments with Large Language Models on Retrieval-Augmented Generation for Closed-Source Simulation Software

SAT: Dynamic Spatial Aptitude Training for Multimodal Language Models

Large Language Models: An Applied Econometric Framework

Edge-Only Universal Adversarial Attacks in Distributed Learning

Image-Guided Semantic Pseudo-LiDAR Point Generation for 3D Object Detection

Detecting the Future: All-at-Once Event Sequence Forecasting with Horizon Matching

Variational Learning of Gaussian Process Latent Variable Models through Stochastic Gradient Annealed Importance Sampling

SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition

A Scene-aware Models Adaptation Scheme for Cross-scene Online Inference on Mobile Devices

Towards Data-efficient Customer Intent Recognition with Prompt-based Learning Paradigm

GTM: Simulating the World of Tools for AI Agents

Self-Transparency Failures in Expert-Persona LLMs: How Instruction-Following Overrides Honesty

Learning the Value of Value Learning

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

ToolMind Technical Report: A Large-Scale, Reasoning-Enhanced Tool-Use Dataset

Debate over Mixed-knowledge: A Robust Multi-Agent Reasoning Framework for Incomplete Knowledge Graph Question Answering

Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives

Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models

Generalized Parallel Scaling with Interdependent Generations

SOCK: A Benchmark for Measuring Self-Replication in Large Language Models

KNARsack: Teaching Neural Algorithmic Reasoners to Solve Pseudo-Polynomial Problems

IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks

FedIFL: A federated cross-domain diagnostic framework for motor-driven systems with inconsistent fault modes

Enhancing Large Language Models through Neuro-Symbolic Integration and Ontological Reasoning

Rolling in the deep of cognitive and AI biases

Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms

Training-Time Action Conditioning for Efficient Real-Time Chunking

Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity

AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement

M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG

MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution

Trusted AI Agents in the Cloud

Impugan: Learning Conditional Generative Models for Robust Data Imputation

Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding

Measuring the Effect of Background on Classification and Feature Importance in Deep Learning for AV Perception

World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty

Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures

Neural Coherence : Find higher performance to out-of-distribution tasks from few samples

Sparse Attention Post-Training for Mechanistic Interpretability

Optimizing Medical Question-Answering Systems: A Comparative Study of Fine-Tuned and Zero-Shot Large Language Models with RAG Framework

NEAT: Neighborhood-Guided, Efficient, Autoregressive Set Transformer for 3D Molecular Generation

Phase-OTDR Event Detection Using Image-Based Data Transformation and Deep Learning

Approximation of Box Decomposition Algorithm for Fast Hypervolume-Based Multi-Objective Optimization

Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling

3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering

Mechanistic Interpretability of Antibody Language Models Using SAEs

Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding

Efficient Text Classification with Conformal In-Context Learning

Big Tech-Funded AI Papers Have Higher Citation Impact, Greater Insularity, and Larger Recency Bias

Bayesian Active Inference for Intelligent UAV Anti-Jamming and Adaptive Trajectory Planning

Faithfulness metric fusion: Improving the evaluation of LLM trustworthiness across domains

Retrieving Semantically Similar Decisions under Noisy Institutional Labels: Robust Comparison of Embedding Methods

InverseCrafter: Efficient Video ReCapture as a Latent Domain Inverse Problem

Feasibility of AI-Assisted Programming for End-User Development

Grounded Multilingual Medical Reasoning for Question Answering with Large Language Models

Modular Jets for Supervised Pipelines: Diagnosing Mirage vs Identifiability

A Comprehensive Framework for Automated Quality Control in the Automotive Industry

Trustworthy Reasoning: Evaluating and Enhancing Factual Accuracy in LLM Intermediate Thought Processes

Created by

Haebom

Category

Empty

저자

Rui Jiao, Yue Zhang, Jinku Li

개요

본 논문은 대규모 언어 모델(LLM)의 중간 추론 단계에서 사실적 부정확성이 존재하는 심각한 취약점을 해결하는 새로운 프레임워크를 제시합니다. 올바른 최종 답변에도 불구하고 중간 추론 단계에서의 사실적 오류는 의료, 법률 분석, 과학 연구 등 고위험 분야에서 사용자를 잘못된 결정으로 이끌 수 있는 상당한 위험을 초래합니다. 이 프레임워크는 세 가지 핵심 구성 요소로 통합됩니다. 첫째, 반사실적 증강 데이터로 훈련된 특수 사실 확인 분류기는 추론 체인 내의 미묘한 사실적 불일치를 감지합니다. 둘째, 향상된 GRPO(Group Relative Policy Optimization) 강화 학습 접근 방식은 다차원 보상을 통해 사실성, 일관성 및 구조적 정확성을 균형 있게 조정합니다. 셋째, 추론 과정 중 모델 활성화에서 사실성 개선이 어떻게 나타나는지 조사하는 기계적 해석 가능성 방법을 사용합니다. 다양한 최첨단 모델에 대한 광범위한 평가 결과, Claude-3.7 및 GPT-o1과 같은 주요 모델에서도 추론 사실 정확도가 각각 81.93% 및 82.57%에 불과한 우려스러운 패턴이 드러났습니다. 제시된 접근 방식은 Math-500, AIME-2024, GPQA 등의 어려운 벤치마크에서 성능을 유지하거나 향상시키면서 사실적 견고성을 최대 49.90%까지 향상시킵니다. 또한, 신경 활성화 수준 분석을 통해 사실적 개선이 모델 아키텍처 내에서 추론 경로를 어떻게 재구성하는지에 대한 실행 가능한 통찰력을 제공하여 활성화 유도 최적화를 통해 사실적 견고성을 명시적으로 목표로 하는 미래의 훈련 방법론에 대한 기반을 마련합니다.

시사점, 한계점

•

시사점:

◦

LLM의 사실적 오류 문제에 대한 새로운 해결책 제시

◦

사실 확인 분류기, GRPO 강화 학습, 기계적 해석 가능성 방법의 통합적 접근

◦

최첨단 LLM에서도 상당한 사실적 오류 존재 확인

◦

사실적 견고성을 크게 향상시키면서 성능 유지 또는 개선

◦

모델 활성화 분석을 통한 향후 훈련 방법론 개선 가능성 제시

•

한계점:

◦

제시된 프레임워크의 일반화 성능 및 다양한 LLM에 대한 적용성 추가 연구 필요

◦

반사실적 증강 데이터 생성 및 품질 관리에 대한 자세한 설명 부족

◦

GRPO 강화 학습의 구체적인 매개변수 및 최적화 전략에 대한 상세한 정보 부족

◦

기계적 해석 가능성 분석 결과의 해석 및 한계에 대한 심층적인 논의 부족

Made with Slashpage