Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

Maximizing Confidence Alone Improves Reasoning

Pre-training for Recommendation Unlearning

FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control

On the performance of machine-learning-assisted Monte Carlo in sampling from simple statistical physics models

Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems

Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems

Train with Perturbation, Infer after Merging: A Two-Stage Framework for Continual Learning

SplitLoRA: Balancing Stability and Plasticity in Continual Learning Through Gradient Space Splitting

Skywork Open Reasoner 1 Technical Report

Speculative Decoding Meets Quantization: Compatibility Evaluation and Hierarchical Framework Design

DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation

Cross-modal RAG: Sub-dimensional Retrieval-Augmented Text-to-Image Generation

CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation

Scaling Up Liquid-Resistance Liquid-Capacitance Networks for Efficient Sequence Modeling

Is Attention Required for Transformer Inference? Explore Function-preserving Attention Replacement

VietASR: Achieving Industry-level Vietnamese ASR with 50-hour labeled data and Large-Scale Speech Pretraining

Hume: Introducing System-2 Thinking in Visual-Language-Action Model

DeSocial: Blockchain-based Decentralized Social Networks

Subgroups Matter for Robust Bias Mitigation

Hybrid Disagreement-Diversity Active Learning for Bioacoustic Sound Event Detection

Automatic Transmission for LLM Tiers: Optimizing Cost and Accuracy in Large Language Models

The challenge of hidden gifts in multi-agent reinforcement learning

Retrieval Visual Contrastive Decoding to Mitigate Object Hallucinations in Large Vision-Language Models

Risk-aware Direct Preference Optimization under Nested Risk Measure

Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network

A Novel Zero-Trust Identity Framework for Agentic AI: Decentralized Authentication and Fine-Grained Access Control

BroadGen: A Framework for Generating Effective and Efficient Advertiser Broad Match Keyphrase Recommendations

Reality Check: A New Evaluation Ecosystem Is Necessary to Understand AI's Real World Effects

How We Won the ISLES'24 Challenge by Preprocessing

SP2RINT: Spatially-Decoupled Physics-Inspired Progressive Inverse Optimization for Scalable, PDE-Constrained Meta-Optical Neural Network Training

Seek-CAD: A Self-refined Generative Modeling for 3D Parametric CAD Using Local Inference via DeepSeek

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

OrionBench: A Benchmark for Chart and Human-Recognizable Object Detection in Infographics

EarthSE: A Benchmark Evaluating Earth Scientific Exploration Capability for Large Language Models

CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark

Beyond Face Swapping: A Diffusion-Based Digital Human Benchmark for Multimodal Deepfake Detection

Edge-First Language Model Inference: Models, Metrics, and Tradeoffs

Smaller, Smarter, Closer: The Edge of Collaborative Generative AI

YESciEval: Robust LLM-as-a-Judge for Scientific Question Answering

DiagnosisArena: Benchmarking Diagnostic Reasoning for Large Language Models

Articulatory Feature Prediction from Surface EMG during Speech Production

Exploring Spatiotemporal Emotional Synchrony in Dyadic Interactions: The Role of Speech Conditions in Facial and Vocal Affective Alignment

LEXam: Benchmarking Legal Reasoning on 340 Law Exams

Neural Networks as Universal Finite-State Machines: A Constructive Deterministic Finite Automaton Theory

The Geometry of ReLU Networks through the ReLU Transition Graph

RepCali: High Efficient Fine-tuning Via Representation Calibration in Latent Space for Pre-trained Language Models

Fusing Bidirectional Chains of Thought and Reward Mechanisms A Method for Enhancing Question-Answering Capabilities of Large Language Models for Chinese Intangible Cultural Heritage

Multimodal Survival Modeling in the Age of Foundation Models

Burger: Robust Graph Denoising-augmentation Fusion and Multi-semantic Modeling in Social Recommendation

The Aloe Family Recipe for Open and Specialized Healthcare LLMs

To Judge or not to Judge: Using LLM Judgements for Advertiser Keyphrase Relevance at eBay

LENSLLM: Unveiling Fine-Tuning Dynamics for LLM Selection

CoordField: Coordination Field for Agentic UAV Task Allocation In Low-altitude Urban Scenarios

Learning to Reason under Off-Policy Guidance

A Combinatorial Theory of Dropout: Subnetworks, Graph Geometry, and Generalization

Error Broadcast and Decorrelation as a Potential Artificial and Natural Learning Mechanism

Carbon-Efficient 3D DNN Acceleration: Optimizing Performance and Sustainability

Agentic Knowledgeable Self-awareness

Are Domain Generalization Benchmarks with Accuracy on the Line Misspecified?

MiZero: The Shadowy Defender Against Text Style Infringements

Firm or Fickle? Evaluating Large Language Models Consistency in Sequential Interactions

LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty

Temporal Relation Extraction in Clinical Texts: A Span-based Graph Transformer Approach

LEAVS: An LLM-based Labeler for Abdominal CT Supervision

From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Calibration

Enhancing Retrieval for ESGLLM via ESG-CID -- A Disclosure Content Index Finetuning Dataset for Mapping GRI and ESRS

DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation

Aligning Text to Image in Diffusion Models is Easier Than You Think

Can LLMs Reason About Program Semantics? A Comprehensive Evaluation of LLMs on Formal Specification Inference

BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life Prediction

Bridging Critical Gaps in Convergent Learning: How Representational Alignment Evolves Across Layers, Training, and Distribution Shifts

ExpandR: Teaching Dense Retrievers Beyond Queries with LLM Guidance

Audio Visual Segmentation Through Text Embeddings

Privacy-Aware Joint DNN Model Deployment and Partitioning Optimization for Collaborative Edge Inference Services

Learning to Reason from Feedback at Test-Time

ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation

STeCa: Step-level Trajectory Calibration for LLM Agent Learning

GSQ-Tuning: Group-Shared Exponents Integer in Fully Quantized Training for LLMs On-Device Fine-tuning

Enhancing Semi-supervised Learning with Zero-shot Pseudolabels

DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing

Multilingual Encoder Knows more than You Realize: Shared Weights Pretraining for Extremely Low-Resource Languages

Jailbreaking to Jailbreak

Behavior-Regularized Diffusion Policy Optimization for Offline Reinforcement Learning

Position: Scaling LLM Agents Requires Asymptotic Analysis with LLM Primitives

CodeSteer: Symbolic-Augmented Language Models via Code/Text Guidance

Toward universal steering and monitoring of AI models

SPRI: Aligning Large Language Models with Context-Situated Principles

MedRAX: Medical Reasoning Agent for Chest X-ray

Adaptive Exploration for Multi-Reward Multi-Policy Evaluation

Wake-Informed 3D Path Planning for Autonomous Underwater Vehicles Using A* and Neural Network Approximations

Fast Large Language Model Collaborative Decoding via Speculation

A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers

Joint Localization and Activation Editing for Low-Resource Fine-Tuning

KBQA-o1: Agentic Knowledge Base Question Answering with Monte Carlo Tree Search

Towards Unified Attribution in Explainable AI, Data-Centric AI, and Mechanistic Interpretability

Chain of Grounded Objectives: Bridging Process and Goal-oriented Prompting for Code Generation

Re-ranking Using Large Language Models for Mitigating Exposure to Harmful Content on Social Media Platforms

Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment

Tensor Product Attention Is All You Need

VideoRAG: Retrieval-Augmented Generation over Video Corpus

Agent-UniRAG: A Trainable Open-Source LLM Agent Framework for Unified Retrieval-Augmented Generation Systems

Created by

Haebom

저자

Hoang Pham, Thuy-Duong Nguyen, Khac-Hoai Nam Bui

개요

본 논문은 최근 등장한 대규모 언어 모델(LLM) 에이전트 개념을 활용하여 통합된 검색 증강 생성(RAG) 시스템에 대한 새로운 접근 방식을 제시합니다. 특히, LLM을 기본 제어기로 사용하는 Agent LLM은 특히 복잡한 추론 질의응답 시스템(예: 다단계 쿼리)의 경우 RAG 작업의 해석성을 가능하게 하는 유망한 접근 방식이 되었습니다. 그러나 이전 연구는 주로 단일 홉 또는 다단계 접근 방식을 별도로 사용하여 RAG 시스템을 해결하는 데 중점을 두었으며, 이는 해당 접근 방식의 실제 응용 프로그램에 대한 적용을 제한합니다. 본 연구에서는 통합된 검색 증강 LLM 시스템을 위한 학습 가능한 에이전트 프레임워크인 Agent-UniRAG를 제안하여 RAG 시스템의 효율성과 해석성을 향상시킵니다. 주요 아이디어는 입력의 복잡성에 따라 단계별로 RAG 작업을 해결하는 LLM 에이전트 프레임워크를 설계하여 단일 홉 및 다단계 쿼리를 동시에 종단 간 방식으로 포함하는 것입니다. 또한, 제안된 에이전트 프레임워크를 소규모 오픈소스 LLM(예: Llama-3-8B)에 적용할 수 있도록 합성 데이터 세트인 SynAgent-RAG를 도입합니다. 결과는 다양한 RAG 벤치마크에서 폐쇄형 소스 및 대규모 오픈소스 LLM과 비교할 만한 성능을 보여줍니다. 저희의 소스 코드와 데이터 세트는 추가 활용을 위해 공개적으로 제공됩니다.

시사점, 한계점

•

시사점:

◦

LLM 에이전트를 활용한 통합 RAG 시스템 구축으로 단일 홉과 다단계 쿼리 모두 처리 가능

◦

Agent-UniRAG 프레임워크를 통해 RAG 시스템의 효율성 및 해석성 향상

◦

소규모 오픈소스 LLM에도 적용 가능한 SynAgent-RAG 데이터셋 제공

◦

소스 코드와 데이터셋 공개를 통한 추가 연구 활성화

•

한계점:

◦

SynAgent-RAG 데이터셋의 일반화 성능에 대한 추가 검증 필요

◦

다양한 유형의 복잡한 쿼리에 대한 로버스트성 평가 필요

◦

Agent-UniRAG의 계산 비용 및 확장성에 대한 추가 연구 필요

Made with Slashpage