Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

Individual utilities of life satisfaction reveal inequality aversion unrelated to political alignment

DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge

Moment- and Power-Spectrum-Based Gaussianity Regularization for Text-to-Image Models

Computational Concept of the Psyche (in Russian)

MachineLearningLM: Scaling Many-shot In-context Learning via Continued Pretraining

The Efficiency Frontier: Classical Shadows versus Quantum Footage

BranchGRPO: Stable and Efficient GRPO with Structured Branching in Diffusion Models

Uncertainty Quantification in Probabilistic Machine Learning Models: Theory, Methods, and Insights

CURE: Controlled Unlearning for Robust Embeddings - Mitigating Conceptual Shortcuts in Pre-Trained Language Models

Revealing Hidden Precursors to Earthquakes via a Stress-Sensitive Transformation of Seismic Noise

A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code

Subjective Behaviors and Preferences in LLM: Language of Browsing

Beyond Ten Turns: Unlocking Long-Horizon Agentic Search with Large-Scale Asynchronous RL

SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion

Self-Questioning Language Models

MetaExplainer: A Framework to Generate Multi-Type User-Centered Explanations for AI Systems

How Should We Meta-Learn Reinforcement Learning Algorithms?

Comprehensive Evaluation of Prototype Neural Networks

HIRAG: Hierarchical-Thought Instruction-Tuning Retrieval-Augmented Generation

CyberRAG: An Agentic RAG cyber attack classification and reporting tool

Multi-Timescale Hierarchical Reinforcement Learning for Unified Behavior and Control of Autonomous Driving

A Nonlinear Low-rank Representation Model with Convolutional Neural Network for Imputing Water Quality Data

VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

Discrete Diffusion in Large Language and Multimodal Models: A Survey

From Static to Adaptive Defense: Federated Multi-Agent Deep Reinforcement Learning-Driven Moving Target Defense Against DoS Attacks in UAV Swarm Networks

How Far Are We from Optimal Reasoning Efficiency?

Whose Name Comes Up? Auditing LLM-Based Scholar Recommendations

Stopping Criteria for Value Iteration on Concurrent Stochastic Reachability and Safety Games

Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors

Prior Prompt Engineering for Reinforcement Fine-Tuning

Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features

CoT-RAG: Integrating Chain of Thought and Retrieval-Augmented Generation to Enhance Reasoning in Large Language Models

TransitReID: Transit OD Data Collection with Occlusion-Resistant Dynamic Passenger Re-Identification

TerraMind: Large-Scale Generative Multimodality for Earth Observation

Recursive Training Loops in LLMs: How training data properties modulate distribution shift in generated data?

Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation

A decision-theoretic approach to dealing with uncertainty in quantum mechanics

VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making

LED: LLM Enhanced Open-Vocabulary Object Detection without Human Curated Data Generation

Reangle-A-Video: 4D Video Generation as Video-to-Video Translation

To See a World in a Spark of Neuron: Disentangling Multi-task Interference for Training-free Model Merging

UAR-NVC: A Unified AutoRegressive Framework for Memory-Efficient Neural Video Compression

MPO: Boosting LLM Agents with Meta Plan Optimization

Pay Attention to Real World Perturbations! Natural Robustness Evaluation in Machine Reading Comprehension

A general language model for peptide identification

Beyond Seen Data: Improving KBQA Generalization Through Schema-Guided Logical Form Generation

CoAT: Chain-of-Associated-Thoughts Framework for Enhancing Large Language Models Reasoning

Mind the Value-Action Gap: Do LLMs Act in Alignment with Their Values?

Traffic-Rule-Compliant Trajectory Repair via Satisfiability Modulo Theories and Reachability Analysis

QR-VC: Leveraging Quantization Residuals for Linear Disentanglement in Zero-Shot Voice Conversion

Generative AI for Data Augmentation in Wireless Networks: Analysis, Applications, and Case Study

Neural-Enhanced Dynamic Range Compression Inversion: A Hybrid Approach for Restoring Audio Dynamics

The Quest for the Right Mediator: Surveying Mechanistic Interpretability Through the Lens of Causal Mediation Analysis

PriorCLIP: Visual Prior Guided Vision-Language Model for Remote Sensing Image-Text Retrieval

A Transformer approach for Electricity Price Forecasting

FedComLoc: Communication-Efficient Distributed Training of Sparse and Quantized Models

PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Towards explainable decision support using hybrid neural models for logistic terminal automation

BlendedNet: A Blended Wing Body Aircraft Dataset and Surrogate Model for Aerodynamic Predictions

That's So FETCH: Fashioning Ensemble Techniques for LLM Classification in Civil Legal Intake and Referral

Murphys Laws of AI Alignment: Why the Gap Always Wins

Adaptive Monitoring and Real-World Evaluation of Agentic AI Systems

Bridging the Gap in Ophthalmic AI: MM-Retinal-Reason Dataset and OphthaReason Model toward Dynamic Multimodal Reasoning

Understanding visual attention beehind bee-inspired UAV navigation

Working with AI: Measuring the Applicability of Generative AI to Occupations

Scaling LLM Planning: NL2FLOW for Parametric Problem Generation and Rigorous Evaluation

Context-Driven Knowledge Graph Completion with Semantic-Aware Relational Message Passing

Meta-Semantics Augmented Few-Shot Relational Learning

Perovskite-LLM: Knowledge-Enhanced Large Language Models for Perovskite Solar Cell Research

Associative Knowledge Graphs for Efficient Sequence Storage and Retrieval

Depth-Bounded Epistemic Planning

A Survey of Reinforcement Learning for Large Reasoning Models

Large Language Model Hacking: Quantifying the Hidden Risks of Using LLMs for Text Annotation

QCardEst/QCardCorr: Quantum Cardinality Estimation and Correction

Merge-of-Thought Distillation

MoVoC: Morphology-Aware Subword Construction for Geez Script Languages

Scaling Truth: The Confidence Paradox in AI Fact-Checking

PianoVAM: A Multimodal Piano Performance Dataset

An End-to-End Deep Learning Framework for Arsenicosis Diagnosis Using Mobile-Captured Skin Images

Using AI to Optimize Patient Transfer and Resource Utilization During Mass-Casualty Incidents: A Simulation Platform

AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning

Learning Turbulent Flows with Generative Models: Super-resolution, Forecasting, and Sparse Flow Reconstruction

FinZero: Launching Multi-modal Financial Time Series Forecast with Large Reasoning Model

DEQuify your force field: More efficient simulations using deep equilibrium models

X-Teaming Evolutionary M2S: Automated Discovery of Multi-turn to Single-turn Jailbreak Templates

Explainability of CNN Based Classification Models for Acoustic Signal

TANGO: Traversability-Aware Navigation with Local Metric Control for Topological Goals

A layered architecture for log analysis in complex IT systems

Reshaping the Forward-Forward Algorithm with a Similarity-Based Objective

Skeleton-based sign language recognition using a dual-stream spatio-temporal dynamic graph convolutional network

Robust Belief-State Policy Learning for Quantum Network Routing Under Decoherence and Time-Varying Conditions

Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations

RoentMod: A Synthetic Chest X-Ray Modification Model to Identify and Correct Image Interpretation Model Shortcuts

UOPSL: Unpaired OCT Predilection Sites Learning for Fundus Image Diagnosis Augmentation

OTESGN:Optimal Transport Enhanced Syntactic-Semantic Graph Networks for Aspect-Based Sentiment Analysis

Classification of 24-hour movement behaviors from wrist-worn accelerometer data: from handcrafted features to deep learning techniques

Memorization in Large Language Models in Medicine: Prevalence, Characteristics, and Implications

Interpretability as Alignment: Making Internal Understanding a Design Principle

MESH -- Understanding Videos Like Human: Measuring Hallucinations in Large Video Models

Recursive Training Loops in LLMs: How training data properties modulate distribution shift in generated data?

Created by

Haebom

저자

Grgur Kova\v{c}, Jeremy Perez, Remy Portelas, Peter Ford Dominey, Pierre-Yves Oudeyer

개요

본 논문은 대규모 언어 모델(LLM)이 생성한 합성 데이터로 LLM을 반복적으로 학습시키는 과정에서 발생하는 분포 이동(model collapse) 현상에 대해 연구합니다. 특히, 인간 데이터의 특성이 이러한 분포 이동에 미치는 영향을 실증적으로 분석합니다. 다양한 인간 데이터셋을 사용하여 반복 학습을 진행하고, 데이터셋 특성 조작과 회귀 분석을 통해 분포 이동의 크기를 예측하는 데이터 특성들을 밝힙니다. 결과적으로 어휘 다양성은 분포 이동을 증폭시키고, 의미 다양성과 데이터 품질은 분포 이동을 완화시킨다는 것을 발견했습니다. 또한, 이러한 영향은 모듈화되어 있어 특정 인터넷 도메인에서 수집된 데이터는 다른 도메인의 콘텐츠 생성에는 거의 영향을 미치지 않는다는 것을 밝혔습니다. 마지막으로, 정치적 편향에 대한 실험을 통해 인간 데이터 특성이 초기 편향을 증폭시키거나 감소시키는지에 영향을 미친다는 것을 보여줍니다. 결론적으로, 인터넷의 서로 다른 부분이 서로 다른 유형의 분포 이동을 겪을 수 있다는 새로운 관점을 제시합니다.

시사점, 한계점

•

시사점:

◦

LLM의 반복 학습 과정에서 발생하는 분포 이동의 크기를 예측하는 데이터 특성(어휘 다양성, 의미 다양성, 데이터 품질)을 규명함.

◦

인터넷 데이터의 도메인 특성이 LLM의 콘텐츠 생성에 미치는 영향의 모듈성을 제시함.

◦

인간 데이터의 특성이 LLM의 정치적 편향에 미치는 영향을 분석함.

◦

인터넷의 다양한 영역에서 발생하는 분포 이동의 다양성을 보여줌.

•

한계점:

◦

분석에 사용된 데이터셋과 특성의 종류 및 범위에 대한 제한.

◦

분포 이동의 정량적 측정 및 예측 모델의 일반화 가능성에 대한 추가 연구 필요.

◦

다양한 LLM 아키텍처 및 학습 방법론에 대한 일반화 가능성 검증 필요.

◦

특정 도메인의 영향이 다른 도메인에 미치지 않는다는 모듈성의 범위와 한계에 대한 추가 연구 필요.

Made with Slashpage