Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

CEHR-XGPT: A Scalable Multi-Task Foundation Model for Electronic Health Records

Unveiling the Response of Large Vision-Language Models to Visually Absent Tokens

Adaptive Learning Strategies for Mitotic Figure Classification in MIDOG2025 Challenge

MitoDetect++: A Domain-Robust Pipeline for Mitosis Detection and Atypical Subtyping

Align-Then-stEer: Adapting the Vision-Language Action Models through Unified Latent Guidance

Fantastic Pretraining Optimizers and Where to Find Them

Towards Interpretable Geo-localization: a Concept-Aware Global Image-GPS Alignment Framework

TECP: Token-Entropy Conformal Prediction for LLMs

The Complexity Trap: Simple Observation Masking Is as Efficient as LLM Summarization for Agent Context Management

Train-Once Plan-Anywhere Kinodynamic Motion Planning via Diffusion Trees

Skill-Aligned Fairness in Multi-Agent Learning for Collaboration in Healthcare

Mitigating Hallucinations in LM-Based TTS Models via Distribution Alignment Using GFlowNets

AgentArmor: Enforcing Program Analysis on Agent Runtime Trace to Defend Against Prompt Injection

HuggingGraph: Understanding the Supply Chain of LLM Ecosystem

Food safety trends across Europe: insights from the 392-million-entry CompreHensive European Food Safety (CHEFS) database

Simple Yet Effective: An Information-Theoretic Approach to Multi-LLM Uncertainty Quantification

BayesSDF: Surface-Based Laplacian Uncertainty Estimation for 3D Geometry with Neural Signed Distance Fields

Empowering Bridge Digital Twins by Bridging the Data Gap with a Unified Synthesis Framework

The Features at Convergence Theorem: a first-principles alternative to the Neural Feature Ansatz for how networks learn representations

AI-Assisted Rapid Crystal Structure Generation Towards a Target Local Environment

First Steps Towards Overhearing LLM Agents: A Case Study With Dungeons & Dragons Gameplay

TokUR: Token-Level Uncertainty Estimation for Large Language Model Reasoning

Cutting Through Privacy: A Hyperplane-Based Data Reconstruction Attack in Federated Learning

AutoPDL: Automatic Prompt Optimization for LLM Agents

RailGoerl24: G\"orlitz Rail Test Center CV Dataset 2024

Revealing higher-order neural representations of uncertainty with the Noise Estimation through Reinforcement-based Diffusion (NERD) model

PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models

Spoof Trace Discovery for Deep Learning Based Explainable Face Anti-Spoofing

The Information Security Awareness of Large Language Models

Automatically Detecting Online Deceptive Patterns

HyperAgent: Generalist Software Engineering Agents to Solve Coding Tasks at Scale

Automated detection of underdiagnosed medical conditions via opportunistic imaging

Selective Preference Optimization via Token-Level Reward Function Estimation

ATHAR: A High-Quality and Diverse Dataset for Classical Arabic to English Translation

PersonaGym: Evaluating Persona Agents and LLMs

CFaults: Model-Based Diagnosis for Fault Localization in C Programs with Multiple Test Cases

From Frege to chatGPT: Compositionality in language, cognition, and deep neural networks

AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling

Demystifying Chains, Trees, and Graphs of Thoughts

Survival Analysis with Adversarial Regularization

Net2Brain: A Toolbox to compare artificial vision models with human brain responses

The Personality Illusion: Revealing Dissociation Between Self-Reports & Behavior in LLMs

PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Dynamic Speculative Agent Planning

AI-SearchPlanner: Modular Agentic Search via Pareto-Optimal Multi-Objective Reinforcement Learning

Graph RAG as Human Choice Model: Building a Data-Driven Mobility Agent with Preference Chain

MHSNet:An MoE-based Hierarchical Semantic Representation Network for Accurate Duplicate Resume Detection with Large Language Model

FutureX: An Advanced Live Benchmark for LLM Agents in Future Prediction

MeLA: A Metacognitive LLM-Driven Architecture for Automatic Heuristic Design

Conversational Education at Scale: A Multi-LLM Agent Workflow for Procedural Learning and Pedagogic Quality Assessment

DiMo-GUI: Advancing Test-time Scaling in GUI Grounding via Modality-Aware Visual Reasoning

Don't Make It Up: Preserving Ignorance Awareness in LLM Fine-Tuning

Translating Federated Learning Algorithms in Python into CSP Processes Using ChatGPT

ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding

Epistemic Skills: Reasoning about Knowledge and Oblivion

Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment

GUI Agents: A Survey

Neural Network Verification with PyRAT

Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning

Low-Dimensional Federated Knowledge Graph Embedding via Knowledge Distillation

MMoE: Robust Spoiler Detection with Multi-modal Information and Domain-aware Mixture-of-Experts

WinT3R: Window-Based Streaming Reconstruction with Camera Token Pool

Crosscoding Through Time: Tracking Emergence & Consolidation Of Linguistic Representations Throughout LLM Pretraining

SpikingBrain Technical Report: Spiking Brain-inspired Large Models

Scaling Performance of Large Language Model Pretraining

Recomposer: Event-roll-guided generative audio editing

COGITAO: A Visual Reasoning Framework To Study Compositionality & Generalization

Uncertain but Useful: Leveraging CNN Variability into Data Augmentation

CURE: Controlled Unlearning for Robust Embeddings -- Mitigating Conceptual Shortcuts in Pre-Trained Language Models

HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models

RapidGNN: Energy and Communication-Efficient Distributed Training on Large-Scale Graph Neural Networks

Enhancing 3D Point Cloud Classification with ModelNet-R and Point-SkipNet

AI Agents for Web Testing: A Case Study in the Wild

Accuracy-Constrained CNN Pruning for Efficient and Reliable EEG-Based Seizure Detection

Exploring Situated Stabilities of a Rhythm Generation System through Variational Cross-Examination

GenAI-based test case generation and execution in SDV platform

ICR: Iterative Clarification and Rewriting for Conversational Search

ToM-SSI: Evaluating Theory of Mind in Situated Social Interactions

Towards Efficient Pixel Labeling for Industrial Anomaly Detection and Localization

Pointing-Guided Target Estimation via Transformer-Based Attention

Adversarial Augmentation and Active Sampling for Robust Cyber Anomaly Detection

LLM Enabled Multi-Agent System for 6G Networks: Framework and Method of Dual-Loop Edge-Terminal Collaboration

High-Resolution Global Land Surface Temperature Retrieval via a Coupled Mechanism-Machine Learning Framework

Exploring an implementation of quantum learning pipeline for support vector machines

DeGuV: Depth-Guided Visual Reinforcement Learning for Generalization and Interpretability in Manipulation

Artificial intelligence for representing and characterizing quantum systems

PLaMo 2 Technical Report

SpiderNets: Estimating Fear Ratings of Spider-Related Images with Vision Models

The Paradox of Doom: Acknowledging Extinction Risk Reduces the Incentive to Prevent It

A Knowledge-Driven Diffusion Policy for End-to-End Autonomous Driving Based on Expert Routing

REMOTE: A Unified Multimodal Relation Extraction Framework with Multilevel Optimal Transport and Mixture-of-Experts

PropVG: End-to-End Proposal-Driven Visual Grounding with Multi-Granularity Discrimination

Exploring Non-Local Spatial-Angular Correlations with a Hybrid Mamba-Transformer Framework for Light Field Super-Resolution

AI-Driven Fronthaul Link Compression in Wireless Communication Systems: Review and Method Design

Toward Accessible Dermatology: Skin Lesion Classification Using Deep Learning Models on Mobile-Acquired Images

Graph Unlearning: Efficient Node Removal in Graph Neural Networks

Enhancing Diversity in Large Language Models via Determinantal Point Processes

VARMA-Enhanced Transformer for Time Series Forecasting

The LLM Has Left The Chat: Evidence of Bail Preferences in Large Language Models

Cold-RL: Learning Cache Eviction with Offline Reinforcement Learning for NGINX

Created by

Haebom

저자

Aayush Gupta, Arpit Bhayani

개요

본 논문은 NGINX와 같은 웹 프록시에서 사용되는 기존의 LRU(Least-Recently-Used) 캐시 교체 정책의 한계를 지적하고, 강화 학습 기반의 새로운 교체 정책인 Cold-RL을 제안한다. Cold-RL은 ONNX 사이드카를 이용하여 듀얼링 DQN(Deep Q-Network)을 구현하여 500 마이크로초 이내의 엄격한 시간 제약 조건 하에서 캐시 교체 결정을 내린다. 캐시 객체의 나이, 크기, 히트 횟수, 도착 간격 시간, 남은 TTL, 마지막 원본 RTT 등 6가지의 경량 특징을 추출하여 교체 대상 객체를 선택하며, 훈련은 NGINX 접근 로그를 재생하여 시뮬레이션 환경에서 수행된다. 실험 결과, Cold-RL은 다양한 캐시 크기에서 기존의 LRU, LFU, 크기 기반, 적응형 LRU 및 하이브리드 기법보다 높은 적중률을 보였다. 특히 작은 캐시 크기(25MB)에서는 146%의 향상을 보였으며, 큰 캐시 크기(400MB)에서는 기존 기법과 유사한 성능을 나타냈다. 추론 과정에서 CPU 오버헤드는 2% 미만이며, 95% 백분위수 지연 시간은 500 마이크로초 이내를 유지한다.

시사점, 한계점

•

시사점:

◦

강화학습을 활용하여 웹 프록시의 캐시 교체 정책을 개선할 수 있음을 보여줌.

◦

엄격한 성능 제약 조건(500 마이크로초) 하에서도 효과적인 강화학습 기반 캐시 교체 정책을 구현 가능함을 입증.

◦

작은 캐시 크기에서 기존 기법 대비 괄목할 만한 성능 향상을 달성.

◦

실제 NGINX에 통합 가능한 강화학습 기반 캐시 교체 정책을 최초로 제시.

•

한계점:

◦

오프라인 학습에 의존하며, 실시간 환경 변화에 대한 적응력은 추가 연구가 필요할 수 있음.

◦

특정 워크로드에 대한 성능 평가 결과이므로, 일반화 가능성에 대한 추가 검증이 필요함.

◦

6가지 특징만 사용하므로, 더욱 풍부한 특징을 사용하면 성능이 더 향상될 가능성이 있음.

◦

다양한 실제 환경에서의 테스트가 추가적으로 필요함.

Made with Slashpage