Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

ManiAgent: An Agentic Framework for General Robotic Manipulation

AndesVL Technical Report: An Efficient Mobile-side Multimodal Large Language Model

ParsVoice: A Large-Scale Multi-Speaker Persian Speech Corpus for Text-to-Speech Synthesis

Optimally Deep Networks -- Adapting Model Depth to Datasets for Superior Efficiency

BrowserAgent: Building Web Agents with Human-Inspired Web Browsing Actions

AGENTIQL: An Agent-Inspired Multi-Expert Framework for Text-to-SQL Generation

Towards Safe Maneuvering of Double-Ackermann-Steering Robots with a Soft Actor-Critic Framework

The Algorithmic Regulator

HccePose(BF): Predicting Front & Back Surfaces to Construct Ultra-Dense 2D-3D Correspondences for Pose Estimation

ICL-Router: In-Context Learned Model Representations for LLM Routing

CrisiText: A dataset of warning messages for LLM training in emergency communication

GTCN-G: A Residual Graph-Temporal Fusion Network for Imbalanced Intrusion Detection (Preprint)

Scalable Policy-Based RL Algorithms for POMDPs

OptiFLIDS: Optimized Federated Learning for Energy-Efficient Intrusion Detection in IoT

Malice in Agentland: Down the Rabbit Hole of Backdoors in the AI Supply Chain

AgentBuilder: Exploring Scaffolds for Prototyping User Experiences of Interface Agents

Neon: Negative Extrapolation From Self-Training Improves Image Generation

Triplet-Structured Knowledge Integration for Multi-Turn Medical Reasoning

General Exploratory Bonus for Optimistic Exploration in RLHF

PolySim: Bridging the Sim-to-Real Gap for Humanoid Control via Multi-Simulator Dynamics Randomization

Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Understanding Language Prior of LVLMs by Contrasting Chain-of-Embedding

Large language models management of medications: three performance analyses

Responsible AI Technical Report

A Longitudinal Randomized Control Study of Companion Chatbot Use: Anthropomorphism and Its Mediating Role on Social Impacts

SPiDR: A Simple Approach for Zero-Shot Safety in Sim-to-Real Transfer

TISDiSS: A Training-Time and Inference-Time Scalable Framework for Discriminative Source Separation

Efficient and Versatile Model for Multilingual Information Retrieval of Islamic Text: Development and Deployment in Real-World Scenarios

StegOT: Trade-offs in Steganography via Optimal Transport

General Demographic Foundation Models for Enhancing Predictive Performance Across Diseases and Populations

Attention as an Adaptive Filter

Diffusion Language Models Know the Answer Before Decoding

NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows

Hard Examples Are All You Need: Maximizing GRPO Post-Training Under Annotation Budgets

AdaptJobRec: Enhancing Conversational Career Recommendation through an LLM-Powered Agentic System

An Introduction to Sliced Optimal Transport

EMSEdit: Efficient Multi-Step Meta-Learning-based Model Editing

DMSC: Dynamic Multi-Scale Coordination Framework for Time Series Forecasting

mmWave Radar-Based Non-Line-of-Sight Pedestrian Localization at T-Junctions Utilizing Road Layout Extraction via Camera

Capturing More: Learning Multi-Domain Representations for Robust Online Handwriting Verification

A Cooperative Approach for Knowledge-based Business Process Design in a Public Authority

Finding Dori: Memorization in Text-to-Image Diffusion Models Is Not Local

Feature Distillation is the Better Choice for Model-Heterogeneous Federated Learning

Knowledge Fusion via Bidirectional Information Aggregation

LearnLens: LLM-Enabled Personalised, Curriculum-Grounded Feedback with Educators in the Loop

Dual Perspectives on Non-Contrastive Self-Supervised Learning

SAFER: Probing Safety in Reward Models with Sparse Autoencoder

Inverse Design in Nanophotonics via Representation Learning

SPADE: Spatial Transcriptomics and Pathology Alignment Using a Mixture of Data Experts for an Expressive Latent Space

VIDEE: Visual and Interactive Decomposition, Execution, and Evaluation of Text Analytics with Intelligent Agents

Time-IMM: A Dataset and Benchmark for Irregular Multimodal Multivariate Time Series

BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models

Uncertainty Estimation on Graphs with Structure Informed Stochastic Partial Differential Equations

EvolveNav: Empowering LLM-Based Vision-Language Navigation via Self-Improving Embodied Reasoning

Can LLMs Reason Structurally? An Evaluation via the Lens of Data Structures

Finite Sample Analysis of Linear Temporal Difference Learning with Arbitrary Features

Leveraging Importance Sampling to Detach Alignment Modules from Large Language Models

Protein Design with Dynamic Protein Vocabulary

Your Pre-trained LLM is Secretly an Unsupervised Confidence Calibrator

Steering Large Language Models for Machine Translation Personalization

AsynFusion: Towards Asynchronous Latent Consistency Models for Decoupled Whole-Body Audio-Driven Avatars

Joint Embedding vs Reconstruction: Provable Benefits of Latent Space Prediction for Self Supervised Learning

Fixed Point Explainability

Time Travel is Cheating: Going Live with DeepFund for Real-Time Fund Investment Benchmarking

MobileCity: An Efficient Framework for Large-Scale Urban Behavior Simulation

A Customized SAT-based Solver for Graph Coloring

Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances

FOCUS on Contamination: A Geospatial Deep Learning Framework with a Noise-Aware Loss for Surface Water PFAS Prediction

ParetoQ: Improving Scaling Laws in Extremely Low-bit LLM Quantization

Query Brand Entity Linking in E-Commerce Search

AgentBreeder: Mitigating the AI Safety Risks of Multi-Agent Scaffolds via Self-Improvement

GraphRAG under Fire

Polynomial-Time Algorithms for Fair Orientations of Chores

CiteBART: Learning to Generate Citations for Local Citation Recommendation

Reframing Image Difference Captioning with BLIP2IDC and Synthetic Augmentation

Multi-View Majority Vote Learning Algorithms: Direct Minimization of PAC-Bayesian Bounds

COMAL: A Convergent Meta-Algorithm for Aligning LLMs with General Preferences

Generative AI for Requirements Engineering: A Systematic Literature Review

Assessing Latency in ASR Systems: A Methodological Perspective for Real-Time Use

ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training

Exploring the Frontier of Vision-Language Models: A Survey of Current Methodologies and Future Directions

Offline Fictitious Self-Play for Competitive Games

When "Competency" in Reasoning Opens the Door to Vulnerability: Jailbreaking LLMs via Novel Complex Ciphers

Can ChatGPT support software verification?

Optimized Layerwise Approximation for Efficient Private Inference on Fully Homomorphic Encryption

DRIFT: Decompose, Retrieve, Illustrate, then Formalize Theorems

Adaptive Dual Reasoner: Large Reasoning Models Can Think Efficiently by Hybrid Reasoning

Concise Reasoning in the Lens of Lagrangian Optimization

Humanoid Artificial Consciousness Designed with Large Language Model Based on Psychoanalysis and Personality Theory

TripScore: Benchmarking and rewarding real-world travel planning with fine-grained evaluation

Agent Learning via Early Experience

L2M-AID: Autonomous Cyber-Physical Defense by Fusing Semantic Reasoning of Large Language Models with Multi-Agent Reinforcement Learning (Preprint)

Clean First, Align Later: Benchmarking Preference Data Cleaning for Reliable LLM Alignment

Similarity Field Theory: A Mathematical Framework for Intelligence

Large Language Models in Operations Research: Methods, Applications, and Challenges

MoEs Are Stronger than You Think: Hyper-Parallel Inference Scaling with RoE

Strategic Tradeoffs Between Humans and AI in Multi-Agent Bargaining

MapAgent: A Hierarchical Agent for Geospatial Reasoning with Dynamic Map Tool Integration

Responsible AI Technical Report

Created by

Haebom

저자

KT, :, Yunjin Park, Jungwon Yoon, Junhyung Moon, Myunggyo Oh, Wonhyuk Lee, Sujin Kim Youngchol Kim, Eunmi Kim, Hyoungjun Park, Eunyoung Shin, Wonyoung Lee, Somin Lee, Minwook Ju, Minsung Noh, Dongyoung Jeong, Jeongyeop Kim, Wanjin Park, Soonmin Bae

개요

KT는 AI 서비스의 안전성과 신뢰성을 보장하기 위해 책임 있는 AI (RAI) 평가 방법론과 위험 완화 기술을 개발했습니다. AI 기본법 시행 및 글로벌 AI 거버넌스 동향 분석을 통해 규제 준수를 위한 고유한 접근 방식을 수립하고, AI 개발부터 운영까지 모든 잠재적 위험 요소를 체계적으로 식별하고 관리합니다. 국내 환경에 맞춰진 KT의 AI 위험 분류 체계를 기반으로 모델 안전성 및 견고성을 체계적으로 검증하는 신뢰할 수 있는 평가 방법론을 제시하며, 식별된 AI 위험을 관리하고 완화하기 위한 실용적인 도구도 제공합니다. 또한, 유해한 AI 모델의 응답을 실시간으로 차단하는 독점 기술인 SafetyGuard를 공개하여 국내 AI 개발 생태계의 안전성 향상을 지원합니다. 이러한 연구 결과는 책임 있는 AI 개발을 추구하는 조직에 귀중한 통찰력을 제공합니다.

시사점, 한계점

•

시사점:

◦

국내 AI 환경에 맞는 맞춤형 AI 위험 평가 방법론 제시.

◦

실시간 유해 응답 차단 기술 SafetyGuard 개발 및 공개를 통한 AI 안전성 강화.

◦

책임 있는 AI 개발을 위한 실용적인 도구 제공.

◦

규제 준수를 위한 고유한 접근 방식 제시.

•

한계점:

◦

논문에서 구체적인 평가 방법론 및 기술의 상세 내용 부족.

◦

기술의 실제 성능 및 효과에 대한 구체적인 데이터 부재.

◦

연구 결과의 일반화 가능성에 대한 추가적인 검증 필요.

◦

SafetyGuard의 기술적 한계 및 적용 범위에 대한 정보 부족.

Made with Slashpage