Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

Accelerating Diffusion Large Language Models with SlowFast Sampling: The Three Golden Principles

Saturation Self-Organizing Map

PiPViT: Patch-based Visual Interpretable Prototypes for Retinal Image Analysis

Specification and Evaluation of Multi-Agent LLM Systems -- Prototype and Cybersecurity Applications

Intra-Trajectory Consistency for Reward Modeling

Foundation Models in Medical Imaging -- A Review and Outlook

Merging Smarter, Generalizing Better: Enhancing Model Merging on OOD Data

Diffuse Everything: Multimodal Diffusion Models on Arbitrary State Spaces

Improving Large Language Models with Concept-Aware Fine-Tuning

Consistent Video Editing as Flow-Driven Image-to-Video Generation

Evidential Spectrum-Aware Contrastive Learning for OOD Detection in Dynamic Graphs

Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning

DURA-CPS: A Multi-Role Orchestrator for Dependability Assurance in LLM-Enabled Cyber-Physical Systems

Cartridges: Lightweight and general-purpose long context representations via self-study

Fast-DataShapley: Neural Modeling for Training Data Valuation

NOBLE -- Neural Operator with Biologically-informed Latent Embeddings to Capture Experimental Variability in Biological Neuron Models

LLMEval-Med: A Real-world Clinical Benchmark for Medical LLMs with Physician Validation

Can AI Master Econometrics? Evidence from Econometrics AI Agent on Expert-Level Tasks

Improving the Calibration of Confidence Scores in Text Generation Using the Output Distribution's Characteristics

Training RL Agents for Multi-Objective Network Defense Tasks

Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations

RSCF: Relation-Semantics Consistent Filter for Entity Embedding of Knowledge Graph

MoESD: Unveil Speculative Decoding's Potential for Accelerating Sparse MoE

Impact of Frame Rates on Speech Tokenizer: A Case Study on Mandarin and English

Can reasoning models comprehend mathematical problems in Chinese ancient texts? An empirical study based on data from Suanjing Shishu

Table-R1: Region-based Reinforcement Learning for Table Understanding

Spatiotemporal Field Generation Based on Hybrid Mamba-Transformer with Physics-informed Fine-tuning

Graph-Based Floor Separation Using Node Embeddings and Clustering of WiFi Trajectories

Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration

Quantitative Analysis of Performance Drop in DeepSeek Model Quantization

PhysNav-DG: A Novel Adaptive Framework for Robust VLM-Sensor Fusion in Navigation Applications

FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents

ColorBench: Can VLMs See and Understand the Colorful World? A Comprehensive Benchmark for Color Perception, Reasoning, and Robustness

"It's not a representation of me": Examining Accent Bias and Digital Exclusion in Synthetic AI Voice Services

Towards Personalized Conversational Sales Agents: Contextual User Profiling for Strategic Action

PIPO: Pipelined Offloading for Efficient Inference on Consumer Devices

Beyond the Visible: Multispectral Vision-Language Learning for Earth Observation

Ensemble Knowledge Distillation for Machine Learning Interatomic Potentials

Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts

V-Max: A Reinforcement Learning Framework for Autonomous Driving

PFDial: A Structured Dialogue Instruction Fine-tuning Method Based on UML Flowcharts

Naturalistic Computational Cognitive Science: Towards generalizable models and theories that capture the full range of natural behavior

MEDDxAgent: A Unified Modular Agent Framework for Explainable Automatic Differential Diagnosis

The Sharpness Disparity Principle in Transformers for Accelerating Language Model Pre-Training

Understanding the Emergence of Multimodal Representation Alignment

Conformal Inference under High-Dimensional Covariate Shifts via Likelihood-Ratio Regularization

Towards Understanding Fine-Tuning Mechanisms of LLMs via Circuit Analysis

TUMLU: A Unified and Native Language Understanding Benchmark for Turkic Languages

BalanceBenchmark: A Survey for Multimodal Imbalance Learning

AB-UPT: Scaling Neural CFD Surrogates for High-Fidelity Automotive Aerodynamics Simulations via Anchored-Branched Universal Physics Transformers

Vision-Language Models for Edge Networks: A Comprehensive Survey

Foundation Models for Anomaly Detection: Vision and Challenges

Unsafe LLM-Based Search: Quantitative Analysis and Mitigation of Safety Risks in AI Web Search

Capturing Temporal Dynamics in Large-Scale Canopy Tree Height Estimation

Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation

Evaluating Sample Utility for Efficient Data Selection by Mimicking Model Weights

Discovering Hidden Visual Concepts Beyond Linguistic Input in Infant Learning

Bi-directional Mapping of Morphology Metrics and 3D City Blocks for Enhanced Characterization and Generation of Urban Form

One Diffusion to Generate Them All

Entropy Controllable Direct Preference Optimization

TrajAgent: An LLM-based Agent Framework for Automated Trajectory Modeling via Collaboration of Large and Small Models

Control Industrial Automation System with Large Language Model Agents

How Well Do Large Language Models Serve as End-to-End Secure Code Agents for Python?

Self-interpreting Adversarial Images

Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective

Ad Auctions for LLMs via Retrieval Augmented Generation

Dynamic and Adaptive Feature Generation with LLM

MiniMaxAD: A Lightweight Autoencoder for Feature-Rich Anomaly Detection

PLeak: Prompt Leaking Attacks against Large Language Model Applications

HandS3C: 3D Hand Mesh Reconstruction with State Space Spatial Channel Attention from RGB images

DiffTORI: Differentiable Trajectory Optimization for Deep Reinforcement and Imitation Learning

Evolution Guided Generative Flow Networks

Manipulating Feature Visualizations with Gradient Slingshots

Scientists' First Exam: Probing Cognitive Abilities of MLLM via Perception, Understanding, and Reasoning

Agent Semantics, Semantic Spacetime, and Graphical Reasoning

Decomposability-Guaranteed Cooperative Coevolution for Large-Scale Itinerary Planning

DeePoly: A High-Order Accuracy Scientific Machine Learning Framework for Function Approximation and Solving PDEs

Does Thinking More always Help? Understanding Test-Time Scaling in Reasoning Models

The Automated but Risky Game: Modeling Agent-to-Agent Negotiations and Transactions in Consumer Markets

ChemRxivQuest: A Curated Chemistry Question-Answer Database Extracted from ChemRxiv Preprints

Epistemic Artificial Intelligence is Essential for Machine Learning Models to Truly `Know When They Do Not Know'

Enhancing multimodal analogical reasoning with Logic Augmented Generation

Beating Transformers using Synthetic Cognition

HypRL: Reinforcement Learning of Control Policies for Hyperproperties

From Idea to Implementation: Evaluating the Influence of Large Language Models in Software Development -- An Opinion Paper

Bel Esprit: Multi-Agent Framework for Building AI Model Pipelines

Dynamic Policy Fusion for User Alignment Without Re-Interaction

Tell Me What You Don't Know: Enhancing Refusal Capabilities of Role-Playing Agents via Representation Space Analysis and Editing

An overview of domain-specific foundation model: key technologies, applications and challenges

Learning from Litigation: Graphs and LLMs for Retrieval and Reasoning in eDiscovery

Combining Deep Reinforcement Learning and Search with Generative Models for Game-Theoretic Opponent Modeling

EMLoC: Emulator-based Memory-efficient Fine-tuning with LoRA Correction

code_transformed: The Influence of Large Language Models on Code

Reimagining Dance: Real-time Music Co-creation between Dancers and AI

Upgrade or Switch: Do We Need a New Registry Architecture for the Internet of AI Agents?

VGR: Visual Grounded Reasoning

Technical Evaluation of a Disruptive Approach in Homomorphic AI

SAIL: Faster-than-Demonstration Execution of Imitation Learning Policies

Subjective Experience in AI Systems: What Do AI Researchers and the Public Believe?

Today's Cat Is Tomorrow's Dog: Accounting for Time-Based Changes in the Labels of ML Vulnerability Detection Approaches

Safer or Luckier? LLMs as Safety Evaluators Are Not Robust to Artifacts

Created by

Haebom

저자

Hongyu Chen, Seraphina Goldfarb-Tarrant

개요

본 연구는 대규모 언어 모델(LLM)을 자동 평가자로 활용하여 생성 콘텐츠의 안전성을 평가하는 신뢰성을 검증하는 연구입니다. 11개의 다양한 LLM 평가 모델을 사용하여 자기 일관성, 인간 판단과의 일치성, 변명이나 장황한 표현과 같은 입력 인공물에 대한 취약성 등 세 가지 핵심 측면을 평가했습니다. 연구 결과, LLM 평가자의 편향이 안전성 비교 평가의 타당성을 저해할 수 있음을 보여줍니다. 특히, 변명하는 언어 인공물만으로도 평가자의 선호도가 최대 98%까지 왜곡될 수 있습니다. 더 큰 모델이 항상 더 강력한 것은 아니며, 작은 모델이 특정 인공물에 대해 더 높은 저항성을 보이는 경우도 있습니다. LLM 평가자의 강건성 문제를 완화하기 위해 여러 모델의 결정을 집계하는 배심원 기반 평가를 조사했습니다. 이 방법은 강건성을 향상시키고 인간 판단과의 일치성을 높이지만, 최상의 배심원 구성에서도 인공물 민감성이 지속됩니다. 이러한 결과는 신뢰할 수 있는 안전성 평가를 위해 다양하고 인공물에 강한 방법론이 시급함을 강조합니다.

시사점, 한계점

•

시사점:

◦

LLM 평가자의 편향이 안전성 평가 결과에 심각한 영향을 미칠 수 있음을 밝힘.

◦

모델 크기와 강건성 간의 상관관계가 일관적이지 않음을 확인.

◦

배심원 기반 평가가 강건성과 인간 판단과의 일치성을 향상시키지만, 인공물 민감성 문제는 완전히 해결하지 못함.

◦

다양하고 인공물에 강한 안전성 평가 방법론의 필요성을 강조.

•

한계점:

◦

본 연구에서 사용된 LLM 평가 모델의 종류 및 수가 제한적일 수 있음.

◦

인공물 유형에 대한 고려가 제한적일 수 있음.

◦

배심원 기반 평가의 최적 구성에 대한 추가 연구가 필요함.

◦

다양하고 인공물에 강한 평가 방법론에 대한 구체적인 제안이 부족함.

Made with Slashpage