Daily Arxiv

世界中で発行される人工知能関連の論文をまとめるページです。
このページはGoogle Geminiを活用して要約し、非営利で運営しています。
論文の著作権は著者および関連機関にあり、共有する際は出典を明記してください。

Watermarking and Anomaly Detection in Machine Learning Models for LORA RF Fingerprinting

Vulnerable Agent Identification in Large-Scale Multi-Agent Reinforcement Learning

Sea-ing Through Scattered Rays: Revisiting the Image Formation Model for Realistic Underwater Image Generation

DPANet: Dual Pyramid Attention Network for Multivariate Time Series Forecasting

MeanFlowSE: one-step generative speech enhancement via conditional mean flow

Empathy-R1: A Chain-of-Empathy and Reinforcement Learning Framework for Long-Form Mental Health Support

Threat Modeling for Enhancing Security of IoT Audio Classification Devices under a Secure Protocols Framework

AToken: A Unified Tokenizer for Vision

TGPO: Tree-Guided Preference Optimization for Robust Web Agent Reinforcement Learning

Comprehensive Evaluation of CNN-Based Audio Tagging Models on Resource-Constrained Devices

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

Improving Anomalous Sound Detection with Attribute-aware Representation from Domain-adaptive Pre-training

Hardness, Structural Knowledge, and Opportunity: An Analytical Framework for Modular Performance Modeling

Benchmark of stylistic variation in LLM-generated texts

SWE-Effi: Re-Evaluating Software AI Agent System Effectiveness Under Resource Constraints

Structure Matters: Brain Graph Augmentation via Learnable Edge Masking for Data-efficient Psychiatric Diagnosis

DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge

Riemannian Batch Normalization: A Gyro Approach

On the Security of Tool-Invocation Prompts for LLM-Based Agentic Systems: An Empirical Risk Assessment

MIDOG 2025: Mitotic Figure Detection with Attention-Guided False Positive Correction

Do Retrieval Augmented Language Models Know When They Don't Know?

LongCat-Flash Technical Report

MedCOD: Enhancing English-to-Spanish Medical Translation of Large Language Models Using Enriched Chain-of-Dictionary Framework

Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

CORE-RAG: Lossless Compression for Retrieval-Augmented LLMs via Reinforcement Learning

OpenWHO: A Document-Level Parallel Corpus for Health Translation in Low-Resource Languages

Subjective Behaviors and Preferences in LLM: Language of Browsing

Using Natural Language for Human-Robot Collaboration in the Real World

RegionMed-CLIP: A Region-Aware Multimodal Contrastive Learning Pre-trained Model for Medical Image Understanding

Causal2Vec: Improving Decoder-only LLMs as Versatile Embedding Models

VLA-Mark: A cross modal watermark for large vision-language alignment model

Deformable Dynamic Convolution for Accurate yet Efficient Spatio-Temporal Traffic Prediction

Deep Reinforcement Learning with Gradient Eligibility Traces

Generating Moving 3D Soundscapes with Latent Diffusion Models

Capturing Polysemanticity with PRISM: A Multi-Concept Feature Description Framework

Discrete Diffusion in Large Language and Multimodal Models: A Survey

DualEdit: Dual Editing for Knowledge Updating in Vision-Language Models

Algorithmic Fairness: Not a Purely Technical but Socio-Technical Property

OptiScene: LLM-driven Indoor Scene Layout Generation via Scaled Human-aligned Data Synthesis and Multi-Stage Preference Optimization

Perception-R1: Advancing Multimodal Reasoning Capabilities of MLLMs via Visual Perception Reward

AS-ASR: A Lightweight Framework for Aphasia-Specific Automatic Speech Recognition

LLMs Can Compensate for Deficiencies in Visual Representations

Spatial Understanding from Videos: Structured Prompts Meet Simulation Data

Emergent Abilities of Large Language Models under Continued Pretraining for Language Adaptation

Cross-Attention Speculative Decoding

Beyond Linear Steering: Unified Multi-Attribute Control for Language Models

Noise-Robustness Through Noise: Asymmetric LoRA Adaption with Poisoning Expert

SEMMA: A Semantic Aware Knowledge Graph Foundation Model

AmpleHate: Amplifying the Attention for Versatile Implicit Hate Detection

Fairness-in-the-Workflow: How Machine Learning Practitioners at Big Tech Companies Approach Fairness in Recommender Systems

GRE Suite: Geo-localization Inference via Fine-Tuned Vision-Language Models and Enhanced Reasoning Chains

A Survey of Large Language Models for Data Challenges in Graphs

CLEAR: A Clinically-Grounded Tabular Framework for Radiology Report Evaluation

Creative Preference Optimization

MUG-Eval: A Proxy Evaluation Framework for Multilingual Generation Capabilities in Any Language

Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning

Space Group Equivariant Crystal Diffusion

Schreier-Coset Graph Propagation

Examining Deployment and Refinement of the VIOLA-AI Intracranial Hemorrhage Model Using an Interactive NeoMedSys Platform

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

ConCISE: Confidence-guided Compression in Step-by-step Efficient Reasoning

AttentionDrop: A Novel Regularization Method for Transformer Models

MigGPT: Harnessing Large Language Models for Automated Migration of Out-of-Tree Linux Kernel Patches Across Versions

Hybrid Temporal Differential Consistency Autoencoder for Efficient and Sustainable Anomaly Detection in Cyber-Physical Systems

Who is Responsible When AI Fails? Mapping Causes, Entities, and Consequences of AI Privacy and Ethical Incidents

No Black Box Anymore: Demystifying Clinical Predictive Modeling with Temporal-Feature Cross Attention Mechanism

Negotiative Alignment: Embracing Disagreement to Achieve Fairer Outcomes - Insights from Urban Studies

MT-RewardTree: A Comprehensive Framework for Advancing LLM-Based Machine Translation via Reward Modeling

Pruning the Paradox: How CLIP's Most Informative Heads Enhance Performance While Amplifying Bias

KatFishNet: Detecting LLM-Generated Korean Text through Linguistic Feature Analysis

SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning

Sparsity May Be All You Need: Sparse Random Parameter Adaptation

Neural Networks for Learnable and Scalable Influence Estimation of Instruction Fine-Tuning Data

"It Felt Like I Was Left in the Dark": Exploring Information Needs and Design Opportunities for Family Caregivers of Older Adult Patients in Critical Care Settings

Gradient Alignment in Physics-informed Neural Networks: A Second-Order Optimization Perspective

Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models

A Layered Multi-Expert Framework for Long-Context Mental Health Assessments

Efficient Real-time Refinement of Language Model Text Generation

FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait

Dynamic Neural Curiosity Enhances Learning Flexibility for Autonomous Goal Discovery

Bayesian Concept Bottleneck Models with LLM Priors

G2D2: Gradient-Guided Discrete Diffusion for Inverse Problem Solving

SeCodePLT: A Unified Platform for Evaluating the Security of Code GenAI

DiRW: Path-Aware Digraph Learning for Heterophily

Towards Interactive and Learnable Cooperative Driving Automation: a Large Language Model-Driven Decision-Making Framework

DynamicNER: A Dynamic, Multilingual, and Fine-Grained Dataset for LLM-based Named Entity Recognition

CrackSCF: Lightweight Cascaded Fusion Network for Robust and Efficient Structural Crack Segmentation

ConfReady: A RAG based Assistant and Dataset for Conference Checklist Responses

FOVAL: Calibration-Free and Subject-Invariant Fixation Depth Estimation Across Diverse Eye-Tracking Datasets

Assessing invariance to affine transformations in image quality metrics

The Great AI Witch Hunt: Reviewers Perception and (Mis)Conception of Generative AI in Research Writing

Database-Augmented Query Representation for Information Retrieval

Two Is Better Than One: Aligned Representation Pairs for Anomaly Detection

BBScoreV2: Learning Time-Evolution and Latent Alignment from Stochastic Representation

Beyond Pixels: Enhancing LIME with Hierarchical Features and Segmentation Foundation Models

Spatio-Temporal Anomaly Detection with Graph Networks for Data Quality Monitoring of the Hadron Calorimeter

Understanding AI Evaluation Patterns: How Different GPT Models Assess Vision-Language Descriptions

Online Robust Planning under Model Uncertainty: A Sample-Based Approach

HiPhO: How Far Are (M)LLMs from Humans in the Latest High School Physics Olympiad Benchmark?

Algorithmic Fairness: Not a Purely Technical but Socio-Technical Property

Created by

Haebom

作者

Yijun Bian, Lei You, Yusa Sasaki, Haruka Maeda, Akira Igarashi

概要

本論文は、社会的に重要な分野における人工知能（AI）および機械学習（ML）システムの急速な展開傾向に基づいて、差別的な行動を含む信頼性に関する懸念が高まっていることを扱います。アルゴリズム公平性研究は数多くの数学的定義と測定基準を提示していますが、公平性の理解の欠如、主にバイナリグループの設定に合わせた既存の測定基準、クロスコンテキストに対する表面的な処理などの誤解と限界のために効果は制限されています。本論文はこの誤解について批判的に論じており、公平性をモデルに対する純粋に技術的な制約に縮小することはできないと主張する。概念分析と実証的実証を通して、既存の公平性指標の制限を検討し、複雑な現実シナリオでの適用性が制限されていることを示しています。さらに、正確性と公平性、および公平性指標間の相互排他性についての既存の見解に異議を唱え、公正性指標の設計に考慮する3つの原則を提示します。これらの研究結果は、技術的公式化と社会的現実との間のギャップを解消し、実際のAI / ML展開の課題を解決するのに役立つと期待しています。

Takeaways、Limitations

•

Takeaways：

◦

公平性を単に技術的問題としてのみ接近してはならず、社会的文脈を考慮しなければならないことを強調。

◦

既存の公平性指標の限界を指摘し、現実的な状況に適用可能な新しい指標の開発の必要性を提示する。

◦

正確性と公平性と公平性の指標間の相互排他性に関する既存の見解の在庫ニーズの提示

◦

公平性メトリック設計のための3つの原則を提示します。

•

Limitations：

◦

提示された3つの原則の具体的な適用方法と実際の効果に関するさらなる研究の必要性

◦

さまざまな社会的文脈とさまざまな種類の差別の包括的な考慮が必要です。

◦

提示された原則は、すべての状況に適用可能な普遍的な解決策ではないかもしれません。

Made with Slashpage