Daily Arxiv

전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.

A Systematic Review of Human-AI Co-Creativity

DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing

Exploring the Capabilities of the Frontier Large Language Models for Nuclear Energy Research

MUPA: Towards Multi-Path Agentic Reasoning for Grounded Video Question Answering

Sparse-Reg: Improving Sample Complexity in Offline Reinforcement Learning using Sparsity

FEAST: A Flexible Mealtime-Assistance System Towards In-the-Wild Personalization

How do Probabilistic Graphical Models and Graph Neural Networks Look at Network Data?

Vision Transformers Don't Need Trained Registers

Eye of Judgement: Dissecting the Evaluation of Russian-speaking LLMs with POLLUX

Maximizing Confidence Alone Improves Reasoning

EasyDistill: A Comprehensive Toolkit for Effective Knowledge Distillation of Large Language Models

Improving LLM Outputs Against Jailbreak Attacks with Expert Model Integration

Personalized Robotic Object Rearrangement from Scene Context

Cannot See the Forest for the Trees: Invoking Heuristics and Biases to Elicit Irrational Choices of LLMs

OpenTCM: A GraphRAG-Empowered LLM-based System for Traditional Chinese Medicine Knowledge Retrieval and Diagnosis

Explicit neural network classifiers for non-separable data

USM-VC: Mitigating Timbre Leakage with Universal Semantic Mapping Residual Block for Voice Conversion

Towards Adaptive Memory-Based Optimization for Enhanced Retrieval-Augmented Generation

LoopGen: Training-Free Loopable Music Generation

Automated detection of atomicity violations in large-scale systems

Grammar and Gameplay-aligned RL for Game Description Generation with LLMs

Generative AI for Software Architecture. Applications, Challenges, and Future Directions

English K_Quantization of LLMs Does Not Disproportionately Diminish Multilingual Performance

Heuristics for AI-driven Graphical Asset Generation Tools in Game Design and Development Pipelines: A User-Centred Approach

Collective Reasoning Among LLMs: A Framework for Answer Validation Without Ground Truth

Multi-Turn Code Generation Through Single-Step Rewards

Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference

KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

AB-UPT: Scaling Neural CFD Surrogates for High-Fidelity Automotive Aerodynamics Simulations via Anchored-Branched Universal Physics Transformers

MedRAG: Enhancing Retrieval-augmented Generation with Knowledge Graph-Elicited Reasoning for Healthcare Copilot

Generative Data Mining with Longtail-Guided Diffusion

Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation

No More Sliding Window: Efficient 3D Medical Image Segmentation with Differentiable Top-k Patch Sampling

Jailbreaking Multimodal Large Language Models via Shuffle Inconsistency

End-to-End Long Document Summarization using Gradient Caching

Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

KNN-MMD: Cross Domain Wireless Sensing via Local Distribution Alignment

EUR/USD Exchange Rate Forecasting incorporating Text Mining Based on Pre-trained Language Models and Deep Learning Methods

Large-Scale Multirobot Coverage Path Planning on Grids With Path Deconfliction

Dynamic Adaptive Rank Space Exploration for Efficient Sentiment Analysis with Large Language Models

Federated Data-Efficient Instruction Tuning for Large Language Models

QT-DoG: Quantization-aware Training for Domain Generalization

Testing Causal Models with Hidden Variables in Polynomial Delay via Conditional Independencies

Stability of Primal-Dual Gradient Flow Dynamics for Multi-Block Convex Optimization Problems

LRP4RAG: Detecting Hallucinations in Retrieval-Augmented Generation via Layer-wise Relevance Propagation

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods

Dynamic Adaptive Optimization for Effective Sentiment Analysis Fine-Tuning on Large Language Models

Mitigating Metropolitan Carbon Emissions with Dynamic Eco-driving at Scale

CAPM: Fast and Robust Verification on Maxpool-based CNN via Dual Network

MimicMotion: High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance

From Data Quality for AI to AI for Data Quality: A Systematic Review of Tools for AI-Augmented Data Quality Management in Data Warehouses

FuzzAug: Data Augmentation by Coverage-guided Fuzzing for Neural Test Generation

RLSF: Fine-tuning LLMs via Symbolic Feedback

A Survey on Patent Analysis: From NLP to Multimodal AI

Enhancing Object Detection Robustness: Detecting and Restoring Confidence in the Presence of Adversarial Patch Attacks

Bias-Augmented Consistency Training Reduces Biased Reasoning in Chain-of-Thought

Programming Distributed Collective Processes in the eXchange Calculus

Communication-Efficient Heterogeneous Federated Learning with Generalized Heavy-Ball Momentum

Fairness and Bias in Algorithmic Hiring: a Multidisciplinary Survey

On CNF formulas irredundant with respect to unit clause propagation

SONG: Self-Organizing Neural Graphs

Mobile-R1: Towards Interactive Reinforcement Learning for VLM-Based Mobile Agent via Task-Level Rewards

KunLunBaizeRAG: Reinforcement Learning Driven Inference Performance Leap for Large Language Models

FEAT: A Preference Feedback Dataset through a Cost-Effective Auto-Generation and Labeling Framework for English AI Tutoring

Dynamic Knowledge Exchange and Dual-diversity Review: Concisely Unleashing the Potential of a Multi-Agent Research Team

PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models

From Human to Machine Psychology: A Conceptual Framework for Understanding Well-Being in Large Language Models

The SWE-Bench Illusion: When State-of-the-Art LLMs Remember Instead of Reason

VLM@school -- Evaluation of AI image understanding on German middle school knowledge

The AI Imperative: Scaling High-Quality Peer Review in Machine Learning

Toward Data Systems That Are Business Semantic Centric and AI Agents Assisted

ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows

$C^3$-Bench: The Things Real Disturbing LLM based Agent in Multi-Tasking

StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity Alignment

REMOR: Automated Peer Review Generation with LLM Reasoning and Multi-Objective Reinforcement Learning

Epistemic Artificial Intelligence is Essential for Machine Learning Models to Truly 'Know When They Do Not Know'

Local Markov Equivalence and Local Causal Discovery for Identifying Controlled Direct Effects

Adapting Probabilistic Risk Assessment for AI

From Superficial to Deep: Integrating External Knowledge for Follow-up Question Generation Using Knowledge Graph and LLM

Wider or Deeper? Scaling LLM Inference-Time Compute with Adaptive Branching Tree Search

SENSEI: Semantic Exploration Guided by Foundation Models to Learn Versatile World Models

OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis

Problem Solving Through Human-AI Preference-Based Cooperation

CRAB: Cross-environment Agent Benchmark for Multimodal Language Model Agents

CLoVE: Personalized Federated Learning through Clustering of Loss Vector Embeddings

HyperCLOVA X THINK Technical Report

Dehazing Light Microscopy Images with Guided Conditional Flow Matching: finding a sweet spot between fidelity and realism

QuickSilver -- Speeding up LLM Inference through Dynamic Token Halting, KV Skipping, Contextual Token Fusion, and Adaptive Matryoshka Quantization

Multi-View Contrastive Learning for Robust Domain Adaptation in Medical Time Series Analysis

Towards Distributed Neural Architectures

Can Video Large Multimodal Models Think Like Doubters-or Double-Down: A Study on Defeasible Video Entailment

Probabilistic Optimality for Inference-time Scaling

Sheaf-Based Decentralized Multimodal Learning for Next-Generation Wireless Communication Systems

From Ground to Air: Noise Robustness in Vision Transformers and CNNs for Event-Based Vehicle Classification with Potential UAV Applications

Concept-Level AI for Telecom: Moving Beyond Large Language Models

A Framework for Multi-source Privacy Preserving Epidemic Analysis

A Deep Learning framework for building damage assessment using VHR SAR and geospatial data: demonstration on the 2023 Turkiye Earthquake

Less Greedy Equivalence Search

A Practical Approach to Power Saving in Hearables Using Sub-Nyquist Sampling with Bandwidth Extension

Refining Salience-Aware Sparse Fine-Tuning Strategies for Language Models

Created by

Haebom

저자

Xinxin Liu, Aaron Thomas, Cheng Zhang, Jianyi Cheng, Yiren Zhao, Xitong Gao

개요

본 논문은 매개변수 효율적인 미세 조정(PEFT)의 희소성 기반 방법(SPEFT)에 초점을 맞추고 있습니다. 기존의 저차원 적응 방법(예: LoRA)과 달리, SPEFT는 모델의 가중치 행렬에 학습 가능한 희소 적응을 도입하여 미세 조정 매개변수 선택에 더 큰 유연성을 제공합니다. 본 논문에서는 제로-코스트 NAS 프록시에서 영감을 받아 SPEFT에 대한 중요도 지표의 최초 체계적인 평가를 수행하고, 간단한 기울기 기반 지표가 신뢰할 수 있으며 최고의 대안과 동등한 성능을 제공함을 확인했습니다. 또한, 정적 및 동적 마스킹 전략을 비교하여, 정적 마스킹이 성능 저하 없이 효율성을 제공하는 반면 동적 마스킹은 실질적인 이점이 없음을 발견했습니다. NLP 작업 전반에서 간단한 기울기 기반 정적 SPEFT는 다른 LLM 미세 조정 방법을 일관되게 능가하며, SPEFT에 대한 간단하면서도 효과적인 기준을 제시합니다. 본 연구는 효과적인 PEFT에 복잡성이 필요하다는 생각에 이의를 제기하며, 오픈소스 프레임워크([https://github.com/0-ml/speft])를 통해 향후 연구를 위한 재현 가능한 벤치마크를 제공합니다.

시사점, 한계점

•

시사점:

◦

간단한 기울기 기반의 정적 SPEFT가 다른 LLM 미세 조정 방법보다 우수한 성능을 보임을 실험적으로 증명.

◦

정적 마스킹 전략이 동적 마스킹보다 효율적이고 성능 저하 없이 효과적임을 밝힘.

◦

복잡성이 높은 PEFT 방법이 항상 최상의 성능을 보장하는 것은 아님을 시사.

◦

오픈소스 프레임워크를 제공하여 향후 연구의 재현성을 높임.

•

한계점:

◦

현재까지 NLP 작업에 대한 평가만 수행되었으며, 다른 도메인이나 작업에 대한 일반화 가능성은 추가 연구가 필요.

◦

제안된 방법의 성능 향상은 특정 데이터셋과 모델에 따라 다를 수 있음.

◦

기울기 기반 중요도 지표의 신뢰성은 다양한 모델과 데이터셋에서 추가적인 검증이 필요.

Made with Slashpage