haebom
Daily Arxiv
전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.
AI/ML in 3GPP 5G Advanced -- Services and Architecture
CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation
Dynamic Correction of Erroneous State Estimates via Diffusion Bayesian Exploration
Concept-Guided Backdoor Attack on Vision Language Models
On the Holographic Geometry of Deterministic Computation
Decoding inner speech with an end-to-end brain-to-text neural interface
Physics-informed Neural Operator Learning for Nonlinear Grad-Shafranov Equation
MindEval: Benchmarking Language Models on Multi-turn Mental Health Support
LAET: A Layer-wise Adaptive Ensemble Tuning Framework for Pretrained Language Models
FAST-CAD: A Fairness-Aware Framework for Non-Contact Stroke Diagnosis
Designing LLM-based Multi-Agent Systems for Software Engineering Tasks: Quality Attributes, Design Patterns and Rationale
SONIC: Supersizing Motion Tracking for Natural Humanoid Whole-Body Control
MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling
Data-Augmented Deep Learning for Downhole Depth Sensing and Field Validation
Chinese Discharge Drug Recommendation in Metabolic Diseases with Large Language Models
Analysing Moral Bias in Finetuned LLMs through Mechanistic Interpretability
Orders in Chaos: Enhancing Large-Scale MoE LLM Serving with Data Movement Forecasting
Reinforce-Ada: An Adaptive Sampling Framework under Non-linear RL Objectives
TempoControl: Temporal Attention Guidance for Text-to-Video Models
AortaDiff: A Unified Multitask Diffusion Framework For Contrast-Free AAA Imaging
SoREX: Towards Self-Explainable Social Recommendation with Relevant Ego-Path Extraction
The AI Productivity Index (APEX)
Uncovering Grounding IDs: How External Cues Shape Multimodal Binding
Pushing Toward the Simplex Vertices: A Simple Remedy for Code Collapse in Smoothed Vector Quantization
V-CECE: Visual Counterfactual Explanations via Conceptual Edits
Momentum-constrained Hybrid Heuristic Trajectory Optimization Framework with Residual-enhanced DRL for Visually Impaired Scenarios
HARP: Hallucination Detection via Reasoning Subspace Projection
Rethinking Sparse Autoencoders: Select-and-Project for Fairness and Control from Encoder Features Alone
IPA: An Information-Reconstructive Input Projection Framework for Efficient Foundation Model Adaptation
Sparse but Wrong: Incorrect L0 Leads to Incorrect Features in Sparse Autoencoders
ORFuzz: Fuzzing the "Other Side" of LLM Safety -- Testing Over-Refusal
Retro-Expert: Collaborative Reasoning for Interpretable Retrosynthesis
A Survey on Diffusion Language Models
CodeNER: Code Prompting for Named Entity Recognition
ReSem3D: Refinable 3D Spatial Constraints via Fine-Grained Semantic Grounding for Generalizable Robotic Manipulation
VERIRAG: A Post-Retrieval Auditing of Scientific Study Summaries
SustainDiffusion: Optimising the Social and Environmental Sustainability of Stable Diffusion Models
LittleBit: Ultra Low-Bit Quantization via Latent Factorization
Real-Time Execution of Action Chunking Flow Policies
AURA: A Diagnostic Framework for Tracking User Satisfaction of Interactive Planning Agents
Enhancing SPARQL Query Rewriting for Complex Ontology Alignments
Exploring Ordinal Bias in Action Recognition for Instructional Videos
Robust Weight Imprinting: Insights from Neural Collapse and Proxy-Based Aggregation
Experiments with Large Language Models on Retrieval-Augmented Generation for Closed-Source Simulation Software
SAT: Dynamic Spatial Aptitude Training for Multimodal Language Models
Large Language Models: An Applied Econometric Framework
Edge-Only Universal Adversarial Attacks in Distributed Learning
Image-Guided Semantic Pseudo-LiDAR Point Generation for 3D Object Detection
Detecting the Future: All-at-Once Event Sequence Forecasting with Horizon Matching
Variational Learning of Gaussian Process Latent Variable Models through Stochastic Gradient Annealed Importance Sampling
SOAP: Enhancing Spatio-Temporal Relation and Motion Information Capturing for Few-Shot Action Recognition
A Scene-aware Models Adaptation Scheme for Cross-scene Online Inference on Mobile Devices
Towards Data-efficient Customer Intent Recognition with Prompt-based Learning Paradigm
GTM: Simulating the World of Tools for AI Agents
Self-Transparency Failures in Expert-Persona LLMs: How Instruction-Following Overrides Honesty
Learning the Value of Value Learning
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe
ToolMind Technical Report: A Large-Scale, Reasoning-Enhanced Tool-Use Dataset
Debate over Mixed-knowledge: A Robust Multi-Agent Reasoning Framework for Incomplete Knowledge Graph Question Answering
Spilling the Beans: Teaching LLMs to Self-Report Their Hidden Objectives
Counterfactual Reasoning for Steerable Pluralistic Value Alignment of Large Language Models
Generalized Parallel Scaling with Interdependent Generations
SOCK: A Benchmark for Measuring Self-Replication in Large Language Models
KNARsack: Teaching Neural Algorithmic Reasoners to Solve Pseudo-Polynomial Problems
IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
FedIFL: A federated cross-domain diagnostic framework for motor-driven systems with inconsistent fault modes
Enhancing Large Language Models through Neuro-Symbolic Integration and Ontological Reasoning
Rolling in the deep of cognitive and AI biases
Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms
Training-Time Action Conditioning for Efficient Real-Time Chunking
Whatever Remains Must Be True: Filtering Drives Reasoning in LLMs, Shaping Diversity
AQUA-Net: Adaptive Frequency Fusion and Illumination Aware Network for Underwater Image Enhancement
M4-RAG: A Massive-Scale Multilingual Multi-Cultural Multimodal RAG
MaxShapley: Towards Incentive-compatible Generative Search with Fair Context Attribution
Trusted AI Agents in the Cloud
Impugan: Learning Conditional Generative Models for Robust Data Imputation
Zoom in, Click out: Unlocking and Evaluating the Potential of Zooming for GUI Grounding
Measuring the Effect of Background on Classification and Feature Importance in Deep Learning for AV Perception
World Models That Know When They Don't Know: Controllable Video Generation with Calibrated Uncertainty
Natural Language Summarization Enables Multi-Repository Bug Localization by LLMs in Microservice Architectures
Neural Coherence : Find higher performance to out-of-distribution tasks from few samples
Sparse Attention Post-Training for Mechanistic Interpretability
Optimizing Medical Question-Answering Systems: A Comparative Study of Fine-Tuned and Zero-Shot Large Language Models with RAG Framework
NEAT: Neighborhood-Guided, Efficient, Autoregressive Set Transformer for 3D Molecular Generation
Phase-OTDR Event Detection Using Image-Based Data Transformation and Deep Learning
Approximation of Box Decomposition Algorithm for Fast Hypervolume-Based Multi-Objective Optimization
Probing the effectiveness of World Models for Spatial Reasoning through Test-time Scaling
3D Path Planning for Robot-assisted Vertebroplasty from Arbitrary Bi-plane X-ray via Differentiable Rendering
Mechanistic Interpretability of Antibody Language Models Using SAEs
Active Video Perception: Iterative Evidence Seeking for Agentic Long Video Understanding
Efficient Text Classification with Conformal In-Context Learning
Big Tech-Funded AI Papers Have Higher Citation Impact, Greater Insularity, and Larger Recency Bias
Bayesian Active Inference for Intelligent UAV Anti-Jamming and Adaptive Trajectory Planning
Faithfulness metric fusion: Improving the evaluation of LLM trustworthiness across domains
Retrieving Semantically Similar Decisions under Noisy Institutional Labels: Robust Comparison of Embedding Methods
InverseCrafter: Efficient Video ReCapture as a Latent Domain Inverse Problem
Feasibility of AI-Assisted Programming for End-User Development
Grounded Multilingual Medical Reasoning for Question Answering with Large Language Models
Modular Jets for Supervised Pipelines: Diagnosing Mirage vs Identifiability
A Comprehensive Framework for Automated Quality Control in the Automotive Industry
Load more
Parameter Importance-Driven Continual Learning for Foundation Models
Created by
Haebom
Category
Empty
저자
Lingxiang Wang, Hainan Zhang, Zhiming Zheng
PIECE: Parameter Importance Estimation-based Continual Enhancement for Domain Adaptation
•
도메인 특정 후속 학습으로 인해 범용 추론 능력이 손실되는 문제 해결
•
사전 학습 데이터 없이, 모델 파라미터 증가 없이, 효율적인 도메인 지식 학습
•
Fisher Information 기반 PIECE-F와 기울기 및 곡률 정보를 결합한 2차 정규화 기반 PIECE-S를 통해 핵심 파라미터의 0.1%만 선택적으로 업데이트
•
3개의 언어 모델과 2개의 멀티모달 모델에서 일반적인 능력 유지와 최첨단 지속 학습 성능 달성
•
확장 가능하고 도메인 적응형 기반 모델을 위한 실용적인 방법 제시
시사점, 한계점
•
시사점:
◦
일반적인 능력 유지를 통해 파운데이션 모델의 지속적인 학습을 가능하게 함
◦
사전 학습 데이터나 추가 파라미터 없이 효율적인 학습을 제공
◦
다양한 모델과 작업에서 뛰어난 성능을 보임
•
한계점:
◦
0.1%의 핵심 파라미터 선택 기준에 대한 추가적인 연구 필요
◦
다양한 도메인과 작업에 대한 일반화 성능 추가 검증 필요
◦
Fisher Information과 2차 정규화의 복잡성으로 인한 계산 비용 고려 필요
PDF 보기
Made with Slashpage