haebom
Daily Arxiv
전 세계에서 발간되는 인공지능 관련 논문을 정리하는 페이지 입니다.
본 페이지는 Google Gemini를 활용해 요약 정리하며, 비영리로 운영 됩니다.
논문에 대한 저작권은 저자 및 해당 기관에 있으며, 공유 시 출처만 명기하면 됩니다.
OPTIC-ER: A Reinforcement Learning Framework for Real-Time Emergency Response and Equitable Resource Allocation in Underserved African Communities
BioAnalyst: A Foundation Model for Biodiversity
Scaling Towards the Information Boundary of Instruction Sets: The Infinity Instruct Subject Technical Report
Dual-Objective Reinforcement Learning with Novel Hamilton-Jacobi-Bellman Formulations
NeuroPhysNet: A FitzHugh-Nagumo-Based Physics-Informed Neural Network Framework for Electroencephalograph (EEG) Analysis and Motor Imagery Classification
PRO-V-R1: Reasoning Enhanced Programming Agent for RTL Verification
Large language models can learn and generalize steganographic chain-of-thought under process supervision
Turing Test 2.0: The General Intelligence Threshold
Consistency-based Abductive Reasoning over Perceptual Errors of Multiple Pre-trained Models in Novel Environments
Empowering Clients - Transformation of Design Processes Due to Generative AI
The Delusional Hedge Algorithm as a Model of Human Learning from Diverse Opinions
The Universal Weight Subspace Hypothesis
DraCo: Draft as CoT for Text-to-Image Preview and Rare Concept Generation
ShadowDraw: From Any Object to Shadow-Drawing Compositional Art
Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning
TV2TV: A Unified Framework for Interleaved Language and Video Generation
Structured Document Translation via Format Reinforcement Learning
SA-IQA: Redefining Image Quality Assessment for Spatial Aesthetics with Multi-Dimensional Rewards
David vs. Goliath: Can Small Models Win Big with Agentic AI in Hardware Design?
Multi-LLM Collaboration for Medication Recommendation
Meta-Learning for Quantum Optimization via Quantum Sequence Model
QKAN-LSTM: Quantum-inspired Kolmogorov-Arnold Long Short-term Memory
Arbitrage: Efficient Reasoning via Advantage-Aware Speculation
Model-Free Assessment of Simulator Fidelity via Quantile Curves
Reflection Removal through Efficient Adaptation of Diffusion Transformers
Evolutionary Architecture Search through Grammar-Based Sequence Alignment
Strategic Self-Improvement for Competitive Agents in AI Labour Markets
Balanced Few-Shot Episodic Learning for Accurate Retinal Disease Diagnosis
GeoPE:A Unified Geometric Positional Embedding for Structured Tensors
Realizable Abstractions: Near-Optimal Hierarchical Reinforcement Learning
LLMs Know More Than Words: A Genre Study with Syntax, Metaphor & Phonetics
CARL: Critical Action Focused Reinforcement Learning for Multi-Step Agent
Declarative Synthesis and Multi-Objective Optimization of Stripboard Circuit Layouts Using Answer Set Programming
ReflexFlow: Rethinking Learning Objective for Exposure Bias Alleviation in Flow Matching
Developing a General Personal Tutor for Education
SEAL: Self-Evolving Agentic Learning for Conversational Question Answering over Knowledge Graphs
Language Models as Semantic Teachers: Post-Training Alignment for Medical Audio Understanding
Mitigating Catastrophic Forgetting in Target Language Adaptation of LLMs via Source-Shielded Updates
From Symptoms to Systems: An Expert-Guided Approach to Understanding Risks of Generative AI for Eating Disorders
SoK: a Comprehensive Causality Analysis Framework for Large Language Model Security
Setting up for failure: automatic discovery of the neural mechanisms of cognitive errors
287,872 Supermassive Black Holes Masses: Deep Learning Approaching Reverberation Mapping Accuracy
YingMusic-SVC: Real-World Robust Zero-Shot Singing Voice Conversion with Flow-GRPO and Singing-Specific Inductive Biases
YingMusic-Singer: Zero-shot Singing Voice Synthesis and Editing with Annotation-free Melody Guidance
Using Machine Learning to Take Stay-or-Go Decisions in Data-driven Drone Missions
Embodied Co-Design for Rapidly Evolving Agents: Taxonomy, Frontiers, and Challenges
UnwrapDiff: Conditional Diffusion for Robust InSAR Phase Unwrapping
SignRoundV2: Closing the Performance Gap in Extremely Low-Bit Post-Training Quantization for LLMs
Neural Policy Composition from Free Energy Minimization
OsmT: Bridging OpenStreetMap Queries and Natural Language with Open-source Tag-aware Language Models
E3AD: An Emotion-Aware Vision-Language-Action Model for Human-Centric End-to-End Autonomous Driving
Measuring the Unspoken: A Disentanglement Model and Benchmark for Psychological Analysis in the Wild
Towards an AI Fluid Scientist: LLM-Powered Scientific Discovery in Experimental Fluid Mechanics
Large Speech Model Enabled Semantic Communication
TimesNet-Gen: Deep Learning-based Site Specific Strong Motion Generation
Generative AI for Self-Adaptive Systems: State of the Art and Research Roadmap
Topology Matters: Measuring Memory Leakage in Multi-Agent LLMs
Semi Centralized Training Decentralized Execution Architecture for Multi Agent Deep Reinforcement Learning in Traffic Signal Control
SEASON: Mitigating Temporal Hallucination in Video Large Language Models via Self-Diagnostic Contrastive Decoding
When GenAI Meets Fake News: Understanding Image Cascade Dynamics on Reddit
When Robots Should Say "I Don't Know": Benchmarking Abstention in Embodied Question Answering
A Light-Weight Large Language Model File Format for Highly-Secure Model Distribution
Diffusion Fine-Tuning via Reparameterized Policy Gradient of the Soft Q-Function
RRPO: Robust Reward Policy Optimization for LLM-based Emotional TTS
Multi-Loss Learning for Speech Emotion Recognition with Energy-Adaptive Mixup and Frame-Level Attention
AdmTree: Compressing Lengthy Context with Adaptive Semantic Trees
Detection of Intoxicated Individuals from Facial Video Sequences via a Recurrent Fusion Model
PhyVLLM: Physics-Guided Video Language Model with Motion-Appearance Disentanglement
Prototype-Based Semantic Consistency Alignment for Domain Adaptive Retrieval
UW-BioNLP at ChemoTimelines 2025: Thinking, Fine-Tuning, and Dictionary-Enhanced LLM Systems for Chemotherapy Timeline Extraction
GraphBench: Next-generation graph learning benchmarking
GuidNoise: Single-Pair Guided Diffusion for Generalized Noise Synthesis
Open-Ended Goal Inference through Actions and Language for Human-Robot Collaboration
NORi: An ML-Augmented Ocean Boundary Layer Parameterization
Automating Complex Document Workflows via Stepwise and Rollback-Enabled Operation Orchestration
Explainable Parkinsons Disease Gait Recognition Using Multimodal RGB-D Fusion and Large Language Models
Dual-Stream Spectral Decoupling Distillation for Remote Sensing Object Detection
Towards 6G Native-AI Edge Networks: A Semantic-Aware and Agentic Intelligence Paradigm
Adversarial Limits of Quantum Certification: When Eve Defeats Detection
FMA-Net++: Motion- and Exposure-Aware Real-World Joint Video Super-Resolution and Deblurring
MASE: Interpretable NLP Models via Model-Agnostic Saliency Estimation
STeP-Diff: Spatio-Temporal Physics-Informed Diffusion Models for Mobile Fine-Grained Pollution Forecasting
AutoGuard: A Self-Healing Proactive Security Layer for DevSecOps Pipelines Using Reinforcement Learning
Mitigating Object and Action Hallucinations in Multimodal LLMs via Self-Augmented Contrastive Alignment
Counting Without Running: Evaluating LLMs' Reasoning About Code Complexity
RGE-GCN: Recursive Gene Elimination with Graph Convolutional Networks for RNA-seq based Early Cancer Detection
DAComp: Benchmarking Data Agents across the Full Data Intelligence Lifecycle
Bayes-DIC Net: Estimating Digital Image Correlation Uncertainty with Bayesian Neural Networks
MANTRA: a Framework for Multi-stage Adaptive Noise TReAtment During Training
Evaluating Long-Context Reasoning in LLM-Based WebAgents
Gamma-from-Mono: Road-Relative, Metric, Self-Supervised Monocular Geometry for Vehicular Applications
Learning Single-Image Super-Resolution in the JPEG Compressed Domain
Bootstrapped Mixed Rewards for RL Post-Training: Injecting Canonical Action Order
Quantitative Analysis of Technical Debt and Pattern Violation in Large Language Model Architectures
The Initialization Determines Whether In-Context Learning Is Gradient Descent
Catching UX Flaws in Code: Leveraging LLMs to Identify Usability Flaws at the Development Stage
Hey GPT-OSS, Looks Like You Got It - Now Walk Me Through It! An Assessment of the Reasoning Language Models Chain of Thought Mechanism for Digital Forensics
Fine-Tuning ChemBERTa for Predicting Inhibitory Activity Against TDP1 Using Deep Learning
MVRoom: Controllable 3D Indoor Scene Generation with Multi-View Diffusion Models
CRAFT-E: A Neuro-Symbolic Framework for Embodied Affordance Grounding
Load more
Dual-Process Scaffold Reasoning for Enhancing LLM Code Debugging
Created by
Haebom
Category
Empty
저자
Po-Chung Hsieh, Chin-Po Chen, Jeng-Lin Li, Ming-Ching Chang
개요
본 논문은 복잡성과 계산 효율성의 균형을 맞춘 추론 단계를 식별하는 문제를 해결하기 위해, 심리학 이론을 기반으로 한 Scaffold Reasoning 프레임워크를 제안합니다. 이 프레임워크는 Scaffold Stream, Analytic Stream, Integration Stream으로 구성되며, DebugBench에서 88.91%의 통과율과 문제당 평균 5.36초의 추론 시간을 달성하여 다른 추론 방식보다 우수한 성능을 보였습니다.
시사점, 한계점
•
심리학적 이론을 활용하여 LLM의 추론 과정을 최적화하는 새로운 접근 방식을 제시함
•
코드 디버깅 분야에서 높은 성능과 효율성을 달성함
•
다양한 문제 난이도와 버그 유형에 따른 인지 경로의 장단점을 분석함
•
제안된 프레임워크가 인간의 인지 과정과 유사함을 확인함
•
System 2 추론에 대한 더 깊이 있는 탐구가 부족함
•
특정 벤치마크 (DebugBench)에 대한 결과만 제시되어 일반화의 한계가 존재함
PDF 보기
Made with Slashpage