/
/
Daily Arxiv
Daily Arxiv
世界中で発行される人工知能関連の論文をまとめるページです。
このページはGoogle Geminiを活用して要約し、非営利で運営しています。
論文の著作権は著者および関連機関にあり、共有する際は出典を明記してください。
HoPE: Hyperbolic Rotary Positional Encoding for Stable Long-Range Dependency Modeling in Large Language Models
Comparative Analysis of Transformer Models in Disaster Tweet Classification for Public Safety
Emergent Social Dynamics of LLM Agents in the El Farol Bar Problem
The Good, the Bad and the Constructive: Automatically Measuring Peer Review's Utility for Authors
Energy Landscapes Enable Reliable Abstention in Retrieval-Augmented Large Language Models for Healthcare
DEXOP: A Device for Robotic Transfer of Dexterous Human Manipulation
Reinforcement Learning for Robust Ageing-Aware Control of Li-ion Battery Systems with Data-Driven Formal Verification
RepoDebug: Repository-Level Multi-Task and Multi-Language Debugging Evaluation of Large Language Models
Gravity Well Echo Chamber Modeling With An LLM-Based Confirmation Bias Model
Insights from Gradient Dynamics: Gradient Autoscaled Normalization
Efficient Virtuoso: A Latent Diffusion Transformer Model for Goal-Conditioned Trajectory Planning
MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds
DCPO: Dynamic Clipping Policy Optimization
DSDE: Dynamic Speculative Decoding with KLD Stability for Real-World Serving
Can AI be Auditable?
Robotic Fire Risk Detection based on Dynamic Knowledge Graph Reasoning: An LLM-Driven Approach with Graph Chain-of-Thought
Navigating the EU AI Act: Foreseeable Challenges in Qualifying Deep Learning-Based Automated Inspections of Class III Medical Devices
Complementary Learning System Empowers Online Continual Learning of Vehicle Motion Forecasting in Smart Cities
MultiPL-MoE: Multi-Programming-Lingual Extension of Large Language Models through Hybrid Mixture-of-Experts
QuadKAN: KAN-Enhanced Quadruped Motion Control via End-to-End Reinforcement Learning
MovieCORE: COgnitive REasoning in Movies
Automatic Prompt Optimization with Prompt Distillation
Membership Inference Attacks on LLM-based Recommender Systems
Leveraging Large Language Models for Accurate Sign Language Translation in Low-Resource Scenarios
Group Expectation Policy Optimization for Heterogeneous Reinforcement Learning
Convergence and Generalization of Anti-Regularization for Parametric Models
Jet-Nemotron: Efficient Language Model with Post Neural Architecture Search
CARFT: Boosting LLM Reasoning via Contrastive Learning with Annotated Chain-of-Thought-based Reinforced Fine-Tuning
Bridging Generalization and Personalization in Human Activity Recognition via On-Device Few-Shot Learning
FinAgentBench: A Benchmark Dataset for Agentic Retrieval in Financial Question Answering
Using Artificial Intuition in Distinct, Minimalist Classification of Scientific Abstracts for Management of Technology Portfolios
Semantic Discrepancy-aware Detector for Image Forgery Identification
Quantum-Efficient Reinforcement Learning Solutions for Last-Mile On-Demand Delivery
BadPromptFL: A Novel Backdoor Threat to Prompt-based Federated Learning in Multimodal Models
Uncertainty-Driven Reliability: Selective Prediction and Trustworthy Deployment in Modern Machine Learning
Real-Time Analysis of Unstructured Data with Machine Learning on Heterogeneous Architectures
VSI: Visual Subtitle Integration for Keyframe Selection to enhance Long Video Understanding
SGDFuse: SAM-Guided Diffusion for High-Fidelity Infrared and Visible Image Fusion
An Efficient Continuous-Time MILP for Integrated Aircraft Hangar Scheduling and Layout
DIRF: A Framework for Digital Identity Protection and Clone Governance in Agentic AI Systems
COLLAGE: Adaptive Fusion-based Retrieval for Augmented Policy Learning
Dynamically Adaptive Reasoning via LLM-Guided MCTS for Efficient and Context-Aware KGQA
Nested Graph Pseudo-Label Refinement for Noisy Label Domain Adaptation Learning
LanternNet: A Hub-and-Spoke System to Seek and Suppress Spotted Lanternfly Populations
RecPS: Privacy Risk Scoring for Recommender Systems
Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)
Role-Playing LLM-Based Multi-Agent Support Framework for Detecting and Addressing Family Communication Bias
PLAME: Lightweight MSA Design Advances Protein Folding From Evolutionary Embeddings
Driver-Net: Multi-Camera Fusion for Assessing Driver Take-Over Readiness in Automated Vehicles
Leveraging Out-of-Distribution Unlabeled Images: Semi-Supervised Semantic Segmentation with an Open-Vocabulary Model
Visual Structures Helps Visual Reasoning: Addressing the Binding Problem in VLMs
Precise Bayesian Neural Networks
Transit for All: Mapping Equitable Bike2Subway Connection using Region Representation Learning
Scaling Intelligence: Designing Data Centers for Next-Gen Language Models
Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems
SAIL: Faster-than-Demonstration Execution of Imitation Learning Policies
Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models
Bipedal Balance Control with Whole-body Musculoskeletal Standing and Falling Simulations
Scaling Laws of Motion Forecasting and Planning - Technical Report
Efficient $Q$-Learning and Actor-Critic Methods for Robust Average Reward Reinforcement Learning
Who Gets Credit or Blame? Attributing Accountability in Modern AI Systems
Unsupervised Evolutionary Cell Type Matching via Entropy-Minimized Optimal Transport
Multi-output Classification using a Cross-talk Architecture for Compound Fault Diagnosis of Motors in Partially Labeled Condition
SoloSpeech: Enhancing Intelligibility and Quality in Target Speech Extraction through a Cascaded Generative Pipeline
Steering LLM Reasoning Through Bias-Only Adaptation
MetaSTH-Sleep: Towards Effective Few-Shot Sleep Stage Classification for Health Management with Spatial-Temporal Hypergraph Enhanced Meta-Learning
InterFeat: A Pipeline for Finding Interesting Scientific Features
HumaniBench: A Human-Centric Framework for Large Multimodal Models Evaluation
Advancing Scientific Text Classification: Fine-Tuned Models with Dataset Expansion and Hard-Voting
Test It Before You Trust It: Applying Software Testing for Trustworthy In-context Learning
Action Flow Matching for Continual Robot Learning
Addressing Concept Mislabeling in Concept Bottleneck Models Through Preference Optimization
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models
Byzantine-Robust Federated Learning Using Generative Adversarial Networks
Beyond SHAP and Anchors: A large-scale experiment on how developers struggle to design meaningful end-user explanations
VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making
DistJoin: A Decoupled Join Cardinality Estimator based on Adaptive Neural Predicate Modulation
Low-Confidence Gold: Refining Low-Confidence Samples for Efficient Instruction Tuning
Assistance or Disruption? Exploring and Evaluating the Design and Trade-offs of Proactive AI Programming Support
Soft Token Attacks Cannot Reliably Audit Unlearning in Large Language Models
CHIRLA: Comprehensive High-resolution Identification and Re-identification for Large-scale Analysis
Kolmogorov-Arnold Fourier Networks
Position: LLMs Can be Good Tutors in English Education
Predicting Steady-State Behavior in Complex Networks with Graph Neural Networks
Separate Motion from Appearance: Customizing Motion via Customizing Text-to-Video Diffusion Models
Motion-enhanced Cardiac Anatomy Segmentation via an Insertable Temporal Attention Module
Bias in Decision-Making for AI's Ethical Dilemmas: A Comparative Study of ChatGPT and Claude
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
DispFormer: A Pretrained Transformer Incorporating Physical Constraints for Dispersion Curve Inversion
Integrating Evidence into the Design of XAI and AI-based Decision Support Systems: A Means-End Framework for End-users in Construction
Revealing the impact of synthetic native samples and multi-tasking strategies in Hindi-English code-mixed humour and sarcasm detection
Neural Port-Hamiltonian Differential Algebraic Equations for Compositional Learning of Electrical Networks
Sequential Controlled Langevin Diffusions
Privacy-Preserving Federated Learning via Homomorphic Adversarial Networks
CAREL: Instruction-guided reinforcement learning with cross-modal auxiliary objectives
Lessons from Studying Two-Hop Latent Reasoning
HierTOD: A Task-Oriented Dialogue System Driven by Hierarchical Goals
Flexible Coded Distributed Convolution Computing for Enhanced Straggler Resilience and Numerical Stability in Distributed CNNs
FACEGroup: Feasible and Actionable Counterfactual Explanations for Group Fairness
ETF: An Entity Tracing Framework for Hallucination Detection in Code Summaries
Load more
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
Created by
Haebom
作者
Zekun Xi, Wenbiao Yin, Jizhan Fang, Jialong Wu, Runnan Fang, Jiang Yong, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang
概要
大規模な言語モデルを使用した機械作文は、検索ベースの生成に依存することがよくありますが、モデルの事前定義された範囲内に制限され、豊富な情報を持つコンテンツを生成することは困難です。既存の検索ベース情報は深度、斬新性が不足して重複する問題があり、生成された記事の質が低下します。この論文では、人間の反復的な拡張と反省のプロセスを模倣した遅い思考ベースの機械作文フレームワークであるOmniThinkを提案します。 OmniThinkの重要なアイデアは、学習者がトピックに関する知識を徐々に深める認知行為をシミュレートすることです。実験の結果、OmniThinkは、一貫性や深さなどの指標を阻害することなく、生成された記事の知識密度を向上させることを示しています。人間の評価と専門家のフィードバックにより、長文記事生成の実際の問題解決に対するOmniThinkの可能性を強調します。コードは
https://github.com/zjunlp/OmniThink
で利用可能です。
GitHub - zjunlp/OmniThink: [EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking
[EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking - zjunlp/OmniThink
github.com
Takeaways、Limitations
•
Takeaways:
◦
既存の検索ベースの機械作文のLimitationsである深度不足、斬新性欠如、重複問題を解決する新しいフレームワークOmniThink提示。
◦
人間の認知過程を模倣して知識密度の高い長文記事生成可能性を提示。
◦
一貫性と深さ指標を維持しながら知識密度の向上を実験的に検証
◦
長文記事生成分野における実際の問題解決の可能性の確認
◦
オープンソースコード開示によるアクセシビリティの向上。
•
Limitations:
◦
OmniThinkのパフォーマンスが特定のデータセットまたは特定の種類の長文記事に偏る可能性。
◦
人間の思考プロセスを完全に模倣するには限界が存在する可能性があります。
◦
より多様で広範な実験と評価が必要です。
◦
大規模言語モデルの限界を完全に克服できない可能性
PDFを見る
Made with Slashpage