Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

MoSEs: Uncertainty-Aware AI-Generated Text Detection via Mixture of Stylistics Experts with Conditional Thresholds

Created by
  • Haebom

Author

Junxi Wu, Jinpeng Wang, Zheng Liu, Bin Chen, Dongjian Hu, Hao Wu, Shu-Tao Xia

Outline

This paper highlights the importance of building a reliable AI-generated text detection system, given growing concerns about the potential misuse of large-scale language models. We highlight the limitations of existing methods, which overlook style modeling and rely on static thresholds, significantly limiting detection performance. We propose the Mixture of Stylistic Experts (MoSEs) framework, which enables conditional threshold estimation for quantifying style-aware uncertainty. MoSEs consists of three core components: a Style Reference Repository (SRR), a Style-Aware Router (SAR), and a Conditional Threshold Estimator (CTE). For input text, the SRR activates appropriate reference data and provides it to the CTE, which then dynamically determines the optimal threshold by jointly modeling linguistic statistical and semantic features. MoSEs generates predicted labels with discriminant scores and corresponding confidence levels, achieving an average 11.34% improvement in detection performance compared to existing methods, and a 39.15% improvement in low-resource environments. The source code is available on GitHub.

Takeaways, Limitations

Takeaways:
A novel framework (MoSEs) is presented to improve AI-generated text detection performance through style-aware uncertainty quantification.
Achieved an average performance improvement of 11.34% over existing methods and 39.15% in low-resource environments.
More accurate detection through dynamic threshold settings
Ensuring reproducibility and scalability through source code disclosure
Limitations:
Potential performance degradation depending on the size and configuration of the SRR
Need to verify generalization performance across different styles and languages
Adaptability assessment needed for new styles of AI-generated text
Potential increase in computational cost due to the complexity of the CTE
👍