With the rapid development of large-scale language models (LLMs), AI-based writing is becoming widespread in educational and professional fields. This paper analyzes and benchmarks the characteristics and quality of essays generated by popular LLMs using large-scale data, and discusses the implications for automated grading and academic integrity.