This paper investigates the applicability of state-of-the-art open-source visual question answering (VQA) models, such as LLaMA2, LLaMA3, QWEN3, and NVILA, to classroom behavior analysis using the BAV-Classroom-VQA dataset, which is derived from real-world classroom video recordings from the Vietnam Banking Academy. This study presents data collection and annotation methodology and benchmarks the performance of selected VQA models, demonstrating promising performance on behavioral visual questions, thereby demonstrating their potential as future classroom analysis and intervention systems.