Daily Arxiv

This is a page that curates AI-related papers published worldwide.
All content here is summarized using Google Gemini and operated on a non-profit basis.
Copyright for each paper belongs to the authors and their institutions; please make sure to credit the source when sharing.

Adapting Vision-Language Models for Neutrino Event Classification in High-Energy Physics

Created by
  • Haebom

Author

Dikshant Sagar, Kaiwen Yu, Alejandro Yankelevich, Jianming Bian, Pierre Baldi

Outline

Building on advances in the Large Language Model (LLM), this paper explores the application of the Visual Language Model (VLM), specifically a fine-tuned LLaMa 3.2 variant, to identify neutrino interactions in pixelated detector data from high-energy physics (HEP) experiments. We compare this model with state-of-the-art convolutional neural network (CNN) architectures similar to those used in the NOvA and DUNE experiments, which achieved high efficiency and purity in classifying electron and muon neutrino events. Considering both classification performance and interpretability of model predictions, we find that the VLM outperforms the CNN, offers greater flexibility in incorporating auxiliary text or semantic information, and provides more interpretable inference-based predictions. This study highlights the potential of the VLM as a universal backbone for physical event classification due to its high performance, interpretability, and generalizability, opening new avenues for integrating multimodal inference in experimental neutrino physics.

Takeaways, Limitations

Takeaways:
VLM outperforms CNN for identifying neutrino interactions in high-energy physics experiments.
VLM provides greater flexibility in integrating auxiliary text or semantic information.
VLM provides more interpretable and inference-based predictions.
VLM presents potential as a universal backbone for physical event classification.
Presenting new possibilities for integrating multimodal inference into experimental neutrino physics.
Limitations:
Further studies are needed to determine whether the performance improvements of the VLM presented in this paper can be generalized to other types of neutrino interactions or experimental data.
Lack of quantitative assessment of the interpretability of VLM.
Dependence on specific VLM architecture and tuning strategy.
VLM performance evaluation in actual experimental environments is needed.
👍