This paper presents the results of a systematic investigation into digital twin-based AI simulations to address the limited data quantity and quality that are key challenges in the adoption of modern, low-level symbolic AI. By analyzing 22 key studies, we identify technological trends and derive a reference framework for deploying digital twins and AI components. We map this to the ISO 23247 digital twin reference architecture to provide architectural guidance and suggest future research challenges and opportunities. We focus on AI simulations that utilize simulated synthetic data in virtual training environments to safely and efficiently develop AI agents, and on the new possibilities offered by digital twins, high-fidelity virtual replicas of physical systems.