MIRROR is a modular architecture that maintains the user's safety-related context in personalized multi-turn conversations, suppresses flattery tendencies, and prevents harmful recommendations while prioritizing user safety. Inspired by dual-process theory, it consists of an immediate response generation (Talker) and asynchronous deliberative processing (Thinker). On the CuRaTe safety benchmark, MIRROR achieved a 21% relative improvement over various models, with the open-source model outperforming the commercial model.