This paper builds on growing evidence that the brain utilizes attention schemas, simplified models of attention, to help control it. Using a neural network with a transformer attention mechanism, we investigated the impact of adding attention schemas to artificial agents on their judgment and collaboration abilities. Experimental results show that agents with attention schemas are better able to classify the attentional states of other agents (improving accuracy), develop patterns that allow other agents to more easily classify their own attentional patterns, and exhibit improved performance in collaborative tasks. These performance gains are not simply due to increased network complexity, but rather are a result of specialized tasks that involve judging, classifying, or predicting the attention of other agents. In conclusion, our results support the hypothesis that attention schemas possess computational properties beneficial for inter-interpretability and interactive behavior.