Takeaways: We identify "Murphy's Gap," a structural limitation of RLHF, and suggest the importance of a correction oracle to address it. We provide information-theoretic evidence proving the performance limitations of RLHF in poorly specified environments, suggesting future directions for RLHF research. We also provide a new explanation for the observed alignment failures.