This paper criticizes existing code generation benchmarks for overlooking contract compliance (preconditions and validity constraints) and for failing to consider important aspects of real-world software. To address this, we present PACT, a program evaluation and contract compliance evaluation framework. PACT is the first framework to systematically assess and improve contract compliance alongside functional correctness. By providing a corpus of test suites focused on contract violations, analyzing code generation under various prompting conditions, and introducing novel metrics to quantify contract compliance in test and code generation, PACT exposes errors overlooked by existing benchmarks and evaluates the robustness of LLM-generated code.