This paper proposes a likelihood-free method for comparing two distributions, given samples drawn from both distributions, with the goal of assessing the quality of generative models. The proposed method, PQMass, provides a statistically rigorous method for evaluating the performance of a single generative model or comparing multiple competing models. PQMass divides the sample space into non-overlapping regions and applies a chi-square test to the number of data samples in each region. This produces a p-value, which measures the probability that the coefficients of the binomial distribution derived from two sets of samples are drawn from the same multinomial distribution. PQMass does not rely on assumptions about the density of the true distribution or on the training or fitting of auxiliary models. We evaluate PQMass on data of various modes and dimensions, demonstrating its effectiveness in assessing the quality, novelty, and diversity of generated samples. Furthermore, we demonstrate that PQMass scales well to moderately high-dimensional data, suggesting that feature extraction is unnecessary in practical applications.