To address the problem of facial distortion caused by wide-angle cameras, this paper proposes ImagePC, a structural-detail portrait correction model that integrates long-range recognition from Transformers and multi-stage denoising from diffusion models. Considering the difficulty of obtaining video labels, we propose VideoPC, a repurposed version of ImagePC for unlabeled wide-angle videos, utilizing spatiotemporal diffusion adaptation with spatial consistency and temporal smoothness constraints. VideoPC sequentially mitigates temporal blur in blind scenarios while maintaining high-quality spatial facial correction. We evaluate the performance and train the model on a video portrait dataset containing a diverse set of people, lighting conditions, and backgrounds, and demonstrate through experiments that it outperforms existing methods both qualitatively and quantitatively. The code and dataset will be made public in the future.