In this paper, we present a deep learning-based climate variable downscaling method that simultaneously handles multiple variables to address the challenges of regional studies due to the low spatial resolution of global climate models (GCMs). To overcome the limitations of conventional single-variable models, we propose a multi-task, multivariable Vision Transformer (ViT) architecture (1EMD) with a shared encoder and variable-specific decoder. The architecture mimics RCM-level downscaling for the European region by predicting three key climate variables: surface temperature (tas), wind speed (sfcWind), and 500 hPa geostationary altitude (zg500) from GCM-resolution inputs. Experimental results show that the proposed multivariable approach outperforms single-variable baseline models and also improves computational efficiency.