This paper compares and analyzes the performance of various spectral and rhythmic features (mel-scaled spectrograms, MFCCs, cyclic tempograms, STFT chromagrams, CQT chromagrams, and CENS chromagrams) in audio data classification using deep convolutional neural networks (CNNs). Using the ESC-50 dataset (2,000 environmental audio recordings), we measured the accuracy, precision, recall, and F1 scores of each feature for audio category and class-level classification. Experiments were conducted using an end-to-end deep learning pipeline.