Audio Vision Transformer 1 minute read Vision Transformer on spectrograms for audio classification, with data augmentation.