Which model to use for multiclass audio classification?

Question

Which model to use for multiclass audio classification?

Pranav Natekar

2022年5月12日 18:05

I am working on a project wherein I want to classify Tabla taalas(patterns) and I didn't find any dataset regarding it. I am recording them myself and I've ~500 data samples recorded. What model shall I use to classify the patterns if I have less than 500 samples and 6 classes?

Topic multiclass-classification deep-learning machine-learning

Category Data Science

Jon Nordby · Accepted Answer · 2019年7月26日 08:32

500 samples for 6 classes is not so much. You should put aside about 100 samples for validation and 100 for testing, leaving 300 samples for training. I'm assuming that these drum loop are on the order of 1 second long (0.5-5 seconds). Then I would recommend trying a pretrained audio model, which typically have 1 second analysis windows. For example OpenL3, which is a powerful and easy to use audio embedding. It even has pretrained versions trained on audio. On top of the audio embeddings try a simple linear classifier like LogisticRegression or a RandomForest.

Which model to use for multiclass audio classification?

About