Unable to understand which features to choose
I am a newbie here, but I am trying to work with a dataset which gives the attempt at the goal by a footballer,which will predict one of 2 possible outputs - whether or not they could score the goal or not.
I have done some basic cleaning but I am still getting only 60% accuracy on whatever classifier I use from sklearn.
I have removed a few features which I thought would not contribute to the 'y' value, and encoded a few values, but still it doesn't bump up the accuracy.
I am not sure how to attach a csv, but this is what the data looks like -- data
For the rows that 'is_goal' doesn't have a value, the aim is to find out if a goal was scored or not.
Should I attach the notebook as well to show what work I've done till now?
Topic dataset feature-selection
Category Data Science