Thought of recreating taste profile
super personalized data
first, used python to verify assumptions and algorithms
used k means to cluster a user's songs, and performed PCA to plot the result
The clusters weren't too defined, so used elbow method, and silhoutte coefficient to verify
Overfitting, used correlation matrix to remove redundant features.