I was a student in Georgia Tech’s CS7641 graduate machine learning course in this past Fall. The course is organized around four large projects, each focused on one of the core ML method classes. The semester begins with selecting a dataset suitable for 3 of these projects (Supervised, Unsupervised, and Randomized Optimization).
I was very keen to find some data in line with my interests. Unfortunately, none of the classic datasets (UCI repository, etc.