Every once in a while I receive a request or see one posted on some bulletin board about data mining data sets. I have to say, I have little patience for many of these requests because a simple google (or Clusty) search will solve the problem. Nevertheless, here are four sites I've used in the past to grab data for some testing of algorithms of software packages:
There are several sites for data, including:
UC Irvine Machine Learning Repository: http://archive.ics.uci.edu/ml/
Carnegie Mellon Statlib Archive: http://lib.stat.cmu.edu/datasets/
DELVE Datasets: http://www.cs.utoronto.ca/~delve/data/datasets.html
MIT Broad Institute Cancer Datasets: http://www.broad.mit.edu/cgi-bin/cancer/datasets.cgi
Wednesday, April 16, 2008
Subscribe to:
Post Comments (Atom)
4 comments:
hi , I am come from Chinakdd (www.chinakdd.com ),Would you like me repost this blog ?
thanks
Hi, i'm looking for a credit card fraud dataset, but i can't find it. pls, could you help me?
Hi I am a Mater (Master in computer and information system MCIS) student, I want to complete my thesis on Student performance prediction and analysis using data mining so my requirement is large student dataset so that i can complete my thesis.
so please help me by forwarding dataset related to student performance to my mail address: mukeshjswl7@gmail.com
Post a Comment