Public Archive Data
-
Ovarian Cancer (NCI PBSII Data) - submitted by kidzik 4712 views, 13393 downloads, 0 comments
last edited by kidzik - Sep 14, 2011, 14:35 CET Rating




- Summary:
Ovarian cancer due to family or personal history of cancer
- Data Shape: 15155 attributes, 253 instances (Integer,Floating Point,String)
- License: unknown (from UCI repository)
- Tags: cancer genetic history ovarian
- Tasks / Methods / Challenges: 1 tasks, 0 methods, 0 challenges
- Download: HDF5 (30.1 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Ovarian cancer due to family or personal history of cancer
-
covtype.binary - submitted by mldata 227 views, 12987 downloads, 0 comments
last edited by mldata - Nov 1, 2010, 11:37 CET Rating




- Summary:
(No information yet)
- Data Shape: 55 attributes, 1162024 instances ()
- License: unknown (from LibSVMTools repository)
- Tags: libsvm LibSVMTools slurped
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (85.6 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
(No information yet)
-
Yahoo! Web Directory Topics - submitted by jeanbaptiste 2231 views, 12981 downloads, 0 comments
last edited by jeanbaptiste - Mar 13, 2012, 15:16 CET Rating




- Summary:
Contains parsed webpages along with their topics extracted from Yahoo! web directory
- Data Shape: 10630 attributes, 2212 instances ()
- License: unknown
- Tags: bag-of-words Classification multi-class text web-pages Yahoo!
- Tasks / Methods / Challenges: 1 tasks, 0 methods, 1 challenges
- Download: HDF5 (3.6 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Contains parsed webpages along with their topics extracted from Yahoo! web directory
-
regression-datasets housing - submitted by mldata 17439 views, 12844 downloads, 0 comments
last edited by cong - Sep 14, 2011, 15:17 CET Rating




- Summary:
UCI boston housing data
- Data Shape: 14 attributes, 506 instances (Integer,Floating Point)
- License: unknown (from Weka repository)
- Tags: arff slurped Weka
- Tasks / Methods / Challenges: 1 tasks, 1 methods, 0 challenges
- Download: HDF5 (59.7 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
UCI boston housing data
-
cod-rna - submitted by mldata 390 views, 12802 downloads, 0 comments
last edited by mldata - Nov 1, 2010, 11:36 CET Rating




- Summary:
(No information yet)
- Data Shape: 9 attributes, 488565 instances ()
- License: unknown (from LibSVMTools repository)
- Tags: libsvm LibSVMTools slurped
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (33.6 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
(No information yet)
-
abalone - submitted by mldata 247 views, 12740 downloads, 0 comments
last edited by mldata - Nov 1, 2010, 11:49 CET Rating




- Summary:
(No information yet)
- Data Shape: 9 attributes, 4177 instances ()
- License: unknown (from LibSVMTools repository)
- Tags: libsvm LibSVMTools slurped
- Tasks / Methods / Challenges: 1 tasks, 0 methods, 0 challenges
- Download: HDF5 (304.2 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
(No information yet)
-
DLBCL Outcome from Harvard - submitted by kidzik 3731 views, 12315 downloads, 0 comments
last edited by kidzik - Sep 15, 2011, 12:49 CET Rating




- Summary:
There are two kinds of classifications about diffuse large b-cell lymphoma (DLBCL) addressed in the publication.
- Data Shape: 7130 attributes, 58 instances (Integer,String)
- License: unknown (from UCI repository)
- Tags: dlbcl Harvard lymphoma outcome
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (2.2 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
There are two kinds of classifications about diffuse large b-cell lymphoma (DLBCL) addressed in the publication.
-
Jester 1 - submitted by kidzik 9 views, 12125 downloads, 0 comments
last edited by kidzik - Sep 13, 2011, 13:24 CET Rating




- Summary:
Over 4.1 million continuous ratings (-10.00 to +10.00) of 100 jokes from 73,421 users: collected between April 1999 - May 2003
- Data Shape: 101 attributes, 73421 instances ()
- License: unknown (from UCI repository)
- Tags: collaborative-filtering jester
- Tasks / Methods / Challenges: 1 tasks, 0 methods, 0 challenges
- Download: HDF5 (56.3 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Over 4.1 million continuous ratings (-10.00 to +10.00) of 100 jokes from 73,421 users: collected between April 1999 - May 2003
-
diabetes_scale - submitted by mldata 7416 views, 11793 downloads, 0 comments
last edited by cong - Sep 14, 2011, 16:22 CET Rating




- Summary:
PIMA indian diabetes data (scaled to [-1,1])
- Data Shape: 9 attributes, 768 instances ()
- License: unknown (from LibSVMTools repository)
- Tags: libsvm LibSVMTools slurped
- Tasks / Methods / Challenges: 2 tasks, 3 methods, 1 challenges
- Download: HDF5 (64.5 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
PIMA indian diabetes data (scaled to [-1,1])
-
Artificial 2state Sequence Data - submitted by nico 2062 views, 11684 downloads, 0 comments
last edited by sonne - Nov 24, 2010, 17:09 CET Rating




- Summary:
A set of input-sequences and a corresponding label-sequence artificially generated.
- Data Shape: 14 attributes, 250000 instances ()
- License: ODbL
- Tags: Label-Sequence-Learning Structured-Output-Prediction
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (26.7 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
A set of input-sequences and a corresponding label-sequence artificially generated.
Disclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.
