Public Archive Data
-
datasets-UCI glass - submitted by mldata 6242 views, 3891 downloads, 0 comments
last edited by mldata - Nov 6, 2010, 09:57 CET Rating




- Summary:
(No information yet)
- Data Shape: 10 attributes, 214 instances (Integer,Floating Point,String)
- License: unknown (from Weka repository)
- Tags: arff slurped Weka
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (37.8 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
(No information yet)
-
MNIST (original) - submitted by sonne 6220 views, 386436 downloads, 0 comments
last edited by demo - Sep 14, 2011, 15:17 CET Rating



- Summary:
The MNIST database of handwritten digits
- Data Shape: 785 attributes, 70000 instances ()
- License: CC0
- Tags: handwritten_digits
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (52.9 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
The MNIST database of handwritten digits
-
DMOZ Web Directory Topics - submitted by jeanbaptiste 6190 views, 11205 downloads, 0 comments
last edited by jeanbaptiste - Mar 29, 2012, 16:47 CET Rating




- Summary:
Contains parsed webpages along with their topics extracted from DMOZ web directory
- Data Shape: 10630 attributes, 2658 instances ()
- License: unknown
- Tags: bag-of-words Classification DMOZ libsvm multi-class text web-pages
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (4.1 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Contains parsed webpages along with their topics extracted from DMOZ web directory
-
agridatasets grub-damage - submitted by mldata 5976 views, 7500 downloads, 0 comments
last edited by mldata - Nov 6, 2010, 09:57 CET Rating




- Summary:
(No information yet)
- Data Shape: 9 attributes, 155 instances (Integer,String)
- License: unknown (from Weka repository)
- Tags: arff slurped Weka
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (37.0 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
(No information yet)
-
datasets-numeric bodyfat - submitted by mldata 5873 views, 5340 downloads, 0 comments
last edited by mldata - Nov 6, 2010, 09:57 CET Rating




- Summary:
(No information yet)
- Data Shape: 15 attributes, 252 instances (Integer,Floating Point)
- License: unknown (from Weka repository)
- Tags: arff slurped Weka
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (43.7 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
(No information yet)
-
datasets-UCI sonar - submitted by mldata 5866 views, 4548 downloads, 0 comments
last edited by mldata - Nov 6, 2010, 09:57 CET Rating




- Summary:
(No information yet)
- Data Shape: 61 attributes, 208 instances (Integer,Floating Point,String)
- License: unknown (from Weka repository)
- Tags: arff slurped Weka
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (126.4 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
(No information yet)
-
Friedman-datasets fri_c2_500_25 - submitted by mldata 5817 views, 7723 downloads, 0 comments
last edited by jaakkopeltonen - Nov 22, 2010, 18:12 CET Rating




- Summary:
Artificial data generated from the Friedman function, part of a collection of 80 data sets. This particular set has 25 features, 1 output (26th feature), colinearity degree 2, and 500 instances.
- Data Shape: 26 attributes, 500 instances (Floating Point)
- License: unknown (from Weka repository)
- Tags: arff colinearity Friedman-function slurped Weka
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (112.1 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Artificial data generated from the Friedman function, part of a collection of 80 data sets. This particular set has 25 features, 1 output (26th feature), colinearity degree 2, and 500 instances.
-
Translation Initiation Site Pred - submitted by kidzik 5792 views, 17586 downloads, 0 comments
last edited by kidzik - Sep 15, 2011, 18:46 CET Rating




- Summary:
Used to find the Translation Initiation Site (TIS), at which the translation from mRNA to proteins initiates
- Data Shape: 928 attributes, 3312 instances (Integer,String)
- License: unknown (from UCI repository)
- Tags: biomedical Initiation Prediction Regression Site Translation
- Tasks / Methods / Challenges: 0 tasks, 0 methods, 0 challenges
- Download: HDF5 (49.3 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Used to find the Translation Initiation Site (TIS), at which the translation from mRNA to proteins initiates
-
Banana IDA - submitted by sonne 5687 views, 13768 downloads, 0 comments
last edited by sonne - Sep 14, 2011, 15:17 CET Rating




- Summary:
Banana data set from the IDA Benchmark repository
- Data Shape: 3 attributes, 5300 instances ()
- License: PDDL
- Tags: IDA_Benchmark_Repository
- Tasks / Methods / Challenges: 2 tasks, 2 methods, 0 challenges
- Download: HDF5 (134.8 KB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Banana data set from the IDA Benchmark repository
-
Lung cancer (Ontario) - submitted by kidzik 5610 views, 9366 downloads, 0 comments
last edited by kidzik - Sep 14, 2011, 14:35 CET Rating




- Summary:
Gene expression data on tumor specimens from a total of 39 NSCLC samples.
- Data Shape: 2881 attributes, 39 instances (Integer,Floating Point,String)
- License: unknown (from UCI repository)
- Tags: cancer lung ontario
- Tasks / Methods / Challenges: 1 tasks, 0 methods, 0 challenges
- Download: HDF5 (1.1 MB) XML CSV ARFF LibSVM Matlab Octave
- Files are converted on demand and the process can take up to a minute. Please wait until download begins.
- Summary:
Gene expression data on tumor specimens from a total of 39 NSCLC samples.
Disclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.
