View uci-20070111 credit-a (public)























- Summary
(No information yet)
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,Floating Point,String
- Download
-
# Instances: 690 / # Attributes: 16
HDF5 (308.3 KB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
You can edit this item to add more meta information and make use of the site's premium features.
- Original Data Format
- arff
- Name
- credit-rating
- Version mldata
- 0
- Comment
Title: Credit Approval
Sources: (confidential) Submitted by quinlan@cs.su.oz.au
Past Usage:
See Quinlan, "Simplifying decision trees", Int J Man-Machine Studies 27, Dec 1987, pp. 221-234. "C4.5: Programs for Machine Learning", Morgan Kaufmann, Oct 1992
Relevant Information:
This file concerns credit card applications. All attribute names and values have been changed to meaningless symbols to protect confidentiality of the data.
This dataset is interesting because there is a good mix of attributes -- continuous, nominal with small numbers of values, and nominal with larger numbers of values. There are also a few missing values.
Number of Instances: 690
Number of Attributes: 15 + class attribute
Attribute Information:
A1: b, a. A2: continuous. A3: continuous. A4: u, y, l, t. A5: g, p, gg. A6: c, d, cc, i, j, k, m, r, q, w, x, e, aa, ff. A7: v, h, bb, j, n, z, dd, ff, o. A8: continuous. A9: t, f. A10: t, f. A11: continuous. A12: t, f. A13: g, p, s. A14: continuous. A15: continuous. A16: +,- (class attribute)
Missing Attribute Values: 37 cases (5%) have one or more missing values. The missing values from particular attributes are:
A1: 12 A2: 12 A4: 6 A5: 6 A6: 9 A7: 9 A14: 13
Class Distribution
+: 307 (44.5%) -: 383 (55.5%)
- Names
- A1,A2,A3,A4,A5,A6,A7,A8,A9,A10,
- Types
- nominal:b,a
- numeric
- numeric
- nominal:u,y,l,t
- nominal:g,p,gg
- nominal:c,d,cc,i,j,k,m,r,q,w,x,e,aa,ff
- nominal:v,h,bb,j,n,z,dd,ff,o
- numeric
- nominal:t,f
- nominal:t,f
- Data (first 10 data points)
A1 A2 A3 A4 A5 A6 A7 A8 A9 A10 ... b 30.83 0.0 u g w v 1.25 t t ... a 58.67 4.46 u g q h 3.04 t t ... a 24.5 0.5 u g q h 1.5 t f ... b 27.83 1.54 u g w v 3.75 t t ... b 20.17 5.625 u g w v 1.71 t f ... b 32.08 4.0 u g m v 2.5 t f ... b 33.17 1.04 u g r h 6.5 t f ... a 22.92 11.585 u g cc v 0.04 t f ... b 54.42 0.5 y p k h 3.96 t f ... b 42.5 4.915 y p w v 3.165 t f ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A gzip'ed tar containing UCI and UCI KDD datasets (uci-20070111.tar.gz, 17,952,832 Bytes)
- URLs
- (No information yet)
- Publications
- Data Source
- http://www.ics.uci.edu/~mlearn/MLRepository.html http://kdd.ics.uci.edu/
- Measurement Details
- Usage Scenario
- revision 1
- by mldata on 2010-11-06 09:58
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
This item was downloaded 2552 times and viewed 1734 times.
Disclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.
