View regression-datasets housing (public)























- Summary
UCI boston housing data
- License
- unknown (from Weka repository)
- Dependencies
- Tags
- arff slurped Weka
- Attribute Types
- Integer,Floating Point
- Download
-
# Instances: 506 / # Attributes: 14
HDF5 (59.7 KB) XML CSV ARFF LibSVM Matlab OctaveFiles are converted on demand and the process can take up to a minute. Please wait until download begins.
- Original Data Format
- arff
- Name
- 'housing'
- Version mldata
- 0
- Comment
Title: Boston Housing Data
Sources: (a) Origin: This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University. (b) Creator: Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978. (c) Date: July 7, 1993
Past Usage:
Used in Belsley, Kuh & Welsch, 'Regression diagnostics ...', Wiley,
- N.B. Various transformations are used in the table on pages 244-261.
- Quinlan,R. (1993). Combining Instance-Based and Model-Based Learning. In Proceedings on the Tenth International Conference of Machine Learning, 236-243, University of Massachusetts, Amherst. Morgan Kaufmann.
Relevant Information:
Concerns housing values in suburbs of Boston.
Number of Instances: 506
Number of Attributes: 13 continuous attributes (including "class" attribute "MEDV"), 1 binary-valued attribute.
Attribute Information:
- CRIM per capita crime rate by town
- ZN proportion of residential land zoned for lots over 25,000 sq.ft.
- INDUS proportion of non-retail business acres per town
- CHAS Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)
- NOX nitric oxides concentration (parts per 10 million)
- RM average number of rooms per dwelling
- AGE proportion of owner-occupied units built prior to 1940
- DIS weighted distances to five Boston employment centres
- RAD index of accessibility to radial highways
- TAX full-value property-tax rate per $10,000
- PTRATIO pupil-teacher ratio by town
- B 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town
- LSTAT % lower status of the population
- MEDV Median value of owner-occupied homes in $1000's
Missing Attribute Values: None.
- Names
- CRIM,ZN,INDUS,CHAS,NOX,RM,AGE,DIS,RAD,TAX,
- Types
- numeric
- numeric
- numeric
- nominal:0,1
- numeric
- numeric
- numeric
- numeric
- numeric
- numeric
- Data (first 10 data points)
CRIM ZN INDUS CHAS NOX RM AGE DIS RAD TAX ... 0.00... 18 2.31 0 0.538 6.575 65.2 4.09 1 296 ... 0.02... 0 7.07 0 0.469 6.421 78.9 4.9671 2 242 ... 0.02... 0 7.07 0 0.469 7.185 61.1 4.9671 2 242 ... 0.03... 0 2.18 0 0.458 6.998 45.8 6.0622 3 222 ... 0.06... 0 2.18 0 0.458 7.147 54.2 6.0622 3 222 ... 0.02... 0 2.18 0 0.458 6.43 58.7 6.0622 3 222 ... 0.08... 12 7.87 0 0.524 6.012 66.6 5.5605 5 311 ... 0.14... 12 7.87 0 0.524 6.172 96.1 5.9505 5 311 ... 0.21... 12 7.87 0 0.524 5.631 100.0 6.0821 5 311 ... 0.17... 12 7.87 0 0.524 6.004 85.9 6.5921 5 311 ... ... ... ... ... ... ... ... ... ... ... ...
- Description
A jarfile containing 30 regression datasets collected by Luis Torgo (regression-datasets.jar, 10,090,266 Bytes).
- URLs
- http://archive.ics.uci.edu/ml/datasets/Housing
- Publications
- Data Source
- Originally from the UCI machine learning repository.
- Measurement Details
From the UCI repository:
Title: Boston Housing Data
Sources: (a) Origin: This dataset was taken from the StatLib library which is maintained at Carnegie Mellon University. (b) Creator: Harrison, D. and Rubinfeld, D.L. 'Hedonic prices and the demand for clean air', J. Environ. Economics & Management, vol.5, 81-102, 1978. (c) Date: July 7, 1993
Past Usage:
Used in Belsley, Kuh & Welsch, 'Regression diagnostics ...', Wiley,
- N.B. Various transformations are used in the table on pages 244-261.
- Quinlan,R. (1993). Combining Instance-Based and Model-Based Learning. In Proceedings on the Tenth International Conference of Machine Learning, 236-243, University of Massachusetts, Amherst. Morgan Kaufmann.
Relevant Information:
Concerns housing values in suburbs of Boston.
Number of Instances: 506
Number of Attributes: 13 continuous attributes (including "class" attribute "MEDV"), 1 binary-valued attribute.
Attribute Information:
- CRIM per capita crime rate by town
- ZN proportion of residential land zoned for lots over 25,000 sq.ft.
- INDUS proportion of non-retail business acres per town
- CHAS Charles River dummy variable (= 1 if tract bounds river; 0 otherwise)
- NOX nitric oxides concentration (parts per 10 million)
- RM average number of rooms per dwelling
- AGE proportion of owner-occupied units built prior to 1940
- DIS weighted distances to five Boston employment centres
- RAD index of accessibility to radial highways
- TAX full-value property-tax rate per $10,000
- PTRATIO pupil-teacher ratio by town
- B 1000(Bk - 0.63)^2 where Bk is the proportion of blacks by town
- LSTAT % lower status of the population
- MEDV Median value of owner-occupied homes in $1000's
Missing Attribute Values: None.
- Usage Scenario
Predict the median value of home from all other variables.
- revision 1
- by mldata on 2010-11-06 09:58
- revision 2
- by cong on 2011-09-14 15:17
No one has posted any comments yet. Perhaps you would like to be the first?
Leave a comment
This item was downloaded 12839 times and viewed 17439 times.
Disclaimer
We are acting in good faith to make datasets submitted for the use of the scientific community available to everybody, but if you are a copyright holder and would like us to remove a dataset please inform us and we will do it as soon as possible.
Acknowledgements
This project is supported by PASCAL (Pattern Analysis, Statistical Modelling and Computational Learning)
http://www.pascal-network.org/.
