Linear Regression Data mining by WEKA

Download Report

Transcript Linear Regression Data mining by WEKA

Data mining by WEKA
Karunamuni P Silva
CIS 675
Data Description
Cost Data for U.S. Airlines, 90 Oservations On 6 Firms For
15 Years, 1970-1984
Source: These data are a subset of a larger data set
provided to the author by Professor Moshe Kim.
They were originally constructed by Christensen
Associates of Madison, Wisconsin.
I consider following data set
Cost vs
Fuel price and
Load factor, the average capacity utilization of the fleet.
Output
Histogram of Output
Histogram of Fuel Price
Histogram of Load Factor
Histogram of Cost
Linear Regression
66% of training set & 34% test set
Visualization
76% of training set & 24% test set
Visualization
86% of training set & 14% test set
Visualization
Alpha=0.05 Iterations=200
Alpha=0.05 Iterations=200
References
http://www.indiana.edu/~statmath/stat/all/pa
nel/
https://blog.itu.dk/SPVCE2010/files/2010/11/wekatutorial.pdf