"time series prediction"
Hi everyone
I am new user of RapidMiner and this is my first post, I have 11 months electric feeder load time series data so I want to forecast one day ahead feeder load with the help of this data.so can anyone guide me how can I do this with the help of RapidMiner ?
Data Format:-
date hour1 hour2 hour3 hour4 hour5 hour6 ......... hour24
10/01/2010 .2934 .1983 .1328 .2032 .1002 .1834 ......... .2903
10/02/2010 .2367 . 1298 .1289 .1901 .1192 .1920 ........ .1902
................. ................................................................................
280 days 24 hour
Thanks
Vikas Gupta
I am new user of RapidMiner and this is my first post, I have 11 months electric feeder load time series data so I want to forecast one day ahead feeder load with the help of this data.so can anyone guide me how can I do this with the help of RapidMiner ?
Data Format:-
date hour1 hour2 hour3 hour4 hour5 hour6 ......... hour24
10/01/2010 .2934 .1983 .1328 .2032 .1002 .1834 ......... .2903
10/02/2010 .2367 . 1298 .1289 .1901 .1192 .1920 ........ .1902
................. ................................................................................
280 days 24 hour
Thanks
Vikas Gupta
Tagged:
0
Answers
day_time, load
1, .2934
2, .1983
....
n .1902
Then use the windowing operator with the appropriate embedding dimension.
Then use k-nn or linear regression as a learner.
If you upload like 50 rows of data I'll make you an example process.
please post your questions only once in the most appropriate board and not in every board here. Thanks.
Cheers,
Ingo
Like:
loaddata #<-- this is a comment
0.5144 #<-- first data point
0.5144
0.5144
0.6001
0.6001
0.6859
0.6859
0.7716
0.7716
1.286
1.286
1.286
1.2003
1.2003
1.2003
1.286
1.286
1.5432
1.8004
1.6289
1.5432
1.3717
1.1145
0.8573
0.9431 #<-- day 2
...
1.286 #<-- last data point, etc
edit:
okay here is the data I will be using:
Please help me about windowing operator(horizon,window size) to forecast the feeder load one day ahead.
Thanks
Vikas
So your better of with classical statistics. No need for windowing, embedding, and machine learning here.
http://devio.us/~wessel/load/load.jpeg
http://devio.us/~wessel/load/load2.jpeg
here is the process:
correlation: 0.795 +/- 0.136 (mikro: 0.786)
A correlation of 0.8 is already really good.
Depends also on how much noise your sensor has.
I got the process but please give me some help abut it's Output
1:- Prediction trend accuracy and correlation both are same thing?
2:-Can you give me some explanation about its output of process?
Thanks
Vikas
Look at the scatter plot of predicted load vs actual load.
When a data point is predicted correctly it lies exactly on the diagonal.
You see that all data points that are not 0 are predicted with only a small error.
The error is bigger in data points that are 0, which is expected because they are anomalous values.
I could have used "mean absolute error" instead of "correlation".
But the nice thing about "correlation" is that its invariant to the dataset.
If I would multiply all data points by a factor 100, "mean absolute error" would go up by a factor 100.
Correlation stays the same, since its normalized between -1 and 1.
Can you help me about this linear regression generated by process
0.299 * load-23 - 0.041 * load-19 + 0.006 * load-15 - 0.007 * load-8 - 0.014 * load-5 + 0.217 * load-1 + 0.407 * load-0 + 0.182
for forecasting of one day ahead load.
http://en.wikipedia.org/wiki/Autoregressive_integrated_moving_average
Can you give me some idea about one hour ahead load prediction using same data set ?
Thanks
isn't that ideas enough?