Forex
Google
New signals service!

Go Back   Forex Trading > Discussion Areas > Metatrader 4


Register in Forex TSD!
Trading Systems Leaders in this forum (automated trading systems) are winning more than 3000 pips in a month (30000$ investing one lot every time).
Click here to register and get more information

Reply
 
LinkBack Thread Tools Display Modes
  #1 (permalink)  
Old 12-10-2006, 03:49 AM
Senior Member
 
Join Date: Feb 2006
Location: New Zealand
Posts: 249
Craig is on a distinguished road
90%??

I have been using the Alpari M1 data to backtest with 90% quality. I decided that I would like to get better data, so I purchased some tick data and wrote a script to convert it to fxt based on the standard script on the MQL website. Now the generated fxt file from the Alpari data for 2 years on the EURUSD was 17,000 KB, the tick file generated from the script is 190,000 KB. This makes me wonder where the 90% figure comes from...the alpari file is about 8% of the size of the tick generated file. I had been finding that all my experts work great in the intervals they were optimized for, but bombed outside those intervals. Basically I was getting overfitting, even on 1 or 2 year intervals at 90% quality, which makes me wonder how effective '90%' quality data is. Anyway, I could have made a mistake, early days yet, forward testing will reveal all...
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #2 (permalink)  
Old 12-15-2006, 01:07 PM
Member
 
Join Date: Sep 2006
Posts: 68
ra300z is on a distinguished road
Quote:
Originally Posted by Craig
I have been using the Alpari M1 data to backtest with 90% quality. I decided that I would like to get better data, so I purchased some tick data and wrote a script to convert it to fxt based on the standard script on the MQL website. Now the generated fxt file from the Alpari data for 2 years on the EURUSD was 17,000 KB, the tick file generated from the script is 190,000 KB. This makes me wonder where the 90% figure comes from...the alpari file is about 8% of the size of the tick generated file. I had been finding that all my experts work great in the intervals they were optimized for, but bombed outside those intervals. Basically I was getting overfitting, even on 1 or 2 year intervals at 90% quality, which makes me wonder how effective '90%' quality data is. Anyway, I could have made a mistake, early days yet, forward testing will reveal all...

Why not just use the build 200 data? It's superior to the Alpari data. If you do a compare of both, you will get more ticks modelled from the 200 data.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #3 (permalink)  
Old 12-15-2006, 04:28 PM
Senior Member
 
Join Date: Feb 2006
Location: New Zealand
Posts: 249
Craig is on a distinguished road
You mean the automatic download thing in the history center? If you do you need to check your data more closely, last time I checked (about 2 weeks ago) it was missing a whole month for this year. This was in fact the straw that broke the camels back with me and drove me to seek an alternate source of data.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #4 (permalink)  
Old 12-15-2006, 06:58 PM
Member
 
Join Date: Sep 2006
Posts: 68
ra300z is on a distinguished road
Quote:
Originally Posted by Craig
You mean the automatic download thing in the history center? If you do you need to check your data more closely, last time I checked (about 2 weeks ago) it was missing a whole month for this year. This was in fact the straw that broke the camels back with me and drove me to seek an alternate source of data.
Was it October or November? The history centre data from build 200 is only January 3, 1999 thru September 29, 2006. As far as I know, all data for that range of dates is there.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #5 (permalink)  
Old 12-15-2006, 09:15 PM
Senior Member
 
Join Date: Feb 2006
Location: New Zealand
Posts: 249
Craig is on a distinguished road
Just tried downloading again, the gap is from 2006.12.14 to 2006.9.30, backtesting on this is hardly going to be accurate. Even if it did not have the hole, your lucky if you are getting more than a tick every couple of minutes, I would not use this data on anything under the H4 timeframe.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #6 (permalink)  
Old 12-15-2006, 10:16 PM
Member
 
Join Date: Sep 2006
Posts: 68
ra300z is on a distinguished road
Quote:
Originally Posted by Craig
Just tried downloading again, the gap is from 2006.12.14 to 2006.9.30, backtesting on this is hardly going to be accurate. Even if it did not have the hole, your lucky if you are getting more than a tick every couple of minutes, I would not use this data on anything under the H4 timeframe.
Of course, that's exactly where the gap is supposed to be! They don't supply that data after September 30, 2006. The build 200 data is January 3, 1999 thru September 29, 2006 inclusive. To fill that hole ou either have to go without or get it from the broker you connect to.

What I did was I cleared all my history. Then I opened up an M1 chart and pressed the left button until all the data to September was loaded into Metatrader. Then I went into history and downloaded the build 200 data. Then converted to other timeframes.

Note that I can only get 90% model quality from April 1999 (or about, haven't tried other dates) to September 29, 2006.

Also note I don't back test with dates after Sept 29.

BTW, my EAs perform (much) better with Alpari data than with build 200 data. Using June 16, 2004 thru September 29, 2006 dates. That's why I stick to build 200 data.

Last edited by ra300z; 12-15-2006 at 10:39 PM.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #7 (permalink)  
Old 12-15-2006, 11:54 PM
Senior Member
 
Join Date: Feb 2006
Location: New Zealand
Posts: 249
Craig is on a distinguished road
I'm not sure I understand why the the data is only in a certain range, the subject of this gap has come up on other forums and nobody had an answer, but anyway...Have you tried your EA's on out of sample data, I found I was getting terrible overfitting on the Alpari data due it's rather 'sparce' nature.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #8 (permalink)  
Old 12-15-2006, 11:57 PM
Diam0nd's Avatar
Senior Member
 
Join Date: Oct 2006
Posts: 235
Diam0nd is on a distinguished road
Quote:
Originally Posted by Craig
I'm not sure I understand why the the data is only in a certain range, the subject of this gap has come up on other forums and nobody had an answer, but anyway...Have you tried your EA's on out of sample data, I found I was getting terrible overfitting on the Alpari data due it's rather 'sparce' nature.
MetaQuotes is working on this gap issue.



Cheers,
Diam0nd




I LOVE
__________________
- MetaTrader v4.00 Build 217 (02 Jul 2008)
- MetaTrader MultiTerminal v4.00 Build 213 (20 Mar 2008)
- Always use the latest (^ABOVE^) MetaQuotes products, saves a lot of nerves. What's new.
"METAQUOTES FOR LIFE!"
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #9 (permalink)  
Old 12-16-2006, 03:02 AM
Member
 
Join Date: Sep 2006
Posts: 68
ra300z is on a distinguished road
Quote:
Originally Posted by Craig
I'm not sure I understand why the the data is only in a certain range, the subject of this gap has come up on other forums and nobody had an answer, but anyway...Have you tried your EA's on out of sample data, I found I was getting terrible overfitting on the Alpari data due it's rather 'sparce' nature.
Which one do you think has sparce data? I say it's Alpari. If you compare the number of ticks modelled in the two, then build 200 comes out ahead by quite a big margin
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
  #10 (permalink)  
Old 12-16-2006, 04:24 AM
Senior Member
 
Join Date: Feb 2006
Location: New Zealand
Posts: 249
Craig is on a distinguished road
I think your missing my point, I'm not trying to deter you from using the download data and backtesting, I'm sure this is a useful exercise. What I am saying is when you compare 1min data to tick data in terms of volume of data you release there is a hell of a lot of information missing & that in my opinion (not fact, yet) this is leading to overfitting of EA params leading to poor out of sample performance.

Even eyeing up the 200 data, you get one bar per minute, if you have watched a chart for any length of time you will know a lot can happen in one minute! even more unknowns are introduced when 'fractal tick modeling' is in play. In response to your very first question, why don't I use the 200 data? I think you should re-read my first post.
Digg this Post!Add Post to del.icio.usBookmark Post in TechnoratiFurl this Post!
Reply With Quote
Reply

Bookmarks

Thread Tools
Display Modes

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are On
Forum Jump


All times are GMT. The time now is 02:49 AM.



Search Engine Friendly URLs by vBSEO 3.2.0 ©2008, Crawlability, Inc.