3 MySQL databases, 100gb for order / market data (anything that comes from the API), 200gb backtesting database that contains processed streaming data and another which is more of a ‘data lake’ containing aggregation of everything (including logs) which I think is around 200gb.
I then use S3 for all logs (last 120 days) and raw streaming data:
- UK/IE racing since streaming began
- AUS/US racing past 12 months
- Greyhound racing past 12 months
- Tennis, major tournaments
- Other random sports markets
Do you Collect Data on Betfair Markets?
- ruthlessimon
- Posts: 2094
- Joined: Wed Mar 23, 2016 3:54 pm
I like thatfirlandsfarm wrote: ↑Tue Jun 25, 2019 4:56 amThe skill is not so much the extracting and manipulating of the data, I see it that the skill is spotting the difference between a statistical freak and a trend and not over fitting.
Does it look "beautiful"? Is it "elegant"? Does it have smooth, defined, consistent curves - LOL
I'm pretty certain someone like Peter will get joy at the "aesthetics" of his strategies
-actually here's a checklist:
A proof that uses a minimum of additional assumptions or previous results.
A proof that is unusually succinct.
A proof that derives a result in a surprising way
A proof that is based on new and original insights.
A method of proof that can be easily generalized to solve a family of similar problems.