Out of sample data in structural breaks competition

civil-francis · August 22, 2025, 11:40am

Is there an ‘out of sample’ phase for the structural breaks competition, or is the leaderboard data always out of sample? Does the data used for the leaderboard ever change?

enzo · August 22, 2025, 12:03pm

Yes there will be an out of sample phase, with another 10’000 datasets that your code never saw to predict.

civil-francis · August 22, 2025, 12:38pm

Thanks, I’m still a bit confused.

My understanding is that now, when you submit to the leaderboard, , it sees data you’ve never seen. Does this public leaderboard data change? Or is it static throughout the competition?

enzo · August 22, 2025, 12:48pm

You are indeed running on 10’000 unseen dataset for the public leaderboard, but as it could be overfitted by repeated attempts, there will also be a out of sample leaderboard when the competition ends.

We will just re-run your model with this new data.

rahul-m · August 24, 2025, 9:05pm

Is the local test data of 100 samples a part of the 10000 samples used in the cloud run? Also, do we expect the 10,000 further samples for private leaderboard evaluation to be a lot different in distributions ?

enzo · August 24, 2025, 9:20pm

Yes, that is right.

The first 100 datasets given for local testing are indeed part of the 10’000 datasets used in the cloud.
And there will be another 10’000 datasets for the out of sample.

Topic		Replies	Views
Will the public leaderboard data be used in the final scoring? ADIA Lab	11	401	August 4, 2023
Which model will be running Out-of-Sample (OOS) ADIA Lab	3	336	May 31, 2023
Score is not shown ADIA Lab	2	174	August 30, 2023
Possibility to continue playing with the dataset and submitting models (not counted towards the competition) ADIA Lab	0	268	September 4, 2023
I didn't see any score for out of sample ADIA Lab	20	394	August 29, 2023

Out of sample data in structural breaks competition

Related topics