Upload datasets

Hello everyone,

I’m a new subscriber to this competition and I’m facing issues regarding datasets—specifically, finding and uploading them. I’d like to use data directly in Google Colab without downloading it. If anyone has code for this purpose, please send it my way.

I would greatly appreciate any guidance or resources to help me get started. I’ve heard a lot about Google Colab and how it allows you to run Python code in the browser, but I’m unsure how to access external datasets directly.

If you have examples of loading datasets from sources like Kaggle, GitHub, or Google Drive, that would be very helpful. Code snippets or step-by-step instructions would be ideal.

Additionally, if you know of any specific libraries for data manipulation and analysis once I have the dataset, please share those as well. I’m eager to learn and join you all in this competition! Thanks in advance for your help!

Hello, if you want a step by step guide on how to get started, please have a look at the documentation:

However as you are also asking about libraries used for analysis, I highly recommand that you start by using the quickstarter:

Quickstarters are available for all competitions in the Submit section.

Thank you for your respend ,and i have other problem just in downloading the data as follws:

I can see (at the very top of the screenshot) that you are trying to enter one of the broad challenges.

The broad competitions do not support .load_data() as the data is too custom to have an x/y train/test combo.

You have to load the data yourself using spatialdata.load_zarr(os.path.join(data_directory_path, "DC5.zarr")).

I recommend copying the code from the random notebook if you want something working: https://colab.research.google.com/github/crunchdao/quickstarters/blob/master/competitions/broad-1/quickstarters/random-submission/random-submission.ipynb