How is the ranking being determined in crunch 2?

raghvendramall · March 7, 2025, 7:05am

Based on spearman per cell, amaris is better and if we take average of spearman per cell and spearman per gene, amaris is still better, then how is ranking being determined?

enzo · March 7, 2025, 10:33am

The rank is the average rank of both metrics: competitions/competitions/broad-2/scoring/leaderboard.py at ca86b49dc0481cc795fe16ee7c4b5b5614986394 · crunchdao/competitions · GitHub

raghvendramall · March 10, 2025, 11:37pm

@enzo Previously the ranking was stated to be per cell spearman and now based on a small set of 20 genes, we also have to perform well (per gene spearman). If someone over optimizes for these 20 genes and attains low generalization across the 2000 genes (per gene spearman), then how will this be accounted for (as we don’t have measurements from spatial transcriptomics for this entire set). Is the current metric fair, to take combination of both spearman per cell and per gene?

tarandros · March 11, 2025, 8:56am

We don’t know these 20 genes, so it is pretty hard to overfit on these 20 genes accounting there are 2000 genes (1%).

raghvendramall · March 11, 2025, 9:14am

Got ur point but the ranking metric is still very weird, like u can be the best on per cell spearman but then if are performing average on per gene spearman, you are ranked down as the ranking is based on the position with respect to other crunchers.

The metric should be absolute in terms of quantitative performance on the criteria and not based on ur rank w.r.t. others, a point raised earlier also

Btw @tarandros thanks a lot for your base code, it was a greater starter point

tarandros · March 11, 2025, 10:21am

For almost every data science problem, there is no optimal metric, making it nearly a nightmare to choose one. Therefore, the metric is only fair at certain points…

On my side, I’m trying to focus the most on model building and minimize the time spent optimizing the metric. In the hypothesis that if a model is better, it should perform better on all possible metrics.

Topic		Replies	Views
The current scoring is unstable Broad Institute Crunch #2	7	117	February 20, 2025
Unknown evaluation metric Broad Institute Crunch #1	5	113	January 3, 2025
Is the MSE the right metric for benchmark? Broad Institute Crunch #1	15	257	January 14, 2025
Crunch 1 learderboard ranking Broad Institute Crunch #1	5	106	January 4, 2025
Is Spearman's rank correlation the right metric for benchmarking? Broad Institute Crunch #1	2	60	December 9, 2024

How is the ranking being determined in crunch 2?

Related topics