Crunch 3 eval random forest input: raw or log counts? normalized or unnormalized counts? what sequencing depth?

quartz · April 18, 2025, 11:59am

My understanding is that Crunch 3 will be scored primarily based on the performance of random forest classification trained on 50 genes per submission. Will the random forest be trained on raw counts, or on log-transformed counts? Will the counts be normalized per cell, and/or standardized per gene? At approximately what depth will the libraries be sequenced (approximately how many reads per cell)?

Thanks in advance!

Topic		Replies	Views
Crunch 3 submission format (ended) Broad Institute Crunch #3	3	104	April 30, 2025
Crunch 3 evaluation (ended) Broad Institute Crunch #3	11	220	April 28, 2025
Doubts about Crunch 2 (ended) Broad Institute Crunch #2	3	108	November 24, 2024
Crunch 1 deliverables - CSV or Notebook with training function (ended) Broad Institute Crunch #1	3	160	December 2, 2024
Meaning of ranking (ended) Broad Institute Crunch #2	9	185	December 16, 2024

Crunch 3 eval random forest input: raw or log counts? normalized or unnormalized counts? what sequencing depth?

Related topics