The Childhood Cancer Data Lab

Accelerating the Pace of Childhood Cancer Research with Big Data

The Childhood Cancer Data Lab was established by Alex’s Lemonade Stand Foundation (ALSF) in 2017. ALSF recognized that pediatric cancer researchers face hurdles that impede the pace of research.

ALSF introduced the Data Lab to empower researchers and scientists across the globe by removing roadblocks, supporting opportunities for collaboration and sharing, and developing resources to accelerate new treatment and cure discovery.

The Data Lab's mission is to empower pediatric cancer experts poised for the next big discovery with the knowledge, data, and tools to reach it. We construct tools that make vast amounts of data widely available, easily mineable, and broadly reusable. We train researchers and scientists to better understand their own data and to advance their work more quickly.

To date, the Data Lab has trained over 400 childhood cancer researchers and has harmonized over 1.3 million data samples and made them easily available. Learn more about the Data Lab’s impact here.

Learn more about us

Projects

The Data Lab develops tools designed to make data and analysis widely available and broadly reusable.

OpenScPCA is an open, collaborative project to analyze data from the ScPCA Portal, which currently holds 500 samples from over 50 pediatric cancer types.

Learn More

The Single-cell Pediatric Cancer Atlas (ScPCA) is accelerating the discovery of better treatments for pediatric solid tumors and leukemias.

Learn More

refine.bio is a repository of uniformly processed and normalized, ready-to-use transcriptome data from publicly available sources.

Learn More

Data Science Workshops

The Data Lab offers workshops to teach researchers the data science skills they need to examine their own data. Our courses focus on the most cutting edge tools and analysis techniques. We ensure that participants walk away with an understanding of:

The R programming language, R Notebooks, and some reproducible research practices.
Processing bulk and single-cell RNA-seq data from raw all the way to downstream analyses.
Downstream analyses methods like differential expression analyses, hierarchical clustering, and preparing publication-ready plots.

“I think anyone who is working on or near single-cell data should take this course. I am so much more confident in what I understand about single-cell analyses compared to where I was at the beginning. 10/10 recommend.”

- Jessica Elswood, Postdoctoral Associate, Baylor College of Medicine

Learn More

Donate

Make a donation to support the Data Lab’s mission of putting knowledge and resources in the hands of pediatric cancer experts poised for the next big discovery.

With your help, we can

Fund innovative models to scale training workshops.

Offer our expertise and provide consultation on projects that will change the future for children fighting cancer.

Train at least 200 childhood cancer researchers over the next four years.

Donate Now

Blog

Announcements

March 2, 2026

Announcements

2026-03-02

Data Lab Advanced Single-cell RNA-Seq Workshop, Virtual, June 8-12, 2026

Applications are open for the Data Lab's upcoming workshop, which will cover advanced topics in the analysis of single-cell RNA-seq data for researchers studying pediatric cancer. The course will be held virtually from June 8-12, 2026 from 12-5pm Eastern time.

JEN O'MALLEY

Announcements

March 2, 2026

Announcements

2026-03-02

Data Lab Introduction to Single-Cell RNA-Seq Workshop, Virtual, May 11-15, 2026

The Data Lab will be holding a virtual workshop, Introduction to Single-cell RNA-Sequencing, from May 11-15, 2026! In this workshop, Data Lab staff will introduce researchers studying pediatric cancer to the R programming language, the Tidyverse R packages for data science, single-cell RNA-seq data analysis, and annotating cell types.

JEN O'MALLEY

News

January 20, 2026

News

2026-01-20

New Feature Release: Cell Type Annotations, CNV Inference, Custom Downloads, and More on the ScPCA Portal

Exciting news from the Single-cell Pediatric Cancer Atlas (ScPCA) Portal! All datasets on the portal have been updated to include several new features that enhance data quality and usability. Here’s a close look at what we’ve added and why.

JEN O'MALLEY

See More Posts

Projects

Data Science Workshops

Subscribe to our Newsletter

Donate

With your help, we can

Blog