Introducing the Single-cell Pediatric Cancer Atlas (ScPCA) Portal

March 28, 2022

The Single-cell Pediatric Cancer Atlas (ScPCA) project began in 2019 when Alex’s Lemonade Stand Foundation (ALSF) funded 10 awards for single-cell profiling of pediatric cancer samples. The goal was to produce an atlas of gene expression profiles for a variety of childhood cancer types from different organ sites. Over the course of the project, ScPCA investigators generated massive volumes of raw sequencing data and shared it with the Data Lab to be uniformly processed. Then we built a web interface to release it to the public as an open source for discovery in one convenient location. Introducing the ScPCA Portal!

Figure that shows how the ScPCA data is funded, processed, and publicly released.

Currently, data from 7 ScPCA projects are available on the portal, which includes a total of 189 patient samples representing 29 diagnoses. Data will continue to be added as we receive and process it. Users from anywhere can now explore this growing database, of single-cell and single-nuclei RNA-Sequencing data and begin utilizing it for their own analyses!

The ScPCA Portal homepage

What the portal does

  • It provides easily available samples from a broad selection of cancer types. Downloads include both filtered and unfiltered count matrices for each sample along with sample and project metadata. Learn more about getting started with an ScPCA dataset.
  • It offers free access to our data processing pipeline. We aimed to process all of the data using a well-documented, open source pipeline, that is comparable to widely used software, but is faster and more memory efficient. Learn more about how the data was processed.
  • It frees up your time! We have already processed the raw sequencing data before making it available on the portal. Users can skip that time consuming step and immediately begin working with the data. Learn more about the data formats.

View the full documentation to learn more about what you can do with ScPCA data!

What the portal does not do yet

  • It does not provide normalized data or integrated data. Each sample is treated independently.
  • It does not serve as a platform for data visualization.
  • It does not currently include data that isn’t funded by ALSF through ScPCA grants.

We look forward to the community’s feedback as we continue to develop the portal!

What’s next?

The ScPCA Portal has the potential to serve the childhood cancer research community in more ways. As more users explore this resource, the community will help guide what comes next! We will soon be conducting usability testing of our data processing pipeline. Are you interested in helping us to enhance our product?

How you can help:

Sign up to virtually join members of our team for a multi-part usability evaluation! Our goal is to ensure that others can successfully use the pipeline to run their own data and to identify opportunities for improvement.

  • Session one: You will test how well our instructions and examples help you prepare to run your own sample. This requires setting up configurations that will work for your computing setup and gathering information related to your data needed for processing.
  • Session two: You will run a single sample using our pipeline!

We are seeking researchers who can participate in at least two sessions and have access to a high performance computing system. If you also have experience with Nextflow and UNIX, this will be especially helpful! You can sign up via the form below and a member of our team will reach out to schedule your sessions soon. As a token of our appreciation for your valuable feedback, the first three users to complete an evaluation will receive a $50 Visa gift card!

Please reach out to us at scpca@ccdatalab.org with any questions!

The Single-cell Pediatric Cancer Atlas (ScPCA) project began in 2019 when Alex’s Lemonade Stand Foundation (ALSF) funded 10 awards for single-cell profiling of pediatric cancer samples. The goal was to produce an atlas of gene expression profiles for a variety of childhood cancer types from different organ sites. Over the course of the project, ScPCA investigators generated massive volumes of raw sequencing data and shared it with the Data Lab to be uniformly processed. Then we built a web interface to release it to the public as an open source for discovery in one convenient location. Introducing the ScPCA Portal!

Figure that shows how the ScPCA data is funded, processed, and publicly released.

Currently, data from 7 ScPCA projects are available on the portal, which includes a total of 189 patient samples representing 29 diagnoses. Data will continue to be added as we receive and process it. Users from anywhere can now explore this growing database, of single-cell and single-nuclei RNA-Sequencing data and begin utilizing it for their own analyses!

The ScPCA Portal homepage

What the portal does

  • It provides easily available samples from a broad selection of cancer types. Downloads include both filtered and unfiltered count matrices for each sample along with sample and project metadata. Learn more about getting started with an ScPCA dataset.
  • It offers free access to our data processing pipeline. We aimed to process all of the data using a well-documented, open source pipeline, that is comparable to widely used software, but is faster and more memory efficient. Learn more about how the data was processed.
  • It frees up your time! We have already processed the raw sequencing data before making it available on the portal. Users can skip that time consuming step and immediately begin working with the data. Learn more about the data formats.

View the full documentation to learn more about what you can do with ScPCA data!

What the portal does not do yet

  • It does not provide normalized data or integrated data. Each sample is treated independently.
  • It does not serve as a platform for data visualization.
  • It does not currently include data that isn’t funded by ALSF through ScPCA grants.

We look forward to the community’s feedback as we continue to develop the portal!

What’s next?

The ScPCA Portal has the potential to serve the childhood cancer research community in more ways. As more users explore this resource, the community will help guide what comes next! We will soon be conducting usability testing of our data processing pipeline. Are you interested in helping us to enhance our product?

How you can help:

Sign up to virtually join members of our team for a multi-part usability evaluation! Our goal is to ensure that others can successfully use the pipeline to run their own data and to identify opportunities for improvement.

  • Session one: You will test how well our instructions and examples help you prepare to run your own sample. This requires setting up configurations that will work for your computing setup and gathering information related to your data needed for processing.
  • Session two: You will run a single sample using our pipeline!

We are seeking researchers who can participate in at least two sessions and have access to a high performance computing system. If you also have experience with Nextflow and UNIX, this will be especially helpful! You can sign up via the form below and a member of our team will reach out to schedule your sessions soon. As a token of our appreciation for your valuable feedback, the first three users to complete an evaluation will receive a $50 Visa gift card!

Please reach out to us at scpca@ccdatalab.org with any questions!

Back To Blog