How to Calculate Cancer Cell Fraction (CCF)
Estimate clonality from variant allele frequency, tumor purity, and copy-number context. This calculator uses a standard purity-adjusted CCF model for somatic variants.
Results
Enter your values and click Calculate CCF.
Expert Guide: How to Calculate Cancer Cell Fraction Correctly
Cancer cell fraction (CCF) is one of the most useful quantitative values in somatic genomics because it links an observed sequencing signal to tumor evolution. In simple terms, CCF estimates what fraction of malignant cells in a sample carry a specific mutation. A mutation with CCF near 1.0 is often interpreted as clonal or near-clonal, while lower values suggest a subclonal event present in only a subset of cancer cells. This distinction matters in translational oncology, molecular tumor boards, and research pipelines that infer branching evolution, resistance mechanisms, or treatment response dynamics.
Many people first encounter variant allele frequency (VAF) and assume it directly equals CCF. That is almost never true in real specimens. VAF is heavily shaped by sample purity, copy-number state, and mutation multiplicity. For example, a clonal heterozygous mutation in a perfectly pure diploid sample can produce a VAF around 50%. The same clonal mutation in a 40% purity sample can drop to much lower values. If copy number is amplified, VAF can move up or down depending on whether the mutant allele was also amplified. Because of this, CCF estimation is fundamentally a model-based correction of observed VAF.
Core Formula Used in This Calculator
This page uses a widely adopted purity-adjusted model:
CCF = VAF × (p × Ct + (1 – p) × Cn) / (p × m)
- VAF: observed variant allele frequency in decimal form (25% = 0.25)
- p: tumor purity in decimal form (60% = 0.60)
- Ct: total tumor copy number at that locus
- Cn: normal copy number (usually 2 for autosomes)
- m: multiplicity, the number of tumor copies carrying the mutation
The calculator also reports the expected VAF if the mutation were fully clonal under the same purity and copy-number assumptions:
Expected clonal VAF = (p × m) / (p × Ct + (1 – p) × Cn)
This gives you a fast reality check: if your observed VAF is far below the clonal expectation, the mutation is likely subclonal (or assumptions are off). If your computed CCF is well above 1.0, your inputs may conflict, often because multiplicity or local copy number has been underestimated, purity is misestimated, or technical artifacts inflated the measured VAF.
Step-by-Step Practical Workflow
- Start with the best available VAF. If you have alt reads and total depth, compute VAF from counts to avoid rounding loss.
- Use a defensible tumor purity estimate from pathology, computational deconvolution, or integrated consensus methods.
- Assign local copy number at the mutation locus from your copy-number pipeline, not genome-wide average ploidy.
- Set multiplicity carefully. In many quick analyses, m = 1 is a default, but this can be wrong in copy-number gains or post-mutation amplifications.
- Calculate CCF and compare observed VAF vs expected clonal VAF.
- Interpret in context of uncertainty: low depth, FFPE damage, tumor heterogeneity, and ctDNA admixture can all shift estimates.
How to Interpret CCF Values
- CCF less than 0.85: often interpreted as subclonal, though noise and bias can mimic subclonality.
- CCF roughly 0.85 to 1.15: broadly consistent with clonal presence after accounting for technical variation.
- CCF above 1.15: usually indicates model mismatch, not biologically impossible super-clonality.
These thresholds are heuristic and should be interpreted with confidence intervals and replicate evidence where possible. In high-stakes clinical interpretation, CCF should be one component of integrated evidence, not a standalone decision variable.
Comparison Table: Typical Sequencing Depth and Practical VAF Detection Range
| Assay Context | Typical Depth | Common Practical VAF Sensitivity | Impact on CCF Confidence |
|---|---|---|---|
| Whole-exome sequencing (tumor-normal) | 80x to 150x | About 5% to 10% VAF for robust calls | Moderate uncertainty for low-VAF subclones |
| Standard targeted DNA panel | 300x to 1000x | About 1% to 5% VAF depending on assay design | Improved quantitation for minor clones |
| UMI-enabled ultra-deep targeted sequencing | 2000x to 20000x raw depth | About 0.1% to 1% VAF in optimized workflows | High sensitivity, but pre-analytic bias still matters |
| Liquid biopsy ctDNA panels | 5000x and above in many assays | Often 0.1% to 0.5% with strict error suppression | Useful for minimal residual disease and tracking dynamics |
Ranges are representative values commonly reported in clinical and translational sequencing practice; exact limits vary by platform, bioinformatics pipeline, and validation protocol.
Comparison Table: Why Purity Changes Expected VAF for the Same Clonal Mutation
| Scenario | Purity (p) | Total Tumor CN (Ct) | Multiplicity (m) | Expected Clonal VAF |
|---|---|---|---|---|
| Diploid tumor, ideal purity | 1.00 | 2 | 1 | 50.0% |
| Diploid tumor, moderate purity | 0.60 | 2 | 1 | 30.0% |
| Diploid tumor, low purity | 0.30 | 2 | 1 | 15.0% |
| Copy-number gain with one mutant copy | 0.60 | 3 | 1 | 23.1% |
| Copy-number gain with two mutant copies | 0.60 | 3 | 2 | 46.2% |
Common Pitfalls That Distort CCF
- Using global ploidy instead of local copy number: CCF is locus-specific, so local CN drives the math.
- Assuming multiplicity is always 1: after amplification events, mutant copies can exceed one.
- Ignoring purity uncertainty: a small purity shift can substantially move CCF estimates.
- Low read depth: stochastic sampling error inflates uncertainty at low VAF.
- Panel-specific error modes: strand bias, deamination, and mapping artifacts can falsely elevate VAF.
- ctDNA admixture complexity: plasma-derived signals may represent multiple lesions with different CN states.
Clinical and Research Context
In oncology research, CCF helps separate early trunk mutations from branch events, supporting phylogenetic models of progression and resistance. In longitudinal monitoring, changes in CCF over time can indicate selective sweeps under treatment pressure. In precision oncology reports, CCF can help prioritize variants by likely biological prevalence, though final interpretation must include gene function, actionability tier, and assay confidence.
In liquid biopsy, CCF interpretation is more complex because cfDNA and ctDNA are mixed biological pools with variable shedding by lesion site and burden. A low VAF in plasma does not automatically imply a low CCF in tissue. Instead, it may reflect low tumor fraction in circulation. This is one reason why integrating tissue and plasma data is increasingly recommended for comprehensive molecular profiling.
Authoritative References for Deeper Reading
- National Human Genome Research Institute (genome.gov): Variant Allele Frequency glossary
- National Cancer Institute (cancer.gov): Liquid biopsy and cancer diagnostics
- NIH/NLM PubMed Central: Methodological frameworks for tumor clonality and subclonality analysis
Best Practices Summary
If you need reliable CCF values, treat the calculation as part of an inference pipeline, not a standalone arithmetic step. Use high-quality variant calls, robust purity estimates, locus-specific copy-number calls, and explicit multiplicity assumptions. Report uncertainty, especially for low-VAF mutations and low-depth loci. When possible, cross-validate major clonality conclusions with orthogonal evidence such as serial samples, independent assays, or consistent phylogenetic structure across multiple variants.
Used this way, CCF becomes a powerful quantitative tool that turns raw sequencing counts into biologically meaningful insight. The calculator above is designed for fast, transparent estimation, and the displayed chart helps you visually compare observed VAF, expected clonal VAF, and inferred CCF in one place.