How to Calculate Cancer Cell Fraction (CCF)

Estimate clonality from variant allele frequency, tumor purity, and copy-number context. This calculator uses a standard purity-adjusted CCF model for somatic variants.

Observed VAF (%) If you provide reads below, VAF can be derived automatically.

Tumor Purity (%) Fraction of cancer cells in the analyzed sample.

Tumor Total Copy Number at Locus Example: 2 for diploid, 3 for one-copy gain.

Mutant Allele Multiplicity (m) How many tumor copies carry the mutation.

Normal Copy Number Adjust if your locus is not diploid in normal cells.

Sample Type Used for interpretation notes only.

Alternate Reads (optional)

Total Depth (optional)

CCF Display Mode

Results

Enter your values and click Calculate CCF.

Expert Guide: How to Calculate Cancer Cell Fraction Correctly

Cancer cell fraction (CCF) is one of the most useful quantitative values in somatic genomics because it links an observed sequencing signal to tumor evolution. In simple terms, CCF estimates what fraction of malignant cells in a sample carry a specific mutation. A mutation with CCF near 1.0 is often interpreted as clonal or near-clonal, while lower values suggest a subclonal event present in only a subset of cancer cells. This distinction matters in translational oncology, molecular tumor boards, and research pipelines that infer branching evolution, resistance mechanisms, or treatment response dynamics.

Many people first encounter variant allele frequency (VAF) and assume it directly equals CCF. That is almost never true in real specimens. VAF is heavily shaped by sample purity, copy-number state, and mutation multiplicity. For example, a clonal heterozygous mutation in a perfectly pure diploid sample can produce a VAF around 50%. The same clonal mutation in a 40% purity sample can drop to much lower values. If copy number is amplified, VAF can move up or down depending on whether the mutant allele was also amplified. Because of this, CCF estimation is fundamentally a model-based correction of observed VAF.

Core Formula Used in This Calculator

This page uses a widely adopted purity-adjusted model:

CCF = VAF × (p × C_t + (1 – p) × C_n) / (p × m)

VAF: observed variant allele frequency in decimal form (25% = 0.25)
p: tumor purity in decimal form (60% = 0.60)
C_t: total tumor copy number at that locus
C_n: normal copy number (usually 2 for autosomes)
m: multiplicity, the number of tumor copies carrying the mutation

The calculator also reports the expected VAF if the mutation were fully clonal under the same purity and copy-number assumptions:

Expected clonal VAF = (p × m) / (p × C_t + (1 – p) × C_n)

This gives you a fast reality check: if your observed VAF is far below the clonal expectation, the mutation is likely subclonal (or assumptions are off). If your computed CCF is well above 1.0, your inputs may conflict, often because multiplicity or local copy number has been underestimated, purity is misestimated, or technical artifacts inflated the measured VAF.

Step-by-Step Practical Workflow

Start with the best available VAF. If you have alt reads and total depth, compute VAF from counts to avoid rounding loss.
Use a defensible tumor purity estimate from pathology, computational deconvolution, or integrated consensus methods.
Assign local copy number at the mutation locus from your copy-number pipeline, not genome-wide average ploidy.
Set multiplicity carefully. In many quick analyses, m = 1 is a default, but this can be wrong in copy-number gains or post-mutation amplifications.
Calculate CCF and compare observed VAF vs expected clonal VAF.
Interpret in context of uncertainty: low depth, FFPE damage, tumor heterogeneity, and ctDNA admixture can all shift estimates.

How to Interpret CCF Values

CCF less than 0.85: often interpreted as subclonal, though noise and bias can mimic subclonality.
CCF roughly 0.85 to 1.15: broadly consistent with clonal presence after accounting for technical variation.
CCF above 1.15: usually indicates model mismatch, not biologically impossible super-clonality.

These thresholds are heuristic and should be interpreted with confidence intervals and replicate evidence where possible. In high-stakes clinical interpretation, CCF should be one component of integrated evidence, not a standalone decision variable.

Comparison Table: Typical Sequencing Depth and Practical VAF Detection Range

Assay Context	Typical Depth	Common Practical VAF Sensitivity	Impact on CCF Confidence
Whole-exome sequencing (tumor-normal)	80x to 150x	About 5% to 10% VAF for robust calls	Moderate uncertainty for low-VAF subclones
Standard targeted DNA panel	300x to 1000x	About 1% to 5% VAF depending on assay design	Improved quantitation for minor clones
UMI-enabled ultra-deep targeted sequencing	2000x to 20000x raw depth	About 0.1% to 1% VAF in optimized workflows	High sensitivity, but pre-analytic bias still matters
Liquid biopsy ctDNA panels	5000x and above in many assays	Often 0.1% to 0.5% with strict error suppression	Useful for minimal residual disease and tracking dynamics

Ranges are representative values commonly reported in clinical and translational sequencing practice; exact limits vary by platform, bioinformatics pipeline, and validation protocol.

Comparison Table: Why Purity Changes Expected VAF for the Same Clonal Mutation

Scenario	Purity (p)	Total Tumor CN (C_t)	Multiplicity (m)	Expected Clonal VAF
Diploid tumor, ideal purity	1.00	2	1	50.0%
Diploid tumor, moderate purity	0.60	2	1	30.0%
Diploid tumor, low purity	0.30	2	1	15.0%
Copy-number gain with one mutant copy	0.60	3	1	23.1%
Copy-number gain with two mutant copies	0.60	3	2	46.2%

Common Pitfalls That Distort CCF

Using global ploidy instead of local copy number: CCF is locus-specific, so local CN drives the math.
Assuming multiplicity is always 1: after amplification events, mutant copies can exceed one.
Ignoring purity uncertainty: a small purity shift can substantially move CCF estimates.
Low read depth: stochastic sampling error inflates uncertainty at low VAF.
Panel-specific error modes: strand bias, deamination, and mapping artifacts can falsely elevate VAF.
ctDNA admixture complexity: plasma-derived signals may represent multiple lesions with different CN states.

Clinical and Research Context

In oncology research, CCF helps separate early trunk mutations from branch events, supporting phylogenetic models of progression and resistance. In longitudinal monitoring, changes in CCF over time can indicate selective sweeps under treatment pressure. In precision oncology reports, CCF can help prioritize variants by likely biological prevalence, though final interpretation must include gene function, actionability tier, and assay confidence.

In liquid biopsy, CCF interpretation is more complex because cfDNA and ctDNA are mixed biological pools with variable shedding by lesion site and burden. A low VAF in plasma does not automatically imply a low CCF in tissue. Instead, it may reflect low tumor fraction in circulation. This is one reason why integrating tissue and plasma data is increasingly recommended for comprehensive molecular profiling.

Authoritative References for Deeper Reading

Best Practices Summary

If you need reliable CCF values, treat the calculation as part of an inference pipeline, not a standalone arithmetic step. Use high-quality variant calls, robust purity estimates, locus-specific copy-number calls, and explicit multiplicity assumptions. Report uncertainty, especially for low-VAF mutations and low-depth loci. When possible, cross-validate major clonality conclusions with orthogonal evidence such as serial samples, independent assays, or consistent phylogenetic structure across multiple variants.

Used this way, CCF becomes a powerful quantitative tool that turns raw sequencing counts into biologically meaningful insight. The calculator above is designed for fast, transparent estimation, and the displayed chart helps you visually compare observed VAF, expected clonal VAF, and inferred CCF in one place.

How To Calculate Cancer Cell Fraction