Skip to main content

A Random Forest Model for Detecting Somatic Mutations from Unpaired Cancer Samples

Friday, September 15, 2017 — Poster Session IV

1:00 p.m. – 2:30 p.m.
FAES Terrace


  • S Tu
  • B Zhou
  • DW Huang
  • R Schmitz
  • GW Wright
  • LM Staudt
  • CA Johnson


In whole exome sequencing studies on human cancer samples without matched normal samples, most SNV callers contaminate the pool of somatic mutation calls with germline calls due to several complications in the cancer genome. We have trained a random forest model on MuTect2 scores from 37 matched-normal diffuse large B-cell lymphoma samples from TCGA. The model’s ROC vastly outperforms that of the toxicity callers that comprise its feature space.

Category: Computational Biology