Illumina Connected Analytics (ICA)
Last updated
Last updated
Use the DRAGEN_Somatic_Enrichment_4-3-17 Pipeline as part of the DRAGEN 4.3 Bundle in ICA to analyze sequencing data of llumina FFPE DNA Prep with Exome 2.5 Enrichment libraries. The settings defined here in this user guide are specific for this library preparation kit.
The analysis workflow uses resource files released with DRAGEN 4.4, so link both the DRAGEN 4.3 and DRAGEN 4.4 Bundles to your project. Additionally, Microsatellite Resource Files need to be manually uploaded and linked to your project if calculating MSI. See the Resource Files page for details on where to find those files. Note that the input for the Microsatellite Normal References can be a directory of individual distance files or combined distance file.
Start the DRAGEN Somatic Enrichment analysis
Under Pipelines, select DRAGEN_Somatic_Enrichment_4-3-17. Click Start Analysis on the top right of the page.
Input the fastq (or ORA) files for the tumor and normal sample in a sample pair.
Reference
The reference genome to use for alignment. Reference genome files are located with the Illumina DRAGEN v10 Reference directory. The provided Resource Files support the use of hg19-alt_masked.cnv.hla.methylation_combined.rna_v4.tar.gz and hg38-alt_masked.cnv.hla.methylation_combined.rna_v4.tar.gz.
Target BED file
BED file that contains targeted regions. If no spike-in probes were included in enrichment, select the Twist_ILMN_Exome_2.5_Panel bed file that matches the reference genome selected above, stored in the Illumina Enrichment BEDs directory.
If other spike-in probes were included in enrichment, generate a BED file of the combined coverage areas (ie., Exome 2.5 Plus Panel with the custom panel), upload to ICA, and select as the target BED file. See the Resources Files for information of format requirements.
Systematic Noise BED File
Select the systematic noise file that matches the panel used for enrichment and the reference genome used for alignment. The Illumina Systematic Noise directory within DRAGEN 4.4 Bundle contains WES_FFPE_NovaSeq_TwistV2.5_hg*_v1.0_systematic_noise.bed.gz. See the Resource Files for information on pre-built and generating your own file.
CNV Panel of Normals/CNV Combined Counts
Select the files to comprise the target.counts baseline files. Files must be of the same type (eg, all .target.counts or all .target.counts.gc-corrected.). Select those files that match the panel used for enrichment, match the reference genome used for alignment, and has the desired handling of GC correction. The Illumina DRAGEN CNV Baseline Files/DRAGEN 4.4/ directory contains combined counts files for both hg19 and hg38. See the Resource Files for information on pre-built target.count files.
MSI - Microsatellites File and Microsatellites Normal References Directory
As noted above, resource files for calculating MSI need to be uploaded to ICA and made available to the Project for inclusion in analysis.
Microsatellites File
Select the microsatellite list that maches the reference genome used for alignment: WES_v1.1.0_hg*_microsatellites.list
Microsatellites Normal References Directory/Combined Microsatellites Normal References File
Input the directory containing the individual distance files of the normal reference samples (hg*-WES-FFPE-*.microsat_normal.dist), or input a single merged file of the distances for each normal sample.
General Options
Output File Prefix—If something other than tumor is desired for the output file prefix, enter that here.
Sample Sex—Choose none from the drop-down list
Map Align Options
Enable Map/Align—true
Enable Map/Align Output—true
Enable Duplicate Marking—false
Map/Align Output—bam
Variant Calling Options
Enable Small Variant Caller—true
Leave Emit Ref Confidence and VCF File Output blank
Enable Germline Tagging—false
Enable CNV calling—true
CNV Use Somatic VC BAF—true
Enable SV calling—false
Targeted Callers
MSI command—tumor-only
MSI Coverage Threshold—60
If desired, set Enable Tumor Mutational Burden—true. Note that this requires variant annotation to be enabled.
Enable HLA—false
Enable HRD—false
Enable Tumor Mutational Burden—true
UMI Options
Enable UMI—true
UMI Library Type—nonrandom-duplex
UMI Aware Variant Calling—Low depth
Minimum Supporting UMI Reads—1
Variant Annotation Options
Enable Variant Annotation—true
Variant Annotation Assembly—reference corresponding to the genome used for variant calling
Additional Options
Additional DRAGEN Args—Paste the following commands into the box, making sure to update the name of the umi-metrics-interval-file
--umi-start-mask-length 1 --umi-end-mask-length 3 --qc-coverage-ignore-overlaps true --vc-sq-call-threshold 3 --vc-sq-filter-threshold 15
These settings can be rerun on additional tumor-normal sample pairs by selecting the particular analysis and clicking Rerun on the top right corner. Update the User reference and swap out the Tumor and Normal input files for the new pair.