Background#
BJ-CNV is a modified version of Ginkgo for copy number call. The pipeline takes deduplicated bam file, calculates coverage, performs normalization, and make cnv calls. Pipeline generates various plots and creates a table with copy number information with genomic coordinates. The CNV Tertiary pipeline is dependent on the secondary BJ-DNA-QC pipeline. Once the BJ-DNA-QC pipeline completes successfully, the prerequisites are met to launch the BJ-CNV pipeline.
Pipeline Overview#
flowchart LR
%% Colors %%
classDef panel fill:transparent,stroke:#323232,stroke-dasharray:8
classDef black fill:#12294C,stroke:#12294C,stroke-width:2px,color:#fff
classDef blue fill:#20A4F3,stroke:#20A4F3,stroke-width:2px,color:#fff
classDef green fill:#3BCEAC,stroke:#3BCEAC,stroke-width:2px,color:#fff
classDef yellow fill:#ffd166,stroke:#ffd166,stroke-width:2px,color:#fff
classDef pink fill:#ef476f,stroke:#ef476f,stroke-width:2px,color:#fff
classDef orange fill:#f3722c,stroke:#f3722c,stroke-width:2px,color:#fff
classDef red fill:#BB4430,stroke:#BB4430,stroke-width:2px,color:#fff
classDef ming fill:#387780,stroke:#387780,stroke-width:2px,color:#fff
Start((Start)):::black --bam--> Coverage[Coverage]:::green
subgraph Ginkgo
Coverage --> QC:::green
QC --> Remove-outliers:::green
Remove-outliers --> Segment-bins:::green
Segment-bins --> CNV-calls:::green
end
subgraph Visualization-App
CNV-calls --> report[CNV App]:::orange
end
CNV-calls --> End((End)):::black
Ginkgo:::panel
Visualization-App:::panel
Following are the steps and tools that pipeline users to perform the analyses:
- Convert BAM input files into BED files using
bamtobed
command frombedtools
- Map the BED coverage files into prior computed bins using
Ginkgo
module - Perform segmentation, ploidy estimation, and create CNV plots using the
Ginkgo
module - Perform final CNV call using the
Ginkgo
module
Parameters#
Module | Parameter Name | Options | Description |
---|---|---|---|
Ginkgo | (default) | CNV module evaluates the copy number using Ginkgo tool. | |
CNV | (default) | CNV module evaluates the copy number. | |
Bin Size (bp) | 1000000 (default) |
Genomic bin size in bp to use for CNV. | |
Minimum Ploidy | 1 1.5 (default) |
Minimum Ploidy to set for calling CNV. For haploid genome set to 1 |
|
Maximum Ploidy | 6 (default) |
Maximum Ploidy to set for calling CNV. |
Output files#
Output Directory/File |
Notes |
---|---|
tertiary_analyses / cnv_ginkgo / <BIOSAMPLE_NAME> / *{.tzv,jpeg} ginkgo_res.binsize_1000000.RDS sampleploidy.tsv BIOSAMPLE_NAME_*CNV*.tsv |
This section includes output files for the Copy Number Variant (CNV) data and information about the gains and losses for each biosample. CNV profile example |