Compute coverage QC
Compute coverage QC metrics over regions in the panel
computeCoverageQC
Compute coverageQC metrics across panel regions.
Usage:
Options:
Name | Type | Description | Default |
---|---|---|---|
--panelPosFile |
Path | File with panel positions as generated by buildIndex command.File should have six columns: Chr.Pos, Chr, Start, End, Gene, RegionLength | _required |
--summaryPosFile |
Path | File with unique positions in the panel to compute sample level summary statistics. File should have one column: Chr.Pos, as generated by buildIndex command. | _required |
--countFile |
Path | Count file(s) generated by SequencErr program. File(s) should've seven columns: Chr, Pos, A_Q_30, C_Q_30, G_Q_30, T_Q_30, N_Q_30 | _required |
--outdir |
Path | Output directory path. | None |
--outSummary / --no-outSummary |
boolean | Output counts at summary positions | False |
--help |
boolean | Show this message and exit. | False |
How to generate count files?
The count files should be generated using SequencErr1
Please refer to Generating count files
Output files
computeCoverageQC
generates two output files.- Panel_regionQC file
- This file consists of the region level QC metrics for all the regions in the panel
- Sample_summaryQC file
-
- This file consists sample level QC metrics across all the positions in the panel
-
Output format
Panel_regionQC file
Chr | Start | End | Gene | RegionLength | paddedLength | Mean | SD | basecount_2SD | basecount_1.5SD | basecount_1SD |
---|---|---|---|---|---|---|---|---|---|---|
chr1 | 1718760 | 1718886 | GNB1 | 127 | 127 | 32883.11 | 3783.5 | 127 | 127 | 127 |
chr1 | 1720482 | 1720718 | GNB1 | 237 | 237 | 30500.3 | 4670.12 | 237 | 237 | 237 |
. | . | . | . | . | . | . | . | . | . | . |
. | . | . | . | . | . | . | . | . | . | . |
chrX | 153629033 | 153629205 | RPL10 | 173 | 173 | 17666.45 | 2345.65 | 173 | 173 | 166 |
chrX | 153711341 | 153711341 | chrX.153711341.C.G | 1 | 1 | 25764.0 | 1 | 1 | 1 |
Sample_summaryQC file
Sample | sampleMean | sampleMedian | sampleSD | CV | cut_2SD | cut_1pt5SD | cut_1SD | quantile_20 | fold_80 | uniformityOfCoverage | Pcntbase_2SDPcntbase_1.5SD | Pcntbase_1SD |
---|---|---|---|---|---|---|---|---|---|---|---|---|
EW-8_S5 | 21974.68 | 20209.0 | 14067.16 | 0.64 | -6159.64 | 873.94 | 7907.52 | 12573.0 | 1.75 | 94.92 | 100.0 | 98.37 |
-
Davis, E.M., Sun, Y., Liu, Y., Kolekar, P. et al. SequencErr: measuring and suppressing sequencer errors in next-generation sequencing data. Genome Biol 22, 37 (2021). https://doi.org/10.1186/s13059-020-02254-2 ↩