Usage
Tools
BamToCov is inspired by the UNIX Phylosophy and the tools are designed for efficient computation of a very specific task. Integration of multiple samples and specific tasks can be achieved with scripts and we provide a set to demonstrate the process.
bamtocov will produce a coverage BED from a single BAM file, or a count matrix from a set of alignments and a target (in BED, GTF or GFF format). Used without a target, it is a drop-in replacement for covtobed, but discarding invalid alignments by default. When providing the target, it can produce coverage statistics for each region in the target, also with multiple BAM files.
bamtocounts will count the number of reads covering each target region, rather than the nucleotidic coverage
bamcountrefs is a shortcut to count the number of reads per chromosome, with filters on the read flags, length and quality
covtotarget (legacy) is an utility to create a count table from the output of the original covtobed program.
Quick start
bamtocov alignment.bam > coverage.bed
will produce a coverage BED file from the alignment file.
File formats
BED files
A BED file (.bed) is a tab-delimited text file that defines a feature track. In this context the magnitude refers to the nucleotide coverage of the interval.
The columns are chromosome name, start position (inclusive, zero-based), end position (non-inclusive, zero-based) and coverage. An example is:
seq1    0       9       0
seq1    9       109     5
seq1    109     189     0
seq1    189     200     2
Target statistics
 this format is not final.
 this format is not final.
For each sample, 5 columns are printed:
- bam_bases
- bam_mean
- bam_min
- bam_max
- bam_length
| interval | bam_bases | bam_mean | bam_min | bam_max | bam_length | 
|---|---|---|---|---|---|
| target1_8X | 699 | 3.495 | 1 | 6 | 200 | 
| target2_0X | 0 | 0.0 | 0 | 0 | 50 | 
| target3_1X | . | . | . | . | . | 
| for_rev_10Xa | 100 | 10.0 | 10 | 10 | 10 | 
| for_rev_10Xb | 100 | 10.0 | 10 | 10 | 10 | 
| for_rev_10Xc | . | . | . | . | . |