|
| 1 | +sambamba-markdup(1) -- finding duplicate reads in BAM file |
| 2 | +============================================================= |
| 3 | + |
| 4 | +## SYNOPSIS |
| 5 | + |
| 6 | +`sambamba markdup` [OPTIONS] <input.bam> <output.bam> |
| 7 | + |
| 8 | +## DESCRIPTION |
| 9 | + |
| 10 | +Marks (by default) or removes duplicate reads. For determining |
| 11 | +whether a read is a duplicate or not, the same criteria as in Picard |
| 12 | +are used. |
| 13 | + |
| 14 | +## OPTIONS |
| 15 | + |
| 16 | + * `-r`, `--remove-duplicates`: |
| 17 | + remove duplicates instead of just marking them |
| 18 | + |
| 19 | + * `-t`, `--nthreads`=<NTHREADS>: |
| 20 | + number of threads to use |
| 21 | + |
| 22 | + * `-l`, `--compression-level`=<N>: |
| 23 | + specify compression level of the resulting file (from 0 to 9)"); |
| 24 | + |
| 25 | + * `-p`, `--show-progress`: |
| 26 | + show progressbar in STDERR |
| 27 | + |
| 28 | + * `--tmpdir`=<TMPDIR>: |
| 29 | + specify directory for temporary files; default is `/tmp` |
| 30 | + |
| 31 | + * `--hash-table-size`=<HASHTABLESIZE>: |
| 32 | + size of hash table for finding read pairs (default is 262144 reads); |
| 33 | + will be rounded down to the nearest power of two; |
| 34 | + should be `> (average coverage) * (insert size)` for good performance |
| 35 | + |
| 36 | + * `--overflow-list-size`=<OVERFLOWLISTSIZE>: |
| 37 | + size of the overflow list where reads, thrown away from the hash table, |
| 38 | + get a second chance to meet their pairs (default is 200000 reads); |
| 39 | + increasing the size reduces the number of temporary files created |
| 40 | + |
| 41 | + * `--io-buffer-size`=<BUFFERSIZE>: |
| 42 | + controls sizes of two buffers of BUFFERSIZE *megabytes* each, used |
| 43 | + for reading and writing BAM during the second pass (default is 128) |
| 44 | + |
| 45 | +## BUGS |
| 46 | + |
| 47 | + External sort is not implemented. |
| 48 | + Thus, memory consumption grows by 2Gb per each 100M reads. |
| 49 | + Check that you have enough RAM before running the tool. |
0 commit comments