본문 바로가기
Tools

GATK

by wycho 2020. 8. 7.

Variant calling을 위해서 GATK를 사용해 보았다.

 

공식 : https://gatk.broadinstitute.org/hc/en-us

다운로드  : https://github.com/broadinstitute/gatk/releases

$ wget https://github.com/broadinstitute/gatk/releases/download/4.1.8.1/gatk-4.1.8.1.zip
$ unzip gatk-4.1.8.1.zip
$ cd gatk-4.1.8.1
$ ./gatk

압축을 풀고 나서 설치과정 없이 바로 사용 가능하다.

 

준비과정으로 reference fasta file의 dictionary만드는 과정이 필요하다.

$ samtools faidx hg37.fa # generate index file
$ gatk CreateSequenceDictionary -R hg37.fa

 

Variant calling

$ gatk HaplotypeCaller -R hg37.fa -I input.bam -O output.vcf.gz --read-filter OverclippedReadFilter --native-pair-hmm-threads 8
or 
$ gatk HaplotypeCaller \
   -R hg37.fa \
   -I input.bam \
   -O output.vcf.gz \
   --read-filter OverclippedReadFilter \
   --native-pair-hmm-threads 8

다양한 옵션들은 아래 링크를 참고하면 된다.

https://gatk.broadinstitute.org/hc/en-us/articles/360036897532-HaplotypeCaller

https://gatk.broadinstitute.org/hc/en-us/articles/360035531892-GATK4-command-line-syntax

 

더보기

RAM VIRT~36.7GB RES~ 1990MB

$ find *.bam | awk -F. '{print $1}' | while read line;do echo "gatk HaplotypeCaller -R hg37.fa -I ${line}.bam -O ${line}.vcf.gz --read-filter OverclippedReadFilter";done | parallel --bar -j5 {}

 

Coverage - Read depth metrics

https://gatk.broadinstitute.org/hc/en-us/articles/360041851491-DepthOfCoverage-BETA-

$ gatk \
   DepthOfCoverage \
   -R reference.fasta \
   -O file_name_base \
   -I input_bams.list \
   [-geneList refSeq.sorted.refseq] \
   [-pt readgroup] \
   [-ct 4 -ct 6 -ct 10] \
   [-L my_capture_genes.interval_list]

* The file extension of 'input_bams.list' must be '.list'.

 

Funcotator for annotation

https://gatk.broadinstitute.org/hc/en-us/articles/360035889931-Funcotator-Information-and-Tutorial

# A VCF instantiation of the Funcotator tool looks like this:
$ ./gatk Funcotator \
     --variant variants.vcf \
     --reference Homo_sapiens_assembly19.fasta \
     --ref-version hg19 \
     --data-sources-path funcotator_dataSources.v1.2.20180329 \
     --output variants.funcotated.vcf \
     --output-file-format VCF

http://ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/funcotator/

 

 

INFO

QUAL/AD = 1063.77/31 = 34.32 = QD

 

 

 

'Tools' 카테고리의 다른 글

checkVCF  (0) 2020.09.15
Windows Terminal  (0) 2020.08.17
Slurm - Workload manager  (0) 2020.08.15
LDpop  (0) 2020.07.12
Shapeit4  (0) 2020.07.09

댓글