https://hgdownload.cse.ucsc.edu/goldenpath/hg38/bigZips/
Chromosomes:
- made from scaffolds placed onto chromosome locations, 95% of the genome file
- format: chr{chromosome number or name}
- e.g. chr1 or chrX, chrM for the mitochondrial genome.
Unlocalized scaffolds:
- a sequence found in an assembly that is associated with a specific
chromosome but cannot be ordered or oriented on that chromosome.
- format: chr{chromosome number or name}_{sequence_accession}v{sequence_version}_random
- e.g. chr17_GL000205v2_random
Unplaced scaffolds:
- a sequence found in an assembly that is not associated with any chromosome.
- format: chrUn_{sequence_accession}v{sequence_version}
- e.g. chrUn_GL000220v1
Alternate loci scaffolds:
- a scaffold that provides an alternate representation of a locus found
in the primary assembly. These sequences do not represent a complete
chromosome sequence although there is no hard limit on the size of the
alternate locus; currently these are less than 1 Mb. These could either
be NOVEL patch sequences, added through patch releases, or present in the
initial assembly release.
- format: chr{chromosome number or name}_{sequence_accession}v{sequence_version}_alt
- e.g. chr6_GL000250v2_alt
Fix loci scaffolds:
- a patch that corrects sequence or reduces an assembly gap in a given
major release. FIX patch sequences are meant to be incorporated into
the primary or existing alt-loci assembly units at the next major
release.
- these sequences are not part of the files in the initial/ directory
- format: chr{chromosome number or name}_{sequence_accession}v{sequence_version}_fix
- e.g. chr2_KN538362v1_fix
'etc' 카테고리의 다른 글
[github] Polypharmacy (0) | 2021.06.24 |
---|---|
Announcement (0) | 2021.06.09 |
[book] 확장된 표현형 (0) | 2021.05.30 |
GDC DNA-sequence analysis (0) | 2021.04.08 |
다윈 (0) | 2021.02.15 |
댓글