Assembly Track Settings
 
Assembly from Fragments   (All Mapping and Sequencing tracks)

Display mode:      Duplicate track
Data schema/format description and download
Assembly: Human Dec. 2013 (GRCh38/hg38)
Data last updated at UCSC: 2022-10-18

Description

This track shows the contigs used to construct the GRCh38 (hg38) genome assembly, as defined in the AGP file delivered with the sequence. For information on the AGP file format, see the NCBI AGP Specification. The NCBI website also provides an overview of genome assembly procedures, as well as specific information about the hg38 assembly.

In dense mode, this track depicts the contigs that make up the currently viewed scaffold. Contig boundaries are distinguished by the use of alternating gold and brown coloration. Where gaps exist between contigs, spaces are shown between the gold and brown blocks. The relative order and orientation of the contigs within a scaffold is always known; therefore, a line is drawn in the graphical display to bridge the blocks.

Component types found in this track (with counts of that type in parenthesis):

  • F - finished sequence (35,798)
  • O - other sequence (8,536)
  • W - whole genome shotgun (764)
  • P - pre draft (16)
  • D - draft sequence (8)
  • A - active finishing (8)

In addition to the standard nucleotide codes, the raw sequence files from NCBI also include IUPAC ambiguity codes for bases that could not be positively identified as A, C, G or T (see Wikipedia's IUPAC notation article for more information). As part of the UCSC assembly creation process, all IUPAC ambiguity characters are converted to Ns. The FASTA files available for download from UCSC reflect this. The raw data files containing the original IUPAC characters can be downloaded from the NCBI FTP site.

The following table lists the counts by chromosome of the various IUPAC ambiguity characters in the original NCBI data files:

chromosome
1 2 3 6 7 9 10 12 13 16 17 21 22 X Y Total
code
B 1 1 2
K 1 4 1 2 8
M 1 1 3 1 2 8
R 1 1 1 1 1 13 1 3 1 2 1 1 27
S 1 1 1 1 1 5
W 2 2 6 1 1 1 1 14
Y 4 3 1 2 2 8 2 2 5 2 2 2 35
Total 2 9 7 1 4 3 36 3 3 1 12 3 5 5 5 99