UniProt Variants Track Settings
 
UniProt/SwissProt Amino Acid Substitutions   (All Variation and Repeats tracks)

Display mode:      Duplicate track


Display data as a density graph:
Data schema/format description and download
Assembly: D. melanogaster Aug. 2014 (BDGP Release 6 + ISO1 MT/dm6)
Data last updated at UCSC: 2024-11-26 08:23:02

Description

NOTE:
This track is intended for use primarily by physicians and other professionals concerned with genetic disorders, by genetics researchers, and by advanced students in science and medicine. While the genome browser database is open to the public, users seeking information about a personal medical or genetic condition are urged to consult with a qualified physician for diagnosis and for answers to personal questions.

This track shows the genomic positions of natural and artifical amino acid variants in the UniProt/SwissProt database. The data has been curated from scientific publications by the UniProt staff.

Display Conventions and Configuration

Genomic locations of UniProt/SwissProt variants are labeled with the amino acid change at a given position and, if known, the abbreviated disease name. A "?" is used if there is no disease annotated at this location, but the protein is described as being linked to only a single disease in UniProt.

Mouse over a mutation to see the UniProt comments.

Artificially-introduced mutations are colored green and naturally-occurring variants are colored red. For full information about a particular variant, click the "UniProt variant" linkout. The "UniProt record" linkout lists all variants of a particular protein sequence. The "Source articles" linkout lists the articles in PubMed that originally described the variant(s) and were used as evidence by the UniProt curators.

Methods

UniProt sequences were aligned to RefSeq sequences first with BLAT, then lifted to genome positions with pslMap. UniProt variants were parsed from the UniProt XML file. The variants were then mapped to the genome through the alignment using the pslMap program. This mapping approach draws heavily on the LS-SNP pipeline by Mark Diekhans. The complete script is part of the kent source tree and is located in src/hg/utils/uniprotMutations.

Data Access

The raw data can be explored interactively with the Table Browser, or the Data Integrator. For automated analysis, the genome annotation is stored in a bigBed file that can be downloaded from the download server. The underlying data file for this track is called spMut.bb. Individual regions or the whole genome annotation can be obtained using our tool bigBedToBed which can be compiled from the source code or downloaded as a precompiled binary for your system. Instructions for downloading source code and binaries can be found here. The tool can also be used to obtain only features within a given range, for example:
bigBedToBed http://hgdownload.soe.ucsc.edu/gbdb/dm6/bbi/uniprot/spMut.bb -chrom=chr6 -start=0 -end=1000000 stdout
Please refer to our mailing list archives for questions, or our Data Access FAQ for more information.

Credits

This track was created by Maximilian Haeussler, with advice from Mark Diekhans and Brian Raney.

References

UniProt Consortium. Reorganizing the protein space at the Universal Protein Resource (UniProt). Nucleic Acids Res. 2012 Jan;40(Database issue):D71-5. PMID: 22102590; PMC: PMC3245120

Yip YL, Scheib H, Diemand AV, Gattiker A, Famiglietti LM, Gasteiger E, Bairoch A. The Swiss-Prot variant page and the ModSNP database: a resource for sequence and structure information on human protein variants. Hum Mutat. 2004 May;23(5):464-70. PMID: 15108278