Recommendations for the description of sequence variants


Last modified March 22, 2016

NOTE: this website is frozen since May 1, 2016. It has been replaced by a new version at http://www.HGVS.org/varnomen. These pages serve as archival copy only.


Contents


Introduction


Discussions regarding the uniform and unequivocal description of sequence variants in DNA and protein sequences (mutations, polymorphisms) were initiated by two papers published in 1993; Beaudet AL & Tsui LC (DOI paper / abstract) and Beutler E (paper / abstract). The original suggestions presented were widely discussed, modified, extended and ultimately resulted in nomenclature recommendations that have been largely accepted and are applied world-wide (see History).

Current rules (den Dunnen, JT and Antonarakis, SE (2000), paper / abstract) however do not extensively cover all types of variants and the more complex changes. These pages will list, based on the last publication, the existing nomenclature recommendations as well as the most recent suggestions (in italics and marked ). More details regarding the  latest additions can be found at the Discussion page. These pages can be used as a guide to describe any sequence variant identified and should help to get a uniformly accepted standard.

Discussions regarding the advantages and disadvantages of the recommendations made are necessary in order to continuously improve the system. What is listed on these pages represents the current consensus of the discussions. We invite investigators to communicate to us regarding the recommendations as well as to send us complicated cases not yet covered, with a suggestion of how to describe these (E-mail to: VarNomen @ HGVS.org). 

Mutation and polymorphism

In some disciplines the term "mutation" is used to indicate "a change" while in other disciplines it is used to indicate "a disease-causing change". Similarly, the term "polymorphism" is used both to indicate "a non disease-causing change" or "a change found at a frequency of 1% or higher in the population". To prevent this confusion we do not use the terms mutation and polymorphism (including SNP or Single Nucleotide Polymorphism) but use neutral terms like "sequence variant", "alteration" and "allelic variant". The Vol.19(1) issue of Human Mutation (2002) contains several contributions discussing these issues as well as the fact that the term "mutation" has developed a negative connotation (see Cotton RGH - p.2, Condit CM et al. - p.69 and Marshall JH - p.76). Therefore, current guidelines of authorative organisations now also recommend to use the neutral term "variant" only (e.g. Richards 2015, Genet.Med. 17:405-424).

Pathogenic

Another confusing term used frequently is "a pathogenic variant". While a non-expert concludes the variant described "causes disease", the expert probably means "causes disease when in a specific context"

To prevent confusion it therefore seems best not to use the term "pathogenic". A good alternative seems a neutral term like "affects function". In fact this properly describes what one actually means, the variant affects the normal function of the gene/protein (in whatever way). This also solves the issue of what term to use for non-disease phenotypes like skin/hair/eye colour or blood group. In such cases it is problematic to choose the phenotype to call "normal" or "pathogenic". Using "affects function" is clear and effective. To classify variants people use most frequently 5 categories. Based on affects function these could be; affects funncion,  probably affects function,  unknown,  probably does not affect function (or probably no functional effect),  does not affect function (no functional effect). Variants for which a functional effect is unknown can together be called "variants of unknown significance" (VUS).


General recommendations

(suggestions extending the published recommendations in italics)


The most important rule is that all variants should be described at the most basic level, i.e. the DNA level. Descriptions should always be in relation to a reference sequence, either a genomic or a coding DNA reference sequence. Discussions on which type of reference sequence to prefer, genomic or coding DNA, have been lively. Although theoretically a genomic reference sequence seems best, in practice a coding DNA reference sequence is preferred (see Reference Sequence discussions).

Detailed recommendations


| Top of page | Homepage | Check-list | Symbols, codons, etc. |
| Recommendations:  generalDNARNAprotein, uncertain |
| Discussions | FAQ's | History |
| Example descriptions:  QuickRefDNARNAprotein |

Copyright © HGVS 2010 All Rights Reserved
Website Created by Rania Horaitis, Nomenclature by J.T. Den Dunnen - Disclaimer