How the IPD-IMGT/HLA Database assigns different suffixes to HLA alleles
Suffixes are applied to certain alleles to indicate various levels of expression. There is no single definitive rule for assigning each of these suffixes, their use is guided by established principles based on expression studies and our current understanding of allele expression. As new data become available, suffix assignments may be revised. A list of alleles that have undergone suffix changes and the reason for the change is available here: https://hla.alleles.org/pages/genes/suffix_changes/.
The `Q` suffix is used with alleles that are considered to have questionable expression. We believe that their sequence may or may not be expressed, sufficient work has not been carried out to determine this yet. This can be caused by a number of different types of mutations, a few examples are below:
In this example, the next ATG is located in the leader sequence, we therefore assume this methionine will be the next start codon. As this is only 3 amino acids down from the canonical start, it may be expressed. If expression analysis is not carried out to determine if the allele is expressed, we will assign this allele a `Q` suffix.
A list of all of currently named `Q` alleles is available here.
Alleles with splice site mutations typically have a mutation in the intron side of an exon/intron boundary. This affects how the sequenced is spliced, which may result in exonic sequence being removed or intronic sequence being added. As we are unsure of the effect of these mutations, we will typically assign a `Q` suffix to alleles with a mutation in the first two or last two bases of an intron, as this is considered to be the region which affects splicing in HLA alleles.
Like with other alleles, submitters to the database will carry out the necessary work and find out the translated sequence, or carry out expression analysis to determine if the allele is expressed or not, which allows us to mark this up more accurately in the database.
HLA-DQB1 is a unique case amongst HLA genes. As exon 5 is variably expressed, this is the result of a mutation in the splice site preceeding exon 5, but we know this does not affect expression, and is found in many HLA-DQB1 alleles. As a result, we do not assign a `Q` suffix to these alleles.
The `N` suffix is used for alleles which have been shown or are inferred to not be expressed. This is either due to work carried out by the submitter to show that the allele is not expressed, or if the sequence in the allele is believed to result in a null allele as a result of its inferred protein structure. This can be caused by a number of different types of mutations, a few examples are below:
Some null alleles appear to share identical amino acid sequence up to the stop codon, but have different assigned protein names. This is due to the inferred sequence after the stop codon being different. An example of which is below, where the amino acid sequence is identical, however, the sequence after the stop codon differs drastically. Which is why they have received different protein names.
A list of all of currently named `N` alleles is available here.
These alleles have mutations which result in either Low expression, they are secreted etc etc, work has been carried out to show the aberrant expression compared to it being a questionable, null or normally expressed allele.
The suffix `L` is used to indicate an allele which has been shown to have `Low` cell surface expression when compared to normal levels. The `S` suffix is used to denote an allele specifying a protein which is expressed as a soluble, `Secreted` molecule but is not present on the cell surface. The `C` suffix is assigned to alleles that produce proteins that are present in the `Cytoplasm` and not on the cell surface. An `A` suffix indicates an `Aberrant` expression where there is some doubt as to whether a protein is actually expressed.
As of December 2025 no alleles have been named with the `C` or `A` suffixes.
Last updated: 08-Jan-2026