WMDA Directory

HLA Nomenclature in WMDA file format

At the request of the IT Working Group of the World Marrow Donor Association (WMDA), we are making a number of computer readable files available. These files will document the official WHO HLA Nomenclature, the relationships between serologically defined antigens, and the relationships between HLA allele sequences and their serologically defined antigens. Updated versions of these files will be released every three months, at the same time as new versions of the IMGT/HLA Database become available.

All five files have a short header with four lines of information, indicating:

hla_nom.txt (Download the hla_nom.txt file )

This file contains details of all current and deleted HLA antigens and alleles, and is sorted by locus and antigen/allele number:

Please note that for HLA Antigen names assigned before November 1987, the dates given are only approximate.

This file includes six fields of information, each separated by a semi-colon (;).

HLA Locus HLA Antigen or Allele name Date Assigned (YYYYMMDD) Date Deleted (YYYYMMDD) Deleted Antigen/Allele Identical to Reason for Deletion
A 1 19680101


A* 0105N 19990216 20010717 01:04N Sequence Identical

 

hla_nom_g.txt (Download the hla_nom_g.txt file )

HLA alleles that have identical nucleotide sequences across the exons encoding the peptide binding domains (exon 2 and 3 for HLA class I and exon 2 only for HLA class II alleles) will be designated by an upper case ‘G’ which follows the first 3 fields of the allele designation of the lowest numbered allele in the group. The full list of these groups is available at the G Groups page. A computer readable version is also available, this file includes three fields of information, each separated by a semi-colon (;).

HLA Locus Alleles within designated group G group name (if available)
A* 01:01:01:01/01:01:01:02N/01:04N/01:22N/01:32/01:34N/01:37/01:45 01:01:01G
A* 01:01:02  

Please note that several DRB alleles are not sequenced for the dimorphic nucleotide at position 357. It would therefore be inaccurate to assign a base to these sequences at this position. For this reason the sequences have been truncated to include only nucleotides 101 to 356, in the DRB analysis.

hla_nom_p.txt (Download the hla_nom_p.txt file )

This file contains details of all HLA Sequences having the same antigen binding domains. This analysis is performed on the polypeptide sequence, and for HLA Class I alleles, identity in the 'antigen binding domains' is based on identical protein sequences as encoded by exons 2 and 3. For HLA Class II alleles this is based on identical protein sequences as encoded by exon 2. HLA alleles having nucleotide sequences that encode the same protein sequence for the peptide binding domains (exon 2 and 3 for HLA class I and exon 2 only for HLA class II alleles) will be designated by an upper case ‘P’ which follows the 2 field allele designation of the lowest numbered allele in the group. The full list of these groups is available at the P Groups page. A computer readable version is also available, this file replaces the file previously called abdm.txt and includes three fields of information, each separated by a semi-colon (;).

HLA Locus Alleles within designated group P group name (if available)
A* 01:01:01:01/01:01:02/01:01:03/01:01:04/01:01:05/01:01:06/01:01:07/01:01:08/01:01:09/
01:01:10/01:01:11/01:01:12/01:01:13/01:32/01:37/01:45
01:01P
A* 01:02  

Several DRB alleles are not sequenced for the dimorphic nucleotide at position 357. It would therefore be inaccurate to assign a base to these sequences at this position. For this reason the sequences have been truncated to include only the translated sequence of nucleotides 101 to 356, in the DRB analysis.

rel_ser_ser.txt ( Download the rel_ser_ser.txt file )

This file lists the relationships between all current serologically defined HLA antigens: broad antigens, split antigens and associated antigens.

This file includes four fields of information, each separated by a semi-colon (;). The file lists only those antigens for which split or associated antigens exist. Multiple values are separated by a forward slash (/).

HLA Locus HLA Antigen name (digits only) Split HLA Antigens Associated HLA Antigens
A 2
203/210
A 9 23/24
A 10 25/26/34/66

 

rel_dna_ser.txt ( Download the rel_dna_ser.txt file )

This file contains details of all current HLA alleles and where known their unambiguous, possible or assumed serologically equivalent antigens.

Details of the unambiguous serology is defined from submissions to the WHO Nomenclature Committee for Factors of the HLA System (1) at the time an allele is submitted for naming, or from the WMDA HLA Dictionary 2008 (2). For Null alleles a value of zero (0) is given and for alleles with no corresponding antigen a question mark (?) is given.

In cases where an allele has been shown to be associated with more than one serologically defined antigen, these are indicated in the 'Possible Serology' field. Multiple values are separated by a forward slash (/). In cases where there is currently no information about the serological equivalent of an allele, the 'Assumed Serology' field contains the antigen equivalent as expected by the first two digits of the allele name.

This file contains details of all current HLA antigens and alleles, and is sorted by locus and allele number:

This file includes six fields of information, each separated by a semi-colon (;).

HLA Locus HLA Antigen/ Allele name Unambiguous Serology Possible Serology Assumed Serology Expert Assigned Exceptions
A* 01:01:01 1


A* 01:04N 0


A* 01:10

1
A* 02:01:01:02L
0/2

A* 02:03:01 203


B* 13:04
15/21
13
B* 83:01 ?


md5checksum.txt ( Download the md5checksum.txt file )

This file contains an md5 fingerprint for each file allowing verification of the downloads.

Release Archive

The release archive is now maintained as a git repository and available at https://github.com/ANHIG/IMGTHLA. This repository contains a branch for each database release and a Latest branch which contains the most recent files as well as all compressed archives.

 

References:

  1. SGE Marsh, ED Albert, WF Bodmer, et al. Nomenclature for Factors of the HLA System, 2010.
    Tissue Antigens (2010) 75 291-455
    International Journal of Immunogenetics. 2005 32:107-60
    Human Immunology. 2005 66:571-636
    Click here to download a copy of this report as published in Tissue Antigens is available courtesy of Wiley Blackwell.
  2. Holdsworth R, Hurley CK, Marsh SGE et al. The HLA Dictionary 2008: a summary of HLA-A, -B, -C, -DRB1/3/4/5, -DQB1 alleles and their association with serologically defined HLA-A, -B, -C, -DR and -DQ antigens.
    Tissue Antigens (2009) 73 95-170
    Click here to download a copy of this report as published in Tissue Antigens is available courtesy of Wiley Blackwell.