HLADB-2.28.0-January 2010 HLA-H Genomic Nucleotide Sequence Alignments Sequences Aligned: 15 January 2010 Steven GE Marsh, Anthony Nolan Research Institute. 5' UTR gDNA -300 | H*01010101 GCACAGGAGG AGCGGGGTCA GGGCGAAGTC CCAGGGCCCC A.GGCGTGGC TCTCAGGGTC TCAGGCCCCG AAGGCGGTGT ATGGATTGGG GAGGCCCCGC H*01010102 ---------- ---------- ---------- ---------- -C-------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- --------.. -.-------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- --------.. -.-------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- ---------- --T------- gDNA -201 | H*01010101 CTTGGGGATT CGCCACCTCC GCAGTTTCTC TTCTTCTCAC AACCTGCGAC GGGTCCTTTT TCCTGGATAC TCAGGAAGCG GGCACAGTTC TCATTCCCAC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---C------ ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---C------ ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---C------ ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---C------ ---------- ---------- H*0204 ---A------ ---------- ---------- ---------- ---------- --------C- ----T----- ---C------ -A-------- ---------- gDNA -101 | H*01010101 TAGGTGTCGG GTTTCTAGAG AAGCCAATCG GTGCCGCCGC GGTCCCGGTT CTAAAGTCCC CACGCACCCA CCGGGACTCA GATTCTCCCC AGACGCCGAG H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- --------.- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- | Exon 1 Exon 1 | Intron 1 gDNA -1 | 1 | | | H*01010101 G | ATGGTGCTC ATGGCGCCCC GAACCCTCCT CCTGCTGCTC TCAGGGGCCC TGGCCCTG.. ....ACCCAG ACCTGGGCGC | GTGAGTGCAG GGTCTGCAGG H*01010102 - | --------- ---------- ---------- ---------- ---------- --------.. ....------ ---------- | ---------- ---------- H*01010103 - | --------- ---------- ---------- ---------- ---------- --------.. ....------ ---------- | ---------- ---------- H*0102 - | --------- ---------- ---------- ---------- ---------- --------.. ....------ ---------- | ---------- ---------- H*02010101 - | --------- ---------- ---------- ---------- ---------- --------.. ....------ ---------- | ---------- ---------- H*02010102 - | --------- ---------- ---------- ---------- ---------- --------.. ....------ ---------- | ---------- ---------- H*0202 - | --------- ---------- ---------- ---------- ---------- --........ ....------ ---------- | ---------- ---------- H*0203 - | --------- ---------- ---------- ---------- ---------- --........ ....------ ---------- | ---------- ---------- H*0204 - | --------- ---------- ---------- ---------- ---------- --------AC CCTG------ ---------- | ---------- ---------- gDNA 94 | H*01010101 GAAATGGTCG GGAGGAGCGA GGGGCCCGCC CGGCGGGGG. CGCAGGACCC AGGGAGCCGC GCAGGGAGGA GGGTCGGGCG GGTCTCAGCT CCTCCTCGCT H*01010102 ---------- ---------- ---------- ------C-.. ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------. ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------. ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------. ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------. ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------. ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------. ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------. ---------- G----T---- ---------- ---------C ---------- ---------- Intron 1 | Exon 2 gDNA 193 | | | H*01010101 CCCAG | GCTCC CACTCCATGA GGTATTTCTA CACCACCATG TCCCGGCCCG GCCGCGGGGA GCCCCGCTTC ATCTCCGTCG GCTACGTGGA CGATACGCAG H*01010102 ----- | ----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ----- | ----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ----- | ----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ----- | ----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ----- | ----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ----- | ----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -T-------- H*0203 ----- | ----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ----- | ----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- gDNA 293 | H*01010101 TTCGTGCGGT TCGACAGCGA CGCCGCGAGC CAGAGGATGG AGCCGCGGGC GCCGTGGATG GAGCGGGAGG GGCCGGAGTA TTGGGACCGG AACACACAGA H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- --A------T -C---AGA-- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- --A------T -C---AGA-- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- --A------T -C---AGA-- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- --A------T -C---AGA-- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- --A------T -C---AGA-- ---------- ---------- ---------- ----A----- ---------- ---------- Exon 2 | Intron 2 gDNA 393 | | | H*01010101 TCTGCAAGGC CCAGGCACAG ACTGAACGAG AGAACCTGCG GATCGCGCTC CGCTACTACA ACCAGAGCGA GGGCG | GTGAG T.GACCCCGG CCCGGGACGC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ----- | ----- -.-------- -----.---- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ----- | ----- -.-------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- -T-------- ---------- ----- | ----- -.-------- ---------- H*02010101 ---------- ---A----G- ---------- ---------- ---------- ---------- ---------- ----- | ----- -.-------- ------G--- H*02010102 ---------- ---A----G- ---------- ---------- ---------- ---------- ---------- ----- | .---- -.-------- ------G--- H*0202 ---------- ---A----G- ---------- ---------- ---------- ---------- ---------- ----- | ----- -.-------- ---------- H*0203 ---------- ---A----G- ---------- ---------- ---------- ---------- ---------- ----- | ----- -.-------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ----- | ----- -.-------- ------G--- gDNA 492 | H*01010101 AGGTCACGAC CCCTCCCCAT CCCCCACGGA GGGCCGGGTC GCCTCGAGTC TCTGGGTCCG AGATCCTCCC CGAAACCGCG GGACCCCGAG ACCCTTGACC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ------A--- ---------- ---------- ---------- gDNA 592 | H*01010101 TGGGAGAGGC CCAGGCGCCT TTACCCGGTT TCATTTTCAG TTTAGGCCAA AATCCCCGCG GGTTGGTCGG GGCAGGGCGG GGCTCGGGGG ACCGGGCTGA H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- -------.-- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- --------C- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- -.------C- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- --C------- ---------- ---------- ---------- ---------- Intron 2 | Exon 3 gDNA 692 | | | H*01010101 CCGCGGGGGC GGGGCCAG | GT TCTCACACCA TGCAGGTGAT GTATGGCTGC GACGTGGGGC CCGACGGGCG CTTCCTCCGC GGGTATGAAC AGCACGCCTA H*01010102 ---------- -------- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---.---- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- -------- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- -------- | -- ---------- ---------- ---------- ---------- ---------- -------T-- ---------- ---------- H*02010102 ---------- ---.---- | -- ---------- ---------- ---------- ---------- ---------- -------T-- ---------- ---------- H*0202 ---------- -------- | -- --C------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- -------- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- -------- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- gDNA 792 | H*01010101 CGACAGCAAG GATTACATCG CTCTGAACGA GGACCTGCGC TCCTGGACCG CGGCGGACAT GGCAGCTCAG ATCACCAAGC GCAAGTGGGA GGCGGCCCGT H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ----G----- ---------- -C-------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- Exon 3 | Intron 3 gDNA 892 | | | H*01010101 CAGGCGGAGC AGCTGAGAGC CTACCTGGAG GGCGAGTTCG TGGAGTGGCT CCGCAGATAC CTGGAGAACG GGAAGGAGAC GCTGCAGCGC GCGG | GTACCA H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---- | ------ H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---- | ------ H*0102 -G-------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---- | ------ H*02010101 -G-------- ---G-----T ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---- | ------ H*02010102 -G-------- ---G-----T ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---- | ------ H*0202 GT-------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---- | ------ H*0203 -G-------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---- | ------ H*0204 -G-------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---- | ------ gDNA 992 | H*01010101 GGGGCCACAG GGCGCCTCCC GGATGGCCTG TAGATCTCCG GGGCTGGCCT CCCACAAGAA AGGGAGACAA ATGGGACCAA CACTATAATA TCGCCCTCCC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ----C----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ----C----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ----C----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ----C----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- gDNA 1092 | H*01010101 TCTGGTCCTG AGGGAGAAGA ATCCTCCTGG GTTTCCAGAG AGTGACTCTG AGGGTCCGCC GTGCTCTTTG ACACAATTAA GGGATGAAAT CTCTGAGGAA H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- -------C-- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- -------C-- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- -------C-- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- -------C-- ---------- ---------- ---------- H*0204 -------T-- ---------- ---------- ---------- ---------- ---------- C------C-- ---------- ---------- --G------- gDNA 1192 | H*01010101 ATGAAGGGAA GACAATCCCT GGAATACTGA TGAGTGGTTC CCTTTGACAC TGGCAGCAGC CTTGGGCCCC GTGACTTTTC CTCTCAGGCC TTGTTCTCTG H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- .--------. ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- --------G- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- --------G- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- T--------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- T--------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- gDNA 1292 | H*01010101 CTTCACACTC AATGTGCCTG GGGGTCTGAG TCCAGCTCTT CTGAGTCCCT CAGCCTCCAC TCAGGTCAGG ACCAGAAGTC GCTGTTCCCT CCTCAGGGAC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ------TG-- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ------TG-- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- -------G-- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -T-------- gDNA 1392 | H*01010101 TAGAATTTTC CACGGAATAG GAGATTATCC CAGGTGCCTG TGTCCAGGCT GTTGTCTGGG TTCTGTGCTC CCTTCCCCAC CCCAGGCGTC CTGTCCATTC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---T------ -----T-TG- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -------A-- ---------- Intron 3 | Exon 4 gDNA 1492 | | | H*01010101 TCAAGATGGC CACATGCGTG CTGGTGGAGT GTCCCATGAC AGATGCAAAA TGCCTGAATT TTCTGACTCT T.CCTGTCAG | ACCCCCCC.A AGACACATAT H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- -.-------- | --------.- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- -.-------- | --------.- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- -.-------- | --------.- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- -.--C----- | --------.- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- -.--C----- | --------.- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- -.--C----- | --------.- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- -.--C----- | --------.- ---------- H*0204 ---------- ---------- ----A----- ---------- ---------- ---------- ---------- -.--C----- | --------C- ---------- gDNA 1590 | H*01010101 GACCCACCAC CCCATCTCTG ACCATGAGGC CACCCTGAGG TGCTGGGCCC TGGGCTTCTA CCCTGCGGAG ATCACACTGA CCTGGCAGCG GGATGGGGAG H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 -------T-- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 -------T-- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- gDNA 1690 | H*01010101 GACCAGACCC AC.ACACGGA GCTCATGGAG ACCAGGCCTG CAGGGGATGG AACCTTCCAG AAGTGGGCGG CTGTGGTGGT GCCTTCTGGA GAGGAGCAGA H*01010102 ---------- --.------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- --.------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- --.------- ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- --.------- ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- --.------- ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- --.------- ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- --.------- ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- -GG------- ----G----- ---------- ---------- ---------- ---------- ---------- ---------- ---------- Exon 4 | Intron 4 gDNA 1789 | | | H*01010101 GATACACCTG CCATGTGCAG CATGAGGGTC TGCCAGAGCC CCTCACCCTG AGATGGG | GTA AGGAGGGAGA TGGGGGTGTC ATGTCCCTTA GGGAAAGC.. H*01010102 ---------- ---------- ---------- ---------- ---------- ------- | --- ---------- ---------- ---------- --------TC H*01010103 ---------- ---------- ---------- ---------- ---------- ------- | --- ---------- ---------- ---------- --A-----.. H*0102 ---------- ---------- ---------- ---------- ---------- ------- | --- ---------- ---------- ---------- --------.. H*02010101 ---------- ---------- ---------- ---------- ---------- ------- | --- ---------- ---------- ---------- --------.. H*02010102 ---------- ---------- ---------- ---------- ---------- ------- | --- ---------- ---------- ---------- --------.. H*0202 ---------- ---------- ---------- ----C----- ---------- ------- | --- ---------- ---------- ---------- --------.. H*0203 ---------- ---------- ---------- ----C----- ---------- ------- | --- ---------- ---------- ---------- --------.. H*0204 ---------- ---------- ---------- ----C----- ---------- ------- | --- ---------- ---------- -----T---- --------.. Intron 4 | Exon 5 gDNA 1887 | | | H*01010101 ...CGGAG.C CTCTCTGGAG AGCTTTAGCA GGGTCAGGGT CCCTCACCTT CCCCCCTTTT CCCAG | AGCCA TCTTCCCAGC CCACCATCCC CATCGTGGGC H*01010102 GAC-----.- ---------- ---------- ---------- ---------- ---------- ----- | ----- ---------- ---------- ---------- H*01010103 ...-----.- ---------- ---------- ---------- ---------- ---------- ----- | ----- ---------- ---------- ---------- H*0102 ...-----.- ---------- ---------- ---------- ---------- ----T----- ----- | ----- ---------- ---------- ---------- H*02010101 ...-A---.- ---------- ---------- ---------- ---------- ---------- ----- | ----- ---------- ---------- ---------- H*02010102 ...-A---.- ---------- ---------- ---------- ---------- ---------- ----- | ----- ---------- ---------- ---------- H*0202 ...-----.- ---------- ---------- ---------- ---------- ---------- ----- | ----- ---------- ---------- ---------- H*0203 ...-----.- ---------- ---------- ---------- ---------- ---------- ----- | ----- ---------- ---------- ---------- H*0204 ...-----.- ---------- ---------- ---------- ---------- ---------- ----- | ----- ---------- ---A-G---- ---------- Exon 5 | Intron 5 gDNA 1983 | | | H*01010101 ATCGTTGCTG GCCTGGTTCT ACTTGTAGCT GTGGTCACTG GAGCTGTGGT CGCTGCTGTA ATGTGGAGGA AGAAGAGCTC AG | GTAAGGAA GGGGTGAGGA H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -- | -------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -- | -------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -- | -------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -- | -------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -- | -------- ---------- H*0202 ---A------ ---------- ---------- ---------- ---------- -A-------- ---------- ---------- -- | -------- ---------- H*0203 ---A------ ---------- ---------- ---------- ---------- -A-------- ---------- ---------- -- | -------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -- | -------- ---------- gDNA 2083 | H*01010101 GTGTGGTCTG AGA.TTTCTT GTCTCACTGA GAGTTCCAAG CCCCAGGTAG AAGTGCCCTG CCTGGTTACT GGGAAGCACC ATCCACACTC ATGGGCCTAC H*01010102 ---------- ---.------ ---------- ---------- ---------C ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---.------ ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---.------ ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---.------ ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---.------ ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---.------ ---------- ---------- ---------- ---G------ ---------- ---------- ---------- ---------- H*0203 ---------- ---.------ ---------- ---------- ---------- ---G------ ---------- ---------- ---------- ---------- H*0204 ---------- ---.------ ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- gDNA 2182 | H*01010101 CCAGCCTGGG CCCTGTGTGC CAGCACTTAC TCTTTTGTAA AGCACCTGTT ACAATGAGGG ACAGATTTAT CACCTTGATG ACTGTGGTGA TGGGACCTGA H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ----C----- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ----C----- ---------- ---------- ---------- ---------- H*0204 ---------- --G------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- gDNA 2282 | H*01010101 TCCCAGCAGT CACAAGTCAC AGGGGAAGGT CCCCGAGGAC AGACCTCAGA AGGGCGGTTG GTCCAGGACC CACATCTGCT TTCCTCATGT TTCCTGATCC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ----A----- ---------- ---------- ---------- --T------- ---T------ ---------- gDNA 2382 | H*01010101 CGCCCTGGGT CTGCAGTTGC ACATTTCTGG AAACTTCTCT GGGGTCCAAG ACTTGGAGGT T.CCTCTAGG ACCTTATGGC CCTGGCTTCT TTCTGGCATC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- -T-------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- -.-------- ---------- ---------- ---------- H*0204 -A-------- ---------- ---------- ---------- -------G-- ---------- -.-------- ---------- ---------- ---------- Intron 5 | Exon 6 Exon 6 | Intron 6 gDNA 2481 | | | | | H*01010101 TCACAGGACA TTTTCTTCCC ACAG | ATAGAA AAGGAGGGAG CTACTCTCAG GCTGCAA | GTA AGTATGAAGG AGGCTGATCC CTGAAATCCT TTGGATATTG H*01010102 ---------- ---------- ---- | ------ ---------- ---------- ------- | --- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---- | ------ ---------- ---------- ------- | --- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---- | ------ ---------- ---------- ------- | --- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---- | ------ ---------- ---------- ------- | --- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---- | ------ ---------- ---------- ------- | --- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---- | ------ ---------- ---------- ------- | --- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---- | --T--- ---------- ---------- ------- | --- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---- | ------ ---------- ---------- ------- | --- ---------- ---------- ---------- ---------- Intron 6 | Exon 7 gDNA 2581 | | | H*01010101 TGTTTGGGAG CCCATGGGGG AGCTCACCCA CCCCACAATT CTTCCTCTAG CCACATCTAC TGTGGGATCT GACCAGGTCC TGTTTTTATT CTACTCCAG | G H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- --------- | - H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- --------- | - H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- --------- | - H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- --------- | - H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- --------- | - H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- --------- | - H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- --------- | - H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ------G-- | - Exon 7 | Intron 7 gDNA 2681 | | | H*01010101 CGGCAACAGT GCCCAGGGCT CTGATGTGTC TCTCACGGCG TGAAAG | GTGA GACCTTGGGG GGCCTGATGT GTGGGGGGTG TTGGGGGGGA ACAGTGGACA H*01010102 ---------- ---------- ---------- ---------- ------ | ---- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ------ | ---- ---------- ---------- ---------- ---------- ------.--- H*0102 ---------- ---------- ---------- ---------- ------ | ---- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ------A--- ------ | ---- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ------A--- ------ | ---- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ------ | ---- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ------ | ---- ---------- ---------- ---------- ---------- ---------- H*0204 -A-------- ---------- ---------- ---------T ------ | ---- ---------- ---------- -------A-- ---------- ---------- gDNA 2781 | H*01010101 CAGCTGTGCT ATGGGGTTCT TTGAATTTGA TGTTTTGAGC ATGCGATGGG CTGCCAAAGT GTCATCCATT ACTGGGACAG ATATGAATTT GTTCATGAAT H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0204 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- Intron 7 | Exon 8| 3' UTR gDNA 2881 | | | | | H*01010101 ATTTTTTCTA TAG | TGTGA | GA CAGCTGCCTT GTGTGGGACT GAGAGGCAAG ATTTGTTCAC ACCTTCCCTT TGTGACTTGA AGAACCCTGA CTTTCT.... H*01010102 ---------- --- | ----- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ------.... H*01010103 ---------- --- | ----- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ------.... H*0102 ---------- --- | ----- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ------.... H*02010101 ---------- --- | ----- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ------.... H*02010102 ---------- --- | ----- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ------.... H*0202 ---------- --- | ----- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ------.... H*0203 ---------- --- | ----- | -- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ------.... H*0204 ---------- --- | ----- | -- ---------- ---------- ---------- -G------CT G--------- ---------- ---------- ------TTCT gDNA 2977 | H*01010101 GCAAAGGCAC CTGAATGTGT CTGTGTTCCT GTAGGCATAA TGTGTGGAGG AGGGGAGACC AACCCACCCT CATGTCCACC ATGACCCTCT TCCCCACGCT H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- -----T---- H*01010103 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---******* H*0204 A--------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- gDNA 3077 | H*01010101 GATCTGTGTT CCCTCCCCAA TCATCTTTCC TGTTCCAGAG AGGCGGGGCT GAGATGTCTC CATCTTTTTC TCAACTTTAT GTGCACTGAG CTGTAACTTC H*01010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*01010103 ---------- ---------- ---------- ---------- ..-------- ---------- ---------- ---------- ---------- ---------- H*0102 ********** ********** ********** ********** ********** ********** ********** ********** ********** ********** H*02010101 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*02010102 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---.------ ---------- --------.- H*0202 ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- ---------- H*0203 ********** ********** ********** ********** ********** ********** ********** ********** ********** ********** H*0204 ---------- ---------- ---------- ---------- ---A------ ---------- ---------- ---------- ---------- ---------- gDNA 3177 | H*01010101 TTACTTCCCT CTTAAAATTA GA H*01010102 ---------- ---------- -- H*01010103 ---------- ---------- -- H*0102 ********** ********** ** H*02010101 ---------- ---------- -- H*02010102 -----.---- ---------- -- H*0202 ---------- ---------- -- H*0203 ********** ********** ** H*0204 ---------- ---------- --