Low Diversity of Major Histocompatibility Complex (MHC) Genes in Endangered Malayan Tapir (Tapirus indicus)
Nurul Adilah Ismail
Department of Biology, Faculty of Science, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor Darul Ehsan, Malaysia.
Christina Seok Yien Yong
Department of Biology, Faculty of Science, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor Darul Ehsan, Malaysia.
Simon Yung Wa Sin
School of Biological Sciences, The University of Hong Kong, Pok Fu Lam Road, Hong Kong SAR.
Geetha Annavi
Department of Biology, Faculty of Science, Universiti Putra Malaysia, 43400 UPM Serdang, Selangor Darul Ehsan, Malaysia.
Communicated by Jen-Pan Huang
The Malayan tapir (Tapirus indicus) is listed as Endangered on the IUCN Red List due to multiple threats such as habitat loss and human disturbance that have led to its population decline. This decline increases the risk of inbreeding, which could result in the reduction of genome-wide genetic variation and negatively affect the gene responsible for immune response i.e., MHC gene. Class I and II MHC genes are responsible for encoding MHC molecules in the cells that recognise pathogenic peptides and present them to T-Cells on the cell surface for adaptive immune response. However, at present there is no study related to the MHC gene in Malayan tapir yet. This study characterises the MHC class I and II genes from seven individuals, investigates evidence of balancing selection and their relationships with homologous genes of other species. We identified at least one class I gene and four class II genes. Five sequences of alpha1 (α1) and four of alpha2 (α2) domains of class I alleles, two DRA, two DQA, three DRB and three DQB of class II alleles were isolated. α1 and α2 domains of class I and DRB domain of class II displayed evidence of selection with a higher rate of non-synonymous over synonymous substitutions. Within the DRB gene, 24 codons were found to be under selection where 10 are part of the codons forming the Antigen Binding Site. Genes sequences show species-specific monophyletic group formation except for class I and DRB genes with intersperse relationship in their phylogenetic trees which may indicate occurrence of trans-species polymorphism of allelic lineage. More studies using RNA samples are needed to identify the gene’s level of expression.
Supplementary materials
Amino acid sequence identity for the Malayan tapir
Tapirus indicus class I exon 2 clones, rhinoceros (
Diceros bicornis), equids (
Equus caballus), human (
Homo sapiens) and cattle (
Bos taurus). The GenBank accession numbers for α1 (exon2) sequences from other mammals are AF055346 (Diceros_bicornis_ DibiUA01), DQ083407 (Equus_caballus_Eqca100101) and DQ145597 (Equus_caballus_Eqca100201), GU812295 (Homo sapiens_HLA_A01010101) and L02834 (Bos_taurus_classIBoLA).
Numbers above the sequence indicate the codon position in the α1 domain.
Single letters and
dots represent amino acids that are distinct from or identical to Tapirus indicus sequence for this domain respectively.
Dashes indicate missing sequences. Putative ABSs were defined according to Reche and Reinherz (2003) and are marked with an asterisk mark above the sequence.
(
download)
Amino acid sequence identity for the Malayan tapir
Tapirus indicus class I exon 3 clones, rhinoceros (
Rhinoceros unicornis Diceros bicornis, and
Ceratotherum simum), equids
(
Equus caballus), and cattle (
Bos taurus). The GenBank accession numbers for α2 (exon3) sequences from other mammals are AJ133670 (R_unicornis_classI), AJ055348 (D_ bicornis_DibiUB02), XM_014795072 (Ceratotherium_ simum_classi), DQ083407 (E_caballus_Eqca100101) and DQ083408 (E_caballus_Eqca200101), and L02834 (Bos_taurus_classI_BoLA).
Numbers above the sequence indicate the codon position in the α2 domain.
Single letters and
dots represent amino acids that are distinct from or identical to
Tapirus indicus sequence for this domain respectively.
Dashes indicate missing sequences. Putative ABSs were defined according to Reche and Reinherz (2003) and are marked with an asterisk mark above the sequence.
(
download)
Amino acid sequence identity for the Malayan tapir
Tapirus indicus class II exon 2 DRA clones, available Malayan tapir GenBank’s sequence (
Tapirus indicus), Baird’s tapir
(
Tapirus bairdii), rhinoceros (
Diceros bicornis,
Rhinoceros unicornis and
Ceratotherum simum), and equids (
Equus caballus). The GenBank accession numbers for DRA (exon2) sequences from other mammals are KM347953 (T_ indicus_Tain-DRA-0104) and KM347956 (T_indicus- Tain-DRA-0106), AF113547
(T_bairdii_Taba- DRA-0101), AF113549 (D_bicornis_DRA-0101), AF113554 (R_unicornis_DRA-0501), AF113553 (Cera_ simum_DRA-0401), and JQ254081 (E_caballus_Eqca- DRA-00102).
Numbers above the sequence indicate the codon position in the DRα domain.
Single letters and
dots represent amino acids that are distinct from or identical to
Tapirus indicus sequence for this domain respectively.
Dashes indicate missing sequences.
Putative ABSs were defined according to Reche and Reinherz (2003) and are marked with asterisk mark above the sequence.
(
download)
Amino acid sequence identity for the Malayan tapir
Tapirus indicus class II exon 2 DQA clones, human (
Homo sapiens), equids (
Equus caballus), bovine (
Bubalus bubalis), boar (
Sus scrofa), and coyote (
Canis latrans). The GenBank accession numbers for DQA (exon2) sequences from other mammals are L3402 (HLA_DQA101011), JQ254060 (E_caballus_EqcaDQA100101), JQ254067 (E_ caballus_EqcaDQA200202), KT428703 (B_bubalis_ BubuDQA2103), AY285931 (Sus_scrofa_DQA1y) and AY126647 (Canis_latrans_DQA01701).
Numbers above the sequence indicate the codon position in the DQα domain.
Single letters and
dots represent amino acids that are distinct from or identical to
Tapirus indicus sequence for this domain respectively.
Dashes indicate missing sequences. Putative ABSs were defined according to Reche and Reinherz (2003) and are marked with asterisk mark above the sequence.
(
download)
Amino acid sequence identity for the Malayan tapir
Tapirus indicus class II exon 2 DRB clones, equids (
Equus caballus) and human (
Homo sapiens). The GenBank accession numbers for DRB (exon2) sequences from other mammals are JQ254085 (E_ caballus_DRB100101), JQ254084 (E_caballus_ DRB100201), JQ254086 (E_caballus_DRB100301), and AF029288 (Homo_sapiens_DRB10101).
Numbers above the sequence indicate the codon position in the DRβ domain.
Single letters and
dots represent amino acids that are distinct from or identical to
Tapirus indicus sequence for this domain respectively.
Dashes indicate missing sequences. Putative ABSs were defined according to Reche and Reinherz (2003) and are marked with asterisk mark above the sequence.
(
download)
Amino acid sequence identity for the Malayan tapir
Tapirus indicus class II exon 2 DQB clones, human (
Homo sapiens), equids (
Equus caballus), bovine (
Bos taurus), boar (
Sus scrofa) and dog (
Canis familiaris). The GenBank accession numbers for DQB (exon2) sequences from other mammals are L34101 (HLA_DQB050101), JQ254070 (E_ caballus_EqcaDQB100101), JQ254069 (E_caballus_ EqcaDQB100201), DQ093609 (Bos_taurus_BoLa_ DQB), AY459300 (Sus_Scrofa_SLADQB1ax), and AF016905 (Canis_familiaris_DQB10010).
Numbers above the sequence indicate the codon position in the DQβ domain.
Single letters and
dots represent amino acids that are distinct from or identical to
Tapirus indicus sequence for this domain respectively.
Dashes indicate missing sequences. Putative ABSs were defined according to Reche and Reinherz (2003) and are marked with asterisk mark above the sequence.
(
download)
Details of individuals included in this study.
(
download)
Akaike Information Criterion (AIC) values for Malayan tapir MHC genes alignments with other species for phylogenetic analysis model selection. Model with lowest AIC value was selected for phylogenetic construction. lnL = log-Likelihood value.
(
download)