Disease background

Three human diseases, dyskeratosis congenita (DKC), aplastic anemia (AA), and idiopathic pulmonary fibrosis (IPF), have been linked to mutations within the genes that encode for the two telomerase essential core components, telomerase RNA (TR) and telomerase reverse transcriptase (TERT); telomerase-associated proteins, DKC1 encodes for dyskerin and Nola3 encodes for Nop10; and recently to one of the six proteins that for the shelterin complex, TINF2 encodes for TIN2. Heterogeneous mutations show that even half the dose of telomerase is insufficient to maintain telomere length, resulting in erosion and loss of function, senescence, and apoptosis. The maintenance of telomere length in highly prolific cells, germline and stem cells, is crucial for the preservation of high populations and human health. In general point mutations, which leads to a single amino acid substitution, are better tolerated than frame shift and splicing junction mutations, limiting but not abolishing telomerase activity. The toleration of reduction and loss of telomerase function decreases with each subsequent generation. This anticipation, or progression of symptoms within the next generation, is characteristic of telomerase-deficiency diseases. The telomeres of the parental generation erode and when passed to their offspring begin this generation with shorter telomeres. The increase in severity of symptoms is linked with the progressive decrease of telomere length.

Dyskeratosis congenita (DKC) is an inherited disorder with clinical manifestations of skin hyperpigmentation (dark patches of skin), oral leukoplakia (white spots inside the mouth), and nail dystrophy (lack of nails). The majority of deaths occur from bone marrow failure, immunodeficiency, pulmonary complications, and malignancies. The X-linked recessive form has severe clinical presentations and caused by mutations found within the DKC1 gene that encodes for the dyskerin protein. The autosomal dominant form of the disease has been shown to be caused by mutations within the genes that encode TR as well as TERT. The autosomal recessive form of the disease has been shown to be caused by a mutation within Nola3, the gene that codes for Nop10. These mutations cause a reduction of telomerase activity leading to a limitation in stem cell capacity for proliferation. This reduction in proliferative capacity for high turnover cells leads in low counts for blood and immune cells, resulting in aplastic anemia. Bone marrow failure, brought on by the aplastic anemia, is the most common cause of death for patients with DKC. Families with this disease show anticipation, or the worsening of symptoms in subsequent generations because each generation begins with shorter telomeres than the previous Vulliamy et al, 2006.

Aplastic anemia (AA) is characteristically an acquired disease, however, there are rare constitutional forms of the disease that are found in patients with a strong familial history of various blood diseases. The disease has is connected with mutations with the genes that encode TR and TERT. This constitutional form of bone marrow failure is defined by low peripheral blood cell counts, hypocellular bone marrow, does not respond to immunosuppressive therapy, and included typical physical anomalies. The constitutional has associated with patients with DKC, however, there are cases lacking symptoms of DKC. The most common cause of death is due to bone marrow failure Fogarty et al, 2003.

Idiopathic pulmonary fibrosis (IPF) is a specific form of plumonary fibrosis with unknown cause, Plumonary fibrosis involves fibrotic lesion and scarring of the lung.  The build up of excess scar tissue in the lungs results in reduced lung volume.  The symptoms that typify the disease are chronic cough and shortness of breath. Some familiar types of IPF are caused by mutations in the genes that encode TR and TERT Armanios et al, 2007.

A standard nomenclature is used throughout the online database for consistency and clarity.


TR mutations (autosomal dominant DKC, AA, IPF)
Numerous mutations causing nucleotide substitution, additions, and deletions have been documented within TR, the RNA component of the telomerase ribonucleoprotein (RNP), that have been connected with human diseases. TR contains the template that encodes for telomeric repeats and binds to telomerase reverse transcriptase (TERT) for DNA synthesis. Three domains characterize TR, pseudoknot that includes the template, conserved regions 4 and 5 (CR4-CR5), and ScaRNA domain for nuclear recruitment.

download formats: .pdf and .jpg


The above secondary structure for the 451 nt RNA component of the telomerase ribonucleoprotein (RNP) has indicated the location of mutations known to cause human diseases. The primary nucleotide sequence is in black, point mutations are colored red, while deletion mutations are shaded blue, and both are labeled. The Genbank accession number used is NR_001566 for the RNA sequence. Below is a description of the locations within the nucleotide sequence, the characteristic clinical presentations, and original literature citations that are linked to the published online journal for known mutations within the RNA component of telomerase. The mutations are organized by domain from the 5’-terminus to the 3’-terminus.
Domains Mutation Region Presentation References
5' UTR -99C>G 5' UTR paroxysmal nocturnal hemoglobinuria, menorrhagia, anemia, and thrombocytopenia
Pseudoknot 2G>C 5' end aplastic anemia Marrone et al, 2007.1
28-34 Δ 7nt P1a aplastic anemia Xin et al, 2007
37A>G P1b autosomal dominant dyskeratosis congenita, idiopathic pulmonary fibrosis, and dyspnea
48A>G Template sporatic dyskeratosis congenita Vulliamy et al, 2006
52-55 Δ CTAA Template sporatic myelodysplasia and sporatic dyskeratosis congenita Vulliamy et al, 2006
52-86 Δ 35nt Template pulmonary fibrosis Marrone et al, 2007.1
58G>A Template aplastic anemia, myelodysplasia, and autosomal dominant dyskeratosis congenita
72C>G P2a.1 aplastic anemia Vulliamy et al, 2002
79 Δ C P2a aplastic anemia Vulliamy et al, 2006
96,97 Δ CT P2b autosomal dominant dyskeratosis congenita Vulliamy et al, 2004
98G>A P2b idiopathic pulmonary fibrosis Armanios et al, 2007
107, 108GC>AG P3 autosomal dominant dyskeratosis congenita Vulliamy et al, 2001
110-113 Δ GACT P3 aplastic anemia, myelodysplasia and leukemia
116C>T P2b aplastic anemia and thrombocytopenia
117A>C P2b aplastic anemia Ly et al, 2005.1
143G>A P2a.1 autosomal dominant dyskeratosis congenita Vulliamy et al, 2004
178G>A P3 aplastic anemia Marrone et al, 2007.1
180C>T P3 aplastic anemia Marrone et al, 2007.1
204C>G P1a aplastic anemia Fogarty et al, 2003
Hypervariable 216-229 Δ 14nt P4.1 autosomal dominant dyskeratosis congenita Ly et al, 2005.2
228G>A P4.1 autosomal dominant dyskeratosis congenita and aplastic anemia
CR4-CR5 305G>A P6.1 aplastic anemia Yamaguchi et al, 2003
316-451 Δ 136nt J6/5 to Box ACA autosomal dominant dyskeratosis congenita Vulliamy et al, 2004
322G>A P5 myelodysplasia and refractory anemia Yamaguchi et al, 2003
ScaRNA 378-451 Δ 74nt Box H to Box ACA autosomal dominant dyskeratosis congenita Vulliamy et al, 2001
389, 390 Δ CC P7b essential thrombocythemia Ly et al, 2005.1
408C>G P8b autosomal dominant dyskeratosis congenita Vulliamy et al, 2001
450G>A Box ACA aplastic anemia Yamaguchi et al, 2003
3' UTR 467T>C 3' UTR aplastic anemia Yamaguchi et al, 2003

TERT mutations (autosomal dominant DKC, AA, IPF)
Numerous mutations causing amino acid substitution, additions, deletions, and frame shifts within TERT, the essential protein component of the telomerase ribonucleoprotein (RNP), have been connected with human diseases. TERT contains the catalytic site for the synthesis of telomeric repeats from the RNA template. TERT is composed of three domains, N-terminal extension (NTE) that contains RNA-interaction domains 1 and 2 (RID1 and RID2), reverse transcription domain (RT) where nucleotide transfer occurs, and a C-terminal extension (CTE) for processivity and localization.

download formats: .pdf and .jpg


The above structural organizational scheme for the various motifs with the three domains of the TERT protein has indicated the locations of mutations known to cause human diseases. The black line represents the mRNA sequence of 4015 nt with the untranslated regions (UTR) labeled and the grey box corresponds to the protein sequence. The individual motifs are labeled and NTE is denoted by green, CTE by orange, and the central RT domain by blue boxes. The Genbank accession numbers used are NT_006576 for the 41881 bp gene and NM_198253 for the cDNA and amino acid sequences. Below is the 1132 amino acid sequence for TERT protein with the motifs labeled and colored by domain. The mutated residues are colored red and the change in amino acids is labeled.

download formats: .pdf and .jpg


Below is a description of the locations within the nucleotide and protein sequence, the amino acid substitutions, the characteristic clinical presentations, and original literature citations that are linked to the published online journal for known mutations within the telomerase protein TERT. The cDNA sequence is used for the nucleotide sequence. The mutations are organized by domain from the N-terminus to the C-terminus and grouped by domain.
Domains Mutation AA substitution Motif Presentation References
N-Terminal 97C>T P33S n/a idiopathic pulmonary fibrosis and fibrosis Tsakiri et al, 2007
164T>A L55Q n/a idiopathic pulmonary fibrosis Armanios et al, 2007
IVS1 1G>A n/a IVS1 idiopathic pulmonary fibrosis Armanios et al, 2007
336 Δ C fs P112 GQ idiopathic pulmonary fibrosis Armanios et al, 2007
430G>A V144M GQ idiopathic pulmonary fibrosis Tsakiri et al, 2007
604G>A A202T n/a aplastic anemia
1849C>T A279T n/a sporatic dyskeratosis congenita Vulliamy et al, 2006
1234C>T H412Y CP aplastic anemia Yamaguchi et al, 2005
1456C>T R486C QFP idiopathic pulmonary fibrosis Tsakiri et al, 2007
1710G>(T or C) K570N T aplastic anemia Xin et al, 2007
Reverse transcriptase 2045G>A G682D n/a aplastic anemia Liang et al, 2006
2080G>A V694M n/a aplastic anemia Yamaguchi 2005
2110C>T P704S A autosomal dominant dyskeratosis congenita Du et al, 2007
2162C>G P721R A autosomal recessive dyskeratosis congenita Vulliamy et al, 2006
2177C>T T726M A aplastic anemia Liang et al, 2006
2240 Δ T fs V747 n/a idiopathic pulmonary fibrosis Tsakiri et al, 2007
2315A>G Y772C n/a aplastic anemia Yamaguchi 2005
2431C>T R811C n/a autosomal recessive dyskeratosis congenita Marrone et al, 2007.2
IVS9 -2A>C n/a IVS9 idiopathic pulmonary fibrosis Armanios et al, 2007
2537A>G Y846C B aplastic anemia Du et al, 2007
2593C>T R865C C idiopathic pulmonary fibrosis Tsakiri et al, 2007
2594G>A R865H C idiopathic pulmonary fibrosis, fibrosis, and aplastic anemia Tsakiri et al, 2007
2628C>G H876Q C aplastic anemia Du et al, 2007
2701C>T R901W D autosomal recessive dyskeratosis congenita and Hoyeraal Hreidarsson syndrome Marrone et al, 2007.2
2706G>C K902N D autosomal dominant dyskeratosis congenita and aplastic anemia Armanios et al, 2005
C-terminal 2935C>T R979W E-I aplastic anemia and dyskeratosis congenita
3268G>A V1090M E-III aplastic anemia Yamaguchi 2005
3329C>T T1110M n/a idiopathic pulmonary fibrosis Armanios et al, 2007
3346-3522 Δ 117nt fs E1116 n/a idiopathic pulmonary fibrosis, fibrosis, and aplastic anemia Tsakiri et al, 2007
3379T>C F1127L n/a resembles Hoyeraal Hreidarsson syndrome and autosomal recessive dyskeratosis congenita

DKC1 (dyskerin) mutations (X-linked recessive DKC)
Numerous mutations causing amino acid substitutions, deletions, and the loss of the entire exon 15 within dyskerin, a telomerase-associated protein, have been connected with human diseases. Dyskerin associates with the ScaRNA domain of the RNA component of telomerase along with Nop10, Gar1, and NHP2 proteins. In common with RNA pseudouridine synthases, dyskerin contains a TruB domain that participates in eukaryotic ribosomal RNA processing. The TruB domain is composed of two motifs, TruB I and TruB II. In addition to this are two nuclear localization (NL) signals, N-terminal and C-terminal, and the PUA, Pseudouridine synthase and Archaeosine transglycosylase, domain involved in RNA modification.

download formats: .pdf and .jpg


The above structural organizational scheme for the two domains, TruB and PUA, of the dyskerin protein has indicated the locations of mutations known to cause human diseases. The black line represents the mRNA sequence of 2454 nt with the untranslated regions (UTR) labeled and the grey box corresponds to the protein sequence. The N-terminal and C-terminal nuclear localization (NL) signals are denoted by green, the TruB motifs within the TruB domain by blue, and the PUA domain by orange boxes. The Genbank accession numbers used are NT_011726 for the 14811 bp gene and NM_001363 for the cDNA and amino acid sequences. Below is the 514 amino aid sequence for dyskerin protein with the NL signals, TruB motifs, and PUA domain colored and labeled. The mutated residues are colored red and the change in amino acids is labeled.
download formats: .pdf and .jpg


Below is a description of the locations within the nucleotide and protein sequence, the amino acid substitutions, the characteristic clinical presentations, and original literature citations that are linked to the published online journal for known mutations within the telomerase-associated protein dyskerin. The cDNA sequence is used for the nucleotide sequence. The mutations are organized by domain from the N-terminus to the C-terminus.
Domains Mutation AA substitution Exon Presentation References
URR -141C>G n/a 5' URR X-linked recessive dyskeratosis congenita Knight et al, 2001
-142C>G n/a 5' URR X-linked recessive dyskeratosis congenita Dokal et al, 2000
  5C>T A2V 1 X-linked recessive dyskeratosis congenita Knight et al,1999.1
  IVS1 592C>G n/a IVS1 X-linked recessive dyskeratosis congenita Knight et al, 2001
  29C>T P10L 2 sporatic dyskeratosis congenita and Hoyeraal Hreidarsson syndrome Vulliamy et al, 2006
  IVS2 473C>G N/A IVS2 X-linked recessive dyskeratosis congenita Knight et al,1999.1
  91C>G Q31E 3 X-linked recessive dyskeratosis congenita Wong et al, 2004
  106T>G F36V 3 X-linked recessive dyskeratosis congenita Heiss et al, 1998
  109-111 Δ CTT L37Δ 3 X-linked recessive dyskeratosis congenita Heiss et al, 1998
  113T>C I38T 3 X-linked recessive dyskeratosis congenita and Hoyeraal Hreidarsson syndrome Cossu et al, 2002
  115A>G K39E 3 X-linked recessive dyskeratosis congenita Knight et al,1999.1
  119C>G P40R 3 X-linked recessive dyskeratosis congenita Heiss et al, 1998
  121G>A E41K 3 X-linked recessive dyskeratosis congenita Knight et al,1999.1
  127A>G K43E 3 X-linked recessive dyskeratosis congenita Heiss et al, 2001
  146C>T T49M 3 X-linked recessive dyskeratosis congenita and Hoyeraal Hreidarsson syndrome
  194G>C R65T 4 X-linked recessive dyskeratosis congenita Knight et al,1999.1
  196A>G T66A 4 X-linked recessive dyskeratosis congenita
  200C>T T67I 4 X-linked recessive dyskeratosis congenita and Hoyeraal Hreidarsson syndrome Vulliamy et al, 2006
  204C>A H68Q 4 sporatic dyskeratosis congenita and Hoyeraal Hreidarsson syndrome Vulliamy et al, 2006
  214,215CT>TA L72Y 4 X-linked recessive dyskeratosis congenita Heiss et al, 1998
TruB 361A>G S121G 5 X-linked recessive dyskeratosis congenita and Hoyeraal Hreidarsson syndrome
472C>T R158W 6 X-linked recessive dyskeratosis congenita and Hoyeraal Hreidarsson syndrome Knight et al, 2001
  838A>C S280R 9 X-linked recessive dyskeratosis congenita Knight et al, 2001
PUA 941A>G K314R 10 X-linked recessive dyskeratosis congenita and Hoyeraal Hreidarsson syndrome, sporatic dyskeratosis congenita Vulliamy et al, 2006
949C>T L317F 10 X-linked recessive dyskeratosis congenita
961C>G L321V 10 X-linked recessive dyskeratosis congenita Knight et al,1999.1
965G>A R322Q 10 X-linked recessive dyskeratosis congenita
1049T>C M350T 11 X-linked recessive dyskeratosis congenita Knight et al,1999.1
1050G>A M350I 11 X-linked recessive dyskeratosis congenita Knight et al,1999.1
1058C>T A353V 11 X-linked recessive dyskeratosis congenita and Hoyeraal Hreidarsson syndrome Knight et al,1999.1
1075G>A D359N 11 sporatic dyskeratosis congenita Vulliamy et al, 2006
  1150C>T P384S 11 X-linked recessive dyskeratosis congenita
  1151C>T P384L 11 X-linked recessive dyskeratosis congenita Knight et al, 2001
  1156G>A A386T 12 sporatic dyskeratosis congenita and Hoyeraal Hreidarsson syndrome Vulliamy et al, 2006
  1193T>C L398P 12 X-linked recessive dyskeratosis congenita Hiramatsu et al, 2002
  1204G>A G402R 12 X-linked recessive dyskeratosis congenita Knight et al,1999.1
  1205G>A G402E 12 X-linked recessive dyskeratosis congenita Heiss et al, 1998
  1223C>T T408I 12 X-linked recessive dyskeratosis congenita Vulliamy et al, 2006
  1226C>T P409L 12 X-linked recessive dyskeratosis congenita Ding et al, 2004
  1258,1259AG>TA S420Y 12 sporatic dyskeratosis congenita Vulliamy et al, 2006
  IVS14 473A>G n/a IVS14 sporatic dyskeratosis congenita Vulliamy et al, 2006
  IVS14 +48-50 Δ TGG Δ Exon 15 15 X-linked recessive dyskeratosis congenita Vulliamy et al, 1999

Nola2 (NHP2) mutations (autosomal recessive DKC)
Three point mutations have been discovered within Nola2, the gene that encodes for the protein NHP2, a member of the H/ACA snoRNPs that are involved in various aspects of rRNA processing and modification. NHP2 associates with the ScaRNA domain of the RNA component of telomerase along with dyskerin, Gar1, and Nop10 proteins. These are the first mutations found within NHP2 that is known to cause human disease. The mutation c. 460T>A causes X154R +51aa, the replacement of the stop codon with arginine and the predicted addition of 51 amino acids.

download formats: .pdf and .jpg


The above structural organizational scheme for the four exons that form the 4498 bp Nola2 gene has indicated the location of mutations known to cause human disease. The black line represents the genomic DNA sequence with the untranslated regions (UTR) labeled. Colored boxes indicate exons 1 though 4. The Genbank accession numbers used are NT_023133 for the genomic DNA sequence and NM_017838 for the cDNA and amino acid sequences. Below is the 153 amino acid sequence for Nhp2 protein with the four exons colored and labeled. The mutated residues are colored red and the change in amino acid is labeled.

download formats: .pdf and .jpg


Below is a description of the locations within the nucleotide and protein sequence, the amino acid substitutions, the characteristic clinical presentations, and original literature citations that are linked to the published online journal for known mutation within the telomerase-associated protein NHP2. The cDNA sequence is used for the nucleotide sequence.
Domains Mutation AA substitution Exon Presentation References
  376G>A V126M 4 autosomal recessive dyskeratosis congenita Vulliamy et al, 2008
  415T>C Y139H 4 autosomal recessive dyskeratosis congenita Vulliamy et al, 2008
3' UTR 460T>A X154R +51aa 3' UTR autosomal recessive dyskeratosis congenita Vulliamy et al, 2008

Nola3 (Nop10) mutations (autosomal recessive DKC)
At present only a single amino acid substitution mutation has been discovered within Nola3, the gene that encodes for the protein Nop10, a member of the H/ACA snoRNPs that are involved in various aspects of rRNA processing and modification. Nop10 associates with the ScaRNA domain of the RNA component of telomerase along with dyskerin, Gar1, and NHP2 proteins. This is the first and only mutation found within Nop10 that is known to cause human disease.

download formats: .pdf and .jpg


The above structural organizational scheme for the two exons that form the 1446 bp Nola3 gene has indicated the location of the mutation known to cause human disease. The black line represents the genomic DNA sequence with the untranslated regions (UTR) labeled. Colored boxes indicate exons 1 and 2. The Genbank accession numbers used are NT_010194 for the genomic DNA sequence and NM_018648 for the cDNA and amino acid sequences. Below is the 64 amino acid sequence for Nop10 protein with the two exons colored and labeled. The mutated residue is colored red and the change in amino acid is labeled.

download formats: .pdf and .jpg


Below is a description of the locations within the nucleotide and protein sequence, the amino acid substitutions, the characteristic clinical presentations, and original literature citations that are linked to the published online journal for known mutation within the telomerase-associated protein Nop10. The cDNA sequence is used for the nucleotide sequence.
Domains Mutation AA substitution Exon Presentation References
  100C>T R34W 2 autosomal recessive dyskeratosis congenita Walne et al, 2007

TINF2 (TIN2) mutations (autosomal dominant DKC)
Currently, three point mutations causing two amino acid substitutions have been documented within TRF1-interacting nuclear factor 2 (TINF2), the gene that encodes for the protein TIN2, a component of the shelterin complex. The shelterin complex is composed of six proteins, three of which that directly associate with the telomeric DNA, TRF1, TRF2, and POT1. Three additional proteins associate with the telomeric DNA binding proteins, TIN2, TPP1, and Rap1. The mutations within TIN2 are the first mutations found within the shelterin complex known to cause human diseases.

download formats: .pdf and .jpg


The above structural organizational scheme for the nine exons that form the 3032 bp TINF2 gene has indicated the location of mutations known to cause human diseases. The black line represents the genomic DNA sequence with the untranslated regions (UTR) labeled. Colored boxes indicate exons 1-5, 6a, 6c, 6d, and 6e. The Genbank accession numbers used are NT_026437 for the genomic DNA sequence and NM_001099274 for the cDNA and amino acid sequences. Below is the 451 amino acid sequence for TIN2 protein with the nine exons colored and labeled. The mutated residues are colored red and the change in amino acids is labeled above.

download formats: .pdf and .jpg


Below is a description of the locations within the nucleotide and protein sequence, the amino acid substitutions, the characteristic clinical presentations, and original literature citations that are linked to the published online journal for known mutations within the shelterin protein TIN2. The cDNA sequence is used for the nucleotide sequence. The mutations are organized by domain from the N-terminus to the C-terminus.
Domains
Mutation
AA substitution
Exon
Presentation
References
  838A>G K280E 6a autosomal-dominant dyskeratosis congenita Savage et al, 2008
  844C>A R282H 6a autosomal-dominant dyskeratosis congenita, Revesz syndrome Savage et al, 2008
  845G>A R282S 6a autosomal-dominant dyskeratosis congenita Savage et al, 2008