CFAP97D2

From Wikipedia, the free encyclopedia

CFAP97D2[edit]

Conceptual Translation of Homo sapiens CFAP97D2

CFAP97D2 (Celia and Flagella Associated Protein 97 Domain containing 2) is a protein that in humans is encoded by the CFAP97D2 gene.

Gene[edit]

Homo sapiens CFAP97D2 gene (XM_017020910.2) is a 68,395 base pair gene that encodes mRNA transcripts ranging from 952 nucleotides to 1841 nucleotides.[1] It is located on Chromosome 13 and found at locus 13q34 on the plus strand. CFAP97D2 is on Chromosome 13. This gene encodes 166 amino acids that make up Cilia and Flagella Associated Protein 97 containing Domain 2 (CFAP97D2) protein. The CFAP97D2 gene is also known as C17orf105 Homolog gene.[2]

CFAP97D2 Gene Neighborhood

mRNA[edit]

Homo sapiens CFAP97D2 Isoforms

Isoforms[edit]

There are 5 mRNA transcripts produced from the CFAP97D2 gene. These transcripts encode 5 different CFAP97D2 isoforms: X1, X2, X3, 1, and 2. Isoform X1 is 166 AA, the longest isoform of Homo sapiens CFAP97D2. Isoform X2 has 2 deletirious mRNA mutations at the end of exon 5 accounting for a single amino acid deletion. The final protein transcript is a 165 amino acids. Isoform X3 has numerous mRNA insertions in Exons 3, 4, and 5. The final protein length is 101 AA. Isoforms 1 and  2 have complete deleterious mRNA mutations of Exon 4 and encode final protein lengths of 99 and 98 AA respectively.[1][3]

NCBI CFAP97D2 RNA Tissue Expression

Expression[edit]

RNA sequencing revealed CFAP97D2 expression in the brain, lungs, pancreas, testis, fallopian tubes, and cervix.[4][5][6]

Protein[edit]

Homo sapiens CFAP97D2 Isoform X1 Tertiary Structure
CFAP97D2 Strict Orthologs

Primary Structure[edit]

Homo sapiens CFAP97D2 Isoform X1 is a basic protein with a predicted PI of 10.4 and Mw 19.3 kaD.[7] There is nuclear leucine zipper motif (AA #37-52) and nuclear localization signal (AA #102-116).[8][9]

Secondary Structure[edit]

By definition, the leucine zipper region of Homo sapiens CFAP97D2 (AA #37-52) is an alpha helix.[10][11]

Tertiary Structure[edit]

CFAP97D2 is characterized by coiled-coiled regions and 2 alpha helices[12]

Homology[edit]

CFAP97 Gene Family[edit]

The CFAP97 gene family contains 3 genes: CFAP97, CFAP97D1, and CFAP97D2 and is characterized by the KIAA1430 gene domain. CFAP97D1 has longer evolutionary conservation than CFAP97D2 with an estimated date of divergence 431 MYA.[13]

Strict Orthologs[edit]

CFAP97D2 is a highly conserved protein found in primates, rodents, bats, even-toed ungulates, otarlidae, birds, reptiles, and bony fish.[13] Primate and rodent CFAP97D2 proteins are most recently related (% identity range: 74-100%). Bats, even-toed ungulates, otariidae, and birds are moderately related to Homo sapiens CFAP97D2 (% identity range: 55.9-73%) and reptile and body fish species are most distantly related (% identity range: 25.6-42%). The asiactic toad is an outlier as a reptile with 60.4% identity with Homo sapiens CFAP97D2.

Paralogs and Distant Homologs[edit]

Homo sapiens CFAP97D2 Paralog and Distant Homolog Table

CFAP97D1 is conserved in both invertebrate and vertebrate species.[14] The CFAP97D1 genes found in vertebrate species are paralogs of CFAP97D2. The invertebrate CFAP97D1 genes existing prior to 431 MYA are distant CFAP97D2 homologs due to these paralogs' shared evolutionary history.

Evolution[edit]

CFAP97D1 has longer evolutionary conservation than CFAP97D2.[15] CFAP97D1's slow and consistent conservation is evidenced by its little divergence from its ortholog ancestors over 600 million years while CFAP97D2 has changed rapidly over 431 million years. Present only in vertebrate species, CFAP97D2's rapid evolution indicates that it is under different selective pressures than CFAP97D1 and therefore serves in a different functional capacity.

CFAP97D2 Phylogenetic Tree

References[edit]

  1. ^ a b "CFAP97D2 CFAP97 domain containing 2 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16.
  2. ^ "CFAP97D2 Gene - GeneCards | C97D2 Protein | C97D2 Antibody". www.genecards.org. Retrieved 2022-12-16.
  3. ^ "Human BLAT Search". genome.ucsc.edu. Retrieved 2022-12-16.
  4. ^ "CFAP97D2 CFAP97 domain containing 2 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-14.
  5. ^ "CFAP97D2 protein expression summary - The Human Protein Atlas". www.proteinatlas.org. Retrieved 2022-12-14.
  6. ^ "Gene Expression in 54 tissues from GTEx RNA-seq of 17382 samples, 948 donors (V8, Aug 2019) (RP11-569D9.5)". genome.ucsc.edu. Retrieved 2022-12-14.
  7. ^ "Expasy - Compute pI/Mw tool". web.expasy.org. Retrieved 2022-12-14.
  8. ^ "ELM - Search the ELM resource". elm.eu.org. Retrieved 2022-12-16.
  9. ^ "Motif Scan". myhits.sib.swiss. Retrieved 2022-12-16.
  10. ^ Seldeen, Kenneth L.; McDonald, Caleb B.; Deegan, Brian J.; Bhat, Vikas; Farooq, Amjad (2010-04-16). "Dissecting the Role of Leucine Zippers in the Binding of bZIP Domains of Jun Transcription Factor to DNA". Biochemical and Biophysical Research Communications. 394 (4): 1030–1035. doi:10.1016/j.bbrc.2010.03.116. ISSN 0006-291X. PMC 2860604. PMID 20331972.
  11. ^ Pollard, Thomas D.; Earnshaw, William C.; Lippincott-Schwartz, Jennifer; Johnson, Graham T., eds. (2017-01-01), "Chapter 10 - Gene Expression*", Cell Biology (Third Edition), Elsevier, pp. 165–187, doi:10.1016/b978-0-323-34126-4.00015-3, ISBN 978-0-323-34126-4, retrieved 2022-12-14
  12. ^ Zheng, Wei; Zhang, Chengxin; Yang, Li; Pearce, Robin; Bell, Eric W.; Zhang, Yang (2021). "Folding non-homology proteins by coupling deep-learning contact maps with I-TASSER assembly simulations". Cell Reports Methods. 1 (100014): 100014. doi:10.1016/j.crmeth.2021.100014. PMC 8336924. PMID 34355210 – via I-Tasser.
  13. ^ a b "BLAST: Basic Local Alignment Search Tool". blast.ncbi.nlm.nih.gov. pp. [XP_016876399.1]. Retrieved 2022-12-14.
  14. ^ "CFAP97D1 CFAP97 domain containing 1 [Homo sapiens (human)] - Gene - NCBI". www.ncbi.nlm.nih.gov. Retrieved 2022-12-16.
  15. ^ "TimeTree :: The Timescale of Life". timetree.org. Retrieved 2022-12-16.