Abstract
BRCA1 and BRCA2 are both tumor suppressors whose mutations are the cause of most hereditary breast cancers. Both genes are highly involved in ensuring genome stability. BRCA1 homologs are found in the plant and animal kingdoms while BRCA2 homologs are additionally found in the fungi kingdom. The initial origin of both genes remains unknown, however it is expected that the common ancestors originated around 1.6 billion years ago prior to the kingdoms diverging. There has been a great amount of divergence between homologs that is not observed in other tumor suppressors with only functionally important domains conserved. This divergence continues today with evidence of primate BRCA1/2 evolution. Cancer-associated mutations have been found to occur at conserved sites, indicating that conserved sites are important for function. In this study, we present a review on the phylogenesis of BRCA1 and BRCA2.
Breast cancer susceptibility genes 1 and 2 (BRCA1 and BRCA2) are found in a wide variety of organisms and help stabilize the genome. BRCA2 homologs and orthologs are found in organisms across three kingdoms: animal, plant, and fungi whereas BRCA1 homologs and orthologs are only found in animal and plant kingdoms. While not every organism possesses either or both of the genes, a vast majority do, indicating that the presence of BRCA1/2 genes dates back to 1.6 billion years ago when the three kingdoms were first diverging.
Both genes are tumor suppressors and are highly involved in ensuring genome stability. BRCA1 is involved in both the checkpoint activation and in DNA repair, while BRCA2 regulates homologous recombination. BRCA1 and BRCA2 mutations are the cause of most hereditary breast cancer, and are responsible for up to 10% of all breast cancer. The risk for developing breast cancer is 80% with BRCA1 mutations and 60% with BRCA2 (1). While BRCA1 and BRCA2 are notoriously linked to an elevated risk of breast cancer, it has been found that they are also associated with an elevated risk for ovarian, pancreatic, endometrial, prostate, stomach, laryngeal, and fallopian tube cancers (2, 3). Breast cancer has been reported in other mammals including rodents, carnivores, and primates (4). This would suggest that BRCA1/2 mutations would affect functions that are unique to mammary tissue and mammals in general, however, that is not the case. The function of both of these genes results in genome stability. BRCA1/2 are both required for homologous recombination. BRCA1 is also involved in cell cycle checkpoints.
Through examining BRCA1/2 in other organisms, the functional domains can be identified that assist in determining if mutations are cancer-causing. Additionally, this allows us to learn more about the function of these genes and possible cures and treatments. Years of research and investigation of various organisms for BRCA1/2 homologs has contributed to our knowledge of BRCA1/2's role in cancer and the evolution of these genes (Table I).
BRCA1 and BRCA2 Function
The initial search for a gene responsible for hereditary breast cancer resulted in finding BRCA1 that is responsible for DNA repair and checkpoints in the cell cycle (5). Further inquires into genes susceptible to breast cancer found BRCA2 which has a role in homologous recombination (6). It has been found that homozygous mutations in either of these genes is lethal and development stops embryonically. Both BRCA1 and BRCA2 lead high-fidelity repair through homologous recombination (7).
The BRCA1 protein exists as a heterodimer with BARD1 and in humans, forms three different complexes. Complex A is involved in DNA repair via homologous recombination, complex B plays a role in the G1/S cell-cycle checkpoint, and complex C occurs in the G2/M cell-cycle checkpoint (8).
The BRCA2 protein primarily functions in homologous recombination. It is responsible for recruiting RAD51 that is essential for repair. BRCA2 is thought to have a special affinity for ssDNA-dsDNA junctions which are important for recombinational repair in normal replication (9). Additionally, it binds to DSS1 which assists in loading RAD51. BRCA2 also has domains that bind to both ssDNA and dsDNA (10). The main role of BRCA2 in homologous recombination is to load RAD51 onto ssDNA.
Domain Structures of BRCA1 and BRCA2
The human BRCA1 protein has four major domains: RING, P300/CBP, coiled-coil, and BRCT. The RING domain interacts with BARD1 and also shows E3 ligase activity. This activity is amplified when BARD1 and BRCA1 form a heterodimer as it can ubiquitinate itself and enhance its activity (3). The P300/CBP domain has transcriptional regulatory activity. It interacts with two transcriptional cofactors: P300 (histone acetyl transferase p300) and CBP (CREB binding protein) (8). The coiled-coil domain functions in homologous recombination. The domain binds to the coiled-coil domain of PALB2 which then binds the N-terminus of BRCA2 (10). This entire complex then assists with the binding of RAD51 and continuation of homologous recombination. The BRCT domains (BRCA1 carboxy-terminal repeats) are conserved in multiple proteins that are involved in DNA repair. The BRCT domains recognize a phosphorylated serine in the pSXXF motif and bind to three different proteins: Abraxas, BRIP1, and CtIP (8). These different proteins correspond to the complexes A, B, and C respectively. Additionally, the BRCT domains can only bind one protein at a time, indicating that the complexes are mutually exclusive.
The human BRCA2 protein also has four domains: N-terminus, BRC, DBD, and C-terminus. As previously mentioned, the N-terminus binds to PALB2 which assists in the localization of RAD51 to the damaged DNA. The BRC repeats bind to RAD51 and assist in regulating the RAD51 filament formation (3). The DBD (DNA binding domain) contains a helix-rich region, three oligonucleotide/oligosaccharide binding (OB) fold modules, and a tower domain (TD). Each of these regions have individual functions: OB1, OB2, and OB3 all bind ssDNA, the TD binds dsDNA, and the helix-rich region, OB1, and OB2 bind DSS1 (10). DSS1 is linked to stabilizing BRCA2, promoting homologous recombination, and has many other functions in the cell including DNA repair, development, and protein degradation (8).
Phylogenesis of BRCA1 and BRCA2
The initial origin of both BRCA1 and BRCA2 is still unknown. Both are found only in Eukaryotes; BRCA1/2 are present in plants and animals, and BRCA2 is also found in fungi. Since BRCA2 is found in all three kingdoms, it is likely that the common ancestor of the gene originated prior to the kingdoms splitting around 1.6 billion years ago (11). The common ancestor of BRCA1 also arose around 1.6 billion years ago when plants and animals where diverging. In order to narrow down when either gene arose, more genomes need to be annotated and examined for the functional domains in BRCA1/2.
While BRCA1 homologs have been found in many eukaryotic cells, there is no recognizable one in S. cerevisiae despite the presence of some BRCA1 interacting proteins such as DSS1 (12). BRCA1 has not been identified in any other kingdom thus far, indicating that the common ancestor must be the last common ancestor of plants and animals. All plant BRCA1 homologs except green algae C. reinhardtii and V. carteri have a conserved domain, PHD, that is not present within any animal BRCA1. This suggests that there was a common BRCA1-like gene prior to plant-animal divergence that gained the PHD domain after the split. This hypothesis is confirmed by a paralog of plant BRCA1, BARD1, which also had the PHD domain. In both plants and animals, BARD1 interacts with BRCA1 to form a heterodimer which is necessary for proper BRCA1 function. The plant homolog gained the PHD domain 1.15 billion years ago and the duplication to form BARD1 occurred between 495.9-301 million years ago (8).
BRCA1 homologs have been found in all Viridiplanta genomes except Physcomitrella patens and in most angiosperms (8). Arabidopsis has a BRCA1 homolog (AtBRCA1) that has three of the human domains: RING, BRCT, and P300/CBP (9). Throughout the entire amino acid sequence there is 20% identity between the human and Arabidopsis BRCA1, however, the identity is higher in the domains (8). As previously described, AtBRCA1 also contains the PHD domain between the P300/CBP and the BRCT domains. The function of PHD is still unknown but it is thought to be involved in the regulation of transcription (8).
A BRCA1 ortholog in C. elegans, brc-1, also functions in DNA repair and is found to be elevated during the DNA damage response (13). Overall, the function of BRCA1 and its homologs have not diverged and still maintain a role in genome stability and DNA repair.
BRCA1 homologs are not highly conserved, even amongst closely related species. Entire sequences have low conservation, however, some domains have higher levels of conservation. In BRCA1, a single exon, 11, encodes around 60% of the protein. This exon sequence is more viable than the entire gene for most organisms. In exon 11 within mammals, only 7.95% of the amino acid sequence is fixed and 22.4% is conserved (4). Within exon 11, there are 13 highly conserved regions between eutherian mammals, chicken, frog, and marsupials; three of these regions are unique to mammals (4). Since the divergence of primates from non-primates 74.1 million years ago, 132 amino acid sites in the human BRCA1 have evolved (14, 15).
BRCA1 homologs do not share the high degree of similarity that is implied by the significant function of the gene. There is only 56% identity between the human and mouse BRCA1. Compared to other genes, this level of identity is low and displays the poor sequence conservation. MSH2 and XPA, both tumor suppressor genes, have a 92% and 86% identity, respectively, between human and mouse sequences. RAD51, which interacts with BRCA2 and is in the BRCA1 complex A, has a 99% identity between human and mouse (15). Despite the low levels of conservation, it appears that function of BRCA1 has not evolved.
Similarly to BRCA1, there is no BRCA2 homolog in S. cerevisiae, however, there are homologs in other fungi. In U. maydis, the BRCA2 homolog is Brh2 and interacts with RAD51. Brh2 requires BRC elements in order to interact with RAD51. There is only one BRC domain in Brh2 whereas there are 8 BRCs in the human BRCA2. Brh2 has no overall sequence alignment with any known gene, however, within the sequence are regions that align with BRCA2 conserved domains. The ability of BRCA2 homologs to associate with RAD51 is conserved in U. maydis, Arabidopsis, C. elegans, and all the way up to mammals (15). In U. maydis, Brh2 functions as a master regulator of recombinational machinery. Brh2 is required for proficiency in repair and recombination and serves in maintaining genome stability in mitosis and meiosis (18). This function is more extensive than the human BRCA2 function suggesting that the function has narrowed and become more specified through evolution. This sequence has conservation with homologs in Arabidopsis and Oryza, with 38% identity in the C terminus and the nuclear localization signal (NLS). The residues that directly interact with DSS1 are the most highly conserved indicating that the interaction between them has a biologically important function (19).
Arabidopsis the is only known organism to have two BRCA2 homologs. They are 96.8% identical to each other which suggests that one arose from the other through a duplication. There is 21% identity to human BRCA2 (9). The function of these homologs has been found to be essential in meiosis. Plant homologs for BRCA2 have been found in Viridiplantae and in green algae as well. Unlike animals, plants that either lack BRCA2 or are homozygous mutants are able to survive past embryonic development (8).
The chicken BRCA2 homolog has the same order and organization as the mammalian version, however, it is poorly conserved with only 37% amino acid identity. Exons 2, 4, 7, and 16-20 are highly conserved between the two homologs with an average identity of 70% (20). Any high conservation within a poorly conserved gene indicates important functionality. Exons 2 and 4 are in the N-terminal domain and exons 16-20 are in the C-terminal domain which supports BRCA2 conserving function. The largest exon, 11, harbors the BRC repeats which are responsible for binding RAD51. In chicken BRCA2, exon 11 is poorly conserved with 21% identity to human BRCA2. However, three of the BRC repeats have a 70% identity between the homologs (20).
The BRC repeats are highly variable among all BRCA2 homologs. The repeats range from 1 repeat to 15 repeats in some species. The duplication of the BRC repeats happened around 230-300 million years ago (20).
Just as the BRCA1 homologs are poorly conserved, the BRCA2 homologs follow this same pattern. The highest conserved regions across all homologs are the BRC repeats and the C-terminal region (18). The divergence amongst the homologs could reflect the diversification and specialization in the regulation of RAD51-mediated DNA repair throughout the evolutionary tree.
Human BRCA1/2 Genes
The human BRCA1 gene is located at chromosome 17q21.3. The entire gene is 193,689 nucleotides; the coding sequence begins in the second exon and ends in the 24th exon. Exon 11 encodes more than 60% of the entire protein (21). The protein is 1,863 amino acids long and 220 kDa. The BRCA2 gene is located at chromosome 12q13.1. The entire gene is 91,193 nucleotides and the coding region begins in the first exon and ends in the 27th exon (22). Exon 11 encodes more than 50% of the entire protein. The BRCA2 protein is 3,418 amino acids long and 390 kDa.
Limitations of Evolutionary Studies
When looking the evolutionary relationship of BRCA1/2, we rely on the published genomes. However, the accuracy of these genomes cannot be confirmed. Many genomes are likely to have poor assembly of the genome, or missing a percentage of the genes. Additionally, since the history of both BRCA1 and BRCA2 extends over one billion years, only domains or residues that are continuously selected for are truly conserved. This can lead to difficulty establishing the origin of the gene because so much divergence has occurred and can also lead to genes being included that are not true homologs, but instead just have similar domains and functions. Therefore, conclusions about the evolutionary development of either BRCA1 or BRCA2 must be taken as good estimates of past events, not as definitive statements.
Relationship Between BRCA1 and BRCA2
There is no phylogenetic relationship between BRCA1 and BRCA2. Although BRCA1 and BRCA2 have similar disease phenotypes and both play a role in homologous recombination, there is no homology between each other. Neither gene shows homology to other genes in the human genome (3). Both genes are thought to have the same disease phenotype because of their joint role in homologous recombination.
Continued Evolution of BRCA1 and BRCA2
There is evidence that both BRCA1 and BRCA2 are still evolving, particularly in primates. It is unknown as to why both of these genes are continuing to evolve as rapidly as they are.
There are indications that there is a selection occurring at specific loci in the modern day (1). There have been 22 non-synonymous mutations in the human BRCA1 gene since the divergence from chimpanzees and bonobos six million years ago. BRCA1 has far more non-synonymous mutations than other genes, indicating that it is evolving and is occurring much faster than expected. Two SNPs in the chimpanzee BRCA1 were found to not be in Hardy-Weinberg equilibrium. This suggests that more than just the human BRCA1 gene is evolving. In simian primates, there has been a selection that favors non-synonymous mutations with BRCA1 that has lasted through the speciation of the primates. The most intense selection found has occurred in the human/chimpanzee/bonobo clade (1). Most of the selection observed is positive selection. Evidence for this rapid evolution within the human BRCA1 gene along with other primate homologs continues to be found across the gene (1, 4, 15). The evolution of BRCA1 in humans in occurring rapidly despite the fact that many of the changes to the gene result in a significantly elevated risk for breast cancer. Despite the rapid evolution in primate BRCA1, there has been no identification of rapid evolution in marsupial BRCA1 homologs. This indicates that the selection force is acting only upon primates.
BRCA2 is also evolving under positive selection only in primates. There have been three sites in exon 11 of the human BRCA2 that are under positive selection in the RAD51 interacting domain (15). The selection in BRCA2 is not concentrated in the human/chimpanzee/bonobo clade as was seen in BRCA1. This selection is not as strong as the selection observed in BRCA1, but it is still occurring more rapidly than typically predicted. It remains unknown as to why these genes are evolving at this elevated rate.
The strongest hypothesis for this phenomenon is viral infections are driving this rapid evolution. Many viruses interact with DNA repair proteins in order to infect the host and remain undetected. The introduction of viral DNA can trigger the DNA damage response, yet in order to effectively infect, the virus must impede this response (23). Some viruses also use the DNA repair proteins in order to replicate themselves. These viruses could be interacting with either BRCA1 or BRCA2 and driving the changes within the genes to become less susceptible to these viruses. This selection leads to decrease vulnerability to infection, however, it also leads to consequences for the host. This scenario could lead to antagonistic pleiotropy where BRCA1/2 are less susceptible to viral infection but increase the risk of developing breast cancer (1).
Role of BRCA1 and BRCA2 in Cancer
Many of the inherited BRCA1 mutations that are cancer-associated have been found within the BRCT and RING domains which indicates that those domains are essential to the function of BRCA1 (3). There have been 38 missense changes in exon 11 of BRCA1 identified that are likely to have an effect on protein function (15). Of these 38, 3 effect fixed locations and 3 affect conservative sites as determined by the analysis of mammalian BRCA1 homologs. Since the mutations affect fixed or conservative sites, they are more likely to be cancer-associated mutations as they disrupt loci that have an essential role in the protein function. The knowledge of conservative locations can be used to determine if variants are simply polymorphisms or disease associated variants. If the variant occurs at a fully or partially conserved residue, then the variant is probably disease-associated. Variants that have no disease link or polymorphisms typically occur in non-conserved regions.
Although mutations in both BRCA1 and BRCA2 increase the risk for breast cancer, they cause different cancer types. BRCA1 mutations typically result in triple-negative breast cancer (TNBC) where there is no overexpression of estrogen receptor (ER), progesterone receptor, or human epidermal growth factor receptor-2 (2). This cancer type is characterized by aggressive growth and very few targeted treatment options. The desired treatment for TNBC would only target cells with the BRCA1 mutation and not healthy cells, however, since there is no overexpression, there is no target for treatment. The use of poly ADP-ribose polymerase-1 (PARP1) inhibitor and gemcitabine has shown to be a promising treatment and prompts BRCA1 mutated cells to go through apoptosis (24). Conversely, BRCA2 mutations result in low-grade ER+ luminal tumors that grow more slowly and have the treatment option of inhibiting signaling through ER (2).
While BRCA1/2 mutations also elevate the risk of many different types of cancer, breast cancer remains the most prominent. It is still not fully understood as to why these mutations impact mammary tissue in particular, especially considering the broad role of genome stabilization that genes carry. BRCA1 has been found to be a transcriptional regulator of ER; BRCA1 acts to repress the estrogen receptor which prevents further signaling that promotes cell proliferation (25). If BRCA1 is mutated and cannot act as a regulator, then the ER can over promote cell proliferation. Another hypothesis is that the hormonally driven growth during the menstrual cycle produces reactive oxygen species which causes DNA damage (3). This DNA damage paired with a nonfunctional BRCA1 or BRCA2 leads to increased genome instability and more DNA mutations. In a study by Malicka et al. (26), it was found that physical therapy in rats lead to significantly higher progesterone levels that was found to inhibit mammary gland cancer and increase tumor cell apoptosis. This suggests that hormone treatment or physical therapy could have a suppressive effect on breast cancer development (26).
The common link between BRCA1 and BRCA2 is homologous recombination. Since BRCA1 and BRCA2 mutations have the same disease phenotype, it can be concluded that homologous recombination is the critical system for protecting the genome. It is this pathway that is affected in both mutations, confirming that the role of BRCA1/2 in homologous recombination is essential for tumor suppression.
Conclusion
Both the tumor suppressor genes BRCA1 and BRCA2 have been around for approximately 1.6 billion years in animals and plants, with BRCA2 in fungi as well. Much of our knowledge about both of these genes have come from studying homologs in various organisms. Additionally, the realization that both BRCA1 and BRCA2 are still rapidly evolving in primates explains the perseverance of disease causing mutations in the genes.
Acknowledgements
The Authors gratefully acknowledge the support from the Department of Biology, Division of Natural and Social Sciences, Carthage College, Kenosha, WI 53140 for supporting Claire Pfeffer and Benjamin Ho as Fellows to conducting research under the mentorship of Dr. Amareshwar T.K. Singh.
Footnotes
This article is freely accessible online.
Conflicts of Interest
The Authors declare no competing financial interests.
- Received July 5, 2017.
- Revision received July 26, 2017.
- Accepted August 2, 2017.
- Copyright© 2017, International Institute of Anticancer Research (Dr. George J. Delinasios), All rights reserved