Recent Advances in Genomic Approaches for the Detection of Homologous Recombination Deficiency

Article information

Cancer Res Treat. 2024;.crt.2024.154
Publication date (electronic) : 2024 July 17
doi : https://doi.org/10.4143/crt.2024.154
1Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
2Department of Obstetrics and Gynecology, Yonsei University College of Medicine, Seoul, Korea
3Massachusetts General Hospital Cancer Center, Harvard Medical School, Boston, MA, USA
4Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA, USA
5Broad Institute of Harvard and MIT, Cambridge, MA, USA
Correspondence: Peter J. Park, Department of Biomedical Informatics, Harvard Medical School, Boston, MA 02115, USA Tel: 1-617-432-7373 Fax: 1-617-432-7373 E-mail: peter_park@hms.harvard.edu
*Yoo-Na Kim and Doga C. Gulhan contributed equally to this work.
Received 2024 February 15; Accepted 2024 July 16.

Abstract

Accurate detection of homologous recombination deficiency (HRD) in cancer patients is paramount in clinical applications, as HRD confers sensitivity to poly(ADP-ribose) polymerase (PARP) inhibitors. With the advances in genome sequencing technology, mutational profiling on a genome-wide scale has become readily accessible, and our knowledge of the genomic consequences of HRD has been greatly expanded and refined. Here, we review the recent advances in HRD detection methods. We examine the copy number and structural alterations that often accompany the genome instability that results from HRD, describe the advantages of mutational signature-based methods that do not rely on specific gene mutations, and review some of the existing algorithms used for HRD detection. We also discuss the choice of sequencing platforms (panel, exome, or whole-genome) and catalog the HRD detection assays used in key PARP inhibitor trials.

Introduction

Many endogenous mutagenic processes or exogenous mutagens can lead to double-strand breaks (DSBs). If left unrepaired, DSBs can lead to a wide range of genomic alterations often observed in cancer. Cells employ several pathways to repair DSBs. Most prominently, homologous recombination repair (HRR) fixes DSB with high fidelity by copying the undamaged sister chromatid. In some cancers, however, HRR is dysfunctional, resulting in homologous recombination deficiency (HRD). In these cases, DSBs are instead repaired by more error-prone pathways, including nonhomologous end joining (NHEJ) and microhomology-mediated end joining (MMEJ). Among the various causes of HRD, BRCA1/2 mutations are the most frequent and well-studied. However, genomic studies in the past decade have revealed that other alterations (e.g., mutations in other genes, epigenetic silencing of target genes, structural rearrangements) could also result in the same type of genome instability as BRCA1/2 mutations, suggesting a more expansive view of HRD cases.

Research in HRD was spurred in part by the advances in therapies that exploit the underlying defect in HRR, namely poly(ADP-ribose) polymerase (PARP) inhibitors. In HRD tumors, inhibition of the PARP1 protein leads to a multitude of unrepaired single-strand breaks (SSBs), which upon replication become DSBs. PARP inhibitors also promote an increase in DSBs by trapping PARP on the DNA, forming a PARP-DNA complex which obstructs DNA replication. When cells are overwhelmed by DSBs, cell death ensues. Since the first trial demonstrated the safety and potential efficacy of PARP inhibitors in BRCA carriers in 2009, numerous studies have examined the potential to extend the benefits of PARP inhibitors beyond the BRCA-mutant tumors to other, non-BRCA-associated cancer types. Besides PARP inhibitors, platinum-based chemotherapy has also proven effective in some HRD tumors.

Given the advances in the treatment of HRD patients, accurate and cost-effective identification of patients with HRD is of paramount importance. In particular, the development of sequencing-based assays has enabled a more precise, genome-wide characterization of genomic instability that accompanies HRD. With the increasing accessibility of genome sequencing methods in clinical settings, the design and use of optimal assays is likely to result in more effective trials and an expanded patient population that will benefit from PARP inhibitors.

In this review, we focus on genome-based methods for identifying patients with a defective HRR pathway, resulting in HRD. After briefly reviewing key components of the HRR pathway, we describe the genomic consequences of HRD, especially copy number variations (CNVs) and structural variations (SVs) observed in whole-genome sequencing (WGS) data. Then, we describe current sequencing platforms and recent advances that have increased the accuracy of HRD detection, which often involve combining multiple genomic features of HRD into a statistical framework. We pay particular attention to the use of ‘mutational signatures,’ which aim to capture a specific set of genomic features associated with distinct mutagenic sources. The last section highlights the clinical application of HRD detection, including a review of platforms used in key PARP inhibitor trials.

Homologus Recombination Deficiency

1. HRR pathway

The HRR pathway starts with the recognition of DSBs by the MRN complex (MRE11, RAD50, and NSB1) and subsequent activation of ATM, which leads to further activations of BRCA1, BRCA2, and PALB2 [1]. The exonuclease component of the MRN complex (in the presence of functional BRCA1) leads to DNA end resection from the 5′ to the 3′ end, generating a 3′ overhang coated by replication protein A (RPA). Then, RAD51 together with BRCA2 replaces RPA, assisted by BRCA1 and PALB2 [2]. This RAD51-DNA nucleoprotein complex initiates D-loop formation and strand invasion. Through such intricate interplay of different steps of repair mechanisms, the cell ensures accurate replication and maintains genome stability.

During the cell cycle, HRR preferentially occurs during the S/G2 phase when end resection can be readily performed, as end-protecting factors (53BP1 and REV7) are inactivated by CDKs [3,4]. In contrast, during other phases of the cell cycle or when HRR is defective, there is a shift towards non-HRR mediated, error-prone DNA repair, namely NHEJ and MMEJ. The NHEJ process, mediated by the Ku70-80 heterodimer complex and DNA-PKc, directly ligates the two broken ends, often creating characteristic mutations, deletions, or inter-chromosomal translocations [5]. MMEJ is mediated by PARP1, which regulates the balance between NHEJ and MMEJ through its competition with the Ku70-80 complex. Following the DNA resection step, the MMEJ process utilizes polymerase theta and microhomology features (homologous DNA sequences of 5 to 25 bp) at DNA breakpoints to join the broken ends [6]. More details on these mechanisms are reviewed elsewhere [7].

2. Characteristic footprints of genomic instability in HRD

An intact HRR pathway is crucial for genome integrity and tumor suppression, and its loss is a prevalent cause of cancer in tumor types such as ovarian cancer and triple-negative breast cancer. The leading cause of HRD is the bi-allelic loss of BRCA1 or BRCA2, most commonly due to mutations on one allele and loss of heterozygosity (LOH) of the other allele, and promoter hypermethylation of BRCA1 [8]. Beyond BRCA1/2, loss of other central HRR genes can also lead to HRD [9]. In addition to their DSB repair function, BRCA1/2 are involved in other key cellular processes such as maintenance of replication fork stability, R-loop resolution, and protection of telomere integrity, all of which contribute to maintaining genome integrity [10,11].

With impaired HRR, the increased contribution of NHEJ/MMEJ leads to characteristic patterns of genomic instability (some refer to these as “genomic scars”). Comparison of HRD and non-HRD (i.e., homologous recombination proficient, HRP) cell lines and tumors have revealed several features of HRD-specific genomic instability involving LOH, allelic imbalance, and chromosomal rearrangements [8,9,12,13]. These features all reflect improper repair of DSBs and are linked to each other—allelic imbalance, for instance, can be caused by copy number—neutral LOH, deletion, or monoallelic copy number gain. In one study, the authors examined different summary statistics of chromosomal alterations and found the number of chromosomal regions with telomeric allelic imbalance (TAI)—allelic imbalance that extends to the telomeric end of a chromosome—to be the most accurate predictor of sensitivity to cisplatin in breast cancer cell lines and associated with impaired HRR [12]. In another study, the authors found frequent large-scale chromosomal rearrangements to be characteristic of BRCA1-associated genomic instability in breast cancers and proposed the number of large-scale state transitions (LST)—chromosomal breaks between adjacent regions of at least 10 Mb—as a robust predictor of BRCA1 status [13]. A commercial HRD test used in many key clinical trials defined the genome instability score (GIS) as a simple sum of LOH events (defined as the number of sub-chromosomal LOH regions > 15 Mb), TAI, and LST scores (see later sections for more details) [14]. These measures were established more than a decade ago based on single nucleotide polymorphism (SNP) arrays, prior to the widespread use of WGS. Therefore, although they attempted to capture genome-wide characteristics of genome instability, the detected features were of low resolution, not allowing inference of detailed rearrangement types.

3. Clinical significance of HRD

Accurate detection of HRD in cancer patients is important because of its therapeutic implications. An efficacious mode of intervention is the PARP inhibitor, which targets PARP1 and PARP2 enzymes. PARP inhibitors restrict the base excision repair pathway for repairing DNA SSBs, leading to an accumulation of unrepaired SSBs, which is then converted into DSBs during replication [15]. Since cells with BRCA1/2 mutations or HRD phenotypes cannot efficiently repair DSBs, PARP inhibitors lead to synthetic lethality. Moreover, PARP inhibitors cause PARP1/2 trapping, which could be associated with drug potency among different types of PARP inhibitors [16]. The first human trial using PARP inhibitors was conducted in 2009, in which 60 patients with multiple cancer types received olaparib in recurrent settings [17]. Responses were observed in breast, ovarian, prostate, and pancreatic cancers, which are BRCA-associated cancers. Since then, PARP inhibitors have become standard care in ovarian cancer. Most notably, in the SOLO-1 trial, olaparib for front-line maintenance in ovarian cancer patients with BRCA mutations was associated with a progression-free survival (PFS) benefit of 36 months compared to placebo, with a hazard ratio of 0.30 (95% confidence interval, 0.23 to 0.41) [18]. A recently published 7-year follow-up suggests that patients continued to derive benefits even after 2 years on olaparib [19]. Although not as dramatic, efficacy of PARP inhibitor was also shown in metastatic or recurrent settings and in BRCA-associated cancer types other than ovarian, as several trials have highlighted [20-22]. Importantly, studies have suggested that patients with HRD disproportionately benefit from PARP inhibitor even when they are wild-type for BRCA (see later sections for more details). Therefore, an accurate HRD detection strategy can be pivotal for refining patient selection criteria and the expansion of cancer types for which PARP inhibitor is not currently considered.

Mutational Signature Analysis

To understand the advances in HRD prediction algorithms, it is necessary to understand the fundamentals of ‘mutational signatures,’ an analytical framework first described a decade ago [23] and now routine in cancer genome analysis. Here, we describe the key concepts and basic properties of mutational signature analysis in non-mathematical language, with a focus on HRD-associated signatures. A schematic summary of signature analysis in the context of HRD is shown in Fig. 1. More detailed reviews of mutational signature analysis are available elsewhere [24-26].

Fig. 1.

Schematic summary of signature analysis in the context of homologous recombination deficient (HRD). (A) The process of double-strand break and repair. In HRD cells, homologous recombination—a high-fidelity repair mechanism—is dysfunctional, and thus errorprone mechanisms dominate. (B) Different types of genomic alterations, SBS, insertions/deletions (ID), CNV, and SV can be used to detect HRD. Spectra indicates typical mutational patterns associated with each alteration type, and among these spectra, HRD-associated signatures and their underlying etiology has been suggested. (C) Schematic description of mutational signature approach, which involves non-negative matrix factorization (NMF) on the mutation type-by-sample matrix, to identify the building blocks (i.e., signatures) that best summarize the data. (D) HRD detection based on CNV and SV-based raw traits that do not involve signature analysis. BIR, break-induced repair; LOH, loss of heterozygosity; LST, large-scale state transitions; MMEJ, microhomology-mediated end joining; NHEJ, nonhomologous end joining; rDel, reciprocal deletion; rDup, reciprocal duplication; SSA, single-strand annealing; tAI, telomeric allelic imbalance.

1. Basic concepts and single base substitutions

Signature analysis aims to decipher the etiology of specific mutational processes based on their characteristic patterns in the genome. In the simplest case of point mutations, different types of single base substitutions (SBSs) are enriched in cancers of different tissues. For example, lung and liver tissues are specifically subject to DNA damage induced by tobacco smoking, resulting in an enrichment of C>A substitutions in the associated tumors. Furthermore, the DNA sequence context surrounding the substituted bases often provides additional information on the underlying mutagenic process. For instance, spontaneous deamination of 5-methylcytosine—an endogenous mutational process operative in nearly all tissues—creates C>T mutations almost exclusively in the CpG context. Standard signature analysis for point mutations thus centers on mutation frequency spectra of trinucleotides, consisting of the substituted base together with one adjacent base on each side, yielding 96 combinations given the four bases. It is possible to extend to larger contexts, e.g., two bases on each side for a pentamer, but such analyses appear to yield limited additional insights, at least with currently available data.

As multiple mutagenic processes are typically active in a cancer genome, its mutational spectrum reflects the cumulative effect of such processes over time. To identify what mutagenic processes might be present (de novo signature discovery), a standard approach is to perform non-negative matrix factorization (NMF) on the mutational type-by-sample matrix. Similar to principal component analysis (PCA), NMF aims to identify the building blocks (i.e., signatures) that best summarize the data; the difference compared to PCA is that all elements of the signature and the coefficients used in decomposition must be positive.

By applying NMF to large numbers of cancer genomes, catalogs of reference mutational signatures have been derived. One popular catalog is the COSMIC Mutational Signatures, which has been an indispensable resource in the field. The latest version (v3.4, 2023) was largely derived from 2,780 cancer genomes. An additional > 1,800 genomes and > 19,000 exomes were used to assess stability and reproducibility [23,27]. Several curated signatures derived from other sources were also included in the catalog. For SBSs, the COSMIC catalog contains 86 signatures, 67 of which are thought to be of biological origin, whereas the rest are suspected to be possible sequencing artifacts. Such biological annotations are possible via population association studies or experimental studies, e.g., by the sequencing of cell lines after exposure to mutagens or the knock-down of genes with known function in DNA repair. Examples of signatures with well-established etiologies include SBS1 (spontaneous deamination of 5-methylcytosine), SBS2/SBS13 (activity of the APOBEC cytidine deaminases), SBS4 (tobacco smoking), and SBS7 (UV light exposure).

Catalogs of reference mutational signatures are evolving. As more genomes of cancer and other diseases are being sequenced, new signatures will be discovered [28]. Known signatures may also be refined or revised. For example, with more samples, there may be sufficient statistical power to further separate an existing signature into multiple ones [29]. It is important to note that deriving reference catalogs is a challenging process that can be error-prone and often requires manual intervention. For example, one needs to be cautious when analyzing cohorts with hypermutated samples, as a few hypermutated samples may bias the signature discovery process by contributing a disproportionately large number of mutations. Particularly challenging are ‘flat’ signatures—where the spectra are more or less even across the triplets—as the underlying signatures become difficult to separate mathematically. Therefore, it is important to verify the robustness of discovered signatures in orthogonal datasets or through experiments.

Given that a fairly stable catalog has been generated, a common procedure in signature analysis is to ‘refit’ a given spectrum with a combination of catalog signatures, i.e., estimating the weighting factor for each signature so that the signatures sum to the original spectrum with minimum error. As is the case for deriving reference catalogs, refitting is not a clearly-defined process. Depending on the choice of the algorithm and the thresholds therein, a sample may be found to contain a range of different signatures. To decrease the likelihood of a false positive signature, one should consider limiting the analysis only to a subset of signatures that are likely to be present in a given tissue as informed by previous studies of large cohorts, although this will prevent the discovery of a new biological process operative in the sample.

2. Alterations beyond SBSs

Recent signature analysis has incorporated mutation types beyond SBSs, including double-base substitutions (e.g., CC>TT), indels, CNVs, and SVs. These alterations can be identified from sequencing data using a wide range of computational tools, as summarized in our recent review [30]. Double-base substitutions are relatively rare, with a burden of about 1% of that of SBSs, and their signatures are defined without the surrounding sequence context to avoid the extremely large number of possible categories. For other mutation types, ad hoc definitions are used to strike a balance between capturing sufficient details of the mutation context and avoiding an excess of categories, which would lead to overly sparse spectra. Specifically, for indels, it is not feasible to enumerate all possible inserted or deleted sequences. The standard indel signature definition thus classifies indels primarily based on their sizes and whether they occur at repeat regions (except for 1-bp indels, which are further separated based on the affected nucleotide). Deletions with microhomologies are further put into separate classes, as they are often informative of specific mutational processes [27,31]. Signature definitions of CNVs and SVs are even more challenging and less established [32-36]. The current COSMIC CNV signatures are mainly defined by the total copy number (1, 2, 3-4, 5-8, 9+) and the size (1-100 kb, 100 kb-1 Mb, 1-10 Mb, 10-40 Mb, > 40 Mb) of each segment, while incorporating the heterozygosity status.

3. Mutational signatures associated with HRD

Previous studies have linked HRD with specific mutational signatures, most notably SBS signature 3 (SBS3), characterized by a roughly uniform distribution across all 96 substitution types. First identified as correlating with BRCA1/2 mutations in breast cancers [36,37], SBS3 was later found to be associated with other HRR gene alterations as well, such as epigenetic inactivation of BRCA1 and RAD51C or PALB2 mutations [38]. The association between SBS3 and HRD was validated by knockout experiments of BRCA1/2 genes initially [39] and other HRR-genes later [40]. The mechanistic origin of SBS3 in HRD tumors is still an open question. Proposed mechanisms involve polymerase theta in MMEJ [41] or REV1/REV3L in translesion synthesis [42], although in another study the double knockout of BRCA1 and REV1 still exhibited SBS3 [43].

Other non-SBS signatures associated with HRD are indel signatures ID6 and ID8, as well as rearrangement signatures RS3 and RS5 [27,36,38-40]. ID6 mainly consists of ≥ 5-bp deletions with overlapping microhomologies at deletion boundaries, suggesting an MMEJ origin. ID8 also contains ≥ 5-bp deletions, but does not show an enrichment of long microhomologies, suggesting NHEJ as its underlying cause. In addition, ID8 was found to be enriched in tumors with topoisomerase mutations [27,44]. Rearrangement signatures are defined based on the type (deletions, tandem duplications, inversions, and translocations) and size of the rearrangements, as well as whether they occur in clusters [36]. Clustered rearrangements are likely to have originated from the same complex event, thus pointing towards distinct mutational processes. The two HRD-associated rearrangement signatures correspond to different underlying causes of HRD. Specifically, RS3, characterized by small tandem duplications of < 10 kb is associated with BRCA1 mutations, whereas RS5, characterized by deletions of 10-100 kb is associated with BRCA2 mutations [45] and enriched in cases with PALB2 or RAD51C losses [40,46]. Of note, RS5 is also present at lower amplitudes in HRP tumors, such as a large fraction of lung, liver, and cervical cancers and melanomas [47]. Furthermore, enrichment in deletions with homeology (approximately homologous sequences of > 50 bp) in BRCA2 mutated samples suggests a potential link between single-strand annealing and RS5 [48,49]. Finally, a recent study using “linked-read” sequencing (Illumina technology that can track fragments from the same > 50 kb DNA molecule but is now discontinued due to patent issues) found complex SVs in BRCA1/2-deficient tumors that were missed by short-read data. Rather than balanced translocation or inversion, near-reciprocal rearrangement with copy loss or gain of the intervening regions were found to be enriched in these tumors [49].

Although a COSMIC mutational signature catalog has also been generated for CNVs, it has not been as informative for HRD prediction. First, other features, especially microhomology-mediated deletions and SBS3, are more effective than a CNV-based measure. Second, GIS score or its variants (e.g., HRD index [50]) already capture the key features in large CNVs, even though those scores are ad hoc measures of genome instability. In two recent CNV signature analysis approaches [32,34], the copy number patterns are defined using copy number states, heterozygosity, segment length distributions, breakpoint density, and other related measures. Not surprisingly, the signatures linked with HRD in both papers, CN17 [32] and CX3 [34], have segment length distribution similar to that used to define components of GIS (each paper used its own CNV catalog with different signature names). Third, CNV signatures are more complicated to construct due to the difficulty in identifying whole-genome duplication regions accurately. For example, the most common copy number states in CN17 are 3 and 4, suggesting that this signature may be capturing events only in samples with whole-genome duplication rather than a broad signature that applies to all HRD samples.

4. Future improvements in mutational signature analysis

Signature analysis has become a standard tool in interpretation of somatic mutations, but there are several shortcomings with current approaches. For example, the observed mutations and their signatures reflect the residual imprints of errors after the repair processes have taken place, rather than the initial DNA damage. This becomes apparent when DNA damage introduced in cells with a deficient repair pathway leads to specific mutational patterns [51,52]. To address this problem, explicit modeling of the damage-repair interaction has been proposed [53]. Another challenge is that mutations do not accumulate uniformly along the genome, as both DNA damage and repair vary depending on locusspecific features such as transcription activity, replication timing, presence of DNA-binding proteins, and epigenetic modifications [52,54]. New signature analysis methods are being developed to incorporate these additional attributes [55]. Even within the standard signature analysis framework, important methodological issues remain. In particular, the NMF algorithm at the center of de novo signature discovery methods has an intrinsic problem of producing non-unique solutions; the procedure of deriving a reference signature catalog needs a method less dependent on manual interventions; and signature refitting is lacking a statistically sound algorithm that minimizes incorrect signature assignments. We have recently developed Mutational Signature Calculator (MuSiCal), a rigorous analytical framework for signature analysis that incorporates several algorithms to tackle some of these issues [56].

Current Assays for HRD Detection

Existing HRD detection approaches can be classified based on what genomic information they rely on and which sequencing platform they use. A summary of genomic information relevant to HRD detection for each sequencing platform is shown in Table 1 [50,57-59].

The pros and cons of different sequencing methods that allow clinical HRD detection

1. Panel-based HRD detection method

An early approach for HRD detection, prior to the development of mutational signatures, is based on mutations of genes associated with HRR. These typically include PALB2, BARD1, BRIP1, RAD51B, RAD51C, RAD51D, ATM, FAAP20, CHEK2, FAN1, FANCE, FANCM, and POLQ among others [60]. However, applying this gene-based approach to clinical HRD detection has been challenging for several reasons. First, clinical studies have utilized slightly different sets of HRR genes, and the low prevalence of mutations in individual HRR genes makes it difficult to assess the clinical significance of each gene. Second, mutations of HRR genes, when followed by an LOH event (thus creating a bi-allelic loss), are more likely to be pathogenic. But determining the monoallelic vs. bi-allelic status of a mutation is difficult, especially when the tumor purity is low. Third, evaluating the pathogenicity of missense mutations in HRR genes is challenging, as variants of unknown significance are common except for the well-studied BRCA1/2.

Clinical trials involving PARP inhibitors have frequently utilized panel sequencing to identify patients with BRCA mutations or other HRR gene mutations in prospective randomized trials. Their inclusion criteria were often shaped by the initial trials that led to the approval of PARP inhibitors in each cancer type. Regardless of cancer type, BRCA mutations were associated with a favorable response to PARP inhibitor, whereas a clinical benefit was not consistently observed for non-BRCA, HRR mutations. In prostate cancer, BRCA1, BRCA2, ATM, and other 12 prespecified HRR genes were used as inclusion criteria for the first trial that led to PARP inhibitor approval in prostate cancer [61]; thus, non-BRCA, HRR genes are still frequently used as the inclusion criteria. In contrast, in ovarian cancer, post-hoc analyses of major PARP inhibitor trials have shown mixed findings—either a trend of improved response or no improvement in HRR-mutated subgroup [62,63]. There is ongoing research to understand the clinical significance of non-BRCA HRR gene mutations, such as identifying the tumor-specific context in which these mutations arise and determining which specific mutations drive therapeutic response.

Although panel sequencing is routinely performed in many cancer hospitals to detect cancer-associated hotspot mutations, the number of identified mutations from panels is generally > 1,000-fold smaller than from WGS, rendering standard mutational signature analysis approaches ineffective. The HRD-associated SBS3 is particularly difficult to detect because its “flat” profile requires more mutations to ascertain its presence with confidence. To ameliorate this situation, we have previously developed a computational method called Signature Multivariate Analysis (SigMA) to identify SBS3 from targeted gene panels [57]. The key innovations were leveraging existing WGS data to learn expected signature combinations for each tumor type and developing a robust classification framework with machine learning. We validated our method by comparing SigMA-derived SBS3 calls with BRCA mutations and comparing results obtained from paired panel and exome data from the same patients [64]. We also showed that the SBS3 status inferred from panel data using SigMA predicted the response of breast cancers to PARP inhibitors and outperformed existing clinical HRD classification strategies [64,65]. Despite the reduced sensitivity and specificity compared to WGS, SigMA enables mutational signature analysis to be performed using panel sequencing data.

2. Detecting HRD-associated genome instability based on copy number profiling

Another popular approach for HRD detection is to measure HRD-associated large-scale copy number alterations, ideally in conjunction with point mutation detection in HRR genes. The commercial test Myriad myChoice provides a ‘Genome Instability Score’ (GIS) based on the three copy numberbased statistics described earlier (TAI, telomeric allelic imbalance; LOH, loss of heterozygosity; and LST, large-scale state transitions), along with the BRCA1/2 mutation status. A GIS cutoff of ≥ 42 was defined based on 95% sensitivity to detect breast cancers with BRCA1/2 mutations and BRCA1 promoter methylations [14]. GIS was initially computed from SNP arrays, but now high-throughput sequencing is used. The Foundation Medicine assay FoundationOne uses the BRCA1/2 mutation status and the genomic LOH percentage based on targeted panel sequencing to detect HRD. In addition to HRD status, it reports pathogenic mutation status of 324 cancer-related genes, including HRR-associated genes.

3. Enhanced HRD characterization with WGS

WGS enables the most unbiased and comprehensive quantification of mutational signature. In addition to a larger number of single nucleotide mutations, WGS allows detection of various structural changes, such as large insertions, deletions, and rearrangements [30], thus enabling analysis of copy number and rearrangement signatures. Although it is possible to detect large-scale CNVs from exome data, WGS enables more accurate CNV analysis, with the resolution of CNV dependent on sequencing coverage.

The first method for HRD detection based on WGS is HRDetect (2017), which uses mutational signatures SBS3, SBS8, RS3, and RS5, deletions with microhomology, and the HRD index (a variant of GIS) as covariates for lasso logistic regression modeling. HRDetect was trained on breast cancer samples with BRCA1/2 deficiency, and this classifier was later also applied to other tumor types [47,50]. The most impactful predictors were the proportion of deletions with microhomology and SBS3, followed by RS3, RS5, and HRD index. Another WGS-based algorithm is CHORD (Classifier of HOmologous Recombination Deficiency) [58], which utilizes a random-forest classifier with 29 features, many of which correspond to components of the mutational signatures used in HRDetect. The two features that contribute the most in CHORD were ≥ 2 bp deletions with flanking microhomology and duplications of size 1-100 kb.

These two methods when applied to multiple tumor types provided largely concordant results [58], e.g., having the deletions with ≥ 2 bp flanking microhomology as one of the most predictive features of HRD. Pan-cancer HRD classification is a challenging task. For HRDetect, training based on breast cancer data may not provide the optimal classification performance for other tumor types. For CHORD, although training was performed in a pan-cancer setting, the positive training set was largely dominated by tumor types typically associated with HRD, such as breast and ovarian cancers. To resolve this problem, more WGS samples must be available publicly for tumor types with low HRD prevalence, so that tumor type-specific training can be performed.

A major question in the design of a WGS assay for HRD is the sequencing coverage. If the aim is simply to measure genome instability alone, it is possible to obtain reasonably accurate calls from shallow coverage. For example, ShallowHRD used ~1× data to measure large-scale CNVs by a statistic similar to LST for SNP arrays, and found a high-concordance with the SNP array-based calls [59]. The same group subsequently showed that shallowHRDv2 had 94% agreement with Myriad myChoice across 449 high-grade ovarian cancers and that patients with HRD according to shallowHRDv2 had longer PFS [66]. While we suspect that not having other features (e.g., microhomology-mediated indels and SBS3) make this less sensitive and less precise, shallow coverage sequencing may be a simple, low-cost alternative to currently available commercial tests.

4. HRD testing in PARP inhibitor trials

In this section, we summarize the key clinical trials that led to the Food and Drug Administration (FDA) approval of PARP inhibitors, focusing on HRD testing platforms and the efficacy of PARP inhibitors with respect to HRD stratification (Table 2) [18,20,67-78]. Across various trials and cancer types, PARP inhibitor efficacy varies widely, likely due to differences in cancer specificity and the treatment setting, since PARP inhibitors had to be incorporated into existing treatment flow. However, consistently across trials, patients with pathogenic BRCA mutations disproportionately benefit from PARP inhibitors compared to those with BRCA wild-type disease. Consequently, PARP inhibitors have been approved for patients with BRCA mutations in ovarian, breast, prostate, and pancreatic cancers. Beyond BRCA mutations, patients with HRD tumors also benefit from PARP inhibitors compared to those with HRP tumors. However, this summary also highlights the considerable differences in how HRD was tested and defined across these trials (e.g., HRR mutation or GIS, SNP array or sequencing, commercial test or in-house assay); the approved indications are driven by the diagnostic platforms available at the time of the study. Efforts to further refine HRD classification and expand the indication for PARP inhibitors to other cancer types would require an understanding of the biological differences in cancer types that underlie the HRD phenotype, as well as careful design of prospective trials with respect to inclusion criteria, testing platforms, prespecified analysis plans, and sample size.

Summary of FDA approvals in PARP inhibitors in respective cancer types, and corresponding clinical trials and HRD testing platforms used

Perspectives for the Future

1. Incorporating WGS-based HRD detection in the clinic

In addition to more comprehensive HRD detection, WGS provides a wealth of information about the genome-wide mutational profile of the tumor, thereby enabling other types of analysis that should be of interest to physician-scientists. Despite these advantages, WGS is not typically incorporated in large prospective trials or in clinics yet. To implement WGS-based assays, several issues must be resolved.

With the introduction of the latest Illumina sequencers (e.g., NovaSeq X) in 2023, standard 30× short-read WGS for germline analysis is now < $400, including the cost of library preparation. If HRD detection is the sole purpose, low coverage WGS can be performed at a much lower cost. For research projects, tumors are generally sequenced at a higher depth (e.g., 60-90×) for somatic variant analysis, which amounts to ~$1,000. With this reduction in sequencing cost, even the cost of high-coverage WGS is relatively small compared to the current cost of FDA-approved HRD tests ($3,500 for Foundation CDx or Foundation liquid CDx; $4,000-6,000 for Myriad myChoice CDx). Thus, sequencing cost is no longer the main bottleneck. Furthermore, several new short-read platforms (Ultima Genomics, Complete Genomics, MGI, Element Biosciences, Singular Genomics) have been introduced just in the past couple of years, with many of them advertising $200 genomes. The cost reduction in long-read sequencing on the PacBio and Nanopore platforms is also changing the landscape of WGS studies.

A much greater challenge for WGS than the sequencing cost is the expertise and infrastructure for downstream analysis. At > 100 GB per 30× genome, any sequencing effort will require processing and storing of terabyte-scale data, requiring a substantial investment in a team of bioinformatics experts and infrastructure [79]. Comprehensive variant identification and interpretation for WGS is an art rather than a science, especially for non-coding variants, structural variants, and coding variants of unknown pathogenicity. Therefore, scientific leadership as well as recruitment and retention of experienced bioinformaticians will be needed. For infrastructure, an existing institutional computing environment may be sufficient for small projects, but additional servers are likely to be needed for larger ones. It is possible to avoid building a local computing environment by utilizing a cloud-based platform such as Amazon Web Services (AWS) or Google Cloud, but this necessitates additional level of informatics expertise or subscription to an informatics platform by a commercial vendor. Computing and storing data on the cloud can also be costly and may require regulatory approval in some institutions and/or countries.

Despite these limitations, WGS studies are continuing to grow in scale and are discovering new insights on driver mutations [80]. For instance, a recent study [81] from the UK 100,000 Genomes Project analyzed > 13,000 solid tumors with treatment outcome data, finding that some tumor types showed actionable structural variants in more than 10% of the cases. Importantly, that study also found that HRD was identified in 40% of high-grade serous ovarian cases with 30% linked to pathogenic germline variants.

Regardless of the progress on WGS data generation and analysis, panel sequencing is likely to continue in the clinic due to the extremely high-coverage needed in that setting. Gene panels in a real patient setting are often sequenced at > 1,000× to detect clinically-relevant variants with very low variant allele fraction (VAF). A study of > 5,000 clinical samples in Samsung Medical Center showed that 10%-25% of hotspot mutations have VAF under 5% and that a substantial fraction of these might be missed if the coverage is lower. For example, 24% of EGFR T790M had < 5% VAF, and 25% of all T790M mutations would be missed if the sequencing coverage is 100× rather than 1,500× [82]. These VAFs are low partly because tumor purity of clinical samples is often low, whereas VAFs in research sequencing projects tend to be higher because a tumor purity threshold is often imposed, e.g., the distribution of VAFs in The Cancer Genome Atlas data are substantially higher than that of real clinical samples [82]. Performing WGS at 1,000× is impractical not only because of the cost, but because of the difficulty in handling and analyzing such data, as one sample would generate more than > 10 TB of data and many WGS algorithms would fail to run. One likely scenario in a research hospital setting would be to perform both a high-coverage panel for hotspot mutations and ~60-90× WGS for a comprehensive characterization of mutations.

Another challenge for WGS adoption is the question of whether high-quality data can be obtained from formalin-fixed paraffin-embedded (FFPE) samples. Formalin-induced DNA damage in FFPE samples results in artifact mutations, which can confound the identification of true variants. In some cases, the problem of artifactual point mutations, mostly C to T transitions due to deamination of cytosine residues, can be ameliorated with a series of filters. For mutational signature analysis, FFPE mutations correspond to SBS30 and can thus be isolated. FFPE samples also contain a large number of artifactual chimeric reads [83] that can be mistaken for structural alterations. One simple way to lessen the impact of FFPE-specific artifacts is to increase coverage, as artifacts are less likely to occur at the same position repeatedly and higher coverage makes it easier to distinguish them from true mutations.

2. Analyzing PARP inhibitor resistance in longitudinally collected samples

A topic of growing interest is the mechanisms of acquired PARP inhibitor resistance such as BRCA reversion mutations. These mutations refer to the secondary mutations that can restore the native reading frame of the mutated gene, thereby reducing sensitivity to platinum or PARP inhibitors. One meta-analysis examined BRCA reversion in four BRCA-associated cancer types (breast, ovarian, prostate, and pancreatic, with the majority being ovarian) after platinum or PARP inhibitor treatments and found reversion mutations in 22% of BRCA1 and 30.7% of BRCA2 cases [84]. BRCA reversion rates can vary substantially depending on prior therapy and response to prior therapy, e.g., one study (n=69) reporting BRCA reversion rates of 2% in platinum-sensitive, 13% in platinum-resistant, and 18% in platinum-refractory cancers [85]. In terms of the mechanisms responsible for BRCA reversion, the majority of the reversion mutations were deletions mediated by NHEJ, often involving sequence microhomologies indicative of MMEJ, especially for BRCA2 [84,86]. However, other reversions do not seem to utilize microhomologies, suggesting that further research is needed to understand the DNA repair or mutagenic processes that generate the reversion mutations. There are also non-BRCA associated PARP inhibitor resistance mechanisms that are either homologous recombination-dependent (i.e., the change from HRD to HRP phenotype) or independent, as detailed in separate reviews [87-89].

For a refined view of how PARP inhibitor resistance evolves under therapeutic pressure, an ideal strategy would be to non-invasively and serially collect circulating tumor DNA (ctDNA) with liquid biopsy. Recent studies showed that resistance mechanisms to PARP inhibitor can be detected from plasma [86,90], with one study finding BRCA reversion mutations in ctDNA in 60% of patients who develop resistance and that detection of a reversion mutation was associated with shorter time to progression [86]. To bring these scientific findings into clinic, a prospective clinical trial is needed address how these specific resistance mechanisms may inform clinical decision making. Such a trial would require meticulous genomic characterization of the tumor at initial diagnosis and at the time of PARP inhibitor resistance acquisition, accompanied by genomic sequencing of serially obtained blood samples.

In addition to liquid biopsy, another emerging technology relevant for HRD studies is single cell RNA sequencing (scRNA-seq). Transcriptomic analysis using scRNA-seq will help characterize the heterogeneity of the cell populations that display HRD signatures, including the interaction between DNA repair-deficient cells and their tumor microenvironment [91]. Recent advances in computational methods allow identification of point mutations and copy number profiles from scRNA-seq data [92,93], which can potentially be used to identify single cells directly from their mutational profile rather than relying on their gene expression patterns. Applications of recent methods in single cell multi-omics [94], e.g., sequencing DNA and RNA simultaneously from the same single cells, may also prove informative in characterizing genetic and phenotypic heterogeneity of HRD samples.

Conclusion

The methods for detecting HRD continue to evolve as we obtain a more accurate view of the mutations associated with HRD. In particular, WGS is allowing us to characterize HRD-associated genomic instability more accurately and thus refine the GIS currently being used in HRD assays. Although incorporation of WGS in clinical trials requires additional costs and bioinformatics expertise, such data will provide more accurate prediction of HRD status as well as better assessment of the correlation between HRD status and response to therapy. Both for panel and WGS data, mutational signature analysis has provided a more robust approach to identify HRD, and this approach should also be applicable to liquid biopsies, which will be particularly useful for investigating acquired resistance mechanisms.

Despite the tremendous progress in expanding the potential pool of patients for PARP inhibitor treatment, the eligibility criteria and treatment settings (early vs. late stage, front-line vs. recurrent) are highly variable across tumor types. A more systematic understanding of de novo and acquired resistance, tumor type-specific effects, impact of front-line therapy, and the correlation between genotype and genomic instability will be needed to design more effective therapeutic strategies for patients with HRD.

Notes

Author Contributions

Conceived and designed the analysis: Kim YN, Gulhan DC, Park PJ.

Collected the data: Kim YN, Gulhan DC, Jin H, Glodzik D, Park PJ.

Contributed data or analysis tools: Jin H, Glodzik D.

Performed the analysis: Kim YN, Gulhan DC, Park PJ.

Wrote the paper: Kim YN, Gulhan DC, Jin H, Glodzik D, Park PJ.

Conflicts of Interest

Gulhan DC and Park PJ hold the patent for the SigMA algorithm.

Acknowledgements

This work was supported by the grant from the US National Institutes of Health to PJP (R01CA269805).

References

1. Bian L, Meng Y, Zhang M, Li D. MRE11-RAD50-NBS1 complex alterations and DNA damage response: implications for cancer treatment. Mol Cancer 2019;18:169.
2. Whelan DR, Lee WT, Yin Y, Ofri DM, Bermudez-Hernandez K, Keegan S, et al. Spatiotemporal dynamics of homologous recombination repair at single collapsed replication forks. Nat Commun 2018;9:3882.
3. Ghezraoui H, Oliveira C, Becker JR, Bilham K, Moralli D, Anzilotti C, et al. 53BP1 cooperation with the REV7-shieldin complex underpins DNA structure-specific NHEJ. Nature 2018;560:122–7.
4. Aylon Y, Liefshitz B, Kupiec M. The CDK regulates repair of double-strand breaks by homologous recombination during the cell cycle. EMBO J 2004;23:4868–75.
5. Zhao B, Rothenberg E, Ramsden DA, Lieber MR. The molecular basis and disease relevance of non-homologous DNA end joining. Nat Rev Mol Cell Biol 2020;21:765–81.
6. Truong LN, Li Y, Shi LZ, Hwang PY, He J, Wang H, et al. Microhomology-mediated end joining and homologous recombination share the initial end resection step to repair DNA double-strand breaks in mammalian cells. Proc Natl Acad Sci U S A 2013;110:7720–5.
7. Scully R, Panday A, Elango R, Willis NA. DNA double-strand break repair-pathway choice in somatic mammalian cells. Nat Rev Mol Cell Biol 2019;20:698–714.
8. Scully R, Livingston DM. In search of the tumour-suppressor functions of BRCA1 and BRCA2. Nature 2000;408:429–32.
9. Lim BW, Li N, Mahale S, McInerny S, Zethoven M, Rowley SM, et al. Somatic inactivation of breast cancer predisposition genes in tumors associated with pathogenic germline variants. J Natl Cancer Inst 2023;115:181–9.
10. Ray Chaudhuri A, Callen E, Ding X, Gogola E, Duarte AA, Lee JE, et al. Replication fork stability confers chemoresistance in BRCA-deficient cells. Nature 2016;535:382–7.
11. San Martin Alonso M, Noordermeer SM. Untangling the crosstalk between BRCA1 and R-loops during DNA repair. Nucleic Acids Res 2021;49:4848–63.
12. Birkbak NJ, Wang ZC, Kim JY, Eklund AC, Li Q, Tian R, et al. Telomeric allelic imbalance indicates defective DNA repair and sensitivity to DNA-damaging agents. Cancer Discov 2012;2:366–75.
13. Popova T, Manie E, Rieunier G, Caux-Moncoutier V, Tirapo C, Dubois T, et al. Ploidy and large-scale genomic instability consistently identify basal-like breast carcinomas with BRCA1/2 inactivation. Cancer Res 2012;72:5454–62.
14. Telli ML, Timms KM, Reid J, Hennessy B, Mills GB, Jensen KC, et al. Homologous recombination deficiency (HRD) score predicts response to platinum-containing neoadjuvant chemotherapy in patients with triple-negative breast cancer. Clin Cancer Res 2016;22:3764–73.
15. Ali AA, Timinszky G, Arribas-Bosacoma R, Kozlowski M, Hassa PO, Hassler M, et al. The zinc-finger domains of PARP1 cooperate to recognize DNA strand breaks. Nat Struct Mol Biol 2012;19:685–92.
16. Murai J, Huang SY, Das BB, Renaud A, Zhang Y, Doroshow JH, et al. Trapping of PARP1 and PARP2 by Clinical PARP Inhibitors. Cancer Res 2012;72:5588–99.
17. Fong PC, Boss DS, Yap TA, Tutt A, Wu P, Mergui-Roelvink M, et al. Inhibition of poly(ADP-ribose) polymerase in tumors from BRCA mutation carriers. N Engl J Med 2009;361:123–34.
18. Moore K, Colombo N, Scambia G, Kim BG, Oaknin A, Friedlander M, et al. Maintenance olaparib in patients with newly diagnosed advanced ovarian cancer. N Engl J Med 2018;379:2495–505.
19. DiSilvestro P, Banerjee S, Colombo N, Scambia G, Kim BG, Oaknin A, et al. Overall survival with maintenance olaparib at a 7-year follow-up in patients with newly diagnosed advanced ovarian cancer and a BRCA mutation: the SOLO1/GOG 3004 trial. J Clin Oncol 2023;41:609–17.
20. Ledermann J, Harter P, Gourley C, Friedlander M, Vergote I, Rustin G, et al. Olaparib maintenance therapy in platinum-sensitive relapsed ovarian cancer. N Engl J Med 2012;366:1382–92.
21. Tutt A, Robson M, Garber JE, Domchek SM, Audeh MW, Weitzel JN, et al. Oral poly(ADP-ribose) polymerase inhibitor olaparib in patients with BRCA1 or BRCA2 mutations and advanced breast cancer: a proof-of-concept trial. Lancet 2010;376:235–44.
22. Mateo J, Carreira S, Sandhu S, Miranda S, Mossop H, Perez-Lopez R, et al. DNA-repair defects and olaparib in metastatic prostate cancer. N Engl J Med 2015;373:1697–708.
23. Alexandrov LB, Nik-Zainal S, Wedge DC, Aparicio SA, Behjati S, Biankin AV, et al. Signatures of mutational processes in human cancer. Nature 2013;500:415–21.
24. Koh G, Degasperi A, Zou X, Momen S, Nik-Zainal S. Mutational signatures: emerging concepts, caveats and clinical applications. Nat Rev Cancer 2021;21:619–37.
25. Brady SW, Gout AM, Zhang J. Therapeutic and prognostic insights from the analysis of cancer mutational signatures. Trends Genet 2022;38:194–208.
26. Steele CD, Pillay N, Alexandrov LB. An overview of mutational and copy number signatures in human cancer. J Pathol 2022;257:454–65.
27. Alexandrov LB, Kim J, Haradhvala NJ, Huang MN, Tian Ng AW, Wu Y, et al. The repertoire of mutational signatures in human cancer. Nature 2020;578:94–101.
28. Degasperi A, Zou X, Amarante TD, Martinez-Martinez A, Koh GCC, Dias JM, et al. Substitution mutational signatures in whole-genome-sequenced cancers in the UK population. Science 2022;376:science.abl9283.
29. Senkin S, Moody S, Diaz-Gay M, Abedi-Ardekani B, Cattiaux T, Ferreiro-Iglesias A, et al. Geographic variation of mutagenic exposures in kidney cancer genomes. Preprint at medRxiv: https://doi.org/10.1101/2023.06.20.23291538. 2023;
30. Cortes-Ciriano I, Gulhan DC, Lee JJ, Melloni GE, Park PJ. Computational analysis of cancer genome sequencing data. Nat Rev Genet 2022;23:298–314.
31. Kucab JE, Zou X, Morganella S, Joel M, Nanda AS, Nagy E, et al. A compendium of mutational signatures of environmental agents. Cell 2019;177:821–36.
32. Steele CD, Abbasi A, Islam SM, Bowes AL, Khandekar A, Haase K, et al. Signatures of copy number alterations in human cancer. Nature 2022;606:984–91.
33. Macintyre G, Goranova TE, De Silva D, Ennis D, Piskorz AM, Eldridge M, et al. Copy number signatures and mutational processes in ovarian carcinoma. Nat Genet 2018;50:1262–70.
34. Drews RM, Hernando B, Tarabichi M, Haase K, Lesluyes T, Smith PS, et al. A pan-cancer compendium of chromosomal instability. Nature 2022;606:976–83.
35. Li Y, Roberts ND, Wala JA, Shapira O, Schumacher SE, Kumar K, et al. Patterns of somatic structural variation in human cancer genomes. Nature 2020;578:112–21.
36. Nik-Zainal S, Davies H, Staaf J, Ramakrishna M, Glodzik D, Zou X, et al. Landscape of somatic mutations in 560 breast cancer whole-genome sequences. Nature 2016;534:47–54.
37. Alexandrov LB, Nik-Zainal S, Wedge DC, Campbell PJ, Stratton MR. Deciphering signatures of mutational processes operative in human cancer. Cell Rep 2013;3:246–59.
38. Polak P, Kim J, Braunstein LZ, Karlic R, Haradhavala NJ, Tiao G, et al. A mutational signature reveals alterations underlying deficient homologous recombination repair in breast cancer. Nat Genet 2017;49:1476–86.
39. Zamborszky J, Szikriszt B, Gervai JZ, Pipek O, Poti A, Krzystanek M, et al. Loss of BRCA1 or BRCA2 markedly increases the rate of base substitution mutagenesis and has distinct effects on genomic deletions. Oncogene 2017;36:746–55.
40. Poti A, Gyergyak H, Nemeth E, Rusz O, Toth S, Kovacshazi C, et al. Correlation of homologous recombination deficiency induced mutational signatures with sensitivity to PARP inhibitors and cytotoxic agents. Genome Biol 2019;20:240.
41. Patterson-Fortin J, D’Andrea AD. Exploiting the microhomology-mediated end-joining pathway in cancer therapy. Cancer Res 2020;80:4593–600.
42. Gyure Z, Poti A, Nemeth E, Szikriszt B, Lozsa R, Krawczyk M, et al. Spontaneous mutagenesis in human cells is controlled by REV1-polymerase zeta and PRIMPOL. Cell Rep 2023;42:112887.
43. Chen D, Gervai JZ, Poti A, Nemeth E, Szeltner Z, Szikriszt B, et al. BRCA1 deficiency specific base substitution mutagenesis is dependent on translesion synthesis and regulated by 53BP1. Nat Commun 2022;13:226.
44. Boot A, Liu M, Stantial N, Shah V, Yu W, Nitiss KC, et al. Recurrent mutations in topoisomerase IIalpha cause a previously undescribed mutator phenotype in human cancers. Proc Natl Acad Sci U S A 2022;119e2114024119.
45. Willis NA, Frock RL, Menghi F, Duffey EE, Panday A, Camacho V, et al. Mechanism of tandem duplication formation in BRCA1-mutant cells. Nature 2017;551:590–5.
46. Menghi F, Inaki K, Woo X, Kumar PA, Grzeda KR, Malhotra A, et al. The tandem duplicator phenotype as a distinct genomic configuration in cancer. Proc Natl Acad Sci U S A 2016;113:E2373–82.
47. Degasperi A, Amarante TD, Czarnecki J, Shooter S, Zou X, Glodzik D, et al. A practical framework and online tool for mutational signature analyses show inter-tissue variation and driver dependencies. Nat Cancer 2020;1:249–63.
48. Stark JM, Pierce AJ, Oh J, Pastink A, Jasin M. Genetic steps of mammalian homologous repair with distinct mutagenic consequences. Mol Cell Biol 2004;24:9305–16.
49. Setton J, Hadi K, Choo ZN, Kuchin KS, Tian H, Da Cruz Paula A, et al. Long-molecule scars of backup DNA repair in BRCA1- and BRCA2-deficient cancers. Nature 2023;621:129–37.
50. Davies H, Glodzik D, Morganella S, Yates LR, Staaf J, Zou X, et al. HRDetect is a predictor of BRCA1 and BRCA2 deficiency based on mutational signatures. Nat Med 2017;23:517–25.
51. Volkova NV, Meier B, Gonzalez-Huici V, Bertolini S, Gonzalez S, Vohringer H, et al. Mutational signatures are jointly shaped by DNA damage and repair. Nat Commun 2020;11:2169.
52. Haradhvala NJ, Kim J, Maruvka YE, Polak P, Rosebrock D, Livitz D, et al. Distinct mutational signatures characterize concurrent loss of polymerase proofreading and mismatch repair. Nat Commun 2018;9:1746.
53. Wojtowicz D, Hoinka J, Amgalan B, Kim YA, Przytycka TM. RepairSig: deconvolution of DNA damage and repair contributions to the mutational landscape of cancer. Cell Syst 2021;12:994–1003.
54. Tomkova M, Tomek J, Kriaucionis S, Schuster-Bockler B. Mutational signature distribution varies with DNA replication timing and strand asymmetry. Genome Biol 2018;19:129.
55. Vohringer H, Hoeck AV, Cuppen E, Gerstung M. Learning mutational signatures and their multidimensional genomic properties with TensorSignatures. Nat Commun 2021;12:3628.
56. Jin H, Gulhan DC, Ben-Isvy D, Geng D, Ljungstrom V, Park PJ. Accurate and sensitive mutational signature ana-lysis with MuSiCal. Preprint at bioRxiv: https://doi.org/10.1101/2022.04.21.489082 (2022).
57. Gulhan DC, Lee JJ, Melloni GE, Cortes-Ciriano I, Park PJ. Detecting the mutational signature of homologous recombination deficiency in clinical samples. Nat Genet 2019;51:912–9.
58. Nguyen L, Martens JW, Van Hoeck A, Cuppen E. Pan-cancer landscape of homologous recombination deficiency. Nat Commun 2020;11:5584.
59. Eeckhoutte A, Houy A, Manie E, Reverdy M, Bieche I, Marangoni E, et al. ShallowHRD: detection of homologous recombination deficiency from shallow whole genome sequencing. Bioinformatics 2020;36:3888–9.
60. Hoppe MM, Sundar R, Tan DS, Jeyasekharan AD. Biomarkers for homologous recombination deficiency in cancer. J Natl Cancer Inst 2018;110:704–13.
61. de Bono J, Mateo J, Fizazi K, Saad F, Shore N, Sandhu S, et al. Olaparib for metastatic castration-resistant prostate cancer. N Engl J Med 2020;382:2091–102.
62. Hodgson DR, Dougherty BA, Lai Z, Fielding A, Grinsted L, Spencer S, et al. Candidate biomarkers of PARP inhibitor sensitivity in ovarian cancer beyond the BRCA genes. Br J Cancer 2018;119:1401–9.
63. Pujade-Lauraine E, Brown J, Barnicle A, Wessen J, Lao-Sirieix P, Criscione SW, et al. Homologous recombination repair gene mutations to predict olaparib plus bevacizumab efficacy in the first-line ovarian cancer PAOLA-1/ENGOT-ov25 trial. JCO Precis Oncol 2023;7e2200258.
64. Batalini F, Gulhan DC, Mao V, Tran A, Polak M, Xiong N, et al. Mutational signature 3 detected from clinical panel sequencing is associated with responses to olaparib in breast and ovarian cancers. Clin Cancer Res 2022;28:4714–23.
65. Farkkila A, Gulhan DC, Casado J, Jacobson CA, Nguyen H, Kochupurakkal B, et al. Immunogenomic profiling determines responses to combined PARP and PD-1 inhibition in ovarian cancer. Nat Commun 2020;11:1459.
66. Callens C, Rodrigues M, Briaux A, Frouin E, Eeckhoutte A, Pujade-Lauraine E, et al. Shallow whole genome sequencing approach to detect homologous recombination deficiency in the PAOLA-1/ENGOT-OV25 phase-III trial. Oncogene 2023;42:3556–63.
67. Ray-Coquard I, Pautier P, Pignata S, Perol D, Gonzalez-Martin A, Berger R, et al. Olaparib plus bevacizumab as first-line maintenance in ovarian cancer. N Engl J Med 2019;381:2416–28.
68. Gonzalez-Martin A, Pothuri B, Vergote I, DePont Christensen R, Graybill W, Mirza MR, et al. Niraparib in patients with newly diagnosed advanced ovarian cancer. N Engl J Med 2019;381:2391–402.
69. Pujade-Lauraine E, Ledermann JA, Selle F, Gebski V, Penson RT, Oza AM, et al. Olaparib tablets as maintenance therapy in patients with platinum-sensitive, relapsed ovarian cancer and a BRCA1/2 mutation (SOLO2/ENGOT-Ov21): a double-blind, randomised, placebo-controlled, phase 3 trial. Lancet Oncol 2017;18:1274–84.
70. Mirza MR, Monk BJ, Herrstedt J, Oza AM, Mahner S, Redondo A, et al. Niraparib maintenance therapy in platinum-sensitive, recurrent ovarian cancer. N Engl J Med 2016;375:2154–64.
71. Coleman RL, Oza AM, Lorusso D, Aghajanian C, Oaknin A, Dean A, et al. Rucaparib maintenance treatment for recurrent ovarian carcinoma after response to platinum therapy (ARIEL3): a randomised, double-blind, placebo-controlled, phase 3 trial. Lancet 2017;390:1949–61.
72. Tutt AN, Garber JE, Kaufman B, Viale G, Fumagalli D, Rastogi P, et al. Adjuvant olaparib for patients with BRCA1- or BRCA2-mutated breast cancer. N Engl J Med 2021;384:2394–405.
73. Robson M, Im SA, Senkus E, Xu B, Domchek SM, Masuda N, et al. Olaparib for metastatic breast cancer in patients with a germline BRCA mutation. N Engl J Med 2017;377:523–33.
74. Litton JK, Rugo HS, Ettl J, Hurvitz SA, Goncalves A, Lee KH, et al. Talazoparib in patients with advanced breast cancer and a germline BRCA mutation. N Engl J Med 2018;379:753–63.
75. Abida W, Patnaik A, Campbell D, Shapiro J, Bryce AH, McDermott R, et al. Rucaparib in men with metastatic castrationresistant prostate cancer harboring a BRCA1 or BRCA2 gene alteration. J Clin Oncol 2020;38:3763–72.
76. Agarwal N, Azad AA, Carles J, Fay AP, Matsubara N, Heinrich D, et al. Talazoparib plus enzalutamide in men with first-line metastatic castration-resistant prostate cancer (TALAPRO-2): a randomised, placebo-controlled, phase 3 trial. Lancet 2023;402:291–303.
77. Clarke NW, Armstrong AJ, Thiery-Vuillemin A, Oya M, Shore N, Loredo E, et al. Abiraterone and olaparib for metastatic castration-resistant prostate cancer. NEJM Evid 2022;1:EVIDoa2200043.
78. Golan T, Hammel P, Reni M, Van Cutsem E, Macarulla T, Hall MJ, et al. Maintenance olaparib for germline BRCA-mutated metastatic pancreatic cancer. N Engl J Med 2019;381:317–27.
79. Sboner A, Mu XJ, Greenbaum D, Auerbach RK, Gerstein MB. The real cost of sequencing: higher than you think! Genome Biol 2011;12:125.
80. Kinnersley B, Sud A, Everall A, Cornish AJ, Chubb D, Culliford R, et al. Analysis of 10,478 cancer genomes identifies candidate driver genes and opportunities for precision oncology. Nat Genet 2024;Jun. 18. [Epub]. https://doi.org/10.1038/s41588-024-01785-9.
81. Sosinsky A, Ambrose J, Cross W, Turnbull C, Henderson S, Jones L, et al. Insights for precision oncology from the integration of genomic and clinical data of 13,880 tumors from the 100,000 Genomes Cancer Programme. Nat Med 2024;30:279–89.
82. Shin HT, Choi YL, Yun JW, Kim NK, Kim SY, Jeon HJ, et al. Prevalence and detection of low-allele-fraction variants in clinical cancer samples. Nat Commun 2017;8:1377.
83. Haile S, Corbett RD, Bilobram S, Bye MH, Kirk H, Pandoh P, et al. Sources of erroneous sequences and artifact chimeric reads in next generation sequencing of genomic DNA from formalin-fixed paraffin-embedded samples. Nucleic Acids Res 2019;47e12.
84. Tobalina L, Armenia J, Irving E, O’Connor MJ, Forment JV. A meta-analysis of reversion mutations in BRCA genes identifies signatures of DNA end-joining repair mechanisms driving therapy resistance. Ann Oncol 2021;32:103–12.
85. Lin KK, Harrell MI, Oza AM, Oaknin A, Ray-Coquard I, Tinker AV, et al. BRCA reversion mutations in circulating tumor DNA predict primary and acquired resistance to the PARP inhibitor rucaparib in high-grade ovarian carcinoma. Cancer Discov 2019;9:210–9.
86. Harvey-Jones E, Raghunandan M, Robbez-Masson L, Magraner-Pardo L, Alaguthurai T, Yablonovitch A, et al. Longitudinal profiling identifies co-occurring BRCA1/2 reversions, TP53BP1, RIF1 and PAXIP1 mutations in PARP inhibitor-resistant advanced breast cancer. Ann Oncol 2024;35:364–80.
87. Dias MP, Moser SC, Ganesan S, Jonkers J. Understanding and overcoming resistance to PARP inhibitors in cancer therapy. Nat Rev Clin Oncol 2021;18:773–91.
88. Li H, Liu ZY, Wu N, Chen YC, Cheng Q, Wang J. PARP inhibitor resistance: the underlying mechanisms and clinical implications. Mol Cancer 2020;19:107.
89. Gupta N, Huang TT, Horibata S, Lee JM. Cell cycle checkpoints and beyond: Exploiting the ATR/CHK1/WEE1 pathway for the treatment of PARP inhibitor-resistant cancer. Pharmacol Res 2022;178:106162.
90. Kim YN, Shim Y, Seo J, Choi Z, Lee YJ, Shin S, et al. Investigation of PARP inhibitor resistance based on serially collected circulating tumor DNA in patients with BRCA-mutated ovarian cancer. Clin Cancer Res 2023;29:2725–34.
91. Jacobson DH, Pan S, Fisher J, Secrier M. Multi-scale characterisation of homologous recombination deficiency in breast cancer. Genome Med 2023;15:90.
92. Muyas F, Sauer CM, Valle-Inclan JE, Li R, Rahbari R, Mitchell TJ, et al. De novo detection of somatic mutations in high-throughput single-cell profiling data sets. Nat Biotechnol 2024;42:758–67.
93. Gao T, Soldatov R, Sarkar H, Kurkiewicz A, Biederstedt E, Loh PR, et al. Haplotype-aware analysis of somatic copy number variations from single-cell transcriptomes. Nat Biotechnol 2023;41:417–26.
94. Baysoy A, Bai Z, Satija R, Fan R. The technological landscape and applications of single-cell multi-omics. Nat Rev Mol Cell Biol 2023;24:695–713.

Article information Continued

Fig. 1.

Schematic summary of signature analysis in the context of homologous recombination deficient (HRD). (A) The process of double-strand break and repair. In HRD cells, homologous recombination—a high-fidelity repair mechanism—is dysfunctional, and thus errorprone mechanisms dominate. (B) Different types of genomic alterations, SBS, insertions/deletions (ID), CNV, and SV can be used to detect HRD. Spectra indicates typical mutational patterns associated with each alteration type, and among these spectra, HRD-associated signatures and their underlying etiology has been suggested. (C) Schematic description of mutational signature approach, which involves non-negative matrix factorization (NMF) on the mutation type-by-sample matrix, to identify the building blocks (i.e., signatures) that best summarize the data. (D) HRD detection based on CNV and SV-based raw traits that do not involve signature analysis. BIR, break-induced repair; LOH, loss of heterozygosity; LST, large-scale state transitions; MMEJ, microhomology-mediated end joining; NHEJ, nonhomologous end joining; rDel, reciprocal deletion; rDup, reciprocal duplication; SSA, single-strand annealing; tAI, telomeric allelic imbalance.

Table 1.

The pros and cons of different sequencing methods that allow clinical HRD detection

Sequencing method
HRD detection method
Pros Cons Building blocks Component Example algorithms/commercial tests
Whole-genome sequencing (~30× germline) (~60-90× tumor; 1-5× low coverage) Uniform and unbiased genome coverage Low VAF variants will missed due to low coverage SBS, ID, SV Spectra components (BRCA1-like vs. BRCA2-like vs. HRP) CHORD [58]
Comprehensive identification of variants: SNVs, CNVs, SVs Signature exposure
Purity and ploidy can be inferred Difficulty with FFPE or low tumor purity samples CNV Genomic instability score (GIS), LOH+TAI+LST HRDetect [50]
Accurate signature analysis Large-scale CNVs from low (~1×) coverage sequencing ShallowHRD [59]
Panel sequencing (~50-500 genes) (~500-2,000×) High depth allows for effective detection of SNVs in regions of interest Variants only in prespecified regions SBS Signature 3 exposure (“Sig3” - HRD signature) SigMA [57]
CNVs are difficult to assess CNV (> 1 Mb) LOH (%) Foundation Medicine (FoundationOne CDx)
Data size easy to manage Signature analysis challenging due to low mutation counts
SNP array (~3-10 kb between probes) Robust CNV calling due to dense SNP loci coverage No detection of somatic mutations (other than CNVs) CNV Genomic instability score (GIS) Myriad (MyChoice CDx)

CNV, copy number variation; FFPE, formalin-fixed paraffin-embedded; HRD, homologous repair deficiency; HRP, homologous repair proficient; ID, indel; LOH, loss of heterozygosity; LST, large scale translocation; SBS, single base substitution; SNP, single nucleotide polymorphism; SNV, single nucleotide variant; SV, structural variation; TAI, telomeric allelic imbalance; VAF, variant allele fraction.

Table 2.

Summary of FDA approvals in PARP inhibitors in respective cancer types, and corresponding clinical trials and HRD testing platforms used

Trials Setting Drug HRD test Cohort PFS overall PFS in HRDa) PFS in HRP
Ovarian cancer:
 - Newly diagnosed setting for advanced stage, high grade: olaparib (gBRCA mutated), olaparib+bevacizumab (g/sBRCA mutated or HRD), and niraparib (any BRCA or HRD status)
 - Second-line or greater as maintenance: olaparib (any BRCA or HRD status), niraparib (gBRCA mutated), and rucaparib (g/sBRCA mutated)
 - Heavily pre-treated setting: retraction of approval
SOLO-1 [18] Newly diagnosed advanced Olaparib vs. placebo Not done gBRCAmut (100%) Not reached vs. 13.8 (HR 0.3, 95% CI 0.23-0.41) NA NA
PAOLA-1 [67] Newly diagnosed advanced Olaparib+bevacizumab vs. placebo Myriad myChoice HRD HRD (48.0%) 22.1 vs. 16.6 (HR 0.59, 95% CI 0.49-0.72) 37.2 vs. 17.7 (HR 0.33, 95% CI 0.25-0.45) 16.6 vs. 16.2 (HR 1.00, 95% CI 0.75-1.35)
HRP (34.4%)
Unknown (17.6%)
PRIMA [68] Newly diagnosed advanced Niraparib vs. placebo Myriad myChoice HRD HRD (50.9%) 13.8 vs. 8.2 (HR 0.62, 95% CI 0.5-0.76) 21.9 vs. 10.4 (HR 0.43, 95% CI 0.31-0.59) 8.1 vs. 5.4 (HR 0.68, 95% CI 0.49-0.94)
HRP (49.1%)
Study 19 [20] Platinum-sensitive recurrent Olaparib vs. placebo Not done gBRCAmut (22.8%) 8.4 vs. 4.8 (HR 0.35, 95% CI 0.25-0.49) NA NA
gBRCAwt (13.2%)
Unknown (64.0%)
SOLO-2 [69] Platinum-sensitive recurrent Olaparib vs. placebo Not done gBRCAmut (96.9%) 19.1 vs. 5.5 (HR 0.3, 95% CI 0.22-0.41) NA NA
Unknown (3.1%)
NOVA [70] Platinum-sensitive recurrent Niraparib vs. placebo HRD Myriad myChoice gBRCAmut (36.7%) Not reported 12.9 vs. 3.8 (HR 0.38, 95% CI 0.24-0.59)b) 6.9 vs. 3.8 (HR 0.58, 95% CI 0.36-0.92)
gBRCAwt (63.3%)
ARIEL3 [71] Platinum-sensitive recurrent Rucaparib vs. placebo Foundation T5 NGS (s) g/sBRCAmut (35%) 10.8 vs. 5.4 (HR 0.36, 95% CI 0.30-0.45) 13.6 vs. 5.4 (HR 0.44, 95% CI 0.29-0.66)c) 6.7 vs. 5.4 (HR 0.58, 95% CI 0.4-0.85)d)
Foundation % LOH BRCAwt (65%)
LOH-H (28%)
LOH-L (29%)
LOH-indeterminate (9%)
Breast cancer:
 - Early stage, HER2-negative breast cancer with high risk of recurrence: olaparib (gBRCA mutated)
 - Metastatic breast cancer (HER2-negative) with prior treatment using chemotherapy in neoadjuvant, adjuvant, or metastatic setting: olaparib (gBRCA mutated)
 - Locally advanced or metastatic breast cancer (HER2-negative): talazoparib (gBRCA mutated)
OlympiA [72] HER2 negative, early breast cancer Olaparib (1 year) vs. placebo NA gBRCAmut (100%) 3-Year invasive disease-free survival 85.9% vs. 77.1% (HR 0.58, 95% CI 0.41-0.82) NA NA
OlympiADe) [73] HER2, negative recurrent metastatic Olaparib vs. physician’s choice single-agent chemotherapy NA gBRCAmut (100%) 7.0 vs. 4.2 (HR 0.58, 95% CI 0.43-0.80) NA NA
EMBRACAe) [74] Locally advanced or Talazoparib vs. physician’s choice single-agent chemotherapy NA gBRCAmut (100%) 8.6 vs. 5.6 (HR 0.54, 95% CI 0.43-0.80) NA NA
Prostate cancer:
 - Metastatic castration-resistant prostate cancer (mCRPC)
 - With progression on prior treatment with enzalutamide or abiraterone: olaparib (g/s HRR gene mutated)
 - Who have been treated with androgen receptor-directed therapy and a taxane-based chemotherapy: rucaparib (g/s BRCA mutated)
 - First-line setting for mCRPC: olaparib+abiraterone+prednisone (HRR mutated) and talazoparib+enzaluta mide (BRCA mutated)
PROfounde) [61] mCRPC, failed prior treatment with new hormonal agent Olaparib vs. physician’s choice single-agent chemotherapy FoundationOne CDxf) Cohort A: BRCA1, BRCA2, ATM (34.8%) Cohort A: 7.4 vs. 3.6 (HR 0.34, 95% CI 0.25-0.47) NA NA
Cohort B: 12 other prespecified genes (63.3%)g) Cohort A+B: 5.8 vs. 3.5 (HR 0.49, 95% CI 0.38-0.63)
TRITON-2e) [75] mCRPC, failed prior treatment with hormonal agent and 1 prior taxane-based chemotherapy Rucaparib (phase II single arm) NA g/s mutation in BRCA1, BRCA2, ATM, or 12 other prespecified genesh) ORR 43.5% (31.0-56.7%), DOR not reached (6.4 months to not reached) NA NA
TALAPRO-2 [76] First-line mCRPC on androgen deprivation therapy Talazoparib+enzalutamide vs. placebo+enzalutamide FoundationOne CDx HRRmut (20.7%)i) Not reached vs. 21.9 (HR 0.63, 95% CI 0.51-0.78) 27.9 vs. 16.4 (HR 0.46, 95% CI 0.30-0.70) Not reached vs. 22.5 (HR 0.70, 95% CI 0.54-0.89)j)
HRRwt (52.9%)
Unknown (26.4%)
PROpel [77] First-line mCRPC with no previous systemic treatment Olaparib+abiraterone vs. placebo+abiraterone FoundationOne CDx HRRmut (27.8%)k) 24.8 vs. 16.6 (HR 0.66, 95% CI 0.54-0.81) Not reached vs. 13.9 (HR 0.5, 95% CI 0.34-0.73) 24.1 vs. 19.0 (HR 0.76, 95% CI 0.60-0.97)
HRRwt (69.9%)
Unknown (2.3%)
Pancreatic cancer
 - Metastatic pancreatic adenocarcinoma who have not progressed on at least 16 weeks of 1st line platinum chemotherapy: olaparib (gBRCA mutated)
POLO [78] Metastatic, not progressed during first-line platinum-based chemotherapy Olaparib vs. placebo NA gBRCAmut (100%) 7.4 vs. 3.8 (HR 0.53, 95% CI 0.35-0.82) NA NA

CI, confidence interval; DOR, duration of response; FDA, Food and Drug Administration; g, germline; g/sBRCA, germline or somatic BRCA; s, somatic; HR, hazard ratio; HRD, homologous recombination deficiency; HRP, homologous recombination proficient; HRR, homologous recombination repair gene; LOH, loss of heterozygosity; mut, mutated; NA, not available; NGS, next generation sequencing; PARP, poly(ADP-ribose) polymerase; PFS, progression-free survival; wt, wildtype.

a)

HRD indicates HRD (without further genomic stratification) or HRR gene mutation based on the respective HRD testing platform that was utilized,

b)

PFS in HRD+gBRCA wildtype,

c)

HRD defined as g/sBRCA mut or LOH-high (LOH-H),

d)

PFS in g/sBRCA wildtype and LOH-low (LOH-L),

e)

Note that PARP inhibitor was given as “treatment” and compared with physicians’ choice single-agent chemotherapy or hormonal therapy. This is in contrast with cases in ovarian cancer where PARP inhibitor was given as “maintenance” (i.e., immediately preceded by chemotherapy),

f)

Germline vs. somatic distinction and analysis are not reported,

g)

12 genes: BARD1, BRIP1, CDK12, CHEK1, CHEK2, FANCL, PALB2, PPP2R2A, RAD51B, RAD51C, RAD51D, and RAD54L,

h)

12 genes: BARD1, BRIP1, CDK12, CHEK2, FANCA, NBN, PALB2, RAD51, RAD51B, RAD51C, RAD51D, and RAD54L,

i)

BRCA1, BRCA2, PALB2, ATM, ATR, CHEK2, FANCA, RAD51C, NBN, MLH1, MRE11A, CDK12,

j)

HRR mutation status proficient or unknown,

k)

BRCA1, BRCA2, PALB2, ATM, BARD1, BRIP1, CDK12, CHEK1, CHEK2, PANCL, RAD51B, RAD51C, RAD51D, RAD54L.