Document 6422425
Transcription
Document 6422425
9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm P1: IKH LaTeX2e(2002/01/18) 10.1146/annurev.genet.37.110801.143717 Annu. Rev. Genet. 2004. 38:819–45 doi: 10.1146/annurev.genet.37.110801.143717 c 2004 by Annual Reviews. All rights reserved Copyright First published online as a Review in Advance on August 30, 2004 INTEGRATION OF ADENO-ASSOCIATED VIRUS (AAV) AND RECOMBINANT AAV VECTORS Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. Douglas M. McCarty,1,2 Samuel M. Young Jr.,3 and R. Jude Samulski4,2 1 School of Pharmacy, University of North Carolina, Chapel Hill, North Carolina 27599; Gene Therapy Center, University of North Carolina, Chapel Hill, North Carolina 27599; 3 Salk Institute, San Diego, California; 4Department of Pharmacology, University of North Carolina, Chapel Hill, North Carolina 27599; email: [email protected], [email protected], [email protected] 2 Key Words AAV, integration, vector, site-specific, gene therapy ■ Abstract The driving interest in adeno-associated virus (AAV) has been its potential as a gene delivery vector. The early observation that AAV can establish a latent infection by integrating into the host chromosome has been central to this interest. However, chromosomal integration is a two-edged sword, imparting on one hand the ability to maintain the therapeutic gene in progeny cells, and on the other hand, the risk of mutations that are deleterious to the host. A clearer understanding of the mechanism and efficiency of AAV integration, in terms of contributing viral and host-cell factors and circumstances, will provide a context in which to evaluate these potential benefits and risks. Research to date suggests that AAV integration in any context is inefficient, and that the persistence of AAV gene delivery vectors in tissues is largely attributable to episomal genomes. CONTENTS INTRODUCTION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . SITE-SPECIFIC AAV INTEGRATION . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . AAV Genome Structure . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . AAV As a Latent Integrating Virus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Site-Specific Integration in Human Chromosome 19 . . . . . . . . . . . . . . . . . . . . . . . . Mechanism of Rep-Mediated Integration in Chromosome 19 . . . . . . . . . . . . . . . . . Unique Nature of the AAVS1 Integration Site . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Elements Within the AAV Genome that Contribute to Specific Integration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Structure of AAV Proviral Junctions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Models for AAV Targeted Integration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Efficiency of AAV Integration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Hybrid Gene Delivery Systems Taking Advantage of Rep-Mediated Integration . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 0066-4197/04/1215-0819$14.00 820 821 821 822 822 823 824 825 826 827 828 829 819 9 Oct 2004 0:15 820 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. Chromosome 19 Disruptions and Consequences for the Cell . . . . . . . . . . . . . . . . . . Evolutionary Considerations of AAV Integration . . . . . . . . . . . . . . . . . . . . . . . . . . . INTEGRATION OF AAV VECTORS . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Integration Efficiency of rAAV Vectors In Vitro . . . . . . . . . . . . . . . . . . . . . . . . . . . . Episomal Conformation of rAAV Genomes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Efficiency of rAAV Integration In Vivo . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Contribution of TRs to Persistence and Integration . . . . . . . . . . . . . . . . . . . . . . . . . Structure of rAAV Integration Junctions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Role of Double-Strand DNA Breaks (DSB) in rAAV Integration . . . . . . . . . . . . . . Integration into Active Genes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Safety of rAAV Vectors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 830 831 831 832 833 834 835 836 837 838 838 INTRODUCTION The advent of viral gene therapy has brought adeno-associated virus (AAV) into the limelight of virus research. The previous obscurity of this parvovirus had been rooted in two important features of its biology. First, AAV does not appear to cause any disease in humans, and none of its close zootropic relatives cause any known diseases in animals. Second, it tends to remain quiescent in the absence of a helper virus, most commonly, adenovirus (Ad). These are the same two features that initially attracted the attention of gene therapy researchers, and have driven the idea that AAV can be used for safe and stable gene delivery. However, the stealthy nature of AAV has also hindered the understanding of its biology in terms of epidemiology, persistence, and potential occult effects on the host cell, all essential elements in a safe and effective gene delivery vector. Although the ability of AAV to establish a latent infection in the absence of a helper virus was recognized early on, the nature of the provirus was elusive, and has only relatively recently been characterized. The recognition that one of the viral proteins, the replication protein (Rep), was a key component in establishing the latent integrated state meant that subsequent research in AAV persistence would diverge in two different directions. One line of inquiry would follow the mechanism by which the AAV Rep protein mediates integration of the viral genome into a specific region of the human host chromosome. The second would elucidate the mechanism of persistence in the absence of viral proteins, which better reflects the situation envisioned for the AAV-derived gene therapy vectors. The role of integration in the biology of AAV, whether Rep-mediated targeted integration or Rep-independent recombinant AAV (rAAV) vector integration, is highly relevant to the future use of this virus as a gene delivery tool. Whereas most rAAV vectors will not include Rep and will not integrate specifically, the potential for mutation and oncogenesis due to random integration may still exist. The evaluation of the frequency of rAAV vector integration and its propensity for targeting transcriptionally active regions of the genome is therefore an area of research being pursed with some sense of urgency (60, 63, 85). On the other hand, many novel gene delivery systems are being devised to take advantage of the 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH AAV INTEGRATION 821 targeted integration properties of the AAV Rep protein. These include hybrid viral vectors as well as nonviral systems for delivering Rep protein and naked plasmids. The ultimate success of these systems may largely depend on the intrinsic efficiency of the AAV integration mechanism, how it is affected by cell type or replication status, and the long- and short-term consequences for the targeted cell. All of these issues are certain to have an impact on the safety and efficacy of rAAV-mediated gene therapy. Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. SITE-SPECIFIC AAV INTEGRATION AAV Genome Structure Like all parvoviruses, AAV packages a single-stranded DNA genome, of approximately 4700 nucleotides, with palindromic regions at each end. In the case of AAV, the palindromes are arranged such that each terminus can fold into a Tshaped DNA secondary structure through base-pairing between palindromic subregions of the terminal repeats (TR). Though the sequence content at each end of the genome is essentially identical, the palindromic region of each TR can be in either of two orientations. The four subregions of the AAV TR are denoted A, B, C, and D, wherein B and C are asymmetric small internal palindromes forming the arms of the T structure. The symmetric A palindromes flank B and C, and form the stem of the T structure when folded and base-paired. The D sequence is present only in one copy at each end of the genome, and therefore remains single-stranded when the TR is in its hairpin configuration. By folding into a hairpin, the TR at the 3 end of the genome serves as a primer for host-cell– mediated DNA synthesis. The TRs are therefore essential for conversion of the single-stranded virion DNA to a double-strand DNA template for transcription and replication. The AAV TRs also serve as origins for subsequent DNA replication through interaction with the large AAV Rep proteins (Rep78 and Rep68). The minimal Rep binding element (RBE) within the A palindrome is a tetranucleotide repeat region (GAGC)3,but additional specific sequences within the arm of the T-structure also contribute to the stability of the Rep-TR complex (50, 81, 98). After binding to the RBE, Rep mediates an ATP-dependent isomerization of the A-region exposing the terminal resolution site (trs) at the junction between the A- and Dregions. A sequence-specific, strand-specific endonuclease activity intrinsic to the Rep protein nicks the trs on one strand, becoming covalently attached to the newly generated 5 end (88). The 3 end then serves as primer for replication of a new TR end. The meticulous characterization of the steps involved in Rep-mediated resolution of the AAV TR provided the essential background for understanding the ability of AAV to integrate site specifically, which is unique among mammalian viruses. As discussed below, it is the recapitulation of these steps on a sequence located on human chromosome 19 (Ch19) that leads to AAV site-specific integration. 9 Oct 2004 0:15 822 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. AAV As a Latent Integrating Virus The ability of AAV to establish a latent proviral state in the absence of helper virus was recognized soon after its discovery as a contaminant in Ad stocks (26, 27). The latent AAV could be rescued from infected cells after 100 passages by superinfecting with Ad helper virus (28). Although this demonstrated that the virus could persist in a latent state, it did not establish its ability to integrate into chromosomal DNA. This ability was made clear, however, when Cheung et al. analyzed a single clone of latently infected Detroit 6 cells at early and late passage (10). The AAV genome was found in a tandem array, joined to cellular sequences at the viral terminal repeats. Thus, the latent state involved stable integration of the intact viral genome into host cellular DNA, rather than an autonomously replicating episome. The head-to-tail organization of the AAV tandem array provided the first suggestion that the genome underwent limited replication before, during, or after the process of integration, perhaps involving a circular intermediate. These headto-tail tandem arrays have been a common feature of AAV proviral structures in many different contexts. In the initial experiments to create and characterize rAAV transduction vectors, the rep gene was retained in the recombinant vector while the neoR gene was substituted for the capsid coding sequences (24, 94). Like the wtAAV, these vectors integrated into the host chromosome and transduced cells to geneticin resistance. The ability to co-opt this important property of AAV, the establishment of latency by integration in the host chromosome, represented the first step in creating a usable gene delivery vector. Subsequent comparisons were made between vector transduction in the presence and absence of the rep gene, either in cis or in trans. The results showed that the rep gene inhibited integration in some experiments and slightly enhanced integration in others (52–54, 82). Generally, integration was enhanced in HeLa cells and inhibited in HEK 293 and other cell lines. In retrospect, some of these effects could be attributed to the toxic effects of Rep expression, which is relatively low in HeLa cells but high in HEK 293 cells, due to the effect of the Ad E1a gene product on the AAV p5 promoter (8, 9). Site-Specific Integration in Human Chromosome 19 The initial characterizations of AAV proviral structures revealed a great deal of heterogeneity. This property was also observed in cells carrying integrated copies of early recombinant AAV (rAAV) gene delivery vectors (52). Restriction fragments carrying AAV sequences had molecular weights (Mr) that did not readily correlate with the observed number of AAV copies. In the absence of cloned proviral-cellular junctions, this led to the interpretation that AAV integrated randomly within the genome. This view changed, however, as AAV proviral-cellular junction sequences became available. The isolation of AAV proviral sequences, either by direct cloning or by enrichment for rAAV sequences containing a specific protein binding site, provided the key to revealing the specific integration properties of AAV and AAV vectors 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. AAV INTEGRATION 823 containing the rep gene (40, 42, 83). When the associated cellular sequences from these provirus isolates were used to probe DNA from independently isolated, latently infected cell lines, it became clear that most of the provirus was integrated within the same sequence context at a specific region of human chromosome 19 (Ch19) (19q13.3-qter). Further, examination of cells with integrated rAAV revealed that vectors containing the AAV Rep gene were mostly integrated within this specific region of Ch19, while those lacking Rep were integrated elsewhere, apparently at random. The Southern blot results were corroborated in studies using PCR and fluorescence in situ hybridization (37, 71). Metaphase spreads of IB3 cells revealed that AAV sequences associated with Ch19 accounted for up to 94% of the detected wtAAV provirus, but none of the rAAV sequences lacking the rep gene. Mechanism of Rep-Mediated Integration in Chromosome 19 Sequencing of the Ch19 preintegration site, termed AAVS1, and comparison with the sequence of the cloned AAV provirus, revealed a great deal about the integration process (41). No large regions of homology were found between the AAVS1 and the virus, suggesting that integration occurred through a nonhomologous recombination pathway. Small (4–5-bp) homologies at the junctions between host cell and viral DNA were consistent with illegitimate recombination products. Partial deletion of sequences within the AAV TRs, as well as large-scale rearrangements of the host sequences around the integration site, suggested that the process was both complex and imprecise. Concurrent with advances in the understanding of the biochemical activities and DNA-binding properties of AAV Rep, Weitzman et al. found a sequence within the AAVS1 region that could also bind specifically to the Rep protein (98). This comprised the same tetranucleotide repeat (4 copies) that mediated Rep binding in the AAV terminal repeat. Further, Rep protein was able to simultaneously bind the AAVS1 RBE and the AAV TR sequence. This immediately suggested a mechanism for AAV site-specific integration wherein the Rep protein tethers the AAV genome to the AAVS1 chromosomal sequence. The biochemical interactions between AAV Rep and the TR that led to resolution and replication of the genome ends had previously been characterized. These included secondary structure and sequence-specific DNA binding, ATP-dependent DNA helicase, and site-specific, strand-specific endonuclease (32, 33, 90). Each of the steps leading to terminal resolution had been recapitulated in vitro using purified Rep protein. Indeed, the entire AAV rolling hairpin replication process could be reconstituted by mixing Rep protein with HeLa cell nuclear extracts (64). The recognition that the AAVS1 fragment contained an active RBE suggested the possibility of a human chromosomal version of the AAV replication origin, which could participate in the specific integration process. When a cloned subfragment of the AAVS1 sequence, containing the RBE, was incubated in nuclear 9 Oct 2004 0:15 Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 824 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI extracts containing Rep protein, specific DNA replication products were readily detected (95). Not surprisingly, given that Rep nicks specifically on one strand of the AAV TR, the replication observed from the plasmid carrying the AAVS1 was specific for only one strand. This observation of asymmetric replication was consistent with the known activities of the Rep protein, in that AAV replication is mediated entirely by leading-strand DNA synthesis. The initiation site for asymmetric replication from the AAVS1 origin mapped to a sequence that conserved 5 of 6 bases relative to the trs associated with the AAV TR, and included a central dithymidine (TT), which is the sequence cleaved in the AAV TR site. Although this putative chromosomal nicking site was spaced 5 bases closer to the RBE than the TR site, and the spacing can affect the efficiency of trs endonuclease activity in the context of the AAV TR (7, 89), the AAVS1 nicking site homologue was cleaved predominantly at the analogous site, between the two thymidine residues. Further, the products of the cleavage reaction were a free DNA 3 OH end and a covalent 5 DNA-Rep complex, as is produced from the endoclease reaction with the AAV terminal repeat. Taken together, the characterization of the AAVS1 site on human chromosome 19 revealed a functional homologue to the replication origin in the AAV terminal repeat. The presence of this AAV Rep-dependent replication origin near the integration site implied that integration was associated with limited DNA replication of cellular sequences. This was consistent with the chromosomal rearrangements, including duplications and inversions, which were present at the host-virus junctions from the latently infected cells (41). Deeper understanding of the AAV site-specific integration process was achieved through the use of episomal cognates of the chromosomal integration site. Giraud et al. cloned the AAVS1 fragment into Epstein-Barr virus-based vectors, which could be rescued from mammalian cells and grown in bacteria (19, 20). This system allowed both quantification of AAV integration events and characterization of the products. At 48 h postinfection, approximately 1.5%–3.0% of the rescued episomes contained AAV sequences. Consistent with the hypothesis that the RBE and nicking site were involved in the process, a subcloned 510-bp fragment containing these elements was both necessary and sufficient to direct targeted integration of the AAV genome into the episome. The system was later refined to show that the two cis-acting elements, the RBE and the trs homolgue, contained within a 33-bp sequence, were the mediators of integration through interaction with the AAV Rep protein (47, 48). Unique Nature of the AAVS1 Integration Site Although the RBE in AAVS1 and the AAV TR is a (GAGC)3 repeat sequence, Rep protein is somewhat permissive in its binding specificity, and substitutions within the core sequence are tolerated (11, 81). Substantial deviations from the consensus repeat are still specifically bound and have biological function. The AAV p5 promoter binds Rep to mediate transactivation and repression of viral gene expression, but contains only one complete (GAGC) and one degenerate (GAGT) copy of the 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. AAV INTEGRATION 825 repeat (43, 49, 67). The sequence of the human inmunodeficiency virus (HIV) long terminal repeat (LTR) contains a degenerate sequence similar to the AAV TR Rep binding site, but with only one intact GAGC motif, yet the transcriptional activity of the HIV LTR is strongly repressed by the AAV Rep protein (5, 39, 65, 76). Additionally, a spurious sequence within the plasmid pBR322, as well as most derivatives of this bacterial vector, also binds specifically to the Rep protein (49). The relatively nonstringent requirements for Rep binding and the simplicity of its consensus binding site suggest that there are a great many potential interactions within the human genome. At least 18 Rep binding sequences have been located within or flanking human genes (99). Based on a consensus GAGYGAGC sequence, there may be up to 2 × 105 Rep binding sites in the human genome (107). However, no secondary preferred integration sites have been observed. Clearly, the close proximity of the essential nicking site plays a major role in this specificity. Other factors that may contribute are elements of chromosomal context including a nearby CpG island, insulator sequences, and numerous transcription factor binding sites (41, 66). The AAVS1 also contains a DNase hypersensitive region, suggesting that it is situated within an open chromatin structure (46). The AAVS1 sequence has been found only in humans and higher primates. Integration of AAV and rAAV vectors carrying the rep gene is apparently random in normal rodent cells. However, in transgenic mice and rats carrying the human AAVS1, on an 8.2-kb, 3.5-kb, or 2.7-kb fragment, AAV integration is specific for the exogenous sequence (3, 77, 107). This suggests that all of the essential cis elements for specific integration are located within a 1.6-kb region common to all of the exogenous fragments used to make these transgenic animals. The DNase hypersensitve region of the exogenous AAVS1 is also maintained in the open chromatin conformation in the animal models, suggesting that cis-acting signals associated with the transgenic fragment direct specific chromatin modeling. This further suggests that open chromatin is likely to be a prerequisite for Rep access to the RBE and nicking site. Elements Within the AAV Genome that Contribute to Specific Integration As stated above, the Rep protein bound to the AAVS1 RBE can form a complex with the AAV RBE located within the terminal repeat. This suggests a mechanism for bringing the AAV genome into close proximity to the chromosomal integration site. While the presence of the nicking site in chromosomal or episomal AAVS1 is essential for specific integration, this does not appear to be the case for the trs of the viral sequences. Young et al. were able to achieve specific integration using an AAV construct with mutant TRs, which contained an insertion between the RBE and trs (108). These TRs had previously been shown to be defective for Rep-mediated endonuclease activity (89). Other investigators have observed Rep-dependent specific integration of plasmids lacking the AAV TR, though at an efficiency at least tenfold lower than plasmids containing at least one copy 9 Oct 2004 0:15 Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 826 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI of the terminal repeat (93). This may have been mediated through the spurious Rep binding site within the plasmid bacterial sequences, as mentioned above. Alternatively, the DNA damage induced by Rep-mediated nicking of the AAVS1, and its subsequent repair, may create a hotspot for illegitimate recombination. In addition to the RBE contained within the AAV TR, sequences within the AAV p5 promoter region also appear to have a role in mediating or enhancing Rep-mediated specific integration. Junction breakpoints between viral and cellular sequences frequently cluster around the p5 promoter in both the EBV episomal model system and in proviral structures isolated from latent cell lines (20, 105). Additionally, the presence of p5 on a rAAV construct serves to enhance the rate of site-specific integration by 10–100-fold in the presence of complementing Rep protein (68, 69). The sequence responsible for this enhancement, termed the p5 integration enhancer element (p5IEE), is contained within a 138-bp fragment that also encompasses the functional p5 promoter region, including the RBE (9, 49). Further, in plasmid constructs, this sequence alone can mediate specific integration more efficiently than constructs containing both the p5IEE and the terminal repeats. Philpott et al. suggest that the TRs can introduce boundaries to the exogenous integrated sequence, which may lead to integration of only half of a circular plasmid containing two TRs rather than the whole plasmid (69). Thus, the p5-associated RBE, which is essential for the regulation of p5 promoter activity as well as the activities of the two downstream AAV promoters, is likely to be the primary mediator of targeting in these constructs, through its interaction with the Rep protein.Other transcription factors bound within the promoter region may also contribute. Structure of AAV Proviral Junctions The organization of the AAV proviral structures, either rescued from EBV shuttle vectors or cloned from cell lines, is complex, with large-scale deletions, rearrangements, and duplications. The EBV-rescued sequences present the greatest sampling of proviral junctions, which generally incorporate the following features. First, most of the crossover points between AAV and cellular sequences were within 100 bp from the AAVS1 nicking site. This is somewhat different from junctions cloned from latent cell lines, which are more typically farther away, but within 1000 bp of the RBE and nicking site (41, 83, 105). In either case, the junction between viral and cellular DNA was always downstream from the nicking site, consistent with nicking and initiation of DNA synthesis on the chromosome prior to incorporation of AAV sequences. This limited DNA synthesis has the effect of moving the crossover point to a location downstream from the initial nicking event. Second, all of the integrated AAV sequences had one end associated with the AAVS1 sequence. The other end, however, was linked to unrelated sequences within the shuttle vector. In several of the EBV clones, however, it was later found that an insertion of a human Line-1 element into the EBV episome had occurred 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. AAV INTEGRATION 827 independently of AAV integration (15). This EBV-Line-1 recombinant had then propagated and become the target for several AAV integration events, and some of these had formed junctions between AAV and the Line-1 sequence. Other junctions were associated with EBV vector sequence. Taken together, this demonstrates that AAV integration is not a simple insertion process, and that large segments of target sequence can be deleted. This was also noted in a careful analysis of a proviral structure from a latent cell line, along with duplications and inversions of the AAVS1 sequence (41). Third, the crossover points within the AAV sequences were usually near a Rep binding site, either within the TR or the p5 promoter. In contrast, crossover points at the AAV trs were not seen, suggesting that nicking of the AAV molecule by Rep was not an important feature for integration, consistent with the observation discussed above, that AAV genomes with defective nicking sites, or lacking TRs, are still integrated into the AAVS1 site. Lastly, the AAV sequences frequently included a head-to-tail viral junction, suggesting that multiple copies in a tandem array had integrated. This was also common to many of the proviral junctions previously characterized in cell lines. This suggested that a circular form of the viral genome was a precursor to the integrated form and that some degree of DNA replication of the viral sequence had occurred before or during integration. Models for AAV Targeted Integration The interpretation of early events of AAV integration, including binding of both the AAVS1 sequence and the integrating genome by Rep protein, and the consequent formation of protein-DNA complexes tethering both structures together, are well supported by observations both in vivo and in vitro. This complex formation is likely to be followed by nicking of the AAVS1 site, but not necessarily the AAV sequence. Again, the necessity of this nicking event in the target sequence has been confirmed in vitro, and in the EBV episomal system using mutant target sites. Exactly what sequence of events takes place after the nicking step is not entirely clear. However, the general complexity of the junction sequences, and the rearrangements in both target and viral DNA, suggest that the mechanism is not precise and has little in common with cut-and-paste type integration systems such as retrovirus or lambda phage. Most of the observations regarding AAV proviral junctions have been incorporated into a general model of AAV integration that is dependent on initiation of DNA replication, followed by several rounds of template switching by the host polymerase complex. Some features of the AAV proviral junctions that point to a DNA replication-dependent mechanism are the locations of the breakpoints at a distance downstream from the AAVS1 nicking site, and the duplications of AAVS1 sequences. Whereas these imply replication of host chromosomal DNA, the frequent presence of head-to-tail junctions of tandem AAV sequences, joined via TR structures, suggest that the AAV genome has also been replicated. Linden et al. 9 Oct 2004 0:15 Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 828 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI have proposed that integration begins with the formation of a Rep-DNA complex involving the AAVS1 site and the AAV genome, either through binding at the TR or through the RBE associated with the p5 promoter (47, 48). The presence of head-to-tail AAV junctions within the proviral sequence suggests that the genome is circularized prior to this complex formation. The Rep protein then creates a nick at the chromosomal site, with concomitant covalent linkage to the 5 end of the nicked chromosomal DNA strand. Ensuing replication from the 3 end of the nick displaces the original chromosomal DNA strand, along with the attached Rep protein complex and circular viral DNA. At some point after limited replication (up to approximately 1000 bp), the host DNA polymerase complex transfers from the chromosomal template strand to the displaced strand, now returning back toward the Rep complex. Again, the host DNA polymerase complex switches templates, jumping from the host DNA to the associated viral genome. After one or more times around the viral genome, creating the head-to-tail concatemers, the polymerase complex switches templates once more to return to the host chromosomal sequence. Clearly, given the multiple template switching required to create the observed structures, such a mechanism would also entail a great deal of gap filling and repair activity subsequent to the synthesis of the leading strand. Whether this activity is the cause of the multiple duplications, rearrangements, and insertions frequently observed following AAV integration into the AAVS1 site, or whether they are created through the meanderings of a DNA polymerase complex as it is tripped-up by the AAV Rep protein, remains to be determined. Efficiency of AAV Integration There are several different perspectives from which to evaluate the efficiency of AAV integration into Ch19. The initial estimates were based on the number of rescue-competent, latent infections that could be derived from a pool of infected cells (6). This is similar to measurements of the percentage of transduced cells using rep+ rAAV vectors, and is always dependent on the multiplicity of infection of the virus or vector (52, 53, 82). In these studies, integration efficiencies of 20%–80% of infected cells were typically achieved. In contrast, the EBV episomal target integration studies discussed above measured the percent of rescued plasmids containing AAV sequences, and these were in the range of 0.01%–0.05%. Again, the multiplicity of infection was a critical factor in these studies, as was the copy number of EBV episomes. To account for these variables, and to best describe the behavior of the AAV virus, the measurement of integration efficiency in terms of integrations per infectious unit of AAV probably has the greatest relevance. The evaluation of many of the earlier studies on AAV integration in the presence of Rep protein in terms of integrations per infectious unit generally yielded frequencies in the range of 0.1%–0.5% (reviewed in 51). More recent studies have supported this conclusion. Huser et al. developed a rapid and efficient assay for Ch19specific integration based on polymerase chain reaction (PCR) amplification using 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. AAV INTEGRATION 829 primers specific for the AAV and chromosomal sequences (30). The frequency of integration was consistent with earlier results at approximately 0.1% of infecting viral genomes. The time course of integration was also evaluated in this study and found to plateau at approximately 24–48 h postinfection. This was also consistent with the EBV episomal studies, where most integration occurred between 24–48 h postinfection, and few integration events were observed during the first 10 h (19). The PCR-based assay also provided a large sampling of integration events that led to the conclusion that there was little or no bias between the left and right ends of the AAV genome in terms of its orientation in the AAVS1 site (29). It is not clear how this observation relates to the effect of the p5 promoter-associated RBE and its effect on integration efficiency, as noted above. The RBEs are possibly favored sites for template switching, whether a polymerase complex is entering the AAV DNA sequence, or leaving it to reassociate with the chromosomal DNA sequence. The integration efficiency of wtAAV vectors in vivo has not been well characterized. Because rodents do not contain the Ch19 integration sequence, these studies must be performed in primates or in the transgenic rodent models. To date, only one such study has been reported, using rhesus macaques (25). Chromosome 19 integration-specific PCR signals were detected from the nasal tissue of one of two nasally infected animals, and weakly from the liver tissue of both of two intravenously infected animals. This suggested that the integration frequency for wt AAV in vivo was low. Hybrid Gene Delivery Systems Taking Advantage of Rep-Mediated Integration Although most AAV-based vectors are not envisioned to include the Rep gene, and thus are likely to integrate randomly and at a low frequency, the ability to take control of the integration mechanism remains attractive. Such vectors resemble the first of the constructs used in AAV gene transfer, which included the rep gene and the neomycin phosphotransferase gene (52). More recently, many researchers have created hybrid systems that include the Rep gene and the AAV TRs but are delivered using either nonviral or chimeric virus constructs. These possibilities were first explored by cotransfecting plasmids containing AAV rep, and the transgene plus TRs on separate molecules (4, 31, 38, 70, 93, 101). While successful in directing targeted integration, this method retains an active episomal rep gene in the transfected cells, which could lead to chromosomal instability or mobilization of the transgene. These problems have been addressed using methods such as direct transfection of the Rep protein rather than the rep gene (45), or using a conditionally inactivated rep gene containing Lox-Cre recombination elements (84). Another strategy has been to fuse the rep gene to a hormone-dependent ligandbinding domain such that it is only transported to the nucleus in the presence of a hormone analogue (75). Other hybrid systems for Rep-mediated Ch19-specific integration have utilized chimeric virus constructs to deliver both the transgene sequences and the rep gene. 9 Oct 2004 0:15 Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 830 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI Recchia et al. created helper-dependent adenovirus constructs, one containing the rep gene, the other containing the transgene and TR sequences (74). Coinfection with the two virus constructs allowed efficient targeting of the transgene to the AAVS1 site. An important advantage of this system is the large transgene capacity of the helper-dependent Ad system, though it is not clear whether the imprecise nature of AAV integration can maintain the integrity of very large transgenes. Chimeric herpes simplex virus (HSV) vectors utilizing Rep-mediated targeted integration have also been created and tested in several different contexts (36, 97). In each case, the presence of the rep gene in the chimeric vector resulted in efficient integration of the construct into AAVS1 as well as increased stability of transgene expression. In contrast, similar vectors used to infect glioma cell lines resulted in decreased stability of transgene expression over time, presumably due to toxicity of the Rep protein (44). These studies have more recently been extended to a transgenic animal model, wherein the exogenous human AAVS1 sequence served as a target for integration of the HSV/AAV hybrid vector (3). There is still a great deal of potential for the use of these hybrid systems for gene delivery, whether in a research or a clinical setting. Future developments are likely to include more efficient means of direct protein delivery using nonviral vectors, nonviral delivery of rep RNA transcripts, or RNA virus-mediated delivery of the rep gene. In any of these cases, rep expression would be transient, thus allowing for long-term stability of the integrated transgene and minimal toxicity from Rep protein. Chromosome 19 Disruptions and Consequences for the Cell The propensity for rep-containing AAV vectors, and hybrid vector systems derived from them, to integrate into human Ch19 has potential benefits for gene therapy applications, but potential side effects as well. In addition to the deletions, insertions, and duplications of the AAVS1 locus upon AAV integration, as noted above, the nonintegrated AAVS1 allele can also suffer amplifications and rearrangements (58, 108). These rearrangements are dependent only upon the expression of the Rep protein, and do not require delivery of a construct containing the AAV terminal repeats. While no specific disease syndrome has been linked to AAV infection, the loss of both AAVS1 alleles would probably be a relatively rare circumstance, making it difficult to evaluate a putative detrimental effect epidemiologically. Early characterization of the AAVS1 sequence suggested that it was part of an actively transcribed region (41). In addition to the presence of a CpG island and multiple transcription factor binding sites, part of the AAVS1 sequence was represented in a cDNA library. The DNase hypersensitive region associated with this region in human cells, and its maintenance in transgenic animals, was also consistent with an overlapping transcription unit (46). Dutheil et al. have characterized the transcript from this region as part of a muscle cell-specific gene called the slow skeletal troponin T gene (TNNT1), which also maps to 19q13.4 in humans (14). Because AAV is generally considered to be an upper respiratory virus, based 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH AAV INTEGRATION 831 on the tissue tropism of its helper Ad, it is not clear whether disruption of both alleles of TNNT1 would have any potential pathogenic effect. However, rAAV vectors can efficiently infect muscle cells when directly injected, and a phenotype from disruption of AAVS1 in this tissue might be more readily detectable. A clear understanding of the role of the TNNT1 gene product will be essential before AAV-derived targeting vectors can be applied in humans. Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. Evolutionary Considerations of AAV Integration As stated above, the AAVS1 sequence, with active RBE and nicking site, is found only in humans and higher primates. Whether AAV benefits from the ability to integrate into this site is still a matter of speculation. However, establishing a stable proviral state in the absence of a helper virus would allow for long-term maintenance of the AAV sequence in dividing cells, until conditions were favorable for replication and packaging of progeny virus. In favor of the hypothesis that AAV has adapted the ability to integrate specifically is the observation of what appears to be coevolution of the AAVS1 site and AAV serotype 4 from monkeys. Amiss et al. noted that the AAV-4 RBE contains an extra copy of the GAGC repeat unit (2). Upon sequencing the AAVS1 homologue from simian CV-1 cells, it was also found to contain an additional copy of the repeat unit. Subsequent experiments revealed that binding of the AAV-4 Rep to the simian sequence had a higher affinity than that of AAV-2 or AAV-4 to the human sequence. Further, the frequency of integration into the AAVS1 homologue was also greater using AAV-4 in simian cells. This suggests an evolving relationship between the virus and the preferred host, based on interactions at the AAVS1 site, which would be difficult to explain unless there is some benefit to the virus. On the other hand, AAV integration is relatively inefficient compared with obligate integrating viruses, such as retrovirus. With an observed frequency of less than 0.5%, it is not clear how integration would be biologically relevant. Additionally, the integration mechanism is imprecise, often leading to rearrangements of the proviral sequences as well as the target sequence. It would appear that AAV integration is largely a consequence of host-cell–mediated illegitimate recombination. The fact that the recombination can be initiated by the action of AAV Rep protein at a serendipitous juxtaposition of an RBE and a functional nicking site may be essentially an accident of nature. However, evolution is driven by such accidents and their consequences, and AAV integration might well be regarded as an evolutionary work in progress. INTEGRATION OF AAV VECTORS The concept of using AAV as a gene therapy vector was built largely on the idea that it was an integrating virus, an immensely desirable property in that infected cells would pass the transgene to daughter cells, and readministration of the vector would unnecessary. While the advantages of vector integration may still apply to 9 Oct 2004 0:15 832 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI rapidly dividing cell types, such as bone marrow–derived cells, they are not so clear for most tissues in the body, which divide slowly if at all. The maturing consensus in gene therapy now includes the recognition that the risks inherent in an integrating vector, oncogenic transformation and gene inactivation, outweigh the benefits of permanent transduction in many tissues. Fortunately for adherents of rAAV gene therapy, the intervening years have also seen a change in our assessment of the integration properties of rAAV vectors. Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. Integration Efficiency of rAAV Vectors In Vitro Some of the earliest experiments with rAAV demonstrated that vectors lacking the Rep protein were still competent for chromosomal integration (52, 54, 82). Although these vectors were initially thought to integrate more efficiently than those retaining rep, this does not appear to be the case (see above). Most of the early studies, as well as many current reports, focused on the percentage of cells that could be stably transformed in the presence of a relatively high multiplicity of infection with vector. Such experiments frequently resulted in transduction frequencies over 80%; i.e., up to 80% of the cells in the culture expressed the transgene. When a selectable marker gene such as neo was used, this translated to an 80% integration frequency. While informative, a more generally useful measure of integration efficiency is the number of transformation events per infectious unit of vector. This could be calculated from the data in many studies by estimating the infectious dose based on typical particle to infectious unit ratios, which range from approximately 10-100:1 in cultured cells (reviewed in 51). These calculations yield numbers in the range of 0.1–0.5 integrations per infectious unit, either with or without selection. Although this is far from the efficiency typically associated with obligate integrating virus vectors such as retrovirus, it is still higher than has been observed from a nonintegrating virus vectors such as adenovirus, which exhibits integration frequencies ranging from 10-3 to 10-5 per cell at a multiplicity of infection of 10 PFU per cell (23). This translates to approximately 10-4 to 10-6 per infectious vector genome, compared with 0.2–1.0 × 10-3 for rAAV vectors. Further, unlike retroviruses or retrovirus vectors, which must integrate to express their genes, rAAV vectors clearly do not require integration for gene expression (18). The efficiency of Rep-independent rAAV vector integration is within the same range as Rep-mediated integration. Although Rep clearly has no role in the integration of these vector genomes, some of the same host-cell factors that mediate Rep-independent integration may also participate in the specific integration mediated by Rep protein. Very little is known about host-cell factors involved in Rep-independent integration of rAAV vectors, or whether any factors specifically recognize and operate through interaction with the AAV terminal repeats. Indeed, the view that the AAV TRs mediate integration at all has recently come into question. However, a body of evidence has accumulated over the years, supporting the idea that the presence of the AAV TRs enhances the rate of integration, at least under the experimental conditions applied. A further consideration is the possibility 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH AAV INTEGRATION 833 that the AAV TRs enhance the persistence of the vector DNA episomally, by resisting degradation, promoting the formation of circular and concatemeric molecules, or by providing a replication origin. An increased probability of chromsomal integration would then be a secondary consequence of this episomal persistence. For this reason, it is important to consider the behavior of episomal rAAV genomes, as these are the likely precursors for integration. Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. Episomal Conformation of rAAV Genomes Many recent studies have focused on the fate of rAAV genomes after transport to the nucleus, uncoating, and second-strand synthesis. These have often been performed in an in vivo setting, partly because of the relevance to gene therapy, but also because the episomes are maintained for much longer periods of time in the nondividing cells of animal tissues than they are in cell cultures. Generally, these studies point to processing of the rAAV genomes by intramolecular and intermolecular recombination to form circles and concatemers, respectively. Several studies using split-gene vector coinfections were contrived to increase the gene delivery capacity of rAAV vectors (92, 104). In these constructs, half the gene is in one vector and the other half in a second vector, such that reconstitution of the reporter gene requires the joining of two different vectors in the correct orientation. These and earlier experiments demonstrating rescue of mixed vector concatemers in bacteria clearly demonstrate that some, if not all, concatemers result from intermolecular recombination (106). Concatemers created by amplification of circular episomes would not lead to gene expression from these constructs. The circularization of rAAV genomes has generally been evaluated by either rescue of the circularized molecules after transfection into bacteria or visualization of circular forms in Southern blots. These studies have demonstrated that monomeric linear rAAV DNA is converted to circular forms over time, and that these circular forms are then slowly converted into large Mr concatamers (>12 kb) (12, 13, 55, 96). Whereas some concatemers result from the joining of two different vector molecules early after infection, rolling circle replication from the circular monmomeric genomes may also contribute to the high Mr forms. Rolling circle replication from plasmids containing the AAV TR in the absence of Rep protein has been reported in cultured cells, and in cell-free extracts, after subjecting the cells to genotoxic stress (102, 103). This would be consistent with the predominantly head-to-tail arrangement of genomes in these structures. In order to determine whether single-stranded rAAV DNA had a role in recombination between AAV genomes, Yue et al. performed sequential infections with split-gene vectors in mouse muscle tissue at times separated by several weeks (109). The second vector was added after the DNA of the first vector had been either converted to double-strand or cleared. Indeed, most of the vector DNA had been converted to monomeric circles and large concatemers before the second vector was added. Reconstitution of gene expression after infection with the second vector demonstrated that single-strand DNA was not essential for the formation of heteroconcatemers. Further, it implied that the second vector interacted with DNA 9 Oct 2004 0:15 834 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. that had already circularized or joined into a higher Mr structure. However, it is not clear whether the circular forms recombine directly or through a linear intermediate. Conversion of circular plasmids containing a modified AAV TR sequence, resembling that often found at the junction between joined AAV ends, has been observed in cultured cells (101). This raises the possibility that episomal forms of rAAV exist in a dynamic balance between circular and linear forms, with circularization predominating. Transient linearization would create an opportunity for recombination with a second linear genome, or a larger concatemer. Such a mechanism of transient availability of rAAV ends could also contribute to integration into the host chromosome. Efficiency of rAAV Integration In Vivo The safe and efficacious use of rAAV vectors for gene therapy will rely on a thorough understanding of its propensity for integration into the host chromosome and the potential consequences of integration. Early experiments with rAAV vectors in cell culture, with or without selection, had established that integration was relatively inefficient, at approximately 0.1%–0.5% of infectious vector genomes. However, it was not clear that this would be the case in the nondividing or slowly dividing cells of animal tissues. Several recent studies have addressed the question of integration efficiency in vivo, with results that are somewhat inconsistent. The first studies designed to estimate the fraction of rAAV vector DNA integrated into the host chromosome were performed in transduced mouse liver tissue. It had already been noted that the rAAV vector genomes are converted from low Mr episomes to higher Mr forms in this and other tissues over the course of several weeks (16, 91, 100). In a subsequent study, Miao et al. assessed the integration status of these high Mr forms using both pulsed-field electrophoreses and in situ hybridization (56). Southern blots of the pulsed-field gels suggested that all of the high Mrvector DNA was associated with chromosomal DNA, or at least with DNA segments greater than 1 megabase in size. Based on their previous estimates of an average rAAV vector content in transduced mouse liver of 3.5 copies per cell, and their observation that only 5% of liver cells were expressing the transgene, the authors suggested that most vector DNA was within cells containing an average 70 vector genome copies. If these were arrayed in concatemers, the average size would be in the range of only 275 kilobases. The authors then looked at interphase and metaphase nuclei for rAAV sequences associated with chromosomes. Consistent with their transgene expression observations, they found vector in 4.8% and 5.5% of interphase and metaphase nuclei, respectively. The vector sequences were detected on sister chromatids in the metaphase nuclei, supporting the conclusion that they were integrated. These results were put in doubt, however, when subsequent studies revealed a population of extrachromosomal vector DNA within the transduced hepatocytes, in the form of monomeric circular and linear molecules (62). Further evidence that the preponderance of rAAV DNA remained episomal came from studies employing two thirds partial hepatectomy in transduced mice to induce liver tissue regenera- 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. AAV INTEGRATION 835 tion (63). After one to two cycles of hepatocyte cell division, the transgene expression would decrease if the vector DNA were episomal, and remain stable if it were integrated. Mice subjected to partial hepatectomy at 12 weeks or 12 months postinfection displayed decreases in vector gene expression of approximately 85%–95%, and similar decreases in the average number of vector genomes per cell. There was no difference between mice hepatectomized at 12 weeks and 12 months postinjection, implying that the balance between integrated and episomal vector DNA was relatively stable. Because the limited number of cell divisions did not ensure that all of the extrachromosomal vector DNA was lost, this suggests that the observed 5%–15% integrated vector represented a maximum value in mouse hepatocytes. Chen et al. also observed integration of rAAV in liver tissue using an in vivo selection model (92). In this case, a population of transduced hepatocytes representing 0.1%–0.5% of the liver could be expanded to approximately 50%–90% of the liver after selection was applied. While this does not provide quantitative information about the frequency of vector integration, it does corroborate the presence of integrated rAAV genomes in liver tissue. Whether these genomes were integrated before or after the induction of hepatocyte cell division is not known. The evaluation of rAAV integration frequency in mouse muscle tissue has yielded far more conservative estimates. Using three different assays, Schnepp et al. were unable to detect integrated vector DNA to a sensitivity of <0.5% of total vector DNA (85). First, genomic DNA from transduced muscle tissue was size selected and cloned into phage. Screening of 1.5 × 106 clones, representing twice the cellular genome content, revealed no vector-specific sequence. A second method of quantitation relied on a carefully calibrated PCR assay to amplify integrated rAAV sequences. At a sensitivity of 75 integrated vector genomes per 1.5 × 104 total vector genomes, no integrated rAAV sequences could be detected. In a third assay, a DNase that was specific for linear double-strand, and linear or circular single-strand DNA, but not double-strand circular DNA, was used to digest DNA from transduced mouse muscle tissue. The DNase treatment did not significantly reduce the amount of vector sequence in these samples, suggesting that all of the vector was in the form of double-stranded circular DNA molecules. Contribution of TRs to Persistence and Integration Recombinant AAV genomes have long been viewed as special substrates for chromosomal integration because of the presence of the hairpin TR structures. However, the role of these hairpins in the recombination mechanism that leads to integration has been difficult to verify or characterize. Direct comparisons between rAAV genomes and plain DNA are difficult due to the qualitative and quantitative differences in delivery mechanisms. One study that attempted to normalize for these problems compared naked linear DNA, either with or without the AAV TR sequences, in mouse liver after in vivo hydrodynamic delivery (61). Although the TRs of the transfected DNA were not in the hairpin configuration at the outset, it remained possible that isomerization of the TRs within the cell would create a molecule resembling the rAAV genome, and potentially provide a primer for 9 Oct 2004 0:15 Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 836 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI at least one round of replication. The integration status of the DNA was then evaluated by measuring transgene expression after a two thirds partial hepatectamy. The authors found that induction of liver regeneration reduced expression from linear molecules lacking the AAV TR sequences by approximately tenfold, whereas expression from linear molecules containing the TR sequences dropped by only fivefold. Expression from circular molecules containing one copy of the TR dropped by approximately 20-fold. This suggested that the TRs did have a role in integration, albeit small, and further, that the linear form was more likely to integrate than the circular form. Isolation and characterization of some of the hostchromosome junctions with vector DNA showed that the recombination usually occurred within the TR sequence, again implying a role for the hairpin, at least to the extent of providing a free end as a substrate for recombination. Structure of rAAV Integration Junctions Like the Rep-mediated integration described above, a great deal may be learned about rAAV vector integration from the structures of the vector-host cell DNA junctions. Examples of these structures have been characterized from both cell culture and animal tissues (59, 80, 105). Each study revealed similar junction structures, whether the source was liver tissue in animals, rapidly dividing cells in culture, or recombination reactions carried out in cell-free lysates. They were also similar to junctions recovered from Rep-mediated integration, except that they were not located near the AAVS1 site. The vector-cellular DNA breakpoints were predominantly found within the AAV TR, though all had some degree of deletion of TR sequences. Small, 2–5-bp microhomologies between vector and host sequence were generally found at the crossover points, but no large regions of homology were observed. This suggested that rAAV vector integration was mediated by nonhomologous recombination, either specifically enhanced by the TR sequence or secondary structure, or utilizing a free DNA end formed by the hairpin structure. More recent studies of rAAV vector integration have benefited from the availability of the human genome sequence in characterizing the chromosomal integration site and the changes that took place at the junctions (58). In addition to deletions within the TRs, small deletions (9–71 bp) and small insertions (1–13 bp) were found within the cellular sequences. Larger deletions, up to 2 kb, were also noted in a separate study (60). A segment of the plasmid vector pBR322 was also found at one junction, implying that it had been packaged and transduced with the vector. An unexpectedly large proportion (4 of 9) of the junctions were found within genes. Also, four of the junctions were located within a highly transcribed, 22-Mb region of Ch19, though at great distances from the AAVS1 site. These observations suggested that rAAV preferentially integrates into actively transcribed regions. One junction spanned a chromosome 9 to chromosome 17 translocation. The association with actively transcribed regions, which are hot spots for DNA damage, and instances of chromosomal rearrangement at the junctions suggested the possibility that the rAAV genome was interacting with previously formed 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH AAV INTEGRATION 837 double-strand DNA breaks. This further suggested a model for rAAV integration based on nonhomologous end-joining between the hairpin TR and the broken chromosome ends (78). Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. Role of Double-Strand DNA Breaks (DSB) in rAAV Integration The possibility that rAAV integrates through the cellular nonhomologous endjoining DNA repair pathway was further explored using an induced DSB model (57). The authors used a retrovirus vector to create a cell line with a unique I-SceI restriction site within a bacterial shuttle construct. A second retrovirus construct was used to introduce the I-SceI enzyme gene, such that a fraction (approximately 5%) of the cells would have a DSB at the specific site. The cells were then infected with an rAAV vector containing a selectable marker, and integration at the induced DSB was assayed in integration-positive cells. One of the 12 positive clones contained the rAAV at the induced DSB site. When the DSB site was more specifically examined by rescue in bacteria, 8 of 190 clones (4.2%) contained the rAAV within the site, suggesting that the DSB created a target for rAAV integration. A converse experiment, assaying the percentage of rAAV integrations occurring at this specific DSB, utilized an rAAV shuttle vector for replication in bacteria. This resulted in the finding that 7.4% of the integrated rAAV had targeted the induced double-strand break. Selection with the target site marker gene revealed that 0.59% of the I-SceI sites contained the rAAV vector. The authors examined rAAV integration at DSBs induced by treatment of cells with etoposide or γ -irradiation. Both treatments resulted in an increased frequency of rAAV integration. The structures of the chromosomal integration sites from cells with induced DSBs were then compared with sites rescued from uninduced cells. Both types of junctions showed a preference for integration into transcribed regions, and both showed similar aberrations of chromosomal sequence at the junction sites. Together, these results suggest that rAAV integrates into pre-existing chromosome breaks by nonhomologous end-joining, and that the aberrations associated with rAAV junctions may be the result of inaccurate repair of previously generated deletions and insertions. A distinctly different role for DSB was characterized in a study of rAAV vectormediated gene repair by homologous recombination. Previous reports had shown that rAAV could mediate gene correction by homologous recombination driven by flanking homologous sequences in the vector genome, and that the singlestranded rAAV genome was likely to participate directly in this interaction (34, 35, 79). Porteus et al. created an I-SceI endonuclease model for gene correction at an induced DSB site (73). An rAAV vector with sequences flanking the inserted I-SceI site could correct a GFP gene by deletion of the intervening sequences. Induction of a DSB at this site increased the frequency of gene repair by >100-fold. Importantly, the enhancement of homologous recombination by induced DSBs can also be mediated by plasmid (72). This suggests that while rAAV integrates randomly by interacting with nonhomologous end-joining repair pathways, it can also interact with repair pathways involving homologous recombination. This points to the 9 Oct 2004 0:15 838 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI possibility that the rAAV genome integrates into the host chromosome as a passive bystander rather than an initiator of recombination. Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. Integration into Active Genes The study discussed above on the effects of rAAV integration into host chromosomal DNA also implied that rAAV preferentially integrates into actively transcribed genes in cultured cells (58). These results have been confirmed and extended in a recent report demonstrating preferential integration into active genes in mouse liver tissue (60). Of the integration sites examined, 21 of 29 (72%) were in genes, and all of these genes were actively expressed in hepatocytes. These events were accompanied by deletions of up to 2 kb, though smaller deletions predominated. Safety of rAAV Vectors The possibility that rAAV integration preferentially targets transcribed genes raises implications for its use as a clinical gene therapy vector, particularly in light of instances of oncogenic transformation in a recent clinical trial as a result of retrovirus insertional activation (22). However, these concerns should be tempered by considering the underlying mechanisms of this phenomenon and factors that contribute to it (78). First, the preferential integration into active regions of the chromosome and transcribed genes is not unique to rAAV vectors. Vectors derived from HIV are also prone to integrate into active genes, though it is not clear to what extent this is due to the local chromatin structure or to interaction with transcription factors (86). A preference toward transcribed regions was also noted in the characterization of adenovirus integration, even though this is generally considered to be a nonintegrating virus (87). Thus, exogenous DNA delivered by any means may carry a similar risk. A second factor that must be considered in evaluating the safety of rAAV with respect to disruption of active genes is the absolute integration frequency of the vector in the target tissue. As discussed above, this frequency is low in liver tissue and not yet detected in muscle tissue. Further, the instances of integration in liver tissue were all recovered from tissue that had been induced to regenerate through injury, which may affect the interaction between vector and host genomes. Finally, the relationship between actively transcribed regions and DNA damage repair is well established, and up to one third of DSBs in transcribed regions result in mutations (1). Thus, if the suggestion that rAAV integrates as a bystander to DNA damage repair is correct, it is not clear that the integration event would cause significantly more damage than would have occurred in that cell in any event, at least in terms of gene inactivation. On the other hand, the possibility of insertional activation of an oncogene, as seen with retroviruses, must also be considered. Unlike retroviruses, however, neither rAAV nor its parent virus has ever been associated with malignancies in humans or animals, or with transformation of cells. Insertional activation from retrovirus is associated with the strong promoter activity of its downstream long terminal repeat. Although there is a weak promoter 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. AAV INTEGRATION 839 activity associated with the rAAV TR, due to its position at the A/D sequence junction, it is likely limited to transcription inward through vector sequence rather than outward into chromosomal DNA (17, 21). Although the safety of rAAV gene therapy will ultimately be determined in controlled clinical trials, the theoretical risks and their underlying mechanisms will continue to be the subject of active research. Foremost among the questions to be answered will be the actual integration frequency in any given tissue or cell type, under growth conditions that reflect the clinical situation. Additionally, an assessment of the consequences of rAAV integration-mediated mutagenesis in vivo, in terms of cell or tissue damage or transformation, will provide a useful framework for the evaluation of rAAV vector safety. The Annual Review of Genetics is online at http://genet.annualreviews.org LITERATURE CITED 1. Allen C, Miller CA, Nickoloff JA. 2003. The mutagenic potential of a single DNA double-strand break in a mammalian chromosome is not influenced by transcription. DNA Repair 2:1147–56 2. Amiss TJ, McCarty DM, Skulimowski A, Samulski RJ. 2003. Identification and characterization of an adeno-associated virus integration site in CV-1 cells from the African green monkey. J. Virol. 77:1904–15 3. Bakowska JC, Di Maria MV, Camp SM, Wang Y, Allen PD, Breakefield XO. 2003. Targeted transgene integration into transgenic mouse fibroblasts carrying the fulllength human AAVS1 locus mediated by HSV/AAV rep(+) hybrid amplicon vector. Gene Ther. 10:1691–702 4. Balague C, Kalla M, Zhang WW. 1997. Adeno-associated virus Rep78 protein and terminal repeats enhance integration of DNA sequences into the cellular genome. J. Virol. 71:3299–306 5. Batchu RB, Hermonat PL. 1995. The trans-inhibitory Rep78 protein of adenoassociated virus binds to TAR region DNA of the human immunodeficiency virus type 1 long terminal repeat. FEBS Lett. 367:267–71 6. Berns KI, Cheung A, Ostrove J, Lewis M. 7. 8. 9. 10. 11. 12. 1982. Adeno-associated virus latent infection. In Virus Persistance, ed. BWJ Mahy, AC Minson, GK Darby, pp. 249–65. Cambridge: Cambridge Univ. Press Brister JR, Muzyczka N. 2000. Mechanism of Rep-mediated adeno-associated virus origin nicking. J. Virol. 74:7762–71 Chang L-S, Shenk T. 1990. The adenovirus DNA binding protein stimulates the rate of transcription directed by adenovirus and adeno-associated virus promoters. J. Virol. 64:2103–9 Chang L-S, Shi Y, Shenk T. 1989. Adenoassociated virus p5 promoter contains an adenovirus EIA inducible element and a binding site for the major late transcription factor. J. Virol. 63:3479–88 Cheung AK, Hoggan MD, Hauswirth WW, Berns KI. 1980. Integration of the adeno-associated virus genome into cellular DNA in latently infected human Detroit 6 cells. J. Virol. 33:739–48 Chiorini JA, Yang L, Safer B, Kotin RM. 1995. Determination of adeno-associated virus Rep68 and Rep78 binding sites by random sequence oligonucleotide selection. J. Virol. 69:7334–38 Duan D, Sharma P, Yang J, Yue Y, Dudus L, et al. 1998. Circular intermediates of recombinant adeno-associated virus have 9 Oct 2004 0:15 840 13. Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 14. 15. 16. 17. 18. 19. 20. 21. AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI defined structural characteristics responsible for long-term episomal persistence in muscle tissue. J. Virol. 72:8568–77. Erratum. 1999. J. Virol. 73(1):861 Duan D, Yan Z, Yue Y, Engelhardt JF. 1999. Structural analysis of adenoassociated virus transduction circular intermediates. Virology 261:8–14 Dutheil N, Shi F, Dupressoir T, Linden RM. 2000. Adeno-associated virus site-specifically integrates into a musclespecific DNA region. Proc. Natl. Acad. Sci. USA 97:4862–66 Dyall J, Szabo P, Berns KI. 1999. Adenoassociated virus (AAV) site-specific integration: formation of AAV-AAVS1 junctions in an in vitro system. Proc. Natl. Acad. Sci. USA 96:12849–54 Fisher KJ, Jooss K, Alston J, Yang Y, Haecker SE, et al. 1997. Recombinant adeno-associated virus for muscle directed gene therapy. Nat. Med. 3:306–12 Flotte TR, Afione SA, Solow R, Drumm ML, Markakis D, et al. 1993. Expression of the cystic fibrosis transmembrane conductance regulator from a novel adenoassociated virus promoter. J. Biol. Chem. 268:3781–90 Flotte TR, Afione SA, Zeitlin PL. 1994. Adeno-associated virus vector gene expression occurs in nondividing cells in the absence of vector DNA integration. Am. J. Respir. Cell Mol. Biol. 11:517–21 Giraud C, Winocour E, Berns KI. 1994. Site-specific integration by adenoassociated virus is directed by a cellular DNA sequence. Proc. Natl. Acad. Sci. USA 91:10039–43 Giraud C, Winocour E, Berns KI. 1995. Recombinant junctions formed by sitespecific integration of adeno-associated virus into an episome. J. Virol. 69:6917– 24 Haberman RP, McCown TJ, Samulski RJ. 2000. Novel transcriptional regulatory signals in the adeno-associated virus terminal repeat A/D junction element. J. Virol. 74:8732–39 22. Hacein-Bey-Abina S, Von Kalle C, Schmidt M, McCormack MP, Wulffraat N, et al. 2003. LMO2-associated clonal T cell proliferation in two patients after gene therapy for SCID-X1. Science 302:415– 19 23. Harui A, Suzuki S, Kochanek S, Mitani K. 1999. Frequency and stability of chromosomal integration of adenovirus vectors. J. Virol. 73:6141–46 24. Hermonat PL, Muzyczka N. 1984. Use of adeno-associated virus as a mammalian DNA cloning vector: transduction of neomycin resistance into mammalian tissue culture cells. Proc. Natl. Acad. Sci. USA 81:6466–70 25. Hernandez YJ, Wang J, Kearns WG, Loiler S, Poirier A, Flotte TR. 1999. Latent adeno-associated virus infection elicits humoral but not cell-mediated immune responses in a nonhuman primate model. J. Virol. 73:8549–58 26. Hoggan MD. 1970. Adeno-associated viruses. Prog. Med. Virol. 12:211–39 27. Hoggan MD, Blacklow NR, Rowe WP. 1966. Studies of small DNA viruses found in various adenovirus preparations: physical, biological, and immunological characteristics. Proc. Natl. Acad. Sci. USA 55:1457–71 28. Hoggan MD, Thomas GF, Thomas FB, Johnson FB. 1972. Continuous carriage of adenovirus associated virus genome in cell culture in the absence of helper adenovirus. Presented at Proc. Lepetite Colloquium, 4th, Cocoyac, Mexico 29. Huser D, Heilbronn R. 2003. Adenoassociated virus integrates site-specifically into human chromosome 19 in either orientation and with equal kinetics and frequency. J. Gen. Virol. 84:133–37 30. Huser D, Weger S, Heilbronn R. 2002. Kinetics and frequency of adeno-associated virus site-specific integration into human chromosome 19 monitored by quantitative real-time PCR. J. Virol. 76:7554–59 31. Huttner NA, Girod A, Schnittger S, Schoch C, Hallek M, Buning H. 2003. 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH AAV INTEGRATION 32. Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 33. 34. 35. 36. 37. 38. 39. 40. Analysis of site-specific transgene integration following cotransduction with recombinant adeno-associated virus and a rep encoding plasmid. J. Gene Med. 5: 120–29 Im D-S, Muzyczka N. 1989. Factors that bind to the AAV terminal repeats. J. Virol. 63:3095–104 Im D-S, Muzyczka N. 1990. The AAV origin binding protein Rep68 is an ATPdependent site-specific endonuclease with DNA helicase activity. Cell 61:447–57 Inoue N, Dong R, Hirata RK, Russell DW. 2001. Introduction of single base substitutions at homologous chromosomal sequences by adeno-associated virus vectors. Mol. Ther. 3:526–30 Inoue N, Hirata RK, Russell DW. 1999. High-fidelity correction of mutations at multiple chromosomal positions by adeno-associated virus vectors. J. Virol. 73:7376–80 Johnston KM, Jacoby D, Pechan PA, Fraefel C, Borghesani P, et al. 1997. HSV/AAV hybrid amplicon vectors extend transgene expression in human glioma cells. Hum. Gene Ther. 8:359–70 Kearns WG, Afione SA, Fulmer SB, Pang MC, Erikson D, et al. 1996. Recombinant adeno-associated virus (AAV-CFTR) vectors do not integrate in a site-specific fashion in an immortalized epithelial cell line. Gene Ther. 3:748–55 Kogure K, Urabe M, Mizukami H, Kume A, Sato Y, et al. 2001. Targeted integration of foreign DNA into a defined locus on chromosome 19 in K562 cells using AAV-derived components. Int. J. Hematol. 73:469–75 Kokorina NA, Santin AD, Li C, Hermonat PL. 1998. Involvement of proteinDNA interaction in adeno-associated virus Rep78-mediated inhibition of HIV1. J. Hum. Virol. 1:441–50 Kotin RM, Berns KI. 1989. Organization of adeno-associated virus DNA in latently infected Detroit 6 cells. Virology 170:460–67 841 41. Kotin RM, Linden RM, Berns KI. 1992. Characterization of a preferred site on human chromosome 19q for integration of adeno-associated virus DNA by nonhomologous recombination. EMBO J. 11: 5071–78 42. Kotin RM, Siniscalco M, Samulski RJ, Zhu XD, Hunter L, et al. 1990. Sitespecific integration by adeno-associated virus. Proc. Natl. Acad. Sci. USA 87: 2211–15 43. Lackner DF, Muzyczka N. 2002. Studies of the mechanism of transactivation of the adeno-associated virus p19 promoter by Rep protein. J. Virol. 76:8225–35 44. Lam P, Hui KM, Wang Y, Allen PD, Louis DN, et al. 2002. Dynamics of transgene expression in human glioblastoma cells mediated by herpes simplex virus/adenoassociated virus amplicon vectors. Hum. Gene Ther. 13:2147–59 45. Lamartina S, Roscilli G, Rinaudo D, Delmastro P, Toniatti C. 1998. Lipofection of purified adeno-associated virus Rep68 protein: toward a chromosome-targeting nonviral particle. J. Virol. 72:7653–58 46. Lamartina S, Sporeno E, Fattori E, Toniatti C. 2000. Characteristics of the adenoassociated virus preintegration site in human chromosome 19: open chromatin conformation and transcription-competent environment. J. Virol. 74:7671–77 47. Linden RM, Ward P, Giraud C, Winocour E, Berns KI. 1996. Site-specific integration by adeno-associated virus. Proc. Natl. Acad. Sci. USA 93:11288–94 48. Linden RM, Winocour E, Berns KI. 1996. The recombination signals for adenoassociated virus site-specific integration. Proc. Natl. Acad. Sci. USA 93:7966–72 49. McCarty DM, Pereira DJ, Zolotukhin I, Zhou X, Ryan JH, Muzyczka N. 1994. Identification of linear DNA sequences that specifically bind the adeno-associated virus Rep protein. J. Virol. 68:4988–97 50. McCarty DM, Ryan JH, Zolotukhin S, Zhou X, Muzyczka N. 1994. Interaction of the adeno-associated virus Rep 9 Oct 2004 0:15 842 51. Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 52. 53. 54. 55. 56. 57. 58. 59. 60. AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI protein with a sequence within the A palindrome of the viral terminal repeat. J. Virol. 68:4998–5006 McCarty DM, Samulski RJ. 1997. Adenoassociated viral vectors. In Concepts in Gene Therapy, ed. M Strauss, JA Barranger, pp. 61–78. Berlin: Walter de Gruyter McLaughlin SK, Collis P, Hermonat PL, Muzyczka N. 1988. Adeno-associated virus general transduction vectors: analysis of proviral structures. J. Virol. 62: 1963–73 Mendelson E, Miller IL, Carter BJ. 1988. Expression and rescue of a nonselected marker from an integrated AAV vector. Virology 166:154–65 Mendelson E, Smith MG, Miller IL, Carter BJ. 1988. Effect of a viral rep gene on transformation of cells by an adeno-associated virus vector. Virology 166:612–15 Miao CH, Nakai H, Thompson AR, Storm TA, Chiu W, et al. 2000. Nonrandom transduction of recombinant adenoassociated virus vectors in mouse hepatocytes in vivo: Cell cycling does not influence hepatocyte transduction. J. Virol. 74: 3793–803 Miao CH, Snyder RO, Schowalter DB, Patijn GA, Donahue B, et al. 1998. The kinetics of rAAV integration in the liver [letter]. Nat. Genet. 19:13–15 Miller DG, Petek LM, Russell DW. 2004. Adeno-associated virus vectors integrate at chromosome breakage sites. Nat. Genet. 36:767–73 Miller DG, Rutledge EA, Russell DW. 2002. Chromosomal effects of adenoassociated virus vector integration. Nat. Genet. 30:147–48 Nakai H, Iwaki Y, Kay MA, Couto LB. 1999. Isolation of recombinant adenoassociated virus vector-cellular DNA junctions from mouse liver. J. Virol. 73: 5438–47 Nakai H, Montini E, Fuess S, Storm TA, Grompe M, Kay MA. 2003. AAV serotype 61. 62. 63. 64. 65. 66. 67. 68. 2 vectors preferentially integrate into active genes in mice. Nat. Genet. 34:297– 302 Nakai H, Montini E, Fuess S, Storm TA, Meuse L, et al. 2003. Helper-independent and AAV-ITR-independent chromosomal integration of double-stranded linear DNA vectors in mice. Mol. Ther. 7:101– 11 Nakai H, Storm TA, Kay MA. 2000. Recruitment of single-stranded recombinant adeno-associated virus vector genomes and intermolecular recombination are responsible for stable transduction of liver in vivo. J. Virol. 74:9451–63 Nakai H, Yant SR, Storm TA, Fuess S, Meuse L, Kay MA. 2001. Extrachromosomal recombinant adeno-associated virus vector genomes are primarily responsible for stable liver transduction in vivo. J. Virol. 75:6969–76 Ni TH, Zhou X, McCarty DM, Zolotukhin I, Muzyczka N. 1994. In vitro replication of adeno-associated virus DNA. J. Virol. 68:1128–38 Oelze I, Rittner K, Sczakiel G. 1994. Adeno-associated virus type 2 rep genemediated inhibition of basal gene expression of human immunodeficiency virus type 1 involves its negative regulatory functions. J. Virol. 68:1229–33 Ogata T, Kozuka T, Kanda T. 2003. Identification of an insulator in AAVS1, a preferred region for integration of adenoassociated virus DNA. J. Virol. 77:9000– 7 Pereira DJ, McCarty DM, Muzyczka N. 1997. The adeno-associated virus (AAV) Rep protein acts as both a repressor and an activator to regulate AAV transcription during a productive infection. J. Virol. 71:1079–88 Philpott NJ, Giraud-Wali C, Dupuis C, Gomos J, Hamilton H, et al. 2002. Efficient integration of recombinant adenoassociated virus DNA vectors requires a p5-rep sequence in cis. J. Virol. 76:5411– 21 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. AAV INTEGRATION 69. Philpott NJ, Gomos J, Berns KI, FalckPedersen E. 2002. A p5 integration efficiency element mediates Rep-dependent integration into AAVS1 at chromosome 19. Proc. Natl. Acad. Sci. USA 99:12381– 85 70. Pieroni L, Fipaldini C, Monciotti A, Cimini D, Sgura A, et al. 1998. Targeted integration of adeno-associated virus-derived plasmids in transfected human cells. Virology 249:249–59 71. Ponnazhagan S, Erikson D, Kearns WG, Zhou SZ, Nahreini P, et al. 1997. Lack of site-specific integration of the recombinant adeno-associated virus 2 genomes in human cells. Hum. Gene Ther. 8:275– 84 72. Porteus MH, Baltimore D. 2003. Chimeric nucleases stimulate gene targeting in human cells. Science 300:763 73. Porteus MH, Cathomen T, Weitzman MD, Baltimore D. 2003. Efficient gene targeting mediated by adeno-associated virus and DNA double-strand breaks. Mol. Cell Biol. 23:3558–65 74. Recchia A, Parks RJ, Lamartina S, Toniatti C, Pieroni L, et al. 1999. Sitespecific integration mediated by a hybrid adenovirus/adeno-associated virus vector. Proc. Natl. Acad. Sci. USA 96:2615–20 75. Rinaudo D, Lamartina S, Roscilli G, Ciliberto G, Toniatti C. 2000. Conditional site-specific integration into human chromosome 19 by using a ligand-dependent chimeric adeno-associated virus/Rep protein. J. Virol. 74:281–94 76. Rittner K, Heilbronn R, Kleinschmidt JA, Sczakiel G. 1992. Adeno-associated virus type 2-mediated inhibition of human immunodeficiency virus type 1 (HIV-1) replication: involvement of p78rep/p68rep and the HIV-1 long terminal repeat. J. Gen. Virol. 73:2977–81 77. Rizzuto G, Gorgoni B, Cappelletti M, Lazzaro D, Gloaguen I, et al. 1999. Development of animal models for adenoassociated virus site-specific integration. J. Virol. 73:2517–26 843 78. Russell DW. 2003. AAV loves an active genome. Nat. Genet. 34:241–42 79. Russell DW, Hirata RK. 1998. Human gene targeting by viral vectors. Nat. Genet. 18:325–30 80. Rutledge EA, Russell DW. 1997. Adenoassociated virus vector integration junctions. J. Virol. 71:8429–36 81. Ryan JH, Zolotukhin S, Muzyczka N. 1996. Sequence requirements for binding of Rep68 to the adeno-associated virus terminal repeats. J. Virol. 70:1542–53 82. Samulski RJ, Chang LS, Shenk T. 1989. Helper-free stocks of recombinant adenoassociated viruses: Normal integration does not require viral gene expression. J. Virol. 63:3822–28 83. Samulski RJ, Zhu X, Xiao X, Brook JD, Housman DE, et al. 1991. Targeted integration of adeno-associated virus (AAV) into human chromosome 19. EMBO J. 10:3941–50. Erratum. 1992. EMBO J. 11(3):1228 84. Satoh W, Hirai Y, Tamayose K, Shimada T. 2000. Site-specific integration of an adeno-associated virus vector plasmid mediated by regulated expression of rep based on Cre-loxP recombination. J. Virol. 74:10631–38 85. Schnepp BC, Clark KR, Klemanski DL, Pacak CA, Johnson PR. 2003. Genetic fate of recombinant adeno-associated virus vector genomes in muscle. J. Virol. 77: 3495–504 86. Schroder AR, Shinn P, Chen H, Berry C, Ecker JR, Bushman F. 2002. HIV-1 integration in the human genome favors active genes and local hotspots. Cell 110:521–29 87. Schulz M, Freisem-Rabien U, Jessberger R, Doerfler W. 1987. Transcriptional activities of mammalian genomes at sites of recombination with foreign DNA. J. Virol. 61:344–53 88. Snyder RO, Im D-S, Muzyczka N. 1990. Evidence for covalent attachment of the adeno-associated virus Rep protein to the ends of the AAV genome. J. Virol. 64: 6204–13 9 Oct 2004 0:15 Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. 844 AR AR230-GE38-27.tex MCCARTY YOUNG AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH SAMULSKI 89. Snyder RO, Im DS, Ni T, Xiao X, Samulski RJ, Muzyczka N. 1993. Features of the adeno-associated virus origin involved in substrate recognition by the viral Rep protein. J. Virol. 67:6096–104 90. Snyder RO, Samulski RJ, Muzyczka N. 1990. In vitro resolution of covalently joined AAV chromosome ends. Cell 60:105–33 91. Snyder RO, Spratt SK, Lagarde C, Bohl D, Kaspar B, et al. 1997. Efficient and stable adeno-associated virus-mediated transduction in the skeletal muscle of adult immunocompetent mice. Hum. Gene Ther. 8:1891–900 92. Sun L, Li J, Xiao X. 2000. Overcoming adeno-associated virus vector size limitation through viral DNA heterodimerization. Nat. Med. 6:599–602 93. Surosky RT, Urabe M, Godwin SG, McQuiston SA, Kurtzman GJ, et al. 1997. Adeno-associated virus Rep proteins target DNA sequences to a unique locus in the human genome. J. Virol. 71:7951– 59 94. Tratschin JD, Miller IL, Smith MG, Carter BJ. 1985. Adeno-associated virus vector for high-frequency integration, expression, and rescue of genes in mammalian cells. Mol. Cell Biol. 5:3251–60 95. Urcelay E, Ward P, Wiener Sm, Safer B, Kotin Rm . 1995. Asymmetric replication in vitro from a human sequence element is dependent on adeno-associated virus Rep protein. J. Virol. 69:2038–46 96. Vincent-Lacaze N, Snyder RO, Gluzman R, Bohl D, Lagarde C, Danos O. 1999. Structure of adeno-associated virus vector DNA following transduction of the skeletal muscle. J. Virol. 73:1949–55 97. Wang Y, Camp SM, Niwano M, Shen X, Bakowska JC, et al. 2002. Herpes simplex virus type 1/adeno-associated virus rep(+) hybrid amplicon vector improves the stability of transgene expression in human cells by site-specific integration. J. Virol. 76:7150–62 98. Weitzman MD, Kyostio SR, Kotin RM, 99. 100. 101. 102. 103. 104. 105. 106. 107. Owens RA. 1994. Adeno-associated virus (AAV) Rep proteins mediate complex formation between AAV DNA and its integration site in human DNA. Proc. Natl. Acad. Sci. USA 91:5808–12 Wonderling RS, Owens RA. 1997. Binding sites for adeno-associated virus Rep proteins within the human genome. J. Virol. 71:2528–34 Xiao X, Li J, Samulski RJ. 1996. Efficient long-term gene transfer into muscle tissue of immunocompetent mice by adenoassociated virus vector. J. Virol. 70:8098– 108 Xiao X, Xiao W, Li J, Samulski RJ. 1997. A novel 165-base-pair terminal repeat sequence is the sole cis requirement for the adeno-associated virus life cycle. J. Virol. 71:941–48 Yalkinoglu AO, Heilbronn R, Burkle A, Schlehofer JR, zur-Hausen H. 1988. DNA amplification of adeno-associated virus as a response to cellular genotoxic stress. Cancer Res. 48:3123–32 Yalkinoglu AO, Zentgraf H, Hubscher U. 1991. Origin of adeno-associated virus DNA replication is a target of carcinogeninducible DNA amplification. J. Virol. 65:3175–84 Yan Z, Zhang Y, Duan D, Engelhardt JF. 2000. Trans-splicing vectors expand the utility of adeno-associated virus for gene therapy. Proc. Natl. Acad. Sci. USA 97: 6716–21 Yang CC, Xiao X, Zhu X, Ansardi DC, Epstein ND, et al. 1997. Cellular recombination pathways and viral terminal repeat hairpin structures are sufficient for adenoassociated virus integration in vivo and in vitro. J. Virol. 71:9231–47 Yang J, Zhou W, Zhang Y, Zidon T, Ritchie T, Engelhardt JF. 1999. Concatamerization of adeno-associated virus circular genomes occurs through intermolecular recombination. J. Virol. 73: 9468–77 Young SM Jr, McCarty DM, Degtyareva N, Samulski RJ. 2000. Roles of 9 Oct 2004 0:15 AR AR230-GE38-27.tex AR230-GE38-27.sgm LaTeX2e(2002/01/18) P1: IKH AAV INTEGRATION Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. adeno-associated virus Rep protein and human chromosome 19 in site-specific recombination. J. Virol. 74:3953–66 108. Young SM Jr, Samulski RJ. 2001. Adenoassociated virus (AAV) site-specific recombination does not require a Repdependent origin of replication within the 845 AAV terminal repeat. Proc. Natl. Acad. Sci. USA 98:13525–30 109. Yue Y, Duan D. 2003. Double strand interaction is the predominant pathway for intermolecular recombination of adeno-associated viral genomes. Virology 313:1–7 P1: JRX October 18, 2004 14:55 Annual Reviews AR230-FM Annual Review of Genetics Volume 38, 2004 Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. CONTENTS MOBILE GROUP II INTRONS, Alan M. Lambowitz and Steven Zimmerly THE GENETICS OF MAIZE EVOLUTION, John Doebley GENETIC CONTROL OF RETROVIRUS SUSCEPTIBILITY IN MAMMALIAN CELLS, Stephen P. Goff LIGHT SIGNAL TRANSDUCTION IN HIGHER PLANTS, Meng Chen, Joanne Chory, and Christian Fankhauser 1 37 61 87 CHLAMYDOMONAS REINHARDTII IN THE LANDSCAPE OF PIGMENTS, Arthur R. Grossman, Martin Lohr, and Chung Soon Im THE GENETICS OF GEOCHEMISTRY, Laura R. Croal, Jeffrey A. Gralnick, Davin Malasarn, and Dianne K. Newman CLOSING MITOSIS: THE FUNCTIONS OF THE CDC14 PHOSPHATASE AND ITS REGULATION, Frank Stegmeier and Angelika Amon RECOMBINATION PROTEINS IN YEAST, Berit Olsen Krogh and Lorraine S. Symington 119 175 203 233 DEVELOPMENTAL GENE AMPLIFICATION AND ORIGIN REGULATION, John Tower 273 THE FUNCTION OF NUCLEAR ARCHITECTURE: A GENETIC APPROACH, Angela Taddei, Florence Hediger, Frank R. Neumann, and Susan M. Gasser 305 GENETIC MODELS IN PATHOGENESIS, Elizabeth Pradel and Jonathan J. Ewbank 347 MELANOCYTES AND THE MICROPHTHALMIA TRANSCRIPTION FACTOR NETWORK, Eirı́kur Steingrı́msson, Neal G. Copeland, and Nancy A. Jenkins EPIGENETIC REGULATION OF CELLULAR MEMORY BY THE POLYCOMB AND TRITHORAX GROUP PROTEINS, Leonie Ringrose and Renato Paro REPAIR AND GENETIC CONSEQUENCES OF ENDOGENOUS DNA BASE DAMAGE IN MAMMALIAN CELLS, Deborah E. Barnes and Tomas Lindahl 365 413 445 MITOCHONDRIA OF PROTISTS, Michael W. Gray, B. Franz Lang, and Gertraud Burger 477 v P1: JRX October 18, 2004 vi 14:55 Annual Reviews AR230-FM CONTENTS METAGENOMICS: GENOMIC ANALYSIS OF MICROBIAL COMMUNITIES, Christian S. Riesenfeld, Patrick D. Schloss, and Jo Handelsman 525 GENOMIC IMPRINTING AND KINSHIP: HOW GOOD IS THE EVIDENCE?, David Haig 553 MECHANISMS OF PATTERN FORMATION IN PLANT EMBRYOGENESIS, Viola Willemsen and Ben Scheres Annu. Rev. Genet. 2004.38:819-845. Downloaded from arjournals.annualreviews.org by Medical Library of the Chinese PLA on 02/25/08. For personal use only. DUPLICATION AND DIVERGENCE: THE EVOLUTION OF NEW GENES AND OLD IDEAS, John S. Taylor and Jeroen Raes GENETIC ANALYSES FROM ANCIENT DNA, Svante Pääbo, Hendrik Poinar, David Serre, Viviane Jaenicke-Despres, Juliane Hebler, Nadin Rohland, Melanie Kuch, Johannes Krause, Linda Vigilant, and Michael Hofreiter 587 615 645 PRION GENETICS: NEW RULES FOR A NEW KIND OF GENE, Reed B. Wickner, Herman K. Edskes, Eric D. Ross, Michael M. Pierce, Ulrich Baxa, Andreas Brachmann, and Frank Shewmaker 681 PROTEOLYSIS AS A REGULATORY MECHANISM, Michael Ehrmann and Tim Clausen 709 MECHANISMS OF MAP KINASE SIGNALING SPECIFICITY IN SACCHAROMYCES CEREVISIAE, Monica A. Schwartz and Hiten D. Madhani 725 rRNA TRANSCRIPTION IN ESCHERICHIA COLI, Brian J. Paul, Wilma Ross, Tamas Gaal, and Richard L. Gourse 749 COMPARATIVE GENOMIC STRUCTURE OF PROKARYOTES, Stephen D. Bentley and Julian Parkhill 771 SPECIES SPECIFICITY IN POLLEN-PISTIL INTERACTIONS, Robert Swanson, Anna F. Edlund, and Daphne Preuss 793 INTEGRATION OF ADENO-ASSOCIATED VIRUS (AAV) AND RECOMBINANT AAV VECTORS, Douglas M. McCarty, Samuel M. Young Jr., and Richard J. Samulski 819 INDEXES Subject Index ERRATA An online log of corrections to Annual Review of Genetics chapters may be found at http://genet.annualreviews.org/errata.shtml 847