Boxwood (Buxus L. spp., Buxaceae) are popular woody landscape shrubs grown for their diverse forms and broad-leaved evergreen foliage (Batdorf, 2004). The genus contains approximately 90 species originating in Africa, Eurasia, the Caribbean, and Central America (Batdorf, 2004). Boxwood plants grown in temperate zones are increasingly threatened by a destructive new blight disease caused by the ascomycete fungus Calonectria pseudonaviculata Henricot (syn. Cylindrocladium pseudonaviculatum, Cylindrocladium buxicola). First identified from the United Kingdom in 1994, the disease has spread throughout continental Europe, parts of western Asia, and into North America. (Ivors et al., 2012; Elmhirst et al., 2013; Gehesquière et al., 2013; Malapi-Wight et al., 2014). To date, all tested cultivated Buxus taxa are affected by boxwood blight, although some taxa appear to be more susceptible to the fungus than others (Henricot et al., 2008; Douglas, 2012; Lamondia, 2014). There is an urgent need to develop blight-tolerant boxwood cultivars because of the impact this disease has on landscapes and commercial growers.
The National Boxwood Collection at the U.S. National Arboretum (USNA) contains more than 700 Buxus accessions, making it one of the most complete collections in the world and a valuable genetic resource for developing blight-tolerant varieties. However, genetic relationships and diversity among these accessions have not been determined. Although morphological features can be useful in determining phylogenetic relationships in Buxaceae (Carlquist, 1982; Köhler and Brückner, 1990), molecular markers are needed to distinguish among closely related accessions and to assess diversity. Van Laere et al. (2011) used amplified fragment length polymorphism (AFLP) markers to characterize and differentiate between European and Asian boxwood. In the current study, we developed and characterized 23 polymorphic genic simple sequence repeat (genic-SSR) markers to facilitate genetic diversity analysis of boxwood taxa from the National Boxwood Collection and elsewhere. Compared to AFLP markers, SSRs are multiallelic, codominant, transferable between related species, and can be used to reproducibly fingerprint organisms in different laboratories. Our objective was to generate a suite of polymorphic genic-SSRs from coding regions of the Buxus genome, as these markers may also be useful in analyzing the functional diversity in germplasm collections.
Table 1.
Characteristics of 23 polymorphic genic-SSRs developed for Buxus spp.
METHODS AND RESULTS
Total RNA was extracted from frozen leaf tissue of B. sempervirens L. ‘Vardar Valley’ (Appendix 1) using the QIAGEN RNeasy Plant Mini Kit (QIAGEN, Valencia, California, USA). RNA was quantified using the Qubit 2.0 Fluorometer (Invitrogen, Carlsbad, California, USA), and quality was evaluated using the QIAxcel capillary electrophoresis system (QIAGEN). cDNA libraries were constructed using the TruSeq RNA Sample Preparation LS kit (Illumina, San Diego, California, USA) following the manufacturer's protocol. Validated pooled cDNA libraries were prepared for sequencing following the Illumina protocol and sequenced using the MiSeq system on a 300-cycle MiSeq sequencing cartridge (Illumina). From a single sequencing run. 3,506,048 reads containing 0.5 Gbp of data with an average length of 140 bp per read were generated. Reads were trimmed of adapters and for quality, then assembled and mapped using the CLC Genomics Workbench version 6 software (CLC Bio, Boston, Massachusetts, USA). The analysis yielded 2,370,726 mapped paired-end reads with an average length of 164 bp, which were assembled into a partial transcriptome of 12,027 contigs (11,912,857 bp).
The partial transcriptome assembly was mined for microsatellites using the PrimerPro Perl pipeline ( http://webdocs.cs.ualberta.ca/~yifeng/primerpro/), which used the MISA algorithm (Coello Coello and Cortés, 2005) to detect tandem repeats of two to six nucleotides for at least five perfect repeat core motifs. A total of 845 SSR motifs were identified, including 469 dinucleotide, 360 trinucleotide, seven tetranucleotide, one pentanucleotide, and eight hexanucleotide repeats (sequences available from the authors). PCR primer pairs were designed using the Primer3 algorithm in the PrimerPro pipeline, with the following settings: primer length of 20 ± 2 nucleotides, GC content of 40–60%, and a PCR product size ranging from 100 to 300 bp. Trinucleotide motifs possessing unique PCR priming sites within the genome, as determined by BLASTN searches using PrimerPro, were evaluated visually for heterozygosity and mutation consistent with stepwise evolution. A total of 71 candidate markers were selected for testing from the trinucleotide SSR sites meeting these in silico criteria. PCR primers were manufactured by Integrated DNA Technologies (Coralville, Iowa, USA). The forward primers had an additional M13(−21) universal sequence (TGTAAAACGACGGCCAGT) attached to the 5′ end to allow indirect fluorescent labeling of PCR products using just one universal FAM (6-carboxy-fluorescine)–labeled M13 primer (Schuelke, 2000). These 71 primer pairs were used to amplify SSR loci in 18 boxwood accessions representing diverse species and cultivars (Appendix 1). Twenty-three of these primer pairs proved to be polymorphic and resulted in expected amplification profiles (Table 1). In addition, eight primer pairs amplified monomorphic loci, 33 primer pairs amplified multiple regions or an unexpected size product, and seven primer pairs did not amplify a product at all (data not shown).
Genomic DNA was extracted from frozen leaf tissue of 18 boxwood accessions using the QIAGEN DNeasy Plant Mini Kit and quantified using the NanoDrop 1000 Spectrophotometer (Thermo Fisher Scientific, Wilmington, Delaware, USA). PCR was carried out in a Bio-Rad iCycler (Bio-Rad Laboratories, Hercules, California, USA). The 20-µL PCR reaction mixture contained 10 ng of template genomic DNA, 0.25 µM of each reverse and universal FAM-labeled M13(−21) primer, and 0.0625 µM of the forward primer with 1× Bioline MangoMix and 2.5 mM Bioline MgCl2 (Bioline, Taunton, Massachusetts, USA). PCR profiles consisted of initial denaturation at 94°C for 5 min; followed by 30 cycles of 94°C for 30 s, optimized annealing temperature of each primer pair (Table 1) for 45 s, and 72°C for 45 s; followed by eight cycles of 94°C for 30 s, 53°C for 45 s, and 72°C for 45 s; and a final extension at 72°C for 10 min. Products were analyzed on an ABI 3730xl DNA Analyzer (Applied Biosystems, Foster City, California, USA) using 1 µL of PCR product, 10 µL of formamide (Applied Biosystems), and 0.3 µL of GeneScan 500 LIZ Size Standard (Applied Biosystems). Allele sizes and number of alleles per locus were determined using GeneMarker version 2.6.3 (SoftGenetics, State College, Pennsylvania, USA). Number of alleles per locus ranged from two to 10 with a mean of 4.34 (Table 1). The boxwood population we used included 10 diploids, four triploids, two tetraploids, and two mixoploids, as determined by flow cytometry in our laboratory (Appendix 1). Thus, we cannot report expected heterozygosities (He) for this population unless segregation analysis was performed to confirm dosage patterns of alleles for each locus (Dufresne et al., 2014). Instead, we treated the 10 diploids as one population and calculated the observed heterozygosity (Ho) and He using GenAlEx software (version 6.5; Peakall and Smouse, 2012) (Table 1). Excluding the monomorphic locus BSVV64, Ho and He ranged from 0.000 to 1.000 and 0.185 to 0.840 with means of 0.377 and 0.495, respectively. The contig sequences of the 23 SSR loci were subjected to a BLAST search against the National Center for Biotechnology Information (NCBI) nonredundant protein database using the BLASTX program to identify putative functions. With a threshold E-value of 1.0E-6, all 23 SSR sequences shared homology to protein sequences from dicots from diverse families (Table 1).
LITERATURE CITED
Notes
[1] This project was supported in part by funds from the Floral and Nursery Research Initiative administered through the United States Department of Agriculture, Agricultural Research Service (USDA-ARS), and the 2013 USDA Farm Bill. Mention of commercial products in this publication is solely for the purpose of providing specific information and does not imply recommendation or endorsement by the USDA. The authors thank Dr. Dapeng Zhang, Dr. Yazmin Rivera, and Dr. Catalina Salgado-Salazar (USDA-ARS Beltsville) for helpful advice on calculating simple sequence repeat heterozygosity in polyploid populations.