Create baits by Bait Tiling (SureSelect Target Enrichment) |
|
This topic describes the Bait Tiling tool that is available within the SureSelect Target Enrichment application type. Another topic describes the Bait Tiling tool that is available within the SureSelect RNA Enrichment application type—see Bait Tiling (SureSelect RNA Enrichment).
Bait Tiling creates baits that evenly cover specified genomic regions of a given species. You can customize the tiling process for specific sequencing technologies and protocols, as well as select the density of tiling. Additional details about bait tiling appear at the end of this topic under Additional Bait Tiling details. If you like, you can use a wizard to guide you through the process of designing and submitting a bait library with Bait Tiling—see the Bait Tiling wizard (SureSelect Target Enrichment).
Note:
•
Consider using Interval
Finder or Exon
Interval Finder to retrieve a list of genomic intervals to use
as input for the Bait Tiling process. These tools find intervals of
interest based on annotations that you enter.
Click the Workspace tab or enter a collaboration.
Set the application type to SureSelect Target Enrichment.
Click Baits
> Bait Tiling.
The Bait Tiling page appears.
Set the following parameters. All are required.
Parameter |
Instructions/Details |
Job Info |
|
Job Name |
Type a name that will later enable you to identify this Bait Tiling job. |
Sequencing Technology |
From the list, select the type of DNA sequencer that you will use to analyze your target DNA fragments. |
Sequencing Protocol |
Select the desired protocol. In the list, eArray displays the available protocols for your chosen DNA sequencing technology. |
Design Options |
|
Use Optimized Parameters |
In general, mark Use Optimized Parameters. With this option, based on your selected sequencing technology and protocol, eArray automatically sets the best values for Design Strategy, Bait Length, Bait Tiling Frequency, and Allowed Overlap into Avoided Regions. |
Design Strategy |
(Available if you clear Use Optimized Parameters.) In Design Strategy, select one of these options:
|
Bait Length |
(Available if you clear Use Optimized Parameters.) Currently, eArray supports a bait length of 120 nucleotides.
|
Bait Tiling Frequency |
(Available if you clear Use Optimized Parameters.) Select the desired tiling frequency. This setting controls the density of tiling. The density can also be affected by your choice of Design Strategy. Availability of options for tiling density varies by sequencing technology.
|
Allowed Overlap into Avoided Regions
|
(Available if you clear Use Optimized Parameters.) To change the value, select it, then type the desired distance in base pairs. eArray generates baits that may extend by this distance into the regions specified in Genomic Avoid Intervals (see below). For best results, set this distance to a maximum of 20 bp. |
Strand |
Select one of these options:
|
Target Details |
|
Species |
Select the desired species. This list contains the names of all species currently supported in eArray for Bait Tiling. |
Genomic Target Intervals |
You can enter the desired target intervals, or upload them as a file. To enter target intervals
To upload intervals as a text or BED format file
|
Extend Interval Boundaries 3' and 5' |
eArray can tile baits into the regions that flank your entered genomic target intervals.
|
Avoid Standard Repeat Masked Regions |
By default, eArray excludes a standard set of repetitive genomic regions from the Bait Tiling process. These are regions of the genome that generally produce poor quality baits. If you want to include repeat regions in the Bait Tiling process, clear this option. |
Avoid Defined Custom Intervals |
eArray can exclude from the Bait Tiling process genomic intervals that you define. You can either enter the intervals, or upload a text file that contains them. To enter custom exclusion intervals
To upload a text file that contains custom exclusion intervals
|
Click Submit.
eArray submits your Bait Tiling job to Agilent for processing. A message
tells you that you will receive an e-mail when the job is completed.
Click Close.
The Bait Tiling page reappears. Your Bait Tiling job appears in the
Search Results pane at the bottom of the page, where you can monitor
its status. The list of jobs does not automatically update once it
appears. Click Refresh to
update the list to its most current form.
Status designations and what they mean
Status |
Description |
In Queue |
The job has been submitted to Agilent, but no action has been taken on it yet. |
Designing |
The job has been submitted to Agilent, and processing has begun. |
Completed |
The job has been completed by Agilent, and the results are available for your use. |
ERROR |
The job has been submitted to Agilent, but there are problems. You must resubmit your job. eArray sends you an e-mail informing you of the error. This e-mail can contain additional information about the error. |
It can take up to one day or more for eArray to finish your Bait Tiling job, depending on the number of jobs ahead of yours in the queue, and the size of your uploaded file. When eArray completes your Bait Tiling job, you receive an e-mail, and the status of the job changes to Completed. You can then view or download the results, and you can also create a new bait group from them.
What design guidance can Agilent offer regarding target enrichment libraries?
When you generate custom SureSelect Target Enrichment Bait Libraries:
In Bait Tiling, avoid repeat regions with fairly high stringency (at most 20 bp overlap).
Follow the "Optimized" bait design strategy (Mark the Use Optimized Parameters check box). These parameters work well under standard conditions and are optimized for each given sequencing technology.
How much genomic space can be targeted?
Agilent can create libraries with up to 57,750 baits per kit. The default bait length is 120 nucleotides, and the default tiling frequency is 2X. Given these parameters, you can target up to 3.3 Mb. As you increase the tiling frequency, this limit decreases. You can also create a library that has more than 57,750 baits, which causes eArray to create a library set, which distributes the requested baits over multiple libraries.
What is the largest region that can be targeted?
Using a single library, assuming 120-mer baits and 2X tiling, the longest single region that can be captured is 3.3 Mb. However, you can identify regions for eArray to avoid in the tiling process. These regions can include the default standard set of repeat regions, as well as custom regions that you enter. This can increase the size of the region that can be captured. You can also let eArray create a library set, which can accommodate many more baits over multiple libraries. Library sets give you additional capture capacity.
Why is 2X tiling helpful?
Baits tiled at a density of 2X cover target regions in a staggered manner. A target sequence that is represented at the end of a given bait is represented toward the middle of the alternate bait that covers the same region. For sequencing technologies that use end-sequencing, such as the Illumina technology, 2X tiling can help to sequence regions in the middle of a DNA segment more effectively.
If I have not reached the capacity of the library, should I increase the tiling frequency?
Yes. In general, the more tiling the better.
At a bait length setting of 120 nucleotides, what is the active capture size?
120 nucleotides.
Is DNA sequencing potential driven by bait length or by shear size?
Shear size. If a 120-mer bait pulls down a 500 bp DNA segment, the entire 500 bp segment is available for sequencing, without regard to the part that aligns with the 120-mer bait.
See Also
Overview of working with baits
SureSelect Target Enrichment libraries