Create a microarray design from target transcripts (Wizard) |
Use this wizard to design microarrays with probes created by GE Probe Design. GE Probe Design creates expression type probes based on target sequence data. To specify this target sequence data, you upload either a FASTA format file of target sequences, or a TDT file of GenBank accessions. This wizard is available for the Expression application type.
Note: To create an Exon array, you must use a different wizard. See Create design from existing probe groups (Wizard). To include probes from GE Probe Design in an Exon array, first use GE Probe Design to separately create a probe group, then include this probe group in an Exon array.
The GE Wizard leads you through six steps. Click one of these links to jump to a specific section of this help topic:
Step 3 – Define Target File and Transcriptome
Step 6 – Create Microarray Design
You must be an eArray registered user, with a valid login name and password.
You must have either a FASTA format file that contains your cDNA target sequence data, or a .txt or .tdt file that contains a list of GenBank accessions. For GenBank accessions, the file must contain the accession numbers separated by new line (return) characters. The file names must not contain any spaces. Create a compressed (*.zip format) file for upload purposes.
(Optional) Prepare an optional FASTA format file that contains custom transcriptome data for use as a similarity database in the probe design process. Create a compressed (*.zip format) file for upload purposes.
Note: A custom transcriptome data file is optional. Agilent strongly encourages you to use one of the available Agilent-constructed species transcriptome databases as a similarity database. You can also use your target sequence data for this purpose.
Click the Workspace tab. This wizard is not available in a collaboration.
Set the application type to Expression.
Click
Home.
The workspace home page appears.
In the Design Wizards pane, select Create a Microarray Design from Target Transcripts.
Click
Next.
The wizard opens in a new window.
In this step, you specify the basic methodology eArray uses to create probes from your target sequences.
Select one of these probe design methods:
Method |
Description |
Base Composition Methodology |
Creates probes that are all the same length, based on an empirically-determined optimal base composition profile. It is the standard GE probe design methodology. This method works best with Agilent protocols and most eukaryotic organisms. |
Tm Matching Methodology |
Creates probes to have similar predicted Tm. This is a good option when you design probes for prokaryotic organisms. |
Click
Next.
The next step of the wizard appears.
In this step you specify the details regarding the probes that eArray generates.
Specify the following parameters in the Probe Details pane. Some of these parameters may not be available for your chosen design methodology.
Parameter |
Instructions/Details |
Design Job Name |
Type a name that will allow you to later identify this specific GE probe design job. |
Probe Length |
Type the maximum length for the generated probes. The allowable length is from 25 to 60 bp. Agilent recommends a probe length of 60 bp, as this has been determined to provide the optimal balance between sensitivity and specificity for most applications on the Agilent in situ microarray platform. |
Probes per target |
Select from 1 to 10 probes per target from the drop-down list. This is the maximum number of probes the probe design process returns for each uploaded target sequence. The probe design process may return fewer probes than you specify. Because of the length and high quality of the generated probes, Agilent recommends that you create one probe per target sequence. However, if you design multiple probes per target sequence, you will have the opportunity to select the best probe after a validation process. |
Masking |
eArray always uses both of these options in the probe design process, and they cannot be disabled. Vector – Identifies and ignores contaminant segments during probe design. Target sequences can contain contaminant segments not actually found in the sample under study. These segments are often artifacts from cloning vectors (e.g. plasmid, phage, BAC, YAC) used in cloning and amplification processes. Repeat – Identifies and ignores repetitive sequences within your target sequences during probe design. The genome of any given organism contains interspersed repeats and low complexity DNA sequences. These sequences, which are unique at a species level, are replicated many times throughout the genome, and are found in the transcriptome as well. Replicate regions are poor candidates for unique probes. |
Probe Orientation |
Select one of these options: Sense – Produces probes in the sense or "coding strand" orientation, similar in sequence to the mRNA targets. Use this option if the sample preparation methodology yields cDNA or cRNA molecules. Antisense – Select this option if you want probes in antisense or "template" orientation, complementary in sequence to the mRNA targets. This is the best option if your samples are directly labeled RNA. |
Design Options |
Select your preferred design process. These options are only meaningful if you request more than one probe per target sequence. Best Probe Methodology – If there are multiple probes for each target sequence, the probe design process favors production of the highest quality probes, rather than even coverage of each target sequence. Best Distribution Methodology – The probe design process favors even coverage of each target sequence, rather than production of the highest quality probes. |
Design with 3' bias |
Mark this option if you want probes derived mainly from the first 1,000 bp from the 3' end of each of your target sequences. If you use an Agilent (or other) labeling protocol that uses linear amplification, it is important to select probes from the 3 end of the sequence. Linear amplification generates sequences that are shorter than the initial template due to the attenuation of the polymerase reaction. Because of this, most of the labeled product represents only the first 1,000 bp from the 3' end of each target sequence. It is important to design probes that represent this region. |
Allow Probes to be Trimmed |
(Available only for Tm-Matching methodology.) Mark this option to allow bases to be removed from candidate probes to increase compliance with the Preferred Probe Tm. eArray will not trim probes to shorter than 45 bases. In concept, a shorter probe has less complementary sequence available, which can reduce its specificity, or infringe on its ability to form a stable duplex with the desired target. However, the risk of this occurring to a significant extent is very low. |
Preferred Probe Tm |
(Available only for Tm-Matching methodology.) Type the target Tm for the probe design process. The Tm is the temperature at which equal populations of a probe and its target sequence would exist as a 50:50 mixture of duplex and single-stranded forms. Select a probe Tm based on two factors:
In practice, the target Tm should be ~20°C higher than the hybridization temperature. For example, if the hybridization temperature is 60°C, then the target probe Tm should be 80°C. |
Click
Next.
The next page of the wizard appears.
In this step, you select target and optional transcriptome files to upload, and submit a GE probe design job. File names should not contain spaces.
In Target File Details, enter the following:
Species – Select the desired species. Species is required.
Select one of these options:
Upload Target File in FASTA format – Click Browse. Select the desired FASTA format target sequence file, then click Open. The location of your file appears in the box. Prepare a compressed (*.zip format) version of the file for upload purposes.
Upload Target File as GenBank Accessions – Click Browse. Select the desired file of GenBank accessions, then click Open. The location of the file appears in the text box. eArray resolves the GenBank accessions in the file to actual sequence data before it starts the probe design process. Prepare a compressed (*.zip format) version of the file for upload purposes.
In Transcriptome Details, select the desired type of transcriptome data for the probe design process. The probe design process uses transcriptome data as a similarity database to eliminate potential probe sequences that would have significant cross-hybridization with targets other than the one of interest.
Use Target File as Transcriptome – Uses the file you specified in Upload Target File as the species transcriptome database. This option works for uploaded target files containing either actual sequence data, or GenBank accessions.
Select Agilent-Provided Transcriptome (by Species) – Uses one of Agilent's available species transcriptome databases. Agilent recommends that you select this option, as these databases have been specifically constructed for use in GE probe design. Select the desired species.
Note: The species that you select here overrides the species that you chose previously.
Upload Transcriptome File – Uses a transcriptome file that you provide as the species transcriptome database. To use this option, click Browse. Select the desired transcriptome file, then click Open. The location of the file appears in the box. Prepare a compressed (*.zip format) version of the file for upload purposes.
Click
Next.
A message indicates that you have submitted your design job to eArray,
and that you will receive an e-mail when the job is completed.
Click Close.
Note: GE probe design is a computation-intensive process. It can take several days for your design job to finish, depending on the size of your files and the number of jobs ahead of yours in the queue.
eArray pauses the wizard. On the workspace or collaboration
home page, the job appears in the Search Results area of the Design Wizards
pane with a status of GE Design Submit. Click Refresh
to update the pane to its most recent version.
When the job is complete, you receive an e-mail from Agilent, the status
of the job changes to Probes Uploaded, and a Continue link appears in
the Action column.
When your probe design job is complete, you can return to the wizard. In this step of the wizard, you define general attributes of the microarray design or set, such as its name, format, and location. Also, if you want to add linkers to "stilt" the active hybridizing sequences of probes further off the glass microarray substrate, you specify the details in this step.
Set the application type to Expression, if needed.
On the
applicable workspace home page, in the Search Results section of the
Design Wizards pane, next to the desired wizard, click Continue.
If the desired wizard is not visible, you may need to click View All to see the full list.
The wizard opens in a new window. It shows that you have progressed
to Step 4 – Define
Design.
Specify the following parameters. All are required unless otherwise noted.
Parameter |
Instructions/Details |
Define Design |
|
Microarray Name |
Type a name for the microarray design. eArray uses this name as one of the search keys for microarray designs, and as a way to refer to the design in search results, lists, and the like. |
Design Format |
Select a design format. For details about the selected design format, click Show Details. When you select or change the design format, an appropriate control grid appears in Control Grid. |
Control Grid |
The name of the control grid appears as a link. Click the link to view details about the control grid. eArray automatically selects a control grid appropriate to your design format, application type, and, for CGH and ChIP applications, species. If appears in Control Grid, you can choose an alternate control grid. To choose an alternate control grid:
|
Folder |
Select a location for your new microarray design. The folders to which you have access appear in the list. |
Description |
(Optional) Type a brief description for the design. |
Species |
(Read-only) Reflects the species (if any) that you selected in the previous step of the wizard. |
Keywords |
(Optional) Type search keywords to associate with the design, separated by commas. Keywords can help you search for the microarray design later. In addition, you |
Attachment |
(Optional) Attachments are files or links that are related your microarray design. Attached files should not exceed 32 Mb in size. To add one or more attachments, follow these steps:
Name – Type a name for the attachment. This name becomes the link you click to access the attachment. Type – Select the desired type of attachment from the list. An attachment can be either a file or a URL. Locale – Select the appropriate locale from the list. eArray uses this option to select content that has been localized for your region, when it is available. File – (Available only if you selected File for the type of attachment) Click Browse. In the dialog box that appears, select the desired attachment, then click Open. URL – (Available only if you selected URL for the type of attachment) Type the full URL of the Internet resource, including the protocol specifier. For example, http://www.agilent.com.
|
Comments |
(Optional) Type comments to include with the microarray design. |
Linker Details |
|
Append linker to 3' end |
Mark this check box to add linker sequences to your probes. Linkers are nucleic acid molecules that are added to the 3' ends of probes. They move the "active" (hybridizing) sequence farther from the glass microarray substrate. This decreases steric hindrance and makes the sequence more available for hybridization. Linker sequences themselves are designed to avoid hybridization to any sequence in the target sample. Agilent provides a default linker sequence that you can use, or you can specify your own custom linker sequence. If you mark this option, also set the Linker Length and Linker Sequence parameters. |
Linker length |
(Available if you select Append linker to 3' end) Select one of these options:
|
Linker Sequence |
(Available if you select Append linker to 3' end) Select one of these options:
|
Click
Next.
The next step of the wizard, Layout
probes, appears.
In this step of the wizard, you can add probe group(s) to your microarray design/set.
Under Select Probes and Layout Options, specify the probes that you want to include in your microarray design/set using the tasks described in the table below. You may need to scroll down to see some of these options.
Task |
Instructions/Details |
You can add or remove a biological or user control probe group from your microarray design. To add a biological or user control probe group
To remove a biological or user control probe group
|
|
(CGH and Expression arrays only) A replicate probe group is a special control probe group that Feature Extraction and DNA Analytics can use to calculate the QC metric Reproducibility. These replicate probe groups are distinct from the user probe groups that you can include in designs in multiple copies. When you create an array for certain species and design formats, eArray automatically adds a default Agilent replicate probe group. To add a replicate probe group
To remove a replicate probe group
|
|
(When available) Under Probe Group Details, next to the applicable probe group, in the Control Type column, select an option. Positive or negative user control probe groups must collectively occupy 50% or fewer of the available features in your design/set.
|
|
View a probe group |
In any available Probe
Group Name column, click the name of the probe group that
you want to view. |
Change number of copies of probe group |
(Available for user non-control probe groups and for replicate probe groups.) In the Replicate column, next to the desired probe group, type the number of copies of the probe group that you want to include in the microarray design. |
Allow eArray to create a microarray set |
Mark Enable Microarray Set. If you select this option, and the number of features required to accommodate your microarray design exceeds the number of features available on one microarray slide, eArray adds as many additional slides as are needed. The Percentage Filled statistic in the Microarray Statistics pane can help you to monitor how full your microarray is. If the percentage filled exceeds 100%, you must mark Enable Microarray Set to accommodate all the probes of your design, or choose a different design format with more feature locations. For positive or negative user control probe groups in a microarray set, you must also select a probe distribution option.
|
Change distribution of positive and negative control probe groups in a microarray set |
(Available for microarray sets that have probe groups with a control type of pos or neg) eArray always puts a copy of each positive and negative user control probe group onto each microarray within a microarray set. Within each microarray in the set, the probes in the control probe groups are always assigned to random feature positions. However, you have some control over how probes are assigned to these randomly-selected feature positions from one array to another in the set.
|
Fill unused features |
You can fill the unused features of a microarray with the probe group of your choice. eArray may add only part of the probe group or more than one copy of some or all of the probe group, depending on the number of available empty features in your design, and the number of probes in the probe group. To add a whole probe group to your design, use the procedure described above in Add a user control or non-control probe group, instead. Follow these steps to fill unused features:
|
Note: When you use a Design Wizard to create a microarray design, only Randomized feature layout is available. To create a design with a Customer specified probe order, you must use another method to create the design.
Click
Next.
The next step of the wizard appears.
In this step, you save your design with a specific status.
In How do you want to save and create your Design?, select one of the following:
Draft – eArray saves the design with a status of Draft. Only you can edit the design.
Review – eArray saves the design with a status of Review. The users in your workgroup can edit the design and save versions of it.
Complete – eArray saves the design with a status of Complete. The design can be submitted to Agilent Manufacturing, and no one can edit the design.
Submit – eArray saves the design with a status of Submitted, and submits the design to Agilent Manufacturing. You can then request a quote for the design. If you select this option, you must also click Design Checklist, then read and mark all of the items that appear.
Click
Save.
eArray creates and saves your design. A success message appears.
Click
Close.
Another success message appears.
Click Close.
See also
About Gene Expression (GE) probe design
Get custom microarray design guidance