Create a microarray design from target transcripts (Wizard)

Use this wizard to design microarrays with probes created by GE Probe Design. GE Probe Design creates expression type probes based on target sequence data. To specify this target sequence data, you upload either a FASTA format file of target sequences, or a TDT file of GenBank accessions. This wizard is available for the Expression application type.

Note: To create an Exon array, you must use a different wizard. See Create design from existing probe groups (Wizard). To include probes from GE Probe Design in an Exon array, first use GE Probe Design to separately create a probe group, then include this probe group in an Exon array.

The GE Wizard leads you through six steps. Click one of these links to jump to a specific section of this help topic:

Before you use the wizard

To start the wizard

Step 1 – Select Method

Step 2 – Select Parameters

Step 3 – Define Target File and Transcriptome

Step 4 – Define Design

Step 5 – Layout Probes

Step 6 – Create Microarray Design

Before you use the wizard

You must be an eArray registered user, with a valid login name and password.
You must have either a FASTA format file that contains your cDNA target sequence data, or a .txt or .tdt file that contains a list of GenBank accessions. For GenBank accessions, the file must contain the accession numbers separated by new line (return) characters. The file names must not contain any spaces. Create a compressed (*.zip format) file for upload purposes.
(Optional) Prepare an optional FASTA format file that contains custom transcriptome data for use as a similarity database in the probe design process. Create a compressed (*.zip format) file for upload purposes.
- Note: A custom transcriptome data file is optional. Agilent strongly encourages you to use one of the available Agilent-constructed species transcriptome databases as a similarity database. You can also use your target sequence data for this purpose.

To start the wizard

Click the Workspace tab. This wizard is not available in a collaboration.
Set the application type to Expression.
Click Home.

The workspace home page appears.
In the Design Wizards pane, select Create a Microarray Design from Target Transcripts.
Click Next.

The wizard opens in a new window.

Step 1 – Select Method

In this step, you specify the basic methodology eArray uses to create probes from your target sequences.

Select one of these probe design methods:

Method	Description
Base Composition Methodology	Creates probes that are all the same length, based on an empirically-determined optimal base composition profile. It is the standard GE probe design methodology. This method works best with Agilent protocols and most eukaryotic organisms.
Tm Matching Methodology	Creates probes to have similar predicted Tm. This is a good option when you design probes for prokaryotic organisms.

Click Next.

The next step of the wizard appears.

Step 2 – Select Parameters

In this step you specify the details regarding the probes that eArray generates.

Specify the following parameters in the Probe Details pane. Some of these parameters may not be available for your chosen design methodology.

Parameter	Instructions/Details
Design Job Name	Type a name that will allow you to later identify this specific GE probe design job.
Probe Length	Type the maximum length for the generated probes. The allowable length is from 25 to 60 bp. Agilent recommends a probe length of 60 bp, as this has been determined to provide the optimal balance between sensitivity and specificity for most applications on the Agilent in situ microarray platform.
Probes per target	Select from 1 to 10 probes per target from the drop-down list. This is the maximum number of probes the probe design process returns for each uploaded target sequence. The probe design process may return fewer probes than you specify. Because of the length and high quality of the generated probes, Agilent recommends that you create one probe per target sequence. However, if you design multiple probes per target sequence, you will have the opportunity to select the best probe after a validation process.
Masking	eArray always uses both of these options in the probe design process, and they cannot be disabled. Vector – Identifies and ignores contaminant segments during probe design. Target sequences can contain contaminant segments not actually found in the sample under study. These segments are often artifacts from cloning vectors (e.g. plasmid, phage, BAC, YAC) used in cloning and amplification processes. Repeat – Identifies and ignores repetitive sequences within your target sequences during probe design. The genome of any given organism contains interspersed repeats and low complexity DNA sequences. These sequences, which are unique at a species level, are replicated many times throughout the genome, and are found in the transcriptome as well. Replicate regions are poor candidates for unique probes.
Probe Orientation	Select one of these options: Sense – Produces probes in the sense or "coding strand" orientation, similar in sequence to the mRNA targets. Use this option if the sample preparation methodology yields cDNA or cRNA molecules. Antisense – Select this option if you want probes in antisense or "template" orientation, complementary in sequence to the mRNA targets. This is the best option if your samples are directly labeled RNA.
Design Options	Select your preferred design process. These options are only meaningful if you request more than one probe per target sequence. Best Probe Methodology – If there are multiple probes for each target sequence, the probe design process favors production of the highest quality probes, rather than even coverage of each target sequence. Best Distribution Methodology – The probe design process favors even coverage of each target sequence, rather than production of the highest quality probes.
Design with 3' bias	Mark this option if you want probes derived mainly from the first 1,000 bp from the 3' end of each of your target sequences. If you use an Agilent (or other) labeling protocol that uses linear amplification, it is important to select probes from the 3 end of the sequence. Linear amplification generates sequences that are shorter than the initial template due to the attenuation of the polymerase reaction. Because of this, most of the labeled product represents only the first 1,000 bp from the 3' end of each target sequence. It is important to design probes that represent this region.
Allow Probes to be Trimmed	(Available only for Tm-Matching methodology.) Mark this option to allow bases to be removed from candidate probes to increase compliance with the Preferred Probe Tm. eArray will not trim probes to shorter than 45 bases. In concept, a shorter probe has less complementary sequence available, which can reduce its specificity, or infringe on its ability to form a stable duplex with the desired target. However, the risk of this occurring to a significant extent is very low.
Preferred Probe Tm	(Available only for Tm-Matching methodology.) Type the target Tm for the probe design process. The Tm is the temperature at which equal populations of a probe and its target sequence would exist as a 50:50 mixture of duplex and single-stranded forms. Select a probe Tm based on two factors: The mean and standard deviation of the Tm of all potential probes that could be generated for the target transcriptome. The hybridization temperature identified in the hybridization protocol. In practice, the target Tm should be ~20°C higher than the hybridization temperature. For example, if the hybridization temperature is 60°C, then the target probe Tm should be 80°C.

Click Next.

The next page of the wizard appears.

Step 3 – Define Target File and Transcriptome

In this step, you select target and optional transcriptome files to upload, and submit a GE probe design job. File names should not contain spaces.

In Target File Details, enter the following:

- Species – Select the desired species. Species is required.
- Select one of these options:

Upload Target File in FASTA format – Click Browse. Select the desired FASTA format target sequence file, then click Open. The location of your file appears in the box. Prepare a compressed (*.zip format) version of the file for upload purposes.

Upload Target File as GenBank Accessions – Click Browse. Select the desired file of GenBank accessions, then click Open. The location of the file appears in the text box. eArray resolves the GenBank accessions in the file to actual sequence data before it starts the probe design process. Prepare a compressed (*.zip format) version of the file for upload purposes.

In Transcriptome Details, select the desired type of transcriptome data for the probe design process. The probe design process uses transcriptome data as a similarity database to eliminate potential probe sequences that would have significant cross-hybridization with targets other than the one of interest.
- Use Target File as Transcriptome – Uses the file you specified in Upload Target File as the species transcriptome database. This option works for uploaded target files containing either actual sequence data, or GenBank accessions.
- Select Agilent-Provided Transcriptome (by Species) – Uses one of Agilent's available species transcriptome databases. Agilent recommends that you select this option, as these databases have been specifically constructed for use in GE probe design. Select the desired species.
- - Note: The species that you select here overrides the species that you chose previously.
- Upload Transcriptome File – Uses a transcriptome file that you provide as the species transcriptome database. To use this option, click Browse. Select the desired transcriptome file, then click Open. The location of the file appears in the box. Prepare a compressed (*.zip format) version of the file for upload purposes.
Click Next.

A message indicates that you have submitted your design job to eArray, and that you will receive an e-mail when the job is completed.
Click Close.
- Note: GE probe design is a computation-intensive process. It can take several days for your design job to finish, depending on the size of your files and the number of jobs ahead of yours in the queue.

eArray pauses the wizard. On the workspace or collaboration home page, the job appears in the Search Results area of the Design Wizards pane with a status of GE Design Submit. Click Refresh to update the pane to its most recent version.

When the job is complete, you receive an e-mail from Agilent, the status of the job changes to Probes Uploaded, and a Continue link appears in the Action column.

Step 4 – Define Design

When your probe design job is complete, you can return to the wizard. In this step of the wizard, you define general attributes of the microarray design or set, such as its name, format, and location. Also, if you want to add linkers to "stilt" the active hybridizing sequences of probes further off the glass microarray substrate, you specify the details in this step.

Set the application type to Expression, if needed.

On the applicable workspace home page, in the Search Results section of the Design Wizards pane, next to the desired wizard, click Continue. If the desired wizard is not visible, you may need to click View All to see the full list.

The wizard opens in a new window. It shows that you have progressed to Step 4 – Define Design.

Specify the following parameters. All are required unless otherwise noted.

Parameter	Instructions/Details
Define Design
Microarray Name	Type a name for the microarray design. eArray uses this name as one of the search keys for microarray designs, and as a way to refer to the design in search results, lists, and the like.
Design Format	Select a design format. For details about the selected design format, click Show Details. When you select or change the design format, an appropriate control grid appears in Control Grid.
Control Grid	The name of the control grid appears as a link. Click the link to view details about the control grid. eArray automatically selects a control grid appropriate to your design format, application type, and, for CGH and ChIP applications, species. If appears in Control Grid, you can choose an alternate control grid. To choose an alternate control grid: In Control Grid, click . A list of available control grids appears in a new window. Select the desired control grid, then click Done.
Folder	Select a location for your new microarray design. The folders to which you have access appear in the list.
Description	(Optional) Type a brief description for the design.
Species	(Read-only) Reflects the species (if any) that you selected in the previous step of the wizard.
Keywords	(Optional) Type search keywords to associate with the design, separated by commas. Keywords can help you search for the microarray design later. In addition, you
Attachment	(Optional) Attachments are files or links that are related your microarray design. Attached files should not exceed 32 Mb in size. To add one or more attachments, follow these steps: Click . A page appears in a new window. In the Attach pane, specify the following: Name – Type a name for the attachment. This name becomes the link you click to access the attachment. Type – Select the desired type of attachment from the list. An attachment can be either a file or a URL. Locale – Select the appropriate locale from the list. eArray uses this option to select content that has been localized for your region, when it is available. File – (Available only if you selected File for the type of attachment) Click Browse. In the dialog box that appears, select the desired attachment, then click Open. URL – (Available only if you selected URL for the type of attachment) Type the full URL of the Internet resource, including the protocol specifier. For example, http://www.agilent.com. Click Add. The attachment appears in the Attachment Header List at the top of the window. A success message appears. Click Close. Add additional attachments, if desired. If you add an attachment in error, select the check box next to its name in the Attachment Header List, then click Remove. Click Done. The window closes. The names of your attachments appear as links in Attachment.
Comments	(Optional) Type comments to include with the microarray design.
Linker Details
Append linker to 3' end	Mark this check box to add linker sequences to your probes. Linkers are nucleic acid molecules that are added to the 3' ends of probes. They move the "active" (hybridizing) sequence farther from the glass microarray substrate. This decreases steric hindrance and makes the sequence more available for hybridization. Linker sequences themselves are designed to avoid hybridization to any sequence in the target sample. Agilent provides a default linker sequence that you can use, or you can specify your own custom linker sequence. If you mark this option, also set the Linker Length and Linker Sequence parameters.
Linker length	(Available if you select Append linker to 3' end) Select one of these options: Make probes of length – Adds nucleotides to the 3' ends of probes so that the resulting probes (active sequence plus linker) have the length specified. Type a number of nucleotides from 20 and 60 in the box. If an active probe sequence in your microarray design exceeds the length that you specify, eArray leaves it alone. It is not trimmed, and no linker is appended to it. Add linker of length – Adds the specified number of nucleotides to the 3' ends of probes. Type a number of nucleotides from 1 to 49. With this option, eArray can produce probes of up to 60 nucleotides in length. If the active probe sequences have different lengths, the resultant probes will also have different lengths after linkers are appended. If the linker sequence is shorter than the length that you specify, eArray replicates the linker sequence to fill in the length.
Linker Sequence	(Available if you select Append linker to 3' end) Select one of these options: Use Agilent linker sequence – You cannot edit the sequence of the Agilent-provided linker. Use Customer linker sequence – Type a DNA base sequence for the linker. Provide a linker sequence that does not hybridize to any sequences in the target sample. Note: eArray appends the Agilent linker sequence to Agilent probes, even if you select Use Customer linker sequence.

Click Next.

The next step of the wizard, Layout probes, appears.

Step 5 — Layout Probes

In this step of the wizard, you can add probe group(s) to your microarray design/set.

Under Select Probes and Layout Options, specify the probes that you want to include in your microarray design/set using the tasks described in the table below. You may need to scroll down to see some of these options.

Task	Instructions/Details
Add or remove a biological or user control probe group	You can add or remove a biological or user control probe group from your microarray design. To add a biological or user control probe group Click Biological Expression Probe Group Details, then click Add. You may need to scroll down to see this button. A probe group selection page appears in a new window. Select one ore more probe groups as described in Select probe groups for probe searches. To add more probe groups, click Add again. Select the desired control type for each probe group. See below, Change control type of probe group. To remove a biological or user control probe group Click Biological Expression Probe Group Details, then mark the check box next to the probe group that you want to remove. Click Remove. Note: Separate tasks below describe how to add or remove a replicate probe groups.
Add or remove a replicate probe group	(CGH and Expression arrays only) A replicate probe group is a special control probe group that Feature Extraction and DNA Analytics can use to calculate the QC metric Reproducibility. These replicate probe groups are distinct from the user probe groups that you can include in designs in multiple copies. When you create an array for certain species and design formats, eArray automatically adds a default Agilent replicate probe group. To add a replicate probe group Click Replicate Probe Group Details, then click Add. A probe group selection page appears. Select one or more probe groups as described in Select probe groups for probe searches. The selected probe group(s) appear in Probe Group name. In Replicate, type the number of copies of the probe group to be included in the design. Agilent recommends a value of 5. Note: When you add a replicate probe group to a microarray set, eArray places the specified number of copies of this probe group on every array within the set. To remove a replicate probe group Click Replicate Probe Group Details, then mark the check box next to the probe group that you want to remove. Click Remove.
Change control type of probe group	(When available) Under Probe Group Details, next to the applicable probe group, in the Control Type column, select an option. Positive or negative user control probe groups must collectively occupy 50% or fewer of the available features in your design/set. neg – Designates the probe group as a negative user control. Negative control groups are intended to have no hybridization. The control grid that is assigned automatically to each design contains an adequate number of negative controls. If you assign your own additional group of negative controls, these negative controls are used by Feature Extraction for background determination (whether or not they have only background signal). pos – Designates the probe group as a positive user control. Although positive controls are excluded from many of the statistical QC metrics in Feature Extraction, they are available for downstream analysis. Positive controls generally have predictable signals, but this is not a requirement. An example of positive controls present on the Agilent control grid is the Agilent spike-in probes, which are used in the gene expression application for calculating QC metrics following addition of spike-in controls to the sample. ignore – Omits the probe group from Feature Extraction analyses and output. Once an array design is submitted, the Control Types cannot be changed, so the only way to "re-activate" them, if desired, is to modify the ControlType field of the design file. biological – Designates that the probe group is not a control (condition = FALSE). It is the default choice for biological probes, which should comprise at least 50% of your design. Note: In a microarray set, if you change the control type of a probe group to either neg or pos, you must also select a probe distribution option.
View a probe group	In any available Probe Group Name column, click the name of the probe group that you want to view. The View Probe Group page appears in a new window. For details about this page, see View probe group.
Change number of copies of probe group	(Available for user non-control probe groups and for replicate probe groups.) In the Replicate column, next to the desired probe group, type the number of copies of the probe group that you want to include in the microarray design.
Allow eArray to create a microarray set	Mark Enable Microarray Set. If you select this option, and the number of features required to accommodate your microarray design exceeds the number of features available on one microarray slide, eArray adds as many additional slides as are needed. The Percentage Filled statistic in the Microarray Statistics pane can help you to monitor how full your microarray is. If the percentage filled exceeds 100%, you must mark Enable Microarray Set to accommodate all the probes of your design, or choose a different design format with more feature locations. For positive or negative user control probe groups in a microarray set, you must also select a probe distribution option. Note: Once you create an individual microarray design, you cannot subsequently convert it into a microarray set, even if you add additional probes and exceed the capacity of the selected design format. Similarly, once you create a microarray set, you cannot convert it into an individual microarray design.
Change distribution of positive and negative control probe groups in a microarray set	(Available for microarray sets that have probe groups with a control type of pos or neg) eArray always puts a copy of each positive and negative user control probe group onto each microarray within a microarray set. Within each microarray in the set, the probes in the control probe groups are always assigned to random feature positions. However, you have some control over how probes are assigned to these randomly-selected feature positions from one array to another in the set. Click Biological <type> Probe Group(s). In the Control Probe Positioning column, select one of these options for each positive or negative user control probe group: Variable Random – Each positive or negative control probe appears on each microarray in the set in a different position. Example: eArray assigns Probe A randomly to feature position 4330 in one microarray in the set. In another microarray in the set, it might assign Probe A to feature position 961. Fixed Random – A given positive or negative control probe appears on each microarray in the set in the same position. Example: eArray assigns Probe A randomly to feature position 4330 in one microarray in the set. In all of the other microarrays in the set, Probe A is also assigned to feature position 4330. Note: For all other types of probe groups, eArray always assigns them to feature positions in a specific way: • Probe groups with a control type of biological or ignore – For each replicate requested, eArray distributes a single copy of the probe group randomly over all of the microarrays in the set. • Probe groups in the Replicate Probe Group Details pane or in the Normalization Probe Group Details pane – When these type(s) of probe groups are included in a microarray set, eArray places a copy of each one on every microarray in the set using Variable Random positioning. • Agilent quality control grid – With the exception of the SureSelect Capture Array application type, a set of these probes appears on every microarray in the set. The placement of these probes is the same on every array, and is defined by the control grid specifications.
Fill unused features	You can fill the unused features of a microarray with the probe group of your choice. eArray may add only part of the probe group or more than one copy of some or all of the probe group, depending on the number of available empty features in your design, and the number of probes in the probe group. To add a whole probe group to your design, use the procedure described above in Add a user control or non-control probe group, instead. Follow these steps to fill unused features: Mark Fill Microarrays. In Probe Group To Fill Microarray, click . A probe group selection page appears in a new window. For instructions on how to use this page, see Select probe groups for probe searches.

Note: When you use a Design Wizard to create a microarray design, only Randomized feature layout is available. To create a design with a Customer specified probe order, you must use another method to create the design.

Click Next.

The next step of the wizard appears.

Step 6 – Create Microarray Design

In this step, you save your design with a specific status.

In How do you want to save and create your Design?, select one of the following:
- Draft – eArray saves the design with a status of Draft. Only you can edit the design.
- Review – eArray saves the design with a status of Review. The users in your workgroup can edit the design and save versions of it.
- Complete – eArray saves the design with a status of Complete. The design can be submitted to Agilent Manufacturing, and no one can edit the design.
- Submit – eArray saves the design with a status of Submitted, and submits the design to Agilent Manufacturing. You can then request a quote for the design. If you select this option, you must also click Design Checklist, then read and mark all of the items that appear.
Click Save.

eArray creates and saves your design. A success message appears.
Click Close.

Another success message appears.
Click Close.