Logo FlyChip
FlyChip
Functional Genomics for Drosophila
Cambridge Systems Biology Centre, Tennis Court Road, Cambridge, CB2 1QR, UK  [map]
Tel: +44 (0)1223-760280.   Fax: +44 (0)1223-760241.

A key to standard microarray spot identity tracking file formats

Overview

Each microarray spot has a unique position and each spot corresponds to a specific probe DNA from a specific well of a microtitre plate. There are typically thousands to tens of thousands of probe DNAs distributed between tens of source plates and thousands to tens of thousands of spots on each microarray. A single microarray experiment can consist of a few or a hundred microarray hybridisations. Tracking where and what each probe DNA is on each microarray is therefore an important issue.

Most robotic spotters are supplied with a data tracking program that uses an input file to describe the positions of each probe DNA within the microtitre plates and another file that defines how the microarray was printed to produce a description of where each probe DNA is within each microarray. These spot identities can then be imported into a spot finding and quantification tool that will 'append' the fluorescence spot signal. These data are then analysed to determine what affect any given experimental condition or treatment has had on the gene expression of the samples being compared.

For a definition of all terminology used below please refer to "Introduction to printing microarrays".

The microarray layout used in the following examples

1. Meta-grid Layout:

        A single 2x2 (X and Y axis) meta-grid printed on each slide
        Numbers below refer to the pins that printed the sub-grids

        01 02
        03 04

        Array position from bottom left of slide: 9.24 mm (X-axis), 34.19 mm (Y-axis)
        Meta-grid has an array area of 6.10 mm (X-axis) by 6.10 mm (Y-axis)

2. Sub-grid Layout:

        Each sub-grid has 4x4 (16) spots
        Centre-to-centre spot distance is 400 microns
        Numbers below refer to 'Source Visits'

        13 14 15 16
        09 10 11 12
        05 06 07 08
        01 02 03 04

        Key to Source Visits
        Source '1' to '16' = Imaginary clones 

3. Comments:
	
        This is a hypothetical microarray
        The arrays contains 64 spots per slide

The microarray input file used in the following examples

The following is a small section of a file that describes a fictitious source plate. Well position is defined by source plate barcode (e.g. TST101), row (A to P) and column (1 to 24). CloneID is a cDNA clone accession code and UniqueID is a well-specific identifier. The columns do not need to be in the order shown. The example shown is a 'tab separated value' (TSV) text file.

CloneID	UniqueID	Row	Column
FC1234	TST101          A	1
FC1235	TST101          A	2
FC1236	TST101          A	3
FC1237	TST101          A	4
Empty	Empty   	A	5
Empty	Empty   	A	6
Empty	Empty   	A	7
Empty	Empty   	A	8
Empty	Empty   	A	9
Empty	Empty   	A	10
Empty	Empty   	A	11
Empty	Empty   	A	12
Empty	Empty   	A	13
Empty	Empty   	A	14
Empty	Empty   	A	15
Empty	Empty   	A	16
Empty	Empty   	A	17
Empty	Empty   	A	18
Empty	Empty   	A	19
Empty	Empty   	A	20
Empty	Empty   	A	21
Empty	Empty   	A	22
Empty	Empty   	A	23
Empty	Empty   	A	24
FC1238	TST101  	B	1
FC1239	TST101  	B	2
FC1240	TST101  	B	3
FC1241	TST101  	B	4
Empty	Empty   	B	5
Empty	Empty   	B	6
Empty	Empty   	B	7

This input file was used to create a series of clone tracking files using the MicroGrid II so that the format structures could be explained using worked examples. These are shown in the following sections, an asterisk denotes an explanation rather than a component of the named file format.

Example microarray spot identity file formats

BioRobotics 'TAM' format

[FileInformation]
FileFormat=,1.0 *Version number of the file format
FormatName=,TAM *File format extension
GeneratedBy=,TAS2.1.5.16 *Arrayer software and version
BlockCount=,4 *Number of meta-grids
SpotSize=,180 *Estimated mean spot diameter


[Block1] *Sub-grid being examined
MetaGridX=,1 *Meta-grid X-axis co-ordinate
MetaGridY=,1 *Meta-grid Y-axis co-ordinate
OriginX=,9000 *Distance from top left edge of slide to centre of top left spot 
OriginY=,35300 *Distance from top left edge of slide to centre of top left spot 
BlockSizeX=,4 *Number of spots in each row
BlockSizeY=,4 *Number of spots in each column
SpacingX=,400 *X-axis centre-to-centre spot spacing
SpacingY=,400 *X-axis centre-to-centre spot spacing		
*Information is repeated for each sub-grid on the slide
			
[mapping] *'comma separated value' spreadsheet that maps clones to wells
1,1,1,1,,1,9,4,"FC1269","1036",1, {},FC1269,1036,TST101,I,4
1,1,1,2,,1,1,4,"FC1237","1004",1, {},FC1237,1004,TST101,A,4
1,1,1,3,,1,9,2,"FC1267","1034",1, {},FC1267,1034,TST101,I,2
1,1,1,4,,1,1,2,"FC1235","1002",1, {},FC1235,1002,TST101,A,2
1,1,2,1,,1,11,4,"FC1277","1044",1, {},FC1277,1044,TST101,K,4

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

*Column 01: Meta-grid X-axis co-ordinate
*Column 02: Meta-grid Y-axis co-ordinate
*Column 03: Sub-grid Y-axis co-ordinate
*Column 04: Sub-grid X-axis co-ordinate
*Column 05: Plate Barcode
*Column 06: Plate Number
*Column 07: Row Number
*Column 08: Column Number
*Column 09: Sample Name
*Column 10: Sample ID
*Column 11: Block number
*Column 12: Termination of additional fields
*Column 13 to End: Source data from the operator (See Example Array Input File)

Axon 'GAL' format

ATF	1.0
10	5 *first term is the number of headers, not including this, the previous or
         the column header row. The second term is the number of columns
"Type=GenePix ArrayList V1.0" *Version number of the file format
"BlockCount=4" *Number of sub-grids
"BlockType=0"
"Block1=9000, 35300, 180, 4, 400, 4, 400" *Mapping information for sub-grid 1 
"Block2=13500, 35300, 180, 4, 400, 4, 400" *Mapping information for sub-grid 2 
"Block3=9000, 39800, 180, 4, 400, 4, 400" *Mapping information for sub-grid 3 
"Block4=13500, 39800, 180, 4, 400, 4, 400" *Mapping information for sub-grid 4 
"Supplier=BioRobotics" *Arrayer manufacturer
"ArrayerSoftwareName=TAS Application Suite (MicroGrid II)" *Arrayer software name
"ArrayerSoftwareVersion=2.1.5.16" *Arrayer software version
"Block"	"Column"	"Row"	"ID"	"Name"
*Header for clone mapping 'tab separated value' (TSV) spreadsheet 
1	1		1	1036	FC1269
1	1		2	1004	FC1237
1	1		3	1034	FC1267
1	1		4	1002	FC1235
1	2		1	1044	FC1277
1	2		2	1012	FC1245
1	2		3	1042	FC1275
1	2		4	1010	FC1243
1	3		1	1052	FC1285
1	3		2	1020	FC1253
1	3		3	1050	FC1283
1	3		4	1018	FC1251
1	4		1	1060	FC1293
1	4		2	1028	FC1261
1	4		3	1058	FC1291
1	4		4	1026	FC1259
2	1		1	1040	FC1273
2	1		2	1008	FC1241
2	1		3	1038	FC1271
2	1		4	1006	FC1239
2	2		1	1048	FC1281
2	2		2	1016	FC1249
2	2		3	1046	FC1279
2	2		4	1014	FC1247
2	3		1	1056	FC1289
2	3		2	1024	FC1257

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

*Block: Sub-grid identification
*Column: Sub-grid Y-axis co-ordinate
*Row: Sub-grid X-axis co-ordinate
*ID: Sample ID (See Example Array Input File)
*Name: Sample Name (See Example Array Input File)

Molecularware 'MWBR' format

[FileInformation]
FileFormat=,1.0.2 *Version number of the file format
FormatName=,MwBr *File format extension
GeneratedBy=,TAS2.1.5.16 *Arrayer software and version	
BlockCount=,4 *Number of sub-grids
SpotSize=,180 *Estimated mean spot diameter

[Source]
Comment=,Genetix384 *Source plate type
XWells=,16 *Number of wells in X-axis
YWells=,24 *Number of wells in Y-axis
XPitch=,4.5 *Well spacing in X-axis
YPitch=,4.5 *Well spacing in Y-axis
PlateCount=,1 *Number of source plates
	
[Tool]
Description=,2x2 (384 well) split pin *MicroSpot II pin number and type
PinsX=,2 *Number of MicroSpot pins in X-axis
PinsY=,2 *Number of MicroSpot pins in Y-axis
PinPitch=,4500 *Pin-to-pin spacing in the pin-tool
	
[Target]
TargetWidth=,25000 *Width of the glass microscope slide 
TargetHeight=,75000 *Height of the glass microscope slide 
LeftMargin=,9239 *Meta-grid distance from left of slide	
RightMargin=,9239 *Meta-grid distance from right of slide 
TopMargin=,34189 *Meta-grid distance from top of slide
BottomMargin=,34189 *Meta-grid distance from bottom of slide 
XSpacing=,0 *Spacing between meta-grids in X-axis
YSpacing=,0 *Spacing between meta-grids in Y-axis
NumberOfCopies=,27 *Number of slides being printed
	
[slides] *Mapping of the slide the microarray was printed on
Slide1=,1,20000,243000, *List abbreviated to save space
	
[Block1] *Sub-grid being examined
MetaGridX=,1 *Meta-grid X-axis co-ordinate
MetaGridY=,1 *Meta-grid Y-axis co-ordinate
OriginX=,9000 *Distance from top left edge of slide to centre of top left spot 
OriginY=,35300 *Distance from top left edge of slide to centre of top left spot 
BlockSizeX=,4 *Number of spots in each row
BlockSizeY=,4 *Number of spots in each column
SpacingX=,400 *X-axis centre-to-centre spot spacing
SpacingY=,400 *X-axis centre-to-centre spot spacing		
*Information is repeated for each sub-grid on the slide
	
[mapping]*As for TAM format expect column 3 and 4 are in the reverse order
1,1,1,1,,1,9,4,"FC1269","1036",1, {},FC1269,1036,TST101,I,4
1,1,1,2,,1,1,4,"FC1237","1004",1, {},FC1237,1004,TST101,A,4

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

Applied Precision Instruments arrayWoRx 'REF' format

#ArrayWoRx Reference File *File Format
#Tag 0x00FF040C
#Version: 2.10 *File format version
#GridOrigin: 34190 9660
#GridRotation: 0
#GridType: Irregular
#GridColumnsRows: 64 1
#GridWidthHeight: 6100 6100 *Array size: X-axis, Y-axis
#NomSpotSpacingColumnRow: 400 400 *X-axis Y-axis centre-to-centre spot distance
#OddRowOffset: 0
#Number_UniqueID_Types: 2
#UniqueID_Type 1: 0
#UniqueID_Type 2: 0
#Column 1: Spot number *Key to the clone spreadsheet
#Column 2: Spot name 1 [gene name] *Key to the clone spreadsheet
#Column 3: Spot name 2 [chromosome] *Key to the clone spreadsheet
#Column 4: Spot type [1=experiment; 2=ratio control] *Key to the clone spreadsheet
#Column 5: Unique ID 1 *Key to the clone spreadsheet
#Column 6: Unique ID 2 *Key to the clone spreadsheet
#Column 7: X coordinate *Key to the clone spreadsheet  
#Column 8: Y coordinate *Key to the clone spreadsheet  
#Column 9: Spot size 1 [width in um] *Key to the clone spreadsheet
#Column 10: Spot size 2 [height in um] *Key to the clone spreadsheet
#Column 11: Bounding box width [ROI about the spot] *Key to the clone spreadsheet
#Column 12: Bounding box height [ROI about the spot] *Key to the clone spreadsheet
#Column 13: Reserved column 1 *Key to the clone spreadsheet
#Column 14: Reserved column 2 *Key to the clone spreadsheet
#Column 15: Reserved column 3 *Key to the clone spreadsheet
#Column 16: Reserved column 4 *Key to the clone spreadsheet
#Column 17: Reserved column 5 *Key to the clone spreadsheet
#Column 18: Reserved column 6 *Key to the clone spreadsheet
#Column 19: Reserved column 7 *Key to the clone spreadsheet
#Column 20: Reserved column 8 *Key to the clone spreadsheet
#Column 21: Include flag [0=exclude; 1=include] *Key to the clone spreadsheet
#Column 22: Description [optional] *Key to the clone spreadsheet

*Tab separated value (TSV) clone spreadsheet

1 FC1262 unknown 1 1029 Unknown 34190 9660 180 180 200 200 * * * * * * * * 1 Plate 1, Well H1
2 FC1294 unknown 1 1061 Unknown 34590 9660 180 180 200 200 * * * * * * * * 1 Plate 1, Well P1
3 FC1264 unknown 1 1031 Unknown 34990 9660 180 180 200 200 * * * * * * * * 1 Plate 1, Well H3
4 FC1296 unknown 1 1063 Unknown 35390 9660 180 180 200 200 * * * * * * * * 1 Plate 1, Well P3

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is bottom left of slide for all measurements.

Quantarray file format

*'Tab separated value' (TSV) text file

1	1	1	1	1036
1	1	2	1	1004
1	1	3	1	1034
1	1	4	1	1002
1	1	1	2	1044
1	1	2	2	1012
1	1	3	2	1042
1	1	4	2	1010
1	1	1	3	1052
1	1	2	3	1020
1	1	3	3	1050
1	1	4	3	1018

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

*Column 01: Meta-grid X-axis co-ordinate
*Column 02: Meta-grid Y-axis co-ordinate
*Column 03: Sub-grid Y-axis co-ordinate
*Column 04: Sub-grid X-axis co-ordinate
*Column 05: 'UniqueID' (See Example Array Input File)

Imagene file format

*'Tab separated value' (TSV) text file

1	1	1	1	1036
1	1	1	2	1004
1	1	1	3	1034
1	1	1	4	1002
1	1	2	1	1044
1	1	2	2	1012
1	1	2	3	1042
1	1	2	4	1010
1	1	3	1	1052
1	1	3	2	1020
1	1	3	3	1050
1	1	3	4	1018

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

*Column 01: Meta-grid X-axis co-ordinate
*Column 02: Meta-grid Y-axis co-ordinate
*Column 03: Sub-grid X-axis co-ordinate
*Column 04: Sub-grid Y-axis co-ordinate
*Column 05: 'UniqueID' (See Example Array Input File)

Layout Map XYxy

*'Tab separated value' (TSV) text file

1	1	1	1	1036
1	1	1	2	1004
1	1	1	3	1034
1	1	1	4	1002
1	1	2	1	1044
1	1	2	2	1012
1	1	2	3	1042
1	1	2	4	1010
1	1	3	1	1052
1	1	3	2	1020
1	1	3	3	1050
1	1	3	4	1018

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

*Column 01: Meta-grid X-axis co-ordinate
*Column 02: Meta-grid Y-axis co-ordinate
*Column 03: Sub-grid X-axis co-ordinate
*Column 04: Sub-grid Y-axis co-ordinate
*Column 05: 'UniqueID' (See Example Array Input File)

Layout Map XYyx

*'Tab separated value' (TSV) text file

1	1	1	1	1036
1	1	2	1	1004
1	1	3	1	1034
1	1	4	1	1002
1	1	1	2	1044
1	1	2	2	1012
1	1	3	2	1042
1	1	4	2	1010
1	1	1	3	1052
1	1	2	3	1020
1	1	3	3	1050
1	1	4	3	1018

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

*Column 01: Meta-grid X-axis co-ordinate
*Column 02: Meta-grid Y-axis co-ordinate
*Column 03: Sub-grid Y-axis co-ordinate
*Column 04: Sub-grid X-axis co-ordinate
*Column 05: 'UniqueID' (See Example Array Input File)

Layout Map YXxy

*'Tab separated value' (TSV) text file

1	1	1	1	1036
1	1	1	2	1004
1	1	1	3	1034
1	1	1	4	1002
1	1	2	1	1044
1	1	2	2	1012
1	1	2	3	1042
1	1	2	4	1010
1	1	3	1	1052
1	1	3	2	1020
1	1	3	3	1050
1	1	3	4	1018

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

*Column 01: Meta-grid Y-axis co-ordinate
*Column 02: Meta-grid X-axis co-ordinate
*Column 03: Sub-grid X-axis co-ordinate
*Column 04: Sub-grid Y-axis co-ordinate
*Column 05: 'UniqueID' (See Example Array Input File)

Layout Map YXyx

*'Tab separated value' (TSV) text file

1	1	1	1	1036
1	1	2	1	1004
1	1	3	1	1034
1	1	4	1	1002
1	1	1	2	1044
1	1	2	2	1012
1	1	3	2	1042
1	1	4	2	1010
1	1	1	3	1052
1	1	2	3	1020
1	1	3	3	1050
1	1	4	3	1018

*Spreadsheet abbreviated for brevity.
*Each row of data relates to each spot on the microarray.
*Origin is top left of slide for all measurements.

*Column 01: Meta-grid Y-axis co-ordinate
*Column 02: Meta-grid X-axis co-ordinate
*Column 03: Sub-grid Y-axis co-ordinate
*Column 04: Sub-grid X-axis co-ordinate
*Column 05: 'UniqueID' (See Example Array Input File)

R. Auburn (17-02-2006).