Clustering of all known and predicted open reading frames of Escherichia coli K12

Takeshi Itoh ([1]
Minoru Yano ([1]
Keiko Takemoto ([2]
Miwako Kajihara ([1]
Hirotada Mori ([1]

[1] Research and Education Center for Genetic Information,
Nara Institute of Science and Technology
8916-5 Takayama, Ikoma, Nara 630-01, Japan
[2] Institute for Virus Research, Kyoto Univ.
Syougoin-Kawahara, Sakyo, Kyoto 606-01, Japan


At present, the non redundant contig sequences of E.coli which covers about 70% of the whole chromosome are constructed. We predicted ORF's (Open Reading Frames) from 2,554,518 bp contig sequences on the basis of Shine-Dalgarno (ribosome binding) sequence. All ORF's were classified according to the structural similarities. Through examining the homology of ORF's in each group in detail, some structural units were revealed.