XI.6.2.4 The sequence of some motifs prevents random start of DNA transcription outside the gene area
Susomo Ohno also noticed that the sequences of oligonucleotides are not entirely arbitrary (Ohno 1982).There is a large amount of DNA that does not encode any gene in practically every eukaryotic cell.The chromosome metazoa is frequently compared to a desert with scattered oases of meaningful genes.The human genome with a length of 3000 Mb contains approximately 30 thousand genes (Lander et al. 2001).This means that the individual genes are, on an average, separated by DNA sections with a length of the order of 100 thousand nucleotides.If the DNA sequences were random in these “stuffings” , then there would necessarily have to occasionally occur a target sequence for the RNA-polymerase II enzyme, which synthesizes mRNA in the eukaryotic cell.The cell would then completely unnecessarily begin to synthesize senseless mRNA at sites outside of the actual gene.In order to reduce, as far as possible, the occurrence of this undesirable phenomenon, the sequences of non-coding DNA in the genome are frequently formed by repetition of the AGCTG AGCTG AGCTG GGGTG sequence.The same motif also appears inside some genes.Comparison of this sequence with the sequence of target sites for RNA-polymerase II, TATAAATA, reveals that they are extremely far apart and that a great many mutations would be required for the formation of a target site for RNA-polymerase II.In addition, the oligonucleotide of blind sites contains three termination codons TGA (underlined), one in each reading frame.As a consequence, even if a polymerase target site is formed, it is probable that the length of the synthesized meaningless proteins will not be very great.