Beckman Research Institute of the City of Hope
1450 E. Duarte Road, Duarte, CA 91010-0269
As far as the English-speaking world is concerned, it seems as
though history began with a palindrome. The first self introduction
of Adam to Eve in the Garden of Eden was likely to have been "Madam
I'm Adam.", which is a perfect palindrome. It should also be noted
that EVE, itself, is a palindrome. The beauty of palindromes is
found in their perfect symmetry, and because of this symmetry, peptide
palindromes appear to have contributed to the functional refinements
of diverse proteins (1). In any human language, a perfect and sensible
palindrome made of more than 10 words is extremely difficult to
compose. The longest that I have come up with, thus far, is: "Wolf's
deeds lived was dog DNA, and God saw devil's deeds flow." Two prior
knowledge are required for understanding the meaning of the above
palindrome. The ancestor of domesticated dogs were wolves and in
folklore of the Dark Ages, dogs were often depicted as messengers
of the devil. In proteins, on the other hand, peptide palindromes
longer than decapeptidic in length are not at all rare.
A. 30 Peptidic Palindromes in the 2I2-ResidueLong Mouse HI Histone
Variety
As shown in Fig. 1, HI histone and other DNA binding proteins tend
to be very rich in peptide palindromes. It would be seen that this
particular variety known as mouse HI histone variety-1 (2) contained
31 often overlapping peptide palindromes which involved 115 of the
212 residues (54%). The longest palindrome occupied the positions
190-to-203, its tetradecapeptidic palindrome being Lys-Ala-Val-Lys-Pro-Lys-Ala-Ala-Lys-Pro-Lys-Val-Ala-Lys
(2nd row from the bottom of Fig. 1). These 31 peptide palindromes,
however are not of 31 varieties, for a number of palindromes are
found in two or more copies. For example, Lys-Lys-Ala-Ala-Lys-Lys
hexapeptidic palindrome occurred twice in the 2nd and 9th row of
Fig. 1, while two tripeptidic palindromes made four appearances
each; more often than not as parts of longer palindromes. They were
Lys-Ala-Lys and Lys-Pro-Lys. The rule of protein construction is
that those with unusual amino acid compositions are invariably comprised
of oligopeptidic repeats; repeating units often being palindromic
(3). In the average amino acid composition deduced from 18,383 entries
in DATABASE, the four dominant residues, leu, Ala, Gly and Ser,
in the above order, comprise 32% of the total (4). In the case of
HI histones, on the contrary, two residues, lys and Ala, in the
above order, comprise 48% of the total. With so much deviation from
the average, a protein has no choice but to become oligopeptidic
repeats. Analogous situations can be found in human languages. Were
one asked to compose an essay using a rarely used syllable repeatedly,
an essay automatically becomes verse instead of prose. Let me illustrate
this point, with a syllable "ze" which is surely one of the seldom
used syllables in English. "Zero in Zen cannot be
characterized as a void .Rather Zenī s zero
is akin t? the absolute zero in temperature, thus representing
the symbolized zenith."
B. 36 Peptide Palindromes in a Protein Encodable By a Complementary
Strand
of Mouse HI Histone Variety-I Coding Sequence
Soon after the unveiling of the universal codon assignment, it
became clear that 32 pairs of complementary codons are designed
to encode amino acids of contrasting properties. For example, the
DNA codon CTC (CUC in RNA) encodes leu which is a hydrophobic residue
of substantial size, whereas itS complementary codon GAG encodes
Glu; an acidic therefore, hydrophilic residue. Similarly, the codon
AAG complementary to another Leu codon, CTT, encodes Lys which this
time is a basic but still hydrophilic residue. In fact, contrasts
between amino acids encoded by complementary codons are so striking,
independent suggestions have been made on several occasions, that
each pair of a Peptide hormone and itS receptor must have originally
been encoded by two complementary strands of the same DNA. The disappointing
fact is that complementary strands of coding sequences are, as a
rule, untranslatable in the corresponding reading frame, due to
frequent interruptions by three chain terminators, TAG, TAA and
TGA. Proteins with unusual amino acid compositions are often poor
in leu and Ser and for this reason, complementary strands of coding
sequences encoding these proteins become frequent exceptions to
this rule in that they can be translated in the corresponding reading
frame,to yield proteins that are as long as proteins encoded by
coding sequences. This is because three chain terminators in a complementary
strand are represented in the coding strand as two leu codons, CTA,
TTA and one Ser codon TCA Observing Fig. 1, it would be noted that
the 'coding sequence of mouse HI histone variety-1 is entirely free
of the above three leu and Ser codons. Accordingly, its complementary
strand when translated in .the ,corresponding reading frame, is
capable ?f yielding a protein longer than HI histone, itself. In
Fig. 1, a complementary strand corresponding to HI histone part
are aligned 5'to-3' below the coding sequence and translated in
the corresponding reading frame. Whereas, HI histone was Lys, Ala-rich,
its complementary protein was dominated by Leu, Gly and Arg; the
above three comprising 61% of the total. Yet, this complementary
protein was as rich in peptide palindromes as HI histone itself.
Fig. 1 shows that this complementary protein contained 36 often
overlapping palindromes which involved 115 of the 212 resi,dues
(54%) .Thus, in spite of their contrasting properties, HI histone
and its complementary protein were identical in the portion devoted,
to peptidic palindromes. As to the lengths of palindromes, two octapeptidic
ones, Leu-Arg-Gly-Leu-Leu-Gly-Arg-Leu in the 4th row and Gly Arg-Arg-Leu-Leu-Arg-Arg-Gly
in the 12th row of Fig. 1 were the longest in this complementary
protein.
Fig. 1 The entire coding sequence accompanied by the amino
acid sequence of mouse HI histone variety-l is shown in large capital
letters in 13 rows. Shown below each row in small capital letters
is its complementary sequence translated in the corresponding reading
frame aligned 5'-to-3'. Peptide palindromes in H-I histone are marked
by solid bars, whereas those in its complementary protein are marked
by open bars. The solid vertical bar in the 7th row indicates the
center of symmetry. The 180 degree pivot around this center by the
complementary sequence would produce DNA double helix.
This protein, too, was oligopeptidic repeats. There were four copies
of Leu-Gly-Leu tripeptidic palindrome; three becoming a part of
Leu-Gly-Leu-Gly repeating tetrapeptides. Another tripeptidic palindrome,
Leu-Arg-Leu was also in four copies; two of them becoming a part
of the repeating pentapeptidic palindrome, Leu-Leu-Arg-Leu-Leu.
Thus, we have learned a few things about a pair of proteins encoded
by complementary strands of the same DNA. Ion spite of markedly
contrasting properties evidently reflected in their very different
amino acid compositions, two proteins share the two basic features:
If one is rich in peptide palindromes, so would the other be to
an identical degree. If one is made mostly of oligopeptidic repeats,
the other would also be of oligopeptidic repeats.
C. Simultaneous Musical Transformation of Two Complementary Strands
of HI Histone Gene
The symmetrical beauty of palindromes was well appreciated by great
composers of the past, notably by Mozart and Haydn. The simultaneous
musical transformation of mouse HI histone variety-l coding sequence
and its open reading frame complementary sequence as the treble
and base clef scores should enable us to appreciate not only the
symmetrical beauty of successive peptide palindromes in HI histone
but also its interplay with equally numerous palindromes in its
complementary protein. This has been done in accordance with the
previously set rule (5). An excerpt of it is shown in Figure 2 that
represented positions I-to-3I followed by positions I48-to-161.
It can be played either on a piano or as a duet between violin and
viola.
D.Summary
Peptide palindromes are invariably found in all proteins, and long
palindromes exceeding 10 residues in length are not rare. They are
particularly abundant in DNA-binding proteins such as HI histone.
When a complementary strand of the coding sequence is translatable
being free of a chain terminator, a complementary protein encode
by it becomes equally abundant in peptide palindromes. The simultaneous
musical transformation of both strands of mouse H1 histone variety-1
DNA enable us to appreciate the symmetrical beauty of successive
palindromes appearing in both H1 histone and its complementary protein.
1.Ohno S (1989) Intrinsic evolution of proteins: the role of peptidic
palindromes. Revista di Bio 82:341-343
2.Cheng G, Nandy A, Clerk S, Skoultchi AI (1989) Different 3'-end
processing produces two independently regulated mRNAS from a single
H1 histone gene. Proc Natl Acad Sci USA 86:7002 7006
3.Ohno S (1991) The grammatical rule of DNA language: messages in
palindromic verses. "Evolution of Life" Fossils, Molecules, and
Culture, Springer-Verlag, Tokyo, Eds S Osawa, T Honjo, 97-108
4.Seto y (1989) Formation of proteins on the primitive earth. Evidence
for the oligoglycine hypothesis. Viva Origino 17:153-163
5.Ohno S, Ohno M (1986) The all pervasive principle of repetitious
recurrence governs not only coding sequence construction but also
human endeavor in musical composition. Immunogen 24:71-78
|