Electronic Dissertations Library

Identification of ß-sheet motifs in three-dimensional protein structures, using a subgraph isomorphism algorithm: an update of a 1992 study, by Ruth V. Spriggs

Appendix C
Table 9) Data for Ten Stranded Sheets

Key at base of page


Number of strands Motif searched for Inversion Coefficient Motif Code Number of matches in number of files
10 1010101010 0.9 g i k 1916 31
1111111111 0.00h j l 64 11
1101010101 0.8 g i k 29 8
1010101001 0.8 g i k 33 7
1111111110 0.1 h j l 12 7
1011010101 0.8 g i k 23 6
1001010110 0.7 g i k 22 3
1010000101 0.6 b f g i 22 3
1010100101 0.8 g i k 29 3
1011111110 0.3 b g h j l 6 3
1101010100 0.7 g i k 16 3
1010110101 0.8 g i k 30 2
1011011010 0.7 g i 6 2
1011111101 0.4 b g h j l 8 2
1101010011 0.6 g i k 3 2
1101101010 0.7 g i k 4 2
1101111101 0.4 b g h j l 2 2
1110101000 0.5 b f g h i k 4 2
1111111101 0.2 b g h j l 6 2
1111111001 0.2 h j l 2 2
1000000001 0.2 - 2 1
1001001001 0.6 g 2 1
1010000001 0.4 b d f g i 1 1
1010001001 0.6 b g i 2 1
1010010001 0.6 g i 2 1
1010011010 0.7 g i 2 1
1010100110 0.7 g i k 2 1
1010110110 0.7 g i k 1 1
1010111010 0.7 b f g h i k 1 1
1010111110 0.5 b d f g h i j k l 1 1
1011001010 0.7 g i k 2 1
1011011110 0.5 b g h j 1 1
1011101010 0.7 b f g h i k 1 1
1011101101 0.6 g h 1 1
1011110101 0.6 b d f g h i j k 2 1
1011110110 0.5 b g h j 2 1
1100000001 0.2 - 1 1
1100000010 0.3 b g 4 1
1100101010 0.7 g i k 1 1
1101000010 0.5 b f g i 1 1
1101010010 0.7 g i k 16 1
1101100101 0.6 g i 1 1
1101101101 0.6 g 4 1
1101101110 0.5 g h 1 1
1101110110 0.5 g h 2 1
1101111010 0.5 b f g h i j 1 1
1101111100 0.3 b g h j l 1 1
1110000001 0.2 a e h 2 1
1110000011 0.2 a e h 1 1
1110100001 0.4 b f g h i 1 1
1110110110 0.5 g h 1 1
1111010000 0.3 b f g h i j 4 1
1111011010 0.5 b g h i j 1 1
1111100000 0.1 a c e h j l 2 1
1111110000 0.1 a c e h j l 2 1
1111110010 0.3 g h j l 1 1
1111111010 0.3 b d f g h i j l 1 1
1111111011 0.2 b g h j l 1 1
1000011110 0.3 a c e h j 0 0
1000100001 0.4 b g 0 0
1000101110 0.5 b g h i 0 0
1000110001 0.4 - 0 0
1000111110 0.3 a e h j l 0 0
1001000001 0.4 b g 0 0
1001001110 0.5 g h 0 0
1001010001 0.6 b f g i k 0 0
1001011110 0.5 b f g h i j 0 0
1001100001 0.4- 0 0
1001100110 0.5 - 0 0
1001101001 0.6 g i 0 0
1001101110 0.5 g h 0 0
1001110001 0.4 a h 0 0
1001110110 0.5 g h 0 0
1001111001 0.4 h j 0 0
1001111110 0.3 h j l 0 0
1010000110 0.5 b f g i 0 0
1010001110 0.5 a b g h i 0 0
1010010110 0.7 g i 0 0
1010011001 0.6 g i 0 0
1010011110 0.5 g h i j 0 0
1010100001 0.6 b d f g i k 0 0
1010101110 0.7 b f g h i k 0 0
1010110001 0.6 g i k 0 0
1010111001 0.6 b f g h i k 0 0
1011000001 0.4 g 0 0
1011000101 0.6 b g i 0 0
1011000110 0.5 g 0 0
1011001001 0.6 g 0 0
1011001101 0.6 g 0 0
1011001110 0.5 g h 0 0
1011010001 0.6b g i 0 0
1011010010 0.7 g i 0 0
1011010110 0.7 g i k 0 0
1011011001 0.6 g 0 0
1011100001 0.4 a e g h 0 0
1011100010 0.5 a g h 0 0
1011100101 0.6 g h i 0 0
1011100110 0.5 g h 0 0
1011101001 0.6 b g h i 0 0
1011101110 0.5 g h 0 0
1011110001 0.4 a b e g h j 0 0
1011110010 0.5 b g h j 0 0
1011111001 0.4 b g h j l 0 0
1011111010 0.5 b d f g h i j l 0 0
1100000011 0.2 - 0 0
1100000101 0.4 b d f g i 0 0
1100000110 0.3 - 0 0
1100001001 0.4 b g 0 0
1100001010 0.5 b d f g i k 0 0
1100001101 0.4 g 0 0
1100001110 0.3 a e h 0 0
1100010001 0.4 g 0 0
1100010010 0.5 g 0 0
1100010101 0.6 b f g i k 0 0
1100010110 0.5 b g i 0 0
1100011001 0.4 - 0 0
1100011010 0.5 g i 0 0
1100011100 0.3 a h 0 0
1100011101 0.4a g h 0 0
1100011110 0.3 a e h j 0 0
1100100001 0.4 b g 0 0
1100100010 0.5 g 0 0
1100100011 0.4 g 0 0
1100100101 0.6 g i 0 0
1100100110 0.5 g 0 0
1100101001 0.6 g i k 0 0
1100101100 0.5 g i 0 0
1100101101 0.6 g i 0 0
1100101110 0.5 b g h i 0 0
1100110001 0.4 - 0 0
1100110010 0.5 g 0 0
1100110011 0.4 - 0 0
1100110101 0.6 g i k 0 0
1100110110 0.5g 0 0
1100111001 0.4 h 0 0
1100111010 0.5 b g h i 0 0
1100111100 0.3 h j 0 0
1100111101 0.4 b g h j 0 0
1100111110 0.3 h j l 0 0
1101000001 0.4 b d f g i 0 0
1101000011 0.4b f g i 0 0
1101000101 0.6 b g i 0 0
1101000110 0.5 b g i 0 0
1101001001 0.6 g i 0 0
1101001010 0.7 g i k 0 0
1101001011 0.6 g i 0 0
1101001100 0.5 g i 0 0
1101001101 0.6 g i 0 0
1101001110 0.5 g h i 0 0
1101010001 0.6 b f g i k 0 0
1101010110 0.7 g i k 0 0
1101011001 0.6 g i k 0 0
1101011010 0.7 g i k 0 0
1101011100 0.5 b f g h i k 0 0
1101011101 0.6 b f g h i k 0 0
1101011110 0.5 b d f g h i j k 0 0
1101100001 0.4 g 0 0
1101100010 0.5 g 0 0
1101100011 0.4 g 0 0
1101100100 0.5 g 0 0
1101100110 0.5 g 0 0
1101101001 0.6 g i 0 0
1101101011 0.6 g i k 0 0
1101101100 0.5 g 0 0
1101110001 0.4 a g h 0 0
1101110010 0.5 g h 0 0
1101110011 0.4 g h 0 0
1101110100 0.5 b g h i 0 0
1101110101 0.6 b f g h i k 0 0
1101111001 0.4 b g h j 0 0
1101111011 0.4 b g h j 0 0
1101111110 0.3 b g h j l 0 0
1110000010 0.3 a b e g h 0 0
1110000100 0.3 a b e g h 0 0
1110000101 0.4 a b e f g h i 0 0
1110000110 0.3 a e h 0 0
1110000111 0.2 a e h 0 0
1110001001 0.4 a g h 0 0
1110001010 0.5 a b f g h i k 0 0
1110001011 0.4 a b g h i 0 0
1110001100 0.3 a h 0 0
1110001101 0.4 a g h 0 0
1110001110 0.3 a h 0 0
1110010001 0.4 g h 0 0
1110010010 0.5g h 0 0
1110010011 0.4 g h 0 0
1110010100 0.5 g h i k 0 0
1110010101 0.6 g h i k 0 0
1110010110 0.5 g h i 0 0
1110011000 0.3 h 0 0
1110011001 0.4 h 0 0
1110011010 0.5 g h i 0 0
1110011011 0.4 g h 0 0
1110011100 0.3h 0 0
1110011101 0.4g h 0 0
1110011110 0.3 h j 0 0
1110100010 0.5b g h i 0 0
1110100011 0.4 b g h i 0 0
1110100100 0.5 b g h i 0 0
1110100101 0.6 b g h i 0 0
1110100110 0.5 b g h i 0 0
1110100111 0.4 b g h i 0 0
1110101001 0.6 b f g h i k 0 0
1110101010 0.7 b f g h i k 0 0
1110101011 0.6 b f g h i k 0 0
1110101100 0.5 b f g h i k 0 0
1110101101 0.6 b f g h i k 0 0
1110101110 0.5 b f g h i k 0 0
1110110001 0.4 g h 0 0
1110110010 0.5 g h 0 0
1110110011 0.4 g h 0 0
1110110100 0.5 g h i 0 0
1110110101 0.6 g h i k 0 0
1110110111 0.4 g h 0 0
1110111000 0.3 a g h 0 0
1110111001 0.4 g h 0 0
1110111010 0.5 b g h i 0 0
1110111011 0.4 g h 0 0
1110111100 0.3 b g h j 0 0
1110111101 0.4 b g h j 0 0
1110111110 0.3 b g h j l 0 0
1111000001 0.2 a c e h j 0 0
1111000010 0.3 a b c e g h j 0 0
1111000011 0.2 a c e h j 0 0
1111000100 0.3 a e g h j 0 0
1111000101 0.4 a b e g h i j 0 0
1111000110 0.3 a e h j 0 0
1111000111 0.2 a e h j 0 0
1111001000 0.3 g h j 0 0
1111001001 0.4 g h j 0 0
1111001010 0.5 g h i j k 0 0
1111001011 0.4 g h i j 0 0
1111001100 0.3h j 0 0
1111001101 0.4 g h j 0 0
1111001110 0.3 h j 0 0
1111001111 0.2 h j 0 0
1111010001 0.4 b f g h i j 0 0
1111010010 0.5 b f g h i j 0 0
1111010011 0.4 b f g h i j 0 0
1111010100 0.5 b d f g h i j k 0 0
1111010101 0.6 b d f g h i j k 0 0
1111010110 0.5 b d f g h i j k 0 0
1111010111 0.4 b d f g h i j k 0 0
1111011000 0.3 b g h j 0 0
1111011001 0.4 b g h j 0 0
1111011011 0.4 b g h j 0 0
1111011100 0.3 b g h j 0 0
1111011101 0.4 b g h j 0 0
1111011110 0.3 b g h j 0 0
1111100001 0.2 a c e h j l 0 0
1111100010 0.3 a e g h j l 0 0
1111100011 0.2 a e h j l 0 0
1111100100 0.3 g h j l 0 0
1111100101 0.4 g h i j l 0 0
1111100110 0.3 h j l 0 0
1111100111 0.2 h j l 0 0
1111101000 0.3 b d f g h i j l 0 0
1111101001 0.4 b d f g h i j l 0 0
1111101010 0.5 b d f g h i j k l 0 0
1111101011 0.4 b d f g h i j k l 0 0
1111101100 0.3 b g h j l 0 0
1111101101 0.4 b g h j l 0 0
1111101110 0.3 b g h j l 0 0
1111101111 0.2 b g h j l 0 0
1111110001 0.2 a e h j l 0 0
1111110011 0.2 h j l 0 0
1111110100 0.3 b d f g h i j l 0 0
1111110101 0.4 b d f g h i j k l 0 0
1111110110 0.3 b g h j l 0 0
1111110111 0.2 b g h j l 0 0
1111111000 0.1 a e h j l 0 0
1111111100 0.1 h j l 0 0
Totals 2312 141


Key


Number of strands: Number of strands in the sheets that were searched for.
Motif searched for: The motifs possible with this number of strands, using the binary notation.
Inversion Coefficient: Please see explanation in the Statistical analysis in the Results chapter.
Number of matches: The total number of occurrences retrieved for that motif.
In number of files: The number of protein files the matches were found in.
Motif Code:
The motifs above, with an associated code, contain at least one of the units of motif next to the codes below.
a000111Three stranded parallel unit antiparallel to three stranded parallel unit
b000101, 000010, 101000, 010000, 010111, 101111Three stranded parallel unit adjacent to three stranded antiparallel unit
c00001111Four stranded parallel unit antiparallel to four stranded parallel unit
d00001010, 01010000, 10101111, 00000101, 10100000, 01011111Four stranded parallel unit adjacent to four stranded antiparallel unit
e1110000, 0000111, 0001111, 1111000Three or four stranded parallel unit antiparallel to three or four stranded parallel unit
f1110101, 1010111, 0001010, 0101000, 1111010, 0000101, 1010000, 0101111Three or four stranded parallel unit adjacent to three or four stranded antiparallel unit
g101, 010Three stranded antiparallel unit
h111Three stranded parallel unit
i1010, 0101Four stranded antiparallel unit
j1111Four stranded parallel unit
k10101, 01010Five stranded antiparallel unit
l11111Five stranded parallel unit


Title Page    Next section


Identification of ß-sheet motifs in three-dimensional protein structures, using a subgraph isomorphism algorithm: an update of a 1992 study.
MSc in Information Management, 1998/1999
Electronic Dissertations Library
© University of Sheffield - Department of Information Studies (All Rights Reserved)