RaTG13 Is Too Good To Be True.
---- WIV1 pk RaTG13 (2).
Zhiyan-Le, 2021-04-03.
https://sites.google.com/site/zhiyanleback/2021-1/z20210404-wiv1xratg13
https://zhiyanleback.blogspot.com/p/ratg13-is-too-good-to-be-true.html
WHO China trip and its SARS-2 origin report indicated RaTG13 as the SARS-2
origin from nature. Their conclusion was base on: 1]: RaTG13 has 96.2% identity,
which was done on the single gene alignment level. 2]: There is only one
Amino-Acid difference, which was done on the protein alignment level.
Such conclusion may not stand true. Reason: Alignment and analysis on Codon
level are totally ignored, which may lead to different conclusions. Let’s take
an example, we have two genomic sequences with the same Am-Acid but different
Codons:
First: | ||||||||||||||
Arg | Phe | Glu | Arg | Arg | Ser | Leu | Gly | Ser | Ser | Arg | Pro | Thr | Cys | Cys |
AGG | TTC | GAG | CGC | CGG | AGT | CTC | GGC | TCA | TCC | CGA | CCG | ACT | TGC | TGT |
Second: | ||||||||||||||
Arg | Phe | Glu | Arg | Arg | Ser | Leu | Gly | Ser | Ser | Arg | Pro | Thr | Cys | Cys |
CGT | TTT | GAA | AGG | CGT | TCC | TTA | GGA | TCC | TCA | CGG | CCC | ACC | TGT | TGC |
Their identities on the Am-Acid or protein level is 100%. And their identities
on the single gene alignment level is 56%. See below:
Identities | 25/45(56%) |
Query 1 AGGTTCGAGCGCCGGAGTCTCGGCTCATCCCGACCGACTTGCTGT 45 | |
| || || | || | || || || || || || || || | |
Sbjct 1 CGTTTTGAAAGGCGTTCCTTAGGATCCTCACGGCCCACCTGTTGC 45 |
However, their identities on the Codon level: 0%. More over, their structures
are different:
Which is accurate? In my view, the one on Codon level is. The reason is simple:
When the virus and ACE2 are running, they work on the Codon level. The given
samples may have the same Am-Acid, however, their energy level or power can be
very different due to their different Codons. This can be well explained by the
binary-image Codon Table and by the Buckyball system.
RaTG13 Reality on the Codon Level.
Now using Codon study to see the identities among relevant nCov samples (with
GenBank ID):
• NC_045512: WH-01, basic sample, collected from a patient in Wuhan hospital in
PRC.
• MN996532: bat RaTG13, said as the most possible origin of SARS-2.
• KF367457: WIV1, the first one of lab-product series of SARS-like nCov.
• NC_028824, Bat-2012, natural bat, collected in PRC Yunnan, 2012.
• NC_030886, Bat-2014. natural bat, collected in PRC Yunnan, 2014
If RaTG13 were a bat from nature in 2013, it should behave as the same or very
closely to the two natural bat sample, Bat-2012 and Bat-2014.
Aligning with the basic sample, WH-01 (NC_45512), on the global Codon level, the
RaTG13 has a matching score of 184 and WIV1 has 190. And here is the alignment
of their Codon base-gene occurring frequency & distribution:
Table 01:
Global Codon BP Freq.Distribution (raw data:NIH GenBank, by 2021-02) | |||||
NC_045512 | WIV1 | RaTG13 | Bat-12 | Bat-14 | |
A | 3038.33 | 2884.67 | 2975.33 | 2342.33 | 2552.33 |
C | 1868.00 | 2020.67 | 1836.67 | 1462.67 | 2135.67 |
G | 1983.67 | 2098.00 | 1948.67 | 1936.00 | 2419.00 |
T | 3267.00 | 3099.67 | 3190.33 | 3250.00 | 2946.00 |
total: | 10157 | 10103 | 9951 | 8991 | 10053 |
avrg: | 2539.25 | 2525.75 | 2487.75 | 2247.75 | 2513.25 |
total gap w NC_45512 | -54 | -206 | -1166 | -104 | |
avrg gap w NC_45512 | -14 | -52 | -292 | -26 |
The aligning result: WIV1 has the total gap [-54] and average gap [-14] , and
RaTG13 has that of [-206] and [-53]. Both Bat-2012 and Bat-2014 have much
bigger gaps.
Clearly, WIV1 is the closest to the basic sample, meaning an obvious greater
possibility to be the SARS-2 origin than that of RaTG13, which behaves very
differently from natural bat samples.
Ratg13: Too Good To Be True As The SARS-2 Origin.
This indicator, Codon and its genetic frequencies and distribution, is
important, according to PRC-PLA doctor Chen Wei (a top virology scientist and
Covid-vaccine developer in China; also, she is a leader in charge of medical
treatment during the early stage of the pandemic in PRC Wuhan City), because it
directly tells the similarity or difference regarding affinity, stability and
mutation status and trend, especially genes C & G and their quantity and
distribution.
S-Gene is a key factor re SARS-2 interacts with human body. Let’s do some
studies on the Codon level by borrowing Dr. Chen Wei suggestion. Below are
similarities among the taken samples.
Fig. 01:
Indeed, as Dr. Chen Wei suggested, gene C & G, as well as their quantity and
distribution, play an important role, particularly in the Codon 3rd BP genes
(where base-gene A & T are 0).
It seems that RaTG13 is in a good position to be the SARS-2 origin: Regarding
single gene-aligning closeness to the basic sample NC_045512, WIV1 has a ratio
of 0.9866, and RaTG13 has a ratio of 0.9969. However, comparing with natural
bats (Bat-2012 and Bat-2014), of which one has the ratio of 0.8892 and another
has it of 1.0134, far enough to be role out as the SARS-2 origin by the said
sample. That is, sample RaTG13 behaves very differently from natural bats, but
it does not.
Further, below is the Codon-leveled aligning result:
Table 02:
Codon-Leveled Similarities to NC-045512 | ||||
WIV1 | RaTG13 | Bat-2012 | Bat-2014 | |
Total Occurring | 1260 | 1256 | 1269 | 1132 |
similarities | 78 | 693 | 66 | 94 |
Ratio of similarity/total | 0.0619 | 0.5518 | 0.0520 | 0.0830 |
The sample RaTG13 has a matching score of 693 occurring aligning similarities,
others have it from 87 to 96. The gap between RaTG13 and natural bats is too big
to believe that it comes from nature. In a lab field, however, it is pretty easy
to reach or even to go beyond 693 similarities score. Besides, RaTG13 similarity
ratio is too close to the basic sample but too far from the natural samples. In
fact, when using NIH-BLAST to search RaTG13’s all possible similarities, the
result has no natural bats but in three categories: synthetic construct, clones,
and vaccines, all are lab/man-made work.
In sum, the indicator on Codon level should be in the must-do-list when
searching the SARS-2 origin(s). By using it, the picture is very different from
what the WHO report said. That is, the sample RaTG13 is too perfect to be a
truthful SARS-2 origin from natural bats, rather, it is very likely a
lab-product. In contrast, and by all indicators, particularly on the Codon
level, WIV1 has the closest relations with basic sample NC_044512, that is, WIV1
is the most possible SARS-2 origin.
Reference:
Message from PLA Vaccine Patent. 2021-03-15.
https://sites.google.com/site/zhiyanleback/2021-1/z20210315-patent-message-en
Data Availability
Table 01-02 (supplement)
SARS-2: Global Codon Freq. & Distribution (raw data: NIH GenBank, by 2021-02-22) | |||||||||||
WH-01 | WIV1 | RaTG13 | Bat-12 | Bat-14 | WH-01 | WIV1 | RaTG13 | Bat-12 | Bat-14 | ||
AAA | 284 | 264 | 312 | 132 | 180 | CAA | 230 | 207 | 243 | 129 | 157 |
AAC | 219 | 186 | 200 | 129 | 210 | CAC | 192 | 151 | 152 | 98 | 125 |
AAG | 252 | 239 | 116 | 114 | 213 | CAG | 184 | 165 | 84 | 129 | 183 |
AAT | 188 | 233 | 264 | 150 | 179 | CAT | 181 | 163 | 165 | 125 | 130 |
ACA | 276 | 283 | 235 | 175 | 219 | CCA | 101 | 141 | 113 | 79 | 150 |
ACC | 127 | 135 | 145 | 89 | 162 | CCC | 44 | 43 | 43 | 31 | 75 |
ACG | 62 | 52 | 46 | 60 | 89 | CCG | 27 | 21 | 30 | 28 | 72 |
ACT | 165 | 246 | 247 | 155 | 239 | CCT | 93 | 115 | 107 | 69 | 142 |
AGA | 201 | 149 | 266 | 137 | 106 | CGA | 36 | 27 | 37 | 28 | 33 |
AGC | 109 | 86 | 141 | 92 | 143 | CGC | 29 | 44 | 39 | 23 | 100 |
AGG | 131 | 108 | 113 | 124 | 112 | CGG | 24 | 29 | 28 | 18 | 40 |
AGT | 161 | 143 | 201 | 146 | 162 | CGT | 50 | 79 | 63 | 65 | 108 |
ATA | 180 | 168 | 106 | 191 | 166 | CTA | 274 | 215 | 119 | 243 | 211 |
ATC | 127 | 131 | 104 | 83 | 81 | CTC | 120 | 147 | 84 | 79 | 93 |
ATG | 298 | 357 | 122 | 319 | 311 | CTG | 271 | 248 | 79 | 211 | 203 |
ATT | 295 | 281 | 210 | 220 | 182 | CTT | 276 | 256 | 213 | 200 | 178 |
WH-01 | WIV1 | RaTG13 | Bat-12 | Bat-14 | WH-01 | WIV1 | RaTG13 | Bat-12 | Bat-14 | ||
GAA | 100 | 199 | 182 | 91 | 141 | TAA | 289 | 100 | 319 | 248 | 76 |
GAC | 76 | 145 | 133 | 57 | 143 | TAC | 201 | 183 | 252 | 169 | 214 |
GAG | 97 | 178 | 78 | 69 | 161 | TAG | 211 | 106 | 123 | 142 | 102 |
GAT | 64 | 173 | 185 | 79 | 159 | TAT | 135 | 163 | 279 | 230 | 174 |
GCA | 99 | 187 | 118 | 107 | 191 | TCA | 193 | 215 | 183 | 120 | 130 |
GCC | 47 | 78 | 69 | 53 | 131 | TCC | 79 | 61 | 72 | 66 | 98 |
GCG | 29 | 47 | 19 | 58 | 107 | TCG | 40 | 63 | 33 | 50 | 55 |
GCT | 89 | 261 | 179 | 114 | 260 | TCT | 139 | 187 | 199 | 133 | 165 |
GGA | 74 | 95 | 124 | 66 | 98 | TGA | 258 | 87 | 305 | 251 | 92 |
GGC | 49 | 73 | 89 | 62 | 137 | TGC | 193 | 150 | 244 | 183 | 193 |
GGG | 51 | 50 | 50 | 43 | 70 | TGG | 195 | 110 | 253 | 208 | 179 |
GGT | 62 | 195 | 155 | 118 | 226 | TGT | 286 | 204 | 358 | 288 | 215 |
GTA | 180 | 153 | 129 | 198 | 192 | TTA | 362 | 248 | 220 | 425 | 214 |
GTC | 70 | 106 | 99 | 91 | 115 | TTC | 180 | 157 | 207 | 141 | 102 |
GTG | 279 | 212 | 97 | 266 | 315 | TTG | 366 | 291 | 188 | 425 | 302 |
GTT | 159 | 237 | 215 | 220 | 283 | TTT | 298 | 277 | 368 | 349 | 259 |
Table 02-02 (supplement)
S-Gene: Global Codon Matching Score (raw data: NIH GenBank, 2021-02-22) | |||||||||
WIV1 | RaTG13 | Bat-12 | Bat-14 | WIV1 | RaTG13 | Bat-12 | Bat-14 | ||
AAA | 0 | 0 | 0 | 0 | CAA | 0 | 0 | 0 | 0 |
AAC | 3 | 51 | 4 | 9 | CAC | 0 | 8 | 0 | 0 |
AAG | 2 | 31 | 0 | 0 | CAG | 4 | 24 | 1 | 2 |
AAT | 0 | 0 | 0 | 0 | CAT | 0 | 0 | 0 | 0 |
ACA | 0 | 0 | 0 | 0 | CCA | 0 | 0 | 0 | 0 |
ACC | 7 | 57 | 8 | 14 | CCC | 2 | 37 | 2 | 5 |
ACG | 0 | 0 | 0 | 0 | CCG | 0 | 0 | 0 | 0 |
ACT | 0 | 0 | 0 | 0 | CCT | 0 | 0 | 0 | 0 |
AGA | 0 | 0 | 0 | 0 | CGA | 0 | 0 | 0 | 0 |
AGC | 11 | 52 | 11 | 8 | CGC | 0 | 0 | 0 | 0 |
AGG | 2 | 24 | 2 | 1 | CGG | 0 | 0 | 0 | 0 |
AGT | 0 | 0 | 0 | 0 | CGT | 0 | 0 | 0 | 0 |
ATA | 0 | 0 | 0 | 0 | CTA | 0 | 0 | 0 | 0 |
ATC | 5 | 35 | 3 | 3 | CTC | 0 | 0 | 0 | 0 |
ATG | 1 | 5 | 2 | 2 | CTG | 7 | 59 | 7 | 14 |
ATT | 0 | 0 | 0 | 0 | CTT | 0 | 0 | 0 | 0 |
WIV1 | RaTG13 | Bat-12 | Bat-14 | WIV1 | RaTG13 | Bat-12 | Bat-14 | ||
GAA | 0 | 0 | 0 | 0 | TAA | 0 | 0 | 0 | 0 |
GAC | 2 | 32 | 3 | 1 | TAC | 2 | 33 | 2 | 4 |
GAG | 1 | 22 | 0 | 1 | TAG | 0 | 0 | 0 | 0 |
GAT | 0 | 0 | 0 | 0 | TAT | 0 | 0 | 0 | 0 |
GCA | 0 | 0 | 0 | 0 | TCA | 0 | 0 | 0 | 0 |
GCC | 7 | 38 | 5 | 9 | TCC | 0 | 0 | 0 | 0 |
GCG | 0 | 0 | 0 | 0 | TCG | 0 | 0 | 0 | 0 |
GCT | 0 | 0 | 0 | 0 | TCT | 0 | 0 | 0 | 0 |
GGA | 0 | 0 | 0 | 0 | TGA | 0 | 0 | 0 | 0 |
GGC | 4 | 49 | 4 | 5 | TGC | 5 | 23 | 0 | 3 |
GGG | 0 | 0 | 0 | 0 | TGG | 0 | 7 | 1 | 0 |
GGT | 0 | 0 | 0 | 0 | TGT | 0 | 0 | 0 | 0 |
GTA | 0 | 0 | 0 | 0 | TTA | 0 | 0 | 0 | 0 |
GTC | 0 | 0 | 0 | 0 | TTC | 5 | 46 | 6 | 2 |
GTG | 8 | 60 | 5 | 11 | TTG | 0 | 0 | 0 | 0 |
GTT | 0 | 0 | 0 | 0 | TTT | 0 | 0 | 0 | 0 |
Comments
Post a Comment