|
|
|
|
|
|
[1]
|
NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA].
STRAIN=O6:H1 / CFT073 / ATCC 700928 / UPEC;
DOI=10.1073/pnas.252529799; PubMed=12471157 [NCBI, ExPASy, EBI, Israel, Japan]
Welch R.A.,
Burland V.,
Plunkett G. III,
Redford P.,
Roesch P.,
Rasko D.,
Buckles E.L.,
Liou S.-R.,
Boutin A.,
Hackett J.,
Stroud D.,
Mayhew G.F.,
Rose D.J.,
Zhou S.,
Schwartz D.C.,
Perna N.T.,
Mobley H.L.T.,
Donnenberg M.S.,
Blattner F.R.;
"Extensive mosaic structure revealed by the complete genome sequence of uropathogenic Escherichia coli.";
Proc. Natl. Acad. Sci. U.S.A. 99:17020-17024(2002).
|
|
|
|
|
|
|
|
|
Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms.
Distributed under the Creative Commons Attribution-NoDerivs License.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| Length: 474 AA [This is the length of the unprocessed precursor] |
Molecular weight: 50859 Da [This is the MW of the unprocessed precursor] |
CRC64: 876ED25B0BE019B9 [This is a checksum on the sequence] |
|
10 20 30 40 50 60
MQHKLLINGE LVSGEGEKQP VYNPATGDVL LEIAEASAEQ VNAAVRAADA AFAEWGQTTP
70 80 90 100 110 120
KARAECLLKL ADVIEENGQV FAELESRNCG KPLHSAFNDE IPAIVDVFRF FAGAARCLNG
130 140 150 160 170 180
LAAGEYLEGH TSMIRRDPLG VVASIAPWNY PLMMAAWKLA PALAAGNCVV LKPSEITPLT
190 200 210 220 230 240
ALKLAELAKD IFPAGVINVL FGRGKTVGDP LTGHPKVRMV SLTGSIATGE HIISHTAPSI
250 260 270 280 290 300
KRTHMELGGK APVIVFDDAD IEAVVEGVRT FGYYNAGQDC TAACRIYAQK GIYDTLVEKL
310 320 330 340 350 360
GAAVATLKSG SPDDESTELG PLSSLAHLER VSKAVEEAKA TGHIKVITGG EKRKGNGYYY
370 380 390 400 410 420
APTLLAGALQ DDAIVQKEVF GPVVSVTLFD NEEQVVNWAN DSQYGLASSV WTKDVGRAHR
430 440 450 460 470
VSARLQYGCT WVNTHFMLVS EMPHGGQKLS GYGKDMSLYG LEDYTVVRHV MVKH
|
Q8FHK7 in FASTA format |
|