nih-gov/www.ncbi.nlm.nih.gov/CBBresearch/Lakshmin/UB/prok_ub_supplement_file_1.html
2025-03-17 02:05:34 +00:00

1955 lines
579 KiB
HTML

<html>
<head>
<!-- -->
<!-- generated by hotgi using the following command -->
<!-- hotgi prok_ub_supplemental_material_for_html.txt -->
<!-- -->
</head>
<body bgcolor="#FFFFFF" link="#003366" vlink="#003366" alink="#003366">
<basefont size="2">
<pre>
Supplementary material- File 1
The prokaryotic antecedents of the Ubiquitin signaling system
and the early evolution of ubiquitin-like ß-grasp domains
Lakshminarayan M. Iyer, A. Maxwell Burroughs and L. Aravind
Presented below are the domain architectures and operon contexts of the different
systems reported in the study. The different groups are represented by the gi of one of the
components from the operons (marked with an asterisk). The operons are usually shown next to the organism name
where "->" signifies gene order from the 5'to 3' direction. Domain architectures are shown with
a '+' separating the domains. Also shown are the species names and the evolutionary group to which
a particular species belongs.
The general order of the major subgroups/operon types follows the order in Table 1.
We also provide alignments of various families described in the study.
--------------------------------------------------------------------------------------------------------------
1A. Classical Thiamine biosynthesis pathway
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
# The gis shown below are ThiE, ThiD and ThiG like proteins (marked with an asterisk); also shown are the length of the protein
GI LENGTH Operon ORGANISM Classification Protein description (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13879931">13879931</a> 222 <-ThiE*||ThiO->ThiS->ThiG-> Mycobacterium tuberculosis CDC1551; actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=66869375">66869375</a> 237 <-ThiG<-ThiS<-ThiO||ThiE*-> Arthrobacter sp. FB24 actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=62424304">62424304</a> 220 <-ThiF<-ThiG<-ThiS<-ThiO||ThiE*-> Brevibacterium linens BL2 actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13092621">13092621</a> 235 ThiC->ThiE-><-ThiG<-ThiS<-ThiO||ThiE*-> Mycobacterium leprae actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71915182">71915182</a> 218 <-ThiG<-ThiS<-ThiO||?->ThiE*-> Thermobifida fusca YX; actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=41409995">41409995</a> 223 <-ThiE*||ThiO->ThiS->ThiG-> Mycobacterium avium subsp. paratuberculosis K-10 actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68263528">68263528</a> 218 <-ThiF<-ThiG<-ThiS<-ThiO<-?<-ThiE*<-ThiC Corynebacterium jeikeium K411 actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86741756">86741756</a> 237 <-ThiE*<-ThiS<-?<-?<-PDOR<-ThiH<-ThiG Frankia sp. CcI3 actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68230362">68230362</a> 229 <-ThiE*<-ThiS<-?<-?<-PDOR<-ThiH<-ThiG Frankia sp. EAN1pec actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=54018822">54018822</a> 232 <-Mopterin_binding_protein<-ThiG<-ThiS<-ThiO||ThiE*-> Nocardia farcinica IFM 10152 actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23493774">23493774</a> 216 ThiE*->ThiO->ThiS->ThiG->ThiF-> Corynebacterium efficiens YS-314 actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=38198930">38198930</a> 222 ThiC->ThiE*->ThiO->ThiS->ThiG->ThiF->ThiD-> Corynebacterium diphtheriae actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13092617">13092617</a> 279 ThiC->ThiD*-><-ThiG<-ThiS<-ThiO||ThiE-> Mycobacterium leprae actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46191094">46191094</a> 289 <-ThiF<-ThiG*<-ThiS Bifidobacterium longum DJO10A actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=5689919">5689919</a> 264 <-PDOR||ThiO->ThiS->ThiG*-> Streptomyces coelicolor A3(2) actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85666191">85666191</a> 304 <-ThiG*<-ThiF<-ThiS Bifidobacterium adolescentis actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71367701">71367701</a> 197 ThiE->ThiO->ThiS->ThiG->ThiE*->ThiD->ThiC-> Nocardioides sp. JS614 actinobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71481458">71481458</a> 259 moaA-><-?||?->OAHSH->ThiS->ThiG*->ThiH->ThiF-> Prosthecochloris vibrioformis DSM 265 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=34541689">34541689</a> 259 <-ThiH<-ThiG*<-?<-ThiC<-ThiS Porphyromonas gingivalis W83 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68550329">68550329</a> 259 <-ThiF<-ThiH<-ThiG*<-ThiS<-OAHSH<-Cysteine_synthase<-permease Pelodictyon phaeoclathratiforme BU-1 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67939245">67939245</a> 259 ThiS->ThiG*->ThiH-> Chlorobium phaeobacteroides BS1 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21646637">21646637</a> 259 <-ThiF<-ThiH<-ThiG*<-ThiS<-Cysteine_synthase<-OAHSH Chlorobium tepidum TLS bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67935885">67935885</a> 259 <-ThiF<-ThiH<-ThiG*<-ThiS<-OAHSH||?-><-moaA Chlorobium phaeobacteroides DSM 266 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78167321">78167321</a> 275 OAHSH->ThiS->ThiG*->ThiH->ThiF-> Pelodictyon luteolum DSM 273 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78171432">78171432</a> 256 OAHSH->?->ThiS->ThiG*->ThiH->?->ThiF-> Chlorobium chlorochromatii CaD3 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67919395">67919395</a> 259 moaA-><-?||?->OAHSH->OAHSH->ThiS->ThiG*->ThiH->ThiF-> Chlorobium limicola DSM 245 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=60493472">60493472</a> 204 ThiS->ThiE*->ThiG->ThiC->?->ThiH-> Bacteroides fragilis NCTC 9343 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52216685">52216685</a> 204 ThiS->ThiE*->ThiG->ThiC->?->ThiH-> Bacteroides fragilis YCH46 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83758120">83758120</a> 210 ThiO->ThiS->ThiG->ThiE*->ThiE-> Salinibacter ruber DSM 13855 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48855690">48855690</a> 203 ThiS->ThiC->ThiD->ThiE*->ThiG->ThiH-> Cytophaga hutchinsonii bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83755862">83755862</a> 290 ThiO->ThiS->ThiG->ThiE->ThiE*-> Salinibacter ruber DSM 13855 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29337956">29337956</a> 209 <-ThiF<-ThiH<-ThiC<-ThiG<-ThiE*<-ThiS Bacteroides thetaiotaomicron VPI-5482 bacteroidetes/chlorobi
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33238326">33238326</a> 346 ThiE*->ThiS-> Prochlorococcus marinus subsp. marinus str. CCMP1375 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=87125481">87125481</a> 348 ThiE*->ThiS-> Synechococcus sp. RS9917 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33639113">33639113</a> 349 ThiE*->ThiS-> Synechococcus sp. WH 8102 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86605751">86605751</a> 257 <-ThiG*<-ThiS<-ThiO Synechococcus sp. JA-3-3Ab cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17130690">17130690</a> 379 ThiE*->ThiS-> Nostoc sp. PCC 7120; cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=35210964">35210964</a> 366 <-ThiS<-ThiE* Gloeobacter violaceus PCC 7421; cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67922607">67922607</a> 338 ThiE*->ThiS-> Crocosphaera watsonii WH 8501 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72002529">72002529</a> 350 ThiE*->ThiS-> Prochlorococcus marinus str. NATL2A; cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33634552">33634552</a> 353 <-ThiS<-ThiE* Prochlorococcus marinus str. MIT 9313 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71674938">71674938</a> 360 <-ThiS<-ThiE* Trichodesmium erythraeum IMS101; cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33640198">33640198</a> 351 ThiE*->ThiS-> Prochlorococcus marinus subsp. pastoris str. CCMP1986 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78713251">78713251</a> 365 ThiE*->ThiS-> Prochlorococcus marinus str. MIT 9312; cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84512362">84512362</a> 343 <-ThiS<-ThiE* Prochlorococcus marinus str. MIT 9211 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56685459">56685459</a> 343 <-ThiS<-ThiE* Synechococcus elongatus PCC 6301 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78169363">78169363</a> 346 ThiE*->ThiS-> Synechococcus sp. CC9902 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78196899">78196899</a> 352 <-ThiS<-ThiE* Synechococcus sp. CC9605 cyanobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=66797755">66797755</a> 221 ThiC->ThiE*->ThiS->ThiG->?->ThiD-> Deinococcus geothermalis DSM 11300 deinococci
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=55772056">55772056</a> 206 ThiE*->ThiS->ThiG->?->ThiC->?->ThiD-> Thermus thermophilus HB8 deinococci
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=6460491">6460491</a> 280 <-permease<-?<-?<-ThiD<-ThiG<-ThiS<-ThiE*<-ThiC Deinococcus radiodurans R1 deinococci
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82744798">82744798</a> 256 Mopterin_binding_protein->?->ThiS->ThiG*->ThiH-> Clostridium beijerincki NCIMB 8052 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72496362">72496362</a> 218 <-ThiF<-ThiG<-ThiS<-ThiO<-ThiE* Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83590499">83590499</a> 255 ThiS->ThiG*->ThiH-> Moorella thermoacetica ATCC 39073 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68055200">68055200</a> 198 <-ThiF<-ThiG<-ThiS<-ThiO<-ThiE* Exiguobacterium sp. 255-15 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15025970">15025970</a> 195 <-ThiE*<-ThiH<-ThiG<-ThiF<-ThiS Clostridium acetobutylicum ATCC 824; firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77996134">77996134</a> 215 ThiS->ThiG->ThiH->ThiF->ThiE*-> Carboxydothermus hydrogenoformans Z-2901 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82499658">82499658</a> 219 ThiS->ThiG->ThiH->ThiF->ThiE*->ThiC-> Caldicellulosiruptor saccharolyticus DSM 8903 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2633520">2633520</a> 205 ThiE*->ThiO->ThiS->ThiG->ThiF->ThiD-> Bacillus subtilis subsp. subtilis str. 168; firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52002880">52002880</a> 203 ThiE*->ThiO->ThiS->ThiG->ThiF->ThiD-> Bacillus licheniformis ATCC 14580 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10174048">10174048</a> 211 ThiE*->ThiS->ThiG->ThiO->ThiD-> Bacillus halodurans C-125 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68446290">68446290</a> 197 <-ThiF<-ThiG<-ThiS<-ThiO<-ThiE* Staphylococcus haemolyticus JCSC1435 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57865486">57865486</a> 152 ThiE*->ThiO->ThiS->ThiG->ThiF-> Staphylococcus epidermidis RP62A firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23023751">23023751</a> 212 <-ThiG<-ThiF<-ThiS<-ThiE* Leuconostoc mesenteroides subsp. mesenteroides ATCC 8293 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56909741">56909741</a> 209 ThiE*->ThiS->ThiG->ThiO->ThiD-> Bacillus clausii KSM-K16 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77683441">77683441</a> 196 ThiS->ThiF->ThiG->ThiH->ThiC->ThiE*-> Alkaliphilus metalliredigenes QYMF firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67875149">67875149</a> 356 <-ThiC<-ThiE*<-ThiF<-ThiH<-ThiG<-ThiS Clostridium thermocellum ATCC 27405 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18145262">18145262</a> 193 <-ThiE*<-ThiH<-ThiG<-ThiF<-ThiS Clostridium perfringens str. 13 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=47501161">47501161</a> 206 Mopterin_binding_protein->?->?->ThiE*->ThiO->ThiS->ThiG->ThiF->ThiD-> Bacillus anthracis str. 'Ames Ancestor' firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56378999">56378999</a> 201 ThiE*->ThiO->ThiS->ThiG->ThiF-> Geobacillus kaustophilus HTA426 firmicutes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=19712999">19712999</a> 206 <-ThiE<-ThiH<-ThiG<-ThiF<-ThiS<-ThiC<-ThiE*<-ThiD Fusobacterium nucleatum subsp. nucleatum ATCC 25586; fusobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=32397912">32397912</a> 287 <-ThiG*||?-><-ThiS Rhodopirellula baltica SH 1 planctomycetes
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=27354938">27354938</a> 208 ThiO->ThiS->ThiG->ThiE*->ThiC-> Bradyrhizobium japonicum USDA 110; proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78495123">78495123</a> 202 <-ThiC<-ThiE*<-ThiG<-ThiS<-ThiO Rhodopseudomonas palustris BisB18 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69299787">69299787</a> 198 ThiD->ThiO->ThiS->ThiG->ThiE*->ThiF-> Silicibacter sp. TM1040 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56676713">56676713</a> 198 ThiD->ThiO->ThiS->ThiG->ThiE*->ThiF-> Silicibacter pomeroyi DSS-3 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83751112">83751112</a> 201 <-ThiD<-ThiE*<-ThiG<-ThiS<-ThiO<-ThiC Bartonella bacilliformis KC583; proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49238087">49238087</a> 252 <-ThiD*<-ThiE<-ThiG<-ThiS<-ThiO<-ThiC||?-><-Mopterin_binding_protein Bartonella henselae str. Houston-1 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84501018">84501018</a> 198 <-ThiF<-ThiE*<-ThiG<-ThiS<-ThiO<-ThiD Oceanicola batsensis HTCC2597 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85705980">85705980</a> 198 <-ThiF<-ThiE*<-ThiG<-ThiS<-ThiO<-ThiD Roseovarius sp. 217 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83952604">83952604</a> 203 ThiC->ThiO->ThiS->ThiG->ThiE*->ThiF->ThiD-> Roseovarius nubinhibens ISM proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86137738">86137738</a> 196 ThiC->ThiO->ThiS->ThiG->ThiE*->ThiF->ThiD-> Roseobacter sp. MED193 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=39650494">39650494</a> 202 ThiO->ThiS->ThiG->ThiE*->ThiC-> Rhodopseudomonas palustris CGA009 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17983764">17983764</a> 203 ThiD->ThiO->ThiS->ThiG->ThiE*->ThiC Brucella melitensis 16M; proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83954398">83954398</a> 198 <-ThiF<-ThiE*<-ThiG<-ThiS<-ThiO<-ThiD Sulfitobacter sp. NAS-14.1 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71062546">71062546</a> 257 ThiS->ThiG*-> Candidatus Pelagibacter ubique HTCC1062 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69926560">69926560</a> 208 <-ThiC<-?<-ThiE*<-ThiG<-ThiS<-ThiO Nitrobacter hamburgensis X14 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68192290">68192290</a> 206 ThiC->ThiO->ThiS->ThiG->ThiE*->ThiD-><-OmpA Mesorhizobium sp. BNC1 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14025575">14025575</a> 201 <-ThiD<-ThiE*<-ThiG<-ThiS<-ThiO<-ThiC Mesorhizobium loti MAFF303099 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23011961">23011961</a> 189 ThiO->ThiS->ThiG->ThiE*-> Magnetospirillum magnetotacticum MS-1; proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13423327">13423327</a> 269 <-ThiG*<-ThiS Caulobacter crescentus CB15 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84703985">84703985</a> 259 <-phosphatidylglycerophosphate_synthase<-?<-?||ThiS->ThiG*-> Parvularcula bermudensis HTCC2503 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17741060">17741060</a> 257 <-ThiG*<-ThiS<-ThiO<-ThiC Agrobacterium tumefaciens str. C58 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=58417134">58417134</a> 266 <-ThiG*<-ThiS Ehrlichia ruminantium str. Gardel proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56416397">56416397</a> 264 ThiS->ThiG*-> Anaplasma marginale str. St. Maries proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83858498">83858498</a> 262 <-ThiG*<-ThiS Oceanicaulis alexandrii HTCC2633 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68538042">68538042</a> 256 <-ThiG*<-ThiS Sphingopyxis alaskensis RB2256 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78698311">78698311</a> 202 <-ThiC<-ThiE*<-ThiG<-ThiS<-ThiO Bradyrhizobium sp. BTAi1 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72394551">72394551</a> 261 <-ThiG*<-ThiS Ehrlichia canis str. Jake proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74022860">74022860</a> 312 ThiE-><-?||?-><-ThiE*<-ThiG<-ThiS<-ThiO Rhodoferax ferrireducens DSM 15236 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74019423">74019423</a> 374 ThiO->ThiS->ThiG->ThiE*->Mopterin_binding_protein-> Burkholderia ambifaria AMMD; proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=7227331">7227331</a> 205 ThiO->ThiE*->ThiS->ThiG-> Neisseria meningitidis MC58 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84713028">84713028</a> 270 ThiC->ThiO->ThiS->ThiG->ThiE*-> Polaromonas naphthalenivorans CJ2 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72117331">72117331</a> 290 ThiC->ThiO->ThiS->ThiG->ThiE->?-><-?<-ThiD*||?->?->?->?->ThiS-> Ralstonia eutropha JMP134 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=30138189">30138189</a> 268 <-methylase<-ThiG*<-ThiS Nitrosomonas europaea ATCC 19718 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82701205">82701205</a> 264 <-methylase<-ThiG*<-ThiS Nitrosospira multiformis ATCC 25196 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71849093">71849093</a> 260 <-methylase<-ThiG*<-ThiS<-ADH Dechloromonas aromatica RCB proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68554870">68554870</a> 276 ThiC->ThiO->ThiS->ThiG->ThiE->?-><-?<-ThiE*||?->?->?->?->ThiS-> Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=34499221">34499221</a> 264 <-ThiG*<-ThiS Chromobacterium violaceum ATCC 12472 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=47571796">47571796</a> 176 ThiC->ThiO->ThiS->ThiG->ThiD*-> Rubrivivax gelatinosus PM1 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17427116">17427116</a> 383 <-ThiE*<-ThiG||?-><-ThiS<-ThiO<-ThiC Ralstonia solanacearum; proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74318144">74318144</a> 262 <-methylase<-ThiG*<-ThiS Thiobacillus denitrificans ATCC 25259 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68212742">68212742</a> 259 <-ThiG*<-ThiS Methylobacillus flagellatus KT proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77544040">77544040</a> 206 ThiC->ThiS->ThiG->ThiH->?->ThiE*-> Pelobacter carbinolicus DSM 2380 proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78219006">78219006</a> 214 ThiS->ThiG->ThiH->ThiF->ThiE*-> Desulfovibrio desulfuricans G20; proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86158938">86158938</a> 203 <-ThiE*||?-><-ThiG<-ThiS Anaeromyxobacter dehalogenans 2CP-C proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71836232">71836232</a> 223 <-ThiE*<-ThiG<-ThiS Pelobacter propionicus DSM 2379 proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71545062">71545062</a> 222 ThiF->ThiG->ThiH->ThiS->ThiE*->?-><-?<-Mopterin_binding_protein Syntrophobacter fumaroxidans MPOB proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85859826">85859826</a> 229 <-ThiF<-ThiH<-ThiG<-ThiS<-ThiE*||?-><-?||?->Cysteine_synthase-> Syntrophus aciditrophicus SB proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=39982458">39982458</a> 213 <-ThiE*<-ThiG<-ThiS Geobacter sulfurreducens PCA proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78195386">78195386</a> 213 ThiS->ThiG->ThiE*-> Geobacter metallireducens GS-15 proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68178162">68178162</a> 203 ThiF->ThiS->ThiG->ThiH->ThiE*-> Desulfuromonas acetoxidans DSM 684; proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=50876628">50876628</a> 263 ThiS->ThiG*->ThiH-> Desulfotalea psychrophila LSv54 proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77544304">77544304</a> 208 <-ThiE*<-ThiH<-ThiG<-ThiS<-ThiF Pelobacter carbinolicus DSM 2380 proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46449915">46449915</a> 226 <-ThiE*<-ThiF<-ThiH<-ThiG<-ThiS Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough proteobacteria>deltaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57166733">57166733</a> 201 <-ThiE*<-ThiH<-ThiG<-ThiF<-ThiS Campylobacter jejuni RM1221 proteobacteria>epsilonproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57240558">57240558</a> 200 ThiS->ThiF->ThiG->ThiH->ThiE*-> Campylobacter lari RM2100 proteobacteria>epsilonproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86155162">86155162</a> 253 ThiS->ThiF->ThiG*->ThiH-> Campylobacter fetus subsp. fetus 82-40 proteobacteria>epsilonproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56178885">56178885</a> 504 ThiC->ThiD+ThiE*->ThiF->ThiS->ThiG->ThiH-> Idiomarina loihiensis L2TR; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83643050">83643050</a> 487 ThiC->ThiO->ThiS->ThiG->ThiD+ThiE*-> Hahella chejuensis KCTC 2396; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=45437723">45437723</a> 229 ThiC->ThiE*->ThiF->ThiS->ThiG->ThiH-> Yersinia pestis biovar Medievalis str. 91001 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=51587926">51587926</a> 215 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC Yersinia pseudotuberculosis IP 32953 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=12518922">12518922</a> 211 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC Escherichia coli O157:H7 EDL933; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49609718">49609718</a> 213 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC Erwinia carotovora subsp. atroseptica SCRI1043 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77960646">77960646</a> 217 ThiC->ThiE*->ThiF->ThiS->ThiG->ThiH-> Yersinia mollaretii ATCC 43969 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75855406">75855406</a> 471 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC<-CcrB Vibrio sp. Ex25; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68542221">68542221</a> 650 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiD+ThiE*<-ThiC Shewanella baltica OS155; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29540946">29540946</a> 479 ThiC->ThiO->ThiS->ThiG->ThiD+ThiE*-> Coxiella burnetii RSA 493; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71145222">71145222</a> 529 ThiC->ThiO->ThiS->ThiG->ThiD+ThiE*-> Colwellia psychrerythraea 34H; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69953446">69953446</a> 559 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiD+ThiE*<-ThiC Shewanella frigidimarina NCIMB 400 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=53751266">53751266</a> 488 ThiO->ThiS->ThiG->ThiD+ThiE*->ThiF-> Legionella pneumophila str. Paris; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=36783918">36783918</a> 216 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC Photorhabdus luminescens subsp. laumondii TTO1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=87119893">87119893</a> 203 <-ThiE*<-ThiG<-ThiS<-ThiO<-?<-?<-?<-Mopterin_binding_protein Marinomonas sp. MED121 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76874359">76874359</a> 508 ThiC->ThiO->ThiS->ThiG->ThiD+ThiE*->ThiF-> Pseudoalteromonas haloplanktis TAC125; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78362775">78362775</a> 218 <-ThiE<-ThiE*<-ThiG<-ThiS<-ThiO<-ThiC Thiomicrospira crunogena XCL-2 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=28808052">28808052</a> 444 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC<-CcrB Vibrio parahaemolyticus RIMD 2210633 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77977810">77977810</a> 226 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC Yersinia intermedia ATCC 29909 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77972243">77972243</a> 226 ThiC->ThiE*->ThiF->ThiS->ThiG->ThiH-> Yersinia frederiksenii ATCC 33641 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68514852">68514852</a> 525 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC Shewanella amazonensis SB2B proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77957006">77957006</a> 216 ThiC->ThiE*->ThiF->ThiS->ThiG->ThiH-> Yersinia bercovieri ATCC 43970 ; proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=37200142">37200142</a> 444 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC<-CcrB Vibrio vulnificus YJ016 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69156747">69156747</a> 613 ThiC->ThiE*->ThiF->ThiS->ThiG->ThiH-> Shewanella denitrificans OS217 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84393668">84393668</a> 430 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC<-CcrB Vibrio splendidus 12B01 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78366585">78366585</a> 581 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiD+ThiE*<-ThiC Shewanella sp. PV-4 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78362774">78362774</a> 281 <-ThiD*<-ThiE<-ThiG<-ThiS<-ThiO<-ThiC Thiomicrospira crunogena XCL-2 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9654457">9654457</a> 440 CcrB->ThiC->ThiE*->ThiF->ThiS->ThiG->ThiH-> Vibrio cholerae O1 biovar eltor str. N16961 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=16422721">16422721</a> 211 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiE*<-ThiC Salmonella typhimurium LT2 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=24373991">24373991</a> 526 <-ThiH<-ThiG<-ThiS<-ThiF<-ThiD+ThiE*<-ThiC Shewanella oneidensis MR-1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76793993">76793993</a> 276 <-ThiF<-?<-ThiG*<-ThiS<-ThiO<-ThiC Pseudoalteromonas atlantica T6c proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49530466">49530466</a> 261 <-ThiG*<-ThiS Acinetobacter sp. ADP1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=26991780">26991780</a> 270 <-methylase<-ThiG*<-ThiS<-?||?-><-?<-?<-Mopterin_binding_protein Pseudomonas putida KT2440 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71555612">71555612</a> 264 <-methylase<-ThiG*<-ThiS<-?||?-><-?<-?<-Mopterin_binding_protein Pseudomonas syringae pv. phaseolicola 1448A proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67677083">67677083</a> 266 ThiS->ThiG*->methylase-> Chromohalobacter salexigens DSM 3043 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68347434">68347434</a> 264 <-methylase<-ThiG*<-ThiS<-?||?-><-?<-?<-Mopterin_binding_protein Pseudomonas fluorescens Pf-5 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77953947">77953947</a> 269 <-ThiG*<-ThiS Marinobacter aquaeolei VT8 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67154906">67154906</a> 264 <-methylase<-ThiG*<-ThiS<-?||?-><-JAB Azotobacter vinelandii AvOP proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78701989">78701989</a> 262 ThiS->ThiG*->methylase-> Alkalilimnicola ehrlichei MLHE-1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48862780">48862780</a> 269 ThiO->ThiS->ThiG*->?->?->Mopterin_binding_protein-> Microbulbifer degradans 2-40 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21109645">21109645</a> 264 ThiS->ThiG*->methylase-> Xanthomonas axonopodis pv. citri str. 306 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9105679">9105679</a> 275 ThiS->ThiG*->methylase-> Xylella fastidiosa 9a5c proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71900706">71900706</a> 275 <-methylase<-ThiG*<-ThiS Xylella fastidiosa Ann-1 proteobacteria>gammaproteobacteria
Bacterial ThiSs fused to ThiG (Gis are of the ThiS+ThiG protein-marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68512207">68512207</a> 252 ThiE->ThiS+ThiG*-> Rubrobacter xylanophilus DSM 9941; actinobacteria Thiamine monophosphate synthase [Rubrobacter xylanophilus DSM 9941]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=79039407">79039407</a> 331 ThiS+ThiG* Novosphingobium aromaticivorans DSM 12444; proteobacteria>alphaproteobacteria similar to Uncharacterized enzyme of thiazole biosynthesis [Novosphingobium aromaticivorans DSM 12444]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56551634">56551634</a> 331 ThiS+ThiG* Zymomonas mobilis subsp. mobilis ZM4; proteobacteria>alphaproteobacteria thiazole biosynthesis protein [Zymomonas mobilis subsp. mobilis ZM4]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84788478">84788478</a> 332 ThiS+ThiG* Erythrobacter litoralis HTCC2594; proteobacteria>alphaproteobacteria thiazole biosynthesis protein [Erythrobacter litoralis HTCC2594]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85709842">85709842</a> 333 ThiS+ThiG* Erythrobacter sp. NAP1; proteobacteria>alphaproteobacteria thiazole biosynthesis protein [Erythrobacter sp. NAP1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83576681">83576681</a> 334 ThiS+ThiG* Rhodospirillum rubrum ATCC 11170; proteobacteria>alphaproteobacteria ThiS, thiamine-biosynthesis [Rhodospirillum rubrum ATCC 11170]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76883424">76883424</a> 347 ThiS+ThiG* Nitrosococcus oceani ATCC 19707; proteobacteria>gammaproteobacteria ThiS, thiamine-biosynthesis [Nitrosococcus oceani ATCC 19707]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68246504">68246504</a> 326 ThiS+ThiG* Magnetococcus sp. MC-1; proteobacteria ThiS, thiamine-biosynthesis [Magnetococcus sp. MC-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46202840">46202840</a> 330 ThiS+ThiG* Magnetospirillum magnetotacticum MS-1; proteobacteria>alphaproteobacteria COG2022: Uncharacterized enzyme of thiazole biosynthesis [Magnetospirillum magnetotacticum MS-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=53758359">53758359</a> 326 ThiS+ThiG* Methylococcus capsulatus str. Bath; proteobacteria>gammaproteobacteria thiamine biosynthesis protein ThiS [Methylococcus capsulatus str. Bath]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82701206">82701206</a> 162 ThiS->ThiG*-> Nitrosospira multiformis ATCC 25196; proteobacteria>betaproteobacteria thiamine biosynthesis protein ThiS [Nitrosospira multiformis ATCC 25196]
Archaeal ThiS solos (Gis are for the ThiS protein -marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon (no particularly conserved operons were detected) ORGANISM (gis are of the ThiS protein) Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48425680">48425680</a> 77 Pyrococcus furiosus DSM 3638 euryarchaeota A Chain A, Backbone Solution Structure Of Mixed AlphaBETA PROTEIN Pf1061
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33359535">33359535</a> 71 Pyrococcus furiosus DSM 3638 euryarchaeota sulfur carrier protein ThiS [Pyrococcus furiosus DSM 3638]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18893126">18893126</a> 69 Pyrococcus furiosus DSM 3638 euryarchaeota hypothetical protein [Pyrococcus furiosus DSM 3638]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=19916735">19916735</a> 77 Methanosarcina acetivorans C2A euryarchaeota predicted protein [Methanosarcina acetivorans C2A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=19916952">19916952</a> 77 Methanosarcina acetivorans C2A euryarchaeota predicted protein [Methanosarcina acetivorans C2A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13540947">13540947</a> 68 Thermoplasma volcanium GSS1 euryarchaeota hypothetical protein TVN0116 [Thermoplasma volcanium GSS1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14324330">14324330</a> 64 Thermoplasma volcanium GSS1 euryarchaeota hypothetical protein [Thermoplasma volcanium GSS1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10581690">10581690</a> 174 Halobacterium sp. NRC-1 euryarchaeota Vng<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2279">2279</a>h [Halobacterium sp. NRC-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18893610">18893610</a> 73 Pyrococcus furiosus DSM 3638 euryarchaeota hypothetical protein [Pyrococcus furiosus DSM 3638]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2622875">2622875</a> 70 Methanothermobacter thermautotrophicus str. Delta H euryarchaeota unknown [Methanothermobacter thermautotrophicus str. Delta H]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=19917335">19917335</a> 70 Methanosarcina acetivorans C2A euryarchaeota predicted protein [Methanosarcina acetivorans C2A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21226239">21226239</a> 70 Methanosarcina mazei Go1 euryarchaeota hypothetical protein MM0137 [Methanosarcina mazei Go1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68211447">68211447</a> 70 Methanococcoides burtonii DSM 6242 euryarchaeota hypothetical protein MburDRAFT_0612 [Methanococcoides burtonii DSM 6242]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72398144">72398144</a> 70 Methanosarcina barkeri str. fusaro euryarchaeota conserved hypothetical protein [Methanosarcina barkeri str. fusaro]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33356745">33356745</a> 69 Pyrococcus abyssi GE5 euryarchaeota sulfur carrier protein ThiS [Pyrococcus abyssi GE5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88951090">88951090</a> 69 Methanosaeta thermophila PT euryarchaeota conserved hypothetical protein [Methanosaeta thermophila PT]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48430257">48430257</a> 64 Picrophilus torridus DSM 9790 euryarchaeota hypothetical protein PTO0537 [Picrophilus torridus DSM 9790]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=44920975">44920975</a> 64 Methanococcus maripaludis S2 euryarchaeota hypothetical protein [Methanococcus maripaludis S2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10640784">10640784</a> 67 Thermoplasma acidophilum euryarchaeota hypothetical protein [Thermoplasma acidophilum]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=11498344">11498344</a> 67 Archaeoglobus fulgidus DSM 4304 euryarchaeota hypothetical protein AF0737 [Archaeoglobus fulgidus DSM 4304]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14591747">14591747</a> 67 Pyrococcus horikoshii OT3 euryarchaeota sulfur carrier protein ThiS [Pyrococcus horikoshii OT3]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57159352">57159352</a> 67 Thermococcus kodakarensis KOD1 euryarchaeota sulfur transfer protein involved in thiamine biosynthesis [Thermococcus kodakarensis KOD1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=55379215">55379215</a> 66 Haloarcula marismortui ATCC 43049 euryarchaeota hypothetical protein rrnAC2563 [Haloarcula marismortui ATCC 43049]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76801103">76801103</a> 66 Natronomonas pharaonis DSM 2160 euryarchaeota homolog to thiamine biosynthesis protein ThiS (probable sulfur donor) [Natronomonas pharaonis DSM 2160]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84489151">84489151</a> 66 Methanosphaera stadtmanae DSM 3091 euryarchaeota hypothetical protein Msp_0330 [Methanosphaera stadtmanae DSM 3091]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68141055">68141055</a> 64 Ferroplasma acidarmanus Fer1 euryarchaeota conserved hypothetical protein [Ferroplasma acidarmanus Fer1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15622711">15622711</a> 68 Sulfolobus tokodaii str. 7 crenarchaeota 68aa long conserved hypothetical protein [Sulfolobus tokodaii str. 7]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68568033">68568033</a> 68 Sulfolobus acidocaldarius DSM 639 crenarchaeota conserved Archaeal protein [Sulfolobus acidocaldarius DSM 639]
B. Variant Thiamine biosynthesis pathway (Gis are for the ThiS+ThiF protein- marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM (gis are of the ThiS+ThiF protein) Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57240561">57240561</a> 265 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Campylobacter lari RM2100 proteobacteria>epsilonproteobacteria HesA/MoeB/ThiF family protein [Campylobacter lari RM2100]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57168916">57168916</a> 266 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Campylobacter coli RM2228 proteobacteria>epsilonproteobacteria HesA/MoeB/ThiF family protein [Campylobacter coli RM2228]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57166736">57166736</a> 267 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Campylobacter jejuni RM1221 proteobacteria>epsilonproteobacteria thiamine biosynthesis protein ThiF [Campylobacter jejuni RM1221]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86152451">86152451</a> 267 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Campylobacter jejuni subsp. jejuni HB93-13 proteobacteria>epsilonproteobacteria thiamine biosynthesis protein ThiF [Campylobacter jejuni subsp. jejuni HB93-13]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86150511">86150511</a> 267 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Campylobacter jejuni subsp. jejuni CF93-6 proteobacteria>epsilonproteobacteria thiamine biosynthesis protein ThiF [Campylobacter jejuni subsp. jejuni CF93-6]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86150854">86150854</a> 267 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Campylobacter jejuni subsp. jejuni 260.94 proteobacteria>epsilonproteobacteria thiamine biosynthesis protein ThiF [Campylobacter jejuni subsp. jejuni 260.94]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=87132835">87132835</a> 267 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Campylobacter jejuni subsp. jejuni 84-25 proteobacteria>epsilonproteobacteria COG0476: Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Campylobacter jejuni subsp. jejuni 84-25]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71837115">71837115</a> 267 OAHShyd->OAHShyd->Cyssynthase->ThiS+ThiF*-> (operon gene displacement) Pelobacter propionicus DSM 2379 proteobacteria>deltaproteobacteria UBA/THIF-type NAD/FAD binding fold [Pelobacter propionicus DSM 2379]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77544308">77544308</a> 268 ThiS+ThiF*->ThiS->ThiG->ThiH->ThiE-> Pelobacter carbinolicus DSM 2380 proteobacteria>deltaproteobacteria molybdopterin biosynthesis protein MoeB [Pelobacter carbinolicus DSM 2380]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68178158">68178158</a> 272 ThiS+ThiF*->ThiS->ThiG->ThiH->ThiE-> Desulfuromonas acetoxidans DSM 684 proteobacteria>deltaproteobacteria UBA/THIF-type NAD/FAD binding fold [Desulfuromonas acetoxidans DSM 684]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18145265">18145265</a> 269 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Clostridium perfringens str. 13 firmicutes probable molybdopterin biosynthesis protein [Clostridium perfringens str. 13]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82748786">82748786</a> 267 ThiS+ThiF*->ThiE-> Clostridium beijerincki NCIMB 8052 firmicutes UBA/THIF-type NAD/FAD binding fold [Clostridium beijerincki NCIMB 8052]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=28203841">28203841</a> 267 ThiD->ThiM->ThiE->ThiS+ThiF*->ThiG->ThiH-> Clostridium tetani E88 firmicutes molybdopterin biosynthesis protein moeB [Clostridium tetani E88]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77683437">77683437</a> 268 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiC->ThiE-> Alkaliphilus metalliredigenes QYMF firmicutes UBA/THIF-type NAD/FAD binding fold [Alkaliphilus metalliredigenes QYMF]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15025973">15025973</a> 266 ThiS->ThiS+ThiF*->ThiG->ThiH->ThiE-> Clostridium acetobutylicum ATCC 824 firmicutes AE007789_11 Dinucleotide-utilizing enzyme involved in molybdopterin/thiamine biosynthesis [Cl ostridium acetobutylicum ATCC 824]
Thiamine biosynthesis pathways in operons with a Cys synthase (gis are for the Cys synthase (Cys syn)- marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85859830">85859830</a> 295 ThiF->ThiH->ThiG->ThiS<-?->?<-UbiA->Cys synthase*-> Syntrophus aciditrophicus SB proteobacteria>deltaproteobacteria cysteine synthase [Syntrophus aciditrophicus SB]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21646639">21646639</a> 310 trans sulf->Cys synthase*->ThiS->ThiG->ThiH-> Chlorobium tepidum TLS bacteroidetes/chlorobi cysteine synthase [Chlorobium tepidum TLS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77545399">77545399</a> 308 Cys syn*->OAHSH->ThiF->ThiS solo-> (probably molybdenum biosynthesis?) Pelobacter carbinolicus DSM 2380 proteobacteria>deltaproteobacteria cysteine synthase [Pelobacter carbinolicus DSM 2380]
Miscellaneous pathway
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67938818">67938818</a> 328 Rrf2 (often fused to NifS)->Cys Synthase*->ThiS Chlorobium phaeobacteroides BS1 bacteroidetes/chlorobi Cysteine synthase K/M:Cysteine synthase A [Chlorobium phaeobacteroides BS1]
-------------------------------------------------------------------------------------------------------------
2. Classical pathway: Molybdopterin cofactor biosynthesis and related pathways
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
a.Bacterial versions of classical MOCO factor biosynthesis pathway (The gis represent the MoaE protein- marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=32033744">32033744</a> 151 moaA->MoaC->MoaD->MoaE*-> Actinobacillus pleuropneumoniae serovar 1 str. 4074; proteobacteria>gammaproteobacteria COG0314: Molybdopterin converting factor, large subunit [Actinobacillus pleuropneumoniae serovar 1 str. 4074]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75431112">75431112</a> 159 moaA->MoaC->MoaD->MoaE*-> Actinobacillus succinogenes 130Z proteobacteria>gammaproteobacteria molybdopterin converting factor, large subunit [Actinobacillus succinogenes 130Z]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=45435690">45435690</a> 152 <-MoaE*<-MoaD<-MoaC<-moaA Yersinia pestis biovar Medievalis str. 91001 proteobacteria>gammaproteobacteria molybdopterin [mpt] converting factor, subunit 2 [Yersinia pestis biovar Medievalis str. 91001]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71038168">71038168</a> 184 MoeA->moaA->MoaB->?->MoaC->MoaD->MoaE*->Mo_transporter->permease-> Psychrobacter arcticus 273-4 proteobacteria>gammaproteobacteria probable molybdopterin converting factor, large subunit [Psychrobacter arcticus 273-4]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67156788">67156788</a> 148 MoaC->MoaD->MoaE*-> Azotobacter vinelandii AvOP proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Azotobacter vinelandii AvOP]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=28868460">28868460</a> 148 MoaC->MoaD->MoaE*-> Pseudomonas syringae pv. tomato str. DC3000 proteobacteria>gammaproteobacteria molybdenum cofactor biosynthesis protein E [Pseudomonas syringae pv. tomato str. DC3000]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21107238">21107238</a> 146 MoaC->MoaD->MoaE*-> Xanthomonas axonopodis pv. citri str. 306 proteobacteria>gammaproteobacteria molybdopterin-converting factor chain 2 [Xanthomonas axonopodis pv. citri str. 306]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=26988029">26988029</a> 148 MoaC->MoaD->MoaE*-> Pseudomonas putida KT2440 proteobacteria>gammaproteobacteria molybdenum cofactor biosynthesis protein E [Pseudomonas putida KT2440]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77381196">77381196</a> 150 MoaC->MoaD->MoaE*-> Pseudomonas fluorescens PfO-1 proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Pseudomonas fluorescens PfO-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68345385">68345385</a> 152 <-MoaB||MobA->?->?-><-MoaE*<-MoaD<-MoaC<-?<-moaA Pseudomonas fluorescens Pf-5 proteobacteria>gammaproteobacteria molybdenum cofactor biosynthesis protein E [Pseudomonas fluorescens Pf-5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77382375">77382375</a> 153 <-MoaE*<-MoaD<-MoaC<-?<-?||MoaB->MoeA-> Pseudomonas fluorescens PfO-1 proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Pseudomonas fluorescens PfO-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68342721">68342721</a> 150 MoaC->MoaD->MoaE*-> Pseudomonas fluorescens Pf-5 proteobacteria>gammaproteobacteria molybdopterin converting factor, subunit 2 [Pseudomonas fluorescens Pf-5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84319277">84319277</a> 150 <-MoeA<-MoaB<-MoaE*<-MoaD Pseudomonas aeruginosa C3719 proteobacteria>gammaproteobacteria COG0314: Molybdopterin converting factor, large subunit [Pseudomonas aeruginosa C3719]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76791053">76791053</a> 154 <-permease<-Mo_transporter<-MoaE*<-MoaD<-MoaC<-MoaB<-?<-moaA Pseudoalteromonas atlantica T6c proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Pseudoalteromonas atlantica T6c]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=36784881">36784881</a> 150 moaA->MoaC->MoaD->MoaE*-> Photorhabdus luminescens subsp. laumondii TTO1 proteobacteria>gammaproteobacteria molybdopterin [MPT] converting factor, subunit 2 (molybdenum cofactor biosynthesis protein E) (molybdopterin converting factor large subunit) [Photorhabdus luminescens subsp. laumondii TTO1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=37198129">37198129</a> 151 moaA->MoaB->MoaC->MoaD->MoaE*->?->?->?->Mopterin_binding_protein-> Vibrio vulnificus YJ016 proteobacteria>gammaproteobacteria molybdenum cofactor biosynthesis protein E [Vibrio vulnificus YJ016]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46912748">46912748</a> 149 moaA->MoaC->MoaD->MoaE*-> Photobacterium profundum SS9 proteobacteria>gammaproteobacteria putative molybdenum cofactor biosynthesisprotein E [Photobacterium profundum SS9]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67676069">67676069</a> 152 <-moaA<-MoaE*<-MoaD<-MoeA<-mobB<-MobA<-Mopterin_binding_protein<-permease Chromohalobacter salexigens DSM 3043 proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Chromohalobacter salexigens DSM 3043]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71144053">71144053</a> 156 <-MoeA||moaA->?->MoaB->MoaC->MoaD->MoaE*->Mo_transporter->permease->Mopterin_binding_protein->MoeB-> Colwellia psychrerythraea 34H proteobacteria>gammaproteobacteria molybdopterin converting factor, subunit 2 [Colwellia psychrerythraea 34H]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=12720896">12720896</a> 150 <-MoaE*<-MoaD<-MoaC<-moaA Pasteurella multocida subsp. multocida str. Pm70 proteobacteria>gammaproteobacteria MoaE [Pasteurella multocida subsp. multocida str. Pm70]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49612263">49612263</a> 150 <-MoaE*<-MoaD<-MoaC<-MoaB<-moaA Erwinia carotovora subsp. atroseptica SCRI1043 proteobacteria>gammaproteobacteria molybdopterin converting factor subunit 2 [Erwinia carotovora subsp. atroseptica SCRI1043]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84393379">84393379</a> 157 <-Mopterin_binding_protein<-?<-?<-?<-MoaE*<-MoaD<-MoaC<-MoaB<-moaA Vibrio splendidus 12B01 proteobacteria>gammaproteobacteria Molybdenum cofactor biosynthesis protein E [Vibrio splendidus 12B01]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=28807085">28807085</a> 151 <-Mopterin_binding_protein<-?<-?<-?<-MoaE*<-MoaD<-MoaC<-MoaB<-moaA Vibrio parahaemolyticus RIMD 2210633 proteobacteria>gammaproteobacteria molybdenum cofactor biosynthesis protein E [Vibrio parahaemolyticus RIMD 2210633]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=26107156">26107156</a> 150 moaA->MoaB-><-?||MoaC->MoaD->MoaE*-> Escherichia coli CFT073 proteobacteria>gammaproteobacteria AE016757_244 Molybdopterin converting factor subunit 2 [Escherichia coli CFT073]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=59711550">59711550</a> 148 moaA->MoaC->MoaD->MoaE*-> Vibrio fischeri ES114 proteobacteria>gammaproteobacteria molybdopterin converting factor, large subunit [Vibrio fischeri ES114]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33148682">33148682</a> 151 <-MoaE*<-MoaD<-MoaC<-moaA Haemophilus ducreyi 35000HP proteobacteria>gammaproteobacteria molybdopterin converting factor subunit 2 [Haemophilus ducreyi 35000HP]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=1574523">1574523</a> 150 <-MoaE*<-MoaD<-MoaC<-moaA Haemophilus influenzae Rd KW20 proteobacteria>gammaproteobacteria molybdopterin converting factor, subunit 2 (moaE) [Haemophilus influenzae Rd KW20]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23467045">23467045</a> 150 moaA->MoaC->MoaD->MoaE*-> Haemophilus somnus 129PT proteobacteria>gammaproteobacteria COG0314: Molybdopterin converting factor, large subunit [Haemophilus somnus 129PT]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68545075">68545075</a> 152 <-Mopterin_binding_protein<-permease<-Mo_transporter<-MoaE*<-MoaD<-MoaC<-MoaB<-moaA Shewanella amazonensis SB2B proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Shewanella amazonensis SB2B]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69158642">69158642</a> 156 <-Mopterin_binding_protein<-permease<-Mo_transporter<-MoaE*<-MoaD<-MoaC<-moaA Shewanella denitrificans OS217 proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Shewanella denitrificans OS217]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75819544">75819544</a> 153 <-MoaE*<-MoaD<-MoaC<-MoaB<-moaA Vibrio cholerae V51 proteobacteria>gammaproteobacteria COG0314: Molybdopterin converting factor, large subunit [Vibrio cholerae V51]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48861422">48861422</a> 145 MoaC->MoaD->MoaE*-> Microbulbifer degradans 2-40 proteobacteria>gammaproteobacteria COG0314: Molybdopterin converting factor, large subunit [Microbulbifer degradans 2-40]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52307131">52307131</a> 161 moaA->MoaC->MoaD->MoaE*-> Mannheimia succiniciproducens MBEL55E proteobacteria>gammaproteobacteria MoaE protein [Mannheimia succiniciproducens MBEL55E]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77951749">77951749</a> 148 MobA-><-MoaE*<-MoaD<-MoeA<-MoaB Marinobacter aquaeolei VT8 proteobacteria>gammaproteobacteria molybdenum cofactor biosynthesis protein E [Marinobacter aquaeolei VT8]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=87121810">87121810</a> 151 <-MoaE*<-MoaD<-MoaC Marinomonas sp. MED121 proteobacteria>gammaproteobacteria molybdenum cofactor biosynthesis protein E [Marinomonas sp. MED121]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78362791">78362791</a> 155 moaA->MoaD->MoaE*->MoeA->MoaC-> Thiomicrospira crunogena XCL-2 proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Thiomicrospira crunogena XCL-2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=53756579">53756579</a> 151 MoaD->MoaE*-> Methylococcus capsulatus str. Bath proteobacteria>gammaproteobacteria molybdopterin converting factor, subunit 2 [Methylococcus capsulatus str. Bath]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=24375927">24375927</a> 155 <-Mopterin_binding_protein<-permease<-Mo_transporter<-MoaE*<-MoaD<-MoaC<-moaA Shewanella oneidensis MR-1 proteobacteria>gammaproteobacteria molybdenum cofactor biosynthesis protein E [Shewanella oneidensis MR-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69951943">69951943</a> 172 <-Mopterin_binding_protein<-permease<-Mo_transporter<-MoaE*<-MoaD<-MoaC<-moaA Shewanella frigidimarina NCIMB 400 proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Shewanella frigidimarina NCIMB 400]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71364350">71364350</a> 163 <-permease<-Mo_transporter<-MoaE*<-MoaD<-MoaC<-?<-MoaB<-moaA<-MoeA Psychrobacter cryohalolentis K5 proteobacteria>gammaproteobacteria Molybdopterin biosynthesis MoaE [Psychrobacter cryohalolentis K5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86154629">86154629</a> 148 <-MoeA<-?<-?<-MoaE*<-MoaD Campylobacter fetus subsp. fetus 82-40 proteobacteria>epsilonproteobacteria molybdopterin converting factor, subunit 2 [Campylobacter fetus subsp. fetus 82-40]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78776455">78776455</a> 145 MoaD->MoaE*->MoeA-> Thiomicrospira denitrificans ATCC 33889 proteobacteria>epsilonproteobacteria possible molybdopterin converting factor, subunit 2 [Thiomicrospira denitrificans ATCC 33889]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15645419">15645419</a> 145 <-MoaC<-MoaB<-MoaE*<-MoaD Helicobacter pylori 26695 proteobacteria>epsilonproteobacteria molybdopterin converting factor, subunit 2 (moaE) [Helicobacter pylori 26695]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=32261594">32261594</a> 157 MoeA->MoaD->MoaE*->mobB->MoaB-> Helicobacter hepaticus ATCC 51449 proteobacteria>epsilonproteobacteria molybdopterin converting factor [Helicobacter hepaticus ATCC 51449]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57167345">57167345</a> 147 MoaD->MoaE*->?->MoeA-> Campylobacter jejuni RM1221; proteobacteria>epsilonproteobacteria molybdopterin converting factor, subunit 2 [Campylobacter jejuni RM1221]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=34483283">34483283</a> 145 MoeA->MoaD->MoaE*->mobB->MoaB->MoaC-> Wolinella succinogenes proteobacteria>epsilonproteobacteria POSSIBLE MOLYBDOPTERIN CONVERTING FACTOR, SUBUNIT 2 [Wolinella succinogenes]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57505250">57505250</a> 151 MoaD->MoaE*->MoeA-> Campylobacter upsaliensis RM3195 proteobacteria>epsilonproteobacteria molybdopterin converting factor, subunit 2 [Campylobacter upsaliensis RM3195]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=34495640">34495640</a> 158 <-MoaE*<-MoaD Chromobacterium violaceum ATCC 12472 proteobacteria>betaproteobacteria molybdopterin converting factor subunit 2 [Chromobacterium violaceum ATCC 12472]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18076268">18076268</a> 172 MoeA->MoaD->MoaE*->CcrB-> Cupriavidus necator proteobacteria>betaproteobacteria molybdopterin synthase large subunit [Cupriavidus necator]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67907156">67907156</a> 226 <-MoaE*||?-><-MoaD<-MoeA<-mobB<-Threonine_synthase Polaromonas sp. JS666 proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoaE [Polaromonas sp. JS666]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74022613">74022613</a> 163 <-MoaE*<-MoaD<-MoeA<-mobB<-Threonine_synthase Rhodoferax ferrireducens DSM 15236 proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoaE [Rhodoferax ferrireducens DSM 15236]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=47573809">47573809</a> 159 Threonine_synthase->mobB->MoeA->MoaD->?->MoaE*->CcrB-> Rubrivivax gelatinosus PM1 proteobacteria>betaproteobacteria COG0314: Molybdopterin converting factor, large subunit [Rubrivivax gelatinosus PM1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74317045">74317045</a> 151 <-moaA||mobB->MoeA->MoaD->MoaE*-> Thiobacillus denitrificans ATCC 25259 proteobacteria>betaproteobacteria molybdenum cofactor biosynthesis protein E [Thiobacillus denitrificans ATCC 25259]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83719603">83719603</a> 166 Threonine_synthase->MoeA->MoaD->MoaE*-> Burkholderia thailandensis E264 proteobacteria>betaproteobacteria molybdopterin converting factor, subunit 2 [Burkholderia thailandensis E264]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77964629">77964629</a> 189 <-MoaD<-MoaE*<-moaA<-MoeA Burkholderia sp. 383 proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoaE [Burkholderia sp. 383]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67664216">67664216</a> 190 MoeA->moaA->MoaE*->MoaD-> Burkholderia cenocepacia HI2424 proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoaE [Burkholderia cenocepacia HI2424]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74018016">74018016</a> 187 MoeA->moaA->MoaE*->MoaD-> Burkholderia ambifaria AMMD; proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoaE [Burkholderia ambifaria AMMD]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84713091">84713091</a> 157 Threonine_synthase->mobB->MoeA->MoaD->MoaE*-> Polaromonas naphthalenivorans CJ2 proteobacteria>betaproteobacteria moaE, RSc1332; probable molybdopterin mpt converting factor (subunit 2) protein [Polaromonas naphthalenivorans CJ2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33563746">33563746</a> 163 ModE->moaA-><-MoeA<-MoaB<-MoaE*<-MoaD<-MoaC Bordetella pertussis Tohama I proteobacteria>betaproteobacteria molybdopterin converting factor [Bordetella pertussis Tohama I]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56315291">56315291</a> 161 <-MoaE*<-MoaD<-MoeA<-mobB Azoarcus sp. EbN1 proteobacteria>betaproteobacteria Molybdenum cofactor biosynthesis protein E [Azoarcus sp. EbN1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68212269">68212269</a> 149 <-MoaE*<-MoaD Methylobacillus flagellatus KT proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoaE [Methylobacillus flagellatus KT]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68557891">68557891</a> 163 <-CcrB<-MoaE*<-MoaD<-MoeA<-Threonine_synthase Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoaE [Ralstonia metallidurans CH34]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17428347">17428347</a> 176 Threonine_synthase->?->MoeA->MoaD->MoaE*->CcrB-> Ralstonia solanacearum proteobacteria>betaproteobacteria PROBABLE MOLYBDOPTERIN MPT CONVERTING FACTOR (SUBUNIT 2) PROTEIN [Ralstonia solanacearum]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86357114">86357114</a> 153 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-Excinuclease<-ADH||OmpA-> Rhizobium etli CFN 42 proteobacteria>alphaproteobacteria molybdopterin converting factor subunit 2 protein [Rhizobium etli CFN 42]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77389070">77389070</a> 146 <-ADH||?->?->?-><-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-Excinuclease<-ADH Rhodobacter sphaeroides 2.4.1 proteobacteria>alphaproteobacteria Molybdopterin converting factor subunit 2 [Rhodobacter sphaeroides 2.4.1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=27355756">27355756</a> 160 <-OmpA||Excinuclease->phosphatidylglycerophosphate_synthase->MoaD->MoaE*-> Bradyrhizobium japonicum USDA 110 proteobacteria>alphaproteobacteria molybdopterin converting factor large subunit [Bradyrhizobium japonicum USDA 110]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23347497">23347497</a> 163 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-Excinuclease<-ADH||OmpA-> Brucella suis 1330 proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Brucella suis 1330]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=39648091">39648091</a> 155 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-Excinuclease||OmpA-> Rhodopseudomonas palustris CGA009 proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Rhodopseudomonas palustris CGA009]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78494766">78494766</a> 152 <-OmpA||Excinuclease->phosphatidylglycerophosphate_synthase->MoaD->MoaE*-> Rhodopseudomonas palustris BisB18 proteobacteria>alphaproteobacteria Molybdopterin biosynthesis MoaE [Rhodopseudomonas palustris BisB18]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83577061">83577061</a> 162 MobA->MoaC->MoaD->MoaE*-> Rhodospirillum rubrum ATCC 11170 proteobacteria>alphaproteobacteria Molybdopterin biosynthesis MoaE [Rhodospirillum rubrum ATCC 11170]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85705895">85705895</a> 147 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-?<-?<-?<-Excinuclease Roseovarius sp. 217 proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Roseovarius sp. 217]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84705082">84705082</a> 155 <-MoaB<-MoaE*<-MoaD<-moaA Parvularcula bermudensis HTCC2503 proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Parvularcula bermudensis HTCC2503]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69936171">69936171</a> 146 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase Paracoccus denitrificans PD1222 proteobacteria>alphaproteobacteria Molybdopterin biosynthesis MoaE [Paracoccus denitrificans PD1222]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69926308">69926308</a> 155 <-OmpA||Excinuclease->phosphatidylglycerophosphate_synthase->MoaD->MoaE*-> Nitrobacter hamburgensis X14 proteobacteria>alphaproteobacteria Molybdopterin biosynthesis MoaE [Nitrobacter hamburgensis X14]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13421104">13421104</a> 150 <-MoaC<-MoaB<-MoaE*<-MoaD<-moaA Caulobacter crescentus CB15;(Note MoaB related to MoeA) proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Caulobacter crescentus CB15]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84786468">84786468</a> 156 moaA->MoaD->MoaE*-> Erythrobacter litoralis HTCC2594 proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Erythrobacter litoralis HTCC2594]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15074100">15074100</a> 155 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-Excinuclease<-ADH||OmpA-> Sinorhizobium meliloti proteobacteria>alphaproteobacteria PROBABLE MOLYBDOPTERIN MPT CONVERTING FACTOR, SUBUNIT 2 PROTEIN [Sinorhizobium meliloti]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68538766">68538766</a> 146 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase Sphingopyxis alaskensis RB2256 proteobacteria>alphaproteobacteria Molybdopterin biosynthesis MoaE [Sphingopyxis alaskensis RB2256]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68193705">68193705</a> 154 <-OmpA||ADH->Excinuclease->phosphatidylglycerophosphate_synthase->MoaD->MoaE*-> Mesorhizobium sp. BNC1 proteobacteria>alphaproteobacteria Molybdopterin biosynthesis MoaE [Mesorhizobium sp. BNC1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14027319">14027319</a> 159 <-OmpA||ADH->Excinuclease->phosphatidylglycerophosphate_synthase->MoaD->MoaE*-> Mesorhizobium loti MAFF303099 proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Mesorhizobium loti MAFF303099]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23016727">23016727</a> 158 ADH->Excinuclease->phosphatidylglycerophosphate_synthase->mobB->MoeA->MoaD->MoaE*-> Magnetospirillum magnetotacticum MS-1 proteobacteria>alphaproteobacteria COG0314: Molybdopterin converting factor, large subunit [Magnetospirillum magnetotacticum MS-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83854897">83854897</a> 147 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-Excinuclease<-ADH Sulfitobacter sp. NAS-14.1 proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Sulfitobacter sp. NAS-14.1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85707988">85707988</a> 147 moaA->MoaD->MoaE*-> Erythrobacter sp. NAP1 proteobacteria>alphaproteobacteria molybdopterin converting factor, subunit 2 [Erythrobacter sp. NAP1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68180109">68180109</a> 147 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-Excinuclease<-ADH Jannaschia sp. CCS1 proteobacteria>alphaproteobacteria Molybdopterin biosynthesis MoaE [Jannaschia sp. CCS1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=58001332">58001332</a> 170 <-MoaE*<-MoaD<-MoaC<-moaA<-MoeA Gluconobacter oxydans 621H proteobacteria>alphaproteobacteria Molybdopterin (MPT) converting factor, subunit 2 [Gluconobacter oxydans 621H]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15156159">15156159</a> 155 <-MoaE*<-MoaD<-phosphatidylglycerophosphate_synthase<-Excinuclease<-ADH||OmpA-> Agrobacterium tumefaciens str. C58; proteobacteria>alphaproteobacteria AGR_C_2084p [Agrobacterium tumefaciens str. C58]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=32444388">32444388</a> 170 MoaD->MoaE*-> Rhodopirellula baltica SH 1 planctomycetes molybdopterin converting factor, large subunit [Rhodopirellula baltica SH 1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=28271029">28271029</a> 133 <-Mopterin_binding_protein<-?<-?||MoaE*->MoaD->moaA-> Lactobacillus plantarum WCFS1 firmicutes molybdopterin biosynthesis protein, E chain [Lactobacillus plantarum WCFS1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=16410446">16410446</a> 140 <-permease||?->MoeA->mobB->MoaE*->MoaD->MoaC->moaA-><-MoaB<-MoeB Listeria monocytogenes firmicutes lmo1044 [Listeria monocytogenes]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56379151">56379151</a> 155 moaA-><-?||MoeA->mobB->MoaE*->MoaD-> Geobacillus kaustophilus HTA426 firmicutes molybdopterin converting factor (subunit 2) [Geobacillus kaustophilus HTA426]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72494466">72494466</a> 148 MoaB-><-MoaC||MoeA->mobB->MoaE*->MoaD->MobA->moaA-> Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 firmicutes molybdopterin converting factor large subunit [Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29898351">29898351</a> 139 <-ADH<-?<-?<-MoaD<-MoaE*<-mobB<-MoeA||MoaC-><-MoeB Bacillus cereus ATCC 14579 firmicutes Molybdopterin (MPT) converting factor, subunit 2 [Bacillus cereus ATCC 14579]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29895811">29895811</a> 156 moaA->MoeB->MoeA->MoaE*->MoaD-> Bacillus cereus ATCC 14579 firmicutes Molybdopterin (MPT) converting factor, subunit 2 [Bacillus cereus ATCC 14579]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56908909">56908909</a> 142 <-MoaB||moaA->MoeA->mobB->MoaE*->MoaD-> Bacillus clausii KSM-K16 firmicutes molybdopterin converting factor subunit 2 MoaE [Bacillus clausii KSM-K16]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10175641">10175641</a> 156 MobA-><-MoaD<-MoaE*<-mobB<-MoeA<-MoaB||MoaC-> Bacillus halodurans C-125 firmicutes molybdopterin converting factor (subunit 2) [Bacillus halodurans C-125]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52003240">52003240</a> 164 MobA->MoeB->MoeA->mobB->MoaE*->MoaD-> Bacillus licheniformis ATCC 14580 firmicutes molybdopterin converting factor (subunit 2) [Bacillus licheniformis ATCC 14580]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2633801">2633801</a> 157 MobA->MoeB->MoeA->mobB->MoaE*->MoaD->?->?->?->?->Mopterin_binding_protein-> Bacillus subtilis subsp. subtilis str. 168; firmicutes molybdopterin converting factor (subunit 2) [Bacillus subtilis subsp. subtilis str. 168]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75760852">75760852</a> 165 <-ADH<-?<-?<-MoaD<-MoaE*<-mobB<-MoeA||MoaC-><-MoeB Bacillus thuringiensis serovar israelensis ATCC 35646 firmicutes Molybdopterin converting factor, large subunit [Bacillus thuringiensis serovar israelensis ATCC 35646]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75762420">75762420</a> 157 moaA->?->MoeB->MoeA->MoaE*->MoaD-> Bacillus thuringiensis serovar israelensis ATCC 35646 firmicutes Molybdopterin converting factor, large subunit [Bacillus thuringiensis serovar israelensis ATCC 35646]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49242615">49242615</a> 148 <-moaA<-MobA<-MoaD<-MoaE*<-mobB<-MoeA||MoaC-><-MoaB Staphylococcus aureus subsp. aureus MRSA252 firmicutes putative molybdopterin-synthase large subunit [Staphylococcus aureus subsp. aureus MRSA252]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=3955206">3955206</a> 150 MoaB-><-MoaC||MoeA->mobB->MoaE*->MoaD->MobA->moaA-> Staphylococcus carnosus firmicutes MoaE [Staphylococcus carnosus]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57867759">57867759</a> 150 <-moaA<-MobA<-MoaD<-MoaE*<-mobB<-MoeA||MoaC-><-MoaB Staphylococcus epidermidis RP62A firmicutes molybdenum cofactor biosynthesis protein E [Staphylococcus epidermidis RP62A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68446506">68446506</a> 149 MoaB-><-MoaC||MoeA->mobB->MoaE*->MoaD->MobA->moaA-> Staphylococcus haemolyticus JCSC1435 firmicutes molybdopterin converting factor moa [Staphylococcus haemolyticus JCSC1435]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78704014">78704014</a> 132 MoaD->MoaE*-> Methanospirillum hungatei JF-1 euryarchaeota Molybdopterin biosynthesis MoaE [Methanospirillum hungatei JF-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78705135">78705135</a> 135 MoeB->MoaD->MoaE*-><-?||?->permease->Mopterin_binding_protein-> Methanospirillum hungatei JF-1 euryarchaeota Molybdopterin biosynthesis MoaE [Methanospirillum hungatei JF-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86604897">86604897</a> 161 <-MoaE*<-MoaD<-?<-moaA<-MoeA Cyanobacteria bacterium Yellowstone A-Prime cyanobacteria molybdopterin converting factor, subunit 2 [Cyanobacteria bacterium Yellowstone A-Prime]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=35214942">35214942</a> 149 MoaD->MoaE*-> Gloeobacter violaceus PCC 7421; MoaD->MoaE cyanobacteria molybdopterin converting factor subunit 2 [Gloeobacter violaceus PCC 7421]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=1001213">1001213</a> 145 Ferr-nitrite_reductase->cyanate_lyase->MoeA->moaA->MoaC+MobA->MoaD->MoaE*-> Synechocystis sp. PCC 6803 cyanobacteria molybdopterin (MPT) converting factor, subunit 2 [Synechocystis sp. PCC 6803]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33639603">33639603</a> 142 MoaC->MoeA-><-?||?-><-MoaE*<-MoaD||MoaB-> Synechococcus sp. WH 8102 cyanobacteria molybdenum cofactor biosynthesis protein E (molydbopterin converting factor large subunit) [Synechococcus sp. WH 8102]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78170140">78170140</a> 148 MoaC->MoeA->sugar_epimerase-><-MoaE*<-MoaD||MoaB-> Synechococcus sp. CC9902 cyanobacteria molybdenum cofactor biosynthesis protein E [Synechococcus sp. CC9902]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=22295084">22295084</a> 148 LysR<-MoaE*<-MoaD<-MoaC+MobA<-moaA<-MoeA Thermosynechococcus elongatus BP-1 cyanobacteria molybdopterin (MPT) converting factor, subunit 2 [Thermosynechococcus elongatus BP-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76261575">76261575</a> 137 ADH->MoaD->MoaE*-> Chloroflexus aurantiacus J-10-fl chloroflexi Molybdopterin biosynthesis MoaE [Chloroflexus aurantiacus J-10-fl]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86134371">86134371</a> 140 <-MoaE*||?-><-MoaD Tenacibaculum sp. MED152 bacteroidetes/chlorobi molybdopterin converting factor, subunit 2 [Tenacibaculum sp. MED152]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67937986">67937986</a> 130 <-MoaE*<-MoaD<-MoeA<-MoaC+MoeA Chlorobium phaeobacteroides BS1; bacteroidetes/chlorobi Molybdopterin biosynthesis MoaE [Chlorobium phaeobacteroides BS1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86143256">86143256</a> 142 <-moaA<-MoaC+MoeA<-MoaE*<-MoeB<-MoaD<-MobA<-ModE<-MoeA Flavobacterium sp. MED217; bacteroidetes/chlorobi molybdopterin converting factor, subunit 2 [Flavobacterium sp. MED217]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68553533">68553533</a> 130 <-MoaE*<-MoaD<-MoeA<-?<-moaA Prosthecochloris aestuarii DSM 271 bacteroidetes/chlorobi Molybdopterin biosynthesis MoaE [Prosthecochloris aestuarii DSM 271]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68562527">68562527</a> 146 MoaD->MoaE*-> Rubrobacter xylanophilus DSM 9941; MoaD->MoaE actinobacteria Molybdopterin biosynthesis MoaE [Rubrobacter xylanophilus DSM 9941]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=54017798">54017798</a> 145 <-MoaD<-moaA||MoeA->?->MoaE*-> Nocardia farcinica IFM 10152 actinobacteria putative molybdopterin biosynthesis protein [Nocardia farcinica IFM 10152]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13880439">13880439</a> 141 MoaC->MoaB->MoaE*-><-?<-MoaD<-moaA Mycobacterium tuberculosis CDC1551 actinobacteria molybdopterin cofactor biosynthesis protein E [Mycobacterium tuberculosis CDC1551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=62425449">62425449</a> 140 <-MoaE*<-MoaC<-MoeA||moaA->MoaD-><-MoeB+Rhod<-MoeA Brevibacterium linens BL2 actinobacteria COG0314: Molybdopterin converting factor, large subunit [Brevibacterium linens BL2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=25169125">25169125</a> 155 <-MoaD<-MoaD<-MoaD<-moaA||MoeA->MoaC->MoaE*-> Arthrobacter nicotinovorans actinobacteria molybdopterin synthase (large subunit moaE) [Arthrobacter nicotinovorans]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=41406902">41406902</a> 141 MoaC->MoaB->MoaE*-><-?<-MoaD<-moaA Mycobacterium avium subsp. paratuberculosis K-10 actinobacteria MoaE2 [Mycobacterium avium subsp. paratuberculosis K-10]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=12620120">12620120</a> 150 moaA->MoaB->MoaC->MoaD->MoaE*-> uncultured bacterium pCosHE1 AF250774_5 putative molybdopterin converting factor subunit 2 [uncultured bacterium pCosHE1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=40062751">40062751</a> 148 <-MoaB<-MoaD<-moaA<-MoaC<-MoeA||MobA-><-MoaE* uncultured bacterium 439 molydopterin converting factor, subunit 2 [uncultured bacterium 439]
Example of a MoaC fused to a MoaD
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84319278">84319278</a> 243 PhoH(PIN+ATPase)->MoaC+MoaD->MoaE->MoaB->MoeA Pseudomonas aeruginosa C3719 proteobacteria>gammaproteobacteria COG0315: Molybdenum cofactor biosynthesis enzyme [Pseudomonas aeruginosa C3719]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67676070">67676070</a> 262 permease->ABC ATPAse->MobA->MobB->MoeA->MoaC+MoaD->MoaE->MoaA-> Chromohalobacter salexigens DSM 3043 proteobacteria>gammaproteobacteria Molybdopterin cofactor biosynthesis protein MoaC [Chromohalobacter salexigens DSM 3043]
Bacterial MoaDs that are fused to MoaE (Gis are for the MoaD+MoaE protein- marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67927210">67927210</a> 229 MoaD+MoaE* Solibacter usitatus Ellin6076 fibrobacteres/acidobacteria Molybdopterin biosynthesis MoaE:ThiamineS [Solibacter usitatus Ellin6076]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46200249">46200249</a> 223 MoaD+MoaE* Thermus thermophilus HB27 deinococci molybdopterin (MPT) converting factor, subunit 2 [Thermus thermophilus HB27]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=66799395">66799395</a> 273 MoaD+MoaE* Deinococcus geothermalis DSM 11300 deinococci Molybdopterin converting factor, subunit 1 [Deinococcus geothermalis DSM 11300]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=6460436">6460436</a> 229 MoaD+MoaE* Deinococcus radiodurans R1 deinococci AE002090_1 molybdenum cofactor biosynthesis protein D/E [Deinococcus radiodurans R1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=51858004">51858004</a> 230 MoaD+MoaE* Symbiobacterium thermophilum IAM 14863 actinobacteria molybdopterin converting factor-like protein [Symbiobacterium thermophilum IAM 14863]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13883249">13883249</a> 221 MoaA->dehydratase-> MoaC->MoaD+MoaE*-> Mycobacterium tuberculosis CDC1551 actinobacteria (dehydratase-pterin-4-alpha-carbinolamine dehydratase) molybdopterin cofactor biosynthesis protein D/E [Mycobacterium tuberculosis CDC1551]
b. Archaeal pathways involved in MOCO biosynthesis and related pathways (MoaD gis)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
- Molybdenum pathway (Basic construction with minor elaboration)
Gis are for the MoaD containing protein (marked with an asterisk)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15621527">15621527</a> 236 MoaD+MoaE*->ThiD+X->HD->InPP+X->Glucosaminyltransferase-> Sulfolobus tokodaii str. 7 crenarchaeota 236aa long hypothetical molybdopterin converting factor [Sulfolobus tokodaii str. 7]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68567385">68567385</a> 235 MoaD+MoaE*->ThiD+X->HD->InPP+X->Glucosaminyltransferase-> Sulfolobus acidocaldarius DSM 639 crenarchaeota molybdenum cofactor biosynthesis protein D/E [Sulfolobus acidocaldarius DSM 639]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13815697">13815697</a> 231 MoaD+MoaE*->ThiD+X->HD->InPP+X->Glucosaminyltransferase-> Sulfolobus solfataricus P2 crenarchaeota Molybdenum cofactor biosynthesis protein E (moaE) [Sulfolobus solfataricus P2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18159566">18159566</a> 229 MoaD+MoaE* Pyrobaculum aerophilum str. IM2 crenarchaeota molybdenum cofactor biosynthesis protein D/E [Pyrobaculum aerophilum str. IM2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88950646">88950646</a> 130 MoaD*->MoaE Methanosaeta thermophila PT euryarchaeota MoaD, archaeal [Methanosaeta thermophila PT]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88603453">88603453</a> 92 MoaD*->MoaE Methanospirillum hungatei JF-1 euryarchaeota thiamineS [Methanospirillum hungatei JF-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88601825">88601825</a> 91 MoeB->MoaD*->MoaE-> Methanospirillum hungatei JF-1 euryarchaeota thiamineS [Methanospirillum hungatei JF-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48430776">48430776</a> 75 MoaC->MoaB->MoaE-><-Sugar_transporter<-MoaA<-MoaD* Picrophilus torridus DSM 9790 euryarchaeota molybdopterin (MPT) converting factor, subunit 1 [Picrophilus torridus DSM 9790]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57160377">57160377</a> 88 MoaD*->MoeB<-?->MoaE-> Thermococcus kodakarensis KOD1 euryarchaeota molybdopterin converting factor, subunit 1 [Thermococcus kodakarensis KOD1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33356787">33356787</a> 94 MoeA->MoaD*-> Pyrococcus abyssi GE5 euryarchaeota molybdopterin converting factor, subunit 1 [Pyrococcus abyssi GE5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=5458838">5458838</a> 89 MoeA->MoaD*-> Pyrococcus abyssi GE5 euryarchaeota moaD molybdopterin synthase, small subunit [Pyrococcus abyssi GE5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33359306">33359306</a> 89 MoeA->MoaD*-> Pyrococcus horikoshii OT3 euryarchaeota putative molybdopterin converting factor, subunit 1 [Pyrococcus horikoshii OT3]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18892532">18892532</a> 90 MoeA->MoaD*-> Pyrococcus furiosus DSM 3638 euryarchaeota molybdopterin converting factor, subunit 1 ; (moaD) [Pyrococcus furiosus DSM 3638]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10640334">10640334</a> 85 MoaD*->?->TFIIB<-MoeA+PBPII<-MoeA Thermoplasma acidophilum euryarchaeota MoaD (involved in molybdopterin synthesis) related protein [Thermoplasma acidophilum]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14324783">14324783</a> 90 MoeA->MoeA+PPBII-><-WcaG->MoaD*-> Thermoplasma volcanium GSS1 euryarchaeota molybdopterin converting factor subunit 1 [Thermoplasma volcanium GSS1]
Archaeal MoaD Solos (Gis are of the MoaD protein-marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=11499216">11499216</a> 86 MoaD* Archaeoglobus fulgidus DSM 4304 euryarchaeota molybdopterin converting factor, subunit 1 (moaD) [Archaeoglobus fulgidus DSM 4304]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=55378770">55378770</a> 133 MoaD*-><-MoaD Haloarcula marismortui ATCC 43049 euryarchaeota hypothetical protein rrnAC2058 [Haloarcula marismortui ATCC 43049]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=55379974">55379974</a> 92 MoaD* Haloarcula marismortui ATCC 43049 euryarchaeota hypothetical protein rrnAC3439 [Haloarcula marismortui ATCC 43049]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10581293">10581293</a> 100 MoaD*-><-MoaD Halobacterium sp. NRC-1 euryarchaeota Vng<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=1848">1848</a>h [Halobacterium sp. NRC-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68210071">68210071</a> 112 MoaD*->CrcB->CrcB-> Methanococcoides burtonii DSM 6242 euryarchaeota MoaD, archaeal [Methanococcoides burtonii DSM 6242]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=19918186">19918186</a> 97 MoaD*->MoeA->CrcB->CrcB-> Methanosarcina acetivorans C2A euryarchaeota molybdopterin converting factor, subunit 1 [Methanosarcina acetivorans C2A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21226933">21226933</a> 97 MoaD*->MoeA->CrcB->CrcB-> Methanosarcina mazei Go1 euryarchaeota Molybdopterin converting factor small subunit [Methanosarcina mazei Go1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72395205">72395205</a> 97 MoaD*->MoeA->CrcB->CrcB-> Methanosarcina barkeri str. fusaro euryarchaeota molybdopterin converting factor small subunit [Methanosarcina barkeri str. fusaro]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76801893">76801893</a> 93 MoaD* Natronomonas pharaonis DSM 2160 euryarchaeota probable molybdopterin converting factor, small subunit 2 [Natronomonas pharaonis DSM 2160]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76803138">76803138</a> 92 MoaD* Natronomonas pharaonis DSM 2160 euryarchaeota probable molybdopterin converting factor, small subunit 1 [Natronomonas pharaonis DSM 2160]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76802608">76802608</a> 97 MoaD* Natronomonas pharaonis DSM 2160 euryarchaeota homolog to molybdopterin converting factor, small subunit [Natronomonas pharaonis DSM 2160]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18160633">18160633</a> 93 MoaD* Pyrobaculum aerophilum str. IM2 crenarchaeota conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18160535">18160535</a> 90 MoaD* Pyrobaculum aerophilum str. IM2 crenarchaeota conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18161603">18161603</a> 94 MoaD* Pyrobaculum aerophilum str. IM2 crenarchaeota conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33356700">33356700</a> 78 MoaD*->CBS-> Pyrococcus abyssi GE5 euryarchaeota hypothetical protein PAB1981.1n [Pyrococcus abyssi GE5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18893753">18893753</a> 79 MoaD*->CBS-> Pyrococcus furiosus DSM 3638 euryarchaeota hypothetical protein [Pyrococcus furiosus DSM 3638]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33359416">33359416</a> 75 MoaD*->CBS-> Pyrococcus horikoshii OT3 euryarchaeota hypothetical protein PH1595.1n [Pyrococcus horikoshii OT3]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68567124">68567124</a> 84 MoaD* Sulfolobus acidocaldarius DSM 639 crenarchaeota conserved Archaeal protein [Sulfolobus acidocaldarius DSM 639]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10640172">10640172</a> 90 MoaD* Thermoplasma acidophilum euryarchaeota conserved hypothetical protein [Thermoplasma acidophilum]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=42557747">42557747</a> 448 MoaD* uncultured crenarchaeote crenarchaeota putative molybdopterin biosynthesis protein [uncultured crenarchaeote]
Archaeal operons that have the MoaE protein and do not include the MoaD protein
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
--- Archaeal MoaEs without MoaD; MobB+MoaE (MobB: Nitrogenase like GTPase); note many of these have a MoaD solo elsewhere in the genome
--gis are of the MoaE protein (marked with an asterisk)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=55377991">55377991</a> 275 MoaC->MoaE*<-CysT<-ModA->ThiC-> Haloarcula marismortui ATCC 43049 euryarchaeota molybdenum cofactor biosynthesis protein [Haloarcula marismortui ATCC 43049]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10579734">10579734</a> 297 <-MobB+MoaE*<-MoeA-->MoaA<-MoeA+PPBDII Halobacterium sp. NRC-1 euryarchaeota (Molybd binding domain)- molybdenum cofactor biosynthesis protein; MoaE [Halobacterium sp. NRC-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68211451">68211451</a> 276 <-MobB+MoaE*<-RadC Methanococcoides burtonii DSM 6242 euryarchaeota Molybdopterin-guanine dinucleotide biosynthesis protein [Methanococcoides burtonii DSM 6242]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72397307">72397307</a> 213 <-MoeA<-MobB+MoaE* Methanosarcina barkeri str. fusaro euryarchaeota molybdopterin converting factor, subunit 2 [Methanosarcina barkeri str. fusaro]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21228894">21228894</a> 278 MobB+MoaE*->MobA<-MoeA+PPBDII Methanosarcina mazei Go1 euryarchaeota Molybdopterin converting factor, subunit 2 [Methanosarcina mazei Go1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=19915893">19915893</a> 279 <-RadC<-MobB+MoaE*->MobA-> Methanosarcina acetivorans C2A euryarchaeota molybdopterin-guanine dinucleotide biosynthesis protein B/molybdopterin converting factor, large subunit [Methanosarcina acetivorans C2A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72396955">72396955</a> 285 MobB+MoaE*->MobA-> Methanosarcina barkeri str. fusaro euryarchaeota molybdopterin converting factor, subunit 2 [Methanosarcina barkeri str. fusaro]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76801780">76801780</a> 262 MobB+MoaE*->?->metalloprotease<-ThiL Natronomonas pharaonis DSM 2160 euryarchaeota molybdopterin converting factor, large subunit [Natronomonas pharaonis DSM 2160]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=5104258">5104258</a> 249 MoaE* Aeropyrum pernix K1 crenarchaeota 249aa long hypothetical molybdopterin (mpt) converting factor, subunit 2 [Aeropyrum pernix K1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=11499761">11499761</a> 239 <-RecB<-phosphoesterase->MoaE*-> Archaeoglobus fulgidus DSM 4304 euryarchaeota molybdopterin converting factor, subunit 2 (moaE) [Archaeoglobus fulgidus DSM 4304]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2833554">2833554</a> 119 MoaE* Methanocaldococcus jannaschii euryarchaeota Y717_METJA Hypothetical protein MJ0717
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2621190">2621190</a> 143 MoaE*->HD hydrolase->Flavoprotein-> Methanothermobacter thermautotrophicus str. Delta H euryarchaeota molybdenum cofactor biosynthesis protein MoaE [Methanothermobacter thermautotrophicus str. Delta H]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=5457600">5457600</a> 148 TPR<-MoaE*->FeS oxidoreductase->KaiC-> Pyrococcus abyssi GE5 euryarchaeota moaE molybdopterin synthase, large chain [Pyrococcus abyssi GE5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68139846">68139846</a> 136 MoaE* Ferroplasma acidarmanus Fer1 euryarchaeota Molybdopterin biosynthesis MoaE [Ferroplasma acidarmanus Fer1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=45047664">45047664</a> 141 MoaE* Methanococcus maripaludis S2 euryarchaeota Molybdopterin biosynthesis MoaE [Methanococcus maripaludis S2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72397308">72397308</a> 60 MoaE* Methanosarcina barkeri str. fusaro euryarchaeota hypothetical protein Mbar_A2676 [Methanosarcina barkeri str. fusaro]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18892013">18892013</a> 145 MoaE* Pyrococcus furiosus DSM 3638 euryarchaeota molybdopterin converting factor (subunit 2) [Pyrococcus furiosus DSM 3638]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10640820">10640820</a> 135 MoaE* Thermoplasma acidophilum euryarchaeota molybdopterin-synthase large subunit related protein [Thermoplasma acidophilum]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14324300">14324300</a> 137 MoaE* Thermoplasma volcanium GSS1 a_b_hydrolase->MoaE euryarchaeota molybdopterin converting factor subunit 2 [Thermoplasma volcanium GSS1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52549594">52549594</a> 130 MoaE* uncultured archaeon GZfos28G7 molybdopterin converting factor subunit 2 [uncultured archaeon GZfos28G7]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52550228">52550228</a> 130 MoaE* uncultured archaeon GZfos36D8 molybdopterin converting factor large subunit [uncultured archaeon GZfos36D8]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52548569">52548569</a> 134 MoaE* uncultured archaeon GZfos17C7 molybdopterin converting factor large subunit [uncultured archaeon GZfos17C7]
Miscellaneous pathways
^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=11498162">11498162</a> 88 MoaD->MoeB->SirA->?->SirA-> Archaeoglobus fulgidus DSM 4304; euryarchaeota hypothetical protein AF0552 [Archaeoglobus fulgidus DSM 4304]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68550331">68550331</a> ** 317 ModA->ModC->Cys synthase->cystathione gamme synthase->ThiS->ThiG-> Pelodictyon phaeoclathratiforme BU-1 bacteroidetes/chlorobi Cysteine synthase K/M:Cysteine synthase A [Pelodictyon phaeoclathratiforme BU-1]
----------------------------------------------------------------------------------------------- --------------
3. Tungsten cofactor biosynthesis
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Abbreviations: 4Fe-S:4fe-SFerredoxin; AOR: Aldehyde ferredoxin oxidoreductase,
PDOR : Pyridine disulfide oxidoreductase, ADH: Alcohol dehydrogenase
Always have AOR and MoaD, often MoeB, occasionally MoeA and MoaA, MoaE
a. Archaeal operons (Gis are for the MoaD protein)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=5458384">5458384</a> 84 AOR->MoaD*->MoaA-> Pyrococcus abyssi GE5 euryarchaeota moaD-like molybdopterin converting factor related, subunit 1 [Pyrococcus abyssi GE5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18892299">18892299</a> 82 AOR->MoaD*->MoaA-> Pyrococcus furiosus DSM 3638 euryarchaeota molybdopterin converting factor, subunit 1; (moaD) [Pyrococcus furiosus DSM 3638]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=11497643">11497643</a> 91 AOR->MoaD*-> Archaeoglobus fulgidus DSM 4304 euryarchaeota hypothetical protein AF0022 [Archaeoglobus fulgidus DSM 4304]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=19915596">19915596</a> 94 AOR->MoaD*-> Methanosarcina acetivorans C2A euryarchaeota predicted protein [Methanosarcina acetivorans C2A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21228746">21228746</a> 94 AOR->MoaD*-> Methanosarcina mazei Go1 euryarchaeota putative molybdopterin converting factor [Methanosarcina mazei Go1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=736275">736275</a> 69 AOR->MoaD*->MoaA-> Pyrococcus furiosus DSM 3638 euryarchaeota unnamed protein product [Pyrococcus furiosus DSM 3638]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33359354">33359354</a> 84 AOR->MoaD*-><-?->MoaA-> Pyrococcus horikoshii OT3 euryarchaeota putative molybdopterin converting factor, subunit 1 [Pyrococcus horikoshii OT3]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=57159324">57159324</a> 82 AOR->MoaD*->MoaA-> Thermococcus kodakarensis KOD1 euryarchaeota molybdopterin converting factor, subunit 1 [Thermococcus kodakarensis KOD1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14325024">14325024</a> 91 <-MoaD*<-AOR Thermoplasma volcanium GSS1 euryarchaeota molybdopterin converting factor subunit 1 [Thermoplasma volcanium GSS1]
Possibly involved in tungsten cofactor biosynthesis (as the MoaAs, typically retrievethe tungsten cofactor protein in blast searches)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
gis are of the MoaD protein (marked with an asterisk)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=11499688">11499688</a> 89 MoaA->MoaD*-> Archaeoglobus fulgidus DSM 4304 euryarchaeota hypothetical protein AF2105 [Archaeoglobus fulgidus DSM 4304]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68140833">68140833</a> 78 MoaA->MoaD*-> Ferroplasma acidarmanus Fer1 euryarchaeota MoaD, archaeal [Ferroplasma acidarmanus Fer1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76801420">76801420</a> 145 MoaA->?->MoaA->MoaD*-> Natronomonas pharaonis DSM 2160 euryarchaeota pterin cluster protein [Natronomonas pharaonis DSM 2160]
b. Bacterial operons (gis are for the AOR gene- marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78700229">78700229</a> 616 4Fe-S->AOR*->PDOR->MoaD-> Alkalilimnicola ehrlichei MLHE-1 proteobacteria>gammaproteobacteria aldehyde:ferredoxin oxidoreductase,tungsten-containing [Alkalilimnicola ehrlichei MLHE-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=34482541">34482541</a> 575 AOR*->MoaD->MoeB Wolinella succinogenes proteobacteria>epsilonproteobacteria ALDEHYDE OXIDOREDUCTASE [Wolinella succinogenes]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68178064">68178064</a> 576 Dehyd->AOR*->MoaD->MoeB->FeS_assembly?->MoaA Desulfuromonas acetoxidans DSM 684 proteobacteria>deltaproteobacteria IMP dehydrogenase/GMP reductase:Aldehyde ferredoxin oxidoreductase [Desulfuromonas acetoxidans DSM 684]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77543953">77543953</a> 576 MoeA<-dehyd->AOR*->MoaD->MoeB->dehyd Pelobacter carbinolicus DSM 2380 proteobacteria>deltaproteobacteria aldehyde ferredoxin oxidoreductase [Pelobacter carbinolicus DSM 2380]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71838535">71838535</a> 576 MoeA<-dehyd->AOR*->MoaD->MoeB->permease->ABC ATPase Pelobacter propionicus DSM 2379 proteobacteria>deltaproteobacteria Aldehyde ferredoxin oxidoreductase [Pelobacter propionicus DSM 2379]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77544154">77544154</a> 577 AOR*->MoaD-> Pelobacter carbinolicus DSM 2380 proteobacteria>deltaproteobacteria aldehyde ferredoxin oxidoreductase [Pelobacter carbinolicus DSM 2380]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71544346">71544346</a> 609 AOR*->MoaD-> Syntrophobacter fumaroxidans MPOB proteobacteria>deltaproteobacteria Aldehyde ferredoxin oxidoreductase [Syntrophobacter fumaroxidans MPOB]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46449005">46449005</a> 576 AOR*-><-MoaD Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough proteobacteria>deltaproteobacteria aldehyde:ferredoxin oxidoreductase, tungsten-containing [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78193518">78193518</a> 576 dehyd->AOR*->MoaD->MoeB Geobacter metallireducens GS-15 proteobacteria>deltaproteobacteria Aldehyde ferredoxin oxidoreductase [Geobacter metallireducens GS-15]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78219908">78219908</a> 577 MoaD->MoeB<-AOR* Desulfovibrio desulfuricans G20 proteobacteria>deltaproteobacteria aldehyde:ferredoxin oxidoreductase, tungsten-containing [Desulfovibrio desulfuricans G20]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68178220">68178220</a> 576 AOR*->MoaD->MoeB Desulfuromonas acetoxidans DSM 684 proteobacteria>deltaproteobacteria IMP dehydrogenase/GMP reductase:Aldehyde ferredoxin oxidoreductase [Desulfuromonas acetoxidans DSM 684]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=50877365">50877365</a> 575 MoeA->MoeA+PPBPII->AOR*->MoaD Desulfotalea psychrophila LSv54 proteobacteria>deltaproteobacteria related to tungsten-containing aldehyde ferredoxin oxidoreductase (AOR) [Desulfotalea psychrophila LSv54]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=39982778">39982778</a> ** 601 4Fe-S->AOR*->PDOR->MoaD-> MoeB-> Geobacter sulfurreducens PCA proteobacteria>deltaproteobacteria aldehyde:ferredoxin oxidoreductase, tungsten-containing [Geobacter sulfurreducens PCA]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74023041">74023041</a> 617 4Fe-S->AOR*->PDOR->MoaD-> Rhodoferax ferrireducens DSM 15236 proteobacteria>betaproteobacteria Aldehyde ferredoxin oxidoreductase [Rhodoferax ferrireducens DSM 15236]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84716937">84716937</a> 615 4Fe-S->AOR*->PDOR->MoaD-> Polaromonas naphthalenivorans CJ2 proteobacteria>betaproteobacteria aldehyde:ferredoxin oxidoreductase,tungsten-containing [Polaromonas naphthalenivorans CJ2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=47572159">47572159</a> 592 4Fe-S->AOR*->PDOR->MoaD-> Rubrivivax gelatinosus PM1 proteobacteria>betaproteobacteria COG2414: Aldehyde:ferredoxin oxidoreductase [Rubrivivax gelatinosus PM1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56314521">56314521</a> ** 774 AOR*+MoaD Azoarcus sp. EbN1 proteobacteria>betaproteobacteria putative tungsten-containing aldehyde ferredoxin oxidoreductase (AOR-1)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23015426">23015426</a> 616 4Fe-S->AOR*->PDOR->PDOR->MoaD-> Magnetospirillum magnetotacticum MS-1 proteobacteria>alphaproteobacteria COG2414: Aldehyde:ferredoxin oxidoreductase [Magnetospirillum magnetotacticum MS-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83589574">83589574</a> 599 4Fe-S->AOR*->MoaD->PDOR-> Moorella thermoacetica ATCC 39073 firmicutes Aldehyde ferredoxin oxidoreductase [Moorella thermoacetica ATCC 39073]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77996039">77996039</a> 597 AOR*->MoaA->MoaD->MoaE-> Carboxydothermus hydrogenoformans Z-2901 firmicutes aldehyde ferredoxin oxidoreductase, tungsten-containing [Carboxydothermus hydrogenoformans Z-2901]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76795288">76795288</a> ** 599 AOR*->MoaA->MoaD->MoeB->MobB->PPBPII->permease->ABC ATPase-> Thermoanaerobacter ethanolicus ATCC 33223 firmicutes Aldehyde ferredoxin oxidoreductase [Thermoanaerobacter ethanolicus ATCC 33223]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77995801">77995801</a> 629 4Fe-S->AOR*->MoaD-> Carboxydothermus hydrogenoformans Z-2901 firmicutes aldehyde ferredoxin oxidoreductase, tungsten-containing [Carboxydothermus hydrogenoformans Z-2901]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77995423">77995423</a> 597 ADH->AOR*->MoaD-> Carboxydothermus hydrogenoformans Z-2901 firmicutes aldehyde ferredoxin oxidoreductase, tungsten-containing [Carboxydothermus hydrogenoformans Z-2901]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71540750">71540750</a> 597 MoaD->MoeB<-?->4Fe-S->AOR*-> Syntrophomonas wolfei str. Goettingen firmicutes Aldehyde ferredoxin oxidoreductase [Syntrophomonas wolfei str. Goettingen]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46200136">46200136</a> 608 AOR*->MoaD-> Thermus thermophilus HB27 deinococci tungsten-containing aldehyde ferredoxin oxidoreductase [Thermus thermophilus HB27]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=51858106">51858106</a> 604 AOR*->MoaD-> Symbiobacterium thermophilum IAM 14863 actinobacteria aldehyde ferredoxin oxidoreductase [Symbiobacterium thermophilum IAM 14863]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=51857711">51857711</a> 603 AOR*->MoaD-> Symbiobacterium thermophilum IAM 14863 actinobacteria aldehyde ferredoxin oxidoreductase [Symbiobacterium thermophilum IAM 14863]
----------------------------------------------------------------------------------------------- --------------
4. Uncharacterized operons with ThiS/ThiF+Rhodanese containing proteins (sulfur metabolism)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
4a. Siderophore biosynthesis (Gis are of the E1+Rhodanese- marked with an asterisk)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=28192388">28192388</a> 387 Hist_phosphate_NH2transferase->E1+Rhodanese*->JAB->ThiS/MoaD->Trp-dioxygenase->hydroxybenzoate hydroxylase-> Pseudomonas fluorescens proteobacteria>gammaproteobacteria QbsC [Pseudomonas fluorescens]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83645618">83645618</a> 390 E1+Rhodanese->JAB*->ThiS/MoaD->+CaiB-like coA transferase->AMP-acid ligase-> Hahella chejuensis KCTC 2396 proteobacteria>gammaproteobacteria Dinucleotide-utilizing enzyme involved in molybdopterin and thiamine biosynthesis family 2 [Hahella chejuensis KCTC 2396]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82702101">82702101</a> 390 E1+Rhodanese->JAB*->ThiS/MoaD->+CaiB-like coA transferase-> Nitrosospira multiformis ATCC 25196 proteobacteria>betaproteobacteria UBA/THIF-type NAD/FAD binding fold [Nitrosospira multiformis ATCC 25196]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=30181075">30181075</a> 390 E1+Rhodanese->JAB*->ThiS/MoaD->+CaiB-like coA transferase->AMP-acid ligase-> Nitrosomonas europaea ATCC 19718 proteobacteria>betaproteobacteria Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Nitrosomonas europaea ATCC 19718]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83748714">83748714</a> 389 E1+Rhodanese->JAB*->ThiS/MoaD->+CaiB-like coA transferase->AMP-acid ligase-> Ralstonia solanacearum UW551 proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoeB protein [Ralstonia solanacearum UW551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83748714">83748714</a> 389 E1+Rhodanese->JAB*->ThiS/MoaD-> Ralstonia solanacearum UW551 proteobacteria>betaproteobacteria Molybdopterin biosynthesis MoeB protein [Ralstonia solanacearum UW551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=5070639">5070639</a> 391 E1+Rhodanese->JAB*->ThiS/MoaD->+CaiB-like coA transferase->AMP-acid ligase-> Pseudomonas stutzeri KC proteobacteria>gammaproteobacteria AF149851_6 MoeB-like protein [Pseudomonas stutzeri KC]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84994030">84994030</a> 390 E1+Rhodanese(PdtF)*->JAB(PdtG)->ThiS/MoaD(PdtH)->+CaiB-like coA transferase(PdtI)->AMP-acid ligase(PdtJ)-> Pseudomonas putida proteobacteria>gammaproteobacteria PdtF [Pseudomonas putida]
----------------------------------------------------
4b. Uncharacterized operon encoding a ThiS/MoaD, a JAB peptidase and E1-like enzyme (gis are of E1+Rhod- marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88807869">88807869</a> 389 JAB->E1+Rhod*-> Synechococcus sp. WH 7805 cyanobacteria gll3412 [Gloeobacter violaceus PCC 7421]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86607093">86607093</a> 387 JAB->ThiS/MoaD->E1+Rhod*-> Cyanobacteria bacterium Yellowstone A-Prime cyanobacteria UBA/THIF-type NAD/FAD binding, MoeZ/MoeB fmaily protein [Anaeromyxobacter dehalogenans 2CP-C]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86609523">86609523</a> 389 JAB->ThiS/MoaD->E1+Rhod*-> Cyanobacteria bacterium Yellowstone B-Prime cyanobacteria putative molybdopterin biosynthesis protein MoeB [Synechococcus sp. JA-2-3B'a(2-13)]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=81298969">81298969</a> 391 JAB->E1+Rhod*-> Synechococcus elongatus PCC 7942 cyanobacteria putative molybdopterin biosynthesis protein MoeB [Synechococcus sp. JA-3-3Ab]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=87300927">87300927</a> 390 JAB->E1+Rhod*-> Synechococcus sp. WH 5701 cyanobacteria Rhodanese-like [Alkalilimnicola ehrlichei MLHE-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=35213984">35213984</a> 395 JAB->ThiS/MoaD->E1+Rhod*-> Gloeobacter violaceus PCC 7421 cyanobacteria Rhodanese-like [Synechococcus elongatus PCC 7942]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78700359">78700359</a> 142 JAB->ThiS/MoaD+Rhodanese+E1*-> Alkalilimnicola ehrlichei MLHE-1 proteobacteria>gammaproteobacteria molybdopterin biosynthesis MoeB protein [Synechococcus sp. WH 5701]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86159911">86159911</a> 390 ThiS/MoaD->E1+Rhod*->JAB-> Anaeromyxobacter dehalogenans 2CP-C proteobacteria>deltaproteobacteria molybdopterin biosynthesis protein [Synechococcus sp. WH 7805]
- JABs in operons with E1+Rhod (No Ub_like) (Gis of E1 containing protein-marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75700942">75700942</a> 390 JAB->E1+Rhod*-> Anabaena variabilis ATCC 29413 cyanobacteria Rhodanese-like MoeZ/MoeB [Anabaena variabilis ATCC 29413]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17132000">17132000</a> 390 JAB->E1+Rhod*-> Nostoc sp. PCC 7120 cyanobacteria molybdopterin biosynthesis protein [Nostoc sp. PCC 7120]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56686316">56686316</a> 391 JAB->E1+Rhod*-> Synechococcus elongatus PCC 6301 cyanobacteria molybdopterin biosynthesis MoeB protein [Synechococcus elongatus PCC 6301]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71676726">71676726</a> 391 JAB->E1+Rhod*-> Trichodesmium erythraeum IMS101 cyanobacteria UBA/THIF-type NAD/FAD binding fold:Rhodanese-like:MoeZ/MoeB [Trichodesmium erythraeum IMS101]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23124399">23124399</a> 390 JAB->E1+Rhod*-> Nostoc punctiforme PCC 73102 cyanobacteria COG0476: Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Nostoc punctiforme PCC 73102]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=87124948">87124948</a> 389 JAB->E1+Rhod*-> Synechococcus sp. RS9917 cyanobacteria Rhodanese-like [Synechococcus sp. RS9917]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78169800">78169800</a> 388 JAB->E1+Rhod*-> Synechococcus sp. CC9902 cyanobacteria Rhodanese-like [Synechococcus sp. CC9902]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72002829">72002829</a> 381 JAB->E1+Rhod*-> Prochlorococcus marinus str. NATL2A cyanobacteria rhodanese-like [Prochlorococcus marinus str. NATL2A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33238703">33238703</a> 379 JAB->E1+Rhod*-> Prochlorococcus marinus subsp. marinus str. CCMP1375 cyanobacteria Prochlorococcus marinus subsp. marinus str. CCMP1375 complete genome
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33635570">33635570</a> 409 JAB->E1+Rhod*-> Prochlorococcus marinus str. MIT 9313 cyanobacteria molybdopterin biosynthesis protein [Prochlorococcus marinus str. MIT 9313]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84513874">84513874</a> 379 JAB->E1+Rhod*-> Prochlorococcus marinus str. MIT 9211 cyanobacteria Dinucleotide-utilizing enzyme [Prochlorococcus marinus str. MIT 9211]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78196401">78196401</a> 378 JAB->E1+Rhod*-> Synechococcus sp. CC9605 cyanobacteria Rhodanese-like [Synechococcus sp. CC9605]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33633363">33633363</a> 377 JAB->E1+Rhod*-> Synechococcus sp. WH 8102 cyanobacteria molybdopterin biosynthesis protein [Synechococcus sp. WH 8102]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76882207">76882207</a> 257 E1*->JAB-> Nitrosococcus oceani ATCC 19707 proteobacteria>gammaproteobacteria Adenylyltransferase [Nitrosococcus oceani ATCC 19707]
----------------------------------------------------
4c. Uncharacterized operon with a ThiS/MoaD, E1-like enzyme, a JAB and a Cysteine synthase (gis are of the cys synthase- marked with an asterisk)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68563152">68563152</a> 305 Cys synthase*->JAB->ThiS/MoaD->E1+Rhodanese-> Rubrobacter xylanophilus DSM 9941 actinobacteria Cysteine synthase K/M [Rubrobacter xylanophilus DSM 9941]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83815753">83815753</a> 317 Cys syn*->JAB->ThiS/MoaD->E1+Rhodanese-> Salinibacter ruber DSM 13855 bacteroidetes/chlorobi cysteine synthase B [Salinibacter ruber DSM 13855]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83757147">83757147</a> 317 Cys syn*->JAB->ThiS/MoaD->E1+Rhod-> Salinibacter ruber DSM 13855 bacteroidetes/chlorobi cysteine synthase B [Salinibacter ruber DSM 13855]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76258730">76258730</a> 308 Cys synthase*->JAB->ThiS/MoaD->E1+Rhodanese-> Chloroflexus aurantiacus J-10-fl chloroflexi Cysteine synthase K/M [Chloroflexus aurantiacus J-10-fl]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67932284">67932284</a> 319 Cys syn*->JAB->ThiS/MoaD->E1+Rhodanese-> Solibacter usitatus Ellin6076 fibrobacteres/acidobacteria Cysteine synthase K/M [Solibacter usitatus Ellin6076]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78493973">78493973</a> 304 JAB->E1+Rhodanese->Cys synthase*-> Rhodopseudomonas palustris BisB18 proteobacteria>alphaproteobacteria Pyridoxal-5'-phosphate-dependent enzyme, beta subunit [Rhodopseudomonas palustris BisB18]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9948117">9948117</a> 392 Cys synthase*->E1+Rhodanese-> Pseudomonasaeruginosa PAO1; proteobacteria>gammaproteobacteria AE004638_1 probable molybdopterin biosynthesis protein MoeB [Pseudomonas aeruginosa PAO1]
----------------------------------------------------
4d. Uncharacterized operon with a ThiS/MoaD/MoaD, JAB, Cysteine synthase and ClpS (gis of Cys synthases)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13880986">13880986</a> 323 ClpS->alpha_helical_domain->JAB->ThiS/MoaD->Cys synthase*-> Mycobacterium tuberculosis CDC1551; actinobacteria cysteine synthase [Mycobacterium tuberculosis CDC1551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=54014566">54014566</a> 320 ClpS->alpha_helical_domain->dmpA_peptidase->JAB->ThiS/MoaD->Cys synthase*-> Nocardia farcinica IFM 10152; actinobacteria putative cysteine synthase [Nocardia farcinica IFM 10152]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29608823">29608823</a> 316 ClpS->alpha_helical_domain->permease->JAB->ThiS/MoaD->Cys synthase*-> Streptomyces avermitilis; MA-4680 actinobacteria putative cysteine synthase [Streptomyces avermitilis MA-4680]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68231907">68231907</a> 315 ClpS->alpha_helical_domain->JAB->ThiS/MoaD->Cys synthase*-> Frankia sp. EAN1pec; actinobacteria Cysteine synthase K/M [Frankia sp. EAN1pec]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86739581">86739581</a> 315 ClpS->alpha_helical_domain->JAB->ThiS/MoaD->Cys synthase*-> Frankia sp. CcI3; actinobacteria cysteine synthases [Frankia sp. CcI3]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71916499">71916499</a> 315 ClpS->alpha_helical_domain->JAB->ThiS/MoaD->Cys synthase*-> Thermobifida fusca YX; actinobacteria cysteine synthase K/M [Thermobifida fusca YX]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71366891">71366891</a> 320 alpha_helical_domain->MutT->JAB->ThiS/MoaD->Cys synthase*-> Nocardioides sp. JS614; actinobacteria Cysteine synthase K/M [Nocardioides sp. JS614]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=5531359">5531359</a> 316 JAB->alpha_helical_domain->ThiS/MoaD->Cys Syn*->alpha_helical_domain<-MBL Streptomyces coelicolor A3(2); actinobacteria putative cysteine synthase [Streptomyces coelicolor A3(2)]
Alignment of rapidly diverging alpha helical protein
ALIGN -------------EE--HHHHHHHHHHHHHHHHHHH-------------------HHHHH-------HH------------------------------EEE-----------------------HHHHHHHHH--HHHHHHHHHHHHHHHHH---------------------HH-HEEE--HHHHHHHHHHHHHHHHHHHHHHH-------------------------HHHHHHHHHHHHHHHHHH---
HMM -----------HEEEEHHHHHHHHHHHHHHHHHHHHH---------H-------HHHHHH-------HE----------------------------EEEEEE---------------------HHHHHHHHHH-HHHHHHHHHHHHHHHHH----------------------EEEEEEE--HHHHHHHHHHHHHHHHHHHH-EEE----HHHH-----H-------HHHHHHHHHHHHHHHHHHHHHH--
FREQ ---HHH------HEE---HHHHHHHHHHHHHHHHHHH------------------HHHHH-------H------------------------------H-H------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHH----------------H-HHH----HHEHHHHHHHHHHHHHHHHHHHHHHHHH-------------------------HHHHHHHHHHHHHHHHHHHH--
PSSM ------------EEEEEHHHHHHHHHHHHHHHHHHH--------------------HHHH-------H------------------------------EEEE-----------------------HHHHHHHH----HHHHHHHHHHHHHHHH------------------------EEEEE-HHHHHHHHHHH---EEEEEEE-----------------------HHHHHHHHHHHHHHHHHHHHHH---
FINAL ------------EEEE-HHHHHHHHHHHHHHHHHHH-------------------HHHHH-------H------------------------------EEEE----------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------------EEEEE-HHHHHHHHHHHHHHHHHHHHH------------------------HHHHHHHHHHHHHHHHHHHHHH--
NocaDRAFT_2640_Nsp._71366887 MSGFQRHRRSKLIIANFTGFEADLLRSLAGQLVELLRNEAAVPRDPV-------DPFEAM-------MDF------------------SGPTQEPEDPVLARLFPTAYPGD-------------QEAASEFRRFTEGTLRDGKAAAAVAIIDGL--------EEAGLPPELTEDGLMIDIELDEATAETWMRSFTDLRLALATRLEVEEGDDAYW-----HSLPDDDPRAQAHDIYEWVGYLQETLVQALSG
Lxx13320_Lxyl_50951464 MRPFRRTRDGT-LRARFEPDEAEILARLAAETAELAV-----------------DAA-------------------------------SGAGDPREDPAFIRLLPDAYSGD-------------AEASAEFRRFTAGGLAERKALTAQVVMETL--------GGGSG---------AIEVRLDAPQAAAWLRTLTDIRLVLAARLGIVQDGDEG-------DIHDAD-SAFRRAVYDWLAGVQESLVLALRS
BlinB01002436_Blin_62424056 --MAAIDARGDDVVLKLEDNERSLMLTVFTDLAALLAEDDNEDGRPD------SENWEARLG--------------------------LVERPRPQDPALLRLFPDVDPLDE-------------ERSREFRRLTEFDLQQAKAHNVRIVLNGL---------AKGS-----------SITLNHDEVLAWMKGLNDLRLVLAVRMGIDTEEAQEEKYAQREDL--DESEELTLTLYDFLTWIQDRLTTTLLS
clpS_Jsp._84494379 AFARKGKGKNLRYAAKLDAVERAVVAGLMEQVHDLVAPEPEEAVATGPSGASDHDDDFAAIVSGLGGLGMGVSISAEDQVADDRPVPADARSFGDRDPALERLLPAGNRAD-------------DQVSAEFRRLTEHGLRQRKAGHLESAITSL--------RAPGS-----------GVELDERAAIDMVIALTDVRLVLGERLGLREDADVDRLEEELADVDDDDPRGHAMSVYDFLTWLQETLATAMLP
cg<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2770">2770</a>_Cglu_41326695 WKKKKGLMRQARYAVVFEPMEREVLGDLSAAVSEALIQRAQS--VPK-------DPLAEMTGMT------------------------SGHKEAPTDPALARLLPDFQHEGD---------EEYDGDNSFLRSLHEGDITRAKLENLRVINDAL--------GPDGN----------VAVTASEEEAHAWLAALNDIRLYVASG-DVRGGEAAE---------------EDRENLVQWLAYNQESLLEAMMN
_Ceff_23494252 WKRRKALMRSARYTCVLEPMEREVLGNLSAVVLEALIHRAQD--APK-------DPLAELTGIP------------------------SGHKEAPRDPALARLLPDFQQEGD---------EEYDGDNSLLRSLHENDITRQKIANLQVINSAL--------GPDGG----------VAVSIPEEEAHAWLAGLNDIRLYLASG-ELKGGEAAE---------------EDRENLVQWLAYNQESLLEAMMG
DIP1856_Cdip_38200689 WKKKKGLFKGARYQCTLEPIEREVLGNLAANISEVLISRAQS--APK-------DELAELTGMG------------------------GGHTEAPEDPGLARLLPDFEMQGD---------EEFDGDNSLLRSLHENDITRAKLANLQTIGQAL--------GPDGS----------VFVTVTEEEAQAWVAGLNDIRLYLASSE-VQDTEDRD-------------------ALVEWLAFAQESLLTAMMG
jk0494_Cjei_68263163 WTKKNSLLRGTRFNTQLEPLEREMLGDSAVAVSDKLMERART--APK-------DELAEMTGMA------------------------SGHADAPKDPGLARLLPSFFREGD---------EEVDGDAALTRQLNETDIIKTKLSNLRFVVDYL--------GPNGS----------VNVSLTQDEVHPWLSAINDIRLYHSAQYEEFKKELL-------EGEENSDQATAAQNYLDWLGYHQDSLLSAMMG
nfa10870_Nfar_54014562 KWTRKNSLGGLKLRAEMDAHEAEVLRSLVGAVSGLLAERAQS--APE-------DELSALTGLR------------------------TGNTAPPDDPRLARLLPDFHRSEPGSPDADRA-----GLNSALRALHEPEIIDAKLAAGSVVLDTV--------PARGG-----------KIVLTPEQADAWLSALTDVRLALGTVLGIDAETP--------DQLDPDDPRAPHLDVYHWLTWMQDSLLQALAP
SCO2915_Scoe_5531364 MPGQFEPLPGGGAAVALDDVEISIIRSLAVQLLELIGPGPAED-ASD-------DPLAELFA--------------------------EGPSEPPSDPVLRRLFPDAYGDPEGAPQAREA-EEQRAHSAEFRRYTENDLRAGKRDNALAVVRTLDTLSSASAGEEGA-----------VLKLSPQESQQWLRALNDLRLAIGSRLEIADEDDTDLLYR----LPDEDPRKPMVMAYLWLGGLQESLVATLMP
SAV5160_Save_29608819 MPGHFEPLPGGGAAVALDEVEISIIRSLAVQLLELIGPGPAED-AAA-------DPLAELFA--------------------------EGPSEPPSDPVLQRLFPDAYGGPGGEGGSPEEAEEQRAHSSEFRRFTENDLRAGKRENALVVIRTL--DGMTVAGEGGA-----------VLKLSPEESRQWLGSLNDLRLAIGSRLDVVDEEDTDLLYR----LPDEDPRKPMVMAYLWLGGLQETLIETLMS
Francci3_0865_Fsp._86739578 DVADGFRRTRAGIELRLPRLEAALLIELVGQIESLLEPPP-----VE-------DPLEALVGLR------------------------DTAPPPPDDPAIARLLPDPYPDD-------------PMASGDFRRRRTDDLLARKRDAARRVLSAV--------PAPGR-----------ALLLDEEAAQDWLTTLNDLRLVLGTRLGLTDDDSTAEL----EHLDPDDSRRPLVAVYAFLTELLDDLTRALG-
Franean1DRAFT_3648_Fsp._68231910 --MNGFRRTRAGIELRLPRLESSLLTELLGQVDALLEAPP-----VD-------DPLEALVGLR------------------------DTAPPPPEDPAVARLLPDPYPDD-------------PLASGDFRRRRTDEALARKRDAARRVLAAV--------PAPGA-----------VLVLDEDAAQDWLTVLNDLRLVLGTRLGLTDDESTAELEN----LTPEDPRRPVAAVYAFLTELLDELTRALL-
KradDRAFT_2533_Krad_67987809 -MATFRRTRNGHFSLTLHAAEADLLASLAREVLELLEVPAAAPPRPV-------DPLQAELGLS----------DLPGFDTPLDDLAGDGPVAPPEDEVLRRLLPDAYGDD-------------PDASADFRRFTERGLRERKAAAASGLLAGL-----APVEGQGG-----------RVQLDADGARTWLAALNDIRLALGTRLGVSEDADPD------ADLAEDDPARWAWAVYDFTTHLQETLVRSLS-
Tfu_2371_Tfus_71916502 MTAKIRSAPHGGARITIGPDEAQLLRSMADFLLRVVEEPEQ-----Q-------DELAALVGIS-------------------------SSATQPEDPALARLFPDAYTDD-------------AEAAADFRRYTESDLRRHKRENARRVASAI--------PEWGG-----------EIVLDAEDVQAWLQTLTDVRLYLGVRLGIETEEDADAL---RAAAVRDESLAAAMHVYEWFTYVQDSLVRAVWQ
ArthDRAFT_1846_Asp._66965396 -MAKAFKYGIKGITGYLEPAERELLRSLIDDVISMLQPAES---ASE-------DPLTALIGLD-------------------------MNVREPSDRALRRLLPNVTKDD-------------DAASLEFRQLTERSLRENKIGALRAAALGL----------DTN-----------ELVLSQADARHWSQALNDVRLVLAERLDIRDDADAEHVHTMQDWSQAEDVESYLALVYNFTTWLQESLVQAMLQ
MT1374_Mtub_13880982 WKRVET-RDGPRFRSSLAPHEAALLKNLAGAMIGLLDDRDSS--SPS-------DELEEITGIK------------------------TGHAQRPGDPTLRRLLPDFYRPDDLDDDDPTAVDGSESFNAALRSLHEPEIIDAKRVAAQQLLDTV--------PDNGG-----------RLELTESDANAWIAAVNDLRLALGVMLEIGPRGP--------ERLPGNHPLAAHFNVYQWLTVLQEYLVLVLMG
MtubF_01001398_Mtub_76784817 WKRVET-RDGPRFRSSLAPHEAALLKNLAGAMIGLLDDRDSS--SPS-------DELEEITGIK------------------------TGHAQRPGDPTLRRLLPDFYRPDDLDDDDPTAVDGSESFNAALRSLHEPEIIDAKRVAAQQLLDTV--------PDNGG-----------RLELTESDANAWIAAVNDLRLALGVMLEIGPRGP--------ERLPGNHPLAAHFNVYQWLTVLQEYLVLVLMG
MAP2428c_Mavi_41408526 WKRVET-AEGPRFRSALASHEAALLKNLATAMIGLLDERESS--SPA-------DELEEITGIK------------------------TGNAQPPKDPTLRRLLPDFYRPDDNGDESPDAAE---SLNAALRSLHEPGIVNAKRVAAQRLLGTV--------PDDGG-----------RFELTEDDANAWIAAVNDIRLTLGVMLEIGPDGP--------ERLPADHPLAVHFDVYQWLTVLQEYLVLVLMG
_Mlep_466922 WKRVET-ANGPRFRSVVAPHEVALLKHLVGALLGLLNERESS--SPL-------DELEVITGIK------------------------AGNAQRPEDPTLRRLLPDFYTPDDKDQLDPAALDAVDSLNAALRSLHEPEIVDAKRSAAQQLLDTL--------PESDG-----------RLELTEASANAWIAAVNDLRLALGVILEIDRPAP--------ERVPAGHPLSVHFDVYQWLTVLQEYLVLALMA
ML1166_Mlep_13093139 WKRVET-ANGPRFRSVVAPHEVALLKHLVGALLGLLNERESS--SPL-------DELEVITGIK------------------------AGNAQRPEDPTLRRLLPDFYTPDDKDQLDPAALDAVDSLNAALRSLHEPEIVDAKRSAAQQLLDTL--------PESDG-----------RLELTEASANAWIAAVNDLRLALGVILEIDRPAP--------ERVPAGHPLSVHFDVYQWLTVLQEYLVLALMA
consensus/100% ............h...h...E..hh..........h..................-.........................................D..h.RLhPs.....................s...R.bp...h...K......h...l...........s.............h.hs...s..h..shsDlRL..u................................hh.a.s..b-.L..sh..
consensus/90% ............h...h...E..ll.plhs.h..hl.........s........D.h..b.s...........................s....PpDPsl.RLhPs....s................su.hRphpp..l...K..sh..l..sl...........ss............l.ls...s..Wh.slsDlRLhlus.b.l....s......................hh.ahs..Q-.L..sh..
Species abbreviations: Asp. : Arthrobacter sp.; Blin : Brevibacterium linens; Cdip : Corynebacterium diphtheriae; Ceff : Corynebacterium efficiens; Cglu : Corynebacterium glutamicum; Cjei : Corynebacterium jeikeium; Fsp. : Frankia sp.; Jsp. : Janibacter sp.; Krad : Kineococcus radiotolerans; Lxyl : Leifsonia xyli; Mavi : Mycobacterium avium; Mlep : Mycobacterium leprae; Mtub : Mycobacterium tuberculosis; Nfar : Nocardia farcinica; Nsp. : Nocardioides sp.; Save : Streptomyces avermitilis; Scoe : Streptomyces coelicolor; Tfus : Thermobifida fusca
Miscellaneous operons
Rhodanese+E1 (no JABs in operons- gis are of the Rhodanese+E1 protein)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71898141">71898141</a> 379 Rhodanese+E1 Xylella fastidiosa Ann-1 proteobacteria>gammaproteobacteria UBA/THIF-type NAD/FAD binding fold:Rhodanese-like:MoeZ/MoeB [Xylella fastidiosa Ann-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71900908">71900908</a> 386 Rhodanese+E1 Xylella fastidiosa Ann-1 proteobacteria>gammaproteobacteria UBA/THIF-type NAD/FAD binding fold:MoeZ/MoeB [Xylella fastidiosa Ann-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9105314">9105314</a> 379 Rhodanese+E1 Xylella fastidiosa 9a5c proteobacteria>gammaproteobacteria AE003897_1 molybdopterin biosynthesis protein [Xylella fastidiosa 9a5c]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77747707">77747707</a> 379 Rhodanese+E1 Xylella fastidiosa Temecula1 proteobacteria>gammaproteobacteria molybdopterin biosynthesis protein MoeB [Xylella fastidiosa Temecula1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78036060">78036060</a> 401 MoeA-><-Rhodanese+E1 Xanthomonas campestris pv. vesicatoria str. 85-10; proteobacteria>gammaproteobacteria molybdopterin biosynthesis protein MoeB [Xanthomonas campestris pv. vesicatoria str. 85-10]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=58426731">58426731</a> 472 Rhodanese+E1 Xanthomonas oryzae pv. oryzae KACC10331 proteobacteria>gammaproteobacteria molybdopterin biosynthesis protein [Xanthomonas oryzae pv. oryzae KACC10331]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21108248">21108248</a> 380 Rhodanese+E1 Xanthomonas axonopodis pv. citri str. 306 proteobacteria>gammaproteobacteria molybdopterin biosynthesis protein [Xanthomonas axonopodis pv. citri str. 306]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84367975">84367975</a> 379 Rhodanese+E1 Xanthomonas oryzae pv. oryzae MAFF 311018 proteobacteria>gammaproteobacteria molybdopterin biosynthesis protein [Xanthomonas oryzae pv. oryzae MAFF 311018]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21113107">21113107</a> 378 Rhodanese+E1 Xanthomonas campestris pv. campestris str. ATCC 33913 proteobacteria>gammaproteobacteria molybdopterin biosynthesis protein [Xanthomonas campestris pv. campestris str. ATCC 33913]
----------------------------------------------------
4e. Operons with genes for sulfur metabolism proteins
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Domain abbreviations:
SirA-like redox proteins IF3C-fold, regulator of disulfide bond formation?, (note in some instances this protein is fused to a Rhodanese)
OAHShyd: typically O-acetylhomoserine/serine sulfhydrylase/Methionine lyase; PLP dependent transferase superfamily
DsrE/H: ancient family, Conserved cysteine, often fused and solo versions, also in archaea,
involved in sulfur reduction, YchN-like fold, perhaps a breakaway Rossmannoid, DsrH like proteins
are involved in oxidation of intracellular sulfur (pdb: 1l1s ): solo gi:67938822
PAPSR: Phosphoadenosine phosphosulfate reductase
ATP_sulf: ATP sulfurylase
Gis are of the SirA or MoaD/ThiS protein -marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67938823">67938823</a> 82 ThiS/MoaD->OAHShyd->E1 solo->JAB->DsrE/H->SirA*-> Chlorobium phaeobacteroides BS1; bacteroidetes/chlorobi SirA-like [Chlorobium phaeobacteroides BS1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68208690">68208690</a> 80 PAPSR->ATP_sulf->Sulf_adenyltransf_large->ThiS/MoaD->E1->JAB->Sulf_reductase(Fe-S binding protein)->SirA*-> Desulfitobacterium hafniense DCB-2; firmicutes SirA-like [Desulfitobacterium hafniense DCB-2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77996033">77996033</a> 72 sulfite_reductase->E1->ThiS/MoaD*->Sulf_adenylyltransferase->4Fe-S->Adenylylsulfate_reductase->?->Adenylylsulfate_kinase Carboxydothermus hydrogenoformans Z-2901; firmicutes thiamine biosynthesis protein ThiS [Carboxydothermus hydrogenoformans Z-2901]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67873788">67873788</a> 81 PAPSR->ATP_sulf->Sulf_adenyltransf_large->ThiS/MoaD->E1->JAB->Sulf_reductase(Fe-S binding protein)->SirA*-> Clostridium thermocellum ATCC 27405; firmicutes SirA-like [Clostridium thermocellum ATCC 27405]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29894496">29894496</a> 77 SirA+Rhodanese->Hydroxyacylglutathione hydrolase->SirA*->Rhod->Rhod-> Bacillus cereus ATCC 14579; firmicutes Molybdopterin biosynthesis MoeB protein [Bacillus cereus ATCC 14579]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82499134">82499134</a> 82 ABC sulfate transporter->ThiS/MoaD->E1->JAB->sulf_reductaseFe-S binding protein)->SirA*->OAHShyd->Adenylylsulfreduct->Ferredoxin->ATP_sulf->PAPSR-> Caldicellulosiruptor saccharolyticus DSM 8903; firmicutes conserved hypothetical protein [Caldicellulosiruptor saccharolyticus DSM 8903]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78194036">78194036</a> 74 OAHShyd->ThiS/MoaD->E1solo->JAB->Sulf_reductase(Fe-S binding protein)->SirA*-> Geobacter metallireducens GS-15; proteobacteria>deltaproteobacteria conserved hypothetical protein [Geobacter metallireducens GS-15]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=18160982">18160982</a> 88 <-PAPSR<-?<-Sulfite_reductase<-?->ThiS/MoaD*->Rhod+Rhod-> Pyrobaculum aerophilum str. IM2; crenarchaeota conserved hypothetical protein [Pyrobaculum aerophilum str. IM2]
Operons lacking sirA (gis are of the ThiF/E1-like protein-marked with an asterisk)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=34483109">34483109</a> 272 PAPSR->ATP_sulf->Sulf_adenyltransf_large->ThiS/MoaD->E1*->JAB->Sulf_reductase(Fe-S binding protein)-> Wolinella succinogenes; proteobacteria>epsilonproteobacteria MOLYBDOPTERIN BIOSYNTHESIS PROTEIN MOEB [Wolinella succinogenes]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77686500">77686500</a> 269 CysTRNAsyn_deacylase->ThiS/MoaD->E1*->JAB->Sulf_reductase(Fe-S binding protein)-> Alkaliphilus metalliredigenes QYMF firmicutes UBA/THIF-type NAD/FAD binding fold:MoeZ/MoeB [Alkaliphilus metalliredigenes QYMF]
ThiS/MoaD+Sulf_reductase containing operon subtype (Gis of ThiS/MoaD)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71366157">71366157</a> 648 Sulf_reductase+ThiS/MoaD*->PAPSR-> Nocardioides sp. JS614 actinobacteria Ferredoxin--nitrite reductase [Nocardioides sp. JS614]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88931571">88931571</a> 639 Sulf_reductase+ThiS/MoaD*->PAPSR-> Acidothermus cellulolyticus 11B actinobacteria Ferredoxin--nitrite reductase [Acidothermus cellulolyticus 11B]
-------------------------------------------------------------------------------------------------------------
5. Phage Tail assembly associated Ub
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
gp: Phage containing Ub domain; also called I-tail component; gpK-JAB
J: host specificity protein J; STF : lambda side tail fiber protein
- Operons of the type JAB+NlpC->Ub->gpJ (Gis are of the JAB protein- Marked with an asterisk)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=38707909">38707909</a> 194 JAB+NlpC*->Ub->gpJ-> Bacteriophage phi1026b bacteriophages gp19 [Bacteriophage phi1026b]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76556246">76556246</a> 226 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Phage BP-4795 bacteriophages putative tail component [Phage BP-4795]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71834086">71834086</a> 191 gpM->gpL->Ub->gpJ-> Bacteriophage JK06 bacteriophages hypothetical tail assembly protein I [Bacteriophage JK06]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77864688">77864688</a> 187 gpL->JAB+NlpC*->Ub->gpJ-> Burkholderia cepacia phage Bcep176 bacteriophages gp63 [Burkholderia cepacia phage Bcep176]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=80750693">80750693</a> 190 gpL->HNH->JAB+NlpC*->Ub->gpJ-> Bacteriophage RTP bacteriophages putative tail assembly protein [Bacteriophage RTP]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46402106">46402106</a> 197 gpL->JAB+NlpC*->?->Ub->gpJ+X-> Bacteriophage phiKO2 bacteriophages Gp20 [Bacteriophage phiKO2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17975181">17975181</a> 194 gpL->JAB+NlpC*->Ub->gpJ-> Bacteriophage phiE125 bacteriophages putative tail component protein [Bacteriophage phiE125]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=11877308">11877308</a> 240 gpL->JAB+NlpC*->Ub->gpJ-> Neisseria meningitidis phage 2120 bacteriophages putative protein I [Neisseria meningitidis phage 2120]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9630484">9630484</a> 192 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Enterobacteria phage N15 bacteriophages gp20 [Bacteriophage N15]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=215124">215124</a> 223 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Enterobacteria phage lambda bacteriophages I (tail component;223) [bacteriophage lambda]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=51773733">51773733</a> 180 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Bacteriophage CP-1639 bacteriophages putative tail fiber component I [Bacteriophage CP-1639]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9634139">9634139</a> 202 gpL->JAB+NlpC*->?->Ub->?->?<-?->gpJ-> Enterobacteria phage HK022 bacteriophages gp21 [Enterobacteria phage HK022]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=45686326">45686326</a> 199 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Enterobacteria phage T1 bacteriophages putative tail assembly protein [Enterobacteria phage T1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84357775">84357775</a> 150 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Burkholderia cenocepacia PC184 proteobacteria>betaproteobacteria COG4723: Phage-related protein, tail component [Burkholderia cenocepacia PC184]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83717443">83717443</a> 194 gpL->JAB+NlpC*->Ub->gpJ-> Burkholderia thailandensis E264 proteobacteria>betaproteobacteria Bacteriophage lambda tail assembly protein I [Burkholderia thailandensis E264]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76579036">76579036</a> 188 gpL->JAB+NlpC*->Ub->gpJ->lysozyme-> Burkholderia pseudomallei 1710b proteobacteria>betaproteobacteria Bacteriophage lambda tail assembly protein I [Burkholderia pseudomallei 1710b]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=16419562">16419562</a> 234 JAB+NlpC*->Ub->gpJ->STF-> Salmonella typhimurium LT2 proteobacteria>gammaproteobacteria Gifsy-2 prophage probable tail assembly protein [phage Gifsy-2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83587164">83587164</a> 190 gpM->gpL->JAB+NlpC*->Ub->gpJ(N)->gpJ(C)-> Escherichia coli 101-1 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli 101-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75208766">75208766</a> 180 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli B171 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli B171]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75210818">75210818</a> 182 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli B171 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli B171]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75208867">75208867</a> 144 JAB+NlpC*->Ub->gpJ-> Escherichia coli B171 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli B171]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75211970">75211970</a> 190 gpM->gpL->JAB+NlpC*->Ub->gpJ(N)->gpJ(C)-> Escherichia coli B171 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli B171]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75229909">75229909</a> 190 gpL->JAB+NlpC*->Ub->gpJ(N)->gpJ(C)-> Escherichia coli B7A proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli B7A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=26107858">26107858</a> 210 gpL->JAB+NlpC*->Ub->gpJ(N)->gpJ(C)-> Escherichia coli CFT073 proteobacteria>gammaproteobacteria AE016759_331 Putative tail component of prophage [Escherichia coli CFT073]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=26107735">26107735</a> 204 gpL-><-?->JAB+NlpC*->Ub->gpJ-> Escherichia coli CFT073 proteobacteria>gammaproteobacteria AE016759_208 Putative tail assembly protein of cryptic prophage [Escherichia coli CFT073]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=26109404">26109404</a> 210 gpL<-?->JAB+NlpC*->Ub->gpJ-> Escherichia coli CFT073 proteobacteria>gammaproteobacteria AE016765_9 Putative tail component of prophage [Escherichia coli CFT073]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75239817">75239817</a> 180 JAB+NlpC*->Ub->gpJ-> Escherichia coli E110019 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E110019]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75235846">75235846</a> 193 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli E110019 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E110019]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=16421139">16421139</a> 215 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Salmonella typhimurium LT2 proteobacteria>gammaproteobacteria Gifsy-1 prophage protein [Salmonella typhimurium LT2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75255450">75255450</a> 193 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli E22 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E22]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75255278">75255278</a> 193 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli E22 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E22]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=24374467">24374467</a> 209 NlpC(fragment)->?->Ub->gpJ-> Shewanella oneidensis MR-1 proteobacteria>gammaproteobacteria prophage LambdaSo, tail assembly protein I [Shewanella oneidensis MR-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75258709">75258709</a> 130 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli E22 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E22]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75257430">75257430</a> 180 Bro-NJAB+NlpC*->Ub->gpJ(N->gpJ(middle)->gpJ(C)-> Escherichia coli E22 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E22]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75259495">75259495</a> 182 gpL->JAB+NlpC*->Ub->gpJ->gpM-> Escherichia coli E22 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E22]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75175531">75175531</a> 193 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Shigella boydii BS512 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Shigella boydii BS512]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75239568">75239568</a> 182 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli F11 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli F11]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75239670">75239670</a> 190 gpM->gpL->JAB+NlpC*->Ub->gpJ(N)-> Escherichia coli F11 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli F11]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=12514222">12514222</a> 300 gpL->JAB+NlpC*->JAB+Ub->gpJ(N)->gpJ(Fn3+C)-> Escherichia coli O157:H7 EDL933 proteobacteria>gammaproteobacteria AE005290_12 putative tail component encoded by cryptic prophage CP-933M; partial [Escherichia coli O157:H7 EDL933]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=12515098">12515098</a> 225 gpM->gpL->JAB+NlpC*->Ub solo->gpJ->gpM-> Escherichia coli O157:H7 EDL933 proteobacteria>gammaproteobacteria AE005349_14 putative tail component of prophage CP-933O [Escherichia coli O157:H7 EDL933]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=12516097">12516097</a> 178 gpM->gpL->JAB+NlpC*->Ub solo->gpJ->gpM-> Escherichia coli O157:H7 EDL933 proteobacteria>gammaproteobacteria AE005420_1 putative tail fiber component I of prophage CP-933U [Escherichia coli O157:H7 EDL933]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46143649">46143649</a> 181 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Actinobacillus pleuropneumoniae serovar 1 str. 4074 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Actinobacillus pleuropneumoniae serovar 1 str. 4074]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82543715">82543715</a> 204 gpL->JAB+NlpC*->Ub->gpJ-> Shigella boydii Sb227 proteobacteria>gammaproteobacteria putative tail component [Shigella boydii Sb227]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=32043835">32043835</a> 200 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Pseudomonas aeruginosa UCBPP-PA14 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Pseudomonas aeruginosa UCBPP-PA14]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75259293">75259293</a> 193 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Escherichia coli E22 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E22]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75234649">75234649</a> 193 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Escherichia coli E110019 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E110019]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75176997">75176997</a> 172 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Shigella boydii BS512 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Shigella boydii BS512]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75238944">75238944</a> 190 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Escherichia coli E110019 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E110019]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9946516">9946516</a> 200 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Pseudomonas aeruginosa PAO1 proteobacteria>gammaproteobacteria AE004499_8 probable bacteriophage protein [Pseudomonas aeruginosa PAO1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75820383">75820383</a> 200 gpM->gpL->JAB+NlpC*->Ub solo->gpJ-> Vibrio cholerae V51 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Vibrio cholerae V51]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74312870">74312870</a> 210 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Shigella sonnei Ss046 proteobacteria>gammaproteobacteria putative tail component of prophage [Shigella sonnei Ss046]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13361702">13361702</a> 226 gpM->gpL->JAB+NlpC*->Ub->gpJ(N)->gpJ(C)-> Escherichia coli O157:H7 proteobacteria>gammaproteobacteria putative tail assembly protein [Escherichia coli O157:H7 str. Sakai]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13361111">13361111</a> 223 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli O157:H7 proteobacteria>gammaproteobacteria tail assembly protein [Escherichia coli O157:H7 str. Sakai]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13362414">13362414</a> 225 gpM->gpL->JAB+NlpC*->Ub->gpJ(N)->gpJ(C)-> Escherichia coli O157:H7 proteobacteria>gammaproteobacteria putative tail assembly protein [Escherichia coli O157:H7 str. Sakai]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13360300">13360300</a> 215 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Escherichia coli O157:H7 proteobacteria>gammaproteobacteria putative tail assembly protein [Escherichia coli O157:H7 str. Sakai]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74312266">74312266</a> 180 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Shigella sonnei Ss046 proteobacteria>gammaproteobacteria putative tail component of prophage [Shigella sonnei Ss046]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84318835">84318835</a> 200 gpM->gpL->JAB+NlpC*->Ub->gpJ(N)-> Pseudomonas aeruginosa C3719 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Pseudomonas aeruginosa C3719]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56383531">56383531</a> 180 gpM->gpL->JAB+NlpC*->Ub->gpJ->gpM-> Shigella flexneri 2a str. 301 proteobacteria>gammaproteobacteria putative tail component [Shigella flexneri 2a str. 301]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68345404">68345404</a> 188 ** Bro-N->KilA-N+C->Ub->gpJ->P5-> Pseudomonas fluorescens Pf-5 proteobacteria>gammaproteobacteria prophage LambdaSo, tail assembly protein I [Pseudomonas fluorescens Pf-5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=24050968">24050968</a> 191 gpM->gpL->JAB+NlpC*->Ub->gpJ->gpM-> Shigella flexneri 2a str. 301 proteobacteria>gammaproteobacteria putative tail component [Shigella flexneri 2a str. 301]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71037999">71037999</a> 187 gpL->JAB+NlpC*->Ub->gpJ-> Psychrobacter arcticus 273-4 proteobacteria>gammaproteobacteria probable phage protein tail protein [Psychrobacter arcticus 273-4]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52788057">52788057</a> 195 gpM->gpL->JAB+NlpC*->Ub->STF (distinct tail fiber protein)-> Yersinia pestis proteobacteria>gammaproteobacteria phage lambda tail assembly protein I [Yersinia pestis]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75254904">75254904</a> 226 Ub->gpJ(N->) EscherichiacoliE22 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E22]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2996351">2996351</a> 183 gpM->gpL->JAB+NlpC*->Ub->host_specificity_J-> Yersinia pestis KIM proteobacteria>gammaproteobacteria unknown [Yersinia pestis KIM]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=66046010">66046010</a> 192 JAB+NlpC*<-?->Ub<-?->gpJ-> Pseudomonas syringae pv. syringae B728a proteobacteria>gammaproteobacteria Bacteriophage lambda tail assembly I [Pseudomonas syringae pv. syringae B728a]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=16506034">16506034</a> 195 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Salmonella enterica subsp. enterica serovar Typhi str. CT18 proteobacteria>gammaproteobacteria putative phage tail protein [Salmonella enterica subsp. enterica serovar Typhi str. CT18]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=62179803">62179803</a> 168 gpM->gpL->JAB+NlpC*->Ub->gpJ-> Salmonella enterica subsp. enterica serovar Choleraesuis str. proteobacteria>gammaproteobacteria Gifsy-1 prophage VtiI [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=16419434">16419434</a> 225 JAB+NlpC*->gpM->gpL->Ub-><-superoxide_dismutase->host_specificity_J-> Salmonella typhimurium LT2 proteobacteria>gammaproteobacteria putative Fels-1 prophage tail assembly protein [phage Fels-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75208698">75208698</a> 193 gpM->gpL->JAB+NlpC*->Ub-><-superoxide_dismutase->host_specificity_J-> Escherichia coli B171 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli B171]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75235151">75235151</a> 193 gpM->gpL->JAB+NlpC*->Ub-><-superoxide_dismutase->host_specificity_J-> Escherichia coli E110019 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E110019]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75214996">75214996</a> 193 gpM->gpL->JAB+NlpC*->Ub-><-superoxide_dismutase->host_specificity_J-> Escherichia coli E110019 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E110019]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75233804">75233804</a> 193 JAB+NlpC*->gpM->gpL->JAB+NlpC*->Ub-><-superoxide_dismutase->host_specificity_J-> Escherichia coli E110019 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli E110019]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13361452">13361452</a> 226 gpM->gpL->JAB+NlpC*->Ub->?<-Superoxide_dismutase->host_specificity_J-> Escherichia coli O157:H7 proteobacteria>gammaproteobacteria putative tail assembly protein [Escherichia coli O157:H7 str. Sakai]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13360578">13360578</a> 226 gpM->gpL->JAB+NlpC*->Ub->?<-Superoxide_dismutase->host_specificity_J-> Escherichia coli O157:H7 proteobacteria>gammaproteobacteria putative tail assembly protein [Escherichia coli O157:H7 str. Sakai]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84327632">84327632</a> 60 gpH->gpL->JAB+NlpC*->Ub solo->gpJ->Lysozyme-> Pseudomonas aeruginosa 2192 fragment proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Pseudomonas aeruginosa 2192]
Operons with no gpJ in vicinity (Ub gis)
Gis are of the JAB or NlpC protein- Marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13361023">13361023</a> 94 gpM->gpL->JAB+NlpC*->Ub->YjbI->?-> Escherichia coli O157:H7 proteobacteria>gammaproteobacteria putative tail assembly protein [Escherichia coli O157:H7 str. Sakai]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82533244">82533244</a> 194 gpL->JAB+NlpC*->Ub solo (perhaps incomplete assembly)-> Burkholderia pseudomallei 1106b proteobacteria>betaproteobacteria hypothetical protein Bpse110_02005448 [Burkholderia pseudomallei 1106b]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=62179570">62179570</a> 221 gpL->JAB+NlpC*->Ub->?->lambda p27(distinct tail fiber)-> Salmonella enterica subsp. enterica serovar Choleraesuis str. proteobacteria>gammaproteobacteria Gifsy-2 prophage probable tail assembly protein [Salmonella enterica subsp. enterica serovar Choleraesuis str. SC-B67]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75207839">75207839</a> 180 NlpC*(fragmented?)->Ub-> Escherichia coli B171 proteobacteria>gammaproteobacteria COG4723: Phage-related protein, tail component [Escherichia coli B171]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17428712">17428712</a> 200 gpL->JAB+NlpC*->Ub-> Ralstonia solanacearum proteobacteria>betaproteobacteria probable phage hk022 gp20-related protein [Ralstonia solanacearum]
Operons without JABs in the operon
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
gis are of the Ub protein-marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82741527">82741527</a> 216 BroN->Ub->gpJ-> Shewanella sp. W3-18-1; proteobacteria>gammaproteobacteria prophage LambdaSo, tail assembly protein I [Shewanella sp. W3-18-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82743846">82743846</a> 238 Ub solo Shewanella sp. W3-18-1; proteobacteria>gammaproteobacteria prophage LambdaSo, tail assembly protein I [Shewanella sp. W3-18-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15980136">15980136</a> 206 Bro-N->ribbon->?->Ub->?->host_specificty_J-> Yersinia pestis CO92; proteobacteria>gammaproteobacteria putative phage tail assembly protein [Yersinia pestis CO92]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71558268">71558268</a> 152 HTH->Ub->gpJ(N)-> Pseudomonas syringae pv. phaseolicola 1448A; proteobacteria>gammaproteobacteria prophage PSPPH03, putative tail assembly protein I [Pseudomonas syringae pv. phaseolicola 1448A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84780140">84780140</a> 198 HNH->Ub->gpJ->lysozyme-> Sodalis glossinidius str. 'morsitans'; proteobacteria>gammaproteobacteria putative phage tail assembly protein [Sodalis glossinidius str. 'morsitans']
B. Note the Domain_Z protein.. (Domain Z: an all beta domain)
(Gis are of the Ub+gpJ protein- marked with an asterisk)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=31788497">31788497</a> 1574 Domain_Z->NlpC->Ub+gpJ*(N+FN3+C) [1-173 Ubl+ 173-673 (N)]-> Xanthomonas campestris phage Xp10 22R [Xanthomonas oryzae bacteriophage Xp10]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84570663">84570663</a> 1571 Domain_Z->NlpC->Ub+gpJ*(N+FN3+C)-> Xanthomonas oryzae phage OP1 putative tail component protein [Xanthomonas oryzae phage OP1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23013869">23013869</a> 775 NlpC solo->Ub+gpJ*(N)->P5->?Y-> (Note no JAB) Magnetospirillum magnetotacticum MS-1 proteobacteria>alphaproteobacteria COG4733: Phage-related protein, tail component [Magnetospirillum magnetotacticum MS-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85716602">85716602</a> 1267 NlpC->Ub+gpJ*(N+FN3+distinct_C)-> (Note no JAB)********* Nitrobacter sp. Nb-311A proteobacteria>alphaproteobacteria tail fiber protein, putative [Nitrobacter sp. Nb-311A]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23016384">23016384</a> 508 Domain_Z->NlpC solo->Ub+gpJ*(N)->gpJ(fragment of C)-> Magnetospirillum magnetotacticum MS-1 proteobacteria>alphaproteobacteria COG0001: Glutamate-1-semialdehyde aminotransferase [Magnetospirillum magnetotacticum MS-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82944335">82944335</a> 775 Domain_Z->NlpC->Ub+gpJ*(N)->P5->?Y-> Magnetospirillum magneticum AMB-1 proteobacteria>alphaproteobacteria Phage-related protein [Magnetospirillum magneticum AMB-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82945132">82945132</a> 775 Domain_Z->NlpC->Ub+gpJ*(N)->P5->?Y-> Magnetospirillum magneticum AMB-1 proteobacteria>alphaproteobacteria Phage-related protein [Magnetospirillum magneticum AMB-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33568295">33568295</a> 1268 Domain_Z->NlpC->Ub+gpJ*(N+distinct_C)-> Bordetella bronchiseptica RB50 proteobacteria>betaproteobacteria phage-related hypothetical protein [Bordetella bronchiseptica RB50]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33564325">33564325</a> 1318 NlpC_solo->Ub+gpJ* (N+FN3+C)-> Bordetella pertussis Tohama I proteobacteria>betaproteobacteria phage-related conserved hypothetical protein [Bordetella pertussis Tohama I]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67545284">67545284</a> 767 Domain_Z->NlpC solo->Ub+gpJ*(N)->(Note no JAB)********* Burkholderia vietnamiensis G4 proteobacteria>betaproteobacteria phage-related conserved hypothetical protein [Burkholderia vietnamiensis G4]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68212786">68212786</a> 1171 Domain_Z->NlpC->Ub+gpJ*(N)-> Methylobacillus flagellatus KT proteobacteria>betaproteobacteria similar to Phage-related protein tail component [Methylobacillus flagellatus KT]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33576899">33576899</a> 1318 Domain_Z->NlpC->Ub+gpJ*(N+FN3+distinct C)-> Bordetella bronchiseptica RB50 proteobacteria>betaproteobacteria phage-related conserved hypothetical protein [Bordetella bronchiseptica RB50]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46449977">46449977</a> 1346 NlpC_solo->Ub+gpJ*(N+FN3+distinct_C)-> Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough proteobacteria>deltaproteobacteria tail fiber protein, putative [Desulfovibrio vulgaris subsp. vulgaris str. Hildenborough]
Versions of above without NLpC or JAB
Gis are of the Ub+gpJ protein-marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23015894">23015894</a> 766 ?H->Ub+gpJ(N)*->Lysozyme->?Y-> Magnetospirillum magnetotacticum MS-1 proteobacteria>alphaproteobacteria COG4733: Phage-related protein, tail component [Magnetospirillum magnetotacticum MS-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78033450">78033450</a> 766 Domain_Z <-PIN<-YoeB->Ub+gpJN)*->X->?Y-> (note toxin-antitoxin insert) Magnetospirillum gryphiswaldense proteobacteria>alphaproteobacteria phage-related protein [Magnetospirillum gryphiswaldense]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71548099">71548099</a> 1644 Ub+gpJ*(distinctN+gp44+distinct_C) (NlpC in genome but not in vicinity) Syntrophobacter fumaroxidans MPOB proteobacteria>deltaproteobacteria similar to Phage-related protein tail component [Syntrophobacter fumaroxidans MPOB]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46916380">46916380</a> 1294 Ub+gpJ(N)* Photobacterium profundum SS9 proteobacteria>gammaproteobacteria hypothetical protein [Photobacterium profundum SS9]
Domain_Z alignment:
FINAL -HHHHHHH-------EEEEEEEEE------------EEEEEE---EEEEE------------EEEEEEEEEEE-------------EEEEE----HHHHHHHHHH-------EEEEEEEEEE-------------EEEE---EEE--EEEEEEEEE-HHHH--------------------
ALIGN ----HHHH-------EEEEHEEE-------------EEEEEE----HEHHHH----------EEEEEE-----------------EEEEEEE-----HHHHHHHHH------HEEEEEEEE--------------EEEEE---------EEEHHHHHHH----------------------
HMM -HHHHHHH-------EEEEEEEEE-----E------EEEEEE--EEEEEE--H---------EEEEEE--EEEE-----------EEEEEEE---HHHHHHHHHH-------EEEEEEEEEE-----------E-EEEE---EEE--EEEEEEEEE-HHHHHH-H---EEEE---------
FREQ -HHHHHHHH------EEEEEEEE-------------EEEEE-----HHHHHHH---------EEEEEEE-----------------EEEEE-----HHHHHHHHH-------EEEEEEEEE---------------EE----------HHHHHHHHHH-----------------------
PSSM -HHHHHH------------EEEE-------------EEEEE----EEEEE------------EEEEEE-EEEE-------------EEEEE----HHHHHHHHH--------EEEEEEEEE------------E-EEEE---EE----EEEEEEEE-------------------------
mgI418_Mgry_78033453 SQALKEAFASAPAGTVILDTLEIWHPTFDE------PIRVVRDHADLTARLEAGAPRDG-GKRVTFAALAFEFSPPPVDT-APVPEITVTLDNVGSDITDALEGAAV-SQQVIEITWRPYLSTDLNGPHMDPPI-TMTLTDVEAD--TMRVTGRARMLDAGNK-SFPSITYTARRFPGLAR
BB3488_Bbro_33576901 EQALKEAYASAPQDRVVFDTLELRHPAFVDPHGEPTAVRVVLGYEDIRARLETEAPLDG-GQDVMFQAGAFRFRLPGFEE-GQVPSLLIAIDGASEQIVDHVEAAVQ-SRFPIYVTYRPYLSTDLSMPQMNPPI-TMELNKVTVT--GSSVSGTATLSDVHNW-AFPHERYVRERFPGLFR
BP3364_Bper_33564327 EKALKEAYASAPQDRVVFDTLELRHPAFVDEHGERTAVRVVLGYEDIYARLEAEAPLDG-GKEVLFQAGAFRLRLPGFEE-GQVPSLLITIDGASEKIVDHVEAAVQ-SRYPIYATYRPYVSTDLSRPQMNPPI-TMELNKVTVT--GASVSGTATLADVHNW-AFPHQRYMRERFPGLFR
Bcep1808DRAFT_4080_Bvie_67545282 SEAIKEAYASAPSQQIILHTLELRHPAFVDEDGQQVAIRVVRDTGDLWARLESQAPLQA-GERVQFVAMGFELDLPPVDT-MPVPEITVTLDNVSREIVRHLDAAAE-SQSVIEVTYRPYLSTDLEGPQMDPPI-HLVLTEVEAD--IFRVTGRARMLDVGNK-AFPGVSYTAKTFPGLTR
amb1190_Mmag_82945130 SQALKEAFASAPAGTVILDTLEIWHPTFIE------PIRVVRDHADLTARLEAGAPRDG-GKRVTFAALAFEFSPPPVDT-APVPEITVTLDNVGSDITDALEGAAI-SQQVIEITWRPYLSTDLNGPHMDPPI-TMTLTEVEAD--TMRVTGRARMLDAGNK-SFPSITYTARRFPGLAR
amb0393_Mmag_82944333 SQALKEAFASAPAGTVVLDTLEIWHPTFDE------PIRVVRDHADLTARLEAGAPRDG-GKRVTFAALAFEFSPPPVDT-APVPEITVTLDNVGSDITDALEGAAI-SQQVIEITWRPYLSTDLNGPHMDPPI-TMTLTEVEAD--TMRVTGRARMLDAGNK-SFPSITYTARRFPGLAR
Magn03007629_Mmag_23013169 SQALKEAFASAPAGTVILDTLEIWHPTFDE------PIRVVRDHADLTARLETGAPRDG-GKRVTFAALAFEFSPPPVDT-APVPEITVTLDNVGSDITDALEGAAI-SQQVIEITWRPYLSTDLNGPHMDPPI-TMALTEVEAD--TMRVTGRARMLDAGNK-SFPSITYTARRFPGLAR
Magn03010833_Mmag_46200892 SQALKEAFASAPAGTVVLDTLEIWHPSFTT------PIRVVRDHADLTARLEAGAPRDG-GKRVTFAALAFEFSPPPVDT-APVPEITVTLDNVGSDITDALEGAAI-SQQVIEITWRPYLSTDLNGPHMDPPI-TMALTEVEAD--TMRVTGRARMLDAGNK-SFPSITYTARRFPGLAR
Magn03010336_Mmag_46201139 ---MREAFAAAPTNTVILHTLEIWHPTFSE------PIRVVRDHADLTARLEAGAPRGG-GQKVTFIALAFDLDLPPVDT-APVPEITVTMDNVGQEIVDALEAAAI-SQDKIDIIYRPFLSTDLEGPHMDPPI-TLTLAEVEAD--TLRVTGRARMLDVGNK-AFPSITYTAKRFPGLAR
MflaDRAFT_2307_Mfla_68212788 EEAIKEAYASNPVGEVELNTLEFRHPNFVDQNGDPSAIRVVLDNVDHYLTLEDDAPLNP-GESVLFVRMAFELTKPEVDS-VAGPAMDITLNNITPEIETQIRAATR-SPYPVIGMYRLYLLSDKTQPQNNPPM-EFQLDNVNAD--DESITARATFGNEAQR-PFPNENYTATRFPGLSR
PputDRAFT_2895_Pput_82737129 MTALEVVYAS--GGDDIVPTLEISCPAWDK------TLYLVQDFEDFRATTEA-------GKTVTFLASAIDVALPAKDN-SGAQTLTFVIDNVTGEAQQLIDASLE-AEARVTIVYREYLYSIPGEPA-DRPY-RMTSFGGTMD--GPTIQIEAGYYDLINM-MWNRFRYTTDFAPGLTY
DaceDRAFT_2556_Dace_68177301 TTAYKEAIAYANPETTIWEAIRITHSSWLE------SILLVNSYEVFTANL---------G---SFIPVQWSMKLPEVEA-ETRGELTLKIDLLPLSIKRTLFSGAS-KTDAMKL--YYYEYTDTTDPAGQLPA-ALEISKVEMDEDNQVTTIKALYADLVNI-VFPRRRMTTTLIPGGLV
BB1708_Ppro_46916381 KNARINLNATT-ADEPFLILVEIHHQSFSE------PARIVADTQDITHA----------G--YRYTALPIDVTLPDEGE-GKLPQAKLIIDNVGRVLTDEIDGTRG-FEGGTCVI-MQVMRSNPS--HVEWGI-ELDVLDVSID--QLKISATLGYEDMLNK-PAVTMRFTPERSPGLF-
BB1708_Bbro_33568293 TQAKRNVNATS-ADEPLLELIEITHPDLAV------PARFVNDTQDIQVE----------G--HAFLACRFDLSIPDDQA-EQVPGARLEVDNIGRELTQWLEYSQG-GKGAKC---RLILLLRSNPSNIELDM-TMDLTGLEIT--NFRVSGDLGFKNTLMQ-SGVAMRFDPLTAPGVF-
NB311A_12117_Nsp._85716598 SLNFRQELFGQESGEVPILLVTITHPELPE------PIYLSTDPTERFSTDPLMYRTRS-R-GIDFLYAGIDVTLPDEQD-KSPPASKLTIANVTRGLIPLARSVS--TPPAVKIEV--VLASDPDTVE-MTWP-AMDMTNLTYD--ASFLTFDLTIDALVTE-PYPSGTFSPAYFPGLFY
RB2654_16431_Rbac_84684053 -MPWLDAINDAETAEVVLTLVTLDHADWAA------PVRLVNDVADFEHD----------G--ETYTAAGFQVAMPDQAE-DRNAAMRWTLNDVDHDVAVLLRTTN--DVIDIEVSY--VLASDPDTVQ-AGPF-EAEIRQADLR--YGSVSGALVVYPVMEEVANASFRFSTGDFPGLI-
_BPMB78_4455819 EAAYRRKLASNPDGEMDFITLEIYHPLLSK------RWLLVRGVKDLTATLET-------GEVVTFEGTPMEAKNAANNN-DMDQTASFSLPDVLNILDEEMDRIPYDNKELPKFIFRRYVSTDLTYP-CDGPV-VYELQTLTQE----KGVFTAETGTPMLNQRATGILMTPEEIPLLRG
_BPKS7_62327363 EAAYRRKLASNPDGEMDFITLEIYHPLLSK------RWLLVRGADDLTATLET-------GEVVTFEGTPMEAKNAANNN-DMDQTASFSLPDVLNILDEEMDRIPYDNKELPKFIFRRYVSTDLTYP-CDGPV-VYELQTLTQE----KGVFTAETGTPMLNQRATGILMTPEEIPLLRG
mgI418_Mgry_78033453 SQALKEAFASAPAGTVILDTLEIWHPTFDE------PIRVVRDHADLTARLEAGAPRDG-GKRVTFAALAFEFSPPPVDT-APVPEITVTLDNVGSDITDALEGAAV-SQQVIEITWRPYLSTDLNGPHMDPPI-TMTLTDVEAD--TMRVTGRARMLDAGNK-SFPSITYTARRFPGLAR
D3p22_BPD3_9635614 ATALERFYAS-DGPDLPIATIEITRPSRPH------PIFICQGFKDLTCMTED-------GRLLTFIAGAIDVSIPKRDN-SGNQNVGFAIDNVTGFAQQYIAEAID-AGEPVTLVLRIYLESDLTAPA-ERPY-RMRVKGADFE--SLTVQVEAGYYDLINT-AALRHIYNVSEFPGLKY
YintA_01000766_Yint_77979284 MTILNRLYASG-GSEVIIQTLEIAVGDK--------TYWLTKGWEDITAVLES-------GESATFTACGIDIALPARNS-DGTQDLQFAISNIDGIVSTAIRGALD-YLSTALLTYRYYVSTDLSAPA-AKPY-TLIVKSGYWT--ATEVQITAGYMNVLDT-AWPRYRYTLPNYPGLRY
PP1578_Pput_26988310 MSILKRLYASS-GPEIIHEVLEITDGIT--------TYWMTKGWDELTITLET-------GQVVVCTPCGMDLALPARND-DGTQDLTFALSNIDGIASGFVRAALR-DGRRMSLVYRAYTSDDLGAPA-HAPH-RFKIKGGSVT--AAQVSVTAGYFDLLDT-RWPRNTYNLNEFPGLRY
PputDRAFT_4718_Pput_82734887 MSLIEECYASGRGE--LVDTIEARKEGGTV------SHLYCSGWEDRVCTTED-------GRTLTFVAMAMDLALPKNDN-SAFQNLVLGLDNVTGEVQEVVEEAKA-ADDRFIITFRRYLAEDLTFPQ--ERY-RMTLLSREYE--DDVAKLTAGFFDLLNT-NGLRTVLTTTLAPGLKY
_BPXp10_31788495 SFVSNRQRLTDYSG--ILQVLEISAAYLPD------TLRLVKDVKDWTIN----------G--QDYIGLEFTITLPEDRS-GSNGVLEIKMSNVGRDVTEDLEKRPPDQMMTAVLK----LSDRETPGEFYRII-PMPIDRVSID--AQTVTLTASMDSIMRQ-QACRLRFTPFITPGLF-
_BPOP1_84570661 SFVSNRQRLTDYSG--ILQVLEISAAYLPD------TLRLVKDVKDWTIN----------G--QDYIGLEFTITLPEDRS-GSNGVLEIKMSNVGRDVTEDLEKRPPDQMMTAVLK----LSDRQTPGEFYRII-PMPIDRVSID--AQTVTLTASMDSIMRQ-QACRLRFTPFITPGLF-
_BPXp15_66392125 MSTFKERKQRVRDPSGLLILMELSANSFQE------TLRIANDTDNWTSN----------G--LLYYGFPFKFTGPDDSD-GSNASSKIVIDNTGRGMSDDLESLQPNEIILVKL-----MITDFYNPSA-IIR-TLYLPMMGATIRVTQMEGRCGV-DYIMRQRSVQLASSPYTAPGSY-
SfumDRAFT_2313_Sfum_71544667 VLVREKNKLATPDPWIVLLDIELDATH---------KLYFCSNNQNVTWS----------G--RVYTAFPFLLEPTEENSKGEIPSVSLKVANVTQVIHAYLEQLDGAVGATVTI--RVVNAGYLSEDASELDM-TFTVVSTSAD--AEWIVFTLGAPNPLRR-RFPPFRFIAKHCHWEFK
DVU_2155_Dvul_46449979 -----------------------MHPSLAA------PLRISSDPTQRTVVTDEEVVYGTVSRAETFVFVPFSISLPNDSA-EETPQTSITIDNVGREMVPTIRALTSAPEITLEMV----MASTPDVVEAVFP--GFALSSVTYD--AMSISGTLSVTEFTTE-PCPAGTFNPAEFPGMF-
consensus/100% ....................hph...............bhs.s................. ....h....h.h..s............h.hs.....h...h..................................h.......p.........h...s......................
consensus/80% ..shcc.bhss.ssp.hh.slEl.ps.h........shblsps..Dhphp......... G....a.u.shchp.P..ps.s.s..hph.lssls..l.p.lc.........h.l....hl.sc.s.s..p.sh..h.l.phphs..s.pls.ph.h.sh.pp..hspbpass..hPGL..
Species abbreviations: BPD3 : Pseudomonas phage D3; BPKS7 : Bacteriophage KS7; BPMB78 : Bacteriophage MB78; BPOP1 : Xanthomonas oryzae phage OP1; BPXp10 : Xanthomonas campestris phage Xp10; BPXp15 : Xanthomonas campestris pv. pelargonii phage Xp15; Bbro : Bordetella bronchiseptica; Bper : Bordetella pertussis; Bvie : Burkholderia vietnamiensis; Dace : Desulfuromonas acetoxidans; Dvul : Desulfovibrio vulgaris; Mfla : Methylobacillus flagellatus; Mgry : Magnetospirillum gryphiswaldense; Mmag : Magnetospirillum magneticum; Mmag : Magnetospirillum magnetotacticum; Nsp. : Nitrobacter sp.; Ppro : Photobacterium profundum; Pput : Pseudomonas putida; Rbac : Rhodobacterales bacterium; Sfum : Syntrophobacter fumaroxidans; Yint : Yersinia intermedia
-------------------------------------------------------------------------------------------------------------
6. OPERONS WITH E2-like domains
6a. Uncharacterized operon with a triple module protein containing an E2-like, E1-like and JAB domains (Metallo beta lactamase neighbor)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Gis are of the E2+E1 containing protein- marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21110358">21110358</a> 750 MBL->E2+E1+JAB*-> Xanthomonas axonopodis pv. citri str. 306 proteobacteria>gammaproteobacteria conserved hypothetical protein [Xanthomonas axonopodis pv. citri str. 306]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48864353">48864353</a> 735 MBL->E2+E1+JAB*-> Microbulbifer degradans 2-40, 48864354 proteobacteria>gammaproteobacteria COG0476: Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Microbulbifer degradans 2-40]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=58038271">58038271</a> 741 MBL->E2+E1+JAB*-> Gluconobacter oxydans 621H, proteobacteria>alphaproteobacteria hypothetical protein GOX2518 [Gluconobacter oxydans 621H]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68246513">68246513</a> 495 MBL->E2+E*1-> (E2+E1) only (JAB perhaps displaced by transposon); Magnetococcus sp. MC-1 proteobacteria UBA/THIF-type NAD/FAD binding fold [Magnetococcus sp. MC-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68559822">68559822</a> 751 MBL->E2+E1+JAB*->MBL-> Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria UBA/THIF-type NAD/FAD binding fold [Ralstonia metallidurans CH34]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74421923">74421923</a> 223 (E2 only)\ MBL->E2->E1->JAB-> Nitrobacter winogradskyi Nb-255 proteobacteria>alphaproteobacteria hypothetical protein Nwi_2872 [Nitrobacter winogradskyi Nb-255]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74421925">74421925</a> 352 (JAB only)| Nitrobacter winogradskyi Nb-255 proteobacteria>alphaproteobacteria hypothetical protein Nwi_2874 [Nitrobacter winogradskyi Nb-255]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74421924">74421924</a> 235 / Nitrobacter winogradskyi Nb-255 ThiF solo, 74421925: JAB, proteobacteria>alphaproteobacteria hypothetical protein Nwi_2873 [Nitrobacter winogradskyi Nb-255]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77387013">77387013</a> 601 E2+E1->JAB-> Rhodobacter sphaeroides 2.4.1 (E2+E1, JAB neighbor) proteobacteria>alphaproteobacteria ThiF family protein [Rhodobacter sphaeroides 2.4.1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77955313">77955313</a> 851 MBL->E2+E1+JAB*-> Marinobacter aquaeolei VT8 proteobacteria>gammaproteobacteria conserved hypothetical protein [Marinobacter aquaeolei VT8]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77955723">77955723</a> 725 MBL->E2+E1+JAB*-> Marinobacter aquaeolei VT8 proteobacteria>gammaproteobacteria hypothetical protein MaquDRAFT_3270 [Marinobacter aquaeolei VT8]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84502025">84502025</a> 761 MBL->E2+E1+JAB*-> Oceanicola batsensis HTCC2597 proteobacteria>alphaproteobacteria hypothetical protein OB2597_18097 [Oceanicola batsensis HTCC2597]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84717800">84717800</a> 751 MBL->E2+E1+JAB*-> Polaromonas naphthalenivorans CJ2 proteobacteria>betaproteobacteria conserved hypothetical protein [Polaromonas naphthalenivorans CJ2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85859492">85859492</a> 1158 MBL->E2+E1+JAB+Calcineurin*-> (C-terminal calcineurin) Syntrophus aciditrophicus SB proteobacteria>deltaproteobacteria hesA/moeB/thiF type protein [Syntrophus aciditrophicus SB]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86559649">86559649</a> 760 MBL->E2+E1+JAB*-> Clostridium perfringens, l firmicutes ThiF [Clostridium perfringens]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88705878">88705878</a> 751 E2+E1+JAB* gamma proteobacterium KT 71 proteobacteria>gammaproteobacteria conserved hypothetical protein [gamma proteobacterium KT 71]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=90019857">90019857</a> 735 E2+E1+JAB* Saccharophagus degradans 2-40 proteobacteria>gammaproteobacteria hypothetical protein Sde_0208 [Saccharophagus degradans 2-40]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=90419011">90419011</a> 746 MBL->E2+E1+JAB* Aurantimonas sp. SI85-9A1 proteobacteria>alphaproteobacteria conserved hypothetical protein [Aurantimonas sp. SI85-9A1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86475921">86475921</a> 760 MBL->E2+E1+JAB* Clostridium perfringens firmicutes ThiF [Clostridium perfringens]
---------------------------- ------------------------
6b. Uncharacterized operon coding a multidomain protein with E2 and E1 domains (This version of the JAB is closer to the E2+E1+JAB type)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Gis are of the E2+E1 protein- marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71038912">71038912</a> 589 Patatin->nuct_transferase->E2+E1*->JAB-> Psychrobacter arcticus 273-4 proteobacteria>gammaproteobacteria (in the vicinity of transposase)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9654584">9654584</a> 584 Patatin->nuct_transferase->E2+E1*->JAB-> Vibrio cholerae O1 biovar eltor str. N16961 proteobacteria>gammaproteobacteria (transposase in vicinity)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=37927532">37927532</a> 538 Patatin->nuct_transferase->E2+E1*->JAB-> Escherichia coli proteobacteria>gammaproteobacteria (Integrative conjugative element)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84786718">84786718</a> 558 nuct_transferase->E2+E1*->JAB-> Erythrobacter litoralis HTCC2594 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85706659">85706659</a> 550 nuct_transferase->E2+E1*-> Roseovarius sp. 217 proteobacteria>alphaproteobacteria (in the vicinty of transposase)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=66965723">66965723</a> 592 E2+E1*->JAB-> Arthrobacter sp. FB24 actinobacteria UBA/E1-type NAD/FAD binding fold [Arthrobacter sp. FB24]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86357617">86357617</a> 562 Nuct_transferase->E2+E1*->JAB-> Rhizobium etli CFN 42 proteobacteria>alphaproteobacteria hypothetical protein RHE_CH01997 [Rhizobium etli CFN 42]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84499281">84499281</a> 557 Nuct_transferase->E2+E1*->JAB-> Oceanicola batsensis HTCC2597 proteobacteria>alphaproteobacteria hypothetical protein OB2597_05120 [Oceanicola batsensis HTCC2597]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86475968">86475968</a> 567 Nuct_transferase->E2+E1*->JAB-> Clostridium perfringens firmicutes conserved hypothetical protein [Clostridium perfringens]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88937743">88937743</a> 576 Nuct_transferase->E2+E1*->JAB-> Geobacter uraniumreducens Rf4 proteobacteria>deltaproteobacteria similar to Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 1 [Geobacter uraniumreducens Rf4]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=22726448">22726448</a> 572 Nuct_transferase->E2+E1*-> Ruegeria sp. PR1b proteobacteria>alphaproteobacteria RC170 [Ruegeria sp. PR1b]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78684828">78684828</a> 575 Nuct_transferase->E2+E1*-> Shewanella sp. ANA-3 proteobacteria>gammaproteobacteria similar to Dinucleotide-utilizing enzymes involved in molybdopterin and thiamine biosynthesis family 2 [Shewanella sp. ANA-3]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84701417">84701417</a> 589 nuct_transferase->E2+E1*-> Parvularcula bermudensis HTCC2503 proteobacteria>alphaproteobacteria hypothetical protein PB2503_00627 [Parvularcula bermudensis HTCC2503]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2496738">2496738</a> 583 E2+E1*->JAB-> Rhizobium sp. NGR234 proteobacteria>alphaproteobacteria Y4QC_RHISN Hypothetical 63.6 kDa protein y4qC
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2496721">2496721</a> 593 E2+E1*-> Rhizobium sp. NGR234 proteobacteria>alphaproteobacteria Y4OA_RHISN Hypothetical 65.2 kDa protein y4oA
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92915671">92915671</a> 591 Patatin->nuct_transferase->E1+E2* Mycobacterium sp. KMS actinobacteria
----------------------------------------------------
6c. Uncharacterized operon coding a distinctive multidomain protein with E2 and E1 related domains
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Gis are of the E2+E1 protein- marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2496664">2496664</a> 519 ?->Metal?->JAB->E2+E1-> Rhizobium sp. NGR234 proteobacteria>alphaproteobacteria Y4JF_RHISN Hypothetical 55.4 kDa protein y4jF
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14025925">14025925</a> 519 ?->Metal?->JAB->E2+E1-> Mesorhizobium loti MAFF303099 proteobacteria>alphaproteobacteria mll6192 [Mesorhizobium loti MAFF303099]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=20803932">20803932</a> 485 ?->Metal?->JAB->E2+E1-> ; note part of symbiosis island (Integrated element) Mesorhizobium loti proteobacteria>alphaproteobacteria HYPOTHETICAL CONSERVED TRANSMEMBRANE PROTEIN [Mesorhizobium loti]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86359719">86359719</a> 514 ?->Metal?->JAB->E2+E1-> (plasmid p42a) Rhizobium etli CFN 42 proteobacteria>alphaproteobacteria hypothetical protein RHE_PA00014 [Rhizobium etli CFN 42]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=23011188">23011188</a> 110 Metal?->JAB->N+E1-> Magnetospirillum magnetotacticum MS-1 proteobacteria>alphaproteobacteria hypothetical protein Magn03005843 [Magnetospirillum magnetotacticum MS-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77690158">77690158</a> 455 Ubl+Ubl+Ubl->Metal?->JAB->N+E1-> Rhodopseudomonas palustris BisB5 proteobacteria>alphaproteobacteria UBA/THIF-type NAD/FAD binding fold [Rhodopseudomonas palustris BisB5]
-Alignment of potential metal binding domain
FINAL ---HHHHHHHHHHHHH-----HHHHHHHHHHHH--EEEE----EEEEEE----------EEEEEEEE---------EEE----------------------------------HHHHHHHHHHHHH--------EEEEE---EE---------------------HHHHHHHHHHHH---HHHHHHHHHHHHH---E--------
ALIGN ---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----EEEEE-----------EEEEE-----------EEEE-------HHHHHHHHHHH------------HHHHHHHHHHHHHHHH--------EEEEE----------------EEEE-------HHHHHHHHHHH----HHHHHHHHHHHH------------
HMM ----HHHHHHHHHHHHHHHHH-----HHHHHHH-HHEHH---EEEEEEE-------HHHHHEEEEEE---------EEEE--------HHHHHHHHH-----------HHHHEEEHHHHHHHHHHE--------EEEEEE--EEEE--------HHHHEE-----HHHHHHHHHHHHHHHHHHHHHHHHEEE----E--------
FREQ --------HHHHHH-------HHHHHHHHHHHH-EEEEE-----EEEEE-----------EEEEEE----------EE----------------------------------HHHHHHHHHHHHHHH--------EEEE----------------EEE-------HHHHHHHHHHH-----HHHHHHHHHHHH------------
PSSM ---HHHHHHHHHHHHHHHHHHH------HHHH---EEEE----EEEEE---------EEEEEEEEEE---------EEEE-------------------------------------------------------EEE---------------------------HHHHHHHHHHHH---HHHHHEEEEEEEE---E--------
RHE_PA00016_Retl_86359721 MTESQFVDEAVSRRKFEREVAQYRELEDSYRRRGWFLLDATFPTVLVLFVALKVTPRSLVCAVRLDFTNYDLEPPSVTFVDPSTGTALPAKSLGFKMLRLNGLKEASPETVTTLAQQQRLSVQELLQAHSPDETPFLCLPGVREYHDHPAHTGDLWLLHRRSGEGSLHFILEQIWASGINPIRMLEYQIQMNFSGFQMDAAALPR
NGR234_174_Rsp._2182463 MPELQTVDPKVSRAKFDREISRFRAYADAYRMQGCFLIEESFPSAFFIFASPKVKPRVIGAAIEIDFTNYDLRPLSVVFVDPFTRQPIARKDLPLNMLRRPQLPGTPTEMISNLIQQNAVSLTDFIQANSLEDQPFLCMAGVREYHDNPAHSGDPWLLHRGSGEGCLAFILDKIIKYGTGPAEQLQIHLQVALGGLLVPPQAIPE
msi103_Mlot_20803930 MPEIQTVDPAVSRAKFDRQIGWFQTQAGAYRAQGCFLIEARFPTAFFIFAPPKIRPQIIGAAVEIDFSNYDLRPPSVVFVDPFTRRPVARKDLLLSMLRRPHLPGTPPGMISVLMQQKALSLSDFLQANSAEHTPFLCMAGVREYHDNPAHSGDSWLLHRGSGEGCLAFILDKIIKYGTGPVEQIQYQFQISVGAMVVPPSAIPE
mll8758_Mlot_14025927 MPEIQTVDPAVSRAKFDRQIGWFQTQAGAYRAQGCFLIEARFPTAFFIFAPPKIRPQIIGAAVEIDFSNYDLRPPSVVFVDPFTRRPVARKDLLLSMLRRPHLPGTPPDMISVLMQQKALSLADFLQANSAEHTPFLCMAGVREYHDNPAHSGDSWLLHRGSGEGCLAFILDKIIKYGTGPVEQIQYQFQISVGAMVVPPSAIPE
RPDDRAFT_1997_Rpal_77690160 ------MLEALSKATFDRDIGRIDPRS--VRMYDWAIVQANYPVFDVIFNHAQVAP----LRLRLVCDDWDEIPPSIELLNK----------------EGQPLATAPPNVGNVFNG----------STHPNTGRPFVCMRGAREYHTHGSHTSDLWDNYRGQSGMDLGGIVVQLWRAWKRSVG----------------------
Magn03005843_Mmag_23011188 -------------------------------------------MLDVILGHPTAAP----LRLRFTCVDWDDLPPSVELLDAA----------------GQHLSQAPPGAGGIFHP----------SPHPVTGRMFVCMRGTREYHTHFSHVGERWDGYRGQSGLDLLGILDQIWRCWKRAVG----------------------
consensus/100% ......h...lS+.pF-Rplubhp.b...hR.bshhllp.paPsh.hlFs..bl.P....h.lclshssaDbbP.Sl.hls.................c...L..sss...ssh............psps.p.pPFlCh.GsREYHspsuHouD.W..aR.pu..sL..Il.blh.....sh.......................
-Alignment of domain marked with a ?:
FINAL -HHHHHHHHEEE-HHHHH-----------E------EEEEEEE-------------EEEE-E----EEE----EE--------E-------------EEEEEEEE----EEEE------HHHHHHHHHH--------------------EEEEE-------E--HHHHHHHHH------
mll6195_Mlot_14025928 MYRQYFRIALIDYSCEAQFQPVYLPLKSRIKEGSTDSVAYPLSFAYSRPVAPSGRLKIAG-LTSRWAQAPGAGWQATGVGQMSKDSGKGD-HGG---KIEITVVVNGQPTQVEANPNQPLHVVRAKALENTQNVAQPAENWEFKDEAGNLLDVDKKVGDFGFANIVTLFLSLKAGVAGA
msi102_Mlot_20803929 MYRQYFRIALIDYSCEAQFQPVYLPLKSRIKEGSTDSVAYPLSFAYSRPVAPSGRLKIAG-LTSGWAQAPGAGWQATGVGQMSKDSGKGD-HGGGPGKIEITVVVNGQPTQVEANPNQPLHVVRAKALENTQNVAQPAENWEFKDEAGNLLDVDKKVGDFGFANIVTLFLSLKAGVAGA
y4jI_Rsp._2496667 -------------------------------------------------MAPSGRSKTASPLTGRSAVVPWGRLASHWSMTMSKEAGKGDNHGGGPGKIEIIVVVNGQPTQVEANPNQPLHVVRTKALENTQNVAQPAENWEFKDEAGTLLDADKKIGDFGFANTGTLFLSLKAGVAGA
RHE_PA00017_Retl_86359722 --------------------------------------------------------------TLDTNRSIDGVSMAKSPNTAPEAAGK---KTGSKNKITLTIVVNGEPVSVEANVNAPLHTAIAKALEESGNVGQPPENWELKDENGTVFDASKKIEDLGITAGQKLFLSLKAGAAG-
Alignment of N-terminal domain fused to E1
FINAL ---HHHHHHHHHHHHH----HHHHHHH--EEEEE------HHHHHHHHHHH---EEEEEE--------EEEEE--------HHHHHHH------EEEEE------------HHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHH-----
ALIGN ---HHHHHHHHHHHHH----HHHHHHHHHEEEEE------HHHHHHHHHHH--EEEEEE---------EEEEE--------HHHHHHHH-----EE--------------HHHHHHHHHHHHHHHHHHHH----------HHHHHH------
HMM ---HHHHHHHHHHHH-----HHHHHHHH-EEEEE------HHHHHHHHHH----EEEEEEE-------EEEEEEE------HHHHHHHHH----EEEEEE----------HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHH-----
FREQ ----HHHHHHHHHHHH---HHHHHHHH--EEEEE-------HEEHHHHHHHH---EEEE-------------------HH-HHHHHH-------EEEE-------------HHHHHHH------HHEE----HHHHHHHHHHHHHHHH----
PSSM ---------EEEEEEE------------EEEEEE--------HHHHHHHHH---EEEEEE--------EEEEE----------HEEH-------EEEEE--------------------HHHHHHHHHHH----HHHHHHHHHH--------
RPDDRAFT_1995_Rpal_77690158 MNKATQQNAMMLASLLGVGEAEAGERLARTVLITAAPGWKSGWAVEVGELIG-RTVQVSHQQEPTDPDLELVIGDVTPRTSARRVYADLGSEGAAASLEPVAKLAG-EPHGLYAAAAACAVSAVVVHAVIDAADLPQARLPMRLDYAQLGVP
Magn03005841_Mmag_46203362 MITPAQENARMLAAILGSDEDDASERLNRAVLVTAPPGGADAAWAAEVAALLARTVGVV-TSPAEEAQLELVIGEAAARTDLPRLHAAIDAGGATVDVRPVGRTGGPPPHPLLAAVAACPAAAATLRMLLDDPALPAVAYPLRLDFDQLGVP
----------------------------------------------------
6d. Uncharacterized operon coding a Ub-like protein, a JAB, an E1-like protein and an E2-like protein
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Gis are of the E2-like protein- marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84717439">84717439</a> 265 Ub->alpha_helical_2->E2*->JAB->E1-> Polaromonas naphthalenivorans CJ2 proteobacteria>betaproteobacteria conserved hypothetical protein [Polaromonas naphthalenivorans CJ2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67910471">67910471</a> 259 Ub->alpha_helical_2->E2*->JAB->E1-> Polaromonas sp. JS666 proteobacteria>betaproteobacteria hypothetical protein BproDRAFT_0623 [Polaromonas sp. JS666]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71847775">71847775</a> 246 Ub->alpha_helical_2->E2*->JAB->E1-> Dechloromonas aromatica RCB proteobacteria>betaproteobacteria conserved hypothetical protein [Dechloromonas aromatica RCB]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67543573">67543573</a> 242 Ub->alpha_helical_2-->E2*->JAB->E1-> Burkholderia vietnamiensis G4 proteobacteria>betaproteobacteria conserved hypothetical protein [Burkholderia vietnamiensis G4]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=38637969">38637969</a> 241 Ub->alpha_helical_2->E2*->JAB->E1-> Cupriavidus necator proteobacteria>betaproteobacteria hypothetical protein PHG308 [Cupriavidus necator]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17428675">17428675</a> 240 Ub->alpha_helical_2->E2*->JAB->E1-> Ralstonia solanacearum proteobacteria>betaproteobacteria conserved hypothetical protein [Ralstonia solanacearum]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56315656">56315656</a> 239 E2*->JAB->E1-> Azoarcus sp. EbN1 proteobacteria>betaproteobacteria conserved hypothetical protein [Azoarcus sp. EbN1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56410324">56410324</a> 239 E2*->JAB->E1-> Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria hypothetical protein [Ralstonia metallidurans CH34]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68559357">68559357</a> 239 E2*->JAB->E1-> Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria conserved hypothetical protein [Ralstonia metallidurans CH34]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=44004435">44004435</a> 214 E2*->alpha_helical_2->JAB->alpha_helical_2->E1-> Bacillus cereus ATCC 10987 firmicutes hypothetical protein BCE_A0096 [Bacillus cereus ATCC 10987]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67908644">67908644</a> 255 E2*->JAB->E1-> Polaromonas sp. JS666 proteobacteria>betaproteobacteria hypothetical protein BproDRAFT_4305 [Polaromonas sp. JS666]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74024822">74024822</a> 241 E2*->JAB->E1-> Rhodoferax ferrireducens DSM 15236 proteobacteria>betaproteobacteria hypothetical protein RferDRAFT_4144 [Rhodoferax ferrireducens DSM 15236]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75705484">75705484</a> 240 E2*->JAB->E1-> Anabaena variabilis ATCC 29413 cyanobacteria conserved hypothetical protein [Anabaena variabilis ATCC 29413]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17134644">17134644</a> 240 E2*->JAB->E1-> Nostoc sp. PCC 7120 cyanobacteria alr7559 [Nostoc sp. PCC 7120]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29339960">29339960</a> 233 Ub->alpha_helical_2->E2*->E1-> Bacteroides thetaiotaomicron VPI-5482 bacteroidetes/chlorobi hypothetical protein BT_2648 [Bacteroides thetaiotaomicron VPI-5482]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71839550">71839550</a> 243 Ub->alpha_helical_2->E2*->E1 ->SFI helicase(note connection to F-box)->JAB-> Pelobacter propionicus DSM 2379 proteobacteria>deltaproteobacteria conserved hypothetical protein [Pelobacter propionicus DSM 2379]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75758403">75758403</a> 241 E2*->alpha_helical_2->E1-> Bacillus thuringiensis serovar israelensis ATCC 35646 firmicutes hypothetical protein RBTH_06715 [Bacillus thuringiensis serovar israelensis ATCC 35646]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75758953">75758953</a> 253 E2*->alpha_helical_2->E1-> Bacillus thuringiensis serovar israelensis ATCC 35646 firmicutes hypothetical protein RBTH_07326 [Bacillus thuringiensis serovar israelensis ATCC 35646]
**: note: all the JABs in these operons have an N-terminal domain, whose alignment is provided below:
Domain found N-terminal to the JABs (JAB-N)
FINAL -HHHHHH---EEEE--------------EEEEEE----EEEE--HHHHHHHH---------E-----HHHHHHH-
ALIGN -----------EE----------------EEEEE---EEEEEE---HHHEEH----------------HH-----
HMM -HHHHH----EEEEE--------H----EEEEEE--HEEEEEHHHHHHHHHHH-------EEEEE--HHHHHH--
FREQ -HHHHHH----EE----------------EEEEHH---EEEEE--HHHHHE----------------HHHHHHH-
PSSM -HHHHHH-----E----------------EEEEE---HEHHH----HHHHHH---------------HHHH----
PproDRAFT_0259_Ppro_71839552 MDAILQEQFPTVMVPRYGD-FVPLAHNGRRFLSASDGLWLEEKNQWLHILWPLALQN--QVAMPYGSLQKKVDFL
RSc1658_Rsol_17428674 ---KLWDSAPTVAVPKFAE-FKQLEDVGHRFLATAEGLFVEVRRPWLHVIQPVAPLNGQTVRPPYGTVKQKVDLA
BproDRAFT_0622_Psp._67910470 LDSIIQGMFPTVIMPREGT-IAPATKNGTRYVVAGDGLWREVVLPWVTVMHKIANS---DFMLPYGAAEEAVVIK
RmetDRAFT_6239_Rmet_68559358 ADMALQQSFPSVMVPRHGA-LPALEQVGERLLIAANGVFLEIVRPWLRVVRRLGEFQH-QTAIPYGDATEVTELR
RMe0063_Rmet_56410325 --MALQQSFPSVMVPRHGA-LPALEQVGERLLIAANGVFLEIVRPWLRVVRRLGEFQH-QTAIPYGDATEVTELR
PHG307_Cnec_38637968 ADAALQQSFPSVMVPRFGA-LAPMERSGERLLIAANGVFLEIVRPWLSVVRHLGAFQH-RTAIPYGEAAETTDLR
Bcep1808DRAFT_6254_Bvie_67543574 LDTVLQQSFPAVMVPSRET-VVPMTRSGERLLIASDGVYLEVLRPWVRVVRRIAQY-AVSIAVPYGKVEETTALL
p1B74_Asp._56315655 RDMALQALTPTVMVPRFGC-FEPLSQPGHRFLVGQNGEWLEVRRAWMYARVQLTQP--SPVVKPYGVVTACLEWL
BproDRAFT_4306_Psp._67908645 MDAIIQSQFPTVLAPRFEA-LSPLETTGDRFILTRHQVLMEVSRPWLHAIQAISAP--FARQTPYGAGPRLGIKL
Daro_2537_Daro_71847774 RDLALQAVCPVIAAPRFGP-LPDM-ANGQRIILAANGVFVQVKLDWLDCIQRLSPA--LPITLPYGGIEERLAFT
RferDRAFT_4145_Rfer_74024823 -DRFLATDCPVITMPHDSEVFEPLKTPGHRLIVAAGGLYKEIRRAWLHAIVHVAR-----AQTPFGELQTTLSM-
PnapDRAFT_0123_Pnap_84717438 LDQITMGVFPLLAASQTGV-LQDPEKHGVRYVAASDGMWRAIDTAWLKA--------------------------
PproDRAFT_0259_Ppro_71839552 MDAILQEQFPTVMVPRYGD-FVPLAHNGRRFLSASDGLWLEEKNQWLHILWPLALQN--QVAMPYGSLQKKVDFL
consensus/100% ....l....Psl.hPp....h..h..sGpRhl.s....h.b....Wh.h...ls.........PaG.......b.
consensus/95% ....l....Psl.hPp....h..h..sGpRhl.s....h.b....Wh.h...ls.........PaG.......b.
consensus/90% ...hlb..hPslhhP+....h.sh.psGpRhlhs.pGla.El.bsWlphh..lu.........PYG.h.p...h.
consensus/85% ...hlb..hPslhhP+....h.sh.psGpRhlhs.pGla.El.bsWlphh..lu.........PYG.h.p...h.
consensus/80% .D.hLQ..hPsVhhP+.us.h.shppsGcRhlluusGlahEl.bsWlphl..lu.........PYG.hpp...h.
consensus/75% .D.hLQ..hPsVhhP+.us.h.shppsGcRhlluusGlahEl.bsWlphl..lu.........PYG.hpp...h.
consensus/70% .D.hLQpphPoVhhPRaus.hsshppsGcRhllAusGlalEl.RsWLchlbplu.........PYGshpc.s.hb
Species abbreviations
Asp. : Azoarcus sp.; Bvie : Burkholderia vietnamiensis; Cnec : Cupriavidus necator; Daro : Dechloromonas aromatica; Pnap : Polaromonas naphthalenivorans; Ppro : Pelobacter propionicus; Psp. : Polaromonas sp.; Rfer : Rhodoferax ferrireducens; Rmet : Ralstonia metallidurans; Rsol : Ralstonia solanacearum
Alignment of alpha helical domain-2
FINAL ----------------------------------------------------------HHHHHH-HHHHHHHH--------------HHHHHHHHHHHHHHHH---------------------------------------HHHHHHH------EEEEEE------HHHHHHH----HHHHHHHHHHH----------HHHHHHHH-EEE---------HHHHHHHHH-------HHHHHHH-------------------------------------HHHHHHHHHHHH----HHHHHHHHHHHHHHHHH---------------------------------EEEEEEE---HHHHHHHHHHHHH---------EEEEEEE-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--
Daro_2539_Daro_71847776 ------------------------------------MMALVLPRIGPQVPRSIAPGPLLAANAM-VSRFLIEAEAFDEADIPVTWSDSLDACRQALDGWLKCQIGALHCLTPRF--------ALHMVSRDGESYRYYGSQPPKDFDFNAVEASWCEYHEQEWPVGAGLEALSARLHGLGTVVLHVLCRQSAFV-YPLFTPDIACDVATYLYWCGED---DEEAALDMNCGEDEEEREAMRAEMVTKSMLEA------------------SFPAWTRRWPRGLELAQCARFLRRATNRLSDPGAKATAEDALALATLEIDDSFRP---DMEGE-----------FVGFGAVLSWRDGDVTTRIYDDLLELAHQGEYC-EHMGEVQVPLD-DPAAFGAWQQAMASRFAAIRLIDRLIHHLSAG
BproDRAFT_0624_Psp._67910472 -----------------------MKSIALERPVPKAHGTFVLPQISSEVPLVIGGESIAHQTLAKFSLAAEKCGMELPGG---DIPKLESIVQMQLQGWLDKQVGAN--------------------ARACLGGQPLISANSSEIEFFMRAVSNLELLKL----KPVIEALEAKVPGLGWYVVDVIERSNGRG-ISIYSPAAM-GYHSFSQLQGAESDEDFVKEMQAMEGEDEPSPEELAELIEQARSDYAYLPSKVLESVEGHAHLLGWASPNAKHGPKRLKTKQAAYLLKTAELPDGLKQCVTDAIALDCLYGK--DKGAYTWDNSQDEE-----------QIGAACFIAWNDAEMLFELVQHYEEDTYNSGTAMECLCRLKVATGGTPAEFEQMARLMRAYFDQWNALGNLLVHFLDQ
PnapDRAFT_0133_Pnap_84717448 MLDRYEHGREFECDPESASLVTTSVRNLGVVAQGNQGGMWVLPTFSPEIPLEISRADAEASNLADFILLAHKKGIRIPDSI---YTTTSELMTQQFANYARSQVKN--------------------VRVELPLDVSIVAIDKKIEFAANATDRFQGIYQL----KDSVERLNAASPGLGWFITDTIRKGHGVG-LTTYDPCRIANCVQLIWFDSET---DQEAAAEVLDIDEKNVTEAHIEQARDERTFM---PSDFLASVGGHKHLLSWSQTKKEKAACRSMSASRVRACIHKLKLVEADRALV-MAALEFHDSIKVRKANAAIAPNGWFEHENLHEFECLDALGSLAFIVWDDSEFAREAITHYEEYAMNGEGSHSQLVGLFVEL-DEPASWGPFIDAYKLYIKRYAAFSNFIGALPEE
RSc1660_Rsol_17428676 ------------------------------------MTALALPRLAA-MPTRYRTRDDGAAWCTPALLGLVDADALSADDVRRDPATPAELLQHTLQRHWDEITAGARIFDW--------HLSANPSQLGWWIPTTTSKNLWLAITPHNNNRVDVPLYYL----GPTITTLENIRKGLGQTVLAVFYDALRLL-PNTLTPADTYGHASWVHWHGET---DETMAIQWLYDEGDFETMEQAAAAYDGPTREA---------------LFEYMPEWAAYPRRVLSDRQVRRIARSHPFVAKVVDAVDGIWNHVHATHATGGYADCRVDADGD-------------SITWIAIFRWHPEDLALRIADDFTEFVTQGEY-QDASTLVCVE--SESDSLARWLHQMRANGQLARLVENLVDLIAMP
BcenP_01000005_Bcen_84357756 -----------------------------------------------------------MLTIEDAQLADDHRNERELARIALTRTWQELTDAHSIFEWSLRLSSD---------------------SCGPSYYRTGDDNSVWVSIHSDGGAGTAPVRFL----RGSISHLESVMPGLGQTVLAVLYEACAHYLPSVLTPSETISIAGYMYWQGHA---NEIEALPELRMHYDDVDEATPEEFFEACSIPR------------RTEFFRDAPDWLVNPQQVLNTFDVHRAAEQDEIAALAVSACDEIYSLIAHGGPFARVDHFDSNAG--------------PGIDFSLFLLWDHDDGTGRVIDDFLEHEMQG-DALEAACAVSLS--LAGKAVGNWFARVRNTSRLALAVEHLLDVIALR
p1B76_Asp._56315657 ------MRGRTYAVSRKLTEEFGVSGSASASIKRHPNDPLRLPR-KCLAPGAYVEHASAGLPLANLALALYEEGLITEADPDWGLAEVVKLGLMRLTEGTLGDLVFVAPVDL--------AVSSTLEGCEGFSVEESDPVPQTYWLALELTNALEPCFA-----GKRLLELEKAVPGLGKTALDVAQSAGART-TGCWSPLFVRDLSSYIYWRGAD---TQEEWLEELEASGEDPDDYGFSPKQYEEGFE--------------------VDWACSAQMELDGFALVQALDHPDPAVGDVAEKLCELMCLLNN-----RRSAFPDASPTDRE-----------SVYRGCLIRWDKNDPIEQVIDDHIEYANQGADCYTTLCSVWDVK-ITREDFSEWLKSYRLGLQLYKSLDQLLAMLHTS
PproDRAFT_0256_Ppro_71839549 -------------------------------MATSPPSFLSLPNIPKSVPRLY-EFDTASTCVANIALHLLDLGVVTESE---AIMPLQDIVKQSLNRWCRSKTKDLECFSPILMVSDTFAGIGGYAVDADTVLEQESITPETSIVALGITFDNTKCFTL----KDKCDTLHTVEPDFIEFVIQKLYH-SLCV-MFAVTPELAHDTADMFYWDCYD---DSDEEPYVTKE-----------------------------------EFYKIIPEWVANPTYKEGWVKEFDRCLEHDNENIRKIAQHILTWQDIEHTRKSDVLPYYPDQVSDDG---------CTTIQNGTWISWDENELFDRIIDDWGEYHYQTST-TDLNNFFVVP--ATKQGIEKGLTLLEYYFVRLEWADKLLRLVGKI
RmetDRAFT_6237_Rmet_68559356 --MLFDPRSFVPALDGGQPG-WSFARQHPAARHRPSHGFLTLPAIAAETPGRAFLSFGDEPDALELARAQFETGVLRASDV-VNPTSAADAFAQAMFAWLAARMPTCRRLNFSFS-------LVDLNAAKDQLMQFGWDDQVDASLYLAIDLPGDEVYFIG---KARADALRAVHPYLLYTAMSLINLASSKS-LHLRTPDVLLDLFARWHWEYDCTLANDDDAREFLKNGCGM-DEGDIARYLPSAVRP------ELAPDDVLPPFCHAYPE-SRKLKTVGSRKLYELARSQHGWLKDVCVALAELNLAVKRQRDR-----SAVADSQWAE-----------PAHSAATLAYAESDYVTQVLDDLYDGYANSGDATLFQCFIPIA--VEPKAIRQQFEDLSGMFKIIAALDRVLTLISD-
PHG309_Cnec_38637970 --MLFDPRSFVPEVDAGQPA-WTPARQHPIARRRPAHDFLTLPAIPAGVPNSGLLTFGDEVDVLGLVRAQFATGVLRANDVS-TPTGAGDAFAQAMFAWLRARTPECKRLSFGFS-------LIDIGAAKDQLMQFGWEDEIEAPLYLAIDMPGDDVYFIG---EARASALRAVHPHLLYTAMSLINTASAKS-LFLRTPEALLDLFARWHWEYDSTLADDKNAREFLAESCDM-DEGDIERYLPSVVRP------ELAPDDVLPPACHGYPA-SSKLRAFGSRKLYELSRANNGWVKDLCVALAELNLTLKRQGDR-----SAVAGSQWAE-----------PAYSAATLAYRQSDYVTQVLDDLYDGLNCSGDATLFQCFIPIA--GEPKAIRQQFRDLDGMLKTIAALDRVLTLISD-
Bcep1808DRAFT_6252_Bvie_67543572 --MFFDPALPDSSIAAGSAARWQPPRAAP-ARRRPAADLLTLPSFSTEVPGAVRLKWREDVNLSDLVLKHFQYGPLRAGDVH-DPADAGDAFQQAFHAWTRRQYGRLSRLRFTPH-----LFDAHAVRDVLDGLGNGNNDDDPTPLFFGFGLEDEWVYSL----EGAIETLRSTHPLLFRTVMGALYRASART-MFIRLPDWFMYEFSCWYWDGDPHISDKDADEALKERFDDDT-E-TRSAYLPSVVRP------QLCPDDADPCVFSGGKWRYRSALTAPELMRLR--ARSRGMPRRVCTEVLKLRALMRRSRSRD-----LLHVNYAAN-----------PAYALCSVIVEDNQFVGDLLDCHFENESQSGDATTYSGFSRLA--STPKAIRRQYADLALAFRILTHLDRLLALVSQS
BproDRAFT_4304_Psp._67908643 --------------------------------------------------------------MAKLARALCNVHPELLDLVTLSEQDLPKSCIEIVERWQASLRSFLPKDALAI-----------QPEVTGYRSGNNPEFGGDLLTVQLFLDCPEPIYMK-------EFMKRCRNKVLAHDAAKAIDQVAYLG-LEIWAPEVIRDMYGSMNWYHCDNDADILEEFAMNHWEGEGEDIPAMKPEDFP----------YVLPSKWDAHMKKLGYKKPGPKPLASIQQLREMAKGRSQKDAALATAILKLRKVIKRGHLRCSDDEDRWGC----------------VEPSFVFLWDTDSAQLRHALDEAVEDRHNAGVSRENVLQVSVRPESALQQVEDDVRAVEHLLAMQIAVGDLHTAMKTF
consensus/100% ..............................................................h..h.b..........s................h............................................................h.............bp.....h...h...h.............P...........h.......s......................................................................................h...........................................h.........phhpp..-...ps....................h......h.........h.ph...h...
consensus/95% ..............................................................h..h.b..........s................h............................................................h.............bp.....h...h...h.............P...........h.......s......................................................................................h...........................................h.........phhpp..-...ps....................h......h.........h.ph...h...
consensus/90% ..............................................................h..h.b...p...b..s....s.....p.....h..a...b.....................................................hb.b.......h..Lps....L...hh..h.p.........bsP....s..s.h.ap..s...s........................................................................p.............h....................s................h...s.l.hppsp...phhsc..-...pu.......s.h.l........h...h..h......h..h.pll..h...
consensus/85% ..............................................................h..h.b...p...b..s....s.....p.....h..a...b.....................................................hb.b.......h..Lps....L...hh..h.p.........bsP....s..s.h.ap..s...s........................................................................p.............h....................s................h...s.l.hppsp...phhsc..-...pu.......s.h.l........h...h..h......h..h.pll..h...
----------------------------------------------------
6e. Uncharacterized operons coding a protein with tandem repeats of a ubiquitin-like domain (polyUbl) (First evidence of polyubiquitins in bacteria)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Abbreviations: Y: Metal_binding domain_1; X Unknown domain
Gis are of the E1-like protein: Marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17134589">17134589</a> 416 Ubl+Ubl->E2l->JAB+E1*->Y-> Nostoc sp. PCC 7120 cyanobacteria alr7504 [Nostoc sp. PCC 7120]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=38423902">38423902</a> 471 Ubl+Ubl->E2l->JAB+E1*->Y-> Synechocystis sp. PCC 6803 cyanobacteria sll6053 [Synechocystis sp. PCC 6803]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67547439">67547439</a> 468 Ubl+Ubl+E2l->JAB+E1*-> Burkholderia vietnamiensis G4 proteobacteria>betaproteobacteria UBA/THIF-type NAD/FAD binding fold [Burkholderia vietnamiensis G4]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84711629">84711629</a> 469 Ubl+Ubl+E2l->JAB+E1*-> Polaromonas naphthalenivorans CJ2 proteobacteria>betaproteobacteria unknown protein [Polaromonas naphthalenivorans CJ2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69928900">69928900</a> 458 Ubl+Ubl+Ubl?+E2l->JAB+E1*-> Nitrobacter hamburgensis X14 proteobacteria>alphaproteobacteria UBA/THIF-type NAD/FAD binding fold [Nitrobacter hamburgensis X14]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86159351">86159351</a> 604 (JAB+E1+thioredoxin-like*?) Anaeromyxobacter dehalogenans 2CP-C proteobacteria>deltaproteobacteria UBA/THIF-type NAD/FAD binding protein [Anaeromyxobacter dehalogenans 2CP-C]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86742694">86742694</a> 476 Ub->X+E1*->Y-> Frankia sp. CcI3 actinobacteria UBA/THIF-type NAD/FAD binding fold [Frankia sp. CcI3]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=14025879">14025879</a> 396 Ubl+Ubl+Ubl->X+E1*->Y-> Mesorhizobium loti MAFF303099 proteobacteria>alphaproteobacteria mlr6140 [Mesorhizobium loti MAFF303099]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68554445">68554445</a> 389 Ubl+Ubl+Ubl->X+E1*->Y-> Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria conserved hypothetical protein [Ralstonia metallidurans CH34]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=28806072">28806072</a> 392 Ubl+Ubl+Ubl->X+E1*->Y-> Vibrio parahaemolyticus RIMD 2210633 proteobacteria>gammaproteobacteria hypothetical protein [Vibrio parahaemolyticus RIMD 2210633]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=39651044">39651044</a> 400 Ubl+Ubl+Ubl->X+E1*->Y-> Rhodopseudomonas palustris CGA009 proteobacteria>alphaproteobacteria hypothetical protein [Rhodopseudomonas palustris CGA009]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77690161">77690161</a> 239 Ubl+Ubl+Ubl->Metal?->JAB->N+E1*-> Rhodopseudomonas palustris BisB5 proteobacteria>alphaproteobacteria hypothetical protein RPDDRAFT_1998 [Rhodopseudomonas palustris BisB5]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82740919">82740919</a> 398 X+E1*->Y-> Shewanella sp. W3-18-1 proteobacteria>gammaproteobacteria conserved hypothetical protein [Shewanella sp. W3-18-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88795472">88795472</a> 291 Ub->E1*-> Alteromonas macleodii 'Deep ecotype' proteobacteria>gammaproteobacteria hypothetical protein MADE_08186 [Alteromonas macleodii 'Deep ecotype']
***Note operon fusion: (The polyub is next to another operon that has been cited in a different context)
Alignment of Y: metal binding domain_1
FINAL HH---------EEEE-----------EEEEEE-----EEEEEE------EEEEEE--------EEEE---------------------EEEEEE--EEEEE-------
ALIGN HHHHH------EEEE-----------EEEEE-----EEEEEEE-------EEEEE--------EEEE----------------------EEEEE--EEEE--------
HMM HH--------EEEEEE---------EEEEEEE-HH--EEEEEE------EEEEEE--------EEEEE-------EEE----------EEEEEE--EEEEE-------
FREQ HHHHHH-----EEEE-----------EEEEE-----EEEEEEE-------EEE----------EEEE----------------------EEEE---EEEEE-------
PSSM HH---------EE-------------EEEEE-------EEEE--------EEEEE-------EEEEEE--------------------EEEEEE--EEEEE-------
RES LGFLKKSELTSRMVNFHPAPEEIMSGEVVIVGDRNHKKWACFRCPSGCGELILLSLNKNQHPSWRVDCDWLNRPTLHPSVRQLN-HCQCHFWIKRGVTQWCADSRHNK
sll6052_Ssp_38423901 LGFLKKSELTSRMVNFHPAPEEIMSGEVVIVGDRNHKKWACFRCPSGCGELILLSLNKNQHPSWRVDCDWLNRPTLHPSVRQLN-HCQCHFWIKRGVTQWCADSRHNK-----------------------------------------------------------
alr7505_Ana_17134590 LRFLPQPDLSARIVPTHPAPENIKPGEILVVGDAEYQKWACFRCPGGCGENILLSLNQKRHPCWAIAIDSLGRPTLNPSVRQLN-ECHCHFWVRQGVVEWCADSGQK------------------------------------------------------------
mlr6141_Mlot_14025880 -MMARVDCLTTVFVED--IPEQLDDGVLYV--SRQCHV-ALHNCACGCGEEVSTPLVPTE---YDLVMED-EGASIWPSIGNHDFPCGSHYIVKRGRIHWAGKMSREQIEAGRAYDRLLKRG--------------AQPKGLRAILAWIKRLWI-KFIG--------
Sputw3181DRAFT_3760_Ssp._82740918 ---MAVHYITPVFVEF--IPENIEQGKLYI--SETYKT-AIHKCCCGCGEEVVTPLSPAD---WQLKNGV-NTVSLYPSIGNWNYKCKSHYFINNNRIIWAPKFSPEQIQAVQVRDRVDKLNYIA-------DKNKAGPIAWSNFIGWLVKSWR-F---IRSLFSLR
RPA4125_Rpal_39651043 RKSMKLDQIKLQRVEF--MPKQLEPGILYV--SEKYRA-VAHLCACGCGAKIRTPLGITE---WAFTDNT-AGPSLWPSVGNWQQACKSHYIIDGGEIIWCGTWTPEQIMAGRRAEQARRKAHY--------DAMYVKR-------GLFNRVWQ-W---LKSLFGG-
Francci3_0886_Fsp._86566461 --MTRLDAVRHEFVEC--IPETLIQGVVYV--SIAYAT-VAHSCCCGCGNVAYTPLAPGR---WALTFDG-RSISLDPSIGNWSFPCQSHYWIERNRVHWHAAWTAEKIQKGRARTLQMI------------NKDIERTDGAKSATTAVQTRWRGWFARLRRRFK--
VP1086_Vpar_28806073 SLVLKHTHLAHKFVRS--IPKQLEPGILYV--SMEYAT-AIHSCCCGCGNQVVTPITPTD---WQLMFDG-DSISLSPSIGNWGFKCRSHYFIRKGMIVEAGQWDKKTITAGRDNDKHNKAHYYQ-------AKPKGDDNTYSHRVGLFKRVWH-WFLGKREFAKKR
RmetDRAFT_5044_Rmet_68554446 --MMRYKELEPRFVTT--VPRQLEPGVLYV--SMEYGT-VVHSCCCGCGEKVVTPLTPTD---WSITFNG-ESVSLWPSIGSWNLPCQSHYVIKGNRVLESGRWNRQMIDAEISRDNEAKAKYYKRTVSNETEPSLAHPIDIETGSQTYARQSF-WKTILSRLLR--
consensus/95% ........l...bV....hPcpl..G.lhl..s......sha.CssGCG..h..sls..p...a.h........ol.PSl.p....C.sHahlp.s...b.s....p............................................................
consensus/90% ........l...bV....hPcpl..G.lhl..s......sha.CssGCG..h..sls..p...a.h........ol.PSl.p....C.sHahlp.s...b.s....p............................................................
consensus/85% ...h..p.lp..hVp...hPcplb.G.lhl..sbpa...shapCssGCGp.l..sLs.sc...W.l..ss....oL.PSl.phs..CpsHahlc.s.l.bsup.s.p............................................................
consensus/80% ...h..p.lp..hVp...hPcplb.G.lhl..sbpa...shapCssGCGp.l..sLs.sc...W.l..ss....oL.PSl.phs..CpsHahlc.s.l.bsup.s.p............................................................
consensus/75% ..hh+.splp.bhVp...hPcplcsG.lYV..SbpY.s.shHpCsCGCGpblhTPLs.sc...W.lshss.pssSL.PSlGsasb.CpSHYhIcpsbl.Wsuphs.cbI...b..p...b............................h.p....b...........
consensus/70% ..hh+.splp.bhVp...hPcplcsG.lYV..SbpY.s.shHpCsCGCGpblhTPLs.sc...W.lshss.pssSL.PSlGsasb.CpSHYhIcpsbl.Wsuphs.cbI...b..p...b............................h.p....b...........
consensus/100% ........l...bV....hPcpl..G.lhl..s......sha.CssGCG..h..sls..p...a.h........ol.PSl.p....C.sHahlp.s...b.s....p............................................................
Species abbreviations:
Ana : Nostoc sp.; Fsp. : Frankia sp.; Mlot : Mesorhizobium loti; Rmet : Ralstonia metallidurans; Rpal : Rhodopseudomonas palustris; Ssp : Synechocystis sp.; Ssp. : Shewanella sp.; Vpar : Vibrio parahaemolyticus
Alignment of domain X: fused to E1: Perhaps a novel protease that displaced the JAB
FINAL --HHHH-----HHHHHHH--EEEEE--EEE-EEEEEEE-----EEEEEEEEEEE--------------EEEEE---------HHHHHHH-----EE------EEEEEEEE---------EE--------HHHHHHHHHHHHHHH------EE--EE--------HHHHHHHHHHHHHHHHHHHHHH--- EEEEEE---HHHHHHHHHHH----EEEE--------------HHHHHHHHHHH--HHHHHHHHHHH-----EEEEE---HHHHHHH---EEEEEE---HHHHHHHHHHHHH----EEEEEEEE---------EEEEEEEEE-----HHHHHHHHHEE---------EEEEEEEHHHHHHHHHHHHHHHHHHHH------------EEEEE-------------
ALIGN -----------HHHHHH---HHHHH--EEE-EEE-------------EEEEEEEE-------------EEEE------------EEEEE------E------EEEEEEEE------------EE----EEEEEEEE-H---E--------EE--EE---------EEE------HHHHHHHHHHH---- EEEEE-----EEEEHHH----EEEEEE------------------HHHHHHHH--HHHHHHHHHHHHHH----------H---------EEEEEEE------HEHHHHHH----EEEEE---EE------------EEEEE-------HHHH----------------HHHHHHHHHHHHHHHHHHHHHHH----E---------EEE----EE---------
HMM --HHHH---HHHHHHHHH-EEEEEE--EEE-EEEEEEE-----EEEEEEEEHHHH------------EEEEEE--------HHHHHHHHH----EEEEEE-HHEEEEEEEE--------EEEEEHHEEEHHHHHHHHH--HEE------EEE--EEEE------EEEEE-----HHHHHHHHHHH---- EEEEEE---HHHHHHHHHHHHHHHHH--------H--E-------HHHHHHHH--HHHHHHHHHHHHH-------HEHHHHHHHHH---EEEEEEEE-----EEEEEEE-----EEEEEEEEEEEE-------EEEEEEEEE----HHHHHH---E----------E--HEEEHHHHHHHHHHHHHHHHHHHH--HHHH--HEEEEEHHHHHHHH--------
FREQ --HHHHH----HHHHHHH---EEE---EEE-EE-----------EEEEEEEEE---------------EEEE-------------EEEE-------------EEEEEEE--------------------HHHHHHHHHHHHHH---------------------HHHHHHHHHHHH--HHHHHHHH--- EEEEE---HHHHHHHHHHHH----EEEE----EE---HHHHHEHH-------H--HHHHHHHHHH------EEEEE--------E-----EEEE---HHHHHHHHHHHHHH----EEEE-------------EEEEEE--------HHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHH--E---------HEEE--------------
PSSM --HHHH-----HHHHHH---EEEE---EEE-EEE---E------EEEEEEEEEEE-------------EEEEE---------HHHHHHH-----HH-----HEEEEE----------------------EEEHHHHHH------------EE--EE-----------HHHH-H----HHHHHHHHHH-- -EEEE-----HHHHHHHHH----EEEEE-----------------HHHHHHHH--HHHHHHHHHHH----EEEE-----HHHHHH----EEEEEE---HHHHHHHHHHHHH----EEEEE-------------E-EEEEEE------------EEEE---------EEE--------HHHHHHHHHHHHHHH---------H----EEE--------------
RPA4126_Rpal_39651044 MFQKLVSHNDDIKRLVDKGYAVGFDSNYMI-VRDIPYLDAQGSECWGAIVTKLVAT--DQGHVIQDDHQIFFAGSSPYNTDGTAIANLSDRPTALGLSEAAADVAVQRQFSNKPRIDGQLVGFNSFFDKIESYVGIISGPARAKFGSNWLTY--RSVEKVANDSVFKIHDTMTSRAEITALSAKFKDEV IAIIGLGGTGAYILDFMVKTPVKEIRGFDLDPFHVHNAFRSPGRFEDSEFKRS--KADVYQTRYDNFRHGLTLKAKFIDASCASDFDGVTFAFVCVDKGSSRAGIFEVLMAKGIPFIDVGMGLNRKRGP---LAGMMRATYYDPANAQAMKDKGFSELSDRPN--DEYRVNIQIGELNALNATLAVIKYKKLKGFYIETNPDFNFLFDLSDCKITRRSKIDEA
mlr6140_Mlot_14025879 MSADLISRDPHLKRLLDEGFELEMRELVLLLVHSVPYVKRDKSLGRGTLVCTLSLDTQGLTASPQTDHTMWFTGETPCHRDGAPMTNIIHNSNEATVGS---DIKVHHYFSSKPEGTGQ---YANIYDKVVTYESHLGAAARSHDKTANART-GVTLASAQDDSPFAIPDSASARYGIVAANRKLRG-R VAIIGLGGTGAYLLDLAAKTRVAEIHLYDDDQLLNHNLFRSPGAPEPVLAKNFPRKVDYYAALYARMHKGVKPHPTRVKADNIDEFAGYDFVFVCVDKGSSRRVIAEGLVRLGIPFVDTGIGLGLEHNT---LDGCARATFIAPGTPWAE-VATHLSFGDDDEEADVYGTEIQTAELNSLNAIMAIMRWKRWLTFYRDERNERNATYMIEGNNITNRGA----
Sputw3181DRAFT_3761_Ssp._82740919 MSSKLTVHNPSILRLIEEGFEIDIVRQHLL-VHSIPYLNQSGEVKFATLACPFVEN--GEQDTRPQDHTMWFKGEYPHDGKGRPMTEVVNSPNQHVLFD---EFGVDFYLSNKPNGQD----FSNFYDKVVHYHTLFVSQARLVDSNADGRT-GIVHGQRDESSVFCYPDTASSRAGITAITQKLEGSR IAIVGVGGTGSFILDLLAKTPIAEIHLFDADDFEPHNAFRAPGAASLEQLQSAPKKVDYFFDVYSAMRHGVVAHPYFLDEQNVYELDSFTFVFVAVDNGQARRVVTQHLVNRGIPFIDVGMGIEIVEDASLQLRGTCRVTLVTNEKNLHL--AQRANLHDDDDE-ALYKSNIQVADLNAMNAALAVMRWKQYMGFYLDQGQAHNLNYTLSLQSLTRDDGPEED
Francci3_0885_Fsp._86566460 MSQRLIVRSADLGRLREEGYHLETRGNVLL-VHDVPYVNPSREVLRGTLVTELELA--GDMTIQPSNHVAQFIGQTPSDSEGHPLSKLINSGAASLVG----SVHVNFTFSKKPMG-GDQ-RYRDYHHKVTTYVALLLMHAQVLDPTVAATTFPVITPDEDDDSPFEYLDTASVRAGISEVTKKLRLGP IAIIGLGGTGAYTLDLVAKTPVREIHLFDGDRYLQHNAFRSPGAPSIEELATVPKKVDYFAARYAKMRKKIVPHGDFVTEANVDELRGMTFVFLALDDGPARKLIVTKLEEYGIDFIDVGIGVEHVDNS---LTGLVRTTLSTVDSRKHLDADHRLPFGKANDA-NDYNRNIQIADLNALNAALAVIKWKKLAGFYLDLEREHYSAYAVNGNTLINEDLG---
VP1085_Vpar_28806072 MSLQLINLNSDLKRLRDEGYFIQVKNGFLI-MRDVPYVNSNRHVCRGTIISSLSLA--GDRTRIPDTHVVHFDGDMPCNAEGEALNAVVLQSSIFDLGR---GITAKHMFSSKPKS-G----YTDYYHKMTTYASILSGHAEVLNSGISPKV--FSTPEDEEDSVFNYTETASGRVGIGALSDLLTEES VAIIGLGGTGSYILDLVAKTPVREILLFDSDEFLQHNAFRAPGAPTLEALRDAEKKVEYFKSIYSNMHKRISTSSTYIDEENLELLNGVTFAFICIDAGTSKKSIVQKLEELDIPFVDVGMGVELTDGS---LGGILRVTASTSGKRQHV-HEGRVSFGGGEGN-DVYSSNIQVADLNALNAALAVIKWKKIRGFYRDLEQEHHSTYTTDGNLLLNGESCA--
RmetDRAFT_5043_Rmet_68554445 MSAALFNRNSDLKRLWDEGYRMRVEGGSLV-MLNVPYVNAKGEVKEGKIISPLLLA--GDVTQKPEPHTVHFEGEFPCDAGGKPLQAISACGVPADL-----HAVAQYYLSTKPDANG----YTDYHQKMATYAAIISGHATVLDREASPRK--VWQPLDDEESVFNYVENASGRAGIDKLTALLAGDC VAIIGLGGTGSYVLDFVAKTPVREIRLIDGDDFLQHNAFRAPGAPTAEQLREVPKKVDHFRSIYANMHRGIAAHAVALDASTVGLLTGVTFAFLCMDAGHGKRIAIDQLESLGVPFVDVGMGLELSNGT---LGGILRTSLSTPDCRDIA--RSTISFDEPDRD-GIYSSNIQVADLNAMNAVMAVMRWKRYRNFYRDFEGEFHSSFTTDVNMLLNGEPK---
consensus/100% M...L.s.sspl.RL.-cGa.h......hl.h.slPYlp.p.p..bu.lhs.h..s..s......psH.h.F.Gp.P.p..G.sh..l..pss...l......h.sp..hSpKP...s....a.shapKh.pY.s.h...Ap........p..........ppSsF.h.-shosRh.Is.hs.bh.... lAIlGlGGTGua.LDhhsKT.l.EI..hD.D.h..HNhFRuPG..p...h.p...Ks-.a.s.Ys.h++.l..ps..lp..sh..h.uhsFsFlshD.G.u+..h.p.L..bslsFlDsGhGl...pss...L.G.hRsoh.ss.p...........h.......s.Y..pIQ.u-LNuhNA.hAlh+aKbh.sFYb-........a..p.p.l.p.......
consensus/95% M...L.s.sspl.RL.-cGa.h......hl.h.slPYlp.p.p..bu.lhs.h..s..s......psH.h.F.Gp.P.p..G.sh..l..pss...l......h.sp..hSpKP...s....a.shapKh.pY.s.h...Ap........p..........ppSsF.h.-shosRh.Is.hs.bh.... lAIlGlGGTGua.LDhhsKT.l.EI..hD.D.h..HNhFRuPG..p...h.p...Ks-.a.s.Ys.h++.l..ps..lp..sh..h.uhsFsFlshD.G.u+..h.p.L..bslsFlDsGhGl...pss...L.G.hRsoh.ss.p...........h.......s.Y..pIQ.u-LNuhNA.hAlh+aKbh.sFYb-........a..p.p.l.p.......
consensus/90% M...L.s.sspl.RL.-cGa.h......hl.h.slPYlp.p.p..bu.lhs.h..s..s......psH.h.F.Gp.P.p..G.sh..l..pss...l......h.sp..hSpKP...s....a.shapKh.pY.s.h...Ap........p..........ppSsF.h.-shosRh.Is.hs.bh.... lAIlGlGGTGua.LDhhsKT.l.EI..hD.D.h..HNhFRuPG..p...h.p...Ks-.a.s.Ys.h++.l..ps..lp..sh..h.uhsFsFlshD.G.u+..h.p.L..bslsFlDsGhGl...pss...L.G.hRsoh.ss.p...........h.......s.Y..pIQ.u-LNuhNA.hAlh+aKbh.sFYb-........a..p.p.l.p.......
consensus/85% M...L.s.sspl.RL.-cGa.h......hl.h.slPYlp.p.p..bu.lhs.h..s..s......psH.h.F.Gp.P.p..G.sh..l..pss...l......h.sp..hSpKP...s....a.shapKh.pY.s.h...Ap........p..........ppSsF.h.-shosRh.Is.hs.bh.... lAIlGlGGTGua.LDhhsKT.l.EI..hD.D.h..HNhFRuPG..p...h.p...Ks-.a.s.Ys.h++.l..ps..lp..sh..h.uhsFsFlshD.G.u+..h.p.L..bslsFlDsGhGl...pss...L.G.hRsoh.ss.p...........h.......s.Y..pIQ.u-LNuhNA.hAlh+aKbh.sFYb-........a..p.p.l.p.......
consensus/80% MS.pLhs+ssclbRLb-EGa.lphc...Ll.h+slPYls.p.pl.bGslls.L.hs..Gp.s.b.psHshaF.Gp.PpsscGpshs.l..pss..sl.....ph.spabhSsKPpu.G....assaacKhsoY.ullsu.Aps.s.ssssp...h..sp.p--SsFph.-oASuRhGIs.lo.bLp... lAIIGLGGTGuYlLDhhAKTPV.EI+LaDsDpab.HNAFRuPGAsp.pbhbps.+KVDaa.sbYupM++.lss+s.hlc.psl.bhsGhTFsFlslD.Gpu++.lhp.L.pbGIPFlDVGhGlpb.css...LsGhhRsTh.sssp.b.h....phshscssp..s.YpsNIQlA-LNAhNAshAVh+WK+hbsFYbDbp.-ap.sas.sss.l.p.p.....
Fsp. : Frankia sp.; Mlot : Mesorhizobium loti; Rmet : Ralstonia metallidurans; Rpal : Rhodopseudomonas palustris; Ssp. : Shewanella sp.; Vpar : Vibrio parahaemolyticus
-------------------------------------------------------------------------------------------------------------
7. Ub fused to Mut7-C (Operons uninformative)
^^^^^^^^^^^^^^^^^^^^
Gis are of the Ub+Mut7C protein
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=41410171">41410171</a> 200 Ub+Mut7C Mycobacterium avium subsp.paratuberculosisK-10 actinobacteria hypothetical protein MAP4073 [Mycobacterium avium subsp. paratuberculosis K-10]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=20520977">20520977</a> 242 Ub+Mut7C Streptomyces coelicolor A3(2) actinobacteria conserved hypothetical protein [Streptomyces coelicolor A3(2)]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71915653">71915653</a> 241 Ub+Mut7C Thermobifida fusca YX actinobacteria conserved hypothetical protein [Thermobifida fusca YX]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=54016307">54016307</a> 251 Ub+Mut7C Nocardia farcinica IFM 10152 actinobacteria hypothetical protein [Nocardia farcinica IFM 10152]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76785598">76785598</a> 252 Ub+Mut7C Mycobacterium tuberculosisF11 actinobacteria COG1656: Uncharacterized conserved protein [Mycobacterium tuberculosis F11]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13880123">13880123</a> 236 Ub+Mut7C Mycobacterium tuberculosisCDC1551 actinobacteria conserved hypothetical protein [Mycobacterium tuberculosis CDC1551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29606942">29606942</a> 241 Ub+Mut7C Streptomyces avermitilis MA-4680 actinobacteria hypothetical protein [Streptomyces avermitilis MA-4680]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=53688960">53688960</a> 250 Ub+Mut7C Nostoc punctiforme PCC 73102 cyanobacteria COG1656: Uncharacterized conserved protein [Nostoc punctiforme PCC 73102]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67930484">67930484</a> 226 Ub+Mut7C Solibacter usitatus Ellin6076 fibrobacteres/acidobacteria Protein of unknown function DUF82 [Solibacter usitatus Ellin6076]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56311907">56311907</a> 265 Ub+Mut7C Azoarcus sp. EbN1 proteobacteria>betaproteobacteria conserved hypothetical protein [Azoarcus sp. EbN1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74318176">74318176</a> 264 Ub+Mut7C Thiobacillus denitrificansATCC25259 proteobacteria>betaproteobacteria hypothetical protein Tbd_2158 [Thiobacillus denitrificans ATCC 25259]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68554875">68554875</a> 266 Ub+Mut7C Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria Protein of unknown function DUF82 [Ralstonia metallidurans CH34]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74019866">74019866</a> 268 Ub+Mut7C Burkholderia ambifaria AMMD proteobacteria>betaproteobacteria Protein of unknown function DUF82 [Burkholderia ambifaria AMMD]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17431582">17431582</a> 257 Ub+Mut7C Ralstonia solanacearum proteobacteria>betaproteobacteria hypothetical protein of unknown function duf82 [Ralstonia solanacearum]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77965627">77965627</a> 254 Ub+Mut7C Burkholderia sp. 383 proteobacteria>betaproteobacteria protein of unknown function DUF82 [Burkholderia sp. 383]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84363527">84363527</a> 252 Ub+Mut7C Burkholderia dolosa AUO158 proteobacteria>betaproteobacteria COG1656: Uncharacterized conserved protein [Burkholderia dolosa AUO158]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48782379">48782379</a> 251 Ub+Mut7C Burkholderia fungorum LB400 proteobacteria>betaproteobacteria COG1656: Uncharacterized conserved protein [Burkholderia fungorum LB400]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83719003">83719003</a> 251 Ub+Mut7C Burkholderia thailandensisE264 proteobacteria>betaproteobacteria Protein of unknown function family [Burkholderia thailandensis E264]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67908809">67908809</a> 251 Ub+Mut7C Polaromonas sp. JS666 proteobacteria>betaproteobacteria Protein of unknown function DUF82 [Polaromonas sp. JS666]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82701281">82701281</a> 251 Ub+Mut7C Nitrosospira multiformis ATCC25196 proteobacteria>betaproteobacteria Protein of unknown function DUF82 [Nitrosospira multiformis ATCC 25196]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83745680">83745680</a> 277 Ub+Mut7C Ralstonia solanacearum UW551 proteobacteria>betaproteobacteria Zinc finger protein [Ralstonia solanacearum UW551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67759108">67759108</a> 251 Ub+Mut7C Burkholderia pseudomallei S13 proteobacteria>betaproteobacteria hypothetical protein BpseS_02004453 [Burkholderia pseudomallei S13]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72117336">72117336</a> 247 Ub+Mut7C Ralstonia eutropha JMP134 proteobacteria>betaproteobacteria Protein of unknown function DUF82 [Ralstonia eutropha JMP134]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67738583">67738583</a> 251 Ub+Mut7C Burkholderia pseudomallei 668 proteobacteria>betaproteobacteria COG1656: Uncharacterized conserved protein [Burkholderia pseudomallei 668]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67666690">67666690</a> 259 Ub+Mut7C Burkholderia cenocepacia HI2424 proteobacteria>betaproteobacteria Protein of unknown function DUF82 [Burkholderia cenocepacia HI2424]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67669903">67669903</a> 251 Ub+Mut7C Burkholderia pseudomallei 1655 proteobacteria>betaproteobacteria hypothetical protein Bpse1_02004518 [Burkholderia pseudomallei 1655]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67154055">67154055</a> 308 Ub+Mut7C Azotobacter vinelandii AvOP proteobacteria>gammaproteobacteria Protein of unknown function DUF82 [Azotobacter vinelandii AvOP]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=4981308">4981308</a> 247 Ub+Mut7C Thermotoga maritima MSB8 thermotogae AE001747_4 conserved hypothetical protein [Thermotoga maritima MSB8]
Alignment of Ubl fused to Mut7-C
<------------------------------Ub-like-------------------------------------------><------NYN/PIN------------------------------------------------------------------------------------------>|<Zn-ribbon------------------------------------------------------>
FINAL ---EEEEEHHHHHH---------EEEE------HHHHHHHEE-----EEEEEEE----------------EEEE-------|---------------EEEHHHHHHHHHHHHHHH------------HHHHHHHHHH--EEEEE-HHHHHHHHH----EEEE---HHHHHHHHHHHH--H----------------------------HHHHH----EEEEE---EEEEE---HHHHHHHHHHHHH------
ALIGN ----HHHHHHHHHH--------------------EEEEEEE------EEEEEEE---------------EEEEE-------|---------------EEEHHHHHHHHHHHHHHH-------------HHHHHHHHH---EEEEHHHHHHHHHHHH----EE----HHHHHHHHHHH--H-------E---------HH--HH---------------EE-----EEEEE----HHHHHHHHHHHH------
HMM ----EEEEHH-HHH--------EEEEEEE-----EEEEEEEEE---EEEEEEEEE--------------EEEEEE------|---------------HEEEHHHHHHHHHHHHHHHHHHHH------HHHHHHHHH---EEEEEHHHHHHHHHHH-EEEEEE----HHHHHHHHHHH--H-HHHHHHHHHH----HHHHHHHHHHH---HHHHHEEEEEEEE---EEEEE--HHHHHHHHHHHHHHH-----
FREQ ---HHHHHHHHH------------HE-------HHHHHHHH------EEEEEEE----------------EEEE-------|--------------HEEHHHHHHHHHHHHHHHH----E--------HHHHHHHHHH--EEEE--HHHHHHHHH---EEEE---HHHHHHHHHHHH--H-------------------------------------EEEEE----EEEE----HHHHHHHHHHH-------
PSSM ---EEEE----------------EEE-------HHHHHH---------HHHEE-----------------EE---------|---------------EEEEHHHHHHHHHHHH----E---------HHHHHHHHH----EEEE---HHHHH------EEE-----HHHHHHHHHHH-------------------EE------------HHH----EEEEE-----EEE---HHHHHHHHHHHHH------
Tfu_1519_Tfus_71915653 HASITLRFDPTLRPLLAPRNRTDLLHVNHDPAASLSHVVESLGVPLTEIGELRINGTTASPSQHPQPGDLIEVLTVPKPQP|---------VPFSPIRFILDVHLGTLARRLRLLGVDTVYYT-HRDDPALVQQANEEQRILLTRDRGILYRKNLRAGGHIYASNPDEQLFEVLDRY--APPLAPWTRCLTCNGPLAQVDKDNIADQLPAGTRATYDTFVQCTECRQIYWPGAHHARLTQIIEAAQKRVAAI-
SCO4976_Scoe_20520977 GPEIHVEFAPELHLFVPRARPTGVASAATDGVSTLGHLVESLGVPLTEVGALLVDGREVPPGHIPAGGESVRVRPVRHPQR|---------VPGAPLRFLLDVHLGTLARRLRLLGVDTAYESTDLGDPALAALSAAEKRVLLSRDRGLLRRRELWAGAYVYSTRPEEQLQEVLDRF--RPALSPWTRCTACNGLLRTATKEEVAEQLEGGTRRSYDVFAQCTACGRAYWRGAHHEQLEAIVERAVSSTRDA-
SAV3291_Save_29606942 GPEIHVAFAPELRLFVPHERRSGTTAVGTDGASTLGHVVESLGVPLPEVGALVVNGRETPVSYIPAAGDSVEVRPVERPQR|---------VPGAPLRFLLDVHLGTLARRLRLLGVDTAYESTDLGDPALAALSAAEKRVLLSRDRGLLRRRELWAGAYIYSTRPDDQLRDVLDRF--APGLAPWTRCTACNGVLEKATKEQVADQLEGGTQRSYDVFAQCEECGRAYWKGAHHDRLEAIVERALAEFGA--
Npun02000115_Npun_53688960 MAIAYFYFHAELNHFLPRHHKQVKISHFFEEKASIKDMIESLGVPHPEVDFINVNGKYVNFSYIVSDGDAINVYPISARSV|IIPSISVFPEPLSIIHFVVDIHLGKLATSLRLLGFDTLYRN-DYEDEKLAQISSSQGRILLTRDKGLLMRSLVTHGYYVRNTNPQEQIIEVLQRFDLFKLITPFKRCLRCNGLLEWVDKQSIIEQVPEKVRSQIDQFQRCQDCDRIYWKGSHYERLQQFIDGVLNSQKGE-
TM_0779_Tmar_4981308 EKIAFFRFFGRLNDFFRNSERIK--THRFTGFQTVKDRIEALGVPHVEVSLITLNGKPVGFDHMVEDGELFFVYPEFQNIE|IPEDWLVTPRYIGEPRFVLDIHLGKLARLLRMLGFEAVFGE-E-SDEKLCWMAVKKKAILLSRDTGLLKRKELVFGYYVRNTDPKEQLVEVVERYDLKKWMKPFTRCIECGVELEEVPKEAVKNRVPPKVYGFFNEFARCPVCGRIYWKGSHYDHMVEFIKSNINKG----
AcidDRAFT_4098_Susi_67930484 MPDGRFYFEGDLSLFLLPSLRGREVKRTWSDTDTLMHVIESIGVPHTEV------------ARIERDGSLIRVYPRTREIL|------------QDPRFVLDQHLGRLAAYLRMLGFDVLHTV-PAPDQHLAAASSREDRVLLTRDVGLLKRKEVRRGYFVRATDPRAQLLEVLKRFGLVDAIAPFTRCFLCNTPLESVDKAVIARQLPERIADLHNHFMRCPSCGRVYWKGSHYDRMRELIEDIKKRALFD-
nfa28300_Nfar_54016307 ASGIELRLYAELNDFLPPQDRQDALWRPVRPHQTVKDIVEAAGVPHTEIDLLLVNGESVGFEHHPRPGDRLAAYPMFESLD|ISGLTRVRPHPLREPRMLIDVNLGGLARLLRLMGQDVRCDF-DATDARLAEISAEDHRILLTRDRGLLARRIVSHGVYVRADRPFEQIVEVIGRLDLADQLAPFTRCLRCGAVLADVAKDEIVHELSPGTRENYDTFRRCTGCGRIYWAGAHQRRLDDLVTQILAAVRR--
MtubF_01000602_Mtub_76785598 VGYVDVRAYAELNEFVELQARGLTVRRPFRSHQTVKDVLEAMGIPHTEVDLILVNGDPADFSYRPVAGDRIAAYPMFEALD|IGSTARLRPAPLRNPRFVVDVNLGQLARLLRLLGFDTRWSS-AADDPTLADISLGEQRILLTRDRGLLKRRAITHGLFVHSQHPEEQALEVLRRLDLNGRLAPLSRCLRCNGELAAVSKDEVIGQLEPLTRRYYESFSRCFGCGRIYWPGSHHARLVRLVERLRDQLTTST
RRSL_04745_Rsol_83745680 MPTLLFTFDASLTPLLPLTQRQRPAARAWPEGATLKHAIETFGVPHTEVGAVHVDGCAAPLESLLPARGAVAVAGVQAALP|-----------QAPLHFLCDAHLGATARLLRMAGFDTAYDN-NYADATIEALADTEDWIVLSRDRELLKRRGIRRGAFVRAREPQAQMREIVARFKLAEAARPFSRCLECNAPLRLLSAEEAASSVPPRVRERQHLFSTCDVCRRVYWPGSHWARMNTALARMLAPHQEDG
RSp1109_Rsol_17431582 MPTLLFTFDASLTPLLPVAQRERPAARAWPEGATLKHAIETFGVPHTEVGVVQVDGHAALLDALLPARGAVAVAGVRAALP|-----------DAPLHFLCDAHLGATARLLRMAGFDTAYDN-NYADATIEALADTEDWIVLSRDRELLKRRGIRRGAFIRAREPQAQMREIVARFRLAEAARPFSRCLECNAPLRLLSAEEAAASVPPRVRERQHLFSTCDVCRRVYWPGSHWARMNTSLARMLAPHPDGA
BdolA_01000029_Bdol_84363527 MATATFRFHGELNAFVARTQRDRAFAHACARDATLKHAIEALGVPHTEIGQLTVNGAAAGLDRPVGDGDRIDVYPERAREP|--AAAPPATPRSEQWRFVADAHLGGLAQLLRLAGFDTCYDN-HYRDDEIAALAEREGRLVLTRDRELLKRRAVARGCYLHALQPADQLRELFSRLALAPYMRPFRLCLRCNAPLHALDADAAAPRVPAGVRQRHRRFVECDVCRRVFWEGSHWRRMRALVDSMRTAAVPDE
BambDRAFT_0385_Bamb_74019866 MATATFRFHGELNAFLARAQRGCAFAHVFARDATVKHAVEALGVPHTEIGRLCVNGAPAALDRPLGDGDRVDIHPERARPA|---IESPVQPQPESWRFIADAHLGGLAQLLRLAGFDTCYDN-HYRDDELVALAAREGRIVLTRDRELLKRRAVVRGCYLHAQQPDAQLHELFARLDLAPHMRPFRLCLRCNAPLHALDAADAAPRVPAGVRQRHRRFAACDVCRRVFWEGSHWRRMRAVVDAMRALPPVAP
Bcen2424DRAFT_1951_Bcen_67666690 MATASLRVVVELNAFLASQQRDRAFAHACARDATVKHAIEALGVPHTEIGRLYVNDAPAALDRPLDDGDRVEVLPERAGPA|---ANGATGPPPAAWRFVADAHLGGLAQLLRLAGFDTCYDN-HYRDDELAALAEREQRIVLTRDRELLKRRAVVRGCYLHALQPADQLRELFERLDLAPHMRPFRLCLRCNAPLHPLDAAAAAPSVPAGVRLRHRRFAACDVCRRVFWEGSHWRRMRAVVDAMRTPSPVRR
Bcep18194_A3405_Bsp._77965627 MATATFRFHDELNAFLPRAQRDRAFGHACARDATLKHAIEALGVPHTEIGRLCVNDAPATLDRPLDDGDRVEAFPERAQPA|---ANGATVPPSAHWRFAADAHLGGLAQLLRLAGFDTCYDN-HYRDDELAALAAREGRIVLTRDRELLKRRAVERGCYLHALQPADQLRELFERLDLAPHMRPFRLCLRCNAPLHPLDAAAAAPRVPAGVRLRHRRFAACDVCRRVFWEGSHWRRMRTVVDAMRAPPPPAP
AvinDRAFT_7917_Avin_67154055 MVSVTFRFYEELNDFLPSERRRQAFACDCARAATVKHMIEALGVPHTEVELVLLNGESVDFSRPLHDGDRVAVYPRFEALD|IGPLLKVRDHPLRELRFIADAHLGGLASLLRMCGFDTLYDN-HYEDRQIAALAAEQRRIVLSRDRELLKRRIVTHGCYLHALKPALQLRELFERLDLAGSARPFSRCLHCNLPLHEVTVEQARPRLPPRIAALYSRFFGCDACQRLYWEGSHWRSMRSLLAPLLDDRPPER
Bpse1_02004518_Bpse_67669903 MVTVTFRFYEELNDFLARPLRRREFAHACMRGASVKHAIEALGVPHTEVELILVNGESTPFSHVLEEGDRVAVYPSFEAID|IRPLLRVRAAPLRVTRFIADAHLGGLAQLLRLAGFDTLYDN-HYPDKLIETIAAREARIVLTRDRELLKRRTITHGCYVRALKPQAQLQELFDRLDLAGSARPFRLCLSCNAPLRRIDPAEAAGRAPQGVLQRHTRFVTCDVCRRVFWEGSHWRRMRALIEHVSQPKPPPG
Bcep02006224_Bfun_48782379 MVTATFRFYEELNDFLARPLRRRAFTYACAPGATAKHMIEALGVPHTEVELILVNGESVGFNHPLSDGDRLAVYPKFEALD|IHPLLRVRERPLRVVRFIADAHLGGLAPLLRLAGFDTLYDN-HYPDADIEALAAAQQRIVLTRDRELLKRRNITHGCYVRTLRPREQLREVFERLDLAGSAQPFRLCLMCNVPLRRIPKEEVGTRAPDGVLERHAQFVTCDVCRRVFWEGTHWQRMRALMDSVAAAPDRSA
Tbd_2158_Tden_74318176 MVIATFRFYEELNDFLAPDRRKREFTVPCARAATTKHMIEALGVPHTEVELILVNGESAGFDRRLQDGDRVAVYPRFEAMD|VSPLLRVRERPLRETRFVADAHLGGLAHMLRMLGFDTLYDN-HFHDDAIVAICEHDGRIVLTRDRELLKRRSVTHGCYIHALKSEAQLREVVARLDLARSARPFTRCLHCNVPLRTVDKASVLDRLPPKVREHYAHFPTCDSCGRIYWAGSHWRNMRRLLDDVLSGERDSG
Nmul_A0146_Nmul_82701281 MVTATFRFYEELNDFLVPERRKREFSCPCARAATTKHMIEALGVPHTEVELVLVNGESVGFDRILEHGDRVAVFPKFEMVD|VAPLLRVREHPLRVTRFIADAHLGGLAHLLRMTGFDTLYDN-NYHDRQIELLAAQEKRIVLTRDRELLKRRSITHGCYVRTLKPPEQLCEIFDRLDLAHSIKPFTLCLNCNAPLRPVEKSVVLERLPPSVRERFDHFSTCDICHRVFWEGSHWQRMRTMLEECIKPNRFGG
BproDRAFT_2323_Psp._67908809 MVMASFRFYEELNEFLAPERRGREFACPCARAATTKHMIEALGVPHTEVELVLVNGESVGFDRQLREGDRVAVYPKFEALD|VTPLLRVRGQTLRVTRFVADAHLGGLAHLLRMAGFDTLYDN-HFRDEEIERIAAEQGRIVLTRDRDLLKRRTITHGCYVHALRTELQLREIFGRLDLARSARPFTLCLHCNAPLHAIEKMRVATMLPPQVREHYQRFSACDVCHRVFWEGSHWRRMRLMLDGLLS------
ebA822_Asp._56311907 MVTATFRFYEELNDFLAPARRRREFDAPCARAATVKHMVEALGVPHTEVELVLVNGESVDFGRLLRDGDRVAVYPKFESLD|ITPLLRVRSHPLRVMRFVADAHLGGLAHLLRMTGFDTLYDN-HFDDGEIEIIAGRDARIVLTRDRELLKRRTLTHGCYVRALKPAQQVREIFDRLDLAGSAKPFTLCLDCNAPLRPIGKAQVEDRLPPGVRASHTRFSTCDVCRRVFWEGSHWRRMRVLVDELLAGSPPLP
RmetDRAFT_5449_Rmet_68554875 MVTATFRFYEELNDFLAPAQRRRDLSCPCARAATVKHMIEALGVPHTEVELILVNGESSPFERIVCDGDRIAVYPKFESFD|IAPLLRVREQPLREIRFVADAHLGGLAHLLRMTGFDTLYDN-HFEDCEIARIASDEKRIVLTRDRELLKRRGITHGCYVRAIRSSLQVREIFSRLDLARSARPFSLCLDCNVPLRRIGKTDVDGRVPEGVFERHEHFVTCPHCHRVFWEGSHWRKMRTLVEELMSAQADQV
Reut_A0217_Reut_72117336 MVTATFRFYEELNDFLAPDQRRRDLSCPCARAATVKHMIEALGVPHTEVELILVNGESSGFDRMLEDGDRVSVYPKFESLD|VSPLLRVRAHPLRIMRFVADAHLGGLAHLLRMMGFDTLYDN-HFEDSEIERIAEREGRIVLTRDRELLKRRGITHGCYVRAIKSTPQVREIFQRLDLARSARPFSLCLDCNVPLQPVARDVVADRVPPAVLERHDRFVTCDGCRRVFWEGSHWRCMRALVDELVCAG----
consensus/100% .....h.h..pLp.hh.................o..c.lEshGlP.sEl.....................h.h.s......|...............+hhhD.pLG..A..LRh.G.-s........D..l...s..p..llLoRD..lL.Rp.l..G.al.s.ps..Qh.-lh.Rh.....h.PhpbC..Cs..L..h....h...h..........F..C..C.bhaW.GsH..ph...h...........
consensus/95% .s.h.h.h..pLp.hl....+............o..chlEshGVP.sEl..l.lss..s...........l.h.s......|...............+FlhDhpLG..A.bLRh.GhDs.a......D..l..hu..p.bllLoRDp.LLbR+.l..Ghal.s.ps..Qh.Elh.Rh.....h.PapbC..Css.L..h....h..ph..........F..C..C.RhaW.GuHa.ph..hl..h........
consensus/90% .s.h.h.h..pLp.hl....R...h..s.....o..chlEuhGVP.sEl..l.lss..ss.......Gs.l.h.s......|...............+FlhDhHLG.LApbLRh.GhDshaps.ph.D..l..hu..p.bllLoRD+.LLbR+.l..Ghal.s.ps..Qh.Elh.Rh.....h.PapbCh.CNs.L..ls...h..pl...sb...p.F..C..C.RlaW.GuHa.php.hl..h........
consensus/85% .s.h.hbF..cLs.Fls...R...h..sh...sThbchlEuhGVPHTEl..l.lsG..sshs....sG-.l.lhP......|...............RFlhDhHLG.LApbLRhhGhDThaps.ph.D..l..lu..p.RllLoRDR.LL+R+.l..Ghal+s.ps..Ql.Elh.Rh.Ls..h.PapbCl.CNs.Lp.ls...h..pls..sb..ap.F..Cs.C.RlaW.GuHa.php.hlp.h........
consensus/80% hs.hphbF..-Ls.Fls..bR.p.hs.shs.suTlKHhlEulGVPHTEl..l.VNG.ssshsp...sG-.l.VhP.b....|......s...s....RFlhDhHLGsLAphLRhhGFDThYcs.ph.D..l..lu.pc.RIlLoRDR.LL+RR.l.+Ghal+u.pP..QlbElh.Rh.Ls..h.PFpbCLpCNssLc.ls...h..plPs.sb.pappFs.CssC.RlaW.GoHap+hp.hlc.hbs......
consensus/75% hs.hphcF..ELs.FLs..bRpp.hs.shs.suTlKHhlEuLGVPHTEl.bl.VNGpssshsp...sGDbl.VhP.b....|......s...P....RFlhDhHLGsLAphLRhhGFDThY-s.ch.D.pl..lusp-.RIlLoRDR.LLKRR.l.+GhYl+u.pP..QlbElh.RhcLA..hpPFpbCLpCNssLc.ls...hhsplPs.lb.pappFspCssC.RlaW.GSHap+hp.hl-.hbs......
consensus/70% MsphpFRF..ELNsFLs..bRpc.hspshspsATlKHhlEuLGVPHTEV.hl.VNGpssshs+.l.sGDbl.VaPbb....|......sp..P....RFlhDhHLGsLApLLRhhGFDThY-s.ch.D.pl..lusp-.RIlLTRDR.LLKRR.lp+GhYl+u.cP..QlbElhpRhcLA..hpPFpbCLpCNsPLc.ls...hhsplPs.lbbpappFspCssC.RlaWcGSHacRMc.ll-.hbs......
Species abbreviations:
Asp. : Azoarcus sp.; Avin : Azotobacter vinelandii; Bamb : Burkholderia ambifaria; Bcen : Burkholderia cenocepacia; Bdol : Burkholderia dolosa; Bfun : Burkholderia fungorum; Bpse : Burkholderia pseudomallei; Bsp. : Burkholderia sp.; Mtub : Mycobacterium tuberculosis; Nfar : Nocardia farcinica; Nmul : Nitrosospira multiformis; Npun : Nostoc punctiforme; Psp. : Polaromonas sp.; Reut : Ralstonia eutropha; Rmet : Ralstonia metallidurans; Rsol : Ralstonia solanacearum; Save : Streptomyces avermitilis; Scoe : Streptomyces coelicolor; Susi : Solibacter usitatus; Tden : Thiobacillus denitrificans; Tfus : Thermobifida fusca; Tmar : Thermotoga maritima
-------------------------------------------------------------------------------------------------------------
8. Uncharacterized operon encoding a Ub-like (RnfH) family protein
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Abbreviations: c/d: aromatic cyclase/dehydrase (c/d)--82
Gis are of the Ub/RnfH protein
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13362951">13362951</a> 102 <-SmpB-c/d->Ub*-><-SmpA- Escherichia coli O157:H7 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67549235">67549235</a> 107 <-SmpB-c/d->Ub*-> Burkholderia vietnamiensis G4 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=24373049">24373049</a> 111 <-SmpB-c/d->Ub*-><-SmpA Shewanella oneidensis MR-1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9655302">9655302</a> 103 <-SmpB-c/d->Ub*-><-SmpA- Vibrio cholerae O1 biovar eltor str. N16961 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71898356">71898356</a> 84 <-SmpB-c/d->Ub*-><-SmpA- Xylella fastidiosa Ann-1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=45435745">45435745</a> 94 <-SmpB-c/d->Ub*-><-SmpA- Yersinia pestis biovar Medievalis str. 91001 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=75857320">75857320</a> 117 <-SmpB-c/d->Ub*-><-SmpA- Vibrio sp. Ex25 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46156590">46156590</a> 110 -c/d->Ub*-> Haemophilus somnus 2336 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67638926">67638926</a> 107 <-SmpB-c/d->Ub*-> Burkholderia mallei 10399 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84387679">84387679</a> 103 <-SmpB-c/d->Ub*-><-SmpA- Vibrio splendidus 12B01 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46202194">46202194</a> 97 Ub* Magnetospirillum magnetotacticum MS-1 proteobacteria>alphaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=16421233">16421233</a> 96 <-SmpB-c/d->Ub*-><-SmpA- Salmonella typhimurium LT2 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77958607">77958607</a> 94 <-SmpB-c/d->Ub*-><-SmpA- Yersinia bercovieri ATCC 43970 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=7379707">7379707</a> 92 FtsJ->FtsH->c/d->Ub* Neisseria meningitidis Z2491 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=58581648">58581648</a> 91 <-SmpB-c/d->Ub*-><-SmpA- Xanthomonas oryzae pv. oryzae KACC10331 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52627727">52627727</a> 90 -c/d->Ub*-><-SmpA- Legionella pneumophila subsp. pneumophila str proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76579340">76579340</a> 269 <-SmpB-Ub*-> Burkholderia pseudomallei 1710b proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68245723">68245723</a> 165 Ub* Magnetococcus sp. MC-1 proteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71362697">71362697</a> 157 Ub*-><-SmpA- Psychrobacter cryohalolentis K5 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71037825">71037825</a> 134 Ub*-><-SmpA- Psychrobacter arcticus 273-4 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71143975">71143975</a> 131 <-SmpB-c/d->Ub*-><-SmpA- Colwellia psychrerythraea 34H proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84717489">84717489</a> 129 <-SmpB-c/d->Ub*-> Polaromonas naphthalenivorans CJ2 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56179014">56179014</a> 125 <-SmpB-c/d->Ub*-> Idiomarina loihiensis L2TR proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68559691">68559691</a> 121 Hjlc<-SmpB-c/d->Ub*-> Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=17428441">17428441</a> 118 Hjlc<-SmpB-c/d->Ub*-> Ralstonia solanacearum proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83749959">83749959</a> 118 Hjlc<-SmpB-c/d->Ub*-> Ralstonia solanacearum UW551 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=37197746">37197746</a> 117 <-SmpB-c/d->Ub*-><-SmpA- Vibrio vulnificus YJ016 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67158728">67158728</a> 117 <-SmpB-c/d->Ub*-><-SmpA- Azotobacter vinelandii AvOP proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33576043">33576043</a> 116 <-SmpB-c/d->Ub*-> Bordetella bronchiseptica RB50 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67677127">67677127</a> 115 <-SmpB-c/d->Ub*-><-SmpA- Chromohalobacter salexigens DSM 3043 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74020818">74020818</a> 115 -c/d->Ub*-> Rhodoferax ferrireducens DSM 15236 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=85711977">85711977</a> 115 -c/d->Ub*-><-SmpA-SmpB-> Idiomarina baltica OS145 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68545933">68545933</a> 112 <-SmpB-c/d->Ub*-><-SmpA- Shewanella amazonensis SB2B proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=74317772">74317772</a> 112 <-SmpB-c/d->Ub*-><-SmpA- Thiobacillus denitrificans ATCC 25259 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69157448">69157448</a> 111 <-SmpB-c/d->Ub*-><-SmpA- Shewanella denitrificans OS217 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=69952904">69952904</a> 111 <-SmpB-c/d->Ub*-><-SmpA- Shewanella frigidimarina NCIMB 400 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72118961">72118961</a> 111 <-SmpB-c/d->Ub*-> Ralstonia eutropha JMP134 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48787671">48787671</a> 110 Hjlc<-SmpB-c/d->Ub*-> Burkholderia fungorum LB400 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52306665">52306665</a> 110 Ub* Mannheimiasucciniciproducens MBEL55E proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68212386">68212386</a> 110 <-SmpB-c/d->Ub*-><-SmpA- Methylobacillus flagellatus KT proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78366485">78366485</a> 110 <-SmpB-c/d->Ub*-><-SmpA- Shewanella sp. PV-4 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67907298">67907298</a> 109 <-SmpB-c/d->Ub*-><-SmpA- Polaromonas sp. JS666 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46912321">46912321</a> 108 <-SmpB-c/d->Ub*-><-SmpA- Photobacterium profundum SS9 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48861843">48861843</a> 108 -c/d->Ub*-><-SmpA- Microbulbifer degradans 2-40 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71847580">71847580</a> 108 <-SmpB-c/d->Ub*-><-SmpA- Dechloromonas aromatica RCB proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76791575">76791575</a> 108 <-SmpB-c/d->Ub*-> Pseudoalteromonas atlantica T6c proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56315277">56315277</a> 107 <-SmpB-c/d->Ub*-> Azoarcus sp. EbN1 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76874703">76874703</a> 107 <-SmpB-c/d->Ub*-><-SmpA- Pseudoalteromonas haloplanktis TAC125 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=47571785">47571785</a> 106 <-SmpB-c/d->Ub*-> Rubrivivax gelatinosus PM1 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71548925">71548925</a> 105 Cox15-><-SmpB--c/d->Ub*-> Nitrosomonas eutropha C71 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=28871646">28871646</a> 104 DC3000;<-SmpB--X->-c/d->Ub*->-X->-X-><-SmpA- Pseudomonas syringae pv. tomato str. proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=66047427">66047427</a> 104 <-SmpB--X->-c/d->Ub*-><-SmpA- Pseudomonas syringae pv. syringae B728a proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68348616">68348616</a> 104 <-SmpB--X->-c/d->Ub*-><-SmpA- Pseudomonas fluorescens Pf-5 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71558661">71558661</a> 104 <-SmpB--X->-c/d->Ub*-><-SmpA- Pseudomonas syringae pv. phaseolicola 1448A proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77380988">77380988</a> 104 <-SmpB-X->c/d->Ub*-><-SmpA- Pseudomonas fluorescens PfO-1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=30138331">30138331</a> 103 <-SmpB-c/d->Ub*-> Nitrosomonas europaea ATCC 19718 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33572188">33572188</a> 103 <-SmpB-c/d->Ub*-> Bordetella pertussis Tohama I proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49530092">49530092</a> 103 ->Ub*-><-SmpA Acinetobacter sp. ADP1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=59712607">59712607</a> 103 <-SmpB-c/d->Ub*-><-SmpA- Vibrio fischeri ES114 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68057197">68057197</a> 102 Ub* Haemophilusinfluenzae 86-028NP proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29541871">29541871</a> 101 <-SmpB-c/d->Ub*-><-SmpA- Coxiella burnetii RSA 493 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=33149060">33149060</a> 100 -c/d->Ub*-> Haemophilus ducreyi 35000HP proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=34498917">34498917</a> 100 -c/d->Ub*-><-SmpA- Chromobacterium violaceum ATCC 12472 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10038928">10038928</a> 99 <-SmpB-Ub*-> Buchnera aphidicola str. APS (Acyrthosipho proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=12720385">12720385</a> 99 -c/d->Ub*-> Pasteurella multocida subsp. multocida str proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76883017">76883017</a> 99 <-SmpB-c/d->Ub*-><-SmpA- Nitrosococcus oceani ATCC 19707 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=87120236">87120236</a> 99 <-SmpB--X->-c/d->Ub*-><-SmpA- Marinomonas sp. MED121 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21112537">21112537</a> 98 -serinepeptidase-><-SmpB--c/d->Ub*-><-SmpA- Xanthomonas campestris pv. campestris str. ATC proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=32035426">32035426</a> 98 -c/d->Ub*-> Actinobacillus pleuropneumoniae serovar 1 str proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78035544">78035544</a> 98 -serinepeptidase-><-SmpB--c/d->Ub*-><-SmpA- Xanthomonas campestris pv. vesicatoria str proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=36786684">36786684</a> 97 <-SmpB-c/d->Ub*-><-SmpA- Photorhabdus luminescens subsp. laumondii TTO1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78701201">78701201</a> 97 <-SmpB-c/d->Ub*-><-SmpA- Alkalilimnicola ehrlichei MLHE-1 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=82702330">82702330</a> 96 -c/d->Ub*-><-SmpA- Nitrosospira multiformis ATCC 25196 proteobacteria>betaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=83644081">83644081</a> 96 <-SmpB--X->-c/d->Ub*-><-SmpA- Hahella chejuensis KCTC 2396 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21623148">21623148</a> 95 <-SmpB--Ub*-> Buchnera aphidicola str. Sg (Schizaphi proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77978427">77978427</a> 94 <-SmpB-c/d->Ub*-><-SmpA- Yersinia intermedia ATCC 29909 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84780300">84780300</a> 94 <-SmpB-c/d->Ub*-><-SmpA- Sodalis glossinidius str. 'morsitans' proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49610307">49610307</a> 93 <-SmpB-c/d->Ub*-><-SmpA- Erwinia carotovora subsp. atroseptica SCRI1043 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77953016">77953016</a> 92 <-SmpB--X->-c/d->Ub*-><-SmpA- Marinobacter aquaeolei VT8 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78364037">78364037</a> 92 <-SmpB-c/d->Ub*-><-SmpA- Thiomicrospira crunogena XCL-2 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21107692">21107692</a> 87 -serinepeptidase-><-SmpB--c/d->Ub*-><-SmpA- Xanthomonas axonopodis pv. citri str. 306 proteobacteria>gammaproteobacteria
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=27904127">27904127</a> 86 Ub*-><-SmpA- Buchneraaphidicola str. Bp (Baizongi proteobacteria>gammaproteobacteria
-------------------------------------------------------------------------------------------------------------
9. Mobile RnfH operon (electron transport chain--9)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Gis are of the RnfH protein (marked with an asterisk)
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56312934">56312934</a> 101 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Azoarcus sp. EbN1; proteobacteria>betaproteobacteria Protein rnfH [Azoarcus sp. EbN1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71846749">71846749</a> 90 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Dechloromonas aromatica RCB; proteobacteria>betaproteobacteria Protein of unknown function UPF0125 [Dechloromonas aromatica RCB]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56552704">56552704</a> 88 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Zymomonas mobilis subsp. mobilis ZM4; proteobacteria>alphaproteobacteria hypothetical protein ZMO1808 [Zymomonas mobilis subsp. mobilis ZM4]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=53756757">53756757</a> 95 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Methylococcus capsulatus str. Bath; proteobacteria>gammaproteobacteria electron transport complex, H subunit [Methylococcus capsulatus str. Bath]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=9843879">9843879</a> 86 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Pseudomonas stutzeri; proteobacteria>gammaproteobacteria RnfH protein [Pseudomonas stutzeri]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=77389630">77389630</a> 85 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Rhodobacter sphaeroides 2.4.1; proteobacteria>alphaproteobacteria probable rnfH protein [Rhodobacter sphaeroides 2.4.1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67158346">67158346</a> 86 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Azotobacter vinelandii AvOP; proteobacteria>gammaproteobacteria Protein of unknown function UPF0125 [Azotobacter vinelandii AvOP]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=1905814">1905814</a> 85 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Rhodobacter capsulatus; proteobacteria>alphaproteobacteria RnfH protein [Rhodobacter capsulatus]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46202216">46202216</a> 84 rnfB->rnfC->rnfD->rnfG->rnfE->(Ub)rnfH*-> Magnetospirillum magnetotacticum MS-1; proteobacteria>alphaproteobacteria COG2914: Uncharacterized protein conserved in bacteria [Magnetospirillum magnetotacticum MS-1]
-------------------------------------------------------------------------------------------------------------
10. Aromatic amino acid hydroxylase; TolueneO-Xylene Monooxygenase Hydroxylase protein B
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
Gis are of the TmoB/Ub protein- marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=78693154">78693154</a> 81 TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Bradyrhizobium sp. BTAi1 proteobacteria>alphaproteobacteria hypothetical protein BradDRAFT_6557 [Bradyrhizobium sp. BTAi1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=48094248">48094248</a> 86 TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Pseudomonas sp. OX1 proteobacteria>gammaproteobacteria toluene o-xylene monooxygenase component [Pseudomonas stutzeri]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68556036">68556036</a> 102 4OCDC->4OCTT->TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Ralstonia metallidurans CH34 proteobacteria>betaproteobacteria Toluene-4-monooxygenase system B [Ralstonia metallidurans CH34]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=5911739">5911739</a> 94 TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Rhodococcus sp. AD45 actinobacteria putative isoprene monooxygenase gamma subunit [Rhodococcus sp. AD45]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=45479222">45479222</a> 84 TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Pseudomonas mendocina proteobacteria>gammaproteobacteria gammahydroxylase [Pseudomonas mendocina]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=1754624">1754624</a> 88 TmoA->TmoB/Ub*->TmoC->TmoD Pseudomonas aeruginosa proteobacteria>gammaproteobacteria bmoB[Pseudomonas aeruginosa]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71849051">71849051</a> 88 TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF->TodX Dechloromonas aromatica RCB proteobacteria>betaproteobacteria Toluene-4-monooxygenase system B [Dechloromonas aromatica RCB]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=86565792">86565792</a> 82 TmoA->TmoB/Ub*->TmoC->TmoD->TmoE Frankia sp. CcI3 actinobacteria Toluene-4-monooxygenase system B [Frankia sp. CcI3]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=4210875">4210875</a> 88 TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Xanthobacter autotrophicus Py2 proteobacteria>alphaproteobacteria oxygenase gamma subunit [Xanthobacter sp. Py2]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=72122837">72122837</a> 86 Note:phenol hydroxylase operon<-TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Ralstonia eutropha JMP134 proteobacteria>betaproteobacteria Toluene-4-monooxygenase system B [Ralstonia eutropha JMP134]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=44893909">44893909</a> 86 TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Ralstonia pickettii proteobacteria>betaproteobacteria gamma hydroxylase subunit [Ralstonia pickettii]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2150114">2150114</a> 89 4OCDC->TmoA->TmoB/Ub*->TmoC->TmoD->TmoE->TmoF Burkholderia cepacia proteobacteria>betaproteobacteria TbhB[Burkholderia cepacia]
Abbreviations:
TmoA: Toluene-4-monooxygenase hydroxylase; Ferritin-like
TmoD: hydroxylase/monooxygenase regulatory protein; Ferritin-like
TmoE: Toluene-4-monooxygenase hydroxylase
TmoB: Ubiquitin fold
TmoC: Rieske 2Fe-S protein
TmoF: NADH-ferredoxin oxidoreductase
4OCDC: 4-oxalocrotonate decarboxylase
4OCTT: 4-oxalocrotonate tautomerase
TodX: Aromatic amino acid transporter, Porin like beta-barrel
* Note The ribonucleotide large and small subunits also correspond to the TmoA/D pair
-------------------------------------------------------------------------------------------------------------
11. YukD-like proteins
Abbreviations:
YukD: YukD like ubiquitin
S/TK: serine/threonine kinase;
gis are of the YukD-like Ub protein protein- marked with an asterisk
GI LENGTH Operon ORGANISM Classification Protein descriptions (if any)
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15026816">15026816</a> 90 <-FtsK<-S/TK<-yukD*<-?<-ESAT-6 Clostridium acetobutylicum ATCC 824 firmicutes AE007866_9 Hypothetical protein [Clostridium acetobutylicum ATCC 824]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=15022859">15022859</a> 81 yukD*->FtsK->ESAT-6-> Clostridium acetobutylicum ATCC 824 firmicutes E007517_5 Hypothetical protein [Clostridium acetobutylicum ATCC 824]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=52004898">52004898</a> 79 <-Mem_prot<-FtsK<-S/TK<-yukD*<-ESAT-6 Bacillus licheniformis ATCC 14580 firmicutes conserved protein YukD [Bacillus licheniformis ATCC 14580]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2635685">2635685</a> 79 <-Mem_prot<-FtsK<-FtsK<-S/TK<-yukD*<-ESAT-6 Bacillus subtilis subsp. subtilis str. 168 firmicutes yukD [Bacillus subtilis subsp. subtilis str. 168]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=56908701">56908701</a> 79 ESAT-6->yukD*->S/TK->FtsK->Mem_prot-> Bacillus clausii KSM-K16 firmicutes conserved hypothetical protein [Bacillus clausii KSM-K16]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=10173588">10173588</a> 80 <-Mem_prot||ESAT-6->yukD*->S/TK->FtsK->?->?->transp-> Bacillus halodurans C-125 firmicutes BH0973 [Bacillus halodurans C-125]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=67875114">67875114</a> 82 yukD* Clostridium thermocellum ATCC 27405 firmicutes hypothetical protein CtheDRAFT_2497 [Clostridium thermocellum ATCC 27405]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76563722">76563722</a> 80 <-FtsK<-S/TK<-yukD*<-?<-Mem_prot<-ESAT-6<-ESAT-6 Streptococcus agalactiae A909 firmicutes conserved hypothetical protein [Streptococcus agalactiae A909]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=88194066">88194066</a> 93 ESAT-6->Mem_prot->?->yukD*->S/TK->FtsK-> Staphylococcus aureus subsp. aureus NCTC 8325 firmicutes hypothetical protein SAOUHSC_00260 [Staphylococcus aureus subsp. aureus NCTC 8325]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49482522">49482522</a> 80 ESAT-6->Mem_prot->?->yukD*->S/TK->FtsK->?->?->transp-> Staphylococcus aureus subsp. aureus MRSA252 firmicutes hypothetical protein SAR0282 [Staphylococcus aureus subsp. aureus MRSA252]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=22776996">22776996</a> 84 ESAT-6->Mem_prot->?->yukD*->S/TK->FtsK->?->?->transp-> Oceanobacillus iheyensis HTE831 firmicutes hypothetical conserved protein [Oceanobacillus iheyensis HTE831]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=16412473">16412473</a> 83 ESAT-6->Mem_prot->?->yukD*->S/TK->FtsK-> Listeria innocua firmicutes lin0052 [Listeria innocua]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=46906292">46906292</a> 83 ESAT-6->Mem_prot->?->yukD*->S/TK->FtsK-> Listeria monocytogenes str. 4b F2365 firmicutes hypothetical protein LMOf2365_0070 [Listeria monocytogenes str. 4b F2365]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=89203070">89203070</a> 83 <-FtsK<-S/TK<-yukD*<-?<-Mem_prot<-ESAT-6 Bacillus cereus subsp. cytotoxis NVH 391-98 firmicutes conserved hypothetical protein [Bacillus cereus subsp. cytotoxis NVH 391-98]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=49329053">49329053</a> 83 ESAT-6->Mem_prot->?->yukD*->S/TK->FtsK-> Bacillus thuringiensis serovar konkukian str. 97-27 firmicutes conserved hypothetical protein [Bacillus thuringiensis serovar konkukian str. 97-27]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13093361">13093361</a> 503 <-FtsK<-?<-subtilisin<-Ub+12xTM*<-?<-FtsK<-memb_associated Mycobacterium leprae actinobacteria probable membrane protein [Mycobacterium leprae]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=41407608">41407608</a> 503 memb_associated->FtsK-><-?||?->PPE_family->PPE_family->PE_family->ESAT-6->?->Ub+12xTM*->subtilisin->?->FtsK->PE_family->PPE_family->PPE_family->?->PPE_family->PPE_family-> Mycobacterium avium subsp. paratuberculosis K-10 actinobacteria hypothetical protein MAP1510 [Mycobacterium avium subsp. paratuberculosis K-10]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13881491">13881491</a> 503 PPE_family->PE_family->PPE_family->?->PPE_family-><-?||PE_family->ESAT-6->?->Ub+12xTM*->subtilisin->?->FtsK->?->PPE_family-><-?||PPE_family->PPE_family-> Mycobacterium tuberculosis CDC1551 actinobacteria hypothetical protein MT1844 [Mycobacterium tuberculosis CDC1551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=31618574">31618574</a> 503 PPE_family->PE_family->PPE_family->PPE_family->PE_family->ESAT-6->ESAT-6->?->Ub+12xTM*->subtilisin->?->FtsK->?->PPE_family->PPE_family->PPE_family-><-?<-PE_family Mycobacterium bovis AF2122/97 actinobacteria CONSERVED HYPOTHETICAL MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=76784314">76784314</a> 481 PPE_family->PE_family->PPE_family->PPE_family->PE_family->ESAT-6->ESAT-6->?->Ub+12xTM*->subtilisin->?->FtsK->?->PPE_family->PPE_family->PPE_family-><-?<-?||PE_family-> Mycobacterium tuberculosis F11 actinobacteria hypothetical protein MtubF_01001866 [Mycobacterium tuberculosis F11]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=41406262">41406262</a> 509 PE_family->?-><-?||?->?->?->FtsK->Ub+12xTM*->subtilisin->?->FtsK-><-?<-?||?-><-FtsK Mycobacterium avium subsp. paratuberculosis K-10 actinobacteria hypothetical protein MAP0164 [Mycobacterium avium subsp. paratuberculosis K-10]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=1944601">1944601</a> 509 <-subtilisin<-FtsK<-?<-subtilisin<-?*<-FtsK<-?<-?<-?<-?<-PE_family<-FtsK<-memb_associated Mycobacterium tuberculosis H37Rv actinobacteria PROBABLE CONSERVED TRANSMEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=31620222">31620222</a> 467 <-ESAT-6<-ESAT-6<-?<-FtsK||Ub+12xTM*->subtilisin-><-memb_associated||cutinase->cutinase-> Mycobacterium bovis AF2122/97 actinobacteria PROBABLE CONSERVED INTEGRAL MEMBRANE PROTEIN [Mycobacterium bovis AF2122/97]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13883386">13883386</a> 467 <-ESAT-6<-ESAT-6<-?<-FtsK||Ub+12xTM*->subtilisin-><-memb_associated||cutinase->cutinase-> Mycobacterium tuberculosis CDC1551 actinobacteria hypothetical protein MT3554 [Mycobacterium tuberculosis CDC1551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=41410338">41410338</a> 452 <-cutinase<-cutinase||memb_associated-><-subtilisin<-Ub+12xTM*||FtsK->?->ESAT-6->ESAT-6-> Mycobacterium avium subsp. paratuberculosis K-10 actinobacteria hypothetical protein MAP4240c [Mycobacterium avium subsp. paratuberculosis K-10]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92916372">92916372</a> 475 <-ESAT-6<-ESAT-6<-?<-FtsK||Ub+12xTM*->subtilisin-><-memb_associated Mycobacterium sp. KMS actinobacteria conserved hypothetical protein [Mycobacterium sp. KMS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92911534">92911534</a> 475 <-ESAT-6<-ESAT-6<-?<-FtsK||Ub+12xTM*->subtilisin-><-memb_associated||?-><-?||?->cutinase->cutinase-> Mycobacterium sp. JLS actinobacteria conserved hypothetical protein [Mycobacterium sp. JLS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=89338189">89338189</a> 434 <-ESAT-6<-ESAT-6<-?<-FtsK||Ub+12xTM*->subtilisin-><-memb_associated<-?||?->cutinase->cutinase->cutinase-> Mycobacterium flavescens PYR-GCK actinobacteria conserved hypothetical protein [Mycobacterium flavescens PYR-GCK]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=90203437">90203437</a> 447 Ub+12xTM*->subtilisin-><-memb_associated<-?||?->cutinase->cutinase->cutinase-> Mycobacterium vanbaalenii PYR-1 actinobacteria conserved hypothetical protein [Mycobacterium vanbaalenii PYR-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92917561">92917561</a> 472 FtsK->memb_associated->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6-><-?<-subtilisin<-Ub+12xTM*<-FtsK<-DNA_binding Mycobacterium sp. KMS actinobacteria Protein of unknown function DUF571 [Mycobacterium sp. KMS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13093791">13093791</a> 485 <-subtilisin<-Ub+12xTM*<-DNA_binding<-ESAT-6<-ESAT-6<-PE_family<-FtsK<-memb_associated<-FtsK Mycobacterium leprae actinobacteria conserved membrane protein [Mycobacterium leprae]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=31617055">31617055</a> 472 FtsK->memb_associated->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6->DNA_binding->Ub+12xTM*->subtilisin-> Mycobacterium bovis AF2122/97 actinobacteria PROBABLE CONSERVED TRANSMEMBRANE PROTEIN [Mycobacterium bovis AF2122/97]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13879797">13879797</a> 472 FtsK->memb_associated->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6->DNA_binding->Ub+12xTM*->subtilisin-> Mycobacterium tuberculosis CDC1551 actinobacteria hypothetical protein MT0303 [Mycobacterium tuberculosis CDC1551]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=41409884">41409884</a> 480 FtsK->memb_associated->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6->DNA_binding->Ub+12xTM*->subtilisin-> Mycobacterium avium subsp. paratuberculosis K-10 actinobacteria hypothetical protein MAP3786 [Mycobacterium avium subsp. paratuberculosis K-10]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92910002">92910002</a> 476 <-subtilisin<-Ub+12xTM*<-DNA_binding<-ESAT-6<-ESAT-6<-PPE_family<-PE_family<-FtsK<-memb_associated<-FtsK Mycobacterium sp. JLS actinobacteria Protein of unknown function DUF571 [Mycobacterium sp. JLS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92915201">92915201</a> 476 <-subtilisin<-Ub+12xTM*<-DNA_binding<-ESAT-6<-ESAT-6<-PPE_family<-PE_family<-FtsK<-memb_associated<-FtsK Mycobacterium sp. KMS actinobacteria Protein of unknown function DUF571 [Mycobacterium sp. KMS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=89343513">89343513</a> 532 <-subtilisin<-Ub+12xTM*<-DNA_binding<-ESAT-6<-ESAT-6<-PPE_family Mycobacterium flavescens PYR-GCK actinobacteria Protein of unknown function DUF571 [Mycobacterium flavescens PYR-GCK]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=90205295">90205295</a> 533 <-subtilisin<-Ub+12xTM*<-DNA_binding<-ESAT-6<-ESAT-6||?-><-PPE_family<-PE_family<-FtsK<-memb_associated<-FtsK Mycobacterium vanbaalenii PYR-1 actinobacteria Protein of unknown function DUF571 [Mycobacterium vanbaalenii PYR-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92917997">92917997</a> 505 <-subtilisin<-Ub+12xTM*<-FtsK<-memb_associated<-?<-ESAT-6<-?<-PPE_family<-PE_family<-FtsK Mycobacterium sp. KMS actinobacteria Protein of unknown function DUF571 [Mycobacterium sp. KMS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=89338337">89338337</a> 473 <-subtilisin<-Ub+12xTM*<-FtsK<-ESAT-6<-ESAT-6<-PPE_family<-PE_family<-FtsK<-memb_associated<-FtsK Mycobacterium flavescens PYR-GCK actinobacteria hypothetical protein MflvDRAFT_5459 [Mycobacterium flavescens PYR-GCK]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=13092444">13092444</a> 512 subtilisin->?->?-><-Ub+12xTM*<-FtsK<-ESAT-6<-ESAT-6<-PPE_family<-FtsK<-FtsK<-memb_associated<-FtsK Mycobacterium leprae actinobacteria putative membrane protein [Mycobacterium leprae]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2370277">2370277</a> 480 subtilisin->?->?->?->?-><-Ub+12xTM*<-FtsK<-ESAT-6<-ESAT-6<-PPE_family<-FtsK<-FtsK<-memb_associated<-FtsK Mycobacterium leprae actinobacteria hypothetical protein [Mycobacterium leprae]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=90202132">90202132</a> 508 ESAT-6->?->?->?->?-><-?<-?<-Ub+12xTM*<-FtsK<-ESAT-6<-?<-?<-PE_family<-FtsK<-FtsK<-memb_associated<-FtsK Mycobacterium vanbaalenii PYR-1 actinobacteria Protein of unknown function DUF571 [Mycobacterium vanbaalenii PYR-1]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=89340379">89340379</a> 549 FtsK->memb_associated->FtsK->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6->FtsK->Ub+12xTM*->?->?-><-?<-?<-?<-?<-?<-?<-subtilisin Mycobacterium flavescens PYR-GCK actinobacteria Protein of unknown function DUF571 [Mycobacterium flavescens PYR-GCK]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92915077">92915077</a> 509 FtsK->memb_associated->FtsK->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6->FtsK->Ub+12xTM*->?-><-?<-?<-?<-subtilisin Mycobacterium sp. KMS actinobacteria Protein of unknown function DUF571 [Mycobacterium sp. KMS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=92909344">92909344</a> 509 FtsK->memb_associated->FtsK->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6->FtsK->Ub+12xTM*->?-><-?<-?<-?<-subtilisin Mycobacterium sp. JLS actinobacteria Protein of unknown function DUF571 [Mycobacterium sp. JLS]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=2960229">2960229</a> 511 FtsK->memb_associated->FtsK->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6->FtsK->?*->?-><-?<-?<-?<-?<-subtilisin<-FtsK<-?<-subtilisin Mycobacterium tuberculosis H37Rv actinobacteria PROBABLE CONSERVED TRANSMEMBRANE PROTEIN [Mycobacterium tuberculosis H37Rv]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=81252663">81252663</a> 487 FtsK->memb_associated->FtsK->FtsK->PE_family->PPE_family->ESAT-6->ESAT-6->FtsK->Ub+12xTM*->?-><-?<-?<-?<-?<-?<-subtilisin<-FtsK Mycobacterium tuberculosis C actinobacteria COG0477: Permeases of the major facilitator superfamily [Mycobacterium tuberculosis C]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=54014302">54014302</a> 493 DNA_binding->ESAT-6->ESAT-6-><-?||memb_associated-><-FtsK||Ub+12xTM*->subtilisin->FtsK->?-><-FtsK Nocardia farcinica IFM 10152 actinobacteria hypothetical protein [Nocardia farcinica IFM 10152]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=54014325">54014325</a> 488 memb_associated-><-?<-subtilisin<-Ub+12xTM*||FtsK->?-><-ESAT-6<-ESAT-6 Nocardia farcinica IFM 10152 actinobacteria hypothetical protein [Nocardia farcinica IFM 10152]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=68264440">68264440</a> 383 <-ESAT-6<-ESAT-6<-?<-FtsK||Ub+12xTM*->?-><-memb_associated Corynebacterium jeikeium K411 actinobacteria putative membrane protein [Corynebacterium jeikeium K411]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=84494284">84494284</a> 443 <-FtsK||?-><-?<-?<-?<-ESAT-6<-ESAT-6||Ub+12xTM*-> Janibacter sp. HTCC2649 actinobacteria putative integral membrane protein [Janibacter sp. HTCC2649]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29831983">29831983</a> 451 Ub+12xTM*->?->FtsK-> Streptomyces avermitilis MA-4680 actinobacteria hypothetical protein SAV5440 [Streptomyces avermitilis MA-4680]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=71369935">71369935</a> 451 <-Ub+12xTM*<-ESAT-6<-ESAT-6 Nocardioides sp. JS614 actinobacteria hypothetical protein NocaDRAFT_4675 [Nocardioides sp. JS614]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=21224082">21224082</a> 491 ESAT-6->?->?-><-?<-?<-?<-?<-?<-?<-FtsK||Ub+12xTM*-> Streptomyces coelicolor A3(2) actinobacteria integral membrane protein [Streptomyces coelicolor A3(2)]
<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=29829069">29829069</a> 502 <-Ub+12xTM*||FtsK->?->?-><-?<-ESAT-6<-ESAT-6<-?<-ESAT-6 Streptomyces avermitilis MA-4680 actinobacteria hypothetical protein SAV2527 [Streptomyces avermitilis MA-4680]
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
II. Comprehensive alignments of different protein families described in the study
1. ThiS/MoaD/Ubiquitin
FINAL ---EEEE------------------E-----HHHHHHHHHH----------------------EEEEEE-----------------EE------EEEEEEEE---
ALIGN -----EEE---------------EEE-------HHHHHHHH----------------------HHEEEE-----------------EE-------EEEEE-----
HMM ---EEEEE---------------EEEE------HHHHHHHH----------------------EEEEEEE----------------EEE-----EEEEEEE----
FREQ ---EEEE-----------------------HHHHHHHHHHH----------------------EEEEEE----E-------------E-------EEEEEEE---
PSSM ---------------------------------HHHHHHHH----------------------EEEEE--------------------------EEEEEEE----
FINAL --EEE-------------------EEE----HHHHHHHHHH----------------------EEEEEE-----EE------E--EEEE----EEEEEEEE---\ThiS
2633522 Bacillus_subtilis_subsp_subtilis_str_168 -MLQLNG---------------KDVKWKKDTGTIQDLLASYQLE-----------------NKIVIVERN--KEIIGKERYHE--VELCDRD-VIEIVHFVGGG|
67939265 Chlorobium_phaeobacteroides_BS1 ITITLNG----------------QQREIQEGSTVEDILSIIGAE-----------------KQRVAVVVN--ENIVYPEKRGS--VLLREKD-QVEVLSFVAGG|
13879933 Mycobacterium_tuberculosis_CDC1551 MIVVVNE----------------QQVEVDEQTTIAALLDSLGFG-----------------DRGIAVALN--FSVLPRSDWATKICELRKPV-RLEVVTAVQGG|
29609756 Streptomyces_avermitilis_MA_4680 MNISVNG----------------ERRRIAPGTALDTLVKTLTAA-----------------PSGVAAALN--ETVVPRAQWSS--TALSEGD-RVEVLTAVQGG|
57865488 Staphylococcus_epidermidis_RP62A MKCIING----------------DLFTFDQNQSIQEVLHSLELD-----------------PKRVIVELN--KELIKQDKYEE--YTVREDD-RLELLEIVGGG|
56909742 Bacillus_clausii_KSM_K16 MRLVVNG----------------EERIS-ESTTLSELVSEFGLA-----------------SQLVVAEVN--GTIIDRVDWEA--TSLSEGM-KIELVHFVGGG|
17130691 Nostoc_sp_PCC_7120 ITLQVNG----------------ETHNCSSPTPLPDLLQQLGFN-----------------PRLVAVEYN--GEILHRQFWEQ--TQVQSGD-RLEVVTIVGGG|
30138190 Nitrosomonas_europaea_ATCC_19718 MQLIING----------------QQQSYDGPMNVQQLVEKLSLQ-----------------NKRFAIERN--GEIIPRSRFPE--LLLNEGD-QLEIIVAVGGG/
FINAL -HHHHHHHHHHHHHH--------EEEE---HHHHHHHHHHHHHH--HHHHHHHH-------HHHHHHH-------------------------EEEEE------\MoaB
52696120 Pyrococcus_furiosus VKVKVKYFARFRQLAG----VDEEEIELPEGARVRDLIEEIKKRHEKFKEEVFGEGYDE--DADVNIAVN--GRYVSWD------EELKDGD-VVGVFPPVSGG|
10640172 Thermoplasma_acidophilum -MVTVRYYATLRPI------TKKKEETFNGISKISELLERLKVEYGSEFTKQMYDGNNL--FKNVIILVN--GNNITSMKGLD--TEIKDDD-KIDLFPPVAGG|
19915596 Methanosarcina_acetivorans_C2A MKIHVKFLATIREITG----KPEIELEILPGDTVGTALQALQARYGPEFKEATTGTTAGG-IPKVRFLVN--GRNTDFLDGFE--TELKAGD-VMVFVPPVAGG|
11499216 Archaeoglobus_fulgidus_DSM_4304 -MVRVKLFANFRE-------AAGVKEVEVEAGTVGEVLQELVRRFPKLESLFYEEGRL---RDYVNIMVN--GRNVRGDLN----YPLSHTD-EVAIFPPVSGG|
75855800 Vibrio_sp_Ex25 -MIKVLFFAQTRELI-----GIDSVELDDQFETVEAIRAHLVEEGADKNGKWDLALE----PGKLLAAVN--QSIVPLD------TEVKAGD-EVAFFPPVTGG|
26107155 Escherichia_coli_CFT073 RMINVLFFAQVRELV-----GTDATEVAADFPTVEALRQHLAAQSDRWALALE--------DGKLLAAVN--QTLVSFD------HSLTDGD-EVAFFPPVTGG|
28868459 Pseudomonas_syringae_pv_tomato_str_DC3000 MKIEVQYFARYRETL-----GIDSESVEGEFVTLEVLRQHLLQRGEAWQVLA---------EQNLMCARN--QELCKLD------EPLLDGD-EVAFFPPVTGG|
4262375 Mus_musculus CQIDVLYFAKSAEIAG----VRSETISVPQEIKASELWKELEMLHPGLADV----------RNQVIFAVR--QEYVELGDQQ---LLLQPGD-EVAIIPPISGG|
30681325 Arabidopsis_thaliana VEIKVLLFARARELTG----VPDLTLKMPSGSTTQKCLDELVLKFPSLEEV----------RSCVVLALN--EEYTTDS------AIVQHRD-ELAIIPPISGG/
FINAL -EEEEEE----------------EEEEE------HHHHHHHHHH--------------------EEEE------EEE---HHHH-HHHH-----EEEE------\Urm1
40889046 Mus_musculus VSFKITLTSDP---------RLPYKVLSVPESTPFTAVLKFAAEEFKVP------------AATSAIITND-GIGINPAQTAGN-VFLKHGS-ELRIIPRDRVG|
71074940 Giardia_lamblia_ATCC_50 IQVKIYKGFDP---------FYTYHVFNIPEASSTEKVIRLAARAFEIP------------QLEAVLINST-GDAIVPCQTILD-TCRRFGT-TLTVAHLKPII|
68352771 Theileria_parva VTFKIVLASDA---------NQPYKVLSVPEQAPFSAVIKFAAEEFRLN------------PATCAIITND-GVGINPTQTAGG-VFLKYGS-NLRLIPRDRVG|
15217447 Arabidopsis_thaliana VSFKVTLTSDP---------KLPFKVFSVPEGAPFTAVLKFAAEEFKVP------------PQTSAIITND-GIGINPQQSAGN-VFLKHGS-ELRLIPRDRVG|
72005426 Strongylocentrotus_purp VTFKITLTSDP---------KLPFKVLSVPESTPFTAVLKFAAEEFRVP------------AATSAIITND-GIGINPAQSAGN-VFLKHGS-ELRLIPRDRVG|
56112391 Chlamydomonas_incerta VTFKVTLTSDP---------KLPFRVFSVPEEAPFTAVLKFAAEEFKVP------------AQTSAIITND-GVGINPQQTAGN-VFLKHGS-ELRLIPRDRVG|
289769 Caenorhabditis_elegans VTFKITLTSDP---------KLPFKVLSVPESTPFTAVLKFAAEEFKVP------------AATSAIITND-GVGVNPAQPAGN-IFLKHGS-ELRLIPRDRVG/
FINAL --EEEEEE------------EEEEEEE----HHHHHHHHHHHHHH----------------------EEEE-EE--------------------EEEE------\RnfH
56312934 Azoarcus_sp_EbN1 MKIGVAYSEPSH--------QVWLNLEVPDGTTVGAAIERSGILAQFPHID----------LTVQKVGVF--AKVVKLD------TPLRHGD-RVEIYRPITCD|
77389630 Rhodobacter_sphaeroides_241 MIVGVAYAKPTV--------QVWKHVDVPEGTSAREAIERSGLLAQFPEID----------LAVNKVGIF--GAICPLD------RTLAEGD-RVEIYRPIHPE|
66047427 Pseudomonas_syringae_pv_syringae_B728a IQIEVVYASVQR--------QVLKTVDVPTGSSVRQALALSGIDKEFPELD----------LSQCAVGIF--GKVVTDPAA----RVLEAGE-RIEIYRLLVAD|
67549235 Burkholderia_vietnamiensis_G4 LSIEVCYALPDR--------QTLIPVSLPEGATVRAAIDASGVLALHPEID----------LAQAKTGVF--GKLAPLD------APLADHD-RVEIYRPLIVD|
68245723 Magnetococcus_sp_MC_1 MRVAVTYAQPNR--------QLLLEFEVPEGTTAQQAVERSGILSKFPDIN----------LAEQKLGIY--AKLVEND------QVLEEGD-RVEIYRPAKGK|
71846749 Dechloromonas_aromatica_RCB MQIGVAYSEPSQ--------QIWLNIEVPDESSVKEAIERSGILKQFPHID----------LSTQKVGVF--GRLVKLD------AALKPGD-RIEIYRGIIAD|
59712607 Vibrio_fischeri_ES114 IHVEVVYALPTE--------QVVFKLAVKAEQTVEEIIVQSGVLERYPEID----------LKVNKVGVF--SRNVKLD------STIRDKD-RIEIYRPLLAD/
FINAL --EEEEE-----------------EEE-------HHHEEE----------------------EEEEEEE----EE------------------EEEEEE-----\TGS
5107656 Escherichia_coli -MPVITL-------------PDGSQRHYDHAVSPMDVALDIGPGLA---------------KACIAGRVN--GELVDAC------DLIENDA-QLSIITAKDEE|
730881 Saccharomyces_cerevisiae VPLKIVLK------------DGAVKEATSWETTPMDIAKGISKSLA---------------DRLCISKVN--GQLWDLD------RPFEGEA-NEEIKLELLDF|
135177 Homo_sapiens KPIKVTLP------------DGKQVDAESWKTTPYQIACGISQGLA---------------DNTVIAKVN--NVVWDLD------RPLEEDC-TLELLKFEDEE|
2983390 Aquifex_aeolicus_VF5 EEVFVFTP------------KG-DLVVLPKGSTPVDLAYKIHTEVG---------------NHCAGAKSN--GRIVPLN------YELKSGD-VVEIITNPNKS|
1710082 Shigella_flexneri DRVYVFTP------------KG-DVVDLPAGSTPLDFAYHIHSDVG---------------HRCIGAKIG--GRIVPFT------YQLQMGD-QIEIITQKQPN|
416555 Drosophila_melanogaster RLQRIYTKPKGQLPD-----YNSPVVLHNERTSIEDFCNKLHRSIAKEFKYAL--------VWGSSVKHQ--PQKVGIE------HVLNDED-VVQIVKKV---|
2120160 Methanocaldococcus_jannaschii GFIKIYLKPQGKKPD-----FDEPLIMR-RGATVKDVCEKLHKDFVRNFRYAQ--------VWGKSAKHP--GQRVGLD------HKLEDGD-ILTIVIKR---/
FINAL ---EEEEHHHHHH---------EEEE------HHHHHHHH----------------------EEEEEEE------------------------EEEEE------\DUF82 fusion
71915653 Thermobifida_fusca_YX ASITLRFDPTLRPLLAPRNRTDLLHVNHDPAASLSHVVESLGVPL----------------TEIGELRIN--GTTASPS------QHPQPGD-LIEVLTVPKPQ|
20520977 Streptomyces_coelicolor_A32 PEIHVEFAPELHLFVPRARPTGVASAATDGVSTLGHLVESLGVPL----------------TEVGALLVD--GREVPPG------HIPAGGE-SVRVRPVRHPQ|
54016307 Nocardia_farcinica_IFM_10152 SGIELRLYAELNDFLPPQDRQDALWRPVRPHQTVKDIVEAAGVPH----------------TEIDLLLVN--GESVGFE------HHPRPGD-RLAAYPMFESL|
4981308 Thermotoga_maritima_MSB8 KIAFFRFFGRLNDFFRN--SERIKTHRFTGFQTVKDRIEALGVPH----------------VEVSLITLN--GKPVGFD------HMVEDGE-LFFVYPEFQNI|
76785598 Mycobacterium_tuberculosis_F11 GYVDVRAYAELNEFVELQARGLTVRRPFRSHQTVKDVLEAMGIPH----------------TEVDLILVN--GDPADFS------YRPVAGD-RIAAYPMFEAL|
68554875 Ralstonia_metallidurans_CH34 VTATFRFYEELNDFLAPAQRRRDLSCPCARAATVKHMIEALGVPH----------------TEVELILVN--GESSPFE------RIVCDGD-RIAVYPKFESF|
67666690 Burkholderia_cenocepacia_HI2424 ATASLRVVVELNAFLASQQRDRAFAHACARDATVKHAIEALGVPH----------------TEIGRLYVN--DAPAALD------RPLDDGD-RVEVLPERAGP/
FINAL -HHHHHHEEEEEEE---------E---------HHHHHHHHH------------------------EEEE-----------------E---E-EEEEEEEEEE-\ub-like
71847777 Dechloromonas_aromatica RCB MTIAVNEIRRVFRY------NGVQLPD-VPGMEPKEVRDLYSAQY----------------PELISAEIE--A------------GDVVNGV-QEYTFRKAVGT|
67908730 Polaromonas_sp_JS666 ILVSTTVLKRVFMS------NGNPLTDPDPSMSPAAVKDFWSAMY----------------PELLNAEVQ--G------------PVSKDGE-LTYTFHRTTGT|
84357757 Burkholderia_cenocepacia_PC184 --MEIETLAREFSY------NGAKLADPAPTFTLQQIRDFYSQTY----------------PELTNAEIE--G------------PVIKGNR-NVYTFRRAVGT|
17428677 Ralstonia_solanacearum --MQTIQLTREFRY------NGVRLADPSPQFTLEQVRDFYANTY----------------PEILNADID--G------------PSVEGTL-QVYGFRRAVGR|
29339958 Bacteroides_thetaiotaomicron_VPI_5482 MALDIKGLKRVFILKKGN--DTLTLEDPDSRMSLSEVTDFYSMNY----------------PELTTATLH--G------------PELEEDR-AIYRFKTTIGT|
71839548 Pelobacter_propionicus_DSM_2379 --MQITTLTRTFKY------NGATLRDPDPKQTPEQVKEFYSMAY----------------PELTTAVVE--G------------PEENNGQ-LQYSFRKGAGT|
38637971 Cupriavidus_necator MALEIKKLLRQFSY------NGMSFVDPGPAFTPEQVRDIYSAQY----------------PELTTASVD--G------------PEVKGEV-ASFTFVRAAGA|
84717440 Polaromonas_naphthalenivorans CJ2 MALIAKTISRTFKF------NGMTLADPSPEMDMETVKRFYANQY----------------PELLNSVVE--G------------PVTKGTV-STYTFIRAVGA/
FINAL ---EEEEE--H---------HHH-EEEEEEE--HHHHHHHHHHHHHHHHHHHH--------EEEEE-------HHHHHHHH------------EEEEEEEE---\TAPI
76556246 Phage_BP_4795 PLARICLHGDL---------QRFGRRLSLYVNTAAEAIRALSLQVPGFRRQMNEGW-----YQIRIAGDDT-APEAVYARLH---EPLGEGT-VIHIVPRLAGA|
215124 bacteriophage_lambda GMARICLYGDL---------QRFGRRIDLRVKTGAEAIRALATQLPAFRQKLSDGW-----YQVRIAGRDV-STSGLTAQLH---ETLPDGA-VIHIVPRVAGA|
11877308 Neisseria_meningitidis_phage_2120 -MITVCLYGGL---------REYGRRFVLHVETPAEALHALFTQIKGLRQRIRDGV-----YQVRFDGKDQ-SEETIGSV-----FRRPADG-VLHIVPRVQGA|
45686326 Enterobacteria_phage_T1 DVKVIKLSGSLG--------RRFGVFHRYAVDSYPEAIRALSSQVDGFKEYMQSEVGSRSKFAIFVDGVNV-GHHEE--------EKFKCAK-EIRIVPIPTGS|
71834086 Bacteriophage_JK06 NVIDVKLGLGLG--------RKFGKLHKLCVKTVPEAMRALSVNIPEFKEFMRSHVGQNTRFAVFVDGKNV-NEHKI--------NDLETVS-EIRIMPIPQGR|
46402106 Bacteriophage_phiKO2 VMTRIELSGILG--------KKFGAYHERLVSTTSEGIQALCCTIDGFEKFLNNSKEKGLTFAIFKGKKNI-GKDDL--------GFPVNGD-VIRIVPVIIGS|
9634139 Enterobacteria_phage_HK022 VMTRIELSGVLA--------KTYGRVHHRLVRTTAEAINALAKTINGFEKFLNTSKARGLTYAVYRDKKNI-GVDDL--------GFPVTGE-VIRIVPVVIGS|
17975181 Bacteriophage_phiE125 TFRTIRLYGVLG--------GRFGRVHRLAVSSTAKAVRALSVLIPGFRAFLTSARDGGLTFAVFNGRRNL-GEDEL--------EHPVGRD-EIRIAPVIVGS|
77864688 Burkholderia_cepacia_phage_Bcep176 KLREVRLYGIAG--------TRFGRVHRLAVSSTAEAVRALSVLLPGFRKFLLEARDNGLTFAVFNGRRNL-SQDDL--------TAPVGDE-AIRIAPVIIGS/
FINAL --EEEEEE--------------EEEEEEE--HHHHHHHHHH-----------------------EEEEE----EEE----EE---EEE-----EEEEEEE----\TAPI+protein J
85716602 Nitrobacter_sp_Nb_311A PAATVSVYGTTHPLNAVA--GARIHCRVPAGWSITEILGEALSHKPGWHR-----------RRDLIVRIN--DHIIPEENWSR--VRVKQGA-TVTFIPRLQDG|
66392071 Xanthomonas_campestris_pv_pelargonii_phag THQVIVSPHPVVVDD-----QKNLILAFKQGESLFEILSRSVDNFE---------------EREWVVTIN--GRRVPVEMWTK--AFPKPGH-IIEVR--GNVG|
33568295 Bordetella_bronchiseptica_RB50 MPALMVVHNPFVASEG----RKAYCAAFLPGETLGRYCERMGVALP---------------SRVVNVWHN--GRPVPLALWQR--LIPRQGD-QVVIRAKGEGG|
46449977 Desulfovibrio_vulgaris_subsp_vulgaris_str KADVVSVTGCPHPFRP----GDRVHDVVPVGGTLESIVVRGLDDMGVPEAL----------RGCGHAFVD--GEYVPRDRWAD--VTPRAGS-TVTYRLVPAGG|
67545284 Burkholderia_vietnamiensis_G4 QSAVVLLRNPFQP-------SQREVMVAHPTQTIRQWLGAQGIAEF---------------DQPTVCIKN--DAPVLRTDWAV--T-PIDG--VVLFITLPQGG|
23015894 Magnetospirillum_magnetotacticum_MS_1 TASVIIIANPFEPV------ASRSVHAIVAGVTVGELLLDCGIDPDRW-------------ADGPEIRIN--GNVVAAEIFAV--RVIGEDE-IISIIRWPLGG|
78033450 Magnetospirillum_gryphiswaldense TASIVIVTNPFEPV------ASRSVHAVESGITLGGLLQACGIAEDCW-------------SDGPEILIG--GMTVPVGIYAV--RAIVDGE-VVTVIRWPQGG/
FINAL ----EEE-----------------EEE----HHHHHHHHH-------------------------EEEEE-------------------E------EEEEEE--\fusions to E1-like proteins
57168916 Campylobacter coli RM2228 --MRIKFN--------------GKELDTKLSTSLDFFKSVSK-------------------NENDVWIIN--GFAT---------KENIKIH-ENDELFCIERN|
57166736 Campylobacter jejuni RM1221 -MMRVKFN--------------GKELDTDFKTSLEFFENISK-------------------NENDVWIIN--GFAT---------KENIALN-EDDELFCIERN|
71837115 Pelobacter propionicus DSM 2379 -MIQIRLN--------------EKTIMVDDGLTLAMLAKQRR-------------------PGADVLILN--GFPA---------EDDTQIN-DGDAVFLIKRG|
77544308 Pelobacter carbinolicus DSM 2380 --MHIWIN--------------EQPHNISEDARLFEMRDRFK-------------------PQADVVILN--GFPV---------TSDRPLS-NGDRIVLIRRG|
68178158 Desulfuromonas acetoxidans DSM 684 --MIIVLN--------------ENKIQVEENQSLFDLRDQIK-------------------PEADVLICN--GLPI---------QSDRTLQ-PFDHVILIRRG|
18145265 Clostridium perfringens str. 13 --MNIKIN--------------EKWREVKENCTVYALKNEEF-------------------PDSHVIVLN--GFPL---------VEDKKLK-DGDRIVFIKKG|
28203841 Clostridium tetani E88 --MKIYVN--------------EIFLNVEEDIDVFKLKNKIK-------------------KDADIVIYN--GFPI---------NNNIVLK-PLDRIVFIKRG|
77683437 Alkaliphilus metalliredigenes QYMF --MKLIVN--------------EDEMDVKKGTTAFEVRNKVK-------------------KDADVVVYN--GFII---------KEDVLLQ-EGDLITLIQRG/
FINAL -EEEEE--------------EEEEEEEE-----HHHHHHHHH---------------------EEEEEEE---EEE-------------------EEEEEEEE-\TmoB
48094248 Pseudomonas_sp_OX1 ATFPIMSNFERD--------FVIQLVPVDTEDTMDQVAEKCAYHSINRRVHPQP-------EKILRVRRHEDGTLFPRGMI----VSDAGLR-PTETLDIIFMD|
78693154 Bradyrhizobium_sp_BTAi1 ALFPLQANFRGD--------FVVLLVPVDDGDTMSVVADKVAQHAVGLRVAE---------KNASKCVYHN-GKELPSAIT----VAQSGIQ-PMDWIEVAYV-|
68556036 Ralstonia_metallidurans_CH34 ALFPLSSNFEGD--------FVLQLVAVDTENTMDEVAAAAAHHSVGRRVKARP-------GHILRVRQQGSKECLPRTMK----VADSGLK-PTECVEVIWEP|
45479222 Pseudomonas_mendocina SAFPVHAAFEKD--------FLVQLVVVDLNDSMDQVAEKVAYHCVNRRVAPR--------EGVMRVRKHRSTELFPRDMT----IAESGLN-PTEVIDVVFEE|
71849051 Dechloromonas_aromatica_RCB ALFPLTSNFEGD--------FVLQLVAVDSENTMDEVAAAAAHHSVGRRVRARP-------GQILRVRRQGGEEFLPRTMR----VSESGLK-PTETVEIIWEA|
86565792 Frankia_sp_CcI3 ALLPLSAVFEHD--------FVSLLVAVDDADTVEVVGQKIAHHVVGRRLPAS--------DAPVGIRHN--GQVLAREAR----IGEAGVG-PLDHVEAFFDE|
72122837 Ralstonia_eutropha_JMP134 ALFPVISNFQYD--------FVLQLVAVDTENSMDEVAAAAAHHSVGRRVAPQP-------GKVVRVRRQGGDQFYPRDAR----IGDTDIK-PMESLEFIFCD/
FINAL EEEEEE-----------------EEE-------HHHHHHHH---------------------EEEEEE------EEEE--HH---HHHH-------EEEEEE--\repeat
84711628 Polaromonas_naphthalenivorans_CJ2 VVADEQLN-------------DRHLDLRDPVPTGRQILQAAEVRPVA--------------DYSIYAILPS-GEFEDLRLDE---TYDLRGR-GAERFVIFQTD|#1
69928899 Nitrobacter_hamburgensis_X14 EVAGTDLA-------------FGPVIIRDRTPTGAQIAAAAGLTPAQ--------------DPYVLSFLPD-GELVEILASE---TVDLDE--GRRRFIVTSAD|
17134587 Nostoc_sp_PCC_7120 KHYLVRID-------------DRSYKVDDPVITGGQLLDKASKRPVD--------------EYLIFQMLNN-GQLEEIRLDE---TVELRKP-GIERFITWRSD|
38423904 Synechocystis_sp_PCC_6803 QQFRIQVD-------------QQQLMIPDPVPTGRQILEIAQKRPAD--------------EFLVFYLLPS-GQLEEIRLDE---TVDLRQT-GIERFITFRSD|
28806071 Vibrio_parahaemolyticus_RIMD_2210633 FFALDSLQ-------------FRSLSVQDPVPTGRQLIEIAGLDSFD--------------DYSLFAILPS-GDFEDIRLNE---TVDLRAR-GVERFIAFKTD|
68554444 Ralstonia_metallidurans_CH34 ------LN-------------FIKIEIDDPVPLGRQVLTAAGMHGDD--------------NYSLFNILES-GDFEDVRLDE---QIDLRRP-GAERFIAFKSD|
77690161 Rhodopseudomonas_palustris_BisB5 RGMEYPVN-------------GAMAAFPDNVVNGREVLTRSGLVPAS--------------EYRLI-LVRN-GRTRLIGTDD---DVDLDKE-HGGSFRAFLSD|
39651045 Rhodopseudomonas_palustris_CGA009 LIADESFN-------------FRSFPFDDRQVTGAQIGEVFGAHPIS--------------DFVIIQQLES-LELETLRPTE---LADLRKS--VRFFV-IRGD|
14025878 Mesorhizobium_loti_MAFF303099 TNFTFKLD-------------GRVVATNDAIISGREVRALGGLDPAS--------------DYILIQIADR-TS-RSIGLEE---AIDFREM-PHSEFLSFQGD|
77961668 Yersinia_mollaretii_ATCC_43969 LFAQENLA-------------FRAIEVNDPVPLGRQILIAAGLRAND--------------DYSLFAILET-GDFEDLRLDE---TFDLRGR-GAERFVAFQTD/
FINAL --EEEEEE---------------EEEE-----HHHHHHHH----------------------EEEEEEEE-----EEE--------EE-------EEEEE----\repeat
84711628 Polaromonas_naphthalenivorans_CJ2 RAFKFTID-------------DRQMEWGKPSISGKILKVLAGVPTD---------------TYDVYLEVRS-GGQDVLIRDTD--LIDLSKP-GIERFITLIRD|#2
69928899 Nitrobacter_hamburgensis_X14 RSYRLTVD-------------GEQYDWPARMVTGATVRKLARVPAE---------------FL-VYLERQD-EPDRLIGNQD---IVNLGDK-GVEHFHARKQT|
67547440 Burkholderia_vietnamiensis_G4 --YKIRID-------------KDYYVVDVPHMTGEQILGLAGKTSA---------------GY-LLSEKVH-GQMRPVAPAQ---TVDFTAH-GVERFATIPKE|
17134587 Nostoc_sp_PCC_7120 RSFRFVID-------------GRRFEWGAPIITGLKLKELAGVDLA---------------SYGVWLELRG-AEDRPIADNE---SVDLQAP-GVERFFTGKKT|
38423904 Synechocystis_sp_PCC_6803 RSFRFVID-------------GRRFEWGIPLISGLKLKQLAQVSPQ---------------AYGVWLEVRG-GEDRPIADHE---TVNLEAP-GVERFFTGKKT|
28806071 Vibrio_parahaemolyticus_RIMD_2210633 RDFKFSLK-------------GRQIVWGKSEIDGSDLYFLADV-AD---------------EQAIFLDVRG-GTDRLIEPDD---TVDLSEA-GIEHFVVADKP|
68554444 Ralstonia_metallidurans_CH34 RNFKLTVN-------------GSQVVWGRPTISGADLYALSKP-AD---------------GEAVFMVVSG-GEDRQIERED---DVDLAAP-GVERFENAPKR|
77690161 Rhodopseudomonas_palustris_BisB5 RDFGFTVD-------------EVGQVWGTADMEVDEFLRIWPQHPE---------------HR-WVLERDD-EPDTVLTPGG---VLSFGPK-GVEHVVSRKDA|
39651045 Rhodopseudomonas_palustris_CGA009 ATYTFIVD-------------GLTMVWPKKTITGKAVKMLTNKDED---------------DIEVLLERED-RPDKVIGDDD---DIQLAAD-GVEKLKTRYAK|
14025878 Mesorhizobium_loti_MAFF303099 RAFSFTVN-------------ERGWEWGSATISAADIYRYASIDED---------------LE-LIL--DS-AGDTVIPADG---AVTLGGQ-GVERIRSREAK/
FINAL -EEEEEEE------------------------HHHHHHEE---------------------EEEEEEEEE-----------------EEEEE--EEEEEEE---\repeat
14025878 Mesorhizobium_loti_MAFF303099 KTVVIKVN-------------GRSRTVPRRKHSYREIALLAYPDA-NFEK-----------FKYTITYLKG-VHGA-EGDLVE--GENIEVKNGMVFNVRRSDK|#3
28806071 Vibrio_parahaemolyticus_RIMD_2210633 PDYIITVN-------------SREHVLDDPNVTYEQIVSFEFQYPPSNPN-----------TCYSMTYRHA-KSKPHAGELAAG-GSVIVKKKGTVFNVTATDK|
68554444 Ralstonia_metallidurans_CH34 PKVVIIVN-------------GTKEELPAPLVTFDQLVALAYPGQPPQPG-----------ITYSITYYKV-ASYPHQGPMAP--GGSVEAKNGSIFNVGRTIQ|
39651045 Rhodopseudomonas_palustris_CGA009 TTVTIIVE-------------GTPHKWDKKKISYAEVVTLEVSDYEHHPD-----------ITYSVNFTNG-PHNRPEGDLAK--GESVKVRDGMIFSVSETGQ|
88795473 Alteromonas_macleodii_Deep_ecotype KIFEIIVN-------------GRMKSVEDKFLTFVEIVKLAFGEFKECQN-----------QIYTMTFKRG-VGKK-EGSLVL--GDKVRIKDGVIFNVTATNK|
86566459 Frankia_sp_CcI3 KTVEIIVN-------------GRRRTVVKGELSFDEVVALAFDPVPAGDN-----------VDFTITFRRG-HGDKPEGTLRP--GGTVKIKEGMIFDVTATDR|
69928899 Nitrobacter_hamburgensis_X14 QNVLIEIA--------------TPTVVVAD--AMRQAGFDPAQPWHIFLKVQDQ-------TKREVAANYV-LDLRTPGIEKLR-LIPKDVNNGEACAPR----/
consensus/100% ........................................................................................................
consensus/95% ....................................h............................................................h......
consensus/90% ....h...........................s...h............................................................h......
consensus/85% ..h.h...........................s...h................................p...........................h......
consensus/80% ..h.h...........................o..ph...h.........................h..ps.......................h..h......
consensus/75% ..h.l.h.........................o..phh..h.........................h..ps....................p..hp.h....ss
consensus/70% ..h.l.h.........................o..phhp.hs........................h..ss....................p..lp.h....ss
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
2. UBC/E2 like domain
Helix-1 Str-1 Str-2 Str-3 Str-4 | * * Helix-2 Helix-3 Helix-4
Secondary Structure -hHHHHHHHHHHHHHh--------EEEEE----------------EEEEEEE--------------EEEEEEE---------------------------EEEE-----------------------------------------------------------HHH-------------------------------HHHHHHHHHHHHH-----------------------------------HHHHHHHH--h---hhhHHHHhhHHH
1ayzA_Ubc2_Scer_3659954 TPARRRLMRDFKRMKE---DAPPGVSASP----------LPDNVMVWNAMII----GPADTPYEDGTFRLLLE------------FDEEYPNKPP-----HVKFLSE---------------------------------MFHPNVYAN---------GEICLDILQ----------------NRWTP------TYDVASILTSIQSLFN---------------DPNPASPAN-----------VEAATLFKDHK---SQYVKRVKETVE
1Q34A_Ubc_Cele_34810893 TPSRRRLMRDFKKLQE---DPPAGVSGAP----------TEDNILTWEAIIF----GPQETPFEDGTFKLSLE------------FTEEYPNKPP-----TVKFISK---------------------------------MFHPNVYAD---------GSICLDILQ----------------NRWSP------TYDVAAILTSIQSLLD---------------EPNPNSPAN-----------SLAAQLYQENR---REYEKRVQQIVE
2E2C_E2-C_Ssol_4388942 HSVSKRLQQELRTLLM---SGDPGITAFP----------DGDNLFKWVATLD----GPKDTVYESLKYKLTLE------------FPSDYPYKPP-----VVKFTTP---------------------------------CWHPNVDQS---------GNICLDILK----------------ENWTA------SYDVRTILLSLQSLLG---------------EPN-NASPL-----------NAQAADMWSNQ---TEYKKVLHEKYK
1QCQA_Ubc4_Scer_5107650 MSSSKRIAKELSDLER---DPPTSCSAGP----------VGDDLYHWQASIM----GPADSPYAGGVFFLSIH------------FPTDYPFKPP-----KISFTTK---------------------------------IYHPNINAN---------GNICLDILK----------------DQWSP------ALTLSKVLLSICSLLT---------------DANPDDPLV-----------PEIAHIYKTDR---PKYEATAREWTK
2AAK_Ubc1_Atha_2981894 TPARKRLMRDFKRLQQ---DPPAGISGAP----------QDNNIMLWNAVIF----GPDDTPWDGGTFKLSLQ------------FSEDYPNKPP-----TVRFVSR---------------------------------MFHPNIYAD---------GSICLDILQ----------------NQWSP------IYDVAAILTSIQSLLC---------------DPNPNSPAN-----------SEAARMYSESK---REYNRRVRDVVE
1PZVA_Ubc_Cele_34811307 EQSSLLLKKQLADMRR---VPVDGFSAGL---------VDDNDIYKWEVLVI----GPPDTLYEGGFFKAILD------------FPRDYPQKPP-----KMKFISE---------------------------------IWHPNIDKE---------GNVCISILH---------DPPEEEEERWLP------VHTVETILLSVISMLT---------------DPNFESPAN-----------VDAAKMQRENY---AEFKKKVAQCVR
1I7KA_Ubch10_Hsap_13786748 GPVGKRLQQELMTLMM---SGDKGISAFP----------ESDNLFKWVGTIH----GAAGTVYEDLRYKLSLE------------FPSGYPYNAP-----TVKFLTP---------------------------------CYHPNVDTQ---------GNICLDILK----------------EKWSA------LYDVRTILLSIQSLLG---------------EPN-IDSPL-----------NTHAAELWKNP---TAFKKYLQETYS
2UCZ_Ubc7_Scer_2981900 KTAQKRLLKELQQLIK---DSPPGIVAGP---------KSENNIFIWDCLIQ----GPPDTPYADGVFNAKLE------------FPKDYPLSPP-----KLTFTPS---------------------------------ILHPNIYPN---------GEVCISILHSPGDDPNMYELAEEEEERWSP------VQSVEKILLSVMSMLS---------------EPNIESGAN-----------IDACILWRDNR---PEFERQVKLSIL
1J7DB_hUbc13_Hsap_15825811 AGLPRRIIKETQRLLA---EPVPGIKAEP----------DESNARYFHVVIA----GPQDSPFEGGTFKLELF------------LPEEYPMAAP-----KVRFMTK---------------------------------IYHPNVDKL---------GRICLDILK----------------DKWSP------ALQIRTVLLSIQALLS---------------APNPDDPLA-----------NDVAEQWKTNE---AQAIETARAWTR
1JASA_Hsubc2b_Hsap_34809571 TPARRRLMRDFKRLQE---DPPVGVSGAP----------SENNIMQWNAVIF----GPEGTPFEDGTFKLVIE------------FSEEYPNKPP-----TVRFLSK---------------------------------MFHPNVYAD---------GSICLDILQ----------------NRWSP------TYDVSSILTSIQSLLD---------------EPNPNSPAN-----------SQAAQLYQENK---REYEKRVSAIVE
1KPSA_Ubc9_Hsap_20150955 GIALSRLAQERKAWRK---DHPFGFVAVP-----TKNPDGTMNLMNWECAIP----GKKGTPWEGGLFKLRML------------FKDDYPSSPP-----KCKFEPP---------------------------------LFHPNVYPS---------GTVCLSILE-----------EDDDDKDWRP------AITIKQILLGIQELLN---------------EPNIQDPAQ-----------AEAYTIYCQNR---VEYEKRVRAQAK
1KPPA_Tsg<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=101">101</a>_Hsap_21465897 YKYRDLTVRETVNVIT------LYKDLKPVLDSYVFNDGSSRELMNLTGTIP----VPYR--GNTYNIPICLW------------LLDTYPYNPP-----ICFVKPT----------------------------SSMTIKTGKHVDAN---------GKIYLPYLH-----------------EWKHP-----QSDLLGLIQVMIVVFG---------------DEPPVFSRP----I------SASYPPYQATG---PPNTSYMPGMPG
1JATA_Ubc13_Scer_14719686 ASLPKRIIKETEKLVS---DPVPGITAEP----------HDDNLRYFQVTIE----GPEQSPYEDGIFELELY------------LPDDYPMEAP-----KVRFLTK---------------------------------IYHPNIDRL---------GRICLDVLK----------------TNWSP------ALQIRTVLLSIQALLA---------------SPNPNDPLA-----------NDVAEDWIKNE---QGAKAKAREWTK
FINAL HHHHHHHHHHHHHHH------------------------------EEEEEEE----------E----EEEEEE----------------------------EEE--------------------EE--------------EE-----------------EEEEEE-------------------------------HHHHHHHHHHHHH-----------------------------------------HHHHH----HHHH--------
_Rsp._22726448 TAGEARLIRECEELAS---LAAASAWLEEP-----QFGKNADGLLTWSFVLL----------AGDRRIPLRLV------------FPALFPDLPP-----FVLPADS-----------------------------SVRLSQHQYGEG----------GELCLQYRP----------------DNWHP------DCKSADVVRSAKALLE---------------ATPKDDGFS------------DVESAHPTDL---PSLLSGCSRRFM
OB2597_05120_Obat_84499281 LVDSARLAAERRSIEQ----AAAGEWFRFA------RWTLHHGLVCVEGEIL----------AHDNTYPVRLI------------YPDQFPLVPA-----WVEPAEK------------------------------ARWSSHQYSG-----------GSLCLELRP----------------DNWIP------TATGADVLESAFNLLH-----TEDPLGEGGATAPSDHRVG------------EVQTYGDLHL---PALIGAGCLDRL
RHE_CH01997_Retl_86357617 LNNTVRVAREKEAVEN---LATETEWFVLD------RWEIHDYKFAAIGSIV----------AHGATYPIRLV------------YPDNFPLVPA-----WVEPQDP-----------------------------EAKWSYHQYGKG----------GALCLELRP----------------DNWTS------RANGADVLRSAYGLLN----LENPLGDGEKGKVTSAHNVG------------EIQKYNWGES---PVFIGQECLTRL
y4oA_Rsp._2496721 RLTEVNVLKRGSDQDN---WWQAYPGLYAR-----ELAAYEGHGASHRPLIQ----------QDGTLILEVLWP-----------MDSAGSIRLN-----VGYSPLH-------------------PFCRPSISAPELQLERHQNPFT----------RDLCLLTQDS---------------AQWYPH---QMVADFIAERLSQVLQVM-------------------T----------------LRRNEQWSEA---ASLEEQAPDPVT
y4qC_Rsp._2496738 PAGRRRLAELQKLHSA------AGESLLVD-----EEAAAAGILRIEFSWPL----------NDGRTIGLRAV------------YPDTFPRLRP-----HVFLTCD----------------------------PSEYPERHCGSE-----------GALCLLGRDT---------------RYWQAN------MSLAELLDENLAHVL---------------DGT-------------------GAEDPQGEP---IEYWWNSLGQAS
ROS217_07909_Rsp._85706659 RTAQDHSAHDFGVMDA---WERVREVLAGH-----GFTLVPGSGRDRYQGQI----------KVGSVPVSLEIE-----------IADYDFLDLP-----KVRVLKR--------------------------EALPKRLTGHIVSD-----------GTLCYADKAT---------------FLLDRY----QPDRSVVSCLEQARTTL---------------NTLLHG---------------NPSVAYMAEL---AAYWSATPYCL-
_Cper_86475968 -MVILILDLFNSLNSF---ENIKNVKEIKK-----NNDNFEVNYSKIYEFTL----------NIQKQNFDIIMC-----------IPEEWNLKLI-----DFYIKDY----------------------------KNIKFIPHLEEN-----------GKICLFDKEG---------------LLVEEN----LNGIAIESIERLNKVLY---------------EGLNDI----------------NKLDFINEF---DAYWNLLSTNNI
GuraDRAFT_0469_Gura_88937743 DESLLKEALETCLLVK---SVAELHPKRLA-----EPWAKDRFVCRSYKLVI----------ELNGVPVDFYFG-----------VKKSFPLSLP-----YIFLAQW----------------------------DSFGILPHVETD-----------GYICYAQEDG---------------SVLDFD---DVAGIAQEALSRAIQVVV---------------DGISGK----------------NHQDFLDEF---GAYWDRLKKVKF
Psyc_1372_Parc_71038912 -MMSELHQTMLSCGFK---YLKNSQRQSIS-----FFDSIPTTRPIYVKDYK----------TSEGIFNVALV------------FGDDLYTTLP-----RAQVLKK-------------------------PKKIEQVLLPHINSG-----------GYLCYVEEKE---------------ADWNPN----NLNALYRAVDEQVQNTL---------------NTAISSLQNG----------QIDQAEFEGEF---VSYWKPEQTIY-
ELI_04040_Elit_84786718 -----FRFRMMSLADR---WRAIAATLANK-----GFTEQQGASPEFRGSIN----------VHGRAVDIELV------------IPDSKFVELP-----IVRLVDR--------------------------KQLPAGAFGHISRDDIEG-------SVVCFAPATG---------------LPLDFH----DPGGSVLRVLRQTELSL---------------EKSFAGQG---------------GAEVAAEY---QEYWIEKEPNFR
_Ecol_37927532 MKDGQLHQVMTGCGYR---YTRARNLPEKS-----ILHSRERGAGYYTKEYA----------TDAGNFNVALV------------IHPDPFTELP-----TAFIIEQ-------------------------PEQFKSCLMPHVALE-----------GFLCYVEQME---------------ADWDSN----DLEATYKEVDAQIHQTL---------------IDSVSAATQG----------VNDKRELEGEF---AAYWRPSETLFL
VC0180_Vcho_9654584 -MKQELHHTLLGCGFR---YTPAKQMPKGI-----LLDTKSRRKGYYVKEYS----------TKGGVFVIALV------------LWNDPHIQLP-----FAYILQQ-------------------------PEQYKGRLLPHINFG-----------FCLCYVTQME---------------ADWNSN----DLKSTYQDVDEQIQLTL---------------DNSVASVESG----------TSNDVELEGEF---SAYWQSEEELYL
PB2503_00627_Pber_84701417 GVISEARTALADRLGA---YLLSAFDAQPF-----SASDLQAYNGKKVDRGW----------RLPGDPPLHLL------------LDPEFPYAPP-----RIALPDE----------------------------TQRLLWPHVETA-----------GLLCVFPTQ----------------TNIDAF---EPEKVATALITDARDLIT---------------RNQSGD----------------LDEEFRKEF---QSYWTLAIDDKA
Shewana3DRAFT_3199_Ssp._78684828 ------LERHRGHSVL---SEIKQHLINQG-----FNCTTSEVAGGERIVVE----------TTILNHGIQLML-----------VADPPYYRLP-----EFFLINP----------------------------DSIGRLAHVSVHEYAGIQI----GTVCVNAPES---------------LSVNFE---QPLLVVEESLRRHILLLE---------------KCITNPD--------------WNHSELLREF---SSEWLRICAPDS
ArthDRAFT_2172_Asp._66965723 WERYAGLLQSEISWLQ---DLGIACRIDET-----KRDDHQTLTMELSVPET----------VTGTAPLELTAV-----------FPDFYPLVPP-----KVFAVDL--------------------------------GMPHHWNPFS---------NEVCLLGTPS---------------EEWGTN------GSLAQLLKDQLPAAL---------------KAGMSGDEH----------ADWNEKPQAEPF---GAYYNSYANSAM
FINAL HHHHHHHHHHH----------EEE-------------------EEEEEEEE---------------EEEEEEEE--------EEE---------------------------------------------------------------------------EEEEE-HH---------------HHH------HHHHHHHHHHHHHHHHHH-------------------------------------------------------------
Mdeg<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=02000735">02000735</a>_Mdeg_48864353 IHDVIRWLDETRSVAG--IQTVTSSDDGV------------VVATNWRVDLP--IRFESEGETESGIRSIEAV---------SWVFPWEYPLRAP-QPKLREDFPLT---------------------------------LPHINPVVEGED------ISPCIAEVDL-----------TDLLHSSGI-----EAVFGAMTHWLNNAASGEL-------------LCPVQGWE------------PVRRDNASGLI---SADTYAIREELN
SYN_01833_Saci_85859492 AQEELREIEAASEGAF--EVLSVRFPEGD------------HRSAIAEISVT--CFDMPYAEGGIKLRDRERF---------LIYIPPDFPFDVPSVYTPHRRFSG----------------------------------NPHVQWQ-----------TYLCLYQSRN-----------TEWDASDGM-----FGFISRLELWLRRAALNQL-------------DMEGAPLH------------PPVAYPTERIT---------------
Mmc1DRAFT_1998_Masp_68246513 ALEQVADIVAASNGTV--ELVQIDPPTSE------------GDTLLLRVSID--TSDYTFQKGGLKFRKREGF---------HIRVSSRFPIEPPIAKFTHQRFMG----------------------------------QAHVQWG-----------NQICLYLATD-----------VEWSASDGM-----FGFIKRLDQWLGDAAQDQL-------------DPDDAPLH------------PPAVYHSSDTK---FSVEIDTPELAD
pCPF5603_46_Cper_86559649 NDDFTMFYKGLLECKN--VKNITIYKLNI------------NSVIIRLELKI--NLPSRRSLMEFDIKEFEPIK--------LLCSTNEIKYKAPLVFSDRNDFPVE--------------------------------KLPHTLAMGLNY-------SYICLHRGNI-----------DDWYIDHSV-----EDFVNRIRFWFSDAACNNL-------------IKPGDDFE------------PMINYTETGNI---VYSYNKLTKFIE
RmetDRAFT_0537_Rmet_68559822 IADALHQLQRHRGLIR--VGEPRTTGAST------------EIEVDVAVQLP--NRSRRNGISETGVRTVETC---------VLVFGSDWPLSAP-EPFLRADFPLN---------------------------------LPHINPHRQGEL------VSPCLFEGSL-----------NELLHRFGL-----DAVVDQLIDWLHKAAAGTL-------------LDLEQGWE------------PTRRDSCPSTV---VFSAEKVAAAAP
MaquDRAFT_3270_Maqu_77955723 HIQMLVAAILQHQRSE--DHQVTERENEL------------VLDVSWRVQLS--SRDVEVGQSGTGIKRLEPV---------RFLIPFAFPLRPP-DITLRSDFPRE--------------------------------FVPHIYPGSPGDP------VCPCIAEVGI-----------TDLMFQEGI-----SGVLRSLQAWLDRAAQGTL-------------MDPSQGWE------------PILFQNIAGSF---LDDKGSFLRGVR
Nwi_2872_Nwin_74421923 AERFLAAALRHPECRG--GRLISVDAGGS------------RIELDLNVEMP--LAFKVDGASPNGVRVVETV---------NVRLWPSYPWSSP-SFYLRMDFPRD---------------------------------LPHVQPGPVTEP------PRPCLIDGNQ-----------REYFFQFGLVELGIFNLVHQLVLWLQRAAEGTL-------------IHHGRGWE------------PTLRCDLNDVI---ALNAEACRAVVD
XAC3952_Xaxo_21110358 DGRMQALLRACNAHAD--INVVELRRIED------------PFIAEIIVADV--GDGAVSPGNDAGIHRIERM---------ALLYRTGARFPFE-ARPLRKTFPKA----------------------------------LHQYATGNEGP------PSLCIMEGDW-----------ELAEHRFTP-----EALLETLLAWLEKTADGTI-------------HEADRGLE------------PVFYSLGQCLM---LPPDFAEALSDP
MaquDRAFT_3597_Maqu_77955313 NLPEPLSDLADACNDN--SDFDIVEFRRI------------SKDSYALVVDA--GDGTFDAENPVGIRRIERL---------AFVLNPNLGFPWE-VRALRSDFPVT----------------------------------MHQNHVEPNSP------RSLCLYVEPW-----------SSVERTWSP-----QSFLARALWWLRETACENL-------------HQANQPLE------------QLFFEPADQFV---LPEDYFERLTDT
PnapDRAFT_0071_Pnap_84717800 RAKTLFDVVSRQRDYA--VVQLLQHCDDG------------TPKLECIVVEV--ECDGVPPKNGVGINYRERL----------ALCVSDDPKQLIEVLAMRKDFPVL----------------------------------MHQNQGILDAP------ASLCLYFESV-----------AAVMRTWTP-----QSFLRRIQWWLEKSARGEL-------------HPTDQPVE------------HLFFATRYELV---LPWNLSTLRKSA
OB2597_18097_Obat_84502025 LTSSAAASFARFVDRH--AAELAAIVALR------------RGGAGELVELA--FRTGRPQQSVVPIRRTERI-----------GVRFAGGDSMPFVYVLRSDFPDT----------------------------------AHQNLTAEGSP------RAICIDDRGW-----------AEARLTWTP-----AELVQRILAWFRRAAEGAL-------------HDARQPVD------------PLMFGTGYNII---MSRALIDNANTQ
GOX2518_Goxy_58038271 RSRLARSVIEYVCDSV--EHPYATIQEFQ------------SDGLSDIVDLE--LEIDLAQDRAVPIRHREPV---------RIVFASPDDLIAPRVLSLREDFPSG---------------------------------QVHTNLDREVDG------LCLCIWEEGW-----------HDLSRNLTG-----QALVERIRWWFAGMADGSL-------------HADDQILE------------PLVATTSDTIV---FPLGTFVGPWFI
RSP_2047_Rsph_77387013 DEEIPDVLHPVTSLLR--IGVGPVTALEG------------WKEWRRGFFSL--PLVARVTISPGQSFPAESR-----------WHLVVSSGSYPA---DIFILPDK--------------------VAGPNLT------FPHQAAVYSRDGKEPWLNGEPCLTDPTAAFGDR------HGSRPEPIAL---ADRLIWKVERFSRWCELAAA-------------GRLHNPGD------------HFELPPLSGHT---NPMTIGFHETEG
FINAL HHHHHHHHHHHHHH-------------------------------EEEEEEE---------------EEEEEE-EEEE----EE----------------E---EEE------------EEEEEEEE----------EE-----EE-------------EEEEE-----------------------H---------HHHHHH-HHHH-----------------------------------EE-HHHHHHHHH----------------
Ava_C0067_Avar_75705484 EREGKESKYKFLSPE-----AVEKAFTSK-TAAS-------GWLSSNTIWWG---------KNPEGEAIIQFYSPQKYQIQIMGQEPEVITVPMP-----AFLFAGCSS----------RYYLWAIKGRVF-KPDTQLYKPPLPNVWED---------SSICFGG-------------------NSLS----MCSAATISQVWDLFWKSPFNKDLSQGKS-----KTHPDNIC--------------NQLIKLHESKA-KSYPSSDLVPVH
alr7559_Ana_17134644 EREGKESKYKFLSPE-----AVEKAFTSK-TAAS-------GWLSSNTIWWG---------KNPEGEAIIQFYSPQKYQIQIMGQETEVITVPMP-----AFLFAGCGS----------RYYLWAVKGRVF-KPDAQLYKPPLPNVWED---------SSICFGG-------------------NSLS----MCSAATISQVWDLFWKSPFNKDLSQGKS-----KTHPDNIC--------------NQLIKLHESKA-KSYPSSDLVPVH
p1B75_Asp._56315656 TVDGLRKMFDSLDPS----RSARPVFLEP-NVLS--------QGPGWLVWWM-----------KPQTRRVWFES--------KEIKLETAEVPHP-----GLVFAVTQE----------EWRVFAVQGRSRPRPGTKLYQAPYWNVWKG---------GRICAGS-------------------ARLP----SAGLQADPSGWEESFFSSR--------------FSHPNIHEKDALVKYKGG--SAKFWNAMLSGKF-KSFPQEVLVPAE
BCE_A0096_Bcer_44004435 TFKDFYLALKEVMEQGTQDNTHYSSGVLPKGCIKH--EVLSKSGDKQAVWIE----------VPKAQWDIHFFE------------RPFQQVGFP-----RLLFRYTVYQKRVT-----NISVFAVKEDMELEEGMKLYQFPYSNVHPS---------GSVCTGR-------------------VVIP----EFRTLKDLETFHVLFFASS--------------FNHDLTHTHTEP--------VGELFKRFEN----QSFDDSILMESE
BT_2648_Bthe_29339960 TYEFMNSLVESYTES----MSGIPHGRIPGNMLLC----DSRKGRERYIWYN-----------PPQKRKMYFQD---------GLHITDGTFNVP-----GVIYVVERE----------CMDIHAFKGA-IPEERTELYLAPFFNVAG----------ANVCLGSSS-----------------PKKPQ---DMDFLEFQEYWEKRFWMSE--------------FSHLGGNRNP----------TRSNLVSVTEHARNNPFDYSELQQSG
BproDRAFT_4305_Psp._67908644 TLTSKNLKLLAQQAQQ---GLKQDFEVIPANVLV--------ANDSLLAWWM-----------PKGTQLMSFDVSMHELAGKSRLQGVSGNVPTP-----ALVFAMMRNRNAGGAFE--GLYVFALEKSERPTSDTSLYRAPLLNVGED---------GSVCWGD-------------------GVKP----AGKTVKDISAWQALFFSSV--------------FTHYNGTVPIVGDD------PYAFIADLMETEA-KEFPAAALKPMK
RferDRAFT_4144_Rfer_74024822 KKDSLMAALRQLARQQ---GISDLVWVDD-QTIA--------TSSTLQVWWT-----------PAQSRWMHFQS---------QGLQLSLPAQNP-----PLVWLACGE----------CLMVFALKENIKPGPTTALHHAPLFNVFAN---------AEVCAGS-------------------MQKP-------KDGNAKEWVESFYAAT--------------FTHANPPSRRLTTYRQG---EKALWKHLMTSKKKPAFPTDKLKPFG
BproDRAFT_0623_Psp._67910471 TEADYLAMVKVLAPQQ----RPQMEWQDH-CILA--------KGMGKMIWWT-----------PPMNRAMFFKKS---DMFGATTFSGQGICPLP-----GMVWMSDGR----------DLFVYAYRGSAMPGKETRLCQAPLFNVWAR---------GEVCVGN-------------------ASRP----DDSAKGNPQAWERFLFDSH--------------FTHPNFAQVDRLTKGVK---PAEFWKKMVAKP-AQKFPESVLVDLE
PnapDRAFT_0124_Pnap_84717439 TQSDLNELVTGLSQSQ---SLSVPSWIDT-TMLA--------LGAGRMIWYT-----------PACQRAMFFKTS----SFTKDTFEAQGQLPTP-----GLVWLVMQG----------ALYVYAYKGSGRPDKETKLYQAPFFNVWSQ---------GKVCTGN-------------------AAMP----VGDNAAIPHMWVDAFFGSN--------------FTHPNFKEKDRLVKGVC---PIDFWKAMTEKP-LPVFPEGRLVDLP
RSc1659_Rsol_17428675 SLGELSEFVEAAQTA-----TAYRGFIEP-HVLY--------LAPNTVAWWR-----------PAAPRTVWFSAE-------KPIGTRHGVTAHP-----PLVFIVHER----------QWYVFALAKNERPAPNTPLHVAPYFNVWER---------GEICTGN-------------------VSLP----DRPAPDALKAYETAFFDSR--------------FTHPNHARITRHKDG-----GGALWAHLLDHPEITEFPATALLPRK
RmetDRAFT_6238_Rmet_68559357 NRMALIHAVRQVAANA----LPKGEFLTP-NVLS--------ISATTVTWWC-----------PAASRRVFFKCE--------EFGERNAIVAHP-----ALVFQASHS----------GFSVFALQGEDRPGPETALFEPPYFNTWDH---------GRICIGS-------------------AQVP----KQIDVASISGWEEGFFNSA--------------FTHPNHGGKRVAYERG----VYAFWKDMLDGKF-PDFPKQVLVPMK
PHG308_Cnec_38637969 NRMALIHAVREVAEAS----LPNGEFLTP-NVLS--------ISPTAVTWWC-----------PAAQRRVFFDCK--------EFGKRSAVVPHP-----ALVFQASQS----------GFRVFALRGDERPVPASELCEPPYFNTWDH---------GKICIGS-------------------AHVP----KQIDVASIAGWEAGFFNSA--------------FTHPNHGSKRVTYERG----AYAFWKDMLDGQF-PDYPKQVLVPMK
Bcep1808DRAFT_6253_Bvie_67543573 DRKVLVQTLQQLAEHV----APRAEFLPA-TVLG--------VSPEAVTWWC-----------PPAMRRVFFECE--------NLGKRSAVVPHP-----GLVFQALNQ----------GFRVFAVACSDRPVRETPLFEPPYFNTWDM---------GRICIGS-------------------AQVP----KRVDVASIDGWEAGFFDSA--------------FTHPNAGGKRIEYKDG----EYAFWRDMLDGKFGETFPLNALVPMK
Daro_2538_Daro_71847775 TPRAAMDLAKALLKR-----AAHGGFLPE-TVLY--------MDGDLIVWWM-----------PPARRHIAFRVD-AEQAEAFGGQERGESVPHP-----GLVFAASSR----------VWRVWAVKGAGRPTPATALFQVPYFNVNVQ---------GNICHGN-------------------APVP----EGTTVEKIAAWNDAFLRSY--------------FTHPNGPGKLIRYRGG----AYTFWRDMLDGRF-QRFPERVLVDVK
PproDRAFT_0257_Ppro_71839550 DVEMLGTLINALGRN-----VSIGGYLPP-NILS--------VGFDSMVWWV-----------KPSKRRVFFKTN------EEIIGERSEVVPHP-----GLVFGVNGSG---------VWAVCAVKGNTRPTEDTPIWQAPYFNVWSS---------GNICTGT-------------------IETP----KSVAVTETGKWEECFFSSY--------------FSHPNAHGSRQLINSRIN--PYQFWKTVLDGKY-KTFPTQKLVQTN
RBTH_06715_Bthu_75758403 NTLFEFVQKNCYETKTNTKKLDIPVFETP-A-----------LPPGTVKYMALPDGKI-----VLFMEKKEFKHNL------TYHSTKYKQIPFP-----NLLFVFVFRPNGDKYILE-NKRCYAFRDKVF-RDTTKLYRFPFSHVQKD---------GEMCFFF---------------------LT----EMQDLAQMSSFIHNWLSAA-------------FTDHYYNLENKNKW-------GWPLRQIFSETQGQPHFNYDKLIEED
RBTH_07326_Bthu_75758953 NTNIETIQQIFMKEQA------METPLLP-------------SQWGVVKYYRKNHYEGYVLTTPPTERVVKFDIG------RSSELPTEVTLPIP-----PMLWVFEVMTDQSGKKKLTHSMTYVIKHELL-SLKDKVFHAPFCNIGIS---------HGICWGR--------------------TLP----EVPIPKSIQSIPARFFSQPFNYDLSGNRVKPFEWTHPNGNTEDTECAVYHMMNEADKLKAAKEAGEAYSYPFDSLKPAG
FINAL ---------HHHHHHH-------EEEEEE---------------EEEEE-------------------EEEEE----------------------------EEEE----------------------HHHHH-------------EE-------------EEEEEEE------------------------- ---HHHHHHHHHHHHHHHH---
PnapDRAFT_3950_Pnap_84711628 TEGLAALPEADQRYLD---SHGFTVEVVS----------DGPHTGVVLKQMQ----LPQGK-FNHPAADVLVI------------LPPGYPDVAP-----DMFFCNL--------------------WLTLVSAGRYPTCADQPHTFM----------GHNWQRWSRH--------------NNSWRP------GVDGLHTMIKRIEHALAEAK---------------------------------------------------------
sll6054_Ssp_38423903 --VMTFLPESDRQYLA---NKDYTYEEIT----------EGSRKGLIFSKFP----LPNQK-YDVSEVDLLIL------------LPNGYPDIVP-----DMFYLEP--------------------AVKLVQGNRPPRATEARQQFN----------GRSWQRWSRH--------------EREWRR------GVDGIWTMLKRVEHALEVAA---------------------------------------------------------
alr7503_Ana_17134588 --VMSFLPSNDRQYLE---NRGLPFEEVV----------DASQKGVILREFQ----LPLGR-FDTEQADILIL------------LPSGYPDAPP-----DMFYLLP--------------------WVKLVQGAKYPKAADQPHQFN----------GQKWQRWSRH--------------NNEWRP------GTDGIWTMLKRIENALEVAA---------------------------------------------------------
NhamDRAFT_1902_Nham_69928899 PRQAFALLPVDERHLD---TMGLKWETVV----------DGGRRWLLIEGYP----VPEG--YNAAVVTLALE------------IPGPYPGAQI-----DMFYVHP--------------------ALRRLVGEEIP-ATQATETVL----------GRIFQRWSRHRGP-----------NSPWSS------RLDNVMTHLTLVDGALAKEVNQ-------------------------------------------------------
Bcep1808DRAFT_3228_Bvie_67547440 VRADFTVMEEDAEFLN---SKGYTWEAVA----------SDAKR-IVVRGFE----PPQG--FAPTKVDMFVI------------LPQGYPDTQI-----DMVYFSP--------------------PLTRNDGKPI--RSLVTNEFE----------GKTWQGWSRHRTA-----------NSPWRQ------GIDNVGTHLMLVDDFLRAELSK-------------------------------------------------------
FINAL EEHHHHHHHHHHHHHH-HHHHHHHHH-------------------EEEEE------HHHHHHHHHHHHHHH-----------------------------EEEEEEE----------------------------------EEEEEE----------------EE--------------------EE--------------HHHHH--------------------------------------------HHHHHH---HHHHHHHHHHHH
y4jF_Rsp._2496664 AFDDQAASCAEGQATL-DLAVRLLARLYP----------------VLAILPL---DSASSFQAQALERLAKSI--------------------NPK----IGIRRSGKS------------------------------AMVCLVAGATRP-------SLRCTTFF------------------IGS-------------DGWAAKLSRT---------------DPVGSGSSLL----------PYGAGAASCFG---AANVFRTIFAAQ
mll6192_Mlot_14025925 AFDDQAASCAEGQATL-DLAVRLLARLYP----------------VLAILPL---GSAASFQAQALERLAKSI--------------------NPK----VGIRRSGKS------------------------------ATICVVAGVTRP-------PLRCPTFF------------------MGS-------------DGWAAKLSRT---------------DPVGSGSSLL----------PYGAGAASCFG---AANVFRTIFAAQ
msi105_Mlot_20803932 AFDDQAASCAEGQATL-DLAVRLLARLYP----------------VLAILPL---GSAASSQAQALERLAKSI--------------------NPK----VGIRRSGKS------------------------------ATICVVAGVTRP-------PLRCPTFF------------------MGS-------------DGWAAKLSRT---------------DPVGSGSSLL----------PYGAGAASCFG---AANVFRTIFAAQ
RHE_PA00014_Retl_86359719 AFDEQACA-TEGRASL-DLLVRLVARLYP----------------TICLLPS---GEEAKKLAKNLASLARSI--------------------NED----ITIARRGSS-----------------------------ALSHCLVVGSTNP-------EISCPKFF------------------LGS-------------DGWIAKFSPE---------------EPVGTAGSNN----------AFGAGAAACIA---ASNLFRHIFRDQ
---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
3. The JAB domain
EE.HHHHHHHHHHHH.........EEEEEEEE......E..................EEEEEEEEEE..........................................................................EEEEEEEEE..........HHHHHHHH..................EEEEEEEE........................EEEEEEE.....EEE..E.EE....
EE.HHHHHHHHHHHH.........EEEEEE...............................EEEEEE..........................................................................EEEEEEEE...........HHHHHHHH...................EEEEEE..........................EEEE....................
AF2198_Aful_11499780 SRGLLKTILEAAKSA-----HPDEFIALLSGSK-----------------------DVMDELIFLPFVS-------------------------------------GSVSAVIHL-------------------DMLPIGMKVFGTVHSHPSPSC--RPSEEDLSLFTRFG---------------KYHIIVCY-----------PYDEN--------SWKCYNR----KGEEVELEVVEKD-
PH0451_Phor_14590365 RRELLEYLLELAKSF-----YPREVAGFLRMKDG-----------------------VFEEVLIVPKGFF------------------------------------GESSVYFDL-------------------TLMPHDESIKGTFHSHPSPFP--YPSEGDLMFFSKFG---------------GIHIIAAF-----------PYDED--------SVKAFDS----EGREVELEVID---\Archaeal JAB
MK0214_Mkan_20093654 DARLLDSLLEASDKN-----HPDEFFAMLGGSI------------------------DAETITIDSLIVVP-----------------------FEA---------SDSGAIFDL--------------------LSVHTCDVIGTFHSHPYGDP--VPSEDDLMLFKRLG---------------AVHAIAAY-----------PYTPD--------RVEFYDK----SGRNITPVVEVRYT|
MA1736_Mace_20090588 LLYMQIKGIARDTLD-----FILEASKSMAPEEFAGLL-------------------QEQDGIITEVLILP----------------------GTES---------SNTNAVIR--------------------LYMMPNVKAVGSVHSHPGANR--RPSKADLRLFSKTG---------------NCHIIAGR-----------PYGRE--------SWTCYDR----EGNVRDLPVLDVEF|
MTH971_Mthe_15678989 FKPVRRVVVDSEVMD-----EVLEIARRSHPHEFAALLEGRQ---------------EGEVLHVTGLIFLP-----------------------SET---------SDEGAVMDV-------------------LMLPPFTGAVGSVHSHPGPVN--LPSAADLHFFSKNG---------------LFHLIIAH-----------PYTME--------TVAAYTR----NGDPVDFEVVP---|
VNG0778C_Hasp_15789943 GGRPSVLGIAEDALE-----FAREAAQDSHPDEYLGLLRATPASAFDLD--------ADDGYVVTDVLVIP----------------------GTET---------NPVSATFGS-------------------TQVPNDMRNVGSIHSHPNGVL--APSDADRSMFGKG----------------QLHIILGH-----------PYGPD--------CWRAFDS----EGEPRTTTVLDVDL/
Z1657_Ecol_15801143 STRAAREWLILNMAG-----LEREEFRVLYLN-------------------------NQNQLIAGETXF-----------------------TGTINRTE------VHPREVIK--------------------RALYHNAAAVVLAHNHPSGEV--TPSKADRLITERL----------------VQALGLVDI----------RVP----------DHLIVGG----NQVFSFAEH-----\RadC
radC_Bsub_16079856 SPEDGANLVMEDMRF-----LTQEHFVCLYLN-------------------------TKNQVIHKRTVF-----------------------IGSLNSSI------VHPREVFK--------------------EAFKRSAASFICVHNHPSGDP--TPSREDIEVTRRL----------------FECGNLIGI----------ELL----------DHLVIGD----KKFVSLKEK-----|
yfjY_Ecol_16130559 STQAARDWLKLKMAG-----LEREEFMMLYLN-------------------------QQNQLIAHETLF-----------------------AGSISSTE------VHPREVVK--------------------RALYFNAAAVILAHNHPSGDT--TPSQADKTITQRL----------------VQALQLVDI----------RVP----------DHLIVGG----RQIYSFAEH-----|
radC_Mace_20090827 SPKDVYALMYPRMRE-----QKKEKFITLYLD-------------------------TKNQILKEEVVS-----------------------IGSLNASI------VHPREVFK--------------------SALLESSASVIMVHNHPSGDP--SPSREDIMVTEKL----------------VEGGKLLGI----------DIL----------DHIIIGD----GRYVSLKDE-----|
radC_Ecol_6686314 SPEMTREFLQSQLTG-----EEREIFMVIFLD-------------------------SQHRVITHRRLF-----------------------SGTLNHVE------VHPREIIR--------------------EAIKINASALILAHNHPSGCA--EPSKADKLITERI----------------IKSCQFMDL----------RVL----------DHIVIGR----GEYVSFAER-----|
RSc2620_Rsol_17547339 SPAAVKEYLRAKLAG-----FEHEVFAVLFMD-------------------------TQHRLIEYAEMF-----------------------RGTIDGAS------VYPRELVK--------------------EALRLNAAAVIVSHNHPSGNP--EPSGADRALTQRL----------------KEALGLVDV----------RVL----------DHVIVAG----TDTTSFAER-----|
VC1786_Vcho_15641789 RTENTTEYLRCKLAG-----YEHEIFAVLFLD-------------------------NQHRLIEFKELF-----------------------RGTVDAAS------VYPREVLK--------------------EALNVNAAAVIFAHNHPSGDP--EPSQADRRITQRL----------------KDALSLVDI----------RVL----------DHVVVGK----SS-VSFAER-----|
radC_Smel_15965481 SWSAVIDYCHAAMAH-----ETKEQFRILFLD-------------------------KRNTLIADEVQQ-----------------------QGTIDHTP------VYPREVVK--------------------RALELSATALILVHNHPSGDP--TPSRADIDMTKLI----------------AEAAKPLGI----------ALH----------DHVIIGK----DGHVSLKGL-----|
radC_Paer_15600512 SPQAVRDYLKARLRH-----EQHEVFACLFLD-------------------------TRHRVLSFEVLF-----------------------QGSIDGAS------VYPRQVVK--------------------RTLAHNAAALILTHNHPSGDA--RPSLADRQLTARL----------------KEALALIDV----------RVL----------DHFIIGD----GEPLSLAEY-----|
radC_Rsol_17547163 SPQSVKDFLRLTLGH-----RPQEVFACLFLD-------------------------VRHRLIAWEELF-----------------------QGTLTEAR------VYPREIAK--------------------RALHHNASALILSHNHPTGHV--EPSESDLVLTREL----------------CRALALLDV----------RVL----------DHMIVGR----AEVYSFLEH-----|
radC_Atum_17935503 SWSSVIDYCHAAMAH-----ETREQFRILFLD-------------------------KRNVLIADEVQG-----------------------QGTVDHTP------VYPREIVR--------------------RALELSSTALILIHNHPSGDP--TPSRADIEMTKTI----------------IDTAKPLGI----------TVH----------DHIIIGK----DGHASFKGL-----|
radC_Ssp_16331325 SPEAAAIALSQDLMW-----QTQEHFAIVMLD-------------------------VKNRLLATKVIT-----------------------IGTATETL------IHPREIFR--------------------EVIKQGATRLIVAHNHPSGGL--EPSPEDIRLTEFL----------------LQGAQYLQI----------PVL----------DHLILGH----GKHQSLRQC-----|
radC_Cace_15894524 SPKEAANLVMEQLRS-----FNKEHLYVIMLN-------------------------TKNIVIKISDVS-----------------------VGSLNSSI------VHPREVYV--------------------EPILKHAASIILCHNHPSGDP--KPSNEDLNITKRL----------------YECSKFIGI----------ELL----------DHIIIGD----GIYISLKEE-----|
TM1557_Tmar_15644305 DSSVKVYKYCQEMVY-----LEREIVKVICLD-------------------------TKLNVIGENTLT-----------------------VGTSDRSL------IHPRDVFR--------------------TAIRANASGVIVVHNHPSGDP--TPSKEDRLITERL----------------KQAGEILGV----------SLV----------DHVIVSR----RGYFSFREE-----|
radC_Aae_15606726 RNPQEAFEFLKDKFD-----ERRESLIALYLD-------------------------LSNRLLDWEVVA-----------------------IGNVNTVF------SKPKDILF--------------------KAVKLSANGIIIAHNHPQGEP--SPSNEDLNFTERL----------------KKACELLGF----------ELL----------DHLILSE----GRYFSFREE-----/
COPS5_Hsap_12654695 SALALLKMVMHARSG-----GNLEVMGLMLGK------------------------VDGETMIIMDSFALP--------------------VEGTETRVNAQAAAYEYMAAYIENA------------------KQVGRLENAIGWYHSHPGYGC--WLSGIDVSTQMLNQQFQ------------EPFVAVVID----------PTRTI---SA---GKVNLG-----AFRTYPK-------\Euk JAB
RRI1_Scer_6319985 SKLSCEKITHYAVRG-----GNIEIMGILMGF------------------------TLKDNIVVMDCFNLP--------------------VVGTETRVNAQLESYEYMVQYIDEMYNHNDGGDGR--------DYKGAKLNVVGWFHSHPGYDC--WLSNIDIQTQDLNQRFQ------------DPYVAIVVD----------PLKSL---ED---KILRMG-----AFRTIES-------|
PSMD14_Hsap_5031981 SSLALLKMLKHGRAG-----VPMEVMGLMLGEF-----------------------VDDYTVRVIDVFAMP--------------------QSGTGVSVE-----AVDPVFQAKMLDML---------------KQTGRPEMVVGWYHSHPGFGC--WLSGVDINTQQSFEALS------------ERAVAVVVD----------PIQSV----K---GKVVID-----AFRLINA-------|
Rpn11_Tbru_18463065 SSLALLKMLMHGRAG-----VPLEVMGLMIGEL-----------------------IDDYTVRVSDVFSMP--------------------QTATGQSVE-----AVDPEYQVHMLDKL---------------SVVGRPEKVVGWYHSHPGFGC--WLSGEDVMTASSYEQLT------------PRSVSVVID----------PIQSV----R---GKVVID-----AFRTTKD-------|
_Ddis_2104757 SSLALLKMLQHARAG-----VPLEVMGLMLGEL-----------------------IDEYTIRVIDVFAMP--------------------QSGTSVSVE-----AIDPVFQTKMLDML---------------KQTGRDEIVIGWYHSHPGFGC--WLSSVDVNTQQSFEQLQ------------SRAVAVVVD----------PLQSV----R---GKVVID-----AFRTIKT-------|
ECU11_0570_Ecun_19074857 SSLALLKMLKHGRAG-----IPLEVMGLMLGEF-----------------------VDEYTVKVVDVFAMP--------------------QSGTNVTVE-----SVDPIFQMEMMSIL---------------KATGRHETVVGWYHSHPGFGC--WLSTVDISTQQSFEKLC------------KRAVAVVVD----------PIQSV----K---GKVVID-----AFRLIDN-------|
RPN11_Scer_14318526 SSIALLKMLKHGRAG-----VPMEVMGLMLGEF-----------------------VDDYTVNVVDVFAMP--------------------QSGTGVSVE-----AVDDVFQAKMMDML---------------KQTGRDQMVVGWYHSHPGFGC--WLSSVDVNTQKSFEQLN------------SRAVAVVVD----------PIQSV----K---GKVVID-----AFRLIDT-------|
C6.1A_Hsap_1168719 ESDAFLVCLNHALST-----EKEEVMGLCIGELNDDTRSDSKFAYTGTEMRTVAEKVDAVRIVHIHSVIIL--------------------RRSDKRKDR----VEISPEQLSAASTEAERLA-----------ELTGRPMRVVGWYHSHPHITV--WPSHVDVRTQAMYQMMD------------QGFVGLIFS----------CFIEDKNTKT---GRVLYT-----CFQSIQA-------/
Stambp_Mmus_17941277 NLCSEFLQLASANTA-----KGIETCGVLCGKLMR----------------------NEFTITHVLIPR----------------------QNGGPD-------YCHTENEEEIFF------------------MQDDLGLLTLGWIHTHPTQTA--FLSSVDLHTHCSYQMM-------------LPESIAIVC----------SPKFQET------GFFKLT-----DYGLQEI-------\Euk JABs
SPAC19B12.10_Spom_19115685 LLKKVFLDVVKPNTK-----KNLETCGILCGKLRQ----------------------NAFFITHLVIPL----------------------QEATSD-------TCGTTDEASLFE------------------FQDKHNLLTLGWIHTHPTQTC--FMSSVDLHTHCSYQLM-------------LPEAIAIVM----------APSKNTS------GIFRLL----DPEGLQTI-------|
CG2224_Dmel_7301945 DTMEVFLKLALANTS-----KNIETCGVLAGHLSQ----------------------NQLYITHIITPQ----------------------QQGTPD-------SCNTMHEEQIFD------------------VQDQMQLITLGWIHTHPTQTA--FLSSVDLHTHCSYQIM-------------MPEALAIVC----------APKYNTT------GFFILT----PHYGLDYI-------|
Stambpl1_Mmus_17390801 DLCHKFLLLADSNTV-----RGIETCGILCGKLTH----------------------NEFTITHVVVPK----------------------QSAGPD-------YCDVENVEELFN------------------VQDQHGLLTLGWIHTHPTQTA--FLSSVDLHTHCSYQLM-------------LPEAIAIVC----------SPKHKDT------GIFRLT-----NAGMLEV-------|
1039_Ddis_2582351 HGEVFQEFMRLAENNTK---RSIETCGILSGTL------------------------SNDVFRITTIIIPK--------------------QEGTTD-------TCNTIEEHEIFE------------------YQLENDLLTLGWIHTHPTQDC--FLSAVDVHTHCSYQYLL------------QEAIAVVIS----------PM-----------ANPNFG-----IFRLTDP-------/
AF2198_Aful_11499780 SRGLLKTILEAAKSA-----HPDEFIALLSGSKD-----------------------VMDELIFLPFVS-------------------------------------GSVSAVIHL-------------------DMLPIGMKVFGTVHSHPSPSC--RPSEEDLSLFTRFG---------------KYHIIVCY-----------PYDEN--------SWKCYNR---KGEEVELEVVEKD--\Archaeal JABS-2
VNG1818a_Hasp_16554503 TREGYDSVLDHAQAD-----TPREACGVFVGE------------------------RDGDLRRVTAVRRVP--------------------NVADAPRV------RYELDPEATLAVFD---------------EAAAVGREVVGFYHSHPVGPG--RPSATDREHAQ------------------WPDRVYVVA----------SLAARPPILD---AWLWTGE----AFER----------|
PA2102_Paer_15597298 TEHALSVIYRHACRT-----YPRECCGFVLADA-----------------------KVKEGTNIQDELHMA-------------------DPRRYPRTAA-----NGYTFSVTDTVFLN---------------SSFKTCSPVSVIYHSHPDVGA--YFSREDIDKALYAGEPM------------LPVDYLVVD--------VAAGNVRGAKLF---AWRNGRF---ECTREFGPSSQ----|
PAE2024_Pyae_18313041 MPKAFLEEARKKCA------PEAECVALIFGISDT-----------------------ALSWRWMKNVAA-----------------------------------SPVFFKLDPEEVYKAIV------------EAEERGEELLAIFHTHPGPP---TPSWEDVRHMRL-----------------WPVTWIIAN----------VFDWHI---S---AWRIDG-----GLKTIPL-------|
APE0681_Aper_14600889 ASIGPLRQVLKLMAL-----AHNEEAGLVIGARR-----------------------GDTVYAYILYRTDN-------------------LKQSPEEFES-----DPWQVVQAHR-------------------AAEKLGLEVVGVYHTHTTCPP--SPSGKDVEGMKR-----------------WPGVWLIAC----------PGEVK--------AWTLEGE---TPVEIELE-------|
PH1488_Phor_3257912 LPKNIIEEIITRSRE-----SKIEICGFIFGTK--------------------------NGERFIGKEVE-------------------FIRNRLNSSVEFE---MDPEEMINALE------------------RAERKGLEVVTIFHSHLNCPP--YPSKKDIKGMENWR---------------IPWLIVSLK----------GD-----------MKAFILR----SNNEVEEVKI----|
SSO0111_Ssol_15897071 NRYFKINCWSRRFMD-----NLKEKCGIICNNT--------------------------FYELKNISRTE-------------------YE--------------FICDPSDFYTT------------------VKGKCSDDIQAIVHTHEESC---EPSYKDIMSMKIWN---------------IPWIIISKK----------CIKSILYLNG---SILELD----IHSLLSQELYHSLM-/
sll0864_Ssp_1652702 SQVHQDQIYRHGERC-----YPEECCGLLLGKILIGENGH-------------------RHWQVVEVQPTENCWGDVE-----------EFQQNNHQGNKLHYFAIDPKVLLSAQK------------------DCRQKGLSIIGIFHSHPHGQP--IPSEFDRAIA-------------------WPEYIYLIA-----SGENGRFNTSR-------SWYLNEA----GNFMEVDS------
YPMT1.08c_Ypes_16082790 MQEIYLTAIKR---------YPNEACGFLVRT---------------------------TGEKYRFMEARN---------------------------------VSENPENTFVMHADDI--------------IAAEDAGDVVAIWHSHTDESA--DASDADRAGCEATE---------------VPWLILAV-----------RKNVEGD------APFHFSE---MNVITPDGFEMPYL-
_Scoe_7479881 TQALYDQIVAHARED-----HPDEACGVVAGPAG-----------------------EGRPERFIPMLNAA--------------------RSPTFYEFD-------SQDLLKLYR------------------EMDDRDEEPVVIYHSHTATEA--HPSRTDVTYAN------------------EPGAHYV------------LV-----------STADTDG---AGEFQFRSFRIVAG-
DR0402_Drad_15805429 PAPLRRALWAQVRRE-----LPRECVGALGGW------------------------VRGEQVQAHALYPLP--------------------NVAADPER------EYLADPGDLLRVVR---------------AMQREGLDLVALYHSHPHGPA--APSASDRRLAA------------------YPVPYLIAD----------PAAE---------VLRAYLL---PGGEEVEV-------
_Aae_2984019 KKEVLEKMIKQAERD-----YPYETCGLLIGK-------------------------SEGGIRIAYEAFET-------------------PNANPDRKHDRYE--IAPKDYMRAED------------------YAISKGMEIVGVYHSHPDHPD--RPSQFDLQRAFP-----------------DLSYIIFSVQ------KGKVASYR--------SWELKGD---KFEEEEV--------
RPCDRAFT_2255_Rpal_78493975 NEETLALIVRHAEQA-----YPKECCGFVYADGEVRA-------------------CVNIQDDLKSID--------------------PARYRHGATAGYTL-----SVADTLALNG-----------------SFETANPASV-IYHSHPDVGA--YFSQEDSDEALFLGTPVYP----------VDYLVVDVRR------AKALEAKL--------FVWRKAG---FFCARVFPIDQSYR-\ThiF+Rhodanese
Noc_0361_Noce_76882206 PRPLVNQLLHQAQVK-----PQQEICGLISAR----------------------------NGLPSRCYP-------------------INNIAPEPQRHFFM-----DPQGQIAAMR-----------------RMREEGEELFGIYHSHPETAP--LPSKSDLAQAAYP----------------GALYLIISLN------TKGVLEMR--------GFRLQGE---VYEEIELQL------|
RRSL_01365_Rsol_83748715 LSELVDAVLAQARRD-----HPIETCGVIAGPV--------------------------GSDRPARLI--------------------PMRNAAQSIDAFRL-----DAQEQFQVWS-----------------EMDAREEEPIVLYHSHTGTNA--CPSRDDVRFAAEP----------------HAHYLIVSTD------PACGQAVR--------SFRIAEG---RAVEETIKVVARYQ-|
MlgDRAFT_2849_Aehr_78700360 PARERDRLARLGLAR-----WPEEACGLMLGCD---------------------------GRVRRLVL--------------------CRNVAARRADRYLV-----HARDFLRWDR-----------------AAHRLGLDILGVWHTHPDGGA--RPSGTDREQAWR-----------------GWSYLIAAVD------GRAITELR--------SWRLRGD---HFIEETLCLKPA---|ThiF+Rhodanese
NE2352_Neur_30181074 HTKLISAMITQSLKD-----HPIETCGIIAGLA--------------------------GSNLPLRLI--------------------PMRNVAQSENFFMF-----DPQQQLQVWK-----------------EMSARHEEPVVIYHSHTGSEA--YPSRSDVELAAEP----------------QAHYVIIPTC------SPHKEEIR--------SFRIVDQ---MVIEERVQIVRQYQ-\ThiF+S (S)
Nmul_A0971_Nmul_82702100 HAKLVEAMLAQAHKD-----HPFEICGVIAGPE--------------------------KSNLPLRLI--------------------PMRNAAQSETFFKF-----DPQEQLQVWR-----------------EMEARGEEPIVIYHSHTHTPA--YPSRTDVQYASQP----------------QSHYVIVPTD------PAYGEEIR--------SFRILDG---MVTEERIRMINSYK-|ThiF+S (S)
pdtG_Pput_84994017 TAQALEQVRHLAQAA-----HPIEACGLIAAAS--------------------------GEPLAHRVV--------------------PMRNQAASPTWFSF-----DPREQLQVWR-----------------ELDQRDEDCRVIYHSHTASEA--WPSREDIALASDP----------------QVHYLIVSTW------GEARHAAR--------SFRIIDG---RVFEEPLCVQP----|siderophore
HCH_02850_Hche_83645617 LSELVDAMVRQAQAE-----HPIETCGVIAGRE--------------------------GSDRPLRLI--------------------PMRNAAASSDMFMF-----DAREQLQIWR-----------------EMDANGEEPVVIYHSHTASRA--YPSKDDILCAAEP----------------HAHYVIIPTD------PEHGSDIR--------SFRIVNG---AVVEETIKAVEHYS-|siderophore
qbsD_Pflu_28192389 SQDIITAIFDQARQA-----HPLECCGIIAAAI--------------------------DSERATRLI--------------------PMTNSACSPVYFAF-----DPRQQLQVWR-----------------EMDARDEEPRVFYHSHTASRA--YPSATDIEFATDA----------------NAHYLIVTT-------ADYDPPLR--------SFRIAQG---CVSEEEVRVETPPY-|Siderophore
_Pstu_5070640 KRQALGQVLAQARRD-----HPLETCGIVASSL--------------------------EAQLATRVI--------------------PMRNQAASQTFFRL-----DSQEQFQVFR-----------------SLDDRNEFQRVIYHSHTASEA--YPSREDIEYAGYP----------------EAHHLIVSTW------ENAREPAR--------CFRILRG---KVIEESISIVE----|Siderophore
SAV5162_Save_29608821 TQALVDQIVAHARQD-----HPDEACGVVAGPE--------------------------GSGRPERFI--------------------PMLNAARSPTFYEF-----DSGDLLKLYR-----------------EMDDRDEEPVIIYHSHTATEA--YPSRTDISYANEP----------------GAHYVLVSTA------DADDAGPF--------QFRSFQI---VAGEVTEEEVKVVE-\Cys Syn ClpS
NocaDRAFT_2642_Nsp._71366889 ARATYDAIVAHARRD-----HPDEACGIVAGPE--------------------------GSDRPERLV--------------------EMVNAAGSPTFYEF-----DSTELLQLYK-----------------EMWARDEEPVVIYHSHTATEA--YPSRTDIGLASEP----------------GAHYVLVSTRHGADSRGGNNGGPV--------EFRSYRI---VDGEVTEEEVVVVD-|Cys Syn
RxylDRAFT_0217_Rxyl_68563153 GRGDVEHIHRHAREA-----YPEECAGALVGMDVGG--------------------GTKIVVDVWRA---------------------ENVHEEERSRRFLI-----EPEQIRRFER-----------------RAAERDMDVLGFYHSHPDHPA--EPSEYDRQHAWP-----------------YYSYVIVSVS------GEEIREMR--------SWRLRDD---RSGYDEEEIVG----|Cys synthase
SRU_2040_Srub_83814538 TPDILDQIRVHGADA-----YPEEGCGFLLGTVTDD--------------------GDNRVAALHRA---------------------TNRRSEQRTRRYEL-----TADDYRAADA-----------------AAQEQGLDVVGVYHSHPDHPA--RPSATDLEEATFP----------------GFTYVIVSVR------DGAPEALT--------AWALAPD---RSEFHREDIVRPDP-|Cys
AcidDRAFT_1958_Susi_67932292 ESAAWAAMVKHAQAS-----YPNECCGAMLGDT--------------------------DGETKLVR----------------ESIALENAFEGAQAARYEL-----RPQDLLAADK-----------------AARERNMDLIGIYHSHPDCDA--YFSKTDLQNSCP-----------------WYSFVVLSIQ------KGEFHHAN--------SWLPNFD----QTEAAKEELSY---|
MT1376_Mtub_13880984 RADLVNAMVAHARRD-----HPDEACGVLAGPE--------------------------GSDRPERHI--------------------PMTNAERSPTFYRL-----DSGEQLKVWR-----------------AMEDADEVPVVIYHSHTATEA--YPSRTDVKLATEP----------------DAHYVLVSTR------DPHRHELR--------SYRIVDG---AVTEEPVNVVEQY--|
nfa10890_Nfar_54014564 KSDLVAAMVAHARAD-----HPDEACGVIAGPE--------------------------GSDRPERFI--------------------AMTNAERSPTFYRF-----DSGEQLKVWR-----------------EMDAADEEPVVIYHSHTATEA--YPSRTDISYASEP----------------NAHYVLISTR------DPEQHELR--------SYRILDG---VVTEEPVRVVDDYD-|
Franean1DRAFT_3647_Fsp._68231909 DRTHYEAIVAHARRD-----HPDEACGVIAGPE--------------------------GSDRPERHI--------------------PMVNAARSPTFYEF-----DPAEQIKVWN-----------------EMFDRDEDPVVIYHSHTATEA--YPSRTDISIAGYP----------------EAHYVLASTR------DPETIEFR--------SFRIADG---EVTEEPVEIL-----|ClpS
Tfu_2370_Tfus_71916501 DRSIYDKIVAHARRD-----HPDEACGIVAGPE--------------------------GSDRPERFI--------------------EMINAERSPTFYRF-----DSLEQLKVWR-----------------EMEERGEEPVVIYHSHTSTEA--YPSRTDISYASEP----------------NAHYVLVSTR------DPETVEFR--------SYRIVDG---VVTEEPVEIID----|ClpS
Francci3_0866_Fsp._86739579 DRACYEAIVAHARRD-----HPDEACGIVAGSL--------------------------GSDRPKRFI--------------------PMENAERSPTFYRF-----DPMEQLKVWR-----------------EMDDRDEEPVIIYHSHTATEA--YPSRTDVSLAAEP----------------GAHYVLASTR------EPDVTEFR--------SYRIVDG---VVTEEPVEIV-----/ClpS
WS1005_Wsuc_34483108 -KALFDSIIEHAQRE-----LPLEACGYVAG----------------------------VEGEVKRLF--------------------PMRNVDASPEHFSF-----DPAEQFSAFK-----------------EAQKEGLRLIGCYHSHPSTPA--RPSDEDIRLAYDS----------------SLSYLIVS--------LAKEPVLN--------SFKIKEG---VVTPENIEVI-----\Sulfite metabolism
Gmet_1569_Gmet_78194034 -RAIHAELIAHAQAD-----APIEACGILGG----------------------------IDGAVSAIF--------------------RMANTDQSDEHFMM-----DPKEQFAVVK-----------------ELRNRGLAMLAIYHSHPETPA--RPSEEDIRLALTP----------------GVSYVIASL-------AGAEPDVK--------AFRITDG---VVEPEPIDIVE----|
Cphamn1DRAFT_2826_Cpha_67938821 CKSVYEKIIEHARRE-----TPLEACGYLGGK----------------------------GKTVIEAY--------------------CLTNIDQSREHFSF-----DPKEQFNAVL-----------------TMRSKKQLAVAVYHSHPVTPA--RPSQEDIRLAFDP----------------EIINVIVSL-------AAQEPEVN--------AFRIVKG---DVTEEPLVVIEGLC-|
CtheDRAFT_3348_Cthe_67873786 TKQQYQEILEHSRNA-----LPNEACGLLGGRI------------------------ENGVKYVEKVY--------------------LLRNIDESPEHFSM-----NPKEQFAAVK-----------------DMRNNGWELLGNFHSHPATPS--RPSEEDIRLAFDP----------------KASYLILSLK-------DDTPVLK--------SFNISSG---QATQEELSIVGEEA-|
DhafDRAFT_0037_Dhaf_68208688 TKKQMEEMLAHARQA-----LPNEACGLLGGRR------------------------DGDDRWVERVY--------------------PLNNLDQSPEHFSM-----DPREQLTAVK-----------------DMRKNGWVMLGNFHSHPATPA--RPSAEDKRLAFDP----------------SLSYLIISLA------EPQKPVCK--------SFLIKKD---GVDEEEIILKEE---|
AmetDRAFT_0932_Amet_77686499 -KENYNQIVKQAKEE-----FPLECCGLLAGVK------------------------TDDEILIKKVY--------------------ALTNIDQSSEHFSM-----DPKEQFAAIK-----------------QMRTDGDIVVGNYHSHPYTPS--RPSEEDKRLAYDP----------------KALYGILSLK-------DQEPVLN--------FFKITAN---ELVEKLELLVI----|
CsacDRAFT_2033_Csac_82499136 PKTLYEEMLNHCLNS-----LPIEACGLLGGVI------------------------EDEKRIVKKVY--------------------LLTNVDQSPEHFSM-----DPLEQFAAVK-----------------DMRKNGWVLLGNFHSHPTTPA--RPSEEDKRLAFDK----------------SLSYLILSLM------DEKNPVLK--------SFRIYES---YVEEEEIQII-----/
Syncc9902_1941_Syn_78169801 DLQCLTVLERSLLAV-----KPQEGCGLLLGTGL--------------------RTPRLRLVTLWPACNAWKKS------------DWVNDALGDLETRFVL-----DPREQIAAQR-----------------WARVHGLEVLGVCHSHPKTAP--EPSTRDCAWAEP-----------------NQLMLILS---------GMRELR---------AWWLGAD---RHPLEIPIEVWENHT\ThiF+Rhodanese
Syncc9605_0389_Syn_78196400 DHRCHTDLRRILLAP-----HPEEGCALLLGQRT--------------------NSGCLRVTTTWPCCNVWGRG------------ASGQRPVHDRCRRFLV-----DPREQLAAQR-----------------WARNRHQYCLGVAHSHPASEP--VPSPHDRQWGEA-----------------ESVMLILS---------ASLGLR---------AWWLHGD---RSVDEIPIQLWDTHK|
SYNW2054_Syn_33633364 RCGCLTILERTLLAS-----WPEEGCALLIGSQG--------------------EGSSLRLDHVWPGCNRWGRQPDLQ--------PWGAGETPGRDCNFLL-----DPREQLAAQR-----------------WSRQHQQWIIGVAHSHPHSPP--VPSAADRCRGVP-----------------HQLMLILS---------AQQGLR---------AWWLEED---RQVRPVPIDVD----|
RS9917_03068_Syn_87124949 GRQCLIVLKRTLAAP-----APEEGCALLLGSLV-IGGAS--------------TRSTWRVHRVWPCCNVWSPGLAAL--------PEPPDASLTRRHRFAL-----DPREQLHAQR-----------------WARARGLQVLGTAHSHPEGEP--EPSRRDLDWATT-----------------PSLMLILG---------GSGALG---------AWWIEAD---AASPLLLEHTDGEA-|
P9211_09717_Pmar_84513875 HGHSKEVLIKSLLIK-----FPQEGCALLLGKKKKETNLS--------------KNFFYEISLIWPCCNIWEPEMKSFSEECL--KEDLTQKPPSKTNRFAI-----DPKEQISAQK-----------------WARKKNLSLLGSAHSHSYSCA--NPSRLDLSWNFS-----------------PGLMIIVD---------GSGVIR---------AWWIGAS---KTIEPTEIPI-----|
Pro_1723_Pmar_33238702 HKDSERILSSSLLTV-----KPEEGCALLLGKSIKAENLE--------------GRNIFQIELVWPCCNIWSNSINADSERCYELNKTTLFKEGSRENRFLI-----DPIEQLLAQK-----------------WGRSKNLTVLGAAHSHPMGST--FPSQMDLSMNFS-----------------PNLMIIVN---------GNQKMR---------AWWLKSS---YMIDSLEIPVVRKSG|
PMN2A_1140_Pmar_72002828 HNRTNSVLSRFLKAA-----EPEEGCCILIGKTNSLTKDH--------------KRNIWEVTHVWNCQNIWGEEESRLIDQNIEAVSNQKNLQLSKKNCFEI-----YAKDQIASQK-----------------WARENDLEVLCCAHSHPLNEN--RPSEMDLLFHQP-----------------PGLMVISN---------KDGDLK---------AWWIKNK---LKFHSVKIEVFSL--|
Ava_0993_Avar_75700941 LPQHQQTILSHAESV-----YPEECCGLIMGYV---------------------ANKAKIVVEVIPTANAWETEADNFTQEIN--QTNITSPASTLKRRYAI-----APQVMLQVQR-----------------QARDKSLNIIGIYHSHPDHHA--VPSECDRLYAWP-----------------GYSYIIVSVQ------KGIASGIL--------SWSLDDH---HQFQSEIIDNITLN-|
Npun02007639_Npun_23124400 SPEHLQTIRAHAEST-----YPEECCGIILGYM---------------------AAEDKIVVEVMPTENAWNTEAGA--------EFSEKRTAESKRRQYAI-----APEVMLKTQK-----------------EARNRLLNIIGIFHSHPDHPA--IPSECDRLYAWQ-----------------GYSYIIVTVQ------NGKAGELQ--------SWSLDDR---HQFQAETIENIK---|
CYB_2073_Syn_86609521 SADHLHSIRQHGEQA-----FPHESCGILIGEI---------------------RGSDKIIHELWSVTNTWDQ------------AENPLADGESSRRRFLI-----DPADFKRAND-----------------HAVRKGLGILGTYHSHPNHAA--IPSEFDRQHAFPW----------------GFSCVIVSVR------EGKAEEVV--------SWVLDEQ---EQPQREPMQMLEDI-|
CYA_2469_Syn_86607091 SAEHLRAIRQHGEQA-----FPYEGCGILIGEL---------------------KGADKIVHELWAVANTWDQ------------AENPLADGESSRRRFLI-----DPADFKRAND-----------------HAVRKGLGILGTYHSHPNHAA--VPSEFDRQHAFPW----------------GFSCVIVSVR------EGKAEEVA--------SWVLDEQ---EQPQREPMQIWEDK-|
CaurDRAFT_0696_Caur_76258731 PDQAAAAIAAHAEAT-----YPDECVGLLVGTL---------------------NGEKKTVLQVVTLENRWSGQV----------QLAATDNPHSRRDRFYL-----DPRDYLRVDR-----------------ETRAAGYEIIGCYHSHPDAEA--VPSERDRIGAQAIGGS-------------GFSFVIQSVH------NGVATALH--------SWLLVNE---GTRFIAEEVRIITT-|
Adeh_3492_Adeh_86159910 GAPLLARISALCEAD-----PEREVCGFVVRRRG--------------------LLEVEPIPNAADRYHA----------------HDPLGFPRTSRDGYLM-----DPRAHLQLLQ-----------------ALDAEGGEVVAVWHSHVEVGA--SFSAKDRADALADGVPLLP----------GAEYLVFGVR------GGKVTEAR--------RFRFHGG---DFVESPLA-------/
N15p19_BPN15_9630483 RQKTIDAIMAHAAAE-----YPRECCGVVAQKS------------------------RVERYFPCRNLSA-------------------------------------EPTEHFHLSPEDY--------------AAAEDWGTVVAIVHSHPDATT--QPSELDKAQCDATL---------------LPWHIVSWPE----------G-----------DLRTIQPRG-ELPLLERPFVLGHF\JAB+NlpC
HK022p18_BPHK022_9634137 RQKTIDAIMAHAAAE-----YPRECCGVVAQKS------------------------RVEKYFPCRNLAT-------------------------------------EPTEHFHLSPEDY--------------AAAEDWGTVIAIVHSHPDATT--QPSELDKAQCDATL---------------LPWHIVSW----------PDG-----------DLRTIQPRG-ELPLLERPFVLGHF|
phiE125p19_BPphiE125_17975180 DEQIKKAIEAHALAE-----YPRECCGLVVKTA------------------------SGETYVPCRNLAA-------------------------------------APTDQFALASDDY--------------AAAEDAGEIVALVHSHPGASA--QPSEADRAMCERSGI--------------AKWVIVSLGV--QADGSIGVD-----------DWCEFAPAGYVARLVGRPFVHGVH|
VchoV5_01000735_Vcho_75820381 -MNWQSNLLLHAQTA-----YPQECCGLLIQVG------------------------SEKLYMACRNSAN-------------------------------------QPEQDFVIHPEDL--------------AMFESMGEIVGICHSHPDASS--KPSERDIYNANALKAEYPN----------ADWHIASW----------PEG-----------DIHSFTPSGEAYPLIGRPFIYGVM|
PA0639_Paer_9946515 SRSLQRAIAAHAARE-----HPRECCGLIVRGV------------------------RQRRYVACRNAAG-------------------------------------SPSEHFVIDHQDW--------------CAAEDQGEVLAIVHSHPDVPA--TPSMADRVSCELHG---------------LPWVILSW----------PEG-----------DVAHLAPEGYRAPLLGREFAHGVL|
RSc1695_Rsol_17428711 QETTLDAARRHAARE-----HPREACGLVVVVR------------------------GRERYMACRNVA--------------------------------------VGTEHFEMPAEDY--------------AAAEDLGEVLAVVHSHPNASA--EPSEADRVACEASG---------------LPWHIIAW----------PAD-----------DVRTITPCGYRAPLVGRQFAHGIL|
phi1026bp18_BPphi1026b_38707908 DEQIKKAIEAHALAE-----YPRECCGLVVKTE------------------------SGEIYVRCRNLAA-------------------------------------VPTDQFALASEDY--------------AAAEDMGEIVALVHSHPGASA--QPTDEDRTMCGRSGI--------------AKWVIVSL-----------GVQADGSIGID--DWCEFEPGGYVARLVGRQFVHGVH|
64_BPBcep176_77864689 DERIKQAIADHALAE-----YPRECCGLIVRTA------------------------AGDVYLPGRNVAP-------------------------------------TPTDQFALAPEDY--------------ADAEDMGEIVAMVHSHPNGTA--QPSMADRTVCERAGI--------------PQWVIVSL-----------GVQADGSIGVD--DWNEFGPSGYVAPLYGREFLHGVL|
_Ypes_2996362 ----MQEIYLTAIKR-----YPNEACGFLVRTTG-----------------------EKYRFMEARNVSE-------------------------------------NPENTFVMHADDI--------------IAAEDAGDVVAIWHSHTDESA--DASDADRAGCEATE---------------VPWLILAVRK---------NVEGDAPFHFS--EMNVITPDGFEMPYLGRPYVFGVF|
PA0639_Paer_11347692 SRSLQRAIAAHAARE-----HPRECCGLIVRGV--------------------------RQRRYVACRNAA-----------------------------------GSPSEHFVIDHQDW--------------CAAEDQGEVLAIVHSHPDVPA--TPSMADRVSCELHG---------------LPWVILSW-----------PEG----------DVAHLAPEGYRAPLLGREFAHGVL|
T1p35_BPT1_45686325 SAKIKLEIMTHAQEE-----YPRECCGVVTQKG------------------------RVQKYHRIDNVHR-------------------------------------DPENHFMMDAVQYAC------------IEDDAESTTIAIVHSHTGDGATTLPSAHDTCMCNEME---------------VTWIIVSV----------PEG-----------DMRFVKPE--KLPLIGRPWSLGSF|
_BP2120_11877307 SEEARREMLACAEEA-----VPSEMCGVLVFSY------------------------EGYEFLPLSNCAE-------------------------------------NPHETFEISADDW--------------MAAERVGEIVAVVHSHPRGEP--FLSGADRWMQVETG---------------LPWILVTQ-----------G------------RLKLFRP---VPHLRGRVFEYGKT|
Aple02001184_Aple_32034630 AEHIKIEILAHAKKS-----EPQESCGFVVSGQ------------------------DEFFYYPCENVAD-------------------------------------DPESFFEIAPEAY--------------IQAEYLGEIVAIVHSHPNGEP--ALSIADRQMQDLSQ---------------LDWWLVCN-----------G------------ELHIFPK---IQPLIGREFIHGTT|
SSO_1765_Sson_74312267 MTQTESAILAHARRC-----APAESCGFVVRTP------------------------EGDRYLPSENISG-------------------------------------EPEERFRMAPEDW--------------LRAQMQGEIVALVHSHPGGLP--WLSEADRRLQVQSD---------------LPWWLVCR-----------G------------AIHKFRC---VPHLTGRRFEHGVT|
_BPlambda_215123 MTQTESAILAHARRC-----APAESCGFVVSTP------------------------EGERYFPCVNISG-------------------------------------EPEAYFRMSPEDW--------------LQAEMQGEIVALVHSHPGGLP--WLSEADRRLQVQSD---------------LPWWLVCR-----------G------------TIHKFRC---VPHLTGRRFEHGVT/
PHG307_Cnec_38637968 PAQLIGEFAAMARAA-----HPKETGGWVVWNA------------------------DSASFRLVPVQ---------------------------------------ILEHSGGHLKYER--------------PPLAADDVLVVDCHSHGRHPA--FFSSTDDDDDCHDV---------------KFAFVMGNCD--------AATPSM--------ALRLCAK---GIFENVEKVPADWY\Div E2
PnapDRAFT_0123_Pnap_84717438 PASLWREFAVLAKST-----LPNEVAAAMVWNA------------------------EQDTWRLAARQ---------------------------------------SIEANPGFVRYRE--------------VELQEGEHFVVDIHSHGNHSA--FFSETDDCDDFGST---------------KVSAVLGCVG------QGATQVRI--------RLMLIDK---HI-DLKLTCSGGWE|
PproDRAFT_0259_Ppro_71839552 PVDMITRFMVEAKER-----FPLECAAWFTWDT------------------------YLKRFNYYSLH---------------------------------------ARQASLDNLDYAC--------------PVLPETECLVCDIHSHGRHPA--VFSPEDNNDDRGET---------------KIAVVLGRIN--------SSPSIA--------FRLCVAG---LKIPIKYNLAKLPF|
RSc1658_Rsol_17428674 TFPMVRAFIEAARKA-----APNEHAAWVVWDS------------------------RTGDLAYRELQ---------------------------------------ITDASPGAISYDR--------------PRLEDHESLVVDMHSHGALAA--FFSEQDNRDDAGEV---------------KISCVVGDLA------DGKTPSIQ--------FRLCVLG---MFLPLKVPADAVLG|
RferDRAFT_4145_Rfer_74024823 PENLLVEFLHQARAA-----APNETAAWIVFNE------------------------GDRSLRLLPME---------------------------------------YDGVTPVHLSILR--------------PMLAAGEHLVVDLHSHHQMSA--YFSRTDDADDIASNEV-------------KIAGVVGTIH--------QNPTWN--------FRMCLEG---VLMPDFLKLLRLKE|
BproDRAFT_0622_Psp._67910470 PTELRSKFVQDAKAA-----MPNEMAAAVIWNS------------------------NDHSWRYEMRE---------------------------------------NTTASTAHIDYRE--------------VHLGDGEFLVLDLHSHGTFSA--FFSQEDDRDDKGSM---------------KFSGVIGNLN------SGNMTSVL--------RMNML-G---QTWDANLASNGKLE|
RmetDRAFT_6239_Rmet_68559358 PPALIGEFTDMARAA-----YPNETGAFVVWNA------------------------RTQQFRLVPLR---------------------------------------ILAQGTGHLKYDR--------------PRLGADDVLVVDCHSHGRYPA--FFSATDDADDRHDV---------------KFAFVIGNCN--------AAVPSL--------ALRLCAK---GIFENVERIPHGWY|
RMe0063_Rmet_56410325 PPALIGEFTDMARAA-----YPNETGAFVVWNA------------------------RTQQFRLVPLR---------------------------------------ILAQGTGHLKYDR--------------PRLGADDVLVVDCHSHGRYPA--FFSATDDADDRHDV---------------KFAFVIGNCN--------AAVPSL--------ALRLCAK---GIFENVERIPHGWY|
alr7560_Ana_17134645 LEPYFRLKVPKVPCQ-----AIAEIINAASINP-------------------------QQEILFYLGVTNDQWWCHTP-----------------------------LQTASSTHVLSLES-------------ALDKSYTDGLVEMHSHGTLAA--YPSSADNQEEKGK----------------FRVFAIIGT----------LNNIPT-------IYTRIGI---YNHFFDINP-----|
Bcep1808DRAFT_6254_Bvie_67543574 PAALVAEFHAMARAA-----LPNEVGAWIVWNS------------------------VTNEFRIVALP---------------------------------------SLSHGPGHLVYER--------------PRLADGEWLVVDCHSHGTSPA--FFSRTDNQDDKHDV---------------KFALVLGHCD---------RTPSV--------ALRLCAK---GIFEKFERAPETWA|
p1B74_Asp._56315655 PLSLFERFAGIARES-----CPLEAAGWITWNE------------------------VSNQFAFREVG---------------------------------------VREASASRIHFDR--------------PRLDESEHLVVDLHSHGASSA--FFSGTDDFDDRGEVKLSIV----------LGRCDQRVVTAQRFCLMGMFVPMQ--------LASAVDG---LTFEPVPVERTRS-|
BproDRAFT_4306_Psp._67908645 PKALLDRFVLQARAA-----TPMETAAWIVWEA------------------------ETCQFRYLPLA---------------------------------------NISVSAGHVKFER--------------PQLEDGVHLVMDIHSHGELPA--FFSEQDNADDADQL---------------CISAVLGRVS-----------DET--------PQFVSRL---SMLGVAVNLMEVI-|
Daro_2537_Daro_71847774 PIRVIEAFIEAARRG-----LPNEVAGALIYSR------------------------RNQSLRLALCE---------------------------------------PIEVSPHQIDYRV--------------PTMDADETLAVDLHTHGYGSA--FWSAKDDGDDQGIKVAGVF----------GCLHQPKPQALFRLVVNGRFRPLP--------HPWQADT---DTACDVAPDLESGL/
XAC3952_Xaxo_21110358 DEGLCEQLLGERASS-----LPLETGGILLGVV-----------------------DFKLNTIHLVDGR-------------------------------------SAPRDSVSTEADFQCGSCGVQEDITEAQRRTAGMVLWIGAWHSHPKGVKA-VPSIQDQDLLSHLCTRLGAHGLP-------AVMLIAGE------------NG---------IDVFLQ----MKGS----------\E2+E1+JAB
MaquDRAFT_3597_Maqu_77955313 DVSIEGCMSSLRENE-----LPNETGGVLVGFI-----------------------DRKIKTISVVLAR-------------------------------------PAPEDSVSTPKEFLRGTAGVEEDIDECRRRTGGIVSYLGEWHSHPRGCHS-NPSTHDRIQLDYLEGVMARDGSP-------AISMIVSD------------ST---------ISVSLD----QQTTM---------|
PnapDRAFT_0071_Pnap_84717800 DASVELQLRDLRTKG-----FPNETGGVLIGYY-----------------------DFNVSAVIVVAAL-------------------------------------PAPPDSKSSPGSFERGIAGLAETVTEASRRTAGVVGYIGEWHSHPPGHSA-SPSRDDLVQLVHLALGMADDGLP-------AVQLIVGE------------HD---------LQVLQG----TVK-----------|
GOX2518_Goxy_58038271 SVGVIKAMRTYRQKA-----APNETGGILIGTF-----------------------DLVRNILHIVAAL-------------------------------------PAPPDSRQAPTFFVRGALDLSPLVDKYAKATVGRLQYVGEWHSHPDGIAA-RPSDDDEKVFAYLCRQLAPAGAP-------YAMLICGR------------DE---------TWLRAA----WQDRG---------|
OB2597_18097_Obat_84502025 DAGLRSRIAAMRDEC-----LPDETGGILLGVV-----------------------DIPARKIHLADAA-------------------------------------KAPADSIGSPTGFVRGTDGVQQMIDRSMAETQGQLRYVGEWHSHPQLVGV-MPSVTDLSQINWLATIFDMDTLP-------GLMLIAGE------------AE---------LAVVFM----RHSEA---------|
RSP_2048_Rsph_77387014 TARAFAKMAEFAAKR-----SQRETGGILIGHY-----------------------SEDLTIARIEAAS-------------------------------------DEPPDSRAGRTWFVRGQVGLAEILQ---RAWREGRYYLGEWHSHPGASP--APSGPDLSAIAKMARHPTFICHR-------PILVIIGG------------NF---------HQQPLL----SATLA---------|
pCPF5603_46_Cper_86559649 YRKAYERILNELNNS-----KPNETGGILLGNI-----------------------NKNNKTIYVTDI--------------------------------------YIPKDSKYGPYLFTKGSYGTKEYLEHVLKSTGNIINYVGDWHTHPESST--NMSSKDKKSLLELKEYLKEYSYP-------AHIMIFNE------------KD---------ISSYVI----S-------------|
RmetDRAFT_0537_Rmet_68559822 LSPVAQAIHADALRW-----GALETGGALIGRI-----------------------SFENRTITIAGLV-------------------------------------EAPPDSVREAARFVLGTNGLVQNLRAANAASLGYLAFIGTWHSHPKGG---AHSGIDRNTLRGIAEDA--GGLPAV----SLVWTPTGL------------TC---------AVDRW-------------------|
KT71_12390_Gpro_88705878 LYPVEQAIHADALRW-----GALETGGALVGRI-----------------------SFEDRTIIIAGLV-------------------------------------ETPPDSVREAARFILGTNGLVQSLRAANEASLGYLAFIGTWHSHPKGG---AHSGIDRNTLRSIAEDA--GGLPAV----SLIWTPTGL------------RC---------AVDRW-------------------|
MaquDRAFT_3270_Maqu_77955723 LDSVVSLMKECRASA-----GKNETGGAIIGRY-----------------------DPLLQTVHVTGLV-------------------------------------EPPAYAQAGPSLFQIDYPYLQEVDEDTLEQTANTLRVVGTWHSHIGAS---RPSSTDQTTFSDLSQNL---GLPF------PVMIVYGA------------DG---------LEILSE------------------|
SYN_01833_Saci_85859492 SQAALSEMHTWVRRSALIYGEKAETGGILFGGR-----------------------DNACRVIWVSEVI-------------------------------------GPPADSESSCAHFICGTNGVAEANEEKRQRTRGSTQYIGMWHTHPTSLP--APSETDFLAMHALVNA----DEPSTH---KHLLLILGS------------DS---------EQSVEL----LSGFL---------|
Mdeg<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=02000735">02000735</a>_Mdeg_48864353 EIRVLGNIRDQIERES--KGSSVEQGGVLAGMV-----------------------CHLSKTIYVTLVV-------------------------------------PAPDGTIRTPARLDIATTGLEEIFENIHSATNGQITLLGTWHSHTTPS---PPSLKDRVTYEKLAKNY---DLPV------VMLVYTGG------------RI---------ERV---------------------|
Nwi_2874_Nwin_74421925 SSRVMTKIAEEVARH-----PAVETGGVLVGTC-----------------------SARLRTIIVVDLI-------------------------------------EAPRDSVRSATRFVLGTAGLKAAIKARHRTSGGTLFDVGTWHSHLADQ---GPSALDRATARQLAAER---PPPS------VLLIQAPT------------RL---------YALMHN----GAAT----------/
_Ecol_37927531 SEVVRLLKSYRQMQY-----VSTEAGGVLIGER-----------------------RGPHIVITHISEP--------------------------------------GPGDIRTRNRFERKGDHHQLKVDELF-EQSNGFLVYLGEWHTHPEDFP--QPSYTDMKSWLTGLIA-----TET------MLLIIVGR------------KS---------EWVGIK----NGNDIKSIREK---\E2+E1->JAB
VC0181_Vcho_9654585 GHVVTRLLSYRQLHH-----LTPESAGVLIGER-----------------------RGQHLVVCDISEP--------------------------------------GSGDIRQRCRVDRRGVHHQSRVNEAF-ERSAGTHLYLGEWHTHPEDRP--FPSATDRHSWRRNIVS-----DES------MLLLIVGR------------KD---------FWLGKK----ERELITVFKKIES-|
Psyc_1371_Parc_71038911 IGVANILTSYRQLSD-----SSPESAGVLIGER-----------------------RDVHIVIKTVSEP--------------------------------------SPWDIRSRFMVDRVSKYHQKVVDDAF-KKNNGEWQYLGEWHTHPEDVP--KPSMTDYSSWHKNLKS-----SDP------LILIIAGR------------RD---------FWVGKK----IQDNIEVLKQV---|
y4qB_Rsp._16519909 PESVVEAMLKDASRW-----HDLETGGTFMGYWS-----------------------DANVAVITKMID-------------------------------------GGSEAIRTRKSFSPDREWEQSEIDRHY-RVSGRVDTYIGDWHTHPNAQS--EPSWTDRRCLRTIIR------SPEARAPRPVMILLCGG------------PE---------NWLPHAWI--GQLTRRALLFERV-|
y4qB_Rsp._16519909 PESVVEAMLKDASRW-----HDLETGGTFMGYWS-----------------------DANVAVITKMID-------------------------------------GGSEAIRTRKSFSPDREWEQSEIDRHY-RVSGRVDTYIGDWHTHPNAQS-GEPSWTDRRCLRTIIR------SPEARAPRPVMILLCGG------------PE---------NWLPHAWI--GQLTRRALLFERVE|
y4qB_Rsp._2496737 PESVVEAMLKDASRW-----HDLETGGTFMGYWS-----------------------DANVAVITKMID-------------------------------------GGSEAIRTRKSFSPDREWEQSEIDRHY-RVSGRVDTYIGDWHTHPNAQS-GEPSWTDRRCLRTIIR------SPEARAPRPVMILLCGG------------PE---------NWLPHAWI--GQLTRRALLFERVE|
ArthDRAFT_2189_Asp._66965740 PVPVLSDLVEQARLY-----APAETGGILVGHYTVTKPN------------------GQRDAVVTDVIG-------------------------------------PGPAATRSRIAFEPDTEWQTAELSRVY-ALRDRRVSYLGDWHTHPTGQP--VPSLRDLKTLETIAA------HTAARCPEPFMAILGKE------------GM---------EQDWNIAV--CQHEALGRIRNIIP|
GuraDRAFT_0478_Gura_88937752 VEVIASIHGYIQNDR-----HKPEGGGVMLGRYII----------------------DSQDVVIDKISF-------------------------------------PMPGDRATRTTFFRKKRAHQQVIDRAW-EASNHTCTYLGEWHTHPEPHP--SPSSIDDTNWKRKLK------NDIVDSDSLFFLIVGTS------------EM---------RMWEGHRR--SRTITMLKLL----|
_Cper_86475967 DNLIACMDSYKQLNS-----NDKEKGGILIGY-IT----------------------TDNNIIIEYITE-------------------------------------PFDSDISKRFSFIRRDINHEKVLNNIW-ESNGKMHTYIGEWHTHPEDYP--NFSSIDKKNWINLGK------KIHPSKRYYINIIIGNK------------DL---------RIWEYDVK--NEKIERIK------|
ELI_01185_Elit_84786147 EASVMDALLAETSLA-----HPLECCGILLGEH-------------------------NHITAIQP----------------------------------------AANVHPQPQTHFEIDP---QTLVDAHR-AGRNGGPQVLGYYHSHPTTVP--EPSATDAAMAAQDGS------IWAIIGQGEIIFWRDRN------------DG---------FAQLSYSI--LDG-----------|
RHE_CH01996_Retl_86357616 SPSELATLTTALRSA-----GDKEIGGQLFGEQIE----------------------PSHFRVSTMTIQ-------------------------------------ARRGTFSRFLVDILQAVRDATRFFDRT-HHQYRRYNYIGEWHSHPSFEV--RPSGVDVQSMRDLVR------DPDFKGSFAVLMIVRLR------------AD---------KFEAGAWL--FDPRGFEQNVKLEM|
OB2597_05125_Obat_84499282 ---------MALNEG-----GHREIGGQLFGEQLA----------------------PSQFLVTNLTVQ-------------------------------------ARRGSYTRFIVDLFQAARDAMRFFDST-QHDYTRHNYIGEWHSHPSFKV--RPSGTDLTTMRELVR------DPGFKGTFAVLMIVRLD------------AD---------CIAAAAWN--FDPLGREGVAQLEI/
mll6193_Mlot_14025926 HCISTVHAHLRSVGR-----EGNEGMALWVGVQ--------------------QDQHFAVTETVLPAQR-HIRTGDGVCV--------------------------MVPAEELHRLNV----------------WLYNSGLKLLAQIHSHPGRA---YHSTTDDAYAVATTVG--------------CLSLVVPNF---AREPFDFARVAAYRLDGKANWNALPS---AALSRMITITS---\E2+E1
RHE_PA00015_Retl_86359720 AAVNDVHEHLAEVGR-----SGYEGLGLWVGTV--------------------AAEIATVERALIPQQR-LIRSAAGVGV--------------------------HVDGTELHRINM----------------WLFDNGLRILAQIHSHPSDA---YHSDTDDEYALATAVG--------------SLSLVVPDF---ATGPTDLSQTAVYRLDKAGKWMAVSQ---ETVNRLIEIVD---|
msi104_Mlot_20803931 HCISTVHAHLRSVGR-----EGNEGMALWVGVQ--------------------QDQHFAVTETVIPAQR-HIRTGDGVCV--------------------------MVPAEELHRLNV----------------WLYNSGLKLLAQIHSHPGRA---YHSTTDDAYAVATTVG--------------CLSLVVPNF---AREPFDFARVAAYRLDGKANWNALPS---AALSRMITITS---|
Magn03005842_Mmag_23011187 AILAETLDRLRVGGR-----RGEERAVLWLARS--------------------ASAAPTPVQEVYEPEQ-ATAE---DYF--------------------------HLPPASMRALMG----------------HLRAHRLKIVAQVHTHPGRA---FHSEADDAWAIVRHRG--------------ALSLVLPRFAATATPGTFLEEAMVYELSDAGLWEHVRR---PGERIRIEVTP---|
RPDDRAFT_1996_Rpal_77690159 SVLERTISIIRRDGN-----RGEERVALWLATA--------------------AQRSPAAIVEVYEPEQ-VVEV---DSF--------------------------YIPPASMRALMN----------------HLRSTRRRIAAQIHTHPGRA---YHSDADAKWAIIRHSG--------------ALSLVLPHFANATTVENFLEEVMTYEYSPAGEWIHCPN---VGAGARVVVTA---/
alr7504_Ana_17134589 --QHSNYLHELLLTI-----DGKERAAYVLCGQAVINADPWDGQP--------HQKFISYEVIPVMPED-EIVSFSAKHI--------------------------TWKTDSFVRAL-----------------QAAQAKNLTLAVFHSHPEGLR--EFSIQDDTNEPDLIQLAQNRNGSDTQI---LSVILMPD------------GN-----LIGRLWVSSQE---VISLRIIRVIGQKI\JAB+E1
sll6053_Ssp_38423902 -ESHLQELRKSLWHS-----DGKEKAAYLICGEVSIQADPWTSMP--------RKKYLSVEVIPIP-DN-EIVSHSPQHI--------------------------TWSTDSFVRVL-----------------KLAQQKNLTVAIIHTHGKNGA--RFSEQDDVNEPDLVQLAQNRNGQDTKL---LSLILTAD------------GD-----LVGRCWFNPKE---YQPLDLIMCVGDRL|
Bcep1808DRAFT_3227_Bvie_67547439 SGKHRTQLRRHLSPG-----DGKEAVAIALCGQ-------ASGVR--------RNQLLVHEVVEVP-YE-ACRIREPDAV--------------------------AWSVEAVLPAL-----------------NQAIKKNLTVVKFHSHPSGYP--EFSRYDDESDRAFFSAVDNILDNVDRR---ASVVMLPD------------GR-----PFGRHVQNGIL---GEPIDLFRIAGDDF|
PnapDRAFT_3951_Pnap_84711629 -ESHEAALRALLHRE-----NGSEAAAYVLFGKAEIAADPWSKQP--------RIRLISHEVVPIT-SD-EMVSSSSVHV--------------------------TWSTQGFMRLL-----------------GLAQHRNLVPALVHTHPGAGA--FFSDQDDRNEAELARTTFNKGAQG--L---ASMVFGQH------------DA-----IVGRLWKSAKA---STKASSISIVGSKI|
NhamDRAFT_1903_Nham_69928900 PVGVHTALRAHLFPG-----DGNESAAILLCAA--------GPGR--------RLKLLARELIPVP-HE-ACSVRKPDRI--------------------------TWPGRWIEEAI-----------------DRGEKEGLHIVLVHSHPGGLF--EFSAADDASDSVVVPGLFAAYDAR--H---GTAIMTPD------------GR-----MKVRFYDHDLQ---PTVVDLVMVPGDDI|
Adeh_2929_Adeh_86159351 VADWVLRALDAELGG-----HPPERGGALLGPPG-------------------RPLLTRFEPDPGA------RASASQWA--------------------------PSAGLGARVAA-----------------LERGEGLELKGLVHSHPGALD--QPSAQDARELAAGLAHNPHLGCYLGPV---VSLAPAGA------------PG-----AHEVALPRGKL---SLFAARRSRGGGTE/
Shortened alignments used for figures
1.
UBC/E2
Helix-1 Str-1 Str-2 Str-3 Str-4 | * * Helix-2 Helix-3 Helix-4
Secondary Structure -hHHHHHHHHHHHHHh--------EEEEE----------------EEEEEEE--------------EEEEEEE---------------------------EEEE-----------------------------------------------------------HHH-------------------------------HHHHHHHHHHHHH-----------------------------------HHHHHHHH--h---hhhHHHHhhHHH
1ayzA_Ubc2_Scer_3659954 TPARRRLMRDFKRMKE---DAPPGVSASP----------LPDNVMVWNAMII----GPADTPYEDGTFRLLLE------------FDEEYPNKPP-----HVKFLSE---------------------------------MFHPNVYAN---------GEICLDILQ----------------NRWTP------TYDVASILTSIQSLFN---------------DPNPASPAN-----------VEAATLFKDHK---SQYVKRVKETVE
1Q34A_Ubc_Cele_34810893 TPSRRRLMRDFKKLQE---DPPAGVSGAP----------TEDNILTWEAIIF----GPQETPFEDGTFKLSLE------------FTEEYPNKPP-----TVKFISK---------------------------------MFHPNVYAD---------GSICLDILQ----------------NRWSP------TYDVAAILTSIQSLLD---------------EPNPNSPAN-----------SLAAQLYQENR---REYEKRVQQIVE
2E2C_E2-C_Ssol_4388942 HSVSKRLQQELRTLLM---SGDPGITAFP----------DGDNLFKWVATLD----GPKDTVYESLKYKLTLE------------FPSDYPYKPP-----VVKFTTP---------------------------------CWHPNVDQS---------GNICLDILK----------------ENWTA------SYDVRTILLSLQSLLG---------------EPN-NASPL-----------NAQAADMWSNQ---TEYKKVLHEKYK
1QCQA_Ubc4_Scer_5107650 MSSSKRIAKELSDLER---DPPTSCSAGP----------VGDDLYHWQASIM----GPADSPYAGGVFFLSIH------------FPTDYPFKPP-----KISFTTK---------------------------------IYHPNINAN---------GNICLDILK----------------DQWSP------ALTLSKVLLSICSLLT---------------DANPDDPLV-----------PEIAHIYKTDR---PKYEATAREWTK
2AAK_Ubc1_Atha_2981894 TPARKRLMRDFKRLQQ---DPPAGISGAP----------QDNNIMLWNAVIF----GPDDTPWDGGTFKLSLQ------------FSEDYPNKPP-----TVRFVSR---------------------------------MFHPNIYAD---------GSICLDILQ----------------NQWSP------IYDVAAILTSIQSLLC---------------DPNPNSPAN-----------SEAARMYSESK---REYNRRVRDVVE
1PZVA_Ubc_Cele_34811307 EQSSLLLKKQLADMRR---VPVDGFSAGL---------VDDNDIYKWEVLVI----GPPDTLYEGGFFKAILD------------FPRDYPQKPP-----KMKFISE---------------------------------IWHPNIDKE---------GNVCISILH---------DPPEEEEERWLP------VHTVETILLSVISMLT---------------DPNFESPAN-----------VDAAKMQRENY---AEFKKKVAQCVR
1I7KA_Ubch10_Hsap_13786748 GPVGKRLQQELMTLMM---SGDKGISAFP----------ESDNLFKWVGTIH----GAAGTVYEDLRYKLSLE------------FPSGYPYNAP-----TVKFLTP---------------------------------CYHPNVDTQ---------GNICLDILK----------------EKWSA------LYDVRTILLSIQSLLG---------------EPN-IDSPL-----------NTHAAELWKNP---TAFKKYLQETYS
2UCZ_Ubc7_Scer_2981900 KTAQKRLLKELQQLIK---DSPPGIVAGP---------KSENNIFIWDCLIQ----GPPDTPYADGVFNAKLE------------FPKDYPLSPP-----KLTFTPS---------------------------------ILHPNIYPN---------GEVCISILHSPGDDPNMYELAEEEEERWSP------VQSVEKILLSVMSMLS---------------EPNIESGAN-----------IDACILWRDNR---PEFERQVKLSIL
1J7DB_hUbc13_Hsap_15825811 AGLPRRIIKETQRLLA---EPVPGIKAEP----------DESNARYFHVVIA----GPQDSPFEGGTFKLELF------------LPEEYPMAAP-----KVRFMTK---------------------------------IYHPNVDKL---------GRICLDILK----------------DKWSP------ALQIRTVLLSIQALLS---------------APNPDDPLA-----------NDVAEQWKTNE---AQAIETARAWTR
1JASA_Hsubc2b_Hsap_34809571 TPARRRLMRDFKRLQE---DPPVGVSGAP----------SENNIMQWNAVIF----GPEGTPFEDGTFKLVIE------------FSEEYPNKPP-----TVRFLSK---------------------------------MFHPNVYAD---------GSICLDILQ----------------NRWSP------TYDVSSILTSIQSLLD---------------EPNPNSPAN-----------SQAAQLYQENK---REYEKRVSAIVE
1KPSA_Ubc9_Hsap_20150955 GIALSRLAQERKAWRK---DHPFGFVAVP-----TKNPDGTMNLMNWECAIP----GKKGTPWEGGLFKLRML------------FKDDYPSSPP-----KCKFEPP---------------------------------LFHPNVYPS---------GTVCLSILE-----------EDDDDKDWRP------AITIKQILLGIQELLN---------------EPNIQDPAQ-----------AEAYTIYCQNR---VEYEKRVRAQAK
1JATA_Ubc13_Scer_14719686 ASLPKRIIKETEKLVS---DPVPGITAEP----------HDDNLRYFQVTIE----GPEQSPYEDGIFELELY------------LPDDYPMEAP-----KVRFLTK---------------------------------IYHPNIDRL---------GRICLDVLK----------------TNWSP------ALQIRTVLLSIQALLA---------------SPNPNDPLA-----------NDVAEDWIKNE---QGAKAKAREWTK
1KPPA_Tsg<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=101">101</a>_Hsap_9789790 YKYRDLTVRETVNVIT------LYKDLKP----VLDSYVFNDGSSRELMNLT----GTIPVPYRGNTYNIPICLW----------LLDTYPYNPP-----ICFVKPT----------------------------SSMTIKTGKHVDAN---------GKIYLPYLH-----------------EWKHP-----QSDLLGLIQVMIVVFG---------------DEPPVFSRP-----------ISASYPPYQAT---GPPNTSYMPGMP
1UKX_RWD_Eif2ak4_Mmus_7305017 YSQRQDHELQALEAIY----GSDFQDLRP-------DARGRVREPPEINLVL--YPQGLAGEEVYVQVELRVK------------CPPTYPDVVP-----EIDLKNA---------------------------------KGLSNESVN---------LLKSHLEEL--------------AKKQCGEVM----IFELAHHVQSFLSEHN---------------KPPPKSGFH-----------EEMLERQAQEK---QQRLLEARRKEE
_Rsp._22726448 TAGEARLIRECEELAS---LAAASAWLEEP-----QFGKNADGLLTWSFVLL----------AGDRRIPLRLV------------FPALFPDLPP-----FVLPADS-----------------------------SVRLSQHQYGEG----------GELCLQYRP----------------DNWHP------DCKSADVVRSAKALLE---------------ATPKDDGFS------------DVESAHPTDL---PSLLSGCSRRFM\
OB2597_05120_Obat_84499281 LVDSARLAAERRSIEQ----AAAGEWFRFA------RWTLHHGLVCVEGEIL----------AHDNTYPVRLI------------YPDQFPLVPA-----WVEPAEK------------------------------ARWSSHQYSG-----------GSLCLELRP----------------DNWIP------TATGADVLESAFNLLH-----TEDPLGEGGATAPSDHRVG------------EVQTYGDLHL---PALIGAGCLDRL|
RHE_CH01997_Retl_86357617 LNNTVRVAREKEAVEN---LATETEWFVLD------RWEIHDYKFAAIGSIV----------AHGATYPIRLV------------YPDNFPLVPA-----WVEPQDP-----------------------------EAKWSYHQYGKG----------GALCLELRP----------------DNWTS------RANGADVLRSAYGLLN----LENPLGDGEKGKVTSAHNVG------------EIQKYNWGES---PVFIGQECLTRL|
y4oA_Rsp._2496721 RLTEVNVLKRGSDQDN---WWQAYPGLYAR-----ELAAYEGHGASHRPLIQ----------QDGTLILEVLWP-----------MDSAGSIRLN-----VGYSPLH-------------------PFCRPSISAPELQLERHQNPFT----------RDLCLLTQDS---------------AQWYPH---QMVADFIAERLSQVLQVM-------------------T----------------LRRNEQWSEA---ASLEEQAPDPVT|
y4qC_Rsp._2496738 PAGRRRLAELQKLHSA------AGESLLVD-----EEAAAAGILRIEFSWPL----------NDGRTIGLRAV------------YPDTFPRLRP-----HVFLTCD----------------------------PSEYPERHCGSE-----------GALCLLGRDT---------------RYWQAN------MSLAELLDENLAHVL---------------DGT-------------------GAEDPQGEP---IEYWWNSLGQAS|
ROS217_07909_Rsp._85706659 RTAQDHSAHDFGVMDA---WERVREVLAGH-----GFTLVPGSGRDRYQGQI----------KVGSVPVSLEIE-----------IADYDFLDLP-----KVRVLKR--------------------------EALPKRLTGHIVSD-----------GTLCYADKAT---------------FLLDRY----QPDRSVVSCLEQARTTL---------------NTLLHG---------------NPSVAYMAEL---AAYWSATPYCL-|
_Cper_86475968 -MVILILDLFNSLNSF---ENIKNVKEIKK-----NNDNFEVNYSKIYEFTL----------NIQKQNFDIIMC-----------IPEEWNLKLI-----DFYIKDY----------------------------KNIKFIPHLEEN-----------GKICLFDKEG---------------LLVEEN----LNGIAIESIERLNKVLY---------------EGLNDI----------------NKLDFINEF---DAYWNLLSTNNI|
GuraDRAFT_0469_Gura_88937743 DESLLKEALETCLLVK---SVAELHPKRLA-----EPWAKDRFVCRSYKLVI----------ELNGVPVDFYFG-----------VKKSFPLSLP-----YIFLAQW----------------------------DSFGILPHVETD-----------GYICYAQEDG---------------SVLDFD---DVAGIAQEALSRAIQVVV---------------DGISGK----------------NHQDFLDEF---GAYWDRLKKVKF|
Psyc_1372_Parc_71038912 -MMSELHQTMLSCGFK---YLKNSQRQSIS-----FFDSIPTTRPIYVKDYK----------TSEGIFNVALV------------FGDDLYTTLP-----RAQVLKK-------------------------PKKIEQVLLPHINSG-----------GYLCYVEEKE---------------ADWNPN----NLNALYRAVDEQVQNTL---------------NTAISSLQNG----------QIDQAEFEGEF---VSYWKPEQTIY-|
ELI_04040_Elit_84786718 -----FRFRMMSLADR---WRAIAATLANK-----GFTEQQGASPEFRGSIN----------VHGRAVDIELV------------IPDSKFVELP-----IVRLVDR--------------------------KQLPAGAFGHISRDDIEG-------SVVCFAPATG---------------LPLDFH----DPGGSVLRVLRQTELSL---------------EKSFAGQG---------------GAEVAAEY---QEYWIEKEPNFR|
_Ecol_37927532 MKDGQLHQVMTGCGYR---YTRARNLPEKS-----ILHSRERGAGYYTKEYA----------TDAGNFNVALV------------IHPDPFTELP-----TAFIIEQ-------------------------PEQFKSCLMPHVALE-----------GFLCYVEQME---------------ADWDSN----DLEATYKEVDAQIHQTL---------------IDSVSAATQG----------VNDKRELEGEF---AAYWRPSETLFL|
VC0180_Vcho_9654584 -MKQELHHTLLGCGFR---YTPAKQMPKGI-----LLDTKSRRKGYYVKEYS----------TKGGVFVIALV------------LWNDPHIQLP-----FAYILQQ-------------------------PEQYKGRLLPHINFG-----------FCLCYVTQME---------------ADWNSN----DLKSTYQDVDEQIQLTL---------------DNSVASVESG----------TSNDVELEGEF---SAYWQSEEELYL|
PB2503_00627_Pber_84701417 GVISEARTALADRLGA---YLLSAFDAQPF-----SASDLQAYNGKKVDRGW----------RLPGDPPLHLL------------LDPEFPYAPP-----RIALPDE----------------------------TQRLLWPHVETA-----------GLLCVFPTQ----------------TNIDAF---EPEKVATALITDARDLIT---------------RNQSGD----------------LDEEFRKEF---QSYWTLAIDDKA|
Shewana3DRAFT_3199_Ssp._78684828 ------LERHRGHSVL---SEIKQHLINQG-----FNCTTSEVAGGERIVVE----------TTILNHGIQLML-----------VADPPYYRLP-----EFFLINP----------------------------DSIGRLAHVSVHEYAGIQI----GTVCVNAPES---------------LSVNFE---QPLLVVEESLRRHILLLE---------------KCITNPD--------------WNHSELLREF---SSEWLRICAPDS|
ArthDRAFT_2172_Asp._66965723 WERYAGLLQSEISWLQ---DLGIACRIDET-----KRDDHQTLTMELSVPET----------VTGTAPLELTAV-----------FPDFYPLVPP-----KVFAVDL--------------------------------GMPHHWNPFS---------NEVCLLGTPS---------------EEWGTN------GSLAQLLKDQLPAAL---------------KAGMSGDEH----------ADWNEKPQAEPF---GAYYNSYANSAM/
Mdeg<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=02000735">02000735</a>_Mdeg_48864353 IHDVIRWLDETRSVAG--IQTVTSSDDGV------------VVATNWRVDLP--IRFESEGETESGIRSIEAV---------SWVFPWEYPLRAP-QPKLREDFPLT---------------------------------LPHINPVVEGED------ISPCIAEVDL-----------TDLLHSSGI-----EAVFGAMTHWLNNAASGEL-------------LCPVQGWE------------PVRRDNASGLI---SADTYAIREELN\
SYN_01833_Saci_85859492 AQEELREIEAASEGAF--EVLSVRFPEGD------------HRSAIAEISVT--CFDMPYAEGGIKLRDRERF---------LIYIPPDFPFDVPSVYTPHRRFSG----------------------------------NPHVQWQ-----------TYLCLYQSRN-----------TEWDASDGM-----FGFISRLELWLRRAALNQL-------------DMEGAPLH------------PPVAYPTERIT---------------|
Mmc1DRAFT_1998_Masp_68246513 ALEQVADIVAASNGTV--ELVQIDPPTSE------------GDTLLLRVSID--TSDYTFQKGGLKFRKREGF---------HIRVSSRFPIEPPIAKFTHQRFMG----------------------------------QAHVQWG-----------NQICLYLATD-----------VEWSASDGM-----FGFIKRLDQWLGDAAQDQL-------------DPDDAPLH------------PPAVYHSSDTK---FSVEIDTPELAD|
pCPF5603_46_Cper_86559649 NDDFTMFYKGLLECKN--VKNITIYKLNI------------NSVIIRLELKI--NLPSRRSLMEFDIKEFEPIK--------LLCSTNEIKYKAPLVFSDRNDFPVE--------------------------------KLPHTLAMGLNY-------SYICLHRGNI-----------DDWYIDHSV-----EDFVNRIRFWFSDAACNNL-------------IKPGDDFE------------PMINYTETGNI---VYSYNKLTKFIE|
RmetDRAFT_0537_Rmet_68559822 IADALHQLQRHRGLIR--VGEPRTTGAST------------EIEVDVAVQLP--NRSRRNGISETGVRTVETC---------VLVFGSDWPLSAP-EPFLRADFPLN---------------------------------LPHINPHRQGEL------VSPCLFEGSL-----------NELLHRFGL-----DAVVDQLIDWLHKAAAGTL-------------LDLEQGWE------------PTRRDSCPSTV---VFSAEKVAAAAP|
MaquDRAFT_3270_Maqu_77955723 HIQMLVAAILQHQRSE--DHQVTERENEL------------VLDVSWRVQLS--SRDVEVGQSGTGIKRLEPV---------RFLIPFAFPLRPP-DITLRSDFPRE--------------------------------FVPHIYPGSPGDP------VCPCIAEVGI-----------TDLMFQEGI-----SGVLRSLQAWLDRAAQGTL-------------MDPSQGWE------------PILFQNIAGSF---LDDKGSFLRGVR|
Nwi_2872_Nwin_74421923 AERFLAAALRHPECRG--GRLISVDAGGS------------RIELDLNVEMP--LAFKVDGASPNGVRVVETV---------NVRLWPSYPWSSP-SFYLRMDFPRD---------------------------------LPHVQPGPVTEP------PRPCLIDGNQ-----------REYFFQFGLVELGIFNLVHQLVLWLQRAAEGTL-------------IHHGRGWE------------PTLRCDLNDVI---ALNAEACRAVVD|
XAC3952_Xaxo_21110358 DGRMQALLRACNAHAD--INVVELRRIED------------PFIAEIIVADV--GDGAVSPGNDAGIHRIERM---------ALLYRTGARFPFE-ARPLRKTFPKA----------------------------------LHQYATGNEGP------PSLCIMEGDW-----------ELAEHRFTP-----EALLETLLAWLEKTADGTI-------------HEADRGLE------------PVFYSLGQCLM---LPPDFAEALSDP|
MaquDRAFT_3597_Maqu_77955313 NLPEPLSDLADACNDN--SDFDIVEFRRI------------SKDSYALVVDA--GDGTFDAENPVGIRRIERL---------AFVLNPNLGFPWE-VRALRSDFPVT----------------------------------MHQNHVEPNSP------RSLCLYVEPW-----------SSVERTWSP-----QSFLARALWWLRETACENL-------------HQANQPLE------------QLFFEPADQFV---LPEDYFERLTDT|
PnapDRAFT_0071_Pnap_84717800 RAKTLFDVVSRQRDYA--VVQLLQHCDDG------------TPKLECIVVEV--ECDGVPPKNGVGINYRERL----------ALCVSDDPKQLIEVLAMRKDFPVL----------------------------------MHQNQGILDAP------ASLCLYFESV-----------AAVMRTWTP-----QSFLRRIQWWLEKSARGEL-------------HPTDQPVE------------HLFFATRYELV---LPWNLSTLRKSA|
OB2597_18097_Obat_84502025 LTSSAAASFARFVDRH--AAELAAIVALR------------RGGAGELVELA--FRTGRPQQSVVPIRRTERI-----------GVRFAGGDSMPFVYVLRSDFPDT----------------------------------AHQNLTAEGSP------RAICIDDRGW-----------AEARLTWTP-----AELVQRILAWFRRAAEGAL-------------HDARQPVD------------PLMFGTGYNII---MSRALIDNANTQ|
GOX2518_Goxy_58038271 RSRLARSVIEYVCDSV--EHPYATIQEFQ------------SDGLSDIVDLE--LEIDLAQDRAVPIRHREPV---------RIVFASPDDLIAPRVLSLREDFPSG---------------------------------QVHTNLDREVDG------LCLCIWEEGW-----------HDLSRNLTG-----QALVERIRWWFAGMADGSL-------------HADDQILE------------PLVATTSDTIV---FPLGTFVGPWFI|
RSP_2047_Rsph_77387013 DEEIPDVLHPVTSLLR--IGVGPVTALEG------------WKEWRRGFFSL--PLVARVTISPGQSFPAESR-----------WHLVVSSGSYPA---DIFILPDK--------------------VAGPNLT------FPHQAAVYSRDGKEPWLNGEPCLTDPTAAFGDR------HGSRPEPIAL---ADRLIWKVERFSRWCELAAA-------------GRLHNPGD------------HFELPPLSGHT---NPMTIGFHETEG/
Ava_C0067_Avar_75705484 EREGKESKYKFLSPE-----AVEKAFTSK-TAAS-------GWLSSNTIWWG---------KNPEGEAIIQFYSPQKYQIQIMGQEPEVITVPMP-----AFLFAGCSS----------RYYLWAIKGRVF-KPDTQLYKPPLPNVWED---------SSICFGG-------------------NSLS----MCSAATISQVWDLFWKSPFNKDLSQGKS-----KTHPDNIC--------------NQLIKLHESKA-KSYPSSDLVPVH\
alr7559_Ana_17134644 EREGKESKYKFLSPE-----AVEKAFTSK-TAAS-------GWLSSNTIWWG---------KNPEGEAIIQFYSPQKYQIQIMGQETEVITVPMP-----AFLFAGCGS----------RYYLWAVKGRVF-KPDAQLYKPPLPNVWED---------SSICFGG-------------------NSLS----MCSAATISQVWDLFWKSPFNKDLSQGKS-----KTHPDNIC--------------NQLIKLHESKA-KSYPSSDLVPVH|
p1B75_Asp._56315656 TVDGLRKMFDSLDPS----RSARPVFLEP-NVLS--------QGPGWLVWWM-----------KPQTRRVWFES--------KEIKLETAEVPHP-----GLVFAVTQE----------EWRVFAVQGRSRPRPGTKLYQAPYWNVWKG---------GRICAGS-------------------ARLP----SAGLQADPSGWEESFFSSR--------------FSHPNIHEKDALVKYKGG--SAKFWNAMLSGKF-KSFPQEVLVPAE|
BCE_A0096_Bcer_44004435 TFKDFYLALKEVMEQGTQDNTHYSSGVLPKGCIKH--EVLSKSGDKQAVWIE----------VPKAQWDIHFFE------------RPFQQVGFP-----RLLFRYTVYQKRVT-----NISVFAVKEDMELEEGMKLYQFPYSNVHPS---------GSVCTGR-------------------VVIP----EFRTLKDLETFHVLFFASS--------------FNHDLTHTHTEP--------VGELFKRFEN----QSFDDSILMESE|
BT_2648_Bthe_29339960 TYEFMNSLVESYTES----MSGIPHGRIPGNMLLC----DSRKGRERYIWYN-----------PPQKRKMYFQD---------GLHITDGTFNVP-----GVIYVVERE----------CMDIHAFKGA-IPEERTELYLAPFFNVAG----------ANVCLGSSS-----------------PKKPQ---DMDFLEFQEYWEKRFWMSE--------------FSHLGGNRNP----------TRSNLVSVTEHARNNPFDYSELQQSG|
BproDRAFT_4305_Psp._67908644 TLTSKNLKLLAQQAQQ---GLKQDFEVIPANVLV--------ANDSLLAWWM-----------PKGTQLMSFDVSMHELAGKSRLQGVSGNVPTP-----ALVFAMMRNRNAGGAFE--GLYVFALEKSERPTSDTSLYRAPLLNVGED---------GSVCWGD-------------------GVKP----AGKTVKDISAWQALFFSSV--------------FTHYNGTVPIVGDD------PYAFIADLMETEA-KEFPAAALKPMK|
RferDRAFT_4144_Rfer_74024822 KKDSLMAALRQLARQQ---GISDLVWVDD-QTIA--------TSSTLQVWWT-----------PAQSRWMHFQS---------QGLQLSLPAQNP-----PLVWLACGE----------CLMVFALKENIKPGPTTALHHAPLFNVFAN---------AEVCAGS-------------------MQKP-------KDGNAKEWVESFYAAT--------------FTHANPPSRRLTTYRQG---EKALWKHLMTSKKKPAFPTDKLKPFG|
BproDRAFT_0623_Psp._67910471 TEADYLAMVKVLAPQQ----RPQMEWQDH-CILA--------KGMGKMIWWT-----------PPMNRAMFFKKS---DMFGATTFSGQGICPLP-----GMVWMSDGR----------DLFVYAYRGSAMPGKETRLCQAPLFNVWAR---------GEVCVGN-------------------ASRP----DDSAKGNPQAWERFLFDSH--------------FTHPNFAQVDRLTKGVK---PAEFWKKMVAKP-AQKFPESVLVDLE|
PnapDRAFT_0124_Pnap_84717439 TQSDLNELVTGLSQSQ---SLSVPSWIDT-TMLA--------LGAGRMIWYT-----------PACQRAMFFKTS----SFTKDTFEAQGQLPTP-----GLVWLVMQG----------ALYVYAYKGSGRPDKETKLYQAPFFNVWSQ---------GKVCTGN-------------------AAMP----VGDNAAIPHMWVDAFFGSN--------------FTHPNFKEKDRLVKGVC---PIDFWKAMTEKP-LPVFPEGRLVDLP|
RSc1659_Rsol_17428675 SLGELSEFVEAAQTA-----TAYRGFIEP-HVLY--------LAPNTVAWWR-----------PAAPRTVWFSAE-------KPIGTRHGVTAHP-----PLVFIVHER----------QWYVFALAKNERPAPNTPLHVAPYFNVWER---------GEICTGN-------------------VSLP----DRPAPDALKAYETAFFDSR--------------FTHPNHARITRHKDG-----GGALWAHLLDHPEITEFPATALLPRK|
RmetDRAFT_6238_Rmet_68559357 NRMALIHAVRQVAANA----LPKGEFLTP-NVLS--------ISATTVTWWC-----------PAASRRVFFKCE--------EFGERNAIVAHP-----ALVFQASHS----------GFSVFALQGEDRPGPETALFEPPYFNTWDH---------GRICIGS-------------------AQVP----KQIDVASISGWEEGFFNSA--------------FTHPNHGGKRVAYERG----VYAFWKDMLDGKF-PDFPKQVLVPMK|
PHG308_Cnec_38637969 NRMALIHAVREVAEAS----LPNGEFLTP-NVLS--------ISPTAVTWWC-----------PAAQRRVFFDCK--------EFGKRSAVVPHP-----ALVFQASQS----------GFRVFALRGDERPVPASELCEPPYFNTWDH---------GKICIGS-------------------AHVP----KQIDVASIAGWEAGFFNSA--------------FTHPNHGSKRVTYERG----AYAFWKDMLDGQF-PDYPKQVLVPMK|
Bcep1808DRAFT_6253_Bvie_67543573 DRKVLVQTLQQLAEHV----APRAEFLPA-TVLG--------VSPEAVTWWC-----------PPAMRRVFFECE--------NLGKRSAVVPHP-----GLVFQALNQ----------GFRVFAVACSDRPVRETPLFEPPYFNTWDM---------GRICIGS-------------------AQVP----KRVDVASIDGWEAGFFDSA--------------FTHPNAGGKRIEYKDG----EYAFWRDMLDGKFGETFPLNALVPMK|
Daro_2538_Daro_71847775 TPRAAMDLAKALLKR-----AAHGGFLPE-TVLY--------MDGDLIVWWM-----------PPARRHIAFRVD-AEQAEAFGGQERGESVPHP-----GLVFAASSR----------VWRVWAVKGAGRPTPATALFQVPYFNVNVQ---------GNICHGN-------------------APVP----EGTTVEKIAAWNDAFLRSY--------------FTHPNGPGKLIRYRGG----AYTFWRDMLDGRF-QRFPERVLVDVK|
PproDRAFT_0257_Ppro_71839550 DVEMLGTLINALGRN-----VSIGGYLPP-NILS--------VGFDSMVWWV-----------KPSKRRVFFKTN------EEIIGERSEVVPHP-----GLVFGVNGSG---------VWAVCAVKGNTRPTEDTPIWQAPYFNVWSS---------GNICTGT-------------------IETP----KSVAVTETGKWEECFFSSY--------------FSHPNAHGSRQLINSRIN--PYQFWKTVLDGKY-KTFPTQKLVQTN|
RBTH_06715_Bthu_75758403 NTLFEFVQKNCYETKTNTKKLDIPVFETP-A-----------LPPGTVKYMALPDGKI-----VLFMEKKEFKHNL------TYHSTKYKQIPFP-----NLLFVFVFRPNGDKYILE-NKRCYAFRDKVF-RDTTKLYRFPFSHVQKD---------GEMCFFF---------------------LT----EMQDLAQMSSFIHNWLSAA-------------FTDHYYNLENKNKW-------GWPLRQIFSETQGQPHFNYDKLIEED|
RBTH_07326_Bthu_75758953 NTNIETIQQIFMKEQA------METPLLP-------------SQWGVVKYYRKNHYEGYVLTTPPTERVVKFDIG------RSSELPTEVTLPIP-----PMLWVFEVMTDQSGKKKLTHSMTYVIKHELL-SLKDKVFHAPFCNIGIS---------HGICWGR--------------------TLP----EVPIPKSIQSIPARFFSQPFNYDLSGNRVKPFEWTHPNGNTEDTECAVYHMMNEADKLKAAKEAGEAYSYPFDSLKPAG/
PnapDRAFT_3950_Pnap_84711628 TEGLAALPEADQRYLD---SHGFTVEVVS----------DGPHTGVVLKQMQ----LPQGK-FNHPAADVLVI------------LPPGYPDVAP-----DMFFCNL--------------------WLTLVSAGRYPTCADQPHTFM----------GHNWQRWSRH--------------NNSWRP------GVDGLHTMIKRIEHALAEAK---------------------------------------------------------\
sll6054_Ssp_38423903 --VMTFLPESDRQYLA---NKDYTYEEIT----------EGSRKGLIFSKFP----LPNQK-YDVSEVDLLIL------------LPNGYPDIVP-----DMFYLEP--------------------AVKLVQGNRPPRATEARQQFN----------GRSWQRWSRH--------------EREWRR------GVDGIWTMLKRVEHALEVAA---------------------------------------------------------|
alr7503_Ana_17134588 --VMSFLPSNDRQYLE---NRGLPFEEVV----------DASQKGVILREFQ----LPLGR-FDTEQADILIL------------LPSGYPDAPP-----DMFYLLP--------------------WVKLVQGAKYPKAADQPHQFN----------GQKWQRWSRH--------------NNEWRP------GTDGIWTMLKRIENALEVAA---------------------------------------------------------|
NhamDRAFT_1902_Nham_69928899 PRQAFALLPVDERHLD---TMGLKWETVV----------DGGRRWLLIEGYP----VPEG--YNAAVVTLALE------------IPGPYPGAQI-----DMFYVHP--------------------ALRRLVGEEIP-ATQATETVL----------GRIFQRWSRHRGP-----------NSPWSS------RLDNVMTHLTLVDGALAKEVNQ-------------------------------------------------------|
Bcep1808DRAFT_3228_Bvie_67547440 VRADFTVMEEDAEFLN---SKGYTWEAVA----------SDAKR-IVVRGFE----PPQG--FAPTKVDMFVI------------LPQGYPDTQI-----DMVYFSP--------------------PLTRNDGKPI--RSLVTNEFE----------GKTWQGWSRHRTA-----------NSPWRQ------GIDNVGTHLMLVDDFLRAELSK-------------------------------------------------------/
y4jF_Rsp._2496664 AFDDQAASCAEGQATL-DLAVRLLARLYP----------------VLAILPL---DSASSFQAQALERLAKSI--------------------NPK----IGIRRSGKS------------------------------AMVCLVAGATRP-------SLRCTTFF------------------IGS-------------DGWAAKLSRT---------------DPVGSGSSLL----------PYGAGAASCFG---AANVFRTIFAAQ\
mll6192_Mlot_14025925 AFDDQAASCAEGQATL-DLAVRLLARLYP----------------VLAILPL---GSAASFQAQALERLAKSI--------------------NPK----VGIRRSGKS------------------------------ATICVVAGVTRP-------PLRCPTFF------------------MGS-------------DGWAAKLSRT---------------DPVGSGSSLL----------PYGAGAASCFG---AANVFRTIFAAQ|
msi105_Mlot_20803932 AFDDQAASCAEGQATL-DLAVRLLARLYP----------------VLAILPL---GSAASSQAQALERLAKSI--------------------NPK----VGIRRSGKS------------------------------ATICVVAGVTRP-------PLRCPTFF------------------MGS-------------DGWAAKLSRT---------------DPVGSGSSLL----------PYGAGAASCFG---AANVFRTIFAAQ|
RHE_PA00014_Retl_86359719 AFDEQACA-TEGRASL-DLLVRLVARLYP----------------TICLLPS---GEEAKKLAKNLASLARSI--------------------NED----ITIARRGSS-----------------------------ALSHCLVVGSTNP-------EISCPKFF------------------LGS-------------DGWIAKFSPE---------------EPVGTAGSNN----------AFGAGAAACIA---ASNLFRHIFRDQ/
--------
2. JABs
EE.HHHHHHHHHHHH.........EEEEEEEE......E..........EEEEEEEEEE................................................................EEEEEEEEE..........HHHHHHHH..................EEEEEEEE........................EEEEEEE.....EEE..E.EE....
EE.HHHHHHHHHHHH.........EEEEEE.......................EEEEEE................................................................EEEEEEEE...........HHHHHHHH...................EEEEEE..........................EEEE....................
AF2198_Aful_11499780(pdb:1oio) SRGLLKTILEAAKSA-----HPDEFIALLSGSK---------------DVMDELIFLPFVS---------------------------GSVSAVIHL-------------------DMLPIGMKVFGTVHSHPSPSC--RPSEEDLSLFTRFG---------------KYHIIVCY-----------PYDEN--------SWKCYNR----KGEEVELEVVEKD-\Archaeal JAB-1
PH0451_Phor_14590365 RRELLEYLLELAKSF-----YPREVAGFLRMKDG---------------VFEEVLIVPKGFF--------------------------GESSVYFDL-------------------TLMPHDESIKGTFHSHPSPFP--YPSEGDLMFFSKFG---------------GIHIIAAF-----------PYDED--------SVKAFDS----EGREVELEVID---|
VNG0778C_Hasp_15789943 GGRPSVLGIAEDALE-----FAREAAQDSHPDEYLGLLRATPASAFDLDADDGYVVTDVLVIP------------GTET---------NPVSATFGS-------------------TQVPNDMRNVGSIHSHPNGVL--APSDADRSMFGKG----------------QLHIILGH-----------PYGPD--------CWRAFDS----EGEPRTTTVLDVDL/
Z1657_Ecol_15801143 STRAAREWLILNMAG-----LEREEFRVLYLN-----------------NQNQLIAGETXF-------------TGTINRTE------VHPREVIK--------------------RALYHNAAAVVLAHNHPSGEV--TPSKADRLITERL----------------VQALGLVDI----------RVP----------DHLIVGG----NQVFSFAEH-----\RadC
radC_Bsub_16079856 SPEDGANLVMEDMRF-----LTQEHFVCLYLN-----------------TKNQVIHKRTVF-------------IGSLNSSI------VHPREVFK--------------------EAFKRSAASFICVHNHPSGDP--TPSREDIEVTRRL----------------FECGNLIGI----------ELL----------DHLVIGD----KKFVSLKEK-----|
radC_Aae_15606726 RNPQEAFEFLKDKFD-----ERRESLIALYLD-----------------LSNRLLDWEVVA-------------IGNVNTVF------SKPKDILF--------------------KAVKLSANGIIIAHNHPQGEP--SPSNEDLNFTERL----------------KKACELLGF----------ELL----------DHLILSE----GRYFSFREE-----/
COPS5_Hsap_12654695 SALALLKMVMHARSG-----GNLEVMGLMLGK----------------VDGETMIIMDSFALP----------VEGTETRVNAQAAAYEYMAAYIENA------------------KQVGRLENAIGWYHSHPGYGC--WLSGIDVSTQMLNQQFQ------------EPFVAVVID----------PTRTI---SA---GKVNLG-----AFRTYPK-------\Euk JAB-1
RRI1_Scer_6319985 SKLSCEKITHYAVRG-----GNIEIMGILMGF----------------TLKDNIVVMDCFNLP----------VVGTETRVNAQLESYEYMVQYIDEMYNHNDGGDGR--------DYKGAKLNVVGWFHSHPGYDC--WLSNIDIQTQDLNQRFQ------------DPYVAIVVD----------PLKSL---ED---KILRMG-----AFRTIES-------|
RPN11_Scer_14318526 SSIALLKMLKHGRAG-----VPMEVMGLMLGEF---------------VDDYTVNVVDVFAMP----------QSGTGVSVE-----AVDDVFQAKMMDML---------------KQTGRDQMVVGWYHSHPGFGC--WLSSVDVNTQKSFEQLN------------SRAVAVVVD----------PIQSV----K---GKVVID-----AFRLIDT-------/
Stambp_Mmus_17941277 NLCSEFLQLASANTA-----KGIETCGVLCGKLMR--------------NEFTITHVLIPR------------QNGGPD-------YCHTENEEEIFF------------------MQDDLGLLTLGWIHTHPTQTA--FLSSVDLHTHCSYQMM-------------LPESIAIVC----------SPKFQET------GFFKLT-----DYGLQEI-------\Euk JAB-2
SPAC19B12.10_Spom_19115685 LLKKVFLDVVKPNTK-----KNLETCGILCGKLRQ--------------NAFFITHLVIPL------------QEATSD-------TCGTTDEASLFE------------------FQDKHNLLTLGWIHTHPTQTC--FMSSVDLHTHCSYQLM-------------LPEAIAIVM----------APSKNTS------GIFRLL----DPEGLQTI-------|
CG2224_Dmel_7301945 DTMEVFLKLALANTS-----KNIETCGVLAGHLSQ--------------NQLYITHIITPQ------------QQGTPD-------SCNTMHEEQIFD------------------VQDQMQLITLGWIHTHPTQTA--FLSSVDLHTHCSYQIM-------------MPEALAIVC----------APKYNTT------GFFILT----PHYGLDYI-------/
VNG1818a_Hasp_16554503 TREGYDSVLDHAQAD-----TPREACGVFVGE----------------RDGDLRRVTAVRRVP----------NVADAPRV------RYELDPEATLAVFD---------------EAAAVGREVVGFYHSHPVGPG--RPSATDREHAQ------------------WPDRVYVVA----------SLAARPPILD---AWLWTGE----AFER----------\Archaeal JABS-2
PA2102_Paer_15597298 TEHALSVIYRHACRT-----YPRECCGFVLADA---------------KVKEGTNIQDELHMA---------DPRRYPRTAA-----NGYTFSVTDTVFLN---------------SSFKTCSPVSVIYHSHPDVGA--YFSREDIDKALYAGEPM------------LPVDYLVVD--------VAAGNVRGAKLF---AWRNGRF---ECTREFGPSSQ----|
SSO0111_Ssol_15897071 NRYFKINCWSRRFMD-----NLKEKCGIICNNT------------------FYELKNISRTE-------------YE----------FICDPSDFYTT------------------VKGKCSDDIQAIVHTHEESC---EPSYKDIMSMKIWN---------------IPWIIISKK----------CIKSILYLNG---SILELD----IHSLLSQELYHSLM-/
sll0864_Ssp_1652702 SQVHQDQIYRHGERC-----YPEECCGLLLGKILIGENGH-----------RHWQVVEVQPTENCWGDVEEF-QQNNHQGNKLHYFAIDPKVLLSAQK------------------DCRQKGLSIIGIFHSHPHGQP--IPSEFDRAIA-------------------WPEYIYLIA-----SGENGRFNTSR-------SWYLNEA----GNFMEVDS------
YPMT1.08c_Ypes_16082790 MQEIYLTAIKR---------YPNEACGFLVRT-------------------TGEKYRFMEARN-----------------------VSENPENTFVMHADDI--------------IAAEDAGDVVAIWHSHTDESA--DASDADRAGCEATE---------------VPWLILAV-----------RKNVEGD------APFHFSE---MNVITPDGFEMPYL-
_Scoe_7479881 TQALYDQIVAHARED-----HPDEACGVVAGPAG---------------EGRPERFIPMLNAA------------RSPTFYEFD-----SQDLLKLYR------------------EMDDRDEEPVVIYHSHTATEA--HPSRTDVTYAN------------------EPGAHYV------------LV-----------STADTDG---AGEFQFRSFRIVAG-
DR0402_Drad_15805429 PAPLRRALWAQVRRE-----LPRECVGALGGW----------------VRGEQVQAHALYPLP------------NVAADPEREY----LADPGDLLRVVR---------------AMQREGLDLVALYHSHPHGPA--APSASDRRLAA------------------YPVPYLIAD----------PAAE---------VLRAYLL---PGGEEVEV-------
_Aae_2984019 KKEVLEKMIKQAERD-----YPYETCGLLIGK-----------------SEGGIRIAYEAFET-----------PNANPDRKHDRYEIAPKDYMRAED------------------YAISKGMEIVGVYHSHPDHPD--RPSQFDLQRAFP-----------------DLSYIIFSVQ------KGKVASYR--------SWELKGD---KFEEEEV--------
RPCDRAFT_2255_Rpal_78493975 NEETLALIVRHAEQA-----YPKECCGFVYADGEVRA-----------CVNIQDDLKSID------------PARYRHGATAGYTL---SVADTLALNG-----------------SFETANPASV-IYHSHPDVGA--YFSQEDSDEALFLGTPVYP----------VDYLVVDVRR------AKALEAKL--------FVWRKAG---FFCARVFPIDQSYR-\ThiF+Rhodanese
Noc_0361_Noce_76882206 PRPLVNQLLHQAQVK-----PQQEICGLISAR--------------------NGLPSRCYP-----------INNIAPEPQRHFFM---DPQGQIAAMR-----------------RMREEGEELFGIYHSHPETAP--LPSKSDLAQAAYP----------------GALYLIISLN------TKGVLEMR--------GFRLQGE---VYEEIELQL------|
RRSL_01365_Rsol_83748715 LSELVDAVLAQARRD-----HPIETCGVIAGPV------------------GSDRPARLI------------PMRNAAQSIDAFRL---DAQEQFQVWS-----------------EMDAREEEPIVLYHSHTGTNA--CPSRDDVRFAAEP----------------HAHYLIVSTD------PACGQAVR--------SFRIAEG---RAVEETIKVVARYQ-|
MlgDRAFT_2849_Aehr_78700360 PARERDRLARLGLAR-----WPEEACGLMLGCD-------------------GRVRRLVL------------CRNVAARRADRYLV---HARDFLRWDR-----------------AAHRLGLDILGVWHTHPDGGA--RPSGTDREQAWR-----------------GWSYLIAAVD------GRAITELR--------SWRLRGD---HFIEETLCLKPA---|ThiF+Rhodanese
NE2352_Neur_30181074 HTKLISAMITQSLKD-----HPIETCGIIAGLA------------------GSNLPLRLI------------PMRNVAQSENFFMF---DPQQQLQVWK-----------------EMSARHEEPVVIYHSHTGSEA--YPSRSDVELAAEP----------------QAHYVIIPTC------SPHKEEIR--------SFRIVDQ---MVIEERVQIVRQYQ-\ThiF+S (S)
Nmul_A0971_Nmul_82702100 HAKLVEAMLAQAHKD-----HPFEICGVIAGPE------------------KSNLPLRLI------------PMRNAAQSETFFKF---DPQEQLQVWR-----------------EMEARGEEPIVIYHSHTHTPA--YPSRTDVQYASQP----------------QSHYVIVPTD------PAYGEEIR--------SFRILDG---MVTEERIRMINSYK-|ThiF+S (S)
pdtG_Pput_84994017 TAQALEQVRHLAQAA-----HPIEACGLIAAAS------------------GEPLAHRVV------------PMRNQAASPTWFSF---DPREQLQVWR-----------------ELDQRDEDCRVIYHSHTASEA--WPSREDIALASDP----------------QVHYLIVSTW------GEARHAAR--------SFRIIDG---RVFEEPLCVQP----|siderophore
qbsD_Pflu_28192389 SQDIITAIFDQARQA-----HPLECCGIIAAAI------------------DSERATRLI------------PMTNSACSPVYFAF---DPRQQLQVWR-----------------EMDARDEEPRVFYHSHTASRA--YPSATDIEFATDA----------------NAHYLIVTT-------ADYDPPLR--------SFRIAQG---CVSEEEVRVETPPY-|Siderophore
AcidDRAFT_1958_Susi_67932292 ESAAWAAMVKHAQAS-----YPNECCGAMLGDT------------------DGETKLVRESIAL--------ENAFEGAQAARYEL---RPQDLLAADK-----------------AARERNMDLIGIYHSHPDCDA--YFSKTDLQNSCP-----------------WYSFVVLSIQ------KGEFHHAN--------SWLPNFD----QTEAAKEELSY---|
nfa10890_Nfar_54014564 KSDLVAAMVAHARAD-----HPDEACGVIAGPE------------------GSDRPERFI------------AMTNAERSPTFYRF---DSGEQLKVWR-----------------EMDAADEEPVVIYHSHTATEA--YPSRTDISYASEP----------------NAHYVLISTR------DPEQHELR--------SYRILDG---VVTEEPVRVVDDYD-|
RxylDRAFT_0217_Rxyl_68563153 GRGDVEHIHRHAREA-----YPEECAGALVGMDVGG------------GTKIVVDVWRA-------------ENVHEEERSRRFLI---EPEQIRRFER-----------------RAAERDMDVLGFYHSHPDHPA--EPSEYDRQHAWP-----------------YYSYVIVSVS------GEEIREMR--------SWRLRDD---RSGYDEEEIVG----|Cys synthase
SAV5162_Save_29608821 TQALVDQIVAHARQD-----HPDEACGVVAGPE------------------GSGRPERFI------------PMLNAARSPTFYEF---DSGDLLKLYR-----------------EMDDRDEEPVIIYHSHTATEA--YPSRTDISYANEP----------------GAHYVLVSTA------DADDAGPF--------QFRSFQI---VAGEVTEEEVKVVE-\Cys Syn ClpS
MT1376_Mtub_13880984 RADLVNAMVAHARRD-----HPDEACGVLAGPE------------------GSDRPERHI------------PMTNAERSPTFYRL---DSGEQLKVWR-----------------AMEDADEVPVVIYHSHTATEA--YPSRTDVKLATEP----------------DAHYVLVSTR------DPHRHELR--------SYRIVDG---AVTEEPVNVVEQY--|
Tfu_2370_Tfus_71916501 DRSIYDKIVAHARRD-----HPDEACGIVAGPE------------------GSDRPERFI------------EMINAERSPTFYRF---DSLEQLKVWR-----------------EMEERGEEPVVIYHSHTSTEA--YPSRTDISYASEP----------------NAHYVLVSTR------DPETVEFR--------SYRIVDG---VVTEEPVEIID----|ClpS
SRU_2040_Srub_83814538 TPDILDQIRVHGADA-----YPEEGCGFLLGTVTDD------------GDNRVAALHRA-------------TNRRSEQRTRRYEL---TADDYRAADA-----------------AAQEQGLDVVGVYHSHPDHPA--RPSATDLEEATFP----------------GFTYVIVSVR------DGAPEALT--------AWALAPD---RSEFHREDIVRPDP-|Cys
WS1005_Wsuc_34483108 -KALFDSIIEHAQRE-----LPLEACGYVAG--------------------VEGEVKRLF------------PMRNVDASPEHFSF---DPAEQFSAFK-----------------EAQKEGLRLIGCYHSHPSTPA--RPSDEDIRLAYDS----------------SLSYLIVS--------LAKEPVLN--------SFKIKEG---VVTPENIEVI-----\Sulfite metabolism
Gmet_1569_Gmet_78194034 -RAIHAELIAHAQAD-----APIEACGILGG--------------------IDGAVSAIF------------RMANTDQSDEHFMM---DPKEQFAVVK-----------------ELRNRGLAMLAIYHSHPETPA--RPSEEDIRLALTP----------------GVSYVIASL-------AGAEPDVK--------AFRITDG---VVEPEPIDIVE----|
CsacDRAFT_2033_Csac_82499136 PKTLYEEMLNHCLNS-----LPIEACGLLGGVI----------------EDEKRIVKKVY------------LLTNVDQSPEHFSM---DPLEQFAAVK-----------------DMRKNGWVLLGNFHSHPTTPA--RPSEEDKRLAFDK----------------SLSYLILSLM------DEKNPVLK--------SFRIYES---YVEEEEIQII-----/
Syncc9902_1941_Syn_78169801 DLQCLTVLERSLLAV-----KPQEGCGLLLGTGL------------RTPRLRLVTLWPACNAWKKSDW----VNDALGDLETRFVL---DPREQIAAQR-----------------WARVHGLEVLGVCHSHPKTAP--EPSTRDCAWAEP-----------------NQLMLILS---------GMRELR---------AWWLGAD---RHPLEIPIEVWENHT\ThiF+Rhodanese
Syncc9605_0389_Syn_78196400 DHRCHTDLRRILLAP-----HPEEGCALLLGQRT------------NSGCLRVTTTWPCCNVWGRGAS----GQRPVHDRCRRFLV---DPREQLAAQR-----------------WARNRHQYCLGVAHSHPASEP--VPSPHDRQWGEA-----------------ESVMLILS---------ASLGLR---------AWWLHGD---RSVDEIPIQLWDTHK|
SYNW2054_Syn_33633364 RCGCLTILERTLLAS-----WPEEGCALLIGSQG------------EGSSLRLDHVWPGCNRWGRQPDLQPWGAGETPGRDCNFLL---DPREQLAAQR-----------------WSRQHQQWIIGVAHSHPHSPP--VPSAADRCRGVP-----------------HQLMLILS---------AQQGLR---------AWWLEED---RQVRPVPIDVD----|
RS9917_03068_Syn_87124949 GRQCLIVLKRTLAAP-----APEEGCALLLGSLV-IGGAS------TRSTWRVHRVWPCCNVWSPGLAALPEPPDASLTRRHRFAL---DPREQLHAQR-----------------WARARGLQVLGTAHSHPEGEP--EPSRRDLDWATT-----------------PSLMLILG---------GSGALG---------AWWIEAD---AASPLLLEHTDGEA-|
CYA_2469_Syn_86607091 SAEHLRAIRQHGEQA-----FPYEGCGILIGEL-------------KGADKIVHELWAVANTWDQAEN----PLADGESSRRRFLI---DPADFKRAND-----------------HAVRKGLGILGTYHSHPNHAA--VPSEFDRQHAFPW----------------GFSCVIVSVR------EGKAEEVA--------SWVLDEQ---EQPQREPMQIWEDK-|
CaurDRAFT_0696_Caur_76258731 PDQAAAAIAAHAEAT-----YPDECVGLLVGTL-------------NGEKKTVLQVVTLENRWSGQVQLA--ATDNPHSRRDRFYL---DPRDYLRVDR-----------------ETRAAGYEIIGCYHSHPDAEA--VPSERDRIGAQAIGGS-------------GFSFVIQSVH------NGVATALH--------SWLLVNE---GTRFIAEEVRIITT-|
Adeh_3492_Adeh_86159910 GAPLLARISALCEAD-----PEREVCGFVVRRRG------------LLEVEPIPNAADRYHAHD--------PLGFPRTSRDGYLM---DPRAHLQLLQ-----------------ALDAEGGEVVAVWHSHVEVGA--SFSAKDRADALADGVPLLP----------GAEYLVFGVR------GGKVTEAR--------RFRFHGG---DFVESPLA-------/
N15p19_BPN15_9630483 RQKTIDAIMAHAAAE-----YPRECCGVVAQKS----------------RVERYFPCRNLSA---------------------------EPTEHFHLSPEDY--------------AAAEDWGTVVAIVHSHPDATT--QPSELDKAQCDATL---------------LPWHIVSWPE----------G-----------DLRTIQPRG-ELPLLERPFVLGHF\JAB+NlpC
HK022p18_BPHK022_9634137 RQKTIDAIMAHAAAE-----YPRECCGVVAQKS----------------RVEKYFPCRNLAT---------------------------EPTEHFHLSPEDY--------------AAAEDWGTVIAIVHSHPDATT--QPSELDKAQCDATL---------------LPWHIVSW----------PDG-----------DLRTIQPRG-ELPLLERPFVLGHF|
phi1026bp18_BPphi1026b_38707908 DEQIKKAIEAHALAE-----YPRECCGLVVKTE----------------SGEIYVRCRNLAA---------------------------VPTDQFALASEDY--------------AAAEDMGEIVALVHSHPGASA--QPTDEDRTMCGRSGI--------------AKWVIVSL-----------GVQADGSIGID--DWCEFEPGGYVARLVGRQFVHGVH|
64_BPBcep176_77864689 DERIKQAIADHALAE-----YPRECCGLIVRTA----------------AGDVYLPGRNVAP---------------------------TPTDQFALAPEDY--------------ADAEDMGEIVAMVHSHPNGTA--QPSMADRTVCERAGI--------------PQWVIVSL-----------GVQADGSIGVD--DWNEFGPSGYVAPLYGREFLHGVL|
T1p35_BPT1_45686325 SAKIKLEIMTHAQEE-----YPRECCGVVTQKG----------------RVQKYHRIDNVHR---------------------------DPENHFMMDAVQYAC------------IEDDAESTTIAIVHSHTGDGATTLPSAHDTCMCNEME---------------VTWIIVSV----------PEG-----------DMRFVKPE--KLPLIGRPWSLGSF|
_BPlambda_215123 MTQTESAILAHARRC-----APAESCGFVVSTP----------------EGERYFPCVNISG---------------------------EPEAYFRMSPEDW--------------LQAEMQGEIVALVHSHPGGLP--WLSEADRRLQVQSD---------------LPWWLVCR-----------G------------TIHKFRC---VPHLTGRRFEHGVT/
PHG307_Cnec_38637968 PAQLIGEFAAMARAA-----HPKETGGWVVWNA----------------DSASFRLVPVQ-----------------------------ILEHSGGHLKYER--------------PPLAADDVLVVDCHSHGRHPA--FFSSTDDDDDCHDV---------------KFAFVMGNCD--------AATPSM--------ALRLCAK---GIFENVEKVPADWY\Div E2
alr7560_Ana_17134645 LEPYFRLKVPKVPCQ-----AIAEIINAASINP-----------------QQEILFYLGVTNDQWWCHTP-------------------LQTASSTHVLSLES-------------ALDKSYTDGLVEMHSHGTLAA--YPSSADNQEEKGK----------------FRVFAIIGT----------LNNIPT-------IYTRIGI---YNHFFDINP-----|
RSc1658_Rsol_17428674 TFPMVRAFIEAARKA-----APNEHAAWVVWDS----------------RTGDLAYRELQ-----------------------------ITDASPGAISYDR--------------PRLEDHESLVVDMHSHGALAA--FFSEQDNRDDAGEV---------------KISCVVGDLA------DGKTPSIQ--------FRLCVLG---MFLPLKVPADAVLG|
p1B74_Asp._56315655 PLSLFERFAGIARES-----CPLEAAGWITWNE----------------VSNQFAFREVG-----------------------------VREASASRIHFDR--------------PRLDESEHLVVDLHSHGASSA--FFSGTDDFDDRGEVKLSIV----------LGRCDQRVVTAQRFCLMGMFVPMQ--------LASAVDG---LTFEPVPVERTRS-|
PproDRAFT_0259_Ppro_71839552 PVDMITRFMVEAKER-----FPLECAAWFTWDT----------------YLKRFNYYSLH-----------------------------ARQASLDNLDYAC--------------PVLPETECLVCDIHSHGRHPA--VFSPEDNNDDRGET---------------KIAVVLGRIN--------SSPSIA--------FRLCVAG---LKIPIKYNLAKLPF|
Bcep1808DRAFT_6254_Bvie_67543574 PAALVAEFHAMARAA-----LPNEVGAWIVWNS----------------VTNEFRIVALP-----------------------------SLSHGPGHLVYER--------------PRLADGEWLVVDCHSHGTSPA--FFSRTDNQDDKHDV---------------KFALVLGHCD---------RTPSV--------ALRLCAK---GIFEKFERAPETWA|
RMe0063_Rmet_56410325 PPALIGEFTDMARAA-----YPNETGAFVVWNA----------------RTQQFRLVPLR-----------------------------ILAQGTGHLKYDR--------------PRLGADDVLVVDCHSHGRYPA--FFSATDDADDRHDV---------------KFAFVIGNCN--------AAVPSL--------ALRLCAK---GIFENVERIPHGWY|
Daro_2537_Daro_71847774 PIRVIEAFIEAARRG-----LPNEVAGALIYSR----------------RNQSLRLALCE-----------------------------PIEVSPHQIDYRV--------------PTMDADETLAVDLHTHGYGSA--FWSAKDDGDDQGIKVAGVF----------GCLHQPKPQALFRLVVNGRFRPLP--------HPWQADT---DTACDVAPDLESGL/
XAC3952_Xaxo_21110358 DEGLCEQLLGERASS-----LPLETGGILLGVV---------------DFKLNTIHLVDGR---------------------------SAPRDSVSTEADFQCGSCGVQEDITEAQRRTAGMVLWIGAWHSHPKGVKA-VPSIQDQDLLSHLCTRLGAHGLP-------AVMLIAGE------------NG---------IDVFLQ----MKGS----------\E2+E1+JAB
SYN_01833_Saci_85859492 SQAALSEMHTWVRRSALIYGEKAETGGILFGGR---------------DNACRVIWVSEVI---------------------------GPPADSESSCAHFICGTNGVAEANEEKRQRTRGSTQYIGMWHTHPTSLP--APSETDFLAMHALVNA----DEPSTH---KHLLLILGS------------DS---------EQSVEL----LSGFL---------|
RmetDRAFT_0537_Rmet_68559822 LSPVAQAIHADALRW-----GALETGGALIGRI---------------SFENRTITIAGLV---------------------------EAPPDSVREAARFVLGTNGLVQNLRAANAASLGYLAFIGTWHSHPKGG---AHSGIDRNTLRGIAEDA--GGL-------PAVSLVWTP------------TG---------LTCAVD----RW------------|
pCPF5603_46_Cper_86559649 YRKAYERILNELNNS-----KPNETGGILLGNI---------------NKNNKTIYVTDI----------------------------YIPKDSKYGPYLFTKGSYGTKEYLEHVLKSTGNIINYVGDWHTHPESST--NMSSKDKKSLLELKEYLKEYSY-------PAHIMIFNE------------KD---------ISSYVI----S-------------|
MaquDRAFT_3597_Maqu_77955313 DVSIEGCMSSLRENE-----LPNETGGVLVGFI---------------DRKIKTISVVLAR---------------------------PAPEDSVSTPKEFLRGTAGVEEDIDECRRRTGGIVSYLGEWHSHPRGCHS-NPSTHDRIQLDYLEGVMARDGS-------PAISMIVSD------------ST---------ISVSLD----QQTTM---------|
RSP_2048_Rsph_77387014 TARAFAKMAEFAAKR-----SQRETGGILIGHY---------------SEDLTIARIEAAS---------------------------DEPPDSRAGRTWFVRGQVGLAEILQ---RAWREGRYYLGEWHSHPGASP--APSGPDLSAIAKMARHPTFICH-------RPILVIIGG------------NF---------HQQPLL----SATLA---------|
Mdeg<a target=hotgi_overjumps href="http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?db=Protein&dopt=GenPept&uid=02000735">02000735</a>_Mdeg_48864353 EIRVLGNIRDQIERES--KGSSVEQGGVLAGMV---------------CHLSKTIYVTLVV---------------------------PAPDGTIRTPARLDIATTGLEEIFENIHSATNGQITLLGTWHSHTTPS---PPSLKDRVTYEKLAKNY---DLP------VVMLVYTGG------------RI---------ERV---------------------|
Nwi_2874_Nwin_74421925 SSRVMTKIAEEVARH-----PAVETGGVLVGTC---------------SARLRTIIVVDLI---------------------------EAPRDSVRSATRFVLGTAGLKAAIKARHRTSGGTLFDVGTWHSHLADQ---GPSALDRATARQLAAER---PPP------SVLLIQAPT------------RL---------YALMHN----GAAT----------/
_Ecol_37927531 SEVVRLLKSYRQMQY-----VSTEAGGVLIGER---------------RGPHIVITHISEP----------------------------GPGDIRTRNRFERKGDHHQLKVDELF-EQSNGFLVYLGEWHTHPEDFP--QPSYTDMKSWLTGLIA-----TE------TMLLIIVGR------------KS---------EWVGIK----NGNDIKSIREK---\E2+E1->JAB
VC0181_Vcho_9654585 GHVVTRLLSYRQLHH-----LTPESAGVLIGER---------------RGQHLVVCDISEP----------------------------GSGDIRQRCRVDRRGVHHQSRVNEAF-ERSAGTHLYLGEWHTHPEDRP--FPSATDRHSWRRNIVS-----DE------SMLLLIVGR------------KD---------FWLGKK----ERELITVFKKIES-|
Psyc_1371_Parc_71038911 IGVANILTSYRQLSD-----SSPESAGVLIGER---------------RDVHIVIKTVSEP----------------------------SPWDIRSRFMVDRVSKYHQKVVDDAF-KKNNGEWQYLGEWHTHPEDVP--KPSMTDYSSWHKNLKS-----SD------PLILIIAGR------------RD---------FWVGKK----IQDNIEVLKQV---|
y4qB_Rsp._16519909 PESVVEAMLKDASRW-----HDLETGGTFMGYWS---------------DANVAVITKMID---------------------------GGSEAIRTRKSFSPDREWEQSEIDRHY-RVSGRVDTYIGDWHTHPNAQS--EPSWTDRRCLRTIIR------SPEARAPRPVMILLCGG------------PE---------NWLPHAWI--GQLTRRALLFERV-|
ArthDRAFT_2189_Asp._66965740 PVPVLSDLVEQARLY-----APAETGGILVGHYTVTKPN----------GQRDAVVTDVIG---------------------------PGPAATRSRIAFEPDTEWQTAELSRVY-ALRDRRVSYLGDWHTHPTGQP--VPSLRDLKTLETIAA------HTAARCPEPFMAILGKE------------GM---------EQDWNIAV--CQHEALGRIRNIIP|
_Cper_86475967 DNLIACMDSYKQLNS-----NDKEKGGILIGY-IT--------------TDNNIIIEYITE---------------------------PFDSDISKRFSFIRRDINHEKVLNNIW-ESNGKMHTYIGEWHTHPEDYP--NFSSIDKKNWINLGK------KIHPSKRYYINIIIGNK------------DL---------RIWEYDVK--NEKIERIK------|
OB2597_05125_Obat_84499282 ---------MALNEG-----GHREIGGQLFGEQLA--------------PSQFLVTNLTVQ---------------------------ARRGSYTRFIVDLFQAARDAMRFFDST-QHDYTRHNYIGEWHSHPSFKV--RPSGTDLTTMRELVR------DPGFKGTFAVLMIVRLD------------AD---------CIAAAAWN--FDPLGREGVAQLEI/
mll6193_Mlot_14025926 HCISTVHAHLRSVGR-----EGNEGMALWVGVQ------------QDQHFAVTETVLPAQR-----------------HIRTGDGVCVMVPAEELHRLNV----------------WLYNSGLKLLAQIHSHPGRA---YHSTTDDAYAVATTVGC-------------LSLVVPNF----AREPFDFARVAAYRLDGKANWNALPS---AALSRMITITS---\E2+E1
RHE_PA00015_Retl_86359720 AAVNDVHEHLAEVGR-----SGYEGLGLWVGTV------------AAEIATVERALIPQQR-----------------LIRSAAGVGVHVDGTELHRINM----------------WLFDNGLRILAQIHSHPSDA---YHSDTDDEYALATAVGS-------------LSLVVPDF----ATGPTDLSQTAVYRLDKAGKWMAVSQ---ETVNRLIEIVD---|
msi104_Mlot_20803931 HCISTVHAHLRSVGR-----EGNEGMALWVGVQ------------QDQHFAVTETVIPAQR-----------------HIRTGDGVCVMVPAEELHRLNV----------------WLYNSGLKLLAQIHSHPGRA---YHSTTDDAYAVATTVGC-------------LSLVVPNF----AREPFDFARVAAYRLDGKANWNALPS---AALSRMITITS---|
RPDDRAFT_1996_Rpal_77690159 SVLERTISIIRRDGN-----RGEERVALWLATA------------AQRSPAAIVEVYEPEQ-----------------VVEV---DSFYIPPASMRALMN----------------HLRSTRRRIAAQIHTHPGRA---YHSDADAKWAIIRHSGA-------------LSLVLPHF-ANATTVENFLEEVMTYEYSPAGEWIHCPN---VGAGARVVVTA---/
alr7504_Ana_17134589 --QHSNYLHELLLTI-----DGKERAAYVLCGQAVINADPWDGQPHQKFISYEVIPVMPED-----------------EIVSFSAKHITWKTDSFVRAL-----------------QAAQAKNLTLAVFHSHPEGLR--EFSIQDDTNEPDLIQLAQNRNGSDTQI---LSVILMPD------------GN-----LIGRLWVSSQE---VISLRIIRVIGQKI\JAB+E1
sll6053_Ssp_38423902 -ESHLQELRKSLWHS-----DGKEKAAYLICGEVSIQADPWTSMPRKKYLSVEVIPIP-DN-----------------EIVSHSPQHITWSTDSFVRVL-----------------KLAQQKNLTVAIIHTHGKNGA--RFSEQDDVNEPDLVQLAQNRNGQDTKL---LSLILTAD------------GD-----LVGRCWFNPKE---YQPLDLIMCVGDRL|
Bcep1808DRAFT_3227_Bvie_67547439 SGKHRTQLRRHLSPG-----DGKEAVAIALCGQ-------ASGVRRNQLLVHEVVEVP-YE-----------------ACRIREPDAVAWSVEAVLPAL-----------------NQAIKKNLTVVKFHSHPSGYP--EFSRYDDESDRAFFSAVDNILDNVDRR---ASVVMLPD------------GR-----PFGRHVQNGIL---GEPIDLFRIAGDDF|
PnapDRAFT_3951_Pnap_84711629 -ESHEAALRALLHRE-----NGSEAAAYVLFGKAEIAADPWSKQPRIRLISHEVVPIT-SD-----------------EMVSSSSVHVTWSTQGFMRLL-----------------GLAQHRNLVPALVHTHPGAGA--FFSDQDDRNEAELARTTFNKGAQG--L---ASMVFGQH------------DA-----IVGRLWKSAKA---STKASSISIVGSKI|
NhamDRAFT_1903_Nham_69928900 PVGVHTALRAHLFPG-----DGNESAAILLCAA--------GPGRRLKLLARELIPVP-HE-----------------ACSVRKPDRITWPGRWIEEAI-----------------DRGEKEGLHIVLVHSHPGGLF--EFSAADDASDSVVVPGLFAAYDAR--H---GTAIMTPD------------GR-----MKVRFYDHDLQ---PTVVDLVMVPGDDI|
Adeh_2929_Adeh_86159351 VADWVLRALDAELGG-----HPPERGGALLGPPG-----------RPLLTRFEPDPGA----------------------RASASQWAPSAGLGARVAA-----------------LERGEGLELKGLVHSHPGALD--QPSAQDARELAAGLAHNPHLGCYLGPV---VSLAPAGA------------PG-----AHEVALPRGKL---SLFAARRSRGGGTE/
------------------------------------------------------------------------------------------------------------------
</pre>
</body>
</html>