NO03G04530, NO03G04530 (gene) Nannochloropsis oceanica

Overview
NameNO03G04530
Unique NameNO03G04530
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length4346
Alignment locationchr3:1300278..1304623 -

Link to JBrowse

Properties
Property NameValue
DescriptionFatty-acid-ligase fadd9
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr3genomechr3:1300278..1304623 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
PRJNA7699582024-08-13
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0008152metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0031177phosphopantetheine binding
GO:0003824catalytic activity
Vocabulary: INTERPRO
TermDefinition
IPR036291NAD(P)-bd_dom_sf
IPR036736ACP-like_sf
IPR013120Male_sterile_NAD-bd
IPR009081Acyl_carrier_prot-like
IPR000873AMP-dep_Synth/Lig
IPR020806PKS_PP-bd
Homology
BLAST of NO03G04530 vs. NCBI_GenBank
Match: EWM29929.1 (fatty-acid- ligase fadd9 [Nannochloropsis gaditana])

HSP 1 Score: 1649.8 bits (4271), Expect = 0.000e+0
Identity = 851/1296 (65.66%), Postives = 991/1296 (76.47%), Query Frame = 0
Query:   10 WTMPGPNLARLATLLPHDPELQAYQPSPSVQSLVDAAPTTIEIIQQLCQGYAERVALKWCSDGAMTSDYTYTMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMDYGGEEKGQAWVAKLRSLLPASVVRVLTMQEL-------XXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGEIPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQLKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFSGSADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGG----GKEKQELA-----------ERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQANPDASITEQVCAALEMTLGLGEEVGLDATFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLEAK---GEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAVGASLLDPNEERVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSFTDGEGQAPHYDGSSVDFVAGAIVAIALEGGR--------GGGRE---KGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRVTKRGDVPVLSERYLHTCLYHMSVVGLIGATPVVAEAAVWER 1270
            W MPGPN+ RLATLL HD ELQ   P P VQ+ +D   TTIE++QQ+C+GYAER ALKWC +GA T+DYT  +TY+++W++V+ MATVLS +GWL KK+FVAVCGFASPGWVL+D+L+LYLGAVLVPLPLNVPYEDL+ ML+ES ARVVF +AEE+ +L  NVL K+GACP VE VVVMDYG E  GQA V  LR +LPA+V+RVLT+ EL                         F+ PA+PG++GW +E NPLLGLMYTSGSTGRPKGAMYTE LMRALWHTGFSWG GE+PV+SLGFLPLNHIVGRVTIYQ+ +KGG VYFVRRSDMSTLFEDFA A+PTKTMMVPRIANMAYETYQLKK ELLQ  GLS +S  +   +  ++  LD VERAAR YIRD++FGGRLLFAIVGTAPSSAAVS LIQEACEMPMVEGYGSTELGGITIEN IN ATV+KWKLIDVPELGYTLKD P PRGELLV TTT IPGYYKHP+AT+ELID EGF+ TGDIMEQRG   LVWIDRRKNVLKLAQGEY+S+SRLEAL++G  D+ NI+LYGNSLRS++L VVVPSE LV  A+ G    +GG    G  + ELA            R+KPLLRQRLD++AK AG+A++EVPRDFIVQL GFTREN L+TDSNKLARARLK+ FGPRLEAMY AVE+RKEKRL  L ++P AS T+ V A LEMTLGLGEEVGLDATFAQLGGDSLSAVR+TEHLKR CGV VPVA+LLNP  T+ SL+++L+ K   G++      +E G EEIY RVHG      +IFA+DL + KFIP AVLHPP    A DWAA A +G  +    ++GVLLTGANGFLGRFLL                               C++R R++AAAK RLWEAVGASLL  +E+R+V LAGDVS+P LGL+  + Y +L E VDLVLHAGALVNHNLSYR LFGPNV+GT NI++FCL  PS PKPLH++ST+AV MGEGGPG  V E H+G  W  +R +    YAEGYGASKWAAEVLLQ++++ETGLPVVAYRCSMILPH+SL GQINV+D+FTRL+ G+IYTGVAP SFTDG G AP YDGS VDFVAGAIVAI  +G            G+E    GCRLFHV+NPH  KG SLDD++ WV+SAGY VE + PY+ W   F+SKL+ALPS+KR QSPLPVL  WQ P PSR   +E+ V+DA+ FR AVK +T+ GDVP L E YL TCLY MS VGLI ATP V  A + E+
Sbjct:   22 WKMPGPNMDRLATLLAHDRELQDAMPDPQVQARIDKTATTIELVQQVCEGYAEREALKWCVNGAPTADYTECLTYRDIWKRVRAMATVLSGRGWLLKKDFVAVCGFASPGWVLIDVLALYLGAVLVPLPLNVPYEDLRSMLEESEARVVFCAAEEAHTLVMNVLAKDGACPGVETVVVMDYGEESNGQAMVQALRPVLPATVLRVLTLGELLTDVSQGIVDSTERYGTTAAASSCGFLPPAVPGSQGWSDESNPLLGLMYTSGSTGRPKGAMYTEKLMRALWHTGFSWGKGEVPVISLGFLPLNHIVGRVTIYQALSKGGRVYFVRRSDMSTLFEDFATARPTKTMMVPRIANMAYETYQLKKMELLQTVGLSSSSLPTEEQAQGKRLKLDEVERAAREYIRDRVFGGRLLFAIVGTAPSSAAVSALIQEACEMPMVEGYGSTELGGITIENRINVATVVKWKLIDVPELGYTLKDQPCPRGELLVMTTTGIPGYYKHPKATAELIDTEGFFHTGDIMEQRGPEELVWIDRRKNVLKLAQGEYLSISRLEALYAGDPDVKNIFLYGNSLRSYVLAVVVPSEALVAAAKAGLEDGNGGNGLKGSARDELAGQGEVEEEAVLARLKPLLRQRLDSMAKEAGMASYEVPRDFIVQLAGFTRENKLVTDSNKLARARLKETFGPRLEAMYDAVEQRKEKRLEHLHSDPQASTTDMVRAVLEMTLGLGEEVGLDATFAQLGGDSLSAVRVTEHLKRLCGVGVPVAELLNPAMTLSSLIQHLDLKLKRGQVVGAEDMEETGEEEIYRRVHGLGDDDAVIFAQDLTIGKFIPTAVLHPPEAKVATDWAAGAEKGFEKREGRVRGVLLTGANGFLGRFLL---------LELLQKLCAQTDYKLGGTGRVYCLIRGRDEAAAKCRLWEAVGASLLQAHEDRIVVLAGDVSRPHLGLS-GDTYANLLENVDLVLHAGALVNHNLSYRSLFGPNVVGTANIIKFCLARPSHPKPLHYVSTIAVTMGEGGPGREVHEGHLGYCWGNQRFVRKGTYAEGYGASKWAAEVLLQNVHKETGLPVVAYRCSMILPHSSLPGQINVADVFTRLLAGVIYTGVAPASFTDGHGPAPAYDGSPVDFVAGAIVAIVWKGFHRSTGQVDVDAGQEIEASGCRLFHVINPHVQKGPSLDDMIDWVQSAGYDVERITPYRTWLETFESKLQALPSEKRNQSPLPVLKHWQDPSPSRIHAQEISVVDASRFRAAVKELTRWGDVPCLCEGYLLTCLYQMSKVGLISATPAVMLAKLAEK 1307          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: XP_005651684.1 (acetyl-CoA synthetase-like protein [Coccomyxa subellipsoidea C-169] >EIE27140.1 acetyl-CoA synthetase-like protein [Coccomyxa subellipsoidea C-169])

HSP 1 Score: 767.7 bits (1981), Expect = 5.700e-218
Identity = 485/1252 (38.74%), Postives = 689/1252 (55.03%), Query Frame = 0
Query:   17 LARLATLLPHDPELQAYQ-PSPSVQSLVDAAPTTIEIIQQLCQGYAERVALKWCSDGAMTSDYTYTMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMDYGGEE-KGQAWVAKLRSLLPASVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGEIPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQLKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFSGSAD-ISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQANPDASITEQVCAALEMTLGLGEEVGLDA---TFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLEAKGEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARL---WEAVGASLL---DPNEERVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSF-TDGEGQAPHYDGSSVDFVAGAIVAIALEGGRGGGREKGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRVTKRGDVPVLSERYLHTCLYHMSVVGLI 1256
            LAR   ++  DP+LQA +    S+Q + DA  T+IEII  + + YA R     C+ G  T    +T+TY  VWE++Q +    +  G++   +FV + GFAS  WV+ D+ +L+ G V+VPLP N+  ED++ ++DE+  R + VSAEE  ++A  +    G C SV+ V+VMD   +        A++++ LPA   ++ T+ E+                    +  IPG +  G   +PL+ LMYTSGS+GRPKGA Y E L+           A E+P + +GFLPLNH++GR T+ +    GG  +FVR +DMST F+D A  +PT+ M  PRI NM ++ +      + Q   L P  S     +   QQ  D ++R      R+   GGRL     G+AP+S  V   ++E    P V GYGSTE G I ++N I  + V  +KL+DVPELGYT KD PFPRGEL ++T   IPGYYKHPEAT++L D EGF  TGD++EQR     +W+DR KN++KL+QGEYVSVSRLE ++ G++  I  +Y+YGNSLR++++ VVVP          G  + +G          +++  LR  LD VA+   L  +E+PR+FIV++  F+++N LLTDS K AR +LK+ +   LE +YTA+EER                      ALE+TLGL EE   D    +FAQLGGDSL+A++   ++   CGV +PV+ +L+ + ++Q++ + +    E+  G    +      +E +HG       I A DL +++F+  A             AA A     E       VLLTGANGFLGRFLL                                IVR  +D  A  RL   +++  A+LL   D   + +   AGD+++P+LGL+ +  Y SL  ++D ++H GALVNH  SY +LF PNVLG+  +M+  L    + K L FIS+V V  G   P P VTE   G     +   G   YA GYG SKWA EVLL++L++  G+PV  +RC MIL HTS  GQIN +D FTRL+ G+ YTG+AP SF T   G   H+DG  +DFV+G I A             G   +HVVNPH   G SLD IV W  SAGYPV  +APY++W+  FK+ LEAL   ++ QSPLP+++QW+ P       +     DAT  R      T+  DVP L E ++H  + H++ + LI
Sbjct:   35 LARAREVIKQDPQLQAAKFDRKSLQRITDAGNTSIEIIAAMFKEYASRDLFGACTPGEST---FHTVTYGAVWERIQALVAGWTALGFVAPGDFVGISGFASVDWVVSDLATLHAGGVMVPLPTNILAEDVRAIIDEAEVRCLMVSAEELAAIAPVI----GGCASVKAVIVMDSSTDAVTSSGAYAEMQANLPAG-AKLTTIDEVLAAGRATGKQP---------ALVIPGRD--GRPADPLVNLMYTSGSSGRPKGAEYPEHLIFDFLKNSMPTDAPELPTIIMGFLPLNHLMGRFTLLKCLLTGGQNWFVRSTDMSTFFDDLATIRPTEAMFPPRIMNMLHDRF------VEQLDRLPPAPS----EAERAQQRQDLIKR-----FREVDLGGRLFTGSFGSAPASPDVIQWLEEVLGFPPVNGYGSTEGGMIMLDNKIQHSYVPAYKLVDVPELGYTTKDKPFPRGELRIKTRRMIPGYYKHPEATADLFDEEGFLKTGDVVEQRDADTFIWLDRVKNIIKLSQGEYVSVSRLEEIYVGNSKLIHQMYIYGNSLRAYLVAVVVPH------IENGACADAG----------KLRAALRTELDDVARRKALQGYEIPREFIVEMRPFSKDNHLLTDSAKPARGQLKKRYQAELEGLYTALEERXXXXXXXXXXXXXXXXXXXXXQALEVTLGLAEEDMADVASRSFAQLGGDSLAAIQFARYVGELCGVNLPVSFVLDHSHSLQAIADRVH---ELVSG----DASAGITFESIHGSDGVN--IKAADLKLDRFLSEADTA----------AAAAAAPASELPARPTHVLLTGANGFLGRFLL---------------LDLLQRGSDKNGGRVVAIVRGSSDEKAAERLRAGFDSGDATLLQRYDTLSKHLTVYAGDLAKPQLGLS-QGVYESLCAELDTIVHNGALVNHAYSYEQLFEPNVLGSVEVMRMALA--KRRKALTFISSVGVVGGLDHPQP-VTEAEDGPTLC-DVHPGDGGYAIGYGCSKWAVEVLLKELHQRWGVPVKVFRCGMILSHTSYLGQINPTDFFTRLLCGIAYTGIAPQSFYTLPHGPEEHFDGMPIDFVSGVISATT------AAERSGFDTYHVVNPHWSDGVSLDRIVDWAESAGYPVNRIAPYEQWYAQFKAALEALDHTRQQQSPLPIIYQWERPASGTSGTK----YDATQLRKRAAAYTQWKDVPHLDEAFIHQNMRHLTTLRLI 1187          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: WP_104977091.1 (oxidoreductase [Sorangium cellulosum] >AUX39065.1 oxidoreductase [Sorangium cellulosum])

HSP 1 Score: 752.7 bits (1942), Expect = 1.900e-213
Identity = 490/1259 (38.92%), Postives = 691/1259 (54.88%), Query Frame = 0
Query:   16 NLARLATLLPHDPELQAYQPSP-SVQSLVDAAPTTIEIIQQLCQGYAERVALK----WCSDGAMTSDYTY-TMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMDYGGEEKGQAWVAKLRSLLPASVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGEIPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQLKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFSG-SADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQA-NPDASITEQVCAALEMTLGLGEEVGLDATFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLEAKGEMKDGGKEDEGGGEE--IYERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAVGA--SLLDPNEE-----RVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSFTDGEGQAPHYDGSSVDFVAGAIVAIALEGGRGGGREKGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRVTKRGD--VPVLSERYLHTCLYHMSVVGLI 1256
            +L R    L  D ELQ   PSP +V+ L     ++IEI+   C  +A+R AL        DG +     Y T++Y E+W +VQ  A+ L   G +   + VA+ GFAS  W++ D   LYL AV VPL   +P  DL+ +L E   R +  SAE+ +++A  +     +CP+V  ++VMD   EE+  A     R+   A+ +  L  +                     + P    +E    EP+PL+ ++YTSGSTG PKGAM+ E+L+RA W          +  +S+G++PLNH  GR  + +S  +GGV + V  SDMSTLFED   ++PT+ + VPR++ M ++ YQ   AEL++  G    S  +            R+E    A +R    G RL++ + GTAP++  + + ++   E+P+ +GYGSTE G I+ +  ++   VL +K+ DVPELGY   D P PRGEL V+    +PGYY++ +AT +L D EG+  TGDI+E  GE  +VW+DR+KNVLKLAQGE+VS SRLE +++  S  I  IY++G+SLR+++L VVVP +  VE A  G  +S G           IK L+R  LD +A+ AGL   EVPR+ +++   FTREN LLT SNK AR +LK+ +G RL+ M+  +E  + ++L  L+     AS  EQV  A+E+TLGL  +V L  TF +LGGDSL+AVRL+  L+   GV VPV  +L+PT++ Q LV ++E          E  GG      + +VHG    A ++ A DL +E+F+    L         D AA++         G++  LLTGANGFLGRFLL                               C+VRA +DA A ARL  A  A  +L    EE     R+ ALAGD+ +PRLGL+    +  L ++VD ++H GALVNH  SY +LF PNVLGT  I++  L    + K + ++STV VA G   P   V E    +   KER    S YA GY ASKWA EVLL DL +  G+PV  +RCSMI+P     G++N SD  TRL+ G++YT VAP SF  GEG A H+DG  VDFVAG+I A+  +         G   +HV NPH     SLD +V W+R+AGY V  +  Y  W+  F+ +LEAL   +R +SPLP+L QW  P+    + +   +      R+   R   R D   P L+E YLH  L  M   G+I
Sbjct:   19 SLDRYQRRLEQDLELQRSAPSPEAVERLASGDRSSIEILALACSLHADRPALGARAFTVEDGVLRYLPRYQTLSYAELWARVQQFASGLRHGGLVQPGDRVAISGFASVDWLVADFACLYLAAVSVPLQTGMPAADLQQILGEVEPRAIVCSAEQLDAIAPAL----ASCPAVRSLIVMDL--EERDLA-----RARAVAARMEALREEHGQRLALFTVADVARIGRQHGIVPHAILSEARAGEPDPLMAILYTSGSTGTPKGAMFPESLVRAQWRAQARGRVSPVASISVGYMPLNHAAGRFEVMRSIMEGGVTHLVLASDMSTLFEDIRISRPTRFLFVPRVSAMIHQHYQ---AELVR-RGAGRASGAAG-----------RIEAEIMAEMRRSFLGDRLVYGVAGTAPTAPEIIDFLERCFEIPIYDGYGSTEAGVISFDGRLSREDVLAFKIADVPELGYRATDAPHPRGELRVKMRRHVPGYYRNAQATRDLFDEEGYLQTGDIVELHGEDEIVWLDRKKNVLKLAQGEFVSTSRLEGVYAAMSPFIQQIYVHGSSLRAYLLAVVVPDQRAVE-AHLGPAASEGA----------IKQLVRGELDRIAREAGLPRWEVPREILIEASPFTRENGLLTASNKPARPKLKERYGARLDRMFEEIERTQLEKLQQLEGQRAGASAAEQVALAMEVTLGL-RDVDLRQTFVELGGDSLAAVRLSTMLEERFGVAVPVGLILDPTSSAQRLVAHVE----------ERAGGAARAVTFSQVHG--AGATVVRAADLRLERFLAPEEL---------DEAARS----AAPASGVRAALLTGANGFLGRFLL----------------LELLERAPRDGGKVFCVVRAASDAEALARLGAAYRADPALRRRFEELSADGRLEALAGDLMKPRLGLS-GAVFDRLCDEVDGIVHNGALVNHAFSYPQLFEPNVLGTAEIIRLALR--RRRKAVGYVSTVGVAGGR-DPRDPVREDEDAQSLWKERPT-DSGYAVGYAASKWAGEVLLHDLAQRFGVPVGVFRCSMIMPDRRCVGEVNTSDFLTRLLAGIVYTAVAPRSFYAGEG-AHHFDGLPVDFVAGSIAAVVTD------LRAGFATYHVTNPHWEDAVSLDTMVDWIRTAGYEVSRIDEYARWYEAFRERLEALSQAQRQRSPLPILRQWARPIGRELRFDTARLEQR--LREIAARPGARVDAAAPHLTEEYLHKYLRDMVAAGVI 1184          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: APR83696.1 (Long-chain-fatty-acid--CoA ligase [Minicystis rosea])

HSP 1 Score: 743.0 bits (1917), Expect = 1.500e-210
Identity = 478/1272 (37.58%), Postives = 705/1272 (55.42%), Query Frame = 0
Query:   16 NLARLATLLPHDPELQAYQPSPSVQSLVDAAP-TTIEIIQQLCQGYAERVAL----KWCSDGAMTSDYTY-TMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMDYGGEE--KGQAWVAKLRSLLPA--SVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGEIPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQLKKAELLQAA-GLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFSG-SADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQ-ANPDASITEQVCAALEMTLGLGEEVGLDATFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLEAKGEMKDGGKEDEGGGEEI-YERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAV--GASLLDPNEE-----RVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSFTDGEGQAPHYDGSSVDFVAGAIVAIALEGGRGGGREKGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRVTKRGDV-PVLSERYLHTCLYHMSVVGLIGATPVVAEAA 1266
            +L R    +  DPELQ   P P   + + + P +TIE +   C  YA+R AL        DG   S   + T++Y ++W +VQT A+ L   G +   + VA+ GFAS  WV+ D+  LYL AV VPL   +P  DL+ +L E+  R V  SAE+ +++A  +     A P+V  ++V+D+  ++  + +A  A++ +L  A    + +LTM+++                   + P++  AEG   E +PL+ ++YTSGSTG PKGAM+ E+LMR  W          +  +++ ++PLNH  GR  + +S   GGV++FV++SDMSTLFED    +PT+ M +PR++ M Y+ YQ   AEL++ A GL+           +++++   +    R+++     G RL++ + GTAP++  + + ++ + E+P+++GYGSTE G I+ +  I    VL +KL DVPELGY   D P PRGELLV+    +PGYY++ +AT++L DA+G+  TGDI+   G   +  IDR+KNVLKL+QGE+VS +RLE L++  S  I  IY++G+S+R+++L VVVP+   V  AR G   S             +K L+R  +D +A+  GL   EVPRDFI++   FTREN LLT SNK +R +LK+ +G +L  M+  +E  + ++L  L+      S++EQV  A+E+TLG+G +V    +F  LGGDSLSAVRL+  L+   G  VPV  +L+PT+ +QSLV+Y+EA+           G    + +  +HG    A ++ A DL +++F+ A  +        +   A+              V LTGANGFLGRFLL                               C+VRA +DA A  RL       + L    +E     R+VALAGD+ +PR GL+ EE    L ++VD ++H GALVNH  SY++LF PNVLGT  I++  L    + K + ++STV VA G     P V E        KER    S YA GY  SKWA EVL+ D      +P+  +RCSMI+PH    GQ+N  D  TRL+ G++YT  AP SF +  G A H+DGS VDFVA +I ++A+   RG     G   +HV NPH   G SLD  V W+RSAGYPV  +  Y  W+  F+ +LEALP+ ++ +SPLP++ QW  P  S ++  +   L A     A +   K   V P L+E Y+H  L  M  V +I A    AE A
Sbjct:   14 SLDRHQRRIAQDPELQRSLPVPEAIAKLSSEPRSTIETVALACSLYADRPALGERASVVEDGKRRSLARFDTLSYADLWSRVQTFASGLHHGGLVEPGDRVAISGFASIDWVVADLACLYLAAVSVPLQTGMPAVDLRQILSETEPRAVVCSAEQLDAIAAAI----SASPAVRSIIVIDHDEKDSARHEALTARMEALREAHGKELVLLTMEDVARIGRQHG-----------LVPSVDPAEG-RNEADPLMAILYTSGSTGTPKGAMFPESLMREQWRIQSKERVPNVAAINICYMPLNHAAGRFEVMRSLMHGGVLHFVQKSDMSTLFEDIRIVRPTRFMFIPRVSAMIYQHYQ---AELVRRAFGLA---------DDARERMAAEISAEMRSFL-----GDRLVYGLTGTAPTAPEIVSFLERSFEIPIIDGYGSTEAGIISFDGRIAHDEVLDFKLADVPELGYRRSDKPHPRGELLVKMRQHVPGYYRNEKATNDLFDADGYLQTGDIVALHGRDEIELIDRKKNVLKLSQGEFVSTARLEGLYAAQSPFIQQIYVHGDSMRAYLLAVVVPNREAV-AARLGAAPSE----------HAMKHLIRGEVDRIAREEGLQRWEVPRDFILETAPFTRENGLLTASNKPSRPKLKERYGAKLGQMFAEIERTQIEKLEKLERERRSGSVSEQVMLAMEVTLGIG-DVDAGQSFLALGGDSLSAVRLSSILEERFGFAVPVGLILDPTSNVQSLVKYVEARA---------SGSARVVSFAEIHG--AGATVVRASDLKLDRFLTAEEIEAVARAAPVSPDARV-------------VFLTGANGFLGRFLL----------------LDLLGRLPQKGGKVVCVVRAGSDAEALERLHAGYQSDSGLHQRFQELSAKGRLVALAGDLMKPRFGLS-EEVMNKLAQEVDTIVHNGALVNHAFSYQQLFEPNVLGTVEILRLAL--QKRRKRIAYVSTVGVAAGRDERTP-VRESEDALSLWKERPT-NSGYAVGYATSKWAGEVLMHDAATRFEVPIGIFRCSMIMPHRRYVGQVNTGDFLTRLLAGVVYTHTAPRSFYE-SGGAHHFDGSPVDFVAQSIASVAVAIERGA----GVATYHVTNPHWGDGVSLDTFVDWIRSAGYPVMRIDDYGRWYEAFQERLEALPAAQKQRSPLPIIQQWARPARSDQRF-DTTELQARLRALAARPKAKVDAVFPQLTEAYMHKYLEDMMAVHIISAPERGAEVA 1189          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: OJY19541.1 (hypothetical protein BGO98_14400 [Myxococcales bacterium 68-20])

HSP 1 Score: 741.1 bits (1912), Expect = 5.700e-210
Identity = 489/1274 (38.38%), Postives = 695/1274 (54.55%), Query Frame = 0
Query:   19 RLATLLPHDPELQAYQPSPSVQSLVDAAPTTIEIIQQLCQGYAERV-----ALKWCSDGAM-TSDYTYTMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMDY-GGEEKGQAWVAKLRSL-LPASVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAG----EIPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQLKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFSG-SADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQAN--PDASITEQVCAALEMTLGLGEEVGLDA-----TFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLEAKGEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAVGASLLDPNEE----------RVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSF--TDGEGQAPHYDGSSVDFVAGAIVAIALEGGRGGGREKGCR----LFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRV-TKRGDVPVLSERYLHTCLYHMSVVGLI 1256
            RL  LL HDPEL   QP     + + A+ TTIE + +  + YA+R      A +  +DG++          Y++VW +V+  A+ L+ +       F+ +CGF S  WV+ D+ ++ L AV VP+  N+   DL+ ++ E+   ++  S E+ +S+   VL +   CPSV+ +VVMD   G+   +A   + +   L  S   VLTM+E+                   V   +P + G   EP+PL+ L+YTSGSTG PKGA+ TE L+R  W  GF +  G    E+P ++LG++PLNH  GR+ +  +  +GG++ FV +SDMSTLF+DF  A+PT  M+VPR++   Y+ YQ   AEL++      T   +      + ++ DR+       +R    G RLL     TAP+   V++ ++    +P+++ YGSTE G +T+++ +  A  L+WKL+DVPELGY   D P+PRGEL V++   +PGYYK+  AT +L D EGF +TGDI+EQRG   L+WIDR +NVLKLAQGE+V+ SRLE +FS  S  I  IY++G+  RS++L VVVP+   V           G G E  + A  IK L+R  +  +A+   L  HEVPRDFI++ + FT    LLTDSNK +R RL   +G  L A YTA+E  + + L  L +     A+I E+V  A+ + LGL E   LD      +F QLGGDSLSAV L   +    GV VPV  LL+PT+++ SLVEY+E       G    +      +  VHG    A  + A+DL +EKF+    +           AA+A +   E     +  LLTGANGFLGRFL                                 +VRA +DAAA  RL     +  +DP             R+  LAGD+ +PRLGL  E+ Y  L E+VDLVLH GALVNH L Y  +F PNVLGT  +M+F L    + K + FIST+AV  G     P V E    +    ER    + YA GYG +KWA E+LL+D + + GLPV  +R S I+ H    GQ+NV D FTRL+ G++YTG+AP SF   D   +A HYDG+ VD VA +I  +++  G G       R     +HVVNPH+  G SLD IV WV++AGYP + +A Y  W+ +F+ +L +L   KR  SPL +LH W+ P     +     VLD T   + ++ +     D P +SE ++H  L  M+V+ LI
Sbjct:   26 RLKRLLAHDPELAVSQPDAEAVAAIHASATTIETVAKAFELYADRPFVAERAYECEADGSVRLLPELRRHRYRDVWARVEAFASGLTHQKLAGPGSFIGICGFGSVDWVVADLAAIRLAAVSVPMQTNMSPADLQQIIGEAELSLIVCSVEQLDSIEA-VLPR---CPSVKSLVVMDLREGDSAAEAMFERRKQAGLAESGAVVLTMREV----------EERGRSAGIVPMVLPASRG---EPDPLMTLIYTSGSTGTPKGAIVTERLLREQWQRGFFYRLGDALPELPQITLGYMPLNHAAGRMNVMMTVLRGGMMAFVAKSDMSTLFDDFRLARPTMAMLVPRVSATIYQHYQ---AELVRR-----TEDVTD--EGERARISDRIMEE----MRGSFLGDRLLLVTTSTAPTPPEVADFLRRCFLVPVIDLYGSTEAGLVTLDDRLVPAPGLEWKLVDVPELGYRTTDRPYPRGELHVKSRFLVPGYYKNERATRDLFDDEGFLNTGDIVEQRGPDRLLWIDRARNVLKLAQGEFVATSRLEGIFSAQSPYIRQIYVHGSGFRSYLLAVVVPNLPAVTAYLR------GRGIEPDDAA--IKELVRSEIHRIARDEHLRGHEVPRDFIIEREPFTIARGLLTDSNKQSRPRLAARYGKDLAARYTAIERAQIEELYGLHSKGVTTATIAERVKRAMSVVLGLPE---LDVRQSEQSFIQLGGDSLSAVSLETLIHDLTGVRVPVGFLLDPTSSVHSLVEYVE-------GALAGKVRRNVTFAEVHG--AGAKSVRAEDLRIEKFLGPEEIE----------AARASKPASELPARAEVALLTGANGFLGRFL----------------TLELLERLSGERKKVYALVRAPSDAAAFERL---ASSYRMDPALSRRFDELSAGGRLTVLAGDLMKPRLGL-AEDVYARLAEEVDLVLHNGALVNHALGYAAMFEPNVLGTVEVMRFALA--RRIKSMSFISTIAVLYGVDRTEP-VREDEDVRTLFTERPT-EAGYAAGYGGTKWAGELLLRDAHEKLGLPVAVFRPSEIMAHRRYHGQVNVPDFFTRLLAGIVYTGLAPRSFYTADAPERAKHYDGTPVDVVARSIATLSIARGGGEAARPAVRATYDTYHVVNPHQDDGISLDVIVHWVKTAGYPAKRIADYDTWYKMFRERLTSLAEPKRQHSPLAILHAWEHPQGDHGQP----VLDTTHILERLRSIEPSLADFPHVSEAFIHKELDDMAVLHLI 1210          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: KYG08588.1 (hypothetical protein BE21_22840 [Sorangium cellulosum])

HSP 1 Score: 709.1 bits (1829), Expect = 2.400e-200
Identity = 485/1293 (37.51%), Postives = 690/1293 (53.36%), Query Frame = 0
Query:   16 NLARLATLLPHDPELQAYQPSPSVQSLVDAAPTTIEIIQQLCQGYAERV--------ALKWCSDGAMTSDYT---YTMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMD-YGGEEKGQAWVAKLRSLLPASVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGE----IPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQ---LKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFS-GSADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQ----ANPDASITEQVCAALEMTLGL-GEEVGLDATFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLE--AKGEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAVGA--SLLDPNEE-----RVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSFTDG--EGQAPHYDGSSVDFVAGAIVAIALE----GGRGGGREKGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRVTKRG------DVPVLSERYLHTCLYHMSVVGLIGATPVVA 1263
            +L R + L+  D EL+   PSP+    + +  +TIE +    + YA+R         A    +DG     Y      ++Y +VW +V+  A+ L  +      +FV + GF S  WV+  +  +YL AV VPL  ++   DL+ ++ E+    V  S  +   +  ++L +   CPSV  VVVMD   G+  GQ+ + + R  L     R L  +                         +P   G   EP+PL+ LMYTSGSTG PKGAM  E+L R  W   F+         +P V L + P+NH +GR  + +S  +GG+ +FV +SDMSTLFED   A+PT   +VPRIA + ++ +Q   L++A  L A G                    R+ER   A +R  + G RLL A +G+AP+   V + ++   ++P+ EGYGSTE   +T +  ++   V ++KL+DVPELGY+  D P PRGEL +R++  +PGYYK+ +AT  L D EG  +TGDI+EQRG   +VWIDR +NVLKL+QGE+V+ SRLE L+S GS  I  I+LYGNS RS++L VVVP   L E+      S+    ++ +   E ++ LLR  +D +A+   L  +E+PRDF+++   FTR + LLT++ K ARARLK  +G RLE +Y  +E  + + L SL+    A P AS    V  ALE TLG+ G E     +FAQLGGDSLSAVRL+  ++   GV VPV  +LNPT++++++V++LE    GE               ++ VHG    A ++ A DL +++F+    L           AA+            +  LLTGANGFLGRFL                                C+VR+ +DA A  RL     +  +LL+  +      R+V LAGD+ +PR GL  ++ Y  L  +VD V+H GALVNH LSY +LF PNVLGT   ++  L    + K ++++ST+A   G    GP   +  I + W +  R  G+ YA GY  SKWA+EVLLQD +   GLPV  YR S I+ H     QINV D FTRL+ G++YTG+AP SF +G    +A HYDG  VD VA +I A+A++     G  G R +    +HVVNP+   G SLD IV WVRSAGYPVE V  Y  W+  F+ +L  L    R  SPLP+L  W+   P+R   E   V DA      ++++  RG       +P +SE  +H  L  M  +GLIG   V A
Sbjct:   29 SLERCSRLVQTDEELRRALPSPAALEKIRSCNSTIESVATAFELYADRPCVGHRPLDAAATAADGGSAPRYLPEFRAVSYADVWSRVEAFASGLQHEKLATTGDFVGISGFGSTDWVVAGLACMYLSAVSVPLQTDLTPADLELIVTEAELACVVCSVGQLARIE-DILPR---CPSVRSVVVMDLLEGDRCGQSELERARRAL-----RPLEARGRRLAVRAMHEVERLGRQQGIAPKVLPAQRG---EPDPLMTLMYTSGSTGSPKGAMVPESLWRRYWQLAFTRSQDPRLDLLPHVGLNYSPMNHFIGRSQVGRSLMRGGITHFVLKSDMSTLFEDLRLARPTTLFLVPRIAELIHQQFQAEVLRRARALGAGGGDAARR--------------RIEREIMAEMRGSLLGDRLLHATIGSAPTPPEVLSFLKRCFDVPIFEGYGSTEASSLTTDGRLDRELVTEFKLVDVPELGYSAADQPCPRGELHIRSSLMVPGYYKNEKATRALFDEEGLMNTGDIVEQRGPDTVVWIDRARNVLKLSQGEFVATSRLEVLYSAGSPFIQQIFLYGNSTRSYLLAVVVPE--LREI------SAHLRQRDVKPDGEPVRQLLRAEIDRIARENQLRGYEIPRDFLIEPAPFTRASGLLTETQKPARARLKARYGARLEELYATIERTQLEELRSLREGGGATP-ASAALAVKKALEATLGITGVEPRSARSFAQLGGDSLSAVRLSRLIEEISGVAVPVGLVLNPTSSVRAIVDHLEHALAGEAPRRAAR--------FDEVHG--AGAEVVRAADLRLDRFLGPDELA----------AARRSTPAAALPAQARVALLTGANGFLGRFL-----------------ALELLERLPEEGRLYCVVRSPDDALAFDRLRATYESDPALLERFDALSAHGRLVVLAGDLVEPRFGL-ADDLYAHLCVEVDCVVHNGALVNHALSYPQLFEPNVLGTVEAIRLSLA--HRVKSMNYVSTIAAVGGLDRSGPIREDEDIRELWPE--RALGAGYAVGYATSKWASEVLLQDAHDALGLPVNVYRPSGIMAHRLYRSQINVPDFFTRLLCGVVYTGLAPRSFYEGGRPHRAGHYDGLPVDAVARSIAAVAVDRRYPAGDAGERARRA-TYHVVNPYWDDGISLDVIVSWVRSAGYPVERVDDYAAWYAAFRDRLMQLSEPLRRHSPLPILDAWER--PARADGE---VFDAERLLARLRQLAARGGAGDLATLPHVSEPLIHKYLDDMVALGLIGPAAVRA 1238          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: WP_061606881.1 (hypothetical protein [Sorangium cellulosum] >KYF71631.1 hypothetical protein BE15_40940 [Sorangium cellulosum])

HSP 1 Score: 703.7 bits (1815), Expect = 1.000e-198
Identity = 493/1300 (37.92%), Postives = 690/1300 (53.08%), Query Frame = 0
Query:   16 NLARLATLLPHDPELQAYQPSPSVQSLVDAAPTTIEIIQQLCQGYAERVAL-KWCSDGAMTS----------DYTYTMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMDYG-GEEKGQAWVAKLRSLLPA-----SVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWG----AGEIPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQLKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFS-GSADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQAN-----PDASITEQVCAALEMTLGLGEEVGLDA-TFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLE---AKGEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAKDLMVEKFI-PAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAVGA-----SLLD--PNEERVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSFTDG--EGQAPHYDGSSVDFVAGAIVAIALEGGR---GGGREKGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRVTKR---GD---VPVLSERYLHTCLYHMSVVGLIGATPVVAEAA 1266
            +L R A LL  D EL+   PSP     + +  T IE +      YA+R  L +   D A                 T +Y +VW +V+  A+ L         + V + GF S  WV+ D+  LYL AV VP+  ++   DL++++ E+    +  S ++   +   VL +   CPSV  VVV+D   G+ + Q  + + R  L A       + V T QE+                   V   +P   G   E +PL+ LMYTSGSTG PKGAM  E+L R  W   F+         +P V + + P+NH +GR  + +S  +GGV +FV RSDMSTLFED   A+PT   +VPRIA M ++ +Q   AELL+              S       +R+E    A +R  + G RLL A V +AP+   V + ++   E+P+++GYGSTE   +T ++ ++   V ++KL+D+PELGY   D P+PRGEL +R+   +PGYYK+ +AT EL D EG  +TGDI+EQRG   LVWIDR +NVLKL+QGE+V+ SR+EAL+S GS  I   +LYGNS RS++L VVVP    ++ A    R     G E  + A  +K LLR+ +D +A+   L  +E+PRDF+V+   FTREN LLT++ K ARARL+  +G RLE MY A+E  + ++L  L        P A +   V  ALE TLGL +     A +FAQLGGDSL+A R +  ++   GV VPV  +L+PT+ ++++VE+LE   A G  +     DE         VHG    A ++ A D+ +++F+ P  +     +  A    A+A           +  LLTGANGFLGRFL                                C+VR+RNDA A  RL  A  +     S LD      R+V LAGD+ +PR GL  ++ Y  L  +VD V+H GALVNH LSY +LF PNVLGT   ++F +    + K ++++ST+A   G G  GP   +  +   W  ER I  S YA GY  SKWA+E+LL+D     GLPV  YR S I+ H    GQINV D FTRL+ G++YT +AP SF       +A HYDG  VD VA +I AIA++  +   G G       +HVVNPH   G SLD IV WVRSAGYP+  V PY  W+  F+ +L  L   +R  SPL +L  W+ P      + +  V DA      ++++  R   GD   +P +SE+ +H  L  M  +G+IG  P  A AA
Sbjct:   29 SLERCARLLATDEELRRALPSPEALEQIRSRQTAIESVATAFALYADRPCLGRRALDVAAAEPDGGGSPRPLPEFRTASYADVWSRVEAFASGLRHDRLADTGDLVGIMGFGSTDWVVADLACLYLSAVSVPMQTSMIPADLQHIVAEAELACIVCSVDQLARIEA-VLPR---CPSVRGVVVIDLADGDRRAQDELDRARRALRALETGGRRLAVRTQQEV----------ERLGRQHGIVPKVLPEERG---ERDPLMTLMYTSGSTGSPKGAMIPESLWRRYWQQAFTRSHDPRLDVLPHVGINYSPMNHFIGRTQVGRSLMRGGVTHFVLRSDMSTLFEDIRLARPTALFLVPRIAEMIHQQFQ---AELLKR-------------SRDLDIAQERIEHEIMADMRASLLGDRLLLATVASAPTPPEVLSFLKRCFEVPVIDGYGSTEAASLTFDDRLDREFVTEFKLVDMPELGYRTADEPYPRGELHLRSLLMVPGYYKNEQATRELFDEEGLMNTGDIVEQRGPDTLVWIDRARNVLKLSQGEFVATSRIEALYSAGSPFIHQAFLYGNSARSYLLAVVVPD---LQAASAHLRRR---GAEPDDAA--VKQLLREEIDRIARDNQLRGYEIPRDFLVEGAPFTRENGLLTETQKPARARLRARYGARLEEMYAAIERTQIEQLRGLHEELGATAPPADVV--VKRALEATLGLADVEPRGAQSFAQLGGDSLTAARFSRVVEDLSGVAVPVGLVLDPTSGVRAIVEHLERALASGAARRPATFDE---------VHG--AGADVVRAADVRLDRFLGPDELASAARSTPAAALPARA-----------RVALLTGANGFLGRFL-----------------ALELLQRLPEEGRLYCVVRSRNDALAFDRLRAAYASDPALVSELDALSGHGRLVVLAGDLMKPRFGLP-DDLYAHLCSEVDCVVHNGALVNHALSYPQLFEPNVLGTVEAIRFAVA--RRVKAMNYVSTIAAVGGLGRRGPIREDEDLRALW-PERPI-DSGYAVGYATSKWASELLLRDARDTLGLPVNVYRPSSIMAHPLCRGQINVPDFFTRLLCGLVYTALAPRSFYQDVRPDRAGHYDGLPVDVVARSIAAIAVDHQQPPAGTGERASHATYHVVNPHWDDGISLDVIVAWVRSAGYPIGRVDPYAAWYAAFRDRLMQLGERQRHLSPLAILPAWEHP-----ALVDGEVFDAERLLARLQQLAAREGSGDLAALPHVSEQLIHKHLDDMVALGVIG--PAAARAA 1234          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: AGP42038.1 (hypothetical protein SCE1572_50595 [Sorangium cellulosum So0157-2])

HSP 1 Score: 702.2 bits (1811), Expect = 2.900e-198
Identity = 480/1293 (37.12%), Postives = 687/1293 (53.13%), Query Frame = 0
Query:   16 NLARLATLLPHDPELQAYQPSPSVQSLVDAAPTTIEIIQQLCQGYAERV--------ALKWCSDGAMTSDYT---YTMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMD-YGGEEKGQAWVAKLRSLLPASVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGE----IPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQ---LKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFS-GSADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQ----ANPDASITEQVCAALEMTLGL-GEEVGLDATFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLE--AKGEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAVGA--SLLDPNEE-----RVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSFTDG--EGQAPHYDGSSVDFVAGAIVAIALE----GGRGGGREKGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRVTKRG------DVPVLSERYLHTCLYHMSVVGLIGATPVVA 1263
            +L R + L+  D EL+   PSP     + +  TTIE +    + YA+R         A    +DG     Y      ++Y ++W +V+  A+ L  +       FV + GF S  WV+  +  +YL AV VPL  ++   DL+ ++ E+    V  S  +   +  ++L +   CPSV  VVVMD   G+  G + + + R  L     R L  +                     +   +P   G   EP+PL+ LMYTSGSTG PKGAM  E+L R  W   F+         +P V L + P+NH +GR  + +S  +GG+ +FV +SDMSTLFED   A+PT   +VPRIA + ++ +Q   L++A  L A G                    R+ER   A +R  + G RLL A +G+AP+   V + ++   ++P+ EGYGSTE   +T +  ++   V ++KL+DVPELGY+  D P PRGEL +R++  +PGYYK+ +AT  L D EG  +TGDI+EQRG   +VWIDR +NVLKL+QGE+V+ SRLE L+S GS  +  I+LYGNS RS++L VVVP   L E+      S+    ++ +   E ++ LLR  +D +A+   L  +E+PRDF+++   FTR + LLT++ K ARARLK  +G RLE +Y  +E  + + L  L+    A P AS    V  ALE TLG+ G E     +FAQLGGDSLSAVRL+  ++   GV VPV  +LNPT++++++ ++LE    GE               ++ VHG    A ++ A DL +++F+    L           AA+            +  LLTGANGFLGRFL                                C+VR+ +DA A  RL     +  +LL+  +      R+V LAGD+ +PR GL  ++ Y  L  +VD V+H GALVNH LSY +LF PNVLGT   ++  L    + K ++++ST+A   G    GP   +  I + W +  R  G+ YA GY  SKWA+EVLLQD +   GLPV  YR S I+ H+    QINV D FTRL+ G++YTG+AP SF +G    +A HYDG  VD VA +I A+A++     G  G R +    +HVVNPH   G SLD IV WVRSAGYPVE V  Y  W+  F+ +L  L    R  SPLP+L+ W+   P+R   E   V DA      ++++   G       +P ++E  +H  L  M  +GLIG   V A
Sbjct:   29 SLERCSRLVQTDEELRRALPSPVALEKIRSCHTTIECVATAFELYADRPCIGHRPLDAAATAADGGSAPRYLPEFRAVSYADMWSRVEAFASGLQHEKLANTGNFVGISGFGSTDWVVAGLACMYLSAVSVPLQTDLSPADLELIVAEAELACVVCSVGQLARIE-DILPR---CPSVRSVVVMDLLEGDRCGHSELERARRAL-----RPLEARGRRLSVRPMHEVERLGRQQGILPKVLPAQRG---EPDPLMTLMYTSGSTGSPKGAMVPESLCRRYWQLAFTRSQDPRLDLLPHVGLNYSPMNHFIGRSQVGRSLMRGGITHFVLKSDMSTLFEDIRLARPTTLFLVPRIAELIHQQFQAEVLRRARALGAGG--------------DDAARRRIEREIMAEMRGSLLGDRLLHATIGSAPTPPEVLSFLKRCFDVPVFEGYGSTEASSLTTDGRLDRELVTEFKLVDVPELGYSATDQPCPRGELHIRSSLMVPGYYKNEKATRALFDEEGLMNTGDIVEQRGPDTVVWIDRARNVLKLSQGEFVATSRLEVLYSAGSPFLQQIFLYGNSTRSYLLAVVVPE--LREI------SAHLRQRDVKPDGEPVRQLLRAEIDRIAREHQLRGYEIPRDFLIEPAPFTRASGLLTETQKPARARLKARYGARLEELYATIERTQLEELRGLREGGGATP-ASAALAVKKALEATLGITGVEPRSARSFAQLGGDSLSAVRLSRLIEEISGVAVPVGLVLNPTSSVRAIADHLEHALAGEAPRRAAR--------FDEVHG--AGAEVVRAADLRLDRFLGPDELA----------AARRSTPAAALPAQARVALLTGANGFLGRFL-----------------ALELLERLPEEGRLYCVVRSPDDALAFDRLRATYESDPALLERFDALSAHGRLVVLAGDLVEPRFGL-ADDLYAHLCVEVDCVVHNGALVNHALSYPQLFEPNVLGTVEAIRLSLA--HRVKSMNYVSTIAAVGGLDRSGPIREDEDIRELWPE--RALGAGYAVGYATSKWASEVLLQDAHDALGLPVNVYRPSGIMAHSLYRSQINVPDFFTRLLCGIVYTGLAPRSFYEGGRPHRAGHYDGLPVDAVARSIAAVAVDRRPPAGDEGERARRA-TYHVVNPHWDDGISLDVIVSWVRSAGYPVERVDDYAAWYAAFRDRLMQLSEPLRRHSPLPILNAWER--PARADGE---VFDAERLLARLRQLAAHGGAGDLATLPHVTEPLIHKYLDDMVALGLIGPAAVRA 1238          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: WP_080682663.1 (hypothetical protein [Sorangium cellulosum])

HSP 1 Score: 702.2 bits (1811), Expect = 2.900e-198
Identity = 480/1293 (37.12%), Postives = 687/1293 (53.13%), Query Frame = 0
Query:   16 NLARLATLLPHDPELQAYQPSPSVQSLVDAAPTTIEIIQQLCQGYAERV--------ALKWCSDGAMTSDYT---YTMTYKEVWEKVQTMATVLSTKGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMD-YGGEEKGQAWVAKLRSLLPASVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGWGEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGE----IPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQ---LKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFS-GSADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQ----ANPDASITEQVCAALEMTLGL-GEEVGLDATFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLE--AKGEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAVGA--SLLDPNEE-----RVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSFTDG--EGQAPHYDGSSVDFVAGAIVAIALE----GGRGGGREKGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTPLPSRKKMEELLVLDATCFRDAVKRVTKRG------DVPVLSERYLHTCLYHMSVVGLIGATPVVA 1263
            +L R + L+  D EL+   PSP     + +  TTIE +    + YA+R         A    +DG     Y      ++Y ++W +V+  A+ L  +       FV + GF S  WV+  +  +YL AV VPL  ++   DL+ ++ E+    V  S  +   +  ++L +   CPSV  VVVMD   G+  G + + + R  L     R L  +                     +   +P   G   EP+PL+ LMYTSGSTG PKGAM  E+L R  W   F+         +P V L + P+NH +GR  + +S  +GG+ +FV +SDMSTLFED   A+PT   +VPRIA + ++ +Q   L++A  L A G                    R+ER   A +R  + G RLL A +G+AP+   V + ++   ++P+ EGYGSTE   +T +  ++   V ++KL+DVPELGY+  D P PRGEL +R++  +PGYYK+ +AT  L D EG  +TGDI+EQRG   +VWIDR +NVLKL+QGE+V+ SRLE L+S GS  +  I+LYGNS RS++L VVVP   L E+      S+    ++ +   E ++ LLR  +D +A+   L  +E+PRDF+++   FTR + LLT++ K ARARLK  +G RLE +Y  +E  + + L  L+    A P AS    V  ALE TLG+ G E     +FAQLGGDSLSAVRL+  ++   GV VPV  +LNPT++++++ ++LE    GE               ++ VHG    A ++ A DL +++F+    L           AA+            +  LLTGANGFLGRFL                                C+VR+ +DA A  RL     +  +LL+  +      R+V LAGD+ +PR GL  ++ Y  L  +VD V+H GALVNH LSY +LF PNVLGT   ++  L    + K ++++ST+A   G    GP   +  I + W +  R  G+ YA GY  SKWA+EVLLQD +   GLPV  YR S I+ H+    QINV D FTRL+ G++YTG+AP SF +G    +A HYDG  VD VA +I A+A++     G  G R +    +HVVNPH   G SLD IV WVRSAGYPVE V  Y  W+  F+ +L  L    R  SPLP+L+ W+   P+R   E   V DA      ++++   G       +P ++E  +H  L  M  +GLIG   V A
Sbjct:   88 SLERCSRLVQTDEELRRALPSPVALEKIRSCHTTIECVATAFELYADRPCIGHRPLDAAATAADGGSAPRYLPEFRAVSYADMWSRVEAFASGLQHEKLANTGNFVGISGFGSTDWVVAGLACMYLSAVSVPLQTDLSPADLELIVAEAELACVVCSVGQLARIE-DILPR---CPSVRSVVVMDLLEGDRCGHSELERARRAL-----RPLEARGRRLSVRPMHEVERLGRQQGILPKVLPAQRG---EPDPLMTLMYTSGSTGSPKGAMVPESLCRRYWQLAFTRSQDPRLDLLPHVGLNYSPMNHFIGRSQVGRSLMRGGITHFVLKSDMSTLFEDIRLARPTTLFLVPRIAELIHQQFQAEVLRRARALGAGG--------------DDAARRRIEREIMAEMRGSLLGDRLLHATIGSAPTPPEVLSFLKRCFDVPVFEGYGSTEASSLTTDGRLDRELVTEFKLVDVPELGYSATDQPCPRGELHIRSSLMVPGYYKNEKATRALFDEEGLMNTGDIVEQRGPDTVVWIDRARNVLKLSQGEFVATSRLEVLYSAGSPFLQQIFLYGNSTRSYLLAVVVPE--LREI------SAHLRQRDVKPDGEPVRQLLRAEIDRIAREHQLRGYEIPRDFLIEPAPFTRASGLLTETQKPARARLKARYGARLEELYATIERTQLEELRGLREGGGATP-ASAALAVKKALEATLGITGVEPRSARSFAQLGGDSLSAVRLSRLIEEISGVAVPVGLVLNPTSSVRAIADHLEHALAGEAPRRAAR--------FDEVHG--AGAEVVRAADLRLDRFLGPDELA----------AARRSTPAAALPAQARVALLTGANGFLGRFL-----------------ALELLERLPEEGRLYCVVRSPDDALAFDRLRATYESDPALLERFDALSAHGRLVVLAGDLVEPRFGL-ADDLYAHLCVEVDCVVHNGALVNHALSYPQLFEPNVLGTVEAIRLSLA--HRVKSMNYVSTIAAVGGLDRSGPIREDEDIRELWPE--RALGAGYAVGYATSKWASEVLLQDAHDALGLPVNVYRPSGIMAHSLYRSQINVPDFFTRLLCGIVYTGLAPRSFYEGGRPHRAGHYDGLPVDAVARSIAAVAVDRRPPAGDEGERARRA-TYHVVNPHWDDGISLDVIVSWVRSAGYPVERVDDYAAWYAAFRDRLMQLSEPLRRHSPLPILNAWER--PARADGE---VFDAERLLARLRQLAAHGGAGDLATLPHVTEPLIHKYLDDMVALGLIGPAAVRA 1297          
BLAST of NO03G04530 vs. NCBI_GenBank
Match: AHH98121.1 (hypothetical protein KALB_4759 [Kutzneria albida DSM 43870])

HSP 1 Score: 677.9 bits (1748), Expect = 5.900e-191
Identity = 449/1276 (35.19%), Postives = 687/1276 (53.84%), Query Frame = 0
Query:   19 RLATLLPHDPELQAYQPSPSVQSLVDAAPTTI-EIIQQLCQGYAERVALKWCSDGAMTSDYT-----------YTMTYKEVWEKVQTMATVLSTKGW-------LCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYMLDESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMDYGGE--EKGQAWVAKLRSLLPASVVRVLTMQELXXXXXXXXXXXXXXXXXXFVSPAIPGAEGW--GEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGEIPVVSLGFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMVPRIANMAYETYQLKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAARAYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGITIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKHPEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSRLEALFSGSADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGGKEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLLTDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQ-ANPDASITEQVCAALEMTLGLGE-EVGLDATFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLLNPTATMQSLVEYLEAKGEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAKDLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGFLGRFLLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCIVRARNDAAAKARLWEAV---GASLL----DPNEERVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLHAGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMGEGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRETGLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSF----TDGEGQAPHYDGSSVDFVAGAIVAIALEGGRGGGREKGCRLFHVVNPHRMKGASLDDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQWQTP-LPSRKKMEELLVLDATCFRDAVK--RVTKRGDVPVLSERYLHTCLYHMSVVGLI 1256
            R A L   D +++A  P  +V     +    + +++  +  GYA+R AL   +   +T   T            T++Y+E+W +V  +A+      W       L   EFV + GF S  +  +D++ L+LGAV VPL  + P   L+ ++ E+G  ++  SAE  ++     LG     P+V  +VV D   E  E+ +A  +  + L  A    V+                           A+P A  +  G + +PL  L+YTSGSTG PKGAMY E L+ +LW  G       +PV+ + ++P++H+ GR+++ ++ + GG  YF  +SD+STLFED A  +PT+  +VPR+ +M ++ YQ   +EL + A          P +S     LD V+   +  +R+   GGR++ A+  TAP SA ++  ++   ++ + +GYGSTE GG+ I+ H+    VL +KL+DVPELGY   D P PRGELL++T T IPGY+K P+AT+E+ DA+G+Y TGDIM + G   LV++DRRKNVLKL+QGE+V+VSRLEA+F+ S  +  +++YG+S R+++L VVVP+E   E  R     ++            +K  + + L  +A+ A L ++E+PRD +++ D F+ EN LL+D+ KL R RLK+ +G RLE +Y  + + +   L +L+    D  + E V  A +  LG    ++  DA F +LGGDSLSA+ L+  L+    VEVPV  +++P   ++ L  Y+E   E+  G K         +  VHG  Q +  + A DL ++KFI +A L      GA D    +G          + VLLTGANG+LGRFL                                CIVR  +  AA+ RL +A     A LL    +   E +  LAGD+ +P LGL+ E+ +  L + VDL++H  ALVNH L Y++LFGPNV+GT  +++  +T   + KP  ++STV V   +  P     ++ I +D +  RR+  S YA GYG SKWA EVLL++ +   GLP V +R  MIL H+  +GQ+NV D+FTRL++ ++ TG+AP SF    +DG  Q  HYDG   +F A AI  +      G     G R F+V+NPH   G SLD +V W+   G+P++ +  YQEWF  F + L ALP  +R    LP++H ++ P +P         V+ A  FR AV+  ++    D+P LS   +   +  +  +GL+
Sbjct:   23 RAAQLRAQDEQVRAAAPLDAVNEATSSPGQRLTQVVAAIMAGYADRPALGERARELVTDPGTGRTSIRLLPWFDTISYRELWTRVGAIAS-----DWHHHPDHPLAAGEFVGILGFTSCDYTTLDLVCLHLGAVCVPLQSSSPASQLRPIIAETGPSILATSAERLDTAVELALGS----PTVRRLVVFDSHPEVDEQREALESARQRLTEAGHPAVVDSLAAVLER----------------GRALPPAPLFTPGPDEDPLTMLIYTSGSTGTPKGAMYPERLVHSLW-DGLWRDKNALPVIGINYMPMSHLAGRISLLRALSSGGTSYFAAKSDLSTLFEDIALIRPTELNLVPRVCDMLFQRYQ---SELDRRA----------PGTSD----LDAVDAQVKQELREGFLGGRVVRAMCSTAPLSAEMAAFVESCLDLELHDGYGSTEAGGVVIDKHVLRPPVLDYKLVDVPELGYFRTDTPHPRGELLIKTRTIIPGYFKRPDATAEIFDADGYYQTGDIMAEIGPDQLVYVDRRKNVLKLSQGEFVAVSRLEAVFATSPLVRQVFVYGSSARAYLLAVVVPTE---EALRRTVTDNAA-----------LKSSISESLQRIAREAELNSYEIPRDLLIETDPFSTENGLLSDARKLLRPRLKEHYGERLEQLYAELAKGQVDELHALRVTGRDRPVLETVTRAAQALLGCASTDLSPDAHFTELGGDSLSALSLSNLLQEIFTVEVPVGVVISPANDLRQLANYVET--ELSSGAK------RPTFATVHG--QGSLEVRAADLTLDKFIDSATL-----AGAKDLPGPSGTA--------RTVLLTGANGYLGRFL----------------CLEWLRRLSQDGGKLVCIVRGSSAEAARRRLEQAFDSGDAELLRLFRELAAEHLEVLAGDIGEPDLGLD-EQTWHRLADSVDLIVHPAALVNHVLPYQQLFGPNVVGTAGLIRMAIT--KRLKPFVYLSTVGVLSAQIAPSALREDLDI-RDTSPVRRLDQS-YASGYGTSKWAGEVLLREAHEAFGLPAVVFRSDMILAHSRYTGQLNVPDMFTRLLLSLVLTGIAPKSFYRTGSDGGRQRAHYDGLPAEFTAEAITEL------GARAAAGYRTFNVLNPHD-DGISLDVLVDWLAETGHPIQRIEDYQEWFARFDTALRALPEKQRQHCLLPLMHAFEQPGVPVAGS-----VIPADEFRAAVRTAKIGPDKDIPHLSASLITKYVRDLEQLGLV 1185          
The following BLAST results are available for this feature:
BLAST of NO03G04530 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM29929.10.000e+065.66fatty-acid- ligase fadd9 [Nannochloropsis gaditana... [more]
XP_005651684.15.700e-21838.74acetyl-CoA synthetase-like protein [Coccomyxa sube... [more]
WP_104977091.11.900e-21338.92oxidoreductase [Sorangium cellulosum] >AUX39065.1 ... [more]
APR83696.11.500e-21037.58Long-chain-fatty-acid--CoA ligase [Minicystis rose... [more]
OJY19541.15.700e-21038.38hypothetical protein BGO98_14400 [Myxococcales bac... [more]
KYG08588.12.400e-20037.51hypothetical protein BE21_22840 [Sorangium cellulo... [more]
WP_061606881.11.000e-19837.92hypothetical protein [Sorangium cellulosum] >KYF71... [more]
AGP42038.12.900e-19837.12hypothetical protein SCE1572_50595 [Sorangium cell... [more]
WP_080682663.12.900e-19837.12hypothetical protein [Sorangium cellulosum][more]
AHH98121.15.900e-19135.19hypothetical protein KALB_4759 [Kutzneria albida D... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL069nonsL069Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
nonsL066nonsL066Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR000ncniR000Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR054ngnoR054Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR048ngnoR048Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK005434NSK005434Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO03G04530.1NO03G04530.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|573015gene_1657Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100035g43gene747Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO03G04530.1NO03G04530.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO03G04530 ID=NO03G04530|Name=NO03G04530|organism=Nannochloropsis oceanica|type=gene|length=4346bp
ATGGCATCCTCCGCTTCCACTATCGAATGGACCATGCCAGGACCAAATCT
TGCCCGTCTTGCTACCCTTCTCCCGCATGATCCGGAATTGCAAGCCTACC
AACCGAGTCCATCAGTTCAATCCCTCGTCGACGCCGCGCCCACCACCATC
GAAATCATTCAGCAACTATGCCAAGGGTACGCAGAACGCGTAGCATTAAA
ATGGTGTTCGGACGGAGCAATGACTTCTGACTACACATATACTATGACCT
ATAAAGAAGTATGGGAAAAAGTGCAGACAATGGCAACGGTCTTGAGCACC
AAGGGGTGGCTGTGCAAGAAGGAGTTTGTAGCCGTATGTGGGTTTGCGAG
CCCCGGGTGGGTACTTGTGGACATCCTGAGCTTGTACCTTGGAGCAGTTC
TTGTGCCTTTACCGCTGAATGTCCCTTATGAAGACTTAAAATATATGTTG
GATGAATCAGGGGCGAGGGTCGTCTTCGTCTCTGCGGAGGAGAGCGAGAG
CCTGGCAGGTAACGTGCTTGGGAAGGAGGGGGCATGTCCGTCAGTCGAGA
TGGTGGTGGTGATGGATTATGGGGGTGAGGAGAAAGGACAGGCGTGGGTG
GCGAAGCTGCGGTCTCTGCTGCCAGCGAGTGTGGTGCGGGTCCTGACGAT
GCAGGAGTTGTTGCTGGAGCACGACAAGAAGAGCAGTAGCAGTAGCAATA
GCAGCAGCAGTACCTTTGTGTCGCCGGCGATTCCGGGGGCGGAGGGGTGG
GGGGAGGAGCCGAATCCTTTGCTGGGGTTGATGTATACCTCGGGGAGCAC
CGGACGGCCGAAAGGGGCTATGTACACGGAGACGTTGATGCGGGCTTTGT
GGCACACGGGCTTTTCGTGGGGGGCGGGGGAGATCCCGGTAGTGAGCTTG
GGGTTTCTACCGTTGAATCATATCGgtacgtgctttccgatcttttcgtc
attcctcccttcctcctaccccagccttggctgaacaacgttgacgcatg
cacctcttatccccttcctccttctctccccccctttctgtcagTTGGAA
GAGTTACGATTTACCAGAGCTTCACCAAGGGCGGCGTGGTTTACTTTGTA
AGGCGTTCCGACATGTCGACGTTATTCGAAGACTTCGCTCGGGCCAAGCC
GACCAAGACAATGATGGTGgtaagtgcatgggtgatcggcaagggaggga
gggggggagggagagctagagtattaagcgaacatgtgagccgcgcacct
acctaggccgatccatgcatacctgttgcttatatctcccgtctttcccc
ttcttttctcacctccctcactttttacagCCCCGCATTGCCAATATGGC
ATACGAAACCTATCAACTCAAGAAAGCCGAACTACTACAAGCCGCCGGCC
TCTCCCCCACCTCTTCTTGCTCCTCCCCGTACTCCTCTTCACAACAGCAG
CTGCTCGATAGAGTTGAACGCGCTGCTCGTGCGTACATCCGCGACAAGAT
ATTCGGAGGCCGTCTCCTCTTTGCGATAGTCGGGACGGCTCCTTCGTCTG
CGGCGGTGTCGAACTTGATCCAAGAGGCGTGTGAGATGCCCATGGTGGAA
GGATACGGGTCTACAGAGgtgaagacgagaaaaattgggagagagggagg
gaggagggctgcggacatgataacgttccgagagcaagctcacacggata
agcaccatctcttcgtcctttcctcttttcctcctttcacagCTCGGCGG
CATCACTATCGAGAACCACATCAACGACGCCACGGTCCTCAAATGGAAAC
TCATTGACGTACCCGAGCTAGGCTACACGCTTAAAGACCATCCCTTTCCT
AGGGGGGAGCTCCTCGTCCGAACCACCACAGCCATCCCCGGGTACTACAA
ACACCCGGAGGCCACAAGTGAGTTAATTGATGCAGAAGGTTTTTATTCCA
CGGGGGACATCATGGAGCAGAGAGGGGAGAGGATACTGGTATGGATCGAT
CGGAGGAAGAATGTGTTGAAATTGGCGCAAGGGGAGTATGTGAGTGTAAG
TCGATTGGAAGCTCTGTTTTCGGGGAGTGCCGATATTTCGAATATTTACT
TGTACGGGAATAGTCTGAGGAGCTTCATGTTGGGTGTAGTTGTACCCTCG
GAGGGATTGGTGGAGATGGCGAGGGAGGGATGGAGGAGTAGTAGTGGGGG
CGGCAAAGAAAAGCAGGAACTCGCGGAAAGGATTAAGCCGTTGTTGCGGC
AACGGTTGGATGCTGTGGCGAAAGCGGCGGGGTTGGCTGCTCATGAGGTG
CCGAGGGATTTCATCGTGCAACTGGATGGCTTTACACGGGAAAACAGCCT
CTTGACAGATAGCAATAAACTGGCTCGAGCAAGGCTTAAGCAAGgtgagt
cccctcctcattccttttagactatcgctgggtactgcaaagcttccacc
cagcaaatctgcgtcttatcagccactaatcctccgtccctcccaatctc
cctccctccacatcatcatcccggagCGTTTGGTCCAAGGCTCGAGGCTA
TGTACACGGCAGTGGAAGAACGCAAGGAGAAAAGACTTGCGTCCCTTCAG
GCCAATCCTGACGCGTCCATCACTGAACAGGTCTGCGCGGCGTTAGAAAT
GACCTTGGGATTAGGGGAGGAGGTGGGTTTGGATGCAACTTTTGCGCAGT
TGGGAGGGGATTCGTTGAGTGCCGTGCGATTGACGGAGCATTTGAAGCGA
TCGTGTGGGGTAGAGGTGCCCGTTGCCAAGTTGCTCAATCCAACAGCAAC
GATGCAGTCTTTAGTAGAGTATTTGGAGGCAAAGGGGGAAATGAAGGATG
GGGGGAAGGAGGATGAAGGAGGGGGAGAGGAGATATATGAACGAGTGCAT
GGTCCTAAGCAAGCAGCCCCTTTAATTTTTGCCAAGGACCTGATGGTGGA
GAAGTTCATTCCAGCGGCTGTGCTGCATCCACCCGGTACGGGGGGTGCAA
TGGATTGGGCCGCGAAGGCAGGGGAGGGCTTGGGGGAGGGAGGAAAAGGG
ATGAAGGGTGTATTGCTGACGGGCGCCAACGGCTTCCTGGGACGGTTTTT
GTTGTTGGAGCTACTGGGTAAGCTGCAGGGGGAAAATGAGGGAAACAGGA
ACAGCAGTCGCAGCACCAGCGTGAGCACCACCATCACCACGATCTATTGC
ATAGTACGTGCCCGTAATGATGCCGCGGCTAAAGCCCGATTATGGGAGGC
AGTGGGTGCTTCGCTGTTAGATCCAAACGAGGAACGGGTCGTCGCCTTGG
CTGGAGATGTGAGTCAGCCACGCCTAGGATTGAATTGCGAAGAAGCCTAT
GTGTCTTTGACGGAAAAAGTAGACTTGGTGTTACATGCAGGGGCGTTGGT
CAACCACAATTTATCATACCGCGAGCTCTTTGGTCCGAACGTATTGGGTA
CGACAAACATCATGCAATTCTGCCTGACCCCCCCCTCGAAACCAAAGCCC
CTTCATTTTATATCCACGGTAGCCGTGGCCATGGGGGAGGGCGGACCCGG
TCCGCATGTCACGGAAGTTCATATTGGGAAGGACTGGGCAAAGGAGAGGA
GGATCGGCGGAAGCGCGTATGCGGAGGGATATGGAGCGTCGAAATGGGCT
GCAGAGGTTTTGTTGCAGGATTTGAATAGGGAAACGGGATTGCCGGTGGT
TGCTTACAGGTGTAGTATGATTCTGCCGCACACCTCGTTGTCGGGACAGA
TTAATGTGTCTGATATTTTTACTCGACTGATAGTGGGCATGATATATACC
GGGGTGGCTCCGCTGAGCTTCACAGATGGAGAGGGGCAAGCGCCGCATTA
TGATGGTTCGTCTGTTGATTTTGTGGCGGGGGCCATAGTGGCGATTGCAC
TGGAGGGGGGGAGAGGTGGAGGGAGGGAGAAGGGATGTCGGCTTTTTCAT
GTGGTGAACCCGCATAGGATGAAGGGGGCATCGCTTGATGATATTGTTGG
GTGGGTTCGGTCAGCCGGGTATCCTGTGGAGGGAGTTGCTCCTTACCAGG
AGTGGTTTAATTTGTTCAAAAGCAAGTTGGAAGCCTTGCCGTCTGATAAA
CGTGCTCAGAGCCCTCTTCCTGTTTTACACCAATGGCAGACGCCCTTGCC
TTCGCGGAAGAAGATGGAGGAGCTGTTGGTGTTGGACGCGACTTGCTTTA
GAGATGCAGTGAAGAGGGTGACGAAAAGGGGAGACGTTCCGGTGTTGAGT
GAACGGTATTTGCACACATGTCTGTATCATATGAGTGTAGTGGGTTTGAT
TGGGGCGACACCGGTGGTGGCTGAGGCGGCGGTATGGGAAAGGTAG
back to top

protein sequence of NO03G04530.1

>NO03G04530.1-protein ID=NO03G04530.1-protein|Name=NO03G04530.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1270bp
MASSASTIEWTMPGPNLARLATLLPHDPELQAYQPSPSVQSLVDAAPTTI
EIIQQLCQGYAERVALKWCSDGAMTSDYTYTMTYKEVWEKVQTMATVLST
KGWLCKKEFVAVCGFASPGWVLVDILSLYLGAVLVPLPLNVPYEDLKYML
DESGARVVFVSAEESESLAGNVLGKEGACPSVEMVVVMDYGGEEKGQAWV
AKLRSLLPASVVRVLTMQELLLEHDKKSSSSSNSSSSTFVSPAIPGAEGW
GEEPNPLLGLMYTSGSTGRPKGAMYTETLMRALWHTGFSWGAGEIPVVSL
GFLPLNHIVGRVTIYQSFTKGGVVYFVRRSDMSTLFEDFARAKPTKTMMV
PRIANMAYETYQLKKAELLQAAGLSPTSSCSSPYSSSQQQLLDRVERAAR
AYIRDKIFGGRLLFAIVGTAPSSAAVSNLIQEACEMPMVEGYGSTELGGI
TIENHINDATVLKWKLIDVPELGYTLKDHPFPRGELLVRTTTAIPGYYKH
PEATSELIDAEGFYSTGDIMEQRGERILVWIDRRKNVLKLAQGEYVSVSR
LEALFSGSADISNIYLYGNSLRSFMLGVVVPSEGLVEMAREGWRSSSGGG
KEKQELAERIKPLLRQRLDAVAKAAGLAAHEVPRDFIVQLDGFTRENSLL
TDSNKLARARLKQAFGPRLEAMYTAVEERKEKRLASLQANPDASITEQVC
AALEMTLGLGEEVGLDATFAQLGGDSLSAVRLTEHLKRSCGVEVPVAKLL
NPTATMQSLVEYLEAKGEMKDGGKEDEGGGEEIYERVHGPKQAAPLIFAK
DLMVEKFIPAAVLHPPGTGGAMDWAAKAGEGLGEGGKGMKGVLLTGANGF
LGRFLLLELLGKLQGENEGNRNSSRSTSVSTTITTIYCIVRARNDAAAKA
RLWEAVGASLLDPNEERVVALAGDVSQPRLGLNCEEAYVSLTEKVDLVLH
AGALVNHNLSYRELFGPNVLGTTNIMQFCLTPPSKPKPLHFISTVAVAMG
EGGPGPHVTEVHIGKDWAKERRIGGSAYAEGYGASKWAAEVLLQDLNRET
GLPVVAYRCSMILPHTSLSGQINVSDIFTRLIVGMIYTGVAPLSFTDGEG
QAPHYDGSSVDFVAGAIVAIALEGGRGGGREKGCRLFHVVNPHRMKGASL
DDIVGWVRSAGYPVEGVAPYQEWFNLFKSKLEALPSDKRAQSPLPVLHQW
QTPLPSRKKMEELLVLDATCFRDAVKRVTKRGDVPVLSERYLHTCLYHMS
VVGLIGATPVVAEAAVWER*
back to top
Synonyms
Publications