NO12G00440, NO12G00440 (gene) Nannochloropsis oceanica

Overview
NameNO12G00440
Unique NameNO12G00440
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length6783
Alignment locationchr12:153059..159841 +

Link to JBrowse

Properties
Property NameValue
Descriptioncycloartenol synthase 1
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr12genomechr12:153059..159841 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0166992021-01-08
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR008930Terpenoid_cyclase/PrenylTrfase
IPR032696SQ_cyclase_C
IPR032697SQ_cyclase_N
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
Homology
BLAST of NO12G00440 vs. NCBI_GenBank
Match: GAX18680.1 (cycloartenol synthase [Fistulifera solaris])

HSP 1 Score: 851.7 bits (2199), Expect = 2.100e-243
Identity = 432/845 (51.12%), Postives = 546/845 (64.62%), Query Frame = 0
Query:   85 TGLCVLGATIYSTF-----YARP-TSYPGGPRTHRQDRRASWTKRVTTLPAG---WVFSHAEESHGVTQRHYQNMLGGEVPLTGEAAGRQMW----------YYDGKMAQALARVGKKNAEYSFSAAVNPNSADKVFRTQQISKWKGA---MPDPKLRPKTAGEAARKGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYI---TKTPLGKEREEAMQAYLKNHQQEDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRF-SADAAADPLLLALRHELYVTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYERAPWL-------WLRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVF--AEGGKEAEAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKADE----------DEASY---WFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVE----GGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGR 878
            + L +L  +IY  F     Y  P  S PGGPR HR  R+  W  R   LP     W    AEESH +T+  +     GE  L+GE  GR +W            D K    LA  G+ +    F+ +VNPNSAD  +R Q I ++  A    P  +  P++   A RK + +Y MLQ +DGHW GDYGGP FLMPGL+ A YI    K+ L  E  E M+ Y++ HQQ DGGWG H+ESPS+ FG+V+ Y++LRL+G    DP C     FI+ HGGA+M  SW KF+L + G   W+G NSVP EMWLLP WFPFHPGR+WCH RMVYLPM YLY  R+   +A  DPL+  LR ELYV PY  I W   RH  + +D Y P+S LM+  QN L+ YE   W         +R +G      Y+ AED QTN++DIGPVNK LNM+  +  A+G  E  + KRH+ R+ DY+W+AEDGMKM+GYNGSQCWDTSF VQ + +  L++ FP   ++ ++YL+R QI + E          + A+Y   ++RHISKGGWPFST+AHGWPI+DCT EGLKGVL L+ +  V+E    G +  I   RLEDA +ILL+YQN DGG+ATYENNRG+GW+E LNPSEVFG IMIDYSY E + A +TA+  F++ FP+HRS EI  AI  G  F++S+QRPDGSWYGSW  CF YG WFGIEGL+  GE P TS +I++A  FLL HQ+ NGGWGE + SC DK Y  +G  +  G+ GSGVV TAWA+L L   +C+D EA+ RGVR+LM +Q   GDW QEGI GVFNR CGITY+ YRN+FP+WALGR
Sbjct:  108 SSLFLLSLSIYLVFLLVTQYLLPAVSDPGGPRPHRIQRQPQWDNRPIPLPGSMKQWRCVIAEESHAITEATF-----GESLLSGEPFGRDIWTQKPYNSKLKAVDEKFVHELASGGRTDI---FNPSVNPNSADIPYRAQLIREYLAAGNSPPSLEKEPQSVQAALRKAVHFYSMLQRDDGHWGGDYGGPHFLMPGLVVAWYIMGKPKSMLDDESIELMKHYIRCHQQSDGGWGTHLESPSTQFGSVLMYLALRLMGADKEDPACQKGLEFIRLHGGALMTASWAKFYLCLAGCMHWDGHNSVPPEMWLLPNWFPFHPGRMWCHARMVYLPMGYLYGSRYVYTEAETDPLIKELREELYVQPYESIDWIKTRHMVAPMDNYSPISKLMETAQNLLARYE--TWAIFQPFKNLVRPRGIKLCAEYMAAEDLQTNFIDIGPVNKVLNMISAYDVAKGNLEDHSVKRHIARVADYMWVAEDGMKMKGYNGSQCWDTSFAVQGIYEADLMDEFPDVSKRVWAYLERCQILSTETSMATEAYKYESAAYRARFYRHISKGGWPFSTSAHGWPISDCTGEGLKGVLCLLKSNAVLEGIENGTLKAISTKRLEDAANILLTYQNEDGGFATYENNRGFGWYEDLNPSEVFGSIMIDYSYCECSMASLTALADFHETFPDHRSDEIRHAIDKGRDFLKSLQRPDGSWYGSWACCFCYGVWFGIEGLVKCGE-PKTSQAIKKACQFLLHHQRRNGGWGEDFTSCYDKDYAANGM-EAYGDDGSGVVNTAWALLALSVAECDDIEAIRRGVRYLMKRQLPCGDWPQEGIAGVFNRACGITYTAYRNVFPIWALGR 940          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: GAX17250.1 (cycloartenol synthase [Fistulifera solaris])

HSP 1 Score: 821.6 bits (2121), Expect = 2.300e-234
Identity = 413/821 (50.30%), Postives = 525/821 (63.95%), Query Frame = 0
Query:   98 FYARPTSYPGGPRTHRQDRRASWTKRVTTLPAG---WVFSHAEESHGVTQRHYQNMLGGEVPLTGEAAG-----RQMWYYDGKMAQALARVGKKNAEYSFSAAVNPNSADKVFRTQQISKWKGA---MPDPKLRPKTAGEAARKGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYI---TKTPLGKEREEAMQAYLKNHQQEDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRF-SADAAADPLLLALRHELYVTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYERAPWL-------WLRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVF--AEGGKEAEAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKADE----------DEASY---WFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVE----GGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGR 878
            ++    S PGGPR HR  R+  W  R   LP     W    AEESH +T+  +   L                  ++   D K+   LA  G+ +   SF+ +VNPNSAD  +R Q I ++  A    P  K  P++   A RK + YY MLQ +DGHW GDYGGP FLMPGL+ A YI    K+ L  E  E M+ Y++ HQQ DGGWG H+ESPS+ FG+V+ Y++LRL+G    DPTC     FI+ HGGA+M  SW KF+L + G   W+G NSVP EMWLLP WFPFHPGR+WCH RMVYLPM YLY  R+   +A  D L+  LR ELY+ PY  I W   RH  + +D Y P+S LM+  QN L+ YE   W         +R +G  F   Y+ AED QTN++DIGPVNK LNM+  +  A G  E    KRH+ R+ DY+W+AEDGMKM+GYNGSQCWDTSF VQ + + GL++ FP   ++ ++YL+R QI + E          + A+Y   ++RHISKGGWPFST+AHGWPI+DCT EGLKGVL L+ +  V+E    G +  I   RLEDA +ILL+YQN DGG+ATYENNRG+GW+E LNPSEVFG IMIDYSY E + A +TA+ +F++ FP+HRS EI  AI  G  F++S+QRPDGSWYGSW  CF YG WFGIEGL+  GE P TS S+++A              GE + SC DK Y  +G  +  G+ GSGVV TAWA+L L A +C+D EA+ +GVR+LM +Q   GDW QEGI GVFNR CGITY+ YRN+FP+WALGR
Sbjct:  127 YFLPAVSDPGGPRPHRIQRQPQWENRPIPLPESMKQWRCVIAEESHAITEEAFGESLXXXXXXXXXXXXXXXXKSKLKAVDEKLVHELASGGRSD---SFNPSVNPNSADIPYRAQLIREYLAAGNSPPSLKKEPQSVQAALRKAVHYYSMLQRDDGHWGGDYGGPHFLMPGLVVAWYIMGKPKSMLDDESIELMKHYIRCHQQSDGGWGTHLESPSTQFGSVLMYLALRLMGADKEDPTCQKGLHFIRLHGGALMTASWAKFYLCLAGCMHWDGHNSVPPEMWLLPNWFPFHPGRMWCHARMVYLPMGYLYGSRYVYTEAETDTLIKDLRQELYIQPYESIDWIKTRHMVAPMDNYSPISKLMETAQNLLARYE--TWTVFQPLKNLVRPRGIKFCAEYMAAEDLQTNFIDIGPVNKVLNMISAYDVANGNLEDHNVKRHIARVADYMWVAEDGMKMKGYNGSQCWDTSFAVQGIYEAGLMDEFPDVSKRVWAYLERCQILSTETSMATEAYKYESAAYRARFYRHISKGGWPFSTSAHGWPISDCTGEGLKGVLCLLKSKAVLEGIENGTLKAISTRRLEDAANILLTYQNEDGGFATYENNRGFGWYEDLNPSEVFGSIMIDYSYCECSMASLTALVEFHETFPDHRSDEICHAIDKGRDFLKSLQRPDGSWYGSWACCFCYGVWFGIEGLVKCGE-PKTSQSVKKACQXXXXXXXXXXXXGEDFTSCYDKDYAANGM-EVYGDDGSGVVNTAWALLALSAAECDDMEAIRKGVRYLMKRQLPCGDWPQEGIAGVFNRACGITYTAYRNVFPIWALGR 940          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: OLP86082.1 (Cycloartenol synthase [Symbiodinium microadriaticum])

HSP 1 Score: 814.7 bits (2103), Expect = 2.800e-232
Identity = 403/820 (49.15%), Postives = 529/820 (64.51%), Query Frame = 0
Query:   81 PLAFTGLCVLGATIYSTFY---ARPTSYPGG--PRTHRQDRRASWTKRVTT--LPAGWVFSHAEESHGVTQRHYQNMLGGEVPLTGEAAGRQMWYYDGKMAQALARVGKKNAEYSFSAAVNPNSADKVFRTQQISKWKGAMPDPKLRPKTAGEAARKGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYIT---KTPLGKEREEAMQAYLKNHQQEDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRFSADAAADPLLLALRHELY-VTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYER-APWLWLRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVFAEGG--------KEAEAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKADEDEASYWFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVEGGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAY-PEDGTGQTLG-EGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGRY 879
            P A     V GA + S ++    RP SYPG   P T R  R  +  K V +  +  GW     EE+        +N L     L  E +G+Q+W + G     +           F    NPNSAD +FR  +++ W G  PD   +P     A  KG  +Y+MLQC+DGHWAGDYGGP FL+PG + A YIT   KT       +A+QAYL NHQQ+DGGWG HIESPS+MFGTV++YV+LRL G+ A DP C     F+Q+HGGA+ APSW KFWLA LG+++W+GI  VP EMW+LP WFP HPGR WCHCRMVYLPMC+LY RRF+ +AA DP+  ALR ELY    Y +I W    H+C+D+D Y P+  +M+ +Q+ L  YER  PW WLRK   DFA+ Y+HAED +TNY+ IGPV+K  ++L  +   G         E+  F+ H+ R+  Y+W+AEDGMK+QGYNGS  WDT+F VQA+ + G+VE++     +A+ +L + Q+++        FR   +GGW FSTA   WP++D TAE  K VL L   P +   G   +P   L D++  LLSYQN DGGWATYENNRGW W+ELLNPSEVFGDIMIDYSYVE +S+ M A+  F+++FP+HR+ EI R+IA GARFI  +QR DGSW+G WG CFTYG WFGIEGL+ AG  P T  +I++ + FLL  Q  +GGWGE + SC ++ Y P +   +  G + GS VVQTAWA+L L+A  C++ +A++RGV  L+ +Q   GDW QE I GVFNR+ GITY+ +RN+FP+WAL R+
Sbjct:    5 PGAVETAAVAGAVLASVWFLRRGRPASYPGDAVPETSRL-RADAKLKPVASRIVEGGW--RPKEEAQP------KNEL-----LPHEVSGQQVWEFVGSTETDME---------PFRPEANPNSADCIFRRAKLNAWTGPKPD-TAQPADVNAALHKGFEFYRMLQCDDGHWAGDYGGPHFLLPGFVIAAYITGRLKTMYPDSHCQAIQAYLLNHQQQDGGWGSHIESPSTMFGTVLNYVALRLAGVDADDPACKKGREFMQQHGGALYAPSWAKFWLACLGVYDWDGIAPVPPEMWMLPSWFPLHPGRFWCHCRMVYLPMCWLYARRFTFNAAQDPVAAALRSELYGGQKYSEIPWRKHVHSCADIDNYSPIHPVMRLMQDVLLVYERFGPWQWLRKVSCDFALEYMHAEDLETNYLTIGPVSKAFHILVSWVAAGGETKPKEASESRPFQAHLARVPAYMWVAEDGMKVQGYNGSMAWDTAFAVQAVVEAGMVESYQDMSTRAWGWLVKEQVRSLPHGDWRHFRQPIQGGWGFSTAEQAWPVSDTTAEAFKAVLLLRKVPSIRSAG-RAMPDEHLCDSVRFLLSYQNNDGGWATYENNRGWKWYELLNPSEVFGDIMIDYSYVECSSSAMQALAYFSEQFPDHRAGEIRRSIARGARFIEEMQRDDGSWFGCWGNCFTYGCWFGIEGLLIAGRKP-TCKAIQKCVKFLLGKQNPDGGWGEDFASCFNREYAPRE---KLYGCDSGSTVVQTAWALLALMAADCQESKAIQRGVALLLKRQLASGDWAQENIAGVFNRSIGITYTAFRNVFPIWALARF 795          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: EJK52279.1 (hypothetical protein THAOC_28460 [Thalassiosira oceanica])

HSP 1 Score: 813.9 bits (2101), Expect = 4.800e-232
Identity = 397/751 (52.86%), Postives = 508/751 (67.64%), Query Frame = 0
Query:  173 MAQALARVGKKNAEYSFSAAVNPNSADKVFRTQQISKW----KGAMPDPKL----------RPKTAGEAARKGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYITKTP---LGKEREEAMQAYLKNHQQEDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRF-SADAAADPLLLALRHELYVTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYERAPWLW---------LRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVFAEGGKEA--EAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKADEDEAS-------------YWFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVV----EGGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGR 878
            + ++LA  G+     +F  + NPNS D+ FR+Q I+++     G +P+             +P  A +AA++G+++Y MLQ  DGHWAGDYGGP FL+PGL+ A YI   P   +  E    M  YL+ HQQEDGGWG HIESPS+MFGT M Y+++RLLG    D        FI+  GGA+M  SW KFWL ++G  +W+G NSVP EMWLLP WFPFHPGRLWCHCRMVYLPM YLY  RF  +DA  DPL+  LR ELY   Y  I W+  RH  +++D Y P+   M F QN LS YE     W         +RKKG  F   Y+ AEDQQTN++DIGPVNK LN++  F   G +   +A + H+ R+ DYLW+AEDGMK QGYNGSQCWDTSF +QA+ +  L++ FP    K ++YL+R+QI + E   +              ++RH+SKGGWPFST+AHGWPI+DCT EGLKGVLALMD+P V+    +G +  I PSR+ DA++++L+ QN DGGWATYENNRG+GW+E LNPSEVFGDIMIDYSYVE + A +TA+ +F+++FP HRS EI  +I  G  F++SIQR DGSWYGSW  CF YG WFGIEGL  AGE   +S +IR+   FLL+ Q+ NGGWGE + SC DK Y   G  +  G+ GSGVV TAWA+L L A KC+D  A+ RGV++L+ +Q + GDW QEGI+GVFNR CGITY+ YRN+FP+WALGR
Sbjct:  459 LVESLASGGR--TPMAFDPSKNPNSCDQPFRSQMITQFLEKNGGRLPEELSYLHDENGTVPKPTCAIDAAKRGVAFYSMLQTTDGHWAGDYGGPHFLLPGLVVAWYILGRPSNMISAEHGALMLHYLRVHQQEDGGWGTHIESPSTMFGTTMIYLAVRLLGGNKHDEWVKRGREFIKNEGGAIMTSSWAKFWLCLVGCMDWKGHNSVPPEMWLLPNWFPFHPGRLWCHCRMVYLPMGYLYGYRFVYSDAETDPLIAELREELYCEHYDSIQWESTRHLVAEMDNYSPIPAFMIFAQNILSLYEN----WSIFRPFRDAVRKKGLTFCAEYMKAEDQQTNFIDIGPVNKALNLVAAFHAAGSDVNHQAVQSHIMRVPDYLWVAEDGMKAQGYNGSQCWDTSFAIQAIWECKLLDHFPLLSSKVWAYLERSQILSTETSKASPAYQYETPTSRERFYRHVSKGGWPFSTSAHGWPISDCTGEGLKGVLALMDSPVVMAAVGKGVLKSIEPSRIHDAVNVMLTLQNEDGGWATYENNRGFGWYEELNPSEVFGDIMIDYSYVECSMASLTALAEFHEKFPCHRSDEIKLSICRGKEFMKSIQREDGSWYGSWACCFCYGCWFGIEGLTKAGES-HSSATIRKCCQFLLSKQRPNGGWGEDFTSCYDKDYARKGM-ECYGDEGSGVVSTAWALLALSAAKCDDVNAVRRGVQYLIDRQLDCGDWPQEGISGVFNRACGITYTAYRNVFPIWALGR 1201          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: XP_002287432.1 (cycloartenol synthase;-2,3-epoxysqualene mutase-like protein [Thalassiosira pseudonana CCMP1335] >EED94875.1 cycloartenol synthase;-2,3-epoxysqualene mutase-like protein [Thalassiosira pseudonana CCMP1335])

HSP 1 Score: 800.0 bits (2065), Expect = 7.200e-228
Identity = 379/680 (55.74%), Postives = 482/680 (70.88%), Query Frame = 0
Query:  226 EAARKGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYITKTP---LGKEREEAMQAYLKNHQQEDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRFS-ADAAADPLLLALRHELYVTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYER----APWL-WLRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVFAEGGKEAEAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKADEDEAS-------------YWFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVE----GGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGRYA 880
            E+AR+G+++Y +LQ  DGH+AGDYGGP FL+PGL+ A Y+   P   +   ++  M  YL+ HQQ DGGWG HIESPS+MFGTV+ Y++ RLLG    D        FIQ+ GGAVM  SW KFWL ++G  +W+G NSVP EMWLLP WFPFHPGRLWCHCRMVYLPM YLY  RF+  DA  D L+  LR ELY   Y  I WD  RH  + +D Y P+  LMK  QN LS YE     +P+   +RK G  + + Y+ AED QTN++DIGPVNK LNM+  F          + H+ R+ DYLW+AEDGMKMQGYNGSQCWDTSF +QA+ + GL++ FP    K ++YL+RTQI + E   S              ++RH+SKGGWPFST+AHGWPI+DCT EGLKGVLALMD+  + +    G +  I P+RL DA++++L+ QN DGGWATYENNRG+GW+E LNPSEVFGDIMIDYSYVE + A +TA+ +F+++FP+HR+KE+T AI  G  F++SIQR DGSWYGSW  CF YG WFG+EGLI  GE P +S +I++   FLL+HQ+ NGGWGE + SC DK Y E+G  ++ G+ GSGVV TAWA++ L A  C + +A+ RGV++LM +Q E GDW QEGI+GVFNR CGITY+ YRN+FP+WALGR A
Sbjct:    2 ESARRGIAFYSLLQTSDGHFAGDYGGPHFLLPGLVVAWYVMGRPAVMISPAQQALMLHYLRVHQQADGGWGTHIESPSTMFGTVVCYLAARLLGAKKDDEWIKEGRDFIQKEGGAVMTSSWAKFWLCLVGCMDWKGHNSVPPEMWLLPNWFPFHPGRLWCHCRMVYLPMGYLYGTRFTYFDAETDLLIQELREELYCESYETIEWDKTRHLVAPMDNYSPIPVLMKVAQNFLSLYENWGIFSPFRNAVRKAGLKYCLEYMRAEDLQTNFIDIGPVNKALNMVSAF--------HVRSHMMRVPDYLWVAEDGMKMQGYNGSQCWDTSFAIQAVWECGLLDKFPIMSAKVWAYLERTQILSTETSQSSPAYAYESCENRDKFYRHVSKGGWPFSTSAHGWPISDCTGEGLKGVLALMDSHVITDSVKKGVLKNIDPTRLYDAVNVILTLQNEDGGWATYENNRGFGWYEELNPSEVFGDIMIDYSYVECSMASLTALAEFHEKFPHHRTKEVTFAIRRGGEFVKSIQREDGSWYGSWACCFCYGCWFGVEGLIKTGE-PTSSSAIQKCCEFLLSHQRPNGGWGEDFTSCYDKDYAENGM-KSYGDDGSGVVNTAWALMALSAANCNNVDAIRRGVQYLMKRQLESGDWPQEGISGVFNRACGITYTAYRNVFPIWALGRCA 671          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: XP_002185678.1 (acetyl-coenzyme A synthetase, partial [Phaeodactylum tricornutum CCAP 1055/1] >ACI65148.1 acetyl-coenzyme A synthetase, partial [Phaeodactylum tricornutum CCAP 1055/1])

HSP 1 Score: 782.7 bits (2020), Expect = 1.200e-222
Identity = 376/680 (55.29%), Postives = 471/680 (69.26%), Query Frame = 0
Query:  226 EAARKGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYITKTP---LGKEREEAMQAYLKNHQQEDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRFSAD-AAADPLLLALRHELYVTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYERAPWLW-----LRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVFAEGGKEA--EAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKADE----------DEASY---WFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVE----GGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGR 878
            EA RK   +Y MLQ  DGH++GDYGGP FLMPGLI   Y+   P   L   +   M+ YL  HQQ DGGWG H+ESPS+MFGT +SYV+LRLLG+ A +P C    AFI+E GGAVM  SW K +L +LG  EW+G NSVP E+WLLP WFPFHP R+WCH RMVYLPM Y+Y  R   D A  DPL+ ALR ELY  PY+ I W   RH  + +D Y PV+ +MK +QN L+ YE  P L      +RK G  F + Y+ AED QTN++DIGPVNK LNML  F   G +        H+ R+ DYLW+AEDGMKM+GYNGSQCWDTSF +QA+ + GL++ FP+   K ++YL+R QI + E          + A Y   ++RHIS+GGWPFST+AHGWPI+DCT EGLKGVL ++ A  V E    G +  I   RL+ A +ILLSYQN DGG+ TYENNRG+G++E LNPSEVFGDIMIDYSYVE + A +TA+  F++ +P+HR++EI  AI  G  F++ +QR DGSWYGSW  CF YG+WFGIEGL+  GE P +S  I +A  FLL HQ++NGGWGE + SC DK Y  +G  +  G+ GSGVV T+WA++ L   KC D EA++RGV++LM +Q   GDW QEG+ GVFNR CGITY+ YRNIFP+WALGR
Sbjct:    3 EAIRKATHFYSMLQTSDGHFSGDYGGPHFLMPGLIVVWYVMGQPSLMLNPAQTALMKHYLIVHQQADGGWGTHVESPSTMFGTTLSYVALRLLGMDAEEPVCQRGRAFIREQGGAVMTSSWAKLYLCILGCMEWDGHNSVPPELWLLPNWFPFHPSRMWCHARMVYLPMGYVYGARLKYDKAEEDPLVQALRRELYCEPYNSIEWMQTRHMVAPMDNYSPVAWMMKTVQNGLARYETWPMLQPFKNDVRKLGLAFCVDYMAAEDLQTNFIDIGPVNKVLNMLSAFHHAGNDLHHSTVMNHMIRVQDYLWVAEDGMKMKGYNGSQCWDTSFAIQAVFEAGLLDDFPELSNKVWTYLERCQILSTEVSQASPAFKYEAALYRRKFYRHISEGGWPFSTSAHGWPISDCTGEGLKGVLCMLKAKSVREGLEDGSLREISEVRLQKAANILLSYQNEDGGFPTYENNRGFGFYESLNPSEVFGDIMIDYSYVECSMASLTALADFHEDYPDHRTEEIVHAIEKGRDFLKDLQREDGSWYGSWACCFCYGSWFGIEGLVKCGE-PVSSEFIAKACKFLLQHQRSNGGWGEDFTSCYDKEYAANGM-EAYGDDGSGVVNTSWALMALSTAKCNDIEAIKRGVQYLMKRQLPCGDWPQEGVAGVFNRACGITYTAYRNIFPIWALGR 680          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: OEU13816.1 (acetyl-coenzyme A synthetase [Fragilariopsis cylindrus CCMP1102])

HSP 1 Score: 764.6 bits (1973), Expect = 3.400e-217
Identity = 363/681 (53.30%), Postives = 461/681 (67.69%), Query Frame = 0
Query:  237 MLQCEDGHWAGDYGGPMFLMPGLIFACYITKTP---LGKEREEAMQAYLKNHQQEDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRFSAD-AAADPLLLALRHELYV-TPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYER-APWLW----LRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVFAEGGKEAE-------------AFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKA-------------DEDEASYWFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVEG----GVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGR 878
            MLQCEDGHWA DYGGP FL+PGL+   Y+   P   L + +   ++ Y++ HQQ DGGWG HIESPS+MFG+V+ YV+LRLLG   +D  C  A  F+ EHGGA+   SW KF+L +LG+ +W+G NS P EMWLLP W PFHPGR+WCH RMVYLPM YLY  RF+ + A +DPL+  L+ ELY   PY  I W   R   +D D Y P+  +MK LQN L+ YE  A + W    +R++G +F++ Y+ AED QTN++DIGPVNK LNML ++                       +RH+ R+ DYLW+AEDGMKMQGYNGSQCWDTSF VQA+ + G+++ FP   RK +SYL+++QI +               D    ++RHIS+GGWPFST+AHGWPI+DCT EGLK  L L+    + +      V  I   RL  A +ILL+YQN DGGWATYENNRGWGW+E LNPSEVFGDIMIDYSYVE + A +TA+  F + FP+HRS ++ R+I  G  F+++IQR DGSWYGSW  CF YG WFGIEGLI  GE P  SP I +A  +LL HQ+ NGGWGE + SC DK + +DG        GSGVV T+WA++ L   KC+D EA+ RGVR+LM +Q   GDW QEGI+GVFNR+ GITY+ YRN+FP+WA+GR
Sbjct:    1 MLQCEDGHWAADYGGPHFLLPGLVVVWYVMNRPSNLLDEGQVRMIRHYIQVHQQLDGGWGTHIESPSTMFGSVLMYVALRLLGADRNDEACVNARTFLDEHGGALFTSSWSKFYLCILGVMDWKGHNSTPPEMWLLPNWIPFHPGRMWCHARMVYLPMGYLYGSRFTYNKAKSDPLIAELQTELYAGQPYDTIPWTKTRQLIADTDNYSPIPLVMKILQNILARYENWAIFDWFRNHVRQRGLEFSMEYMKAEDLQTNFIDIGPVNKVLNMLSMYXXXXXXXXXXXXXXXXXXXELLIERHIARVRDYLWVAEDGMKMQGYNGSQCWDTSFCVQALWEAGMLDEFPDLTRKVWSYLEKSQILSTPVSKSSPAYGFETNDNRFKFYRHISEGGWPFSTSAHGWPISDCTGEGLKATLCLLKTKTIRDALNSTVVVSISNERLYKAANILLTYQNEDGGWATYENNRGWGWYEQLNPSEVFGDIMIDYSYVECSMASLTALADFAETFPDHRSDDVRRSIEKGRSFLKNIQRDDGSWYGSWACCFCYGVWFGIEGLIKCGE-PVNSPCILKACRYLLMHQRPNGGWGEDFTSCYDKDFAKDGMKAYGDSDGSGVVNTSWALMALSYAKCDDVEAIRRGVRYLMERQLPSGDWPQEGISGVFNRSVGITYTAYRNVFPIWAMGR 680          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: XP_009033487.1 (hypothetical protein AURANDRAFT_19883, partial [Aureococcus anophagefferens] >EGB12462.1 hypothetical protein AURANDRAFT_19883, partial [Aureococcus anophagefferens])

HSP 1 Score: 743.0 bits (1917), Expect = 1.000e-210
Identity = 374/681 (54.92%), Postives = 453/681 (66.52%), Query Frame = 0
Query:  230 KGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYIT---KTPLGKEREEAMQAYLKNHQQEDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRFS-ADAAADPLLLALRHELYVT--PYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYERAPW---LWLRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVF----AEGGKEAEAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVE--AFPQAMRKAYSYLDRTQI-KADEDEAS------------YWFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVEGGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAG----EDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGRY 879
            K +++YQ LQC+DGHW GDYGGP FL PGL+   Y+T      L + +  AM  Y +NHQQ DGGWG H+ESPS+MFG+V++YV+LRLLG PA  P C A    I E GGA    SW KF L +LG  +WEG  SVP EMWLLP W PFHP R+WCH RMVYLPM YL+ +R++  +A +DP++LALR ELY     Y  I W   R   + +D Y PV  LM   Q  L  YE        + R+KG  F+  Y  AED QTNYV IGPVNK  NML  +    A+GG   EA  RH  R+ DYLW+AEDGMKMQGYNGSQCWD SF  QA+A+  L +   F    +KA+SYL+RTQI      +AS             +FRH+SKGGWPFST+AHGWPI+DCTAEGLK VLAL    CV  G    I   RL DA  ++L+ QN DGG+ATYEN RG+GW+E LNPSEVFGDIMIDYSYVE +     A+ +F +  P+HR+ EI+ A+  G  F+RSIQR DGSWYGSW  CFTY  WFGIEGL+ +G    EDP TS  + RA  FLL HQ+ NGGWGE + SC DKAY + G        G+GVV T WA+LGL+AG C D +A+ RGV +L A+Q  DGDW QEGI+GVFNR+CGITY+ YRN+FP+WAL RY
Sbjct:    3 KALAFYQQLQCDDGHWGGDYGGPHFLSPGLVVVWYVTGRRDDVLDEHQRRAMVRYYENHQQTDGGWGTHVESPSTMFGSVLTYVALRLLGEPADAPACAAGRKLILEQGGACYTSSWAKFALCLLGAMDWEGHESVPPEMWLLPCWCPFHPCRMWCHARMVYLPMGYLWGKRWTYENADSDPVVLALRDELYPASPAYGAIPWRATRSWVAPMDDYSPVHPLMVAAQRFLRVYEDLGGPLRRYARRKGLAFSADYCRAEDLQTNYVCIGPVNKVYNMLVAYDDRHADGG---EALARHALRVPDYLWVAEDGMKMQGYNGSQCWDASFATQAIAESDLGDDARFRDCAKKAWSYLERTQILSTTTSQASPAFAFEAPKLRERYFRHVSKGGWPFSTSAHGWPISDCTAEGLKSVLALRSLACV--GECAPIGYERLCDAADVVLALQNADGGYATYENTRGYGWYEWLNPSEVFGDIMIDYSYVECS----MALARFREACPDHRAAEISAALKRGNAFLRSIQRADGSWYGSWACCFTYAGWFGIEGLVDSGEDLREDPKTSEPVARACAFLLRHQRPNGGWGEDFTSCFDKAYAKHGMEAYGDAEGAGVVCTGWALLGLMAGACADADAVARGVAYLEARQLPDGDWPQEGISGVFNRSCGITYTAYRNVFPMWALARY 674          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: XP_020433325.1 (cycloartenol synthase [Heterostelium album PN500] >EFA81207.1 cycloartenol synthase [Heterostelium album PN500])

HSP 1 Score: 742.7 bits (1916), Expect = 1.400e-210
Identity = 365/700 (52.14%), Postives = 459/700 (65.57%), Query Frame = 0
Query:  181 GKKNAEYSFSAAVNPNSADKVFRTQQISKWKGAMPDPKLRPKTAGEAARKGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYITKTPLGKEREEAMQAYLKNHQQ-EDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRFSADAAADPLLLALRHELYVTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYERAPWLWLRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVFAEGGKEAEAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKADEDEASYWFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVEGGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNH-RSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGRY 879
            G++   YS     NP   D  F        KG    P   P+   E+  K + Y+  +Q EDGHWAGDYGGPMFL+PGL+  CY+T   L +   + +  YL N Q  +DGGWGLHIE+ S +FGT + YVSLRLLGLP   P    A  F++++GGA   PSW KFWLA L ++ W+G+N +P E WLLP   P  PGR WCHCRMVYLPM YLY RR    AA  PL+  LR ELYVTPY +I W  +R   + LD+Y P S L+K +   L+ YER    WLR K  DF   +I  ED+QT Y+DIGPVNKTLNML V+   G+    FK H  RL DYLWLA DGMKMQGYNGSQ WDT+F +QA  + G+   FP+AMR A  YLD TQ+  +  +   +FRHISKG WPFST  HGWPI+DCTAEG+K  LAL   P +V      I   R+ + ++++LS QN DGGWA+YEN RG  W EL NPSEVF +IMIDYSYVE ++A + AM  F K  P H R++EI R+I  G +FI+SIQR DGSW GSWGICFTYGTWFG+EGL+A+GE P  SP + +A  FL++ Q+ +GGWGES+ S V K Y +    Q        +V T WA+L L+A K  D+E +ERG+++L+++Q  +GD+ QE I GVFN NC I+YS Y+NIFPLWA+ RY
Sbjct:   10 GRQTWRYSKEVNPNPKPVDGTFCP------KGCNMTPAKSPQ---ESITKAVQYFTQVQTEDGHWAGDYGGPMFLLPGLVITCYVTGYKLPEPHVQEIIRYLLNRQNPKDGGWGLHIEAHSDIFGTALQYVSLRLLGLPVDHPGVERARKFLRDNGGATGIPSWGKFWLATLNVYSWDGLNPIPIEFWLLPYSVPICPGRWWCHCRMVYLPMSYLYARR--TTAAETPLIRELRKELYVTPYSEINWPAQRDHINKLDMYAPHSYLLKSVNGALNLYERMHSKWLRDKAIDFTFDHIRFEDEQTKYIDIGPVNKTLNMLVVWDREGQSPNFFK-HADRLYDYLWLASDGMKMQGYNGSQLWDTAFTIQAFVESGISHQFPEAMRMANHYLDITQVPDNAPDG--YFRHISKGAWPFSTVDHGWPISDCTAEGIKAALALRSLPNIVP-----ISLDRVAEGVNVILSLQNSDGGWASYENKRGPNWLELFNPSEVFQNIMIDYSYVECSAACIQAMSSFLKHAPEHPRAREIRRSIDRGIKFIKSIQRDDGSWLGSWGICFTYGTWFGVEGLVASGE-PLNSPHLVKACKFLISKQREDGGWGESFRSNVTKNYVQHEQSQ--------IVNTGWALLSLMAAKYPDREPIERGIKYLISKQYPNGDFPQESIIGVFNFNCMISYSNYKNIFPLWAISRY 681          
BLAST of NO12G00440 vs. NCBI_GenBank
Match: XP_012758197.1 (hypothetical protein SAMD00019534_007760 [Acytostelium subglobosum LB1] >GAM17601.1 hypothetical protein SAMD00019534_007760 [Acytostelium subglobosum LB1])

HSP 1 Score: 740.3 bits (1910), Expect = 6.800e-210
Identity = 355/659 (53.87%), Postives = 446/659 (67.68%), Query Frame = 0
Query:  222 KTAGEAARKGMSYYQMLQCEDGHWAGDYGGPMFLMPGLIFACYITKTPLGKEREEAMQAYLKNHQQ-EDGGWGLHIESPSSMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAVLGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRFSADAAADPLLLALRHELYVTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQNCLSYYERAPWLWLRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCVFAEGGKEAEAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMADGGLVEAFPQAMRKAYSYLDRTQIKADEDEASYWFRHISKGGWPFSTAAHGWPIADCTAEGLKGVLALMDAPCVVEGGVPLIPPSRLEDAMHILLSYQNGDGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNKRFPNH-RSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGLIAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQTLGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEGITGVFNRNCGITYSQYRNIFPLWALGRY 879
            K+  EA  K   Y+  +Q EDGHWAGDYGGPMFL+PGL+  CY+T   L +     +  Y+ N Q  +DGGWGLHIE+ S +FGT + YVSLR+LGLPA+ P  T A  F++ +GGAV  PSW KFWLAVL ++ W+G+N +P E WL+P  FP  PGR WCHCRMVYLPM YLY RR    AA  PL+  LR ELYVT Y  I W  ++++ + LD+Y P S L+K +   L  YE     WLR K  DF   +I  ED+QT Y+DIGPVNKTLNMLCV+   G+    FK H  RL DYLWLA DGMKMQGYNGSQ WDT+F +QA  + G+   FP  MR A  YLD +Q+  +    +++FRHISKG WPFST  HGWPI+DCTAEG+K  L+L   P      +  I   R+ + ++++LS QN DGGWA+YEN RG  W E  NPSEVF +IMIDYSYVE ++A + AM  F  + PNH R KE+  +I  G RFI+SIQR +GSW GSWGICFTYGTWFG+EGL+AAGE P TSP I +A  FLL+ Q+ +GGWGES++S V K Y  +   Q        +V T WA+L L+A K   +E +ERG++FL+++Q  +GD+ QE I GVFN NC I+YS Y+NIFPLWAL RY
Sbjct:   51 KSPQEAITKAFQYFSAVQTEDGHWAGDYGGPMFLLPGLVITCYVTGYSLPEAHCREIIRYMLNRQNPKDGGWGLHIEAHSDIFGTALQYVSLRILGLPAAHPGVTRARDFLRANGGAVGIPSWGKFWLAVLNVYSWDGLNPIPIEFWLVPYAFPICPGRWWCHCRMVYLPMSYLYARR--TTAAETPLIRELRQELYVTDYSTINWPAQKNSINKLDMYAPHSTLLKGINAALGVYEGVHSKWLRDKAIDFTFDHIRYEDEQTKYIDIGPVNKTLNMLCVWDREGQSPNFFK-HADRLQDYLWLANDGMKMQGYNGSQLWDTAFTIQAFVETGIAGQFPDTMRLANHYLDISQVPDNSPNMNHYFRHISKGAWPFSTVDHGWPISDCTAEGVKAALSLRSLPF----HIAPISIDRVAEGINVILSLQNKDGGWASYENKRGPNWLEKFNPSEVFQNIMIDYSYVECSAACIQAMCAFRSQAPNHPRIKEVNGSIERGVRFIKSIQRDNGSWLGSWGICFTYGTWFGVEGLVAAGE-PLTSPHIVKACKFLLSKQRDDGGWGESFMSNVTKEYVHNDQSQ--------IVNTGWALLTLMAAKYPQREPIERGIKFLISRQYPNGDFPQESIIGVFNFNCMISYSNYKNIFPLWALARY 693          
The following BLAST results are available for this feature:
BLAST of NO12G00440 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
GAX18680.12.100e-24351.12cycloartenol synthase [Fistulifera solaris][more]
GAX17250.12.300e-23450.30cycloartenol synthase [Fistulifera solaris][more]
OLP86082.12.800e-23249.15Cycloartenol synthase [Symbiodinium microadriaticu... [more]
EJK52279.14.800e-23252.86hypothetical protein THAOC_28460 [Thalassiosira oc... [more]
XP_002287432.17.200e-22855.74cycloartenol synthase;-2,3-epoxysqualene mutase-li... [more]
XP_002185678.11.200e-22255.29acetyl-coenzyme A synthetase, partial [Phaeodactyl... [more]
OEU13816.13.400e-21753.30acetyl-coenzyme A synthetase [Fragilariopsis cylin... [more]
XP_009033487.11.000e-21054.92hypothetical protein AURANDRAFT_19883, partial [Au... [more]
XP_020433325.11.400e-21052.14cycloartenol synthase [Heterostelium album PN500] ... [more]
XP_012758197.16.800e-21053.87hypothetical protein SAMD00019534_007760 [Acytoste... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL124nonsL124Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR070ncniR070Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR066ncniR066Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR006ngnoR006Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK009932NSK009932Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO12G00440.1NO12G00440.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|323426gene_9947Nannochloropsis oceanica (N. oceanica CCMP1779)gene
jgi.p|Nanoce1779_2|674559gene_10109Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_101364g2gene4543Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO12G00440.1NO12G00440.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO12G00440 ID=NO12G00440|Name=NO12G00440|organism=Nannochloropsis oceanica|type=gene|length=6783bp
CTCTGTTCTTCGTTTTTCGATGAGTTAACGTTTTCTTTTTGTATCTTGTC
TTTTTTCTTTTCTGCTGCGTTCGCTAGCCCTGACATGATTTTGAGCACTA
CGGCCATTGTCGAATTCGTCTTTTTTCATTTCCCCCATCAGCTCTGTGGT
CATTAATCTTTTCCAACCTCGCACACTCCTACACACTCCTCCATCGTATG
TATTCACATCTGTTCCATTCTCTTTTTTTCCCTTACTTTCCTCCATCCTT
CCCTCACCTACCCTCGCGCATTGTACCCTAACGGTATGCCTGCGTTCCTG
AGCCTAACCCGTCCTCTTCGAAATCCACCCGCCCGGCAATCAAAGCGGCG
GCGCGCGACGAAGAAGAAAAGGAAGTGGAGGAAGTCCCGGAGGTGGAGGA
GGATGTGGACCCCTCACTACTGTACTGTTGGTGCAGGTCGGTGCCCTCTT
GCTGCTCTTGTTGCTCATGCAATGGCCCCTCATCCTCGACGGCCATTCTC
TCCTCCTCGATCTCGCCTAAAGGTGGCAAGGCTGCTTGGAAGCTGTCACC
AACGCGAACGTGCAGAGGGGCCGCGGGTGGACGAAGGTACTTTTTCGATT
CTCCTCCCTCGCCTTCCGAATCACTTGTGTCGTTCGTGGTCGAGTCCACG
GCCACATTAGCTGGCGGATCCAGCTTGGAGGGAACAAGGGAGTGGAAGGG
AAAAGAGAGTCGTAGAGGAAGGTATGAGGAGACATGACATCATGGGGTGA
TCATGATCTAAATGGTAAGAGTACCGCTATGCCAGGTAGACAGTGCGGAA
TCTGACCACAAGATGAATGAATTGACATCTCACCTAATATATAAATACAT
GCTACATGCATACATAAAAGCAACGCGTGTCTCACCTCCCGTAGGACAGA
CGTTGCGGCTCCCCCTGCTGTTGGCGCCGCCGCTGCTTCTGCTTCTGCTG
CTGTTGCTGCTGCTGCGCGCATATCGTTCAGGTCGTCTAAGGCTTGTTTC
TTGGGGCAGGGGGCTTGGCGAGATGGGGATCCATCGCGTTTGTTGGGGTT
ATCCTTGCCGTTGTCGTCGGCGTTCATTCCCGAGATGGGGGGAGGAGTGA
GACAGGGAGGAGGTTTTGCCTAGTCCCTTGATGTTGGTATGCTATGCTGA
ATTACTACGGCAGCTGCGGAGGCCACTCTAGCAACGCAAATGCCAATAAG
AGTTACCTTGTGGTAGGGTGTGGAGTGTTGTGAGAGGCGTCACGACTTAC
AACCACAGAACAAACAAGAAGAAGGTGCGTGTGGATGATACCATGAATGA
TACCCAGGAATCATGAAGGGCAAAAACCTCCTCTCCGTCGCCAACTTGTT
TTTCATAGAGCGTGTCTCAAGCCCCCCACATCTGAACCATGCATTCCAGC
CTCGTCACCAACGCATGCACACGCTGAATCCTCCTATCCCCCTCCCACAG
GAATTAGGAAAATGGCGGCAGGCTTTCTGCGGGCGCTCCTGGATAAGGGC
GGCCATGACGAAGGCGGGGACGATAGATTGCTTGTTCCGTTGGCCTTCAC
GGGGCTATGCGTCTTGGGAGgtatgtacgcagaatttggccttcagaaga
acaagtaatatctgtgtgtatgcactgaccagttcagggaaaatgtgctg
tgccttagcctggtattcggaccctgtcgtctgacccctcgccctccctc
tttccgtactaccaccccacattcgcacatcccccctatcctcccgcatt
tccactttctttcgattttcacctccgtcctctttcatgcctcctgcagC
CACAATCTACTCGACTTTCTATGCCCGACCAACGTCCTACCCCGGTGGCC
CCCGCACCCACCGGCAAGACCGCCGTGCCTCGTGGACGAAACGAGTCACC
ACCCTGCCGGCCGGATGGGTGTTTAGTCATGCGGAGGAATCGCATGGAGT
GACCCAACGGCATTATCAAAACATGCTGGGGGGTGAAGTCCCCTTAACGG
GTGAGGCGGCAGGACGACAGATGTGGTACTATGACGGAAAGATGGCCCAG
GCGCTGGCAAGAGTTGGGAAGAAAAATGCCGAGTACAGCTTCAGCGCGGC
CGTTAACCCTAACAGgtacttgtcgtatctatgggcaccatagactgtac
acacactaaagtatatataggcctgcaatatatatatatatatatgtaga
tagtggaagcgaatgacatttcgtggccatttccccccctcccttcgacc
tcactctcctccctccctccctccctccctccctccctccctccttcccc
acagTGCCGACAAGGTCTTCCGCACGCAGCAGATCAGCAAGTGGAAAGGC
GCCATGCCCGACCCCAAACTCCGGCCCAAGACGGCGGGCGAGGCGGCGAG
GAAGGGAATGAGCTACTATCAGATGCTGCAGTGCGAGGATGGgtgagatg
ggggggggcgggagggaaggaggggggcagaatagaccttccatttccta
cattttaccctccttctatctatcactccctccctccccctctccgtttc
tgttttaagtcactggacagaaggacatgtttctcgtggcggggtgatat
tggtttattttgtcacatggaccccgcctgggaagacaggagaggaattc
ttgcaaccatccacccggtcctcccccactccctcccccccccctccctc
cctccctccctttcccctccagGCATTGGGCGGGGGACTACGGAGGACCG
ATGTTTCTCATGCCGGGGCTGATTTTCGCGTGCTACATCACCAAGACGCC
TTTGGGGAAGGAGAGGGAGGAGGCCATGCAGGCGTACCTGAAGAACCACC
AGCAGGAGGACGGAGGCTGGGGCCTACACATCGAAAGCCCTTCCTCAATG
TTTGGCACGGTCATGTCCTATGTCTCCCTGCGCCTGTTGGGCTTGCCAGC
Ggtgcgtcctcgtgccctccctccctccctccctccctccctccctccct
ccctccctgtcgcggaccttccctctcgataataacagcggctcgtcccc
cccctcacacccctcccttcctccctccttccctccctcccttcctcctt
cagTCTGACCCCACCTGCACTGCCGCACACGCCTTCATCCAAGAGCACGG
CGGGGCAGTGATGGCGCCCTCCTGGTGTAAGTTTTGGCTGGCCGTGCTGG
GCCTTCACGAGTGGGAGGGCATCAACAGTGTGCCGGCCGAGATGTGgtga
ggagggagggaggaagggagggagggagggagggagggaagcacagggat
tggaggcacaggggagggtgggaaggaaggaaggagggagggaggcagag
gggaaggaggataaattcaagtatccgttaaatagaaaccgcccactacc
ccccccccccctctctctgcagGCTTCTTCCCCGATGGTTCCCATTCCAT
CCCGGCCGTCTCTGGTGCCATTGCCGCATGgtacgtctcctcccctttct
cccttcctccccccctctccaggtggcttttccggatgccacctctgctg
ccaccacgtctccactgttgccgcctacgcccttctcctcctcctcgccc
tacagcgagacctttccctttctccccactcaaaccaatccaagccctcc
ttcctacttgcctccctccttccctctctcccttctggtctccttccctt
ccctccctccctccctccctccctccctccctccctccctccctcctcta
gGTGTACCTTCCCATGTGCTACCTCTACTGCCGCCGCTTCTCCGCCGATG
CCGCGGCCGACCCCCTCCTCCTGGCCCTCCGCCACGAGCTCTACGTGACT
CCCTACCACGACATTGCTTGGGACGGGGAGAGGCACGCCTGCAGCGACCT
GGACGTCTACGATCCCgtgaggagggagggagggagggagggaggaaagg
gaggagggagggagggagggatggaagggaggatggaaggacggagggaa
gggacggagggaaggaggaagggaggggaaggagggaggaaggagggagg
gaggggagggagggagggaagggagagagggaagggagggaagggaggac
cgagtgatgtaaccggtcgctggcaaacaaccgagactcacccctccttc
cctcccccctccctccccgcctccctccctcccttccagGTCTCGGGCCT
CATGAAGTTTCTCCAGAACTGCCTCTCGTACTACGAACGCGCCCCATGGC
TCTGGCTGCGGAAGAAGGGCACCGATTTTGCCATTGCCTACATCCACGCC
GAGGATCAACAGACCAACTACGTCGATATAGgtgggtgtccctccctccc
tctttcctccctcaatccctccctcaatcccttgttgtcttcctccgaga
aagcgatcctccctccctccctccctccctcccttccttccttccgaggg
atcggtcctccctttcacccttcccctccctccctctttttccagGTCCT
GTGAACAAGACGCTCAACATGCTCTGTGTTTTTGCCGAGGGCGGGAAGGA
GGCCGAGGCCTTCAAGCGCCACGTCGGCCGTCTGGATGACTATTTGTGGC
TCGCGGAGGATGGGATGAAGATGCAGgtgagggagggaaggagggaggga
gggagggacggagagagggagtcaggccaggaggtcgcttttagtgtcct
gaactctcagtcattcactcgctgagcctttgcatcaaccctctgcaata
tcctcccctaccctccctccctccctccctccctcagGGCTACAACGGGA
GCCAGTGCTGGGACACCTCTTTCCTCGTCCAGGCGATGGCTGACGGCGGT
TTGGTGGAGGCATTCCCCCAAGCCATGCGCAAAGCGTACAGgtaggtagg
taggtaggtaggtaggtaggtagggagggagggaggaaggggttgaggag
caaggagtacgtggtgatgctgaggtggcgttgcctgattttcgttcctc
cctccctccctcccttcctccttccttcaccagCTACCTAGACCGCACTC
AGATCAAGGCCGACGAAGACGAGGCCTCTTATTGGTTCCGACACATCTCC
AAAGgtgcgtatactcttgccttcccccctcctttccttcctccctcccc
tccctcccctccctccctccctccctcccttcctccctcccttcctcctg
aacctcccgcccgccctcccttcccagGTGGCTGGCCGTTCTCGACGGCG
GCCCATGGATGGCCCATAGCCGACTGCACCGCCGAGGGGTTGAAGGGAGT
GCTGGCCCTGATGGACGCCCCCTGCGTGGTGGAGGGAGgtgaggagggag
gaagggggggaggaaacgaggcagggaagaagagagggaagaaaagaggc
gcatccgtggtgcaacaagattcatcccgtgttccccctccctccccctc
tccctccccagGCGTGCCTCTCATCCCTCCCTCCCGCCTCGAAGATGCCA
TGCATATCCTCCTGTCCTACCAGAACGGCGACGGGGGATGGGCCACGTAC
GAGAATAATCGGGGATGGGGGTGGTTCGAGCTCCTCAATCCTTCGGAGgt
gggatgggagggagggaggaagggcgggagggagggcgaggcctgcaggc
aggcacaagagtcctctttctagacccacccacataccaatccttcctac
ctccctccctccctccctctctccccccttcctcctcctccccctctgca
gGTATTTGGGGACATCATGATCGACTACTCGTATGTGGAGCTCACCTCCG
CCGTTATGACCGCCATGCACAAGTTCAACAAACGGTTTCCCAACCATCGA
AGCAAGGAGgtagggcaaacagggagggagggagggagggagggaaggaa
ggaaagtgaccaccaccacaggtatgtgtgttctgacgtccctcccttcc
ttcctccttcccttcctccttcttcccagATCACGCGCGCCATCGCCAGC
GGCGCCCGTTTCATCCGTTCCATCCAACGGCCGGACGGGTCTTGGTACGG
CTCCTGGGGCATCTGCTTCACCTACGGCACGTGGTTCGGCATCGAGgtgc
gttccttccctcccttattccatccctccctccctctttccctccccccc
cttccatccgttctgccccttcagcagagtcgttgtctctccactcatcc
gtaactcttctctcgatcgctcctttcctcccccccctcactcattcctc
ctgtagGGCCTCATCGCCGCCGGGGAGGACCCTGCCACCTCCCCCTCAAT
CCGACGCGCCCTCGGCTTCCTCCTCGCCCACCAGCAGGCCAACGGCGGGT
GGGGTGAGAGTTATCTGTCCTGTGTCGACAAAGCCTACCCGGAAGATGGG
ACGGGACAGACACTGGGGGAGGGAGGGTCAGGGGTGGTGCAGACGGCGTG
GGCGGTGCTGGGGCTGCTGGCGGGGAAATGCGAGGACAAGGAAGCGATGG
AGCGAGGGGTGAGGTTTCTGATGGCGCAGCAGCAGGAGGATGGGGATTGG
GGGCAGGAGGGGATCACGGGGGTCTTTAACCGCAATTGTGGCATTACGTA
CAGTCAGTATCGCAACATCTTCCCGTTGTGGGCCTTGGGGCGATATGCAA
AGGAAGTGGAGGAGAAGGGGAGGTGACAGGGAGGGGAGGGAGAAAAGGGT
GGATGGATGGATTGATGGAGGGAGGGAGGGAGGGAGATATAGGTTGTCAG
TAAGAACCAATAAGGTTGAGAGAAAAAAAGAAA
back to top

protein sequence of NO12G00440.1

>NO12G00440.1-protein ID=NO12G00440.1-protein|Name=NO12G00440.1|organism=Nannochloropsis oceanica|type=polypeptide|length=888bp
MIPRNHEGQKPPLRRQLVFHRACLKPPTSEPCIPASSPTHAHAESSYPPP
TGIRKMAAGFLRALLDKGGHDEGGDDRLLVPLAFTGLCVLGATIYSTFYA
RPTSYPGGPRTHRQDRRASWTKRVTTLPAGWVFSHAEESHGVTQRHYQNM
LGGEVPLTGEAAGRQMWYYDGKMAQALARVGKKNAEYSFSAAVNPNSADK
VFRTQQISKWKGAMPDPKLRPKTAGEAARKGMSYYQMLQCEDGHWAGDYG
GPMFLMPGLIFACYITKTPLGKEREEAMQAYLKNHQQEDGGWGLHIESPS
SMFGTVMSYVSLRLLGLPASDPTCTAAHAFIQEHGGAVMAPSWCKFWLAV
LGLHEWEGINSVPAEMWLLPRWFPFHPGRLWCHCRMVYLPMCYLYCRRFS
ADAAADPLLLALRHELYVTPYHDIAWDGERHACSDLDVYDPVSGLMKFLQ
NCLSYYERAPWLWLRKKGTDFAIAYIHAEDQQTNYVDIGPVNKTLNMLCV
FAEGGKEAEAFKRHVGRLDDYLWLAEDGMKMQGYNGSQCWDTSFLVQAMA
DGGLVEAFPQAMRKAYSYLDRTQIKADEDEASYWFRHISKGGWPFSTAAH
GWPIADCTAEGLKGVLALMDAPCVVEGGVPLIPPSRLEDAMHILLSYQNG
DGGWATYENNRGWGWFELLNPSEVFGDIMIDYSYVELTSAVMTAMHKFNK
RFPNHRSKEITRAIASGARFIRSIQRPDGSWYGSWGICFTYGTWFGIEGL
IAAGEDPATSPSIRRALGFLLAHQQANGGWGESYLSCVDKAYPEDGTGQT
LGEGGSGVVQTAWAVLGLLAGKCEDKEAMERGVRFLMAQQQEDGDWGQEG
ITGVFNRNCGITYSQYRNIFPLWALGRYAKEVEEKGR*
back to top
Synonyms
Publications