NO03G00640, NO03G00640 (gene) Nannochloropsis oceanica

Overview
NameNO03G00640
Unique NameNO03G00640
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length7954
Alignment locationchr3:203202..211155 -

Link to JBrowse

Properties
Property NameValue
Descriptionpresequence protease 1
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr3genomechr3:203202..211155 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
PRJNA7699582024-08-13
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
PXD0166992021-01-08
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PXD0087212019-04-30
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0046872metal ion binding
GO:0003824catalytic activity
GO:0046872metal ion binding
GO:0003824catalytic activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR011249Metalloenz_LuxS/M16
IPR007863Peptidase_M16_C
IPR013578Peptidase_M16C_assoc
Homology
BLAST of NO03G00640 vs. NCBI_GenBank
Match: XP_005842187.1 (hypothetical protein GUITHDRAFT_83724 [Guillardia theta CCMP2712] >EKX55207.1 hypothetical protein GUITHDRAFT_83724 [Guillardia theta CCMP2712])

HSP 1 Score: 1003.8 bits (2594), Expect = 4.000e-289
Identity = 493/983 (50.15%), Postives = 699/983 (71.11%), Query Frame = 0
Query:   86 RYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEFEAQ---ELDSQVTYQSKKTEPWTLTKTFPATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKE-EDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLAANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQERGVTILRHALPTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKGGVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVKKENLILNLTGDNKVLTDVLSPVYRFLDGMP-STGDRKLVCQDWRTDNQLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAAAHLAKSS--ITEEDLTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALEDANAKLAADAK 1062
            +Y++V+ + + EYGAKV L++H+KT  E+MSV+V D+NKVFGITFRTPP DSTG+PHILEHSVLCGSR++PVKEPFV+LLKGS+ TFLNAFTYPDRTCYPVASQN KDFYNLI+VYLDAVLHP     P  L+QEGWH E+ED+ + L YKGVV+NEMKGVYSSPD +  R  Q+ALFPDN Y VDSGGDP VIP L+++ F+ FH+ +YHPSNSR+YFYG+DD  ARL+LL+ +L EFE       DS + +Q K+  PWT+ + +P+ +    K ++ +NWL+ND+ L  +D+L L +LD +LMG+  + LY+ L ESGLGESVI  GL+  L Q T+SVG+KG+ +     +V++L+L+ L     EGF   +I AS+NS+EF LREFNTG FPRGLSFMLG+++ W+YDRDP++ LRFE PL E+++ +A+ + +FED I  YL++NGHR  V+ +PD  LEEK +K EE  L  ++  +  E + K+I++T +LK  Q+AED PE  A +P L + DLDK  +  P+ V++E+GV +LRH LPTNGI+YADIG D+  + +D LPL+PLF RCL E GT   D   L   I THTGG+ +S+  + K+  G  + P  ++ +++F+RGKA  +K++E+F ++ D++T+ N  NQ +  +M+ E+KAR  +++VG+G+S+A+ R+ +RY +   + EK  GI  +  +  L  + + +W  +  +LE+IRDL+V ++NL++NL+ ++K  + + S +  ++  +P  T + K+V  DW  + + F  K EGF VPTQVNYV KG  +++ GE   G+A V++ +LR  +LWD VRV+GGAYG   S++  SG+F + SYRDPNLLQTL+ YD     L + S  ++   L  AIIG IGD+D+PM+PDQKGF+S+  +L   + E RQ+ RD+VLST++KDFAEFA+RL  +    + AV+GS  ALE+AN +L  + K
Sbjct:   71 KYDIVKEDHIDEYGAKVVLFKHKKTGAEVMSVSVPDENKVFGITFRTPPNDSTGVPHILEHSVLCGSRRYPVKEPFVELLKGSMNTFLNAFTYPDRTCYPVASQNLKDFYNLINVYLDAVLHPALT--PWTLKQEGWHYEIEDESDALKYKGVVFNEMKGVYSSPDAVHGRACQQALFPDNTYGVDSGGDPTVIPKLTWENFEGFHKKFYHPSNSRIYFYGDDDVAARLELLETFLGEFEQHPRVRKDSTIEWQQKRNAPWTIEQHYPSGQ--DGKVLMTVNWLINDQVLKPQDELALDVLDDLLMGTPVSPLYKTLRESGLGESVISDGLETVLQQATYSVGMKGIDDVAKCDQVQKLILDTLNKIANEGFDKSSIEASLNSLEFKLREFNTGGFPRGLSFMLGSLSSWLYDRDPMEPLRFEKPLAELRSRIASGEPVFEDLIKKYLINNGHRVTVKSLPDPELEEKNRKREEEELENVRKSLQKEDISKLIEETKMLKEKQQAEDPPEKLALIPSLTMDDLDKQGRNIPIAVSEEKGVKVLRHELPTNGIVYADIGLDMRVVPVDLLPLIPLFCRCLTEMGTHKRDDIALSDFIRTHTGGVYTSTSTTQKYGSGNRL-PEPEVVSNLFLRGKATYAKSAEMFEVMNDIITNTNFNNQNKFKQMVLETKARLEANIVGSGHSYAAGRIGARYMVTEFVEEKMRGIETLDFIRELAKEVDKNWEGVLAKLERIRDLLVNRKNLLINLSAEDKGFSSLQSNLEEYIQSIPLKTEESKVV--DWAMEMKKFDGKGEGFVVPTQVNYVGKGAQIFKPGEVTSGAAAVVSRHLRTTWLWDKVRVVGGAYGAMNSYNPSSGMFKYVSYRDPNLLQTLETYDQTPEFLRELSKEMSPTTLANAIIGMIGDMDAPMSPDQKGFTSMDRYLTGLTDEMRQERRDQVLSTTAKDFAEFAERLEVVTKEGSIAVIGSSSALEEANKELGLELK 1046          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: XP_002290387.1 (hypothetical protein THAPSDRAFT_22863 [Thalassiosira pseudonana CCMP1335] >EED92139.1 hypothetical protein THAPSDRAFT_22863 [Thalassiosira pseudonana CCMP1335])

HSP 1 Score: 999.2 bits (2582), Expect = 9.700e-288
Identity = 542/1067 (50.80%), Postives = 724/1067 (67.85%), Query Frame = 0
Query:   19 ASAALDAARHRGGAASARALAFSSYKGRSLSASVDLGGSAVTRLDKAVDVFGEKQPKKEWDYVAHHPRYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEF-EAQELD--SQVTYQSKKTE-PWTLTKTFPATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLA-ANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQ---ERGVTILRHAL-PTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKG---GVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVKK----ENLILNLTGDNKVLTDVLSPVYRFLDGMP--STGDR----KLVCQDW-------RTDNQLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAA-AHLAKSSITEED---LTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALEDA 1053
            A+ ++         A+A A    +  G S +A+      ++T L ++  V  +     E      HP +E++ ++VV E+GA  TLYRH+K+  E++SV  DDDNK FGITFRTPP+DSTG+PHILEHSVLCGSRK+  K+PFV LL+GSLQTFLNAFTYPDRTCY VASQNTKDFYNLI+VY DAV HPRA  DP+V  QEGWHLELED  EPLTYKGVVYNEMKGVYSSPD ++ RE Q+++FPDN Y VDSGGDP  IP+LSF+QF  FH+ +YHP+NSR++F G+DD   RL+++D YL++F E+ E    S + +Q+K  + P  +   +PA     +  M+ +NWLVNDK ++  +++ + ILDH+LMG+S++ L + L+ESGLG+++ GGGL   L+Q TFSVGLKGVK E+  KVE+LV+E L   V EGF  DAIAAS+N+IEF +REFNTG FP+GLS MLG+M  W+YDR P D L+FEGPL E+K  +A +  K+F+D I+  LL N HR+ + M P  +LEE+Q K E+ RLA IKA M++E L+ II  T  LK  Q AED+PEARA++P L+L+DL +   E P++V +   + G+T++RH L  T+GI YA +  DVS LSLDD+ LLPLF R +LETG    D   L R IG HTGG+ +S  IS  + +G   GV+     + + + I GKA + K  EL S+   +L DANL  + +++E+L++SK++  SS+ G+G++ A+ R+ SRYS  G I EK  GIS +  + AL+DQAEND+P+L  RLE IR+ +++K    + +IL+LTGD  V   +   V +FL  +P  S GD+          W        TDN   P  +EGF VPTQV+YV KGG L+++GE V GS  V++ +L  GY+WDNVRV+GGAYGGF  F    G+ SF SYRDPNL  T+D+YD AA A LA +   E D   LT AIIG+I D+D  ++PDQKG ++    L  ESPE RQ++RD+VL+T   DF EFA+RL  LK  S +AVV SK A EDA
Sbjct:  114 AARSISTVSTSRNTAAAVAFMHKNRFGASAAAASTCSSGSITALRQSTVVASD----LEKTLGVTHPGFEVISTDVVNEFGAYCTLYRHKKSGAELLSVATDDDNKCFGITFRTPPSDSTGVPHILEHSVLCGSRKYKTKDPFVQLLQGSLQTFLNAFTYPDRTCYVVASQNTKDFYNLINVYSDAVFHPRATSDPMVHAQEGWHLELEDVAEPLTYKGVVYNEMKGVYSSPDSLLQREAQQSIFPDNTYGVDSGGDPNEIPNLSFEQFADFHKKFYHPANSRIFFAGDDDVARRLEIMDEYLSDFGESPESKPASTIQWQAKNFDAPKKIRNPYPAGADQPETHMIMVNWLVNDKPMTALEEITISILDHLLMGTSSSILRKTLMESGLGDAITGGGLMSELMQGTFSVGLKGVKPENVEKVEELVMETLTKVVDEGFTEDAIAASMNTIEFDMREFNTGSFPKGLSLMLGSMREWVYDRSPTDALKFEGPLSELKETIATSGSKVFQDMINDLLLKNTHRSTIEMYPSKTLEEEQLKNEKDRLASIKASMSEEELQSIIDTTKELKKLQAAEDAPEARATIPSLELSDLKREVTEYPIDVTENEADTGITVVRHELGSTSGIAYAKLAVDVSGLSLDDVALLPLFTRMMLETGAGEYDSVALSRRIGMHTGGVSASVMISGVNAEGEDEGVVTSGEYLISKLTITGKATSDKVDELLSIFDLILRDANLDAKAKIIEILRQSKSQKESSIQGSGHATANARIRSRYSPIGYIGEKMNGISSLDTVKALLDQAENDFPSLLARLENIRNTILEKSTCRDGMILDLTGDKNVFETIQPSVEKFLLQLPGDSKGDKLQNFYTEVHPWVKHSKEEMTDNA--PIVDEGFVVPTQVSYVGKGGRLYEEGEAVSGSTAVVSRFLGTGYMWDNVRVIGGAYGGFAQFEPRGGVMSFLSYRDPNLAGTIDVYDGAADALLASAKDMENDPEALTTAIIGAIADMDGALSPDQKGSTAFSRWLSRESPEQRQKYRDQVLNTKPSDFKEFAERLKALKDPS-SAVVSSKAAFEDA 1173          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: XP_002177646.1 (predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1] >EEC50460.1 predicted protein, partial [Phaeodactylum tricornutum CCAP 1055/1])

HSP 1 Score: 998.8 bits (2581), Expect = 1.300e-287
Identity = 519/984 (52.74%), Postives = 689/984 (70.02%), Query Frame = 0
Query:   87 YEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEF----EAQELDSQVTYQSKK-TEPWTLTKTFPATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLA-ANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQ---ERGVTILRHAL-PTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKGGVIDPTTD---MTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVKK----ENLILNLTGDNKVLTDVLSPVYRFLDGMPSTGDRKLVCQDWRTDN-----------QLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAAAHLAKSSITEED----LTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRS 1039
            Y++V  +VV EYGA  TLYRH+K+  E++SV VDDDNKVFGITFRTPP DSTG+PHILEHSVLCGSRK+  K+PFV LL+GSLQTFLNAFTYPDRTCY VASQNTKDFYNLI+VY DAV HPRA+ DP V  QEGWHLELEDK  PLTYKGVVYNEMKGVYSSPD  + R +QR++FPDN Y VDSGGDP VIP+LS++QF+ FHR +Y PSNSR+YF G+DD   RL+L+D YL EF    +A+E  SQ+ +QSK   EP    +T+PA     +  ++ +NWL+NDK ++  ++L LG+LDH+LMG++++ L + L+ESGLGE++ GGGL D LLQ TFSVGLKGV+ E   +VE+L+++ L    K+GF  D IA+S+N+IEF +REFNTG FP+GLSFMLG+M+ W+YD  P + L+FE PL E+K  +A +  KIF+D I  YL+ N HR  V + P  +LEE+  KEE  RL +IK+ ++ E L++II  T  LK  Q +EDS EARA++P L+L+DL +   E P+ V Q   + GVT++RH L  T+GI Y     D+S +S++D+PLLP+F + + +TG    D   L R IGTHTGG+  S   +  HP+G     T D   M   M I+GKA + K  ELFS++  +LTD+ L +QK+V+EMLKES++R  SSV GAG++ ++ R+ +RY + G I E + GISY+  +  L+ QAE DWP+L  R EKIR  +++K      ++L++T D KV  D+   V +FL  +P   + + +   ++  +           +  P K+EGF VPTQV+YV K G L+ +GE +PGSA V+  YLR GYLWD+VRVMGGAYGGFC+FS  SG FSF SYRDPNL +T+D+YDAAA          E+    L  AIIG+IGD+D  ++PDQKG +++   L+ ES E RQ++RDEVL+T + DF EFA+RL  LK  S
Sbjct:    1 YDVVEKDVVDEYGAYCTLYRHKKSGAELLSVAVDDDNKVFGITFRTPPEDSTGVPHILEHSVLCGSRKYKTKDPFVQLLQGSLQTFLNAFTYPDRTCYVVASQNTKDFYNLINVYADAVYHPRAIDDPNVHAQEGWHLELEDKAGPLTYKGVVYNEMKGVYSSPDSRLMRASQRSIFPDNTYGVDSGGDPRVIPELSYEQFREFHRKFYSPSNSRIYFSGDDDVYQRLELMDEYLQEFDMLPDAKE-KSQIQWQSKTYMEPKKEFETYPAGADQPETHLLTVNWLLNDKPMTSFEELTLGVLDHLLMGTTSSKLRKTLMESGLGEAITGGGLSDELLQATFSVGLKGVQGEKTGEVEKLIVDTLTGIAKDGFDEDDIASSLNTIEFQMREFNTGSFPKGLSFMLGSMSKWLYDNSPTEALKFERPLAELKERIADSGSKIFQDMIQSYLVENTHRTTVELAPSKTLEEEILKEERDRLEEIKSKLSQEDLDEIIHKTEELKRLQSSEDSVEARATIPSLELSDLKRETTEYPISVTQNESKSGVTVVRHELGSTSGIAYVSTAIDISGVSVEDIPLLPIFTKMMTQTGAGEYDSVALSRRIGTHTGGVGVSLLTTAVHPEGSDESVTGDGEHMITKMLIQGKATSEKVDELFSIMNLILTDSKLDSQKKVIEMLKESRSRLESSVQGAGHAVSNTRMKARYRVGGYIDEITSGISYLQTVKELVKQAEEDWPSLLRRFEKIRSTILEKSTCRSGMVLDITADEKVFGDIQPSVEQFLTELPGDANGEKLQNFYKEIHPWVPHAKNMMAEFAPVKDEGFVVPTQVSYVGKSGLLYDEGEHIPGSAAVVARYLRTGYLWDHVRVMGGAYGGFCTFSPFSGYFSFLSYRDPNLDKTIDVYDAAADAXXXXXXXXENNPEALATAIIGTIGDMDGALSPDQKGAAAMQRWLINESSEYRQKYRDEVLNTKASDFREFAERLKGLKLPS 983          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: GAX21277.1 (presequence protease [Fistulifera solaris])

HSP 1 Score: 998.0 bits (2579), Expect = 2.200e-287
Identity = 512/1000 (51.20%), Postives = 704/1000 (70.40%), Query Frame = 0
Query:   84 HPRYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEFEAQEL---DSQVTYQSKK-TEPWTLTKTFPATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLA-ANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQER---GVTILRHAL-PTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKG---GVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVK----KENLILNLTGDNKVLTDVLSPVYRFLDGMPSTGDRKLVCQDWRTDNQLFP-----------NKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAAAHLAKSSITEED----LTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALEDA 1053
            HP YE+++ +VV EYGA  +LYRH+K+  +++SV+ +DDNKVFGIT RTPP D TG+ HILEHSVLCGSRK+  K+PFV LLKGSLQTFLNAFTYPDRTCY VASQNTKDFYNLI+VY DAV HPRA+ DP+VL QEG HLEL+DK+EPL YKGVVYNEMKGVYSSPD ++ R  QR++FPDN YAVDSGGDP VIP+L+FDQF  FH  +YHPSNSR+YF G+DD L RL+L+D YL EF+A      DS++ +Q K+ TEP    + +PA     +   +++NWL+ND+ LS  ++L LG+LDH+L G++++ L + L+ESGLG+++ GGGL D LLQ  F +GLKGV ++D SKVE L+L+ L+   +EGF  D IA+S+N+IEFSLREFNTG FP+GLSFMLG+M+ W+Y+  P D L+FE PL+++K ++A +  KIF+D +   L+ N HR  V + P  +LEE+Q ++E+ RL+ IK  ++++ L+KII++T  L A Q A+DSPE RA++P L+L+DL + Q E P+ V++     GVT++RH L  T+GI Y  +  D+S+LSLD++PLLP+F + + ETG    D  QL R IGTHTGG+      +  +P+G    +     +M   + I GKA + K  +LFSL   +LTDA L ++ +V+E+LKES++R  SS   +G+S A+ R+ +RY + G + E + G+SY+  + +LI  AE+DWPAL  R E +R  ++     +  +++++TG+ KVL  +   +  FLD +P   + + +   ++  +   P             +EGF VPTQV+YV K G L+++GE+VPGSA V++ +LR GYLWD+VRVMGGAYGGFC+FS  SG FSF SYRDPNL +TLD+YDAAA +L   + T E+    L  AIIG+IGDLD  ++PDQKG +     ++ ESPE RQ+ RDE+L+T   DF EFA+RL  L   S  AVV SK A E+A
Sbjct:   89 HPAYEVLKRDVVTEYGAYCSLYRHKKSGAQLLSVSTEDDNKVFGITLRTPPEDGTGIAHILEHSVLCGSRKYTTKDPFVHLLKGSLQTFLNAFTYPDRTCYVVASQNTKDFYNLINVYADAVFHPRAIKDPMVLAQEGHHLELQDKEEPLVYKGVVYNEMKGVYSSPDSLLMRSAQRSIFPDNTYAVDSGGDPTVIPNLTFDQFVDFHSRFYHPSNSRIYFSGDDDVLQRLELMDEYLREFDAAPETVDDSRIEWQPKRYTEPQRTVEYYPAGGDQPETHSLSINWLLNDRPLSALEELTLGVLDHLLTGTTSSVLRKTLMESGLGDAITGGGLSDELLQAVFMIGLKGVAKDDVSKVEDLILDTLRKVSEEGFTEDDIASSLNTIEFSLREFNTGSFPKGLSFMLGSMSKWLYEESPTDALKFEEPLQQLKDEIAKSGSKIFQDMVKEMLVENMHRTTVELAPSKTLEEEQAEDEKRRLSSIKESLSNDDLDKIIEETNKLLALQSADDSPEDRATIPSLELSDLKREQAEYPIAVSENENGSGVTVVRHELVSTSGIAYVSLSVDLSSLSLDEVPLLPIFTKLMKETGAGDYDSVQLSREIGTHTGGISVGLMTTAVYPRGADESLSASGENMQTKIVISGKATSEKIDKLFSLFNLMLTDARLDSKAKVIELLKESRSRLESSAQRSGHSVANTRMKARYRVGGYVDEITGGVSYLNTVKSLIKLAEDDWPALLNRFENLRKTILNERTCRSGMLVDITGEKKVLDSIKPSLDSFLDTLPGDANGEKLIDFYKEVHPWVPEAKKRMADLSLETSEGFIVPTQVSYVGKAGLLFKEGERVPGSAQVVSRFLRTGYLWDHVRVMGGAYGGFCTFSPFSGFFSFLSYRDPNLHKTLDVYDAAADYLLSVAETLENDPEALATAIIGTIGDLDGALSPDQKGSTQFQRWIINESPEHRQRIRDEILATKPSDFREFAERLKKLSNPS-IAVVSSKAAFEEA 1087          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: GAX12181.1 (presequence protease [Fistulifera solaris])

HSP 1 Score: 996.1 bits (2574), Expect = 8.300e-287
Identity = 513/1000 (51.30%), Postives = 701/1000 (70.10%), Query Frame = 0
Query:   84 HPRYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEFEAQEL---DSQVTYQSKK-TEPWTLTKTFPATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLA-ANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQER---GVTILRHAL-PTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKG---GVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVK----KENLILNLTGDNKVLTDVLSPVYRFLDGMPSTGDRKLVCQDWRTDN-----------QLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAAAHLAKSSITEED----LTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALEDA 1053
            HP YE+++ +VV EYGA  +L+RH+K+  +++SV+ +DDNKVFGIT RTPP D TG+ HILEHSVLCGSRK+  K+PFV LLKGSLQTFLNAFTYPDRTCY VASQNTKDFYNLI+VY DAV HPRA+ DP+VL QEG HLEL+DK+EPL YKGVVYNEMKGVYSSPD ++ R  QR++FPDN YAVDSGGDP VIP+L+FDQF  FH  +YHPSNSR+YF G+DD L RL+L+D YL EF+A      DS++ +Q K+ TEP    + +PA     +   +++NWL+ND+ LS  ++L LG+LDH+L G++++ L + L+ESGLG+++ GGGL D LLQ  F +GLKGV ++D SKVE L+L+ L+   +EGF  D IA+S+N+IEFSLREFNTG FP+GLSFMLG+M+ W+Y+  P D L+FE PL+++K ++A +  KIF+D +   L+ N HR  V + P  +LEE+Q ++E+ RL+ IK  ++D  L+KII +T  L A Q A+DSPE RA++P L+L+DL + Q E P+ V++     GVT++RH L  T+GI Y  +  D+S+LSLDD+PLLP+F + + ETG    D  QL R IGTHTGG+      +  +P+G    V     +M   + I GKA + K  +LF L   +LTDA L ++ +V+E+LKES++R  SS   +G+S A+ R+ +RY + G + E + G+SY+  + +LI  AE+DWPAL  R E +R  ++     +  +++++TG+ KVL  +   +  FLD +P     + +   ++  +           +L    +EGF VPTQV+YV K G L+++GE+VPGSA V++ +LR GYLWD+VRVMGGAYGGFC+FS  SG FSF SYRDPNL +TLD+YDAAA +L   + T E+    L  AIIG+IGDLD  ++PDQKG +     ++ ESPE RQ+ RDE+L+T   DF EFA+RL  L   S  AVV SK A E+A
Sbjct:   89 HPAYEVLKRDVVTEYGAYCSLFRHKKSGAQLLSVSTEDDNKVFGITLRTPPEDGTGIAHILEHSVLCGSRKYTTKDPFVHLLKGSLQTFLNAFTYPDRTCYVVASQNTKDFYNLINVYADAVFHPRAIKDPMVLAQEGHHLELQDKEEPLVYKGVVYNEMKGVYSSPDSLLMRSAQRSIFPDNTYAVDSGGDPTVIPNLTFDQFVDFHSRFYHPSNSRIYFSGDDDVLQRLELMDEYLREFDAAPETVDDSRIEWQPKRYTEPQRTVEYYPAGGDQPETHSLSINWLLNDRPLSALEELTLGVLDHLLTGTTSSVLRKTLMESGLGDAITGGGLSDELLQAVFMIGLKGVAKDDVSKVEDLILDTLRKVSEEGFTEDDIASSLNTIEFSLREFNTGSFPKGLSFMLGSMSKWLYEESPTDALKFEEPLQQLKDEIATSGSKIFQDMVKEMLVENMHRTTVELAPSKTLEEEQAEDEKRRLSSIKESLSDSDLDKIIDETNKLLALQSADDSPEDRATIPSLELSDLKREQAEYPIAVSENENNSGVTVVRHELVSTSGIAYVSLAVDLSSLSLDDVPLLPIFTKLMKETGAGDYDSVQLSREIGTHTGGISVGLMTTAVYPRGADESVSASGENMQTKIVISGKATSEKIDKLFLLFNLMLTDARLDSKAKVVELLKESRSRLESSAQRSGHSVANTRMKARYRVGGYVDEITGGVSYLNTVKSLIKLAEDDWPALLNRFENLRKTILNERTCRSGMLVDITGEKKVLDTIKPSLDSFLDTLPGDASGEKLTDFYKEVHPWVPEAKKRMAELSSETSEGFIVPTQVSYVGKAGLLFKEGERVPGSAQVVSRFLRTGYLWDHVRVMGGAYGGFCTFSPFSGFFSFLSYRDPNLHKTLDVYDAAADYLLSVADTLENDPEALATAIIGTIGDLDGALSPDQKGSTQFQRWIINESPEHRQRIRDEILATKPSDFREFAERLKKLSNPS-IAVVSSKAAFEEA 1087          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: OEU20476.1 (M16C_assoc-domain-containing protein [Fragilariopsis cylindrus CCMP1102])

HSP 1 Score: 984.6 bits (2544), Expect = 2.500e-283
Identity = 513/1009 (50.84%), Postives = 690/1009 (68.38%), Query Frame = 0
Query:   75 KKEWDYVAHHPRYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEFE---AQELDSQVTYQSKK-TEPWTLTKTFPATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLA-ANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQ---ERGVTILRHAL-PTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKG---GVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVKKE----NLILNLTGDNKVLTDVLSPVYRFLDGMPSTGDRKLVCQDWRTD-----------NQLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAA-AHLAKSSITEED---LTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALEDA 1053
            KK+ D    HP +E++  + V+EYGA  TLYRH+K+  E++SV+ +DDNKVFGITFRTPP DSTG+PHILEHSVLCGS+K+  K+PFV LL+GSLQTFLNAFTYPDRTCY VASQN KDFYNLI VY DAV HPRA+ DP+V  QEGWHLELE K +PL YKGVV+NEMKGVYSSPD ++ RE+QR++FPDN Y VDSGGDP VIPDLSF+QF  FH+ +YHP+NSR+YF G+DD   RL ++D  L+EF+     +  S + +Q K  TEP      +P  E   +  M+ +NWL+ND+ LS  D+L LGI+DH+LMG+S++ LY+ L+ESGLGE++ GGGL D LLQ TFS+G+KG+K ED  K+EQL+++ +    KEGF AD IA+S+N+IEF LREFNTG FP+GLSFMLGAM+ W+YD  P D L+FE PL E+K  +A +  ++F+D I   L+ N HR+ + M P  + E +  KEE+ RL  IK  ++D  +++II+ T  LK  Q AEDSPE RA++P L+L DL +   E P+E  +   + GVT+LRH    T+GI YA +G D+S+LS++D+PLLPL    ++ETG    D   L R IGT TGG+  S   +  HP+G     I     +   + +RGKA +  A ELFSL+K +L+DA    + RV+EMLKE+KA   + + G+G+   ++R+ +RY + G I E   GI+ +  +  L+ QAE DWP+L  RLE +R +++ +E     + L++TGD  VL  V   V  FL  +P +   K +   ++ +           N+  P  +EGF VPTQV+YV K G L+++GE+  G++ V++ +LR GYLWD+VRVMGGAYGGFC+FS  SG FSF SYRDPNLL+TLDIYDAA  A +A +     D   L+Q IIG+IG++D  + PDQKGF+SL   L+ ESP  RQ +RD++L T  +DF  F +RL  +K  S  AVV S+ A E A
Sbjct:   13 KKKAD--VEHPAFEILNKDFVEEYGAAATLYRHKKSGAELLSVSTEDDNKVFGITFRTPPEDSTGVPHILEHSVLCGSKKYTTKDPFVQLLQGSLQTFLNAFTYPDRTCYVVASQNEKDFYNLISVYADAVFHPRAISDPMVHAQEGWHLELESKDDPLVYKGVVFNEMKGVYSSPDSLLGRESQRSIFPDNTYGVDSGGDPRVIPDLSFEQFADFHKKFYHPTNSRIYFSGDDDVATRLKMMDEVLDEFDFSPESKPGSTIVWQKKTYTEPRKEVHPYPIGEDQPETHMMNVNWLMNDEKLSSFDELTLGIMDHLLMGTSSSILYKALMESGLGEAITGGGLSDELLQATFSIGMKGIKAEDVPKLEQLIIDTIAKVAKEGFEADDIASSMNTIEFQLREFNTGSFPKGLSFMLGAMSKWLYDESPTDALKFEKPLAELKEKIAESGSQVFQDLIQKMLVDNSHRSTIEMQPSKTHESELLKEEKDRLEDIKKSLSDSDIDQIIETTNKLKELQSAEDSPEDRATIPSLELGDLKRETTEYPIEETKNENDSGVTVLRHEFGSTSGIAYAVLGIDLSSLSVEDIPLLPLMTSMMMETGAGDYDSVALSRRIGTDTGGVSVSVLNTAVHPEGADESAILEGNHLQTKLIVRGKATSDNAGELFSLMKIILSDAKFDAKSRVVEMLKETKASMEARIGGSGHMAINMRMKARYRVGGYIEEMMGGITQLETVKDLLVQAEEDWPSLLARLENMRSVILNEETCRDGMFLDITGDKSVLEKVQPSVDTFLKELPGSNTGKCLPNFYKDEHPWVAPVKKLMNEFAPIADEGFVVPTQVSYVGKSGLLFEEGEQSSGTSQVVSKFLRTGYLWDHVRVMGGAYGGFCTFSPYSGFFSFLSYRDPNLLKTLDIYDAAGDAVIAAAEQMRNDPDILSQTIIGTIGEMDGSLGPDQKGFTSLQRWLVNESPSYRQAFRDQILDTKPEDFDAFGERLKKIKDPS-VAVVSSQSAFETA 1018          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: CEM29089.1 (unnamed protein product [Vitrella brassicaformis CCMP3155])

HSP 1 Score: 962.6 bits (2487), Expect = 1.000e-276
Identity = 481/999 (48.15%), Postives = 686/999 (68.67%), Query Frame = 0
Query:   84 HPRYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEFEA--QELDSQVTYQSKKTEPWTLTKTFP-ATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEED--FSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLAANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQERGVTILRHALPTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKGGVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVKKENLILNLTGDNKVLTDVLSPVYRFLDGMP----STGD-RKLVCQDWRT--DNQLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAAAHLAKSSITEEDLTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADL---KTRSTTAVVGSKKALEDANAKLAADAKLDVIEL 1068
            HP+Y++VR++ V EY +   LYRH+KT  +++SV  DD  KVFGI+FRTPP DS G+PHILEH +LCGSRKF  K+ F  L KGSL  F+NA TYPDRTCYPVAS N KDFYNL+ VYLDAVL PRAV DP VL QEGWH+E E+K + L + GVVYNEMKGVYS  + ++ R T++ LFPDN Y VDSGGDP  I DL+FDQF+ FH+ +YHP+NSR + YGNDD   RLD L  YL+EF+A    +DS VT+Q KK EPW + +TFP A   TK+K +V +NWL+ND+ L+  ++L L I++ +L+G+ +++LY+ LIESG G  + G G+  +LLQ TF++GLK V  E+  + K+E LVL  LQ+ V +GF ADAI A++NS+EF+LREFNTG FP+GLS  L  +  W+YD DP D ++FE PL ++KADL   K +F++ I  YLL N HR  + M PD  LE+K  +EE+ RL +IK  + D  ++++IK+T  LK  Q AEDSPE  A++P+++L D+++  +  P  V QE+ V ILRH LPT+G+LYA +G D+  + ++D+P LPLF R L E+GTS  D+    R IG+ TGG+ +SS I+ K    GV+    D+ +++FI GK+V  K  ++F ++KDVL DA L N++R  E+LKE K    SS++GAG+ +A  R +++Y+L G I++ + G+  +  +  LI Q E+DWP++Q RLE IR  +++++ ++L++T  + +L+     +  FLD +P    + GD        W T   + L P + EG  VPTQVNYV K   ++ +GE+VPG+A+V+  +L  GYLWDNVRV+ GAYG      + SG+  F SYRDP L++++++YD A  HL ++   ++D+T+A++G    LD+P  PDQKG  ++++HL  E+ E+RQ++RDEVL+TS +DF + ADRL  +    TRS+  VVGS KA+E+AN +L    KL  +++
Sbjct:   78 HPQYDLVRADYVTEYNSTTYLYRHKKTGAQVLSVAADDPVKVFGISFRTPPDDSKGVPHILEHGLLCGSRKFQAKDTFNQLRKGSLCCFINAMTYPDRTCYPVASTNEKDFYNLMDVYLDAVLFPRAVTDPKVLAQEGWHVEAENKSDDLKFNGVVYNEMKGVYSQVESLVFRRTKQELFPDNTYRVDSGGDPREITDLTFDQFKEFHQRFYHPANSRTFIYGNDDLKKRLDFLHTYLDEFDAPPAPVDSAVTWQKKKNEPWLVRETFPVAPGDTKNKDIVTVNWLLNDQELTPYEKLSLTIMNELLLGTPSSYLYKALIESGYGAQLAGSGVMGSLLQWTFAIGLKDVISEEGTYKKIEDLVLRTLQERVDKGFDADAIEAALNSVEFALREFNTGSFPKGLSLTLAMLEEWVYDLDPSDAVKFEKPLAQLKADLKEGKPVFQNLIKKYLLDNNHRMTLHMTPDEGLEKKWLEEEKQRLERIKDKLTDADMDRVIKETKELKEFQAAEDSPEVLAAIPKMNLEDIERKAEVIPTTVTQEKDVKILRHPLPTSGVLYATLGIDLRDMPVEDIPYLPLFTRMLTESGTSKYDEVGFSRRIGSKTGGVGASSTITSKRAPDGVVGNPMDVLSYLFITGKSVPDKVGDMFDIMKDVLYDARLDNKRRATEILKERKTALESSIIGAGHRYALQRAAAQYTLAGRINDMTGGLGQLEFVRDLIKQVESDWPSVQARLESIRKSLLRRDRMLLDITAGDNILSAAQPSIDTFLDALPPSQATAGDSSSSEKHPWSTAMSSALLPVRGEGIEVPTQVNYVAKLCKVYDEGERVPGAASVVMEHLDVGYLWDNVRVLNGAYGAMAGLGQQSGIVQFASYRDPQLIKSIEVYDNAVKHLRENPPDKDDITRAVLGIFRSLDAPKQPDQKGRQAMMQHLQGETQEERQRYRDEVLATSPEDFGKLADRLEKVIHDATRSSVTVVGSVKAIEEANPQLPDQVKLTTLKV 1076          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: KOO32825.1 (hypothetical protein Ctob_011032 [Chrysochromulina sp. CCMP291])

HSP 1 Score: 959.1 bits (2478), Expect = 1.100e-275
Identity = 507/1030 (49.22%), Postives = 681/1030 (66.12%), Query Frame = 0
Query:   46 RSLSASVDLGGSAVTRLDKAVDVFGEKQPKKEWDYVAHHPRYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEFEAQELD---SQVTYQSKKTEPWTLTKTFPATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIY-DRDPLDGLRFEGPLEEIKADLAANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETP-VEVAQERGVTILRHALPTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHP--KGGVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANL-GNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVKKENLILNLTGDNKVLTDVLSPVYRFLDGMPSTGDRKLVCQDWRTDNQLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAAAHLAKSSITEEDLTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALEDANAKLAADAKLDVIEL 1068
            R L++   +GGS ++ + +AV      +P       A HP ++++R E++ EY  K   YRH+K+  E++S   DDDNKVFGI FRTP TDSTG+PHILEHSVLCGS K+  KEPFV+LLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNL++VYLDAVLHPRA  DP VL QEGWH ELE+K +PLTYKGVV+NEMKGVYSSPD +M R  Q+  FPDN YAVDSGGDP  IP L+F QF+AFH  YYHP+NSR++FYG+D+   RL+LLD YL +F A +     S+V  Q    +P  L + +P    +    MV LNWL++++ LS  D+L LG+LDH+LMG+  + LY+ +IESGLG S++GGGL D L Q TFS+GLKGV+  D  KVE L +E L+   ++GF ADAI AS+N+IEFSLREFNTG +P+GLS MLG +  W+Y    P +GLRFE PL  +KA LA  +++FE  +   ++ N H A V +VPD +L E Q+  EE  LA +KA M+D  LE II  T  LK AQ  EDS EA  S+PR+ L DL++  K+ P V      G  +L H LP  G++YAD+  D++ + L D+PL+ LF   L E GTS M+   + R IG  TGGL  S+ + Y+ P   GG +    D+ A++ +RGKA   K+ +LFSL   +LTDANL G Q +V+E+L+E K+   ++ + +GNSFA  RL++R +L G + E ++G++Y  A+  ++ QA++DWP L  RL+K+R+ ++ +E LI+NLT D   L  V   V  F   +P T         WR    L P  +E +++ TQV+YV  G  L++ G K+ G+   +  +L  GYLWDNVRV+GGAYGG CS +  +G F+F SYRDPN+  TLDIY   A  L  + +T++ L QAI+G++GDLDSPM  +QKGF +L  HL   + E RQQ+RDEVL T+   F  FA  L     +   A+ G+K A+E AN    AD ++ + +L
Sbjct:  220 RGLNSMRKMGGS-MSAVCEAVPAVVAAEP-------ATHPAFDLLRVEMIDEYTIKCATYRHKKSGAELISAQADDDNKVFGIVFRTPVTDSTGVPHILEHSVLCGSEKYTSKEPFVELLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLVNVYLDAVLHPRATRDPTVLAQEGWHYELENKDDPLTYKGVVFNEMKGVYSSPDSLMYRAAQQLTFPDNTYAVDSGGDPTAIPSLTFGQFKAFHAAYYHPANSRIFFYGDDNLSTRLELLDSYLQKFNAADATPAASEVKTQPLIRKPRRLVEKYPTEPGSPPTHMVMLNWLMHEEPLSADDELALGVLDHLLMGTPTSALYKPMIESGLGASLMGGGLSDELKQATFSIGLKGVQPADVPKVEALAIETLKKAAEDGFEADAIEASLNTIEFSLREFNTGGYPKGLSLMLGILPRWLYGSGSPTEGLRFEAPLANLKARLAKGERVFEGLLQRMIVDNSHLATVELVPDDTLAEAQKAAEEAELAAVKAKMSDAELEGIIAATKSLKEAQLKEDSEEALKSIPRVGLADLERKVKDYPTVFDTLAGGGELLLHPLPCAGVVYADVLLDITKVPLADMPLVRLFSELLDEVGTSDMNAVAMQRKIGARTGGL--STAMIYEQPTGPGGTVADPLDLVAYLAVRGKATVDKSGDLFSLAHALLTDANLKGGQAKVVELLREKKSNLETAFISSGNSFAGARLAARNTLHGYVGELTQGVTYYEAVKEMLTQAKDDWPTLLGRLDKVRETLLSQEGLIINLTADPDALDAVRPTVDAFAAKLPKTAKADPTAVPWRKAVTLLPAVDEAYAITTQVHYVAAGMRLFEPGTKLDGAFYAVARFLSRGYLWDNVRVVGGAYGGGCSLNPRTGGFAFSSYRDPNVQGTLDIYAKTAEALENAHLTDDALEQAIVGAVGDLDSPMTSEQKGFRALTLHLTGVTTEMRQQYRDEVLGTTRASFKAFAKTLRAKPFK--VAIFGAKDAIEAANTARGADEQIAITQL 1237          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: OIT07230.1 (presequence protease 1, chloroplasticmitochondrial [Nicotiana attenuata])

HSP 1 Score: 925.2 bits (2390), Expect = 1.800e-265
Identity = 494/1043 (47.36%), Postives = 678/1043 (65.00%), Query Frame = 0
Query:   21 AALDAARHRGGAASARALAFSSYKGRSLSASVDLGGSAVTRLDKAVDVFGEKQPKKEW----DYVAHHPRYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEFEAQEL--DSQVTYQSKKTEPWTLTKTFPATE--VTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLA--ANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQERGVTILRHALPTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKGGVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVKKENLILNLTGDNKVLTDVLSPVYRFLDGMPSTGDRKLVCQDWRTDNQLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAAAHLAKSSITEEDLTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALEDAN 1054
            A+  A RHR      R  +     GR LS S+DL         +A+      Q  +E+    D VA    +E V  + + E  +K  LY+H+KT  EIMSV+ DD+NKVFGI FRTPP DSTG+PHILEHSVLCGSRK+P+KEPFV+LLKGSL TFLNAFTYPDRTCYPVAS N KDFYNL+ VYLDAV  P+ V D    QQEGWH EL D  + +T+KGVV+NEMKGVYS PD ++ R +Q+ALFPDN Y VDSGGDP VIP LSF++F+ FHR +YHPSNSR++FYG+DDP  RL +L  YLN F+A     +S+V  Q   +EP  + + +P  E    K K MV LNWL++DK L ++ +L LG LDH+L+G+ A+ L + L+ESGLG++++GGG++D LLQ  FS+GLKGV EE+  K+E+LV+  L+   ++GF +DA+ AS+N+IEFSLRE NTG FPRGL+ ML ++  WIYD DP + L+++ PLE +KA +A   +K +F   ID Y+L N HR  V M PD     ++++ E+  L ++KA M  E L ++ + T  L+  QE  D PEA  S+P L L D+ +     P EV    GV +LRH L TN +LYA++ F++S+L  + LPL+PLF + LLE GT  +D  QL + IG  TGG+   S   +     G ++P     + + +RGKA++ +  +LF+LI  VL D  L + KR  + + +S+AR  + + G+G+S A+ R+ ++ ++ G ISE+  G+SY+  L  L DQ E DWP + + LE+IR  ++ K   ++NLT D K LT+    +  FLD +PST    +    W   N      NE   VPTQVNYV K   L++ G ++ GSA VI+NY+ N +LWD VRV GGAYGGFC F   SG+FSF SYRDPNLL+TLD+YD  +  L +  + ++ LT+AIIG+IGD+D+   PD KG+SSL+ +L+  S E+RQ+ R+E+LST   DF +F D +  +K +     V S   +E AN
Sbjct:   30 ASYSAKRHRLLQNLCRRRSLLRSNGRLLSPSLDLKRQFYPLSVRAI-ATSAPQSSQEFLGADDEVAEKYGFEKVSEQFIDECKSKAVLYKHKKTGAEIMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSDDITFKGVVFNEMKGVYSQPDNLLGRTSQQALFPDNTYGVDSGGDPRVIPSLSFEEFKEFHRKFYHPSNSRIWFYGDDDPNERLRILSEYLNMFDASSAPHESRVEPQKLFSEPVRIVEKYPVGEDGDLKKKHMVCLNWLLSDKPLDLETELALGFLDHLLLGTPASPLRKILLESGLGDAIVGGGIEDELLQPQFSIGLKGVAEENIQKIEELVMSTLEGLAEKGFDSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFEPLKYQKPLEALKARIAKEGSKAVFAPLIDQYILRNPHRVTVEMQPDPKKASREEEIEKETLDKVKASMTQEDLAELARATHELRLKQETPDPPEALKSVPSLSLQDIPREPTHVPTEVGDINGVKVLRHDLFTNDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGI---SVYPFTSSVRGKVEP----CSKIIVRGKAMSQRTDDLFNLINRVLQDVQLNDHKRFKQFVSQSRARMENRLRGSGHSIAASRMGAKLNVAGWISEQMGGVSYLEFLKGLEDQIEKDWPQISSSLEEIRTSLLSKNGCLINLTADGKNLTNAEKHISNFLDLLPSTS--LVEPAAW---NAQLSRSNEAIVVPTQVNYVGKAANLYEAGYELKGSAYVISNYISNTWLWDRVRVSGGAYGGFCGFDTHSGVFSFLSYRDPNLLKTLDVYDGTSNFLKELEMDDDALTKAIIGTIGDVDAYQLPDAKGYSSLLRYLLGVSEEERQRRREEILSTRLDDFKKFGDVMEAVKDKGVVVAVASPDDVEAAN 1059          
BLAST of NO03G00640 vs. NCBI_GenBank
Match: XP_019256631.1 (PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like [Nicotiana attenuata] >XP_019256678.1 PREDICTED: presequence protease 1, chloroplastic/mitochondrial-like [Nicotiana attenuata])

HSP 1 Score: 925.2 bits (2390), Expect = 1.800e-265
Identity = 494/1043 (47.36%), Postives = 678/1043 (65.00%), Query Frame = 0
Query:   21 AALDAARHRGGAASARALAFSSYKGRSLSASVDLGGSAVTRLDKAVDVFGEKQPKKEW----DYVAHHPRYEMVRSEVVQEYGAKVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLCGSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHVYLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSPDQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNSRVYFYGNDDPLARLDLLDGYLNEFEAQEL--DSQVTYQSKKTEPWTLTKTFPATE--VTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLIESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVKEGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDGLRFEGPLEEIKADLA--ANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQQKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLDLTDLDKVQKETPVEVAQERGVTILRHALPTNGILYADIGFDVSALSLDDLPLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKGGVIDPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKESKARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQAENDWPALQTRLEKIRDLVVKKENLILNLTGDNKVLTDVLSPVYRFLDGMPSTGDRKLVCQDWRTDNQLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPGSATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQTLDIYDAAAAHLAKSSITEEDLTQAIIGSIGDLDSPMAPDQKGFSSLIEHLMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALEDAN 1054
            A+  A RHR      R  +     GR LS S+DL         +A+      Q  +E+    D VA    +E V  + + E  +K  LY+H+KT  EIMSV+ DD+NKVFGI FRTPP DSTG+PHILEHSVLCGSRK+P+KEPFV+LLKGSL TFLNAFTYPDRTCYPVAS N KDFYNL+ VYLDAV  P+ V D    QQEGWH EL D  + +T+KGVV+NEMKGVYS PD ++ R +Q+ALFPDN Y VDSGGDP VIP LSF++F+ FHR +YHPSNSR++FYG+DDP  RL +L  YLN F+A     +S+V  Q   +EP  + + +P  E    K K MV LNWL++DK L ++ +L LG LDH+L+G+ A+ L + L+ESGLG++++GGG++D LLQ  FS+GLKGV EE+  K+E+LV+  L+   ++GF +DA+ AS+N+IEFSLRE NTG FPRGL+ ML ++  WIYD DP + L+++ PLE +KA +A   +K +F   ID Y+L N HR  V M PD     ++++ E+  L ++KA M  E L ++ + T  L+  QE  D PEA  S+P L L D+ +     P EV    GV +LRH L TN +LYA++ F++S+L  + LPL+PLF + LLE GT  +D  QL + IG  TGG+   S   +     G ++P     + + +RGKA++ +  +LF+LI  VL D  L + KR  + + +S+AR  + + G+G+S A+ R+ ++ ++ G ISE+  G+SY+  L  L DQ E DWP + + LE+IR  ++ K   ++NLT D K LT+    +  FLD +PST    +    W   N      NE   VPTQVNYV K   L++ G ++ GSA VI+NY+ N +LWD VRV GGAYGGFC F   SG+FSF SYRDPNLL+TLD+YD  +  L +  + ++ LT+AIIG+IGD+D+   PD KG+SSL+ +L+  S E+RQ+ R+E+LST   DF +F D +  +K +     V S   +E AN
Sbjct:   87 ASYSAKRHRLLQNLCRRRSLLRSNGRLLSPSLDLKRQFYPLSVRAI-ATSAPQSSQEFLGADDEVAEKYGFEKVSEQFIDECKSKAVLYKHKKTGAEIMSVSNDDENKVFGIVFRTPPKDSTGIPHILEHSVLCGSRKYPLKEPFVELLKGSLNTFLNAFTYPDRTCYPVASTNAKDFYNLVDVYLDAVFFPKCVEDFQTFQQEGWHYELNDPSDDITFKGVVFNEMKGVYSQPDNLLGRTSQQALFPDNTYGVDSGGDPRVIPSLSFEEFKEFHRKFYHPSNSRIWFYGDDDPNERLRILSEYLNMFDASSAPHESRVEPQKLFSEPVRIVEKYPVGEDGDLKKKHMVCLNWLLSDKPLDLETELALGFLDHLLLGTPASPLRKILLESGLGDAIVGGGIEDELLQPQFSIGLKGVAEENIQKIEELVMSTLEGLAEKGFDSDAVEASMNTIEFSLRENNTGSFPRGLALMLRSIGKWIYDMDPFEPLKYQKPLEALKARIAKEGSKAVFAPLIDQYILRNPHRVTVEMQPDPKKASREEEIEKETLDKVKASMTQEDLAELARATHELRLKQETPDPPEALKSVPSLSLQDIPREPTHVPTEVGDINGVKVLRHDLFTNDVLYAEVVFNMSSLKQELLPLVPLFCQSLLEMGTKDLDFVQLNQLIGRKTGGI---SVYPFTSSVRGKVEP----CSKIIVRGKAMSQRTDDLFNLINRVLQDVQLNDHKRFKQFVSQSRARMENRLRGSGHSIAASRMGAKLNVAGWISEQMGGVSYLEFLKGLEDQIEKDWPQISSSLEEIRTSLLSKNGCLINLTADGKNLTNAEKHISNFLDLLPSTS--LVEPAAW---NAQLSRSNEAIVVPTQVNYVGKAANLYEAGYELKGSAYVISNYISNTWLWDRVRVSGGAYGGFCGFDTHSGVFSFLSYRDPNLLKTLDVYDGTSNFLKELEMDDDALTKAIIGTIGDVDAYQLPDAKGYSSLLRYLLGVSEEERQRRREEILSTRLDDFKKFGDVMEAVKDKGVVVAVASPDDVEAAN 1116          
The following BLAST results are available for this feature:
BLAST of NO03G00640 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
XP_005842187.14.000e-28950.15hypothetical protein GUITHDRAFT_83724 [Guillardia ... [more]
XP_002290387.19.700e-28850.80hypothetical protein THAPSDRAFT_22863 [Thalassiosi... [more]
XP_002177646.11.300e-28752.74predicted protein, partial [Phaeodactylum tricornu... [more]
GAX21277.12.200e-28751.20presequence protease [Fistulifera solaris][more]
GAX12181.18.300e-28751.30presequence protease [Fistulifera solaris][more]
OEU20476.12.500e-28350.84M16C_assoc-domain-containing protein [Fragilariops... [more]
CEM29089.11.000e-27648.15unnamed protein product [Vitrella brassicaformis C... [more]
KOO32825.11.100e-27549.22hypothetical protein Ctob_011032 [Chrysochromulina... [more]
OIT07230.11.800e-26547.36presequence protease 1, chloroplasticmitochondrial... [more]
XP_019256631.11.800e-26547.36PREDICTED: presequence protease 1, chloroplastic/m... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL065nonsL065Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR000ncniR000Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR053ngnoR053Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR051ngnoR051Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK005886NSK005886Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO03G00640.1NO03G00640.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|701gene_1242Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100802g1gene631Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO03G00640.1NO03G00640.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO03G00640 ID=NO03G00640|Name=NO03G00640|organism=Nannochloropsis oceanica|type=gene|length=7954bp
ATGCTCCTCCTCCGGTCCTCCGCCTCTTTTCTTGTGCGCAGCAGCGGCCT
CCCTGCCTCAGCTGCCCTGGACGCCGCCCGGCACCGTGGTGGAGCAGCCT
CGGCGCGGGCGTTGGCTTTTTCTTCGTACAAGGGCCGCTCTTTGTCCGCC
TCGGTCGACTTGGGGGGGAGTGCAGTGACGAGGCTCGACAAGGCCGTAGA
CGTGTTCGGGGAGAAGCAGCCCAAGAAGGAGTGGGACTATGTGGCCCACC
ACCCTCGgtaagagagaagaggagcgaaaagggagtagagaagacaggga
aaggaaagtagatatcgttgctatcggagttgtggctgtcatggtcactc
actccgctctatccctccctcactctccctccctccttctcacttcagGT
ACGAGATGGTCAGGTCGGAGGTGGTGCAGGAATATGGCGCCAAGGTGACG
CTCTATCGCCACCGAAAGACCCAGACCGAGATCATGAGCGTCACGGTCGA
CGACGACAATAAGgtaaggagggagggatggagggagggagggagggacc
gaccggagactgtccctttcttgagctcattctgtgggttgaatgataca
ttgcacctcctcacattccttccttcctccctccctcccttcctccctcc
tcatcaagGTCTTCGGCATCACCTTTCGCACCCCCCCGACGGACTCGACG
GGCTTGCCCCATATCCTCGAGCATTCCGTCCTCTGCGGCTCCCGCAAATT
CCCCGTGAAGgtacctccctccctccctccctccctccctccctccctca
cttgattctcagatacaatcgcctccctccctccctccctccctcccttc
ctccctccctccctccctccctccctccctcccttcctccctccctccct
ccctccctccctccctccctccctccctccctccctcccttcagGAACCG
TTCGTGGACCTGCTCAAAGGCTCCCTGCAGACCTTCCTCAACGCCTTCAC
ATACCCGGATCGAACCTGCTACCCCGTGGCCAGTCAGAACACCAAGgtac
gccctccttcctccgtccctccgtccgtccctccgtccctccctccgtcc
ctccctccctccctccgtccctccgtccctccctccctcctttccctccc
tccctccgtccctctctccttccctccctttctccttttccccctccctt
cctcctccctccgtccatccctccctccttcccttcctcccttcctccct
tccttgcttcagaacggcccacctctcccctccctccctccctccctccc
tcactctctctcactcctcccttctcacatgcctgctctcccttcctccc
tccctcccttcctccctccctccctccctccctcaagGACTTCTACAACC
TGATTCACGTCTACCTGGATGCGGTCCTGCACCCTCGGGCGGTGGGGGAT
CCTTTGGTCCTGCAGCAGGAGGGGTGGCACTTGGAGCTGGAAGACAAGAA
Ggtacgtggcccgcccgcccgtcctccctccctccctccctccctcctcc
gctcctctcttccctccctcctttctctctcatctctcccgctttccccc
ctccctccctcccttccacctccacttccttcccgccccctcttttacat
tccacccgctcaacccctccctccctccctccctccctccctccctcccg
ccctccctccagGAGCCCCTGACGTACAAGGGCGTGGTGTACAACGAGAT
GAAGGGAGTCTACTCCTCCCCCGACCAGATCATGAACCGAGAAACACAGC
GCGCACTATTCCCCGACAACGCCTACGCAGTGgtacgccctccctccctc
ctttcctctctccctctctctctaccttcccccctccctccctccttcct
ttgtcgactcaaccagagactccctccctccctccctccctccctccctc
cctccctccctccctccctccctccctccctccctccctccctcactccc
tccctccctccctccctccctccctccctccctgcctccctccctccctc
cctccctccctccctccctccctccctgcctccctccctgcctccctccc
tccctccatgcctccctccctgcctccctccctgcctccctccctgcctc
cctccctccctccctccctccctccctccctccctccctccctccctccc
ttcacagGACTCTGGAGGAGACCCTCTCGTTATCCCTGACCTGTCCTTTG
ACCAATTCCAAGgtacgccctccctccctccctccctccctccctccctc
cctccttccctcacctcctctcctctcctctctcttccctccctccctcc
ttccctctctccttccctccccccctcccccttccaagCTTTCCACCGTC
ACTACTACCACCCGTCCAACTCCCGGGTATACTTTTACGGCAACGACGAC
CCCCTTGCCCGCCTCGACCTCCTCGATGGCTACCTCAATGAGTTCGAGGC
CCAGGAGCTCGACTCGCAGgtacccccctccctccctccctccttccctc
cttgacctcctttgccctctccactcctcttctctcttctcctcgtcacc
tttcctcccccccttcccccctccctccctccttccctccttgtcctcct
ttgccctcaccactcctcttctctctgctcgtcaccttccctccccccct
ccctccctccctcactgcagGTGACGTACCAGTCCAAGAAAACGGAGCCT
TGGACGCTCACTAAGACCTTCCCCGCCACGGAGgtacgcccctccctccc
tccctccctccctccctccctccctcccttccttccttccttccttcctt
cctcccttcctccctttccacaaaccaccctccctctctccctccctccc
tccctccctccctccctccctccgtccctcccccctccttccagGTGACG
AAAGACAAGGGCATGGTCGCGCTCAATTGGCTCGTGAACGACAAGGCCTT
GAGCATGAAGGACCAGCTCGTCCTCGGCATCCTCGACCATATCCTCATGG
GCTCCAGgtacgccctccctccctccctccctccctccctccctccctcc
ctccctccctccctccctccctcccttgcttcgcctcttttggccctcta
ccaaccttgtcgaatccaaccgatccctccctccctccctccctccctcc
ctccctccctccctcccttccttccttcctcagCGCCGCCTTCCTCTACC
GTCGCCTCATCGAGAGCGGGCTGGGCGAGTCCGTCATCGGGGGGGGCCTG
GATGACACCCTCCTTCAAAATACCTTCTCCGTCGGCCTCAAGGGCGTCAA
GGAAGAGGACTTCTCCAAGgtacctccctccctctctccctcccttgcgc
ctgtacttctccccttcaatattctgcgcctaaccccttcctccctccct
tccttcccttcctccctccctccctcccgctctccctcccgccctccctc
ccgcctagGTGGAGCAACTGGTGCTGGAGATCCTGCAAGACTGCGTCAAG
GAAGGCTTTCCCGCGGACGCCATCGCCGCCTCTGTCAACTCCATCGAATT
CTCCCTTCGGGAATTCAACACGGGCCGgtacgttacctcccttccttcac
tccctccctccctccctccctccctcccttcctcccttatcacatggaag
tttcttgggacccacccttccttcccctccctccctccctccccatcacc
cagCTTCCCCCGCGGTCTGTCCTTCATGCTCGGTGCGATGAACCACTGGA
TCTACGACCGAGACCCTTTGGACGGCCTCCGATTCGAGGGCCCCCTGGAG
GAGATCAAGGCTGACCTCGgtaacacaccctccctccctccttccctact
tcccctcctattggcttccttcatcccctccccctccccgccccgcccct
ccctccttccctccctccctccccttctacccccttcattcattccctcc
ctcccttcctccctccctcctccctcccccctccctccccccccagCGGC
CAACAAGAAGATCTTCGAAGACGCCATTGACTTCTACCTCCTCTCCAATG
GCCATCGAGCCGCTGTGCGCATGGTGCCTGACGTCTCTCTCGAGGAGAAG
CAGCAGAAGGAGGAGGAGGGCCGTCTTGCCCAAATCAAGGCTGGGATGAA
CGATGAGgtaagccctccctccctccctccttctctccctccctccctcc
ctccttccaggactccgagaatatatttgaagccactgtctttctctccg
cttcaccacccccaccccctccctccttcccttcctccttcctatccttc
caccctcgctaagGCCCTGGAGAAGATCATCAAGGACACGGCTCTCCTCA
AGGCCGCTCAGGAGGCAGAAGACAGCCCCGAAGCCCGGGCTTCCCTCCCT
CGTTTGGACCTGACAGACCTCGATAAAGTGCAAAAAGAAACGCCCGTGGA
AGTAGCACAAGAGCGCGgtacggccccctccctccctccctccctcccat
ctccatttcgaccgtgctccatgcctcctgccatctcacccctccctccc
tcccttcctccctccctccttgccctagGTGTGACCATCCTCCGTCACGC
CCTTCCCACCAACGGCATCCTCTACGCCGACATCGGCTTTGACGTCTCGg
tacgcccctcccccccctcttccctccctccctccctccctccctccctc
ctgtcctccgtcctttccgccactcatccttcccccctttcctccgctcc
agGCGCTGTCACTGGATGACCTCCCCCTCCTCCCCCTCTTCCTCCGCTGC
CTCCTCGAGACGGGCACGTCCACCATGGACCAGgttggccctccctcccc
cgcttcctccctccctccctccctccttccctctcaccgttccctcccca
ccccttcctccctccctcccaccctccctccagACCCAGCTCGTTCGCGC
CATCGGCACCCACACCGGTGGCCTTCGCTCGTCTTCCCGCATCTCCTACA
AGCATCCCAAGGgtcggtccctccctcccgccctccttctccctcccgcc
ctcccttttcgtcccctccgtacgtgccatcttcaaacctccctcctccc
ttcctcccccccctccctcccccagGCGGCGTCATCGACCCCACCACGGA
CATGACGGCCCACATGTTCATCCGCGGCAAGGCCGTGGCCTCTAAGGCGT
CCGAGCTCTTCTCCCTCATCAAGGACGTCCTCACCGACGCGAACCTCGGG
AACCAAAAGCGAGTTTTGGAGATGCTCAAGGAAAGCAAGGCTCGgtacgt
ccctccctccctccttccctccctccctccctccctcccttcccttccct
tcccctgtccacatcgttcacacacctcccttccattcccttccctccct
ccctccctccttcagGTACCGTTCCTCCGTGGTGGGCGCCGGCAACAGTT
TTGCCTCCATCCGTCTCTCGTCCCGATACTCCCTCCCCGGCCTCATCAGC
GAGAAGAGCGAGGGGATCTCCTACATGCTCGCCCTCGACGCGCTCATCGA
CCAGGCTGAAAATGACTGGCCCGCCCTGCAGgtactccctccctccctcc
ctccctccctccctccctctctccctccctccttgtcgtgtcgcccatgg
gcacgaaacccttctaccgcctctcatccctccctccctccctccctccc
tccctccctccctccctccctccctagACCCGCCTGGAGAAGATTCGTGA
TTTGGTGGTGAAAAAGGAAAACCTTATCCTCAATCTCACGGGCGACAACA
AGGTCTTGACCGACGTCCTGgtacgtaagccctccctccctccctccctc
cctccctccctctctccctcccttccttccgtccttcctccctcccttcc
tcccggggcatcatgcctccattacccacctctcccttcctccccccctc
cctccctccctccctccctccctccctccctccctccctccctcagTCAC
CCGTCTACCGCTTCTTGGACGGCATGCCCAGCACGGGCGATCGGAAGCTC
GTCTGTCAAGACTGGAGGACAGACAACCAGCTCTTCCCCAACAAAAACGA
AGGCTTCTCGGTCCCGACGCAGgttagccccccttccctccctccctccc
tccctccctcctttgtgtgacgcccctctgtgcctccgtcccccccccct
gggccctgcccctctatccctgacattcttctccaactcatctcttcctc
ccttcctccctccctccctccctcctcttctcctcagGTCAACTACGTAA
TCAAGGGCGGGCCTCTCTGGCAGAAGGGCGAAAAGGTACCCGGATCTGCC
ACCGgtacgtaccctccctccctccctccctctctccttccctccctcct
tccctccctccttccctcccaccctctttcccttccctccatcactgtcg
acatcccattccccccctccctccctgcagagatcaacaactacctccct
cactgccaacctctcctcccccccccctccctccctccctgcagTGATCA
ACAACTACCTCCGTAACGGCTACCTGTGGGATAACGTCCGTGTCATGGGA
GGCGCCTACGGAGGCTTCTGCTCCTTTTCCCGCATGAGgtatggacagcg
ccctccctctctccctccctccctccatccctccctctcttcctcccttc
ctcctcatcgattatctcaacctttcatcccatcccctcaccctcccctc
ccatcctccctccctcccgccctccctccagCGGCCTCTTCTCCTTTGGC
TCCTACCGAGATCCTAACCTCCTCCAGACGCTGGATATCTACGACGCCGC
CGCCGCCCACCTGGCCAAGAGTAGCATTACCGAGGAGGACCTGACGCAGg
tacggagggagggagggagggagagtggatactttaatcgtgttggcagg
ggcgtcctttcttcccaccaaatgaaatattcaacgtctgtccctcacca
agcgagccttccctcctccctctttcctcagGCCATCATCGGGAGTATCG
GCGACCTGGACTCGCCCATGGCTCCGGACCAGAAGGGCTTCTCCTCCCTT
ATTGAACACCTGATGgtaaggaggaaggggggatgataacaattaccagg
aggaaaggagggagggaaagagggagggaaagagggagggaaggagggag
ggaaggagggagggaaagagggagggagggagggagggagggagggaggg
gaagagcacaaggaagcaagatggggactttacagggaacgagggaagga
gggagggagagacggaaaggagagggaaggaagacaaaatagccaacaga
atcccctgttttggagtgccggtagcaagtctgtgagtgctcacacagcg
tacacatatctccctccctccctccctccctctctccctccctccctccc
tccctcctcatcacagGAGGAGTCCCCGGAGGACAGGCAGCAGTGGCGTG
ACGAGGTGCTCTCCACCTCCAGCAAAGATTTTGCTGAGTTCGCCGATCGT
CTCGCGGACCTTAAAACGgtatgcctcccttttattattagtgttttagg
aagaatcctactttttcttccttctttgtctctctcccctcttccctgct
tccttccctgcttccttccctccctccctttctcggcttccctccgtgga
tatctctccgtgtcctaagggaatgcgagaaacatttatgcactctctct
ctcctcttctttcctttcctccctccttccctgcctccctcccgccccat
cacagCGCTCTACCACGGCAGTGGTGGGATCAAAGAAGGCTCTGGAGGAT
GCCAACGCGAAGTTGGCGGCCGATGCCAAATTGGATGTGATTGAGCTGCT
TTAA
back to top

protein sequence of NO03G00640.1

>NO03G00640.1-protein ID=NO03G00640.1-protein|Name=NO03G00640.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1069bp
MLLLRSSASFLVRSSGLPASAALDAARHRGGAASARALAFSSYKGRSLSA
SVDLGGSAVTRLDKAVDVFGEKQPKKEWDYVAHHPRYEMVRSEVVQEYGA
KVTLYRHRKTQTEIMSVTVDDDNKVFGITFRTPPTDSTGLPHILEHSVLC
GSRKFPVKEPFVDLLKGSLQTFLNAFTYPDRTCYPVASQNTKDFYNLIHV
YLDAVLHPRAVGDPLVLQQEGWHLELEDKKEPLTYKGVVYNEMKGVYSSP
DQIMNRETQRALFPDNAYAVDSGGDPLVIPDLSFDQFQAFHRHYYHPSNS
RVYFYGNDDPLARLDLLDGYLNEFEAQELDSQVTYQSKKTEPWTLTKTFP
ATEVTKDKGMVALNWLVNDKALSMKDQLVLGILDHILMGSSAAFLYRRLI
ESGLGESVIGGGLDDTLLQNTFSVGLKGVKEEDFSKVEQLVLEILQDCVK
EGFPADAIAASVNSIEFSLREFNTGRFPRGLSFMLGAMNHWIYDRDPLDG
LRFEGPLEEIKADLAANKKIFEDAIDFYLLSNGHRAAVRMVPDVSLEEKQ
QKEEEGRLAQIKAGMNDEALEKIIKDTALLKAAQEAEDSPEARASLPRLD
LTDLDKVQKETPVEVAQERGVTILRHALPTNGILYADIGFDVSALSLDDL
PLLPLFLRCLLETGTSTMDQTQLVRAIGTHTGGLRSSSRISYKHPKGGVI
DPTTDMTAHMFIRGKAVASKASELFSLIKDVLTDANLGNQKRVLEMLKES
KARYRSSVVGAGNSFASIRLSSRYSLPGLISEKSEGISYMLALDALIDQA
ENDWPALQTRLEKIRDLVVKKENLILNLTGDNKVLTDVLSPVYRFLDGMP
STGDRKLVCQDWRTDNQLFPNKNEGFSVPTQVNYVIKGGPLWQKGEKVPG
SATVINNYLRNGYLWDNVRVMGGAYGGFCSFSRMSGLFSFGSYRDPNLLQ
TLDIYDAAAAHLAKSSITEEDLTQAIIGSIGDLDSPMAPDQKGFSSLIEH
LMEESPEDRQQWRDEVLSTSSKDFAEFADRLADLKTRSTTAVVGSKKALE
DANAKLAADAKLDVIELL*
back to top
Synonyms
Publications