NO06G03350, NO06G03350 (gene) Nannochloropsis oceanica

Overview
NameNO06G03350
Unique NameNO06G03350
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length4100
Alignment locationchr6:925420..929519 +

Link to JBrowse

Properties
Property NameValue
DescriptionUnknown protein
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr6genomechr6:925420..929519 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
PRJNA7699582024-08-13
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO:0019538protein metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0008233peptidase activity
GO:0016787hydrolase activity
Vocabulary: INTERPRO
TermDefinition
IPR001539Peptidase_U32
Homology
BLAST of NO06G03350 vs. NCBI_GenBank
Match: XP_005852797.1 (putative protease, partial [Nannochloropsis gaditana CCMP526] >EKU23034.1 putative protease, partial [Nannochloropsis gaditana CCMP526])

HSP 1 Score: 721.1 bits (1860), Expect = 5.400e-204
Identity = 387/563 (68.74%), Postives = 443/563 (78.69%), Query Frame = 0
Query:   38 ICLLLPLLL---LQFLLMATTQAFIVLPPGSPTAFMRSSPAVHLMATTTAATSTRLMKGVRPTAPSEPLIPPPIPTHPVPQILAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAHRYGMKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNSSSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNAGHLGVPAGRVSYVSKDGKTFRFTPEVDLERYDGIKITPPSRAFHSTPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPEFSLRNFKVQGSKAFNAMAGAVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREW 598
            +C +L LLL   L+ +   T  AF+      P +  R S  +  +A TTA +     K V    P  P     +P HPVPQ+LAPAGGREQFLAALN+GADQVFLGLK+FNARARAENF VEDL+ +VP+AH++GM+VLVTVNVLIKE E   L+D LSALEELEVDAIIVQDQAVG +V++FFPTL +HASTQMAVHNLQGV+KA  LGY+RVVLARE+TAKEMK+IRAGV + V +LEAF HGSLCYSYSGLC FSGETDARSGNRGEC+Y+CR+PY LTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLK+EGRKKDAQYVAS+VALYRRRLDQ+Y RPTLR  APPEA Q    +   P+A LRQDL+LSFHR TTSFFVRGRYHENVIDL+NAGHLGV AGRV  VSKDG+TF+F+  VDLERYDGIK+ PP+RAFH+TPQHG           A        AA  +LL EKYAN+LPEFSLR FKVQGSKAF AMAGA V+VE+P E+RR WEQ  +  R I  GD+VFQSRSN LKRRV+AL  VP+ YK+R W
Sbjct:   44 LCAVLLLLLQCNLRIIHARTRHAFL-----HPLSGRRHSTRLGSLAGTTALSG----KAVTDRQPPLP-TTRSVPHHPVPQVLAPAGGREQFLAALNAGADQVFLGLKNFNARARAENFCVEDLKELVPMAHKFGMQVLVTVNVLIKEEELGVLIDTLSALEELEVDAIIVQDQAVGALVREFFPTLRIHASTQMAVHNLQGVVKATGLGYQRVVLAREVTAKEMKEIRAGVPSGVAELEAFVHGSLCYSYSGLCFFSGETDARSGNRGECAYTCRQPYTLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKVEGRKKDAQYVASAVALYRRRLDQIYARPTLRSQAPPEAFQAAAQARHTPDAGLRQDLALSFHRRTTSFFVRGRYHENVIDLDNAGHLGVLAGRVLSVSKDGRTFKFSALVDLERYDGIKLCPPARAFHTTPQHGD----------ARAEDRGVHAARERLLQEKYANELPEFSLRQFKVQGSKAFEAMAGATVEVEVPREVRREWEQGHNGHRAIQSGDLVFQSRSNRLKRRVQALAVVPESYKARSW 586          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: SMF68653.1 (putative protease [Pseudobacteriovorax antillogorgiicola])

HSP 1 Score: 634.0 bits (1634), Expect = 8.600e-178
Identity = 401/1012 (39.62%), Postives = 555/1012 (54.84%), Query Frame = 0
Query:  105 PPPIPTHPVPQILAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAHRYGMKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNSSSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNAGHLGVPAGRVSYVSKDGKTFRFTPEVDLERYDGIKITPPSRAFHSTPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPEFSLRNFKVQGSKAFNAMAGAVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKVDVGVEVRREGGREGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKKTKEGVIADIVEVLGTYGELGFRADAVVIDGLDXXXXXXXXXXXXXGKGVPFIRRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISXXXXXXXXXXXXXXXXXXXPLPARMWDERRFAIKFDRPEYLDMLDLYLSSMTGREGGRESTVVKEVVFEPKRMYLADVKPEDGVARLMAFGARHGVRVRLALPTVVRAWDMTPLKGWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLASIAPQYILYKDVPLFMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDEEALPNTHVANFDRKLL 1117
            P  +    VP+ILAPAGGR QF AALN+GAD V+LGLK FNAR RAENF +EDL  +VPLA   GMKVLVT+N+L+K+ EF  L++ LS L+ + + A+I+QD  V  IV+ +FP+L MHASTQMAVHN+ GV +A A+G++RVV+ARELT +E+K I   +     ++EAF HGSLCYSYSGLC FSG  DARSGNRGEC+Y+CR+PYK+ +E G GFLFSM DL+++ DL+ L EA +DTLKIEGRKKDAQYVA+ V +YR+RL++++ R T    AP +       S +R     ++D++ SFHR TTSFF+ GRYHENVIDL+N  H G+  G+VS V   G+   F     +ER+DG++I P +  FH+ PQHGS       E T                  KY N + +FS+R+  + G KA        + + +           PD   +   GD +F++RSNELKR  EAL       K RE R + +  E+  +   +   G   L  K    GQ L   +  W +       T E    D+   L  +G+    A+  V   L+                  F+ R  +K LK  +   +    E +     +R++ G  ++                    LP  +  +  + +K DR EY+D L  Y    TG           E++FEPKR +L+  KP+D +  L       G  +RLA+PTV+RAWD   +K +V A+                    F++GN+GA   L+ WG               D+ +DFTLYSLN+ A+      LG  R+ LSVEDD+ ++   +Q+WP                              +  + ILYKD PLF+AEACSLTALH + CP +KVCGYRTL +EN++GE+F +AHE CKSIVY  +A S+ H ++ L   GV  FR+DFLTR YD+K F  VL++ L   +     + ++H AN+ R+LL
Sbjct:   18 PKAVQIGKVPEILAPAGGRAQFFAALNAGADAVYLGLKEFNARGRAENFSIEDLEELVPLAKDQGMKVLVTLNILLKDIEFDRLIERLSQLQWVGIHAVIIQDLGVARIVRDYFPSLRMHASTQMAVHNVHGVRQAKAMGFQRVVVARELTIQELKLIHKELAETPVEIEAFCHGSLCYSYSGLCFFSGAEDARSGNRGECAYTCRKPYKILNEPGHGFLFSMKDLNSNQDLERLVEAQVDTLKIEGRKKDAQYVATVVGMYRKRLNEIFGRNT----APQD------RSYMRD---FKKDMAFSFHRDTTSFFMNGRYHENVIDLSNPTHRGLEVGKVSQVK--GRQVVFDTMEPIERFDGLRIDPVAATFHAKPQHGSQVQGHMGEAT-----------------RKYENKICQFSVRDMYINGKKAPRGQKRQRLTISL-----------PDEVPLPKVGDPIFKTRSNELKRVTEALA------KPREARLMAL-KEIDLDIFADSHDGETTLRFKAHLRGQELATHQFSWPAERPKKAATLED---DLKRSLSLFGDFNLHANLSVEGDLNW-----------------FLPRSQVKRLKQDLGTVIQKGIETYT----RRSIQGGRLA-------------MSRQRSSLPPAL--DESYTVKIDRMEYMDFLVQYKLD-TGFSPA-------EIIFEPKRAFLSG-KPQDLMKELWLKAQHLGTTLRLAIPTVLRAWDEAVVKRYVSAY-------------WELGGRAFDLGNVGAKSALQAWGYETS-----------DMVSDFTLYSLNT-AAVEELGALGLKRVCLSVEDDETSIMTKLQSWPE---------------------------IGVQAEVILYKDTPLFIAEACSLTALH-HGCPTAKVCGYRTLEVENDEGEQFLIAHESCKSIVYGKQAYSISHYRERLQAAGVGCFRIDFLTRPYDQKGFTNVLNSCLTGSK-----VADSHAANYSRELL 873          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: EWM26318.1 (hypothetical protein Naga_100078g11 [Nannochloropsis gaditana])

HSP 1 Score: 629.0 bits (1621), Expect = 2.800e-176
Identity = 351/612 (57.35%), Postives = 401/612 (65.52%), Query Frame = 0
Query:  537 MAGAVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKVDVGVEVRREG-------------------GREGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKKTKEGVIADIVEVLGTYGELGFRADAVVIDGLDXXXXXXXXXXXXXGKGVPFIRRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISXXXXXXXXXXXXXXXXXXXPLPARMWDERRFAIKFDRPEYLDMLDLYLSSMTGREGGRES---TVVKEVVFEPKRMYLADVKPEDGVARLMAFGARHGVRVRLALPTVVRAWDMTPLKGWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLAS----------IAPQYILYKDVPLFMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDEEALPNTHVANFDRKLL 1117
            MAGA V+VE+P E+RR WEQ  +  R I  GD+VFQSRSN LKRRV+AL  VP+ YK+R W  VDV V+V R G                   G EG  GGL L V+VSKLGQ+LVE+   W ++E + +K++E V+ D+ EVLG YGELG RA AVV+ GL              G  VPFI R+DLK LK++VA  L  AYEAF+  RK RAV  LG+                    PLP   W+ RRFA+K DR EYLD+LD YL  +    G RE      V EVVFEPKRM+LA  KP++G+ARL+AFG RH VRVRLALPTVVRAWD  P++ WVEAF AA               +CFEVGNLGAWGLLEE G          S   VDVTTDFTLYSLNSQASA+WAR+LGAS IALSVEDD  NL+AHM+AWPRAM                  A GDE +             APQ+ILYKDVPLFMAEACSLTALHGN CPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYST AQSLVHRQ+DLL  G+R FRLDFLTRKY+K+  FEVLDAALRRE  D   LP+TH ANFDR+LL
Sbjct:    1 MAGATVEVEVPREVRREWEQGHNGHRAIQSGDLVFQSRSNRLKRRVQALAVVPESYKARSWTTVDVHVQVDRAGRXXXXXXXXXXXXXXXXQKGEEGTRGGLTLRVQVSKLGQLLVEETHAWPTWESTTRKSREDVVGDMGEVLGVYGELGMRARAVVVTGL-----GEAGVEGEEGSEVPFIPRRDLKALKSRVAARLAPAYEAFLEERKGRAVRALGLG-----EKEALLVPNPPSAPPLPPGRWENRRFAVKIDRLEYLDLLDEYLDQILAPWGEREGEGRAWVDEVVFEPKRMFLAAKKPDEGIARLLAFGRRHRVRVRLALPTVVRAWDAPPVRAWVEAFAAAQ----EQAGQGDGSRLCFEVGNLGAWGLLEEHG----LLPASSSGRHVDVTTDFTLYSLNSQASAVWARSLGASLIALSVEDDAANLEAHMRAWPRAM--------ARVVGGGDVPAAGDEGVGEGRQAGSGHTHTAPQFILYKDVPLFMAEACSLTALHGNQCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTNAQSLVHRQQDLLRMGIRHFRLDFLTRKYEKQHLFEVLDAALRREADDTTPLPDTHQANFDRRLL 586          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: KHE91618.1 (peptidase [Candidatus Scalindua brodae])

HSP 1 Score: 281.6 bits (719), Expect = 1.100e-71
Identity = 279/1005 (27.76%), Postives = 419/1005 (41.69%), Query Frame = 0
Query:  114 PQILAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAHRYGMKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNSSSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNAGHLGVPAGRVSYVSKDGKTFRFTPEVDLERYDGIKITPPSRAFHSTPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPEFSLRNFKVQGSKAFNAMAG--AVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKVDVGVEVRREGGREGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKKTKEGVIADIVEVLGTYGELGFRADAVVIDGLDXXXXXXXXXXXXXGKGVPFIRRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISXXXXXXXXXXXXXXXXXXXPLPARMWDERRFAIKFDRPEYLDMLDLYLSSMTGREGGRESTVVKEVVFEPKRMYLADVKPEDGVARLMAFGARHGVRVRLALPTVVRAWDMTPLKGWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLASIAPQYILYKDVPLFMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDEEALPNTHVANFDRKLL 1117
            P++L+PAG  E F AA+++GAD V+ GLK F+ARA A+NF ++D    +  A R  +KV + +N L+K +E    +D+L ALEE+  D+II+QD  +  ++Q  FP   +HASTQMA+HNL GV +   LG++RVVLAREL++ E+K+I      ++   E F HG+LCYSYSGLC FS     RSGNRG+C+  CR+PY   S +G G+LFSM DL T   +  L +AG+D+LKIEGR K  +YVA     YR+ +D                        +  +  +   +   F R TTS +V G  + N+   N++    + A  +   S       +  EV     + I I    RA         L  F               +A   LLH           +++ KV G + F+  AG  AV++ E                + I RG  ++   S + K      T+     K     KV V +EVR E      +G         K+ Q + +        +   + T E    D+       GE  F   A++  G+              G   P      L VL     E   G   A+   R  +                            +P    DE R ++K DR   L+ LDL L+         E      +V   K   ++ ++  D +A  +    +   ++ L+LP ++R             F   +                F++ N GA  L                   V +  D+ LYSLN   S +  R LG  R  LS ED  ENLK                                  L S     ILY+D PLF +EAC + A   ++CPG   CG+  +++ NE G++F   +E C++++ +    S+ H  K  L  G RD+R+D   + Y  +   ++L          E+ + N+ + NF R LL
Sbjct:   11 PELLSPAGNMESFFAAVDNGADAVYFGLKDFSARASAQNFSLDDAGKAIAYARRKSIKVYIALNTLVKTSELGRAVDLLIALEEMRPDSIIIQDLGLLYLIQSQFPGFNIHASTQMAIHNLAGVKQLEQLGFKRVVLARELSSAEIKNIAENTSIEI---ETFIHGALCYSYSGLCFFSSMIGGRSGNRGKCAQPCRKPYHSQSGEG-GYLFSMKDLLTLSGIGDLVDAGVDSLKIEGRMKSPEYVAVVTDAYRKAID----------------------GELSDQDEIANRIKTVFSRETTSAYVMG--NNNLSVKNDSSIRQLKATDIVNPSYPANMGLYAGEVIRSDENHIVI----RAEAGIGVRDLLQVFE------------NVSAKPALLH-----------VKSIKVNGKRVFSVEAGDTAVINSE----------------QKIKRGAKLYIVSSQKTKE-----TSAQKIPKKLTSTKVPVNLEVRVEADSIAVSG---------KVMQFIFKKDYPLNLEKSINRSTNE---EDLKNCFSRLGETPFEL-AIISAGIS------------EGLFAP------LSVLNNIRREYFNGLSAAWQNNRVLKCDEVKRWLKEEFTKYGNLISEEKHLRHRIPE---DEVRLSLKIDR---LNCLDLALT---------EKIYKLYIVLTDKT--ISYLQKNDDIADTL---LKERDKIVLSLPVIMRDTG--------NGFETYNYFKESVSALIKSGFRQFQISNPGAMDLF--------------GDADVILYADYPLYSLN-PLSVIKLRELGFQRNTLSPEDGMENLKT---------------------------------LLSDNTDLILYQDTPLFTSEAC-VWANMKSACPGIDRCGFEKMTLTNEHGDRFVAINEACRTVIINEIPFSINHLIKPFLEAGHRDYRVDLCYKDYTPETISDLLSGI-----KSEKKVRNSTIGNFQRGLL 826          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: OQW95059.1 (hypothetical protein BWK77_08235 [Verrucomicrobia bacterium A1])

HSP 1 Score: 279.6 bits (714), Expect = 4.100e-71
Identity = 290/1012 (28.66%), Postives = 418/1012 (41.30%), Query Frame = 0
Query:  117 LAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAHRYG--MKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKL--TSEDGMG-FLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNSSSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNAGHLGVPAGRVSYVS-KDGKT-FRFTPEVDLERYDGIKITPP--SRAFHSTPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPEFSLRNFKVQGSKAFNAMAGAVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKVDVGVEVRREGGREGEAGGLELCVKVSKL--GQVLVEDKLVWQSFEYSMKKTKEGVIADIVEVLGTYGELGFRADAVVIDGLDXXXXXXXXXXXXXGKGVPFIRRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISXXXXXXXXXXXXXXXXXXXPLPARMWDERRFAIKFDRPEYLDMLDLYLSSMTGREGGRESTVVKEVVFEPKRMYLADVKPEDGVARLMAFGARHGVRVRLALPTVVRAWDMTPLKGWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLASIAPQYILYKDVPLFMAE--ACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDEEALPNTHVANFDRKL 1116
            +APAGG +   AA   GAD V+LG++ F+ARA AENF ++ L  +   AH      +V + +N LI + E  A LD  +   +L VDA+IVQD  +   V++ FP L +HASTQMAVHN  GV     LG+ R  LARELT  E++DI      D  ++E F HG+LC  YSGLCL+S     RSGNRG C+Y CR+ +     +E G G F FSM DL     +  L  AG+D++KIEGRKK   YVA+    YRR LD         P+AP + +             + QDL   F R  T+ F  GR + +++D +  GH G P GRV  VS + G+T  +FT    +ER+DGI++  P   R F     H  L++   E+   P                                   + F + AG  ++VE+P       +  P+    +  G  V+ + S ++KRR       P  ++ R  R VDV V +  E        GL    +++    G+  VE  +       S  +   G    +       G+    A  + +D                G   P  R  DL+         LT A E     +++ A     ++                      A  W      IK DRP  LD  +         +G  E  VV ++ F       AD      V RL A   R   RVRLALP + RAWD   L+  +E  +                   +E+ NL  W  L +                +D+   + LY++N  A   W    G S +ALS ED + NL A +                           G +  A +     +++DVPLF++E  AC   A+ G + P           + ++ GE  ++     +  + +     L  R  +L   G R FR  F+ R Y+     EV D   R         P+T V +FDR L
Sbjct:    1 MAPAGGPDAAFAAFQYGADAVYLGMQEFSARADAENFSLDALNEITAFAHSLAPRRRVYLALNTLILDREMPAALDQAARAADLGVDALIVQDAGLAGTVRRHFPNLRLHASTQMAVHNRAGVEHLRDLGFARATLARELTLDEIRDI---ATVDGIEIETFVHGALCVCYSGLCLYSSLASGRSGNRGRCAYLCRDRFAAGDGAERGDGAFRFSMKDLALPGLVGDLEAAGVDSIKIEGRKKSPLYVAAVTNFYRRLLD--------GPVAPADRRT------------MEQDLQSIFSRPWTTLFAAGRDNADLVDPDYVGHRGTPVGRVEAVSRRSGRTLLQFTTGRPVERHDGIQVDVPGSDRPFGFPVDHLFLAA---EDRRPP----------------------------------REVFESHAGDRIEVELP-------DDAPE----LPVGATVYSASSQDVKRRYRFERPKPGLHRVR--RTVDVTVGLAAE--------GLSATARLAPRYPGEAAVEASISLAE-PLSPARNPHGTEEAVRRAFEKLGDTHLAAGTLTLD-------------DPSGLFAPASRLNDLR-------RQLTAALE-----KEREAALSASVARICAELAPPVAAAAAGAEG---AEFW-----RIKIDRPGVLDAFE-----PADLDGVAE--VVFDIGFSDPAAVCAD------VGRLAALVGRD--RVRLALPVIARAWDTEALQASIERLL-------------RDGWARWEISNLSGWRFLPD--------------ESLDLCGGWPLYAMNRAAVRQWL-AWGLSSVALSPEDGRNNLAALL--------------------------TGYDARADVT----VFEDVPLFLSETRAC---AIGGGTAPSECGRSCEIPLVSSKGGETLQICRRG-RMALLNRAPFCLAGRLDELRALGARRFRAAFVWRPYEPA---EVRD-LWRSLRSGHAPTPHT-VGSFDRGL 815          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: OFW82678.1 (hypothetical protein A2018_08210 [Alphaproteobacteria bacterium GWF2_58_20])

HSP 1 Score: 274.6 bits (701), Expect = 1.300e-69
Identity = 267/954 (27.99%), Postives = 398/954 (41.72%), Query Frame = 0
Query:  114 PQILAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAH--RYGMKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKLTSEDGM--GFLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNSSSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNAGHLGVPAGRV-SYVSKDGKTF-RFTPEVDLERYDGIKITPPSRAFHSTPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPEFSLRNFKVQGSKAFNAMAGAVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKVDVGVEVRREGGR-EGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKKTKEGVIADIVEVLGTYGELGFRADAVVI---DGLDXXXXXXXXXXXXXGKGVPFIRRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISXXXXXXXXXXXXXXXXXXXPLPARMWDERRFAIKFDRPEYLDMLDLYLSSMTGREGGRESTVVKEVVFEPKRMYLADVKPEDGVARLMAFGARHGVRVRLALPTVVRAWDMTPLKGWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLASIAPQYILYKDVPLFMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLV 1058
            P++LAPAG  E   AA + GAD ++LGL  F+ARA A NF  E L   +  AH      KV V VN L+++ E   LL +L  LE+L  DA+IVQD  V  I++Q FP L +HASTQMA+HN +G + A ALG  RVVLARELT  E+ DI    +    + EAF HG+LCYS SGLCL+S     RS NRG C+Y CR+ +   S +G+  G +FSM D+    D+  LAEAGI +LKIEGRKK   Y A+    YR  LD                     ++S +  A     +   F R  T  ++  R  +NV+  +  G +G   GRV + V+  G+T+ RF P   +ER+DG++I          P  G    F                                F++ + +++G   FNA AG VV++ +P       E  P           V+ S S   ++        P  ++SR    + V + +    GR + EA    L    S     +V D            +T EG+     +     G+  F  + +     DGL                   F+    L  ++ ++   L  A+       +   +AGLG +                    LPA              P YL  +D         E  RE   V E++     + L    PE    + MA  A     +R++LPT+ RAW+   ++  VE+ +AA                 +++ NL  W +L E                +D++  + LY+LN+ A+      +G SRI LS ED +EN +  +  +P  +                                 +Y   PLF++E C + A  G SCP    C      + +  G    +  + C+S V S +++ ++
Sbjct:    7 PELLAPAGSPESAYAAFSHGADAIYLGLSRFSARADATNFTREALSESIGFAHAQEKPRKVYVAVNTLVQDAELPDLLPMLEMLEDLRADAVIVQDMGVARIIRQHFPGLALHASTQMAIHNREGAMAAMALGISRVVLARELTLPELSDI---AQNSGVETEAFIHGALCYSMSGLCLYSSFATGRSANRGSCAYPCRDQF---SGEGLPSGHIFSMKDMALDGDVAKLAEAGITSLKIEGRKKGPLYTAAVTDYYRNILD--------------------GTASPKELAEKAWRIKTIFSRNQTRLYLENRRAKNVVSPHVVGPMGGELGRVAAIVNAQGRTWLRFIPTHAIERHDGLQI----------PLPGEEKPFG-------------------------------FAVMDMRLKGKSVFNAPAGQVVELALPAGAPTLAESMP-----------VYVSSSQAAQKAYPFPIPRPGAFRSR----MPVSIRLSLTHGRIDAEA----LLADGSAASLSVVGD--------LPPAQTPEGMETACRKAFEKCGDTPFSLETLSFLNPDGL-------------------FVPAAQLNDIRRRLLAELEEAHAKNRAETRAAVLAGLGAT----------------QPETLPA-------------SPGYLLSVDNAADLQDFEEADREG--VSEIL-----LPLDSATPE--TLQTMAH-AWPNATLRISLPTLCRAWETQAIQKHVESLIAA-------------GQTRWDIQNLWGWEMLHE-------------KAGLDLSAGWPLYALNT-AAVSSLMEMGFSRITLSPEDSQENARTLLARFPGRIALP------------------------------VYMTPPLFISETCPVPA-SGASCPSG--CKGGISDLRSRHGNDLRLIRKGCRSFVCSVQSRCVI 748          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: WP_099324650.1 (U32 family peptidase [Candidatus Kuenenia stuttgartiensis] >CAJ72375.1 conserved hypothetical protein [Candidatus Kuenenia stuttgartiensis])

HSP 1 Score: 267.7 bits (683), Expect = 1.600e-67
Identity = 279/1037 (26.90%), Postives = 434/1037 (41.85%), Query Frame = 0
Query:  116 ILAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAHRYGMKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNSSSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNA-GHLGVPAGRVSYVSKDGKTFRFTPEV-DLERYDGIKITPPSRAFHS--TPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPEFSL---RNFKVQGSKAFNAMAG--AVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWR-KVDVGVEVRREG-GREGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKKTKEGVIADIVEVLGTYGELGFRADAVVIDGLDXXXXXXXXXXXXXGKGVPFIRRKDLKVLKAKVAETLTGAYEAFVMGRKQRA------VAGLGISXXXXXXXXXXXXXXXXXXXPLPARMWDERRFAIKFDRPEYLDMLDL-------------YLSSMTGREGGRESTVVKEVVFEPKRMYLA-----DVKPEDGVARLMAFGARHGVRVRLALPTVVRAWDMTPLK-GWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLASIAPQYILYKDVPLFMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDEEALPNTHVANFDRKLL 1117
            +L+PAG  E F  A+ +GAD +++GLK F+ARA A NF ++D+R  +  A +  ++V V +N LIK +E   +++ L AL+EL  DA+I+QD  +  ++Q  FP  T+HASTQM +HNL GV +   +G++RVVL+REL+  E+K+I      ++   E F HG+LCYSYSGLC FS     RSGNRG C+  CR  YK    +G G+LFSM DL T   ++ L  AGI + KIEGR K A+YVA++  +YR+ +D      TL  +         N+  +    F R+         T S+     Y +   + NN        A  +S+ S+ G   +    V D+ER+   K    S A +       G+ +    ++G    T    A    + L + + N   E +L   RN +V G + F   AG  A++D E         + QP        G  ++   S ++K        VP   KS   R  VD+ +++  +G   +G A      +  S                E S+ +T E     + E     GE  F    +  +  D                VP      L VL     E     YE +   R +R       V G  I                              R ++K D+P+YL+ + L              +  +   +   +++  K  +     +Y       ++ P +G  +++            +LP ++R   +  +  G+ +  V                   F + N GA  L E+                V +  D+  Y LN   SA+  R LG  R  LS EDDKENL                                 E L S   + I+Y+D PLF +E C + A     CPG   CG+  +++ENE G++F   +E C+++V   K  SL+     L+  G RDFR+D   + Y  +   ++        +     + N+ + NF+R LL
Sbjct:    1 MLSPAGDMECFFVAVENGADAIYVGLKDFSARASACNFSIDDVRKAIAYARKMSVRVYVAINTLIKTDELEKVVEYLIALDELRPDALIIQDLGLLFLIQSQFPQFTLHASTQMTIHNLAGVKQMERMGFKRVVLSRELSVDEIKNIALNSNMEI---EVFVHGALCYSYSGLCFFSSVMGGRSGNRGRCAQPCRMYYKSPQGEG-GYLFSMKDLRTLTHVNRLMAAGIHSFKIEGRMKSAEYVAAATHVYRQAID-----GTLEDMD--------NAIHLMNTVFSRET--------TYSYLFEETYQQGKKNSNNKYAPRPENAPFLSHPSQGGAVVKSPKFVQDIERFSNNKQVKASDAINPFYPANIGAYAGEVTKQGKGCITVRADAEIGVRDLLQFFENGAKEPALLPVRNIRVNGKRVFGIKAGDIAMIDTE--------RQYQP------GVGARLYLLSSQKIKEYF--APKVPK--KSEASRMPVDLEIKIMPDGIDIKGTARYFSFPMNFS-------------VKLEKSIHRTTER--EQVKECFSRLGETSFELKDIHTEISDELF-------------VP------LSVLNEIRREYFRIFYEEWHKDRDRRCESIKKWVKGEWIEFTHPVHRDVSAGGI---------------RLSLKVDKPDYLNHIPLETVHKIYIVLTEETIGDLLALQSHNQTSFNKRGL--SALLYTCGENNENISPSNGRDKIV-----------FSLPAIMRDTGIGCMTYGYCKKAVQELLSQGFRQ---------FHISNPGAIELFED--------------AEVQLYADYPFYCLN-PLSAIKLRELGFCRYTLSPEDDKENL---------------------------------EKLFSPHAELIIYQDTPLFTSETC-IWANMKRECPGKNWCGFSQMTLENEYGDRFTAINEDCRTVVIGQKPFSLIQYIPKLIDGGQRDFRVDLCYKDYTPEMVQDIFSKIQTMSK-----VNNSVMGNFERGLL 869          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: OOP56505.1 (hypothetical protein AYP45_08865 [Candidatus Brocadia caroliniensis])

HSP 1 Score: 263.8 bits (673), Expect = 2.300e-66
Identity = 276/1022 (27.01%), Postives = 422/1022 (41.29%), Query Frame = 0
Query:  114 PQILAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAHRYGMKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLD-QLY-QRPTLRPLAPPEAQQTVNSSSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNAGHLGVPAGRVSYVSKDGKTFRFTPEVDLERYDGIKITPPSRAFHSTPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPEFSLRNFKVQGSKAFNAMAGAVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKVDVGVEVRREGGREGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKKTKEGVIADIVEVLGTYGELGFRA---DAVVIDGLDXXXXXXXXXXXXXGKGVPFIRRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISXXXXXXXXXXXXXXXXXXXPLPARMWDERRFAIKFDRPEYLDMLDL------YLSSMTGR-EGGRESTVVKEVVFEPKRMYLADVKPEDGVARLMAFGARHGVRVRLALPTVVRAWDM-------TPLKGWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLASIAPQYILYKDVPLFMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDEEALPNTHVANFDRKLL 1117
            P++L+PAG  E F AA+ +GAD ++ GL+ F+ARA A+NF + D    +  AH+  +K  +T+N LIK  E   + D+L ALEE++ DA+I+QD  +  +++  FP   +HASTQM +HNL GV +   +G+RRVVLAREL+  E+ +I    +    + E F HG+LCYSYSGLC  S  T  RSGNRG C+  CR  Y+ +S DG G+LFSM DL T   ++ L  AG+ + KIEGR K  +YVA     YR+ +D +LY +  T+R +      +TV                  F R TT  ++    H+   +  N       A   +Y +  G    +  EV   R   I I    RA         L  F              A     LL+           +++ ++   + F   AG V  +           +QP     +  G  ++   S +LK   +  + VP  + S     VD+ V+VR +G          + +K S     + +D  V    E  + +  EG    I       GE  F      A + + L                   FI    L  ++    + L+ +Y+      K++A     I                           D+ R ++K D+  YL+ + L      YL       E         ++    + M  +          ++   AR   ++   LP ++R  DM          K +V+ F+A                  F++ NLGA  L +                 V    D+ LY LN   SA   R  G  R  LS EDD +N+                                   L S     ILY+D PLF +E C L A     CPG+K C ++ +++ENE G+KF   ++ CK++V   +  SL+H    LL  G RDFR+D   R Y  +   ++  +   + +     + N+ + NF+R LL
Sbjct:   11 PELLSPAGNIECFFAAIENGADAIYFGLEDFSARAGAQNFTLTDASKAIAYAHKNAVKAYITLNTLIKTCEMERVADLLIALEEIQPDALILQDLGLLHLIRSQFPHFCLHASTQMTIHNLAGVKQLERMGFRRVVLARELSVDEITNI---TQHTTMETEVFVHGALCYSYSGLCFLSSMTGGRSGNRGRCAQPCRMRYQTSSGDG-GYLFSMKDLLTISQINKLITAGVHSFKIEGRMKSPEYVAVVTNAYRQAIDGKLYDEDDTIRRM------KTV------------------FSRETTHAYLFHADHQQARNSTNHQVKAADAINPAYPANIGS---YAGEVIDSRRGLIVI----RADSDIGVRDLLQVFD------------TAQTEPSLLY-----------VKSLEINRKRVFEIHAGDVATIA---------SEQP-----LTPGSKIYLISSQKLKETFQ--SKVPKKHISTR-IPVDLEVDVRPDG----------ISIKGSTKHVTIAKDYPV--KLERGIHRVIEG--EGIRNSFSRLGETSFELRDFQAEISETL-------------------FIPLSMLNEIRRDFFQHLSVSYQ------KEKADRSQNIKKWIKKVATEYCDSGKRFSEK------DDIRLSLKIDKLHYLNHIPLEKIYKIYLVPTNETIENLMSHNDCDQIPLNKRGMKGSSGSCTQNDKHIIPSLARD--KIVFCLPAIMR--DMGNGYETYEYYKNFVQKFMAQGFRQ-------------FQLSNLGAMDLFK--------------GADVQWYADYPLYCLN-PLSASKLRESGFCRYTLSPEDDNDNMLT---------------------------------LFSANADLILYQDTPLFTSETC-LWANMKRRCPGTKQCSFKQVTVENEFGDKFMAMNDRCKTVVIGERPFSLIHNIPKLLDAGQRDFRIDLCWRDYTPEMIEDIFQSFQTKSK-----VRNSIMGNFERGLL 841          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: OQY98879.1 (hypothetical protein B6D35_10845 [Candidatus Brocadia sp. UTAMX2])

HSP 1 Score: 260.8 bits (665), Expect = 2.000e-65
Identity = 267/1037 (25.75%), Postives = 423/1037 (40.79%), Query Frame = 0
Query:  114 PQILAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAHRYGMKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNSSSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNAGHLGVPAGRVSYVSKDGKTFRFTPEVDLERYDGIKITPPSRAFHSTPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPEFSLRNFKVQGSKAFNAMAGAVVDVEIPIEIRRSWEQQ--PDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKVDVGVEVRREGGREGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKKTKEGVIADIVEVLGTYGELGFRADAVVIDGLDXXXXXXXXXXXXXGKGVPFIRRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISXXXXXXXXXXXXXXXXXXXPLPARMW---DERRFAIKFDRPEYLDMLDL----------------YLSS---------MTGREGGRESTVVKEVVFEPKRMYLADVKPEDGVARLMAFGARHGVRVRLALPTVVR-----AWDMTPLKGWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLASIAPQYILYKDVPLFMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDEEALPNTHVANFDRKL 1116
            P++L+PAG  E F AA+ +GAD ++ GL  F+ARA AENF +ED    +  A +  +K+ + +N L+K  E   ++D+L A+EEL+ DA+I+QD  +  ++Q  FP  ++HASTQM +HNL GV +   +G++RVVLAREL   E+ +I    +    + E F HG+LCYSYSGLC FS  T  RSGNRG C+  CR  YK +S DG G+LFSM DL T   ++ L  AG+   KIEGR K  +YVA     YR+ +D             P   +T++             +   F R TT  +V   + E+     N     V +      S       +  EV   R   + I    RA +       L  F              A A   LLH           +++ +++G + F   AG V  V          EQ+  P     +     + ++ S ++ +++ + T +P          V + V++R  G       G+   V ++K   V +E  + W   E  ++ T   + A   E+   + ++  +                            FI    L  ++    + L+ AY+      K++     GI                      P + +   D  R ++K D+ +YL+ + L                 L S         ++    G +  V+K+++  P        +  D +  L+         +  +LP ++R           +K  V+  +A                  F++ NLGA  +                +  V    D+ LY LN  + A   R LG  R  LS EDDKENL                                 + L S     I+Y+D PLF +E C + A     CPG   CG+R +++ENE G++F   ++ CK++V   +  S++H    LL  G RDFR+D   R Y  +    +      R      A+    + N++R L
Sbjct:   11 PELLSPAGNMECFFAAIENGADAIYFGLPDFSARAAAENFTLEDASKAIAHARKRAVKIYIALNTLMKTQELEKIVDLLIAVEELQPDALILQDLGLLFLLQSRFPQFSLHASTQMTIHNLAGVKQLERMGFQRVVLARELPMDEITNI---ARNTAMETEVFVHGALCYSYSGLCFFSSMTGGRSGNRGRCAQPCRMRYKTSSGDG-GYLFSMKDLLTISQINKLIAAGVHAFKIEGRMKSPEYVAVVTHAYRQAIDGRL----------PGLDETMHR------------IRTVFSRETTHAYV---FPEDSRKTKNGTQYQVKSTDTINPSYPANVGSYAGEVIATRRGKVVI----RADNDIGVRDLLQVFD------------HAQAEPSLLH-----------VKSLEIEGRRVFEIHAGDVAAV--------GSEQRFMPGAKLYLISSQKIRETLSLKVPKKLIS-TRIP----------VTLEVKIRPYG---VSIKGIARHVTLAKDYPVKLEQGMHWAIGEEHIRDTFSRLGATPFELRDIHTDVSEKL---------------------------FIPFSTLNEIRRDFFQNLSVAYQ------KEKEGMSQGIKNWIKEVTLEYRN---------PCKRFSGEDGIRLSLKIDKLQYLNHVPLEKIYKIYVVLSREVLMALGSTKKTDMSPLLSASSKGSDKAVLKKLL--PVETATGSTEAYDVMNTLLPLQD----TIVFSLPAIMRDRGNGLETFGDMKIIVQKLIALGFRQ-------------FQLSNLGALDIF--------------GTKDVQWYADYPLYCLNPLSVAQ-LRKLGFCRYTLSPEDDKENL---------------------------------QTLYSADADLIIYQDTPLFTSETC-VWANMKRRCPGISECGFRQVTVENEYGDRFVAINDRCKTVVIGERPFSIIHHIPKLLEAGQRDFRIDLCWRDYTPEMIENIFSGIQNR-----TAMKYATMGNYERGL 854          
BLAST of NO06G03350 vs. NCBI_GenBank
Match: OQZ02728.1 (hypothetical protein B6D34_10210 [Candidatus Brocadia sp. UTAMX1])

HSP 1 Score: 259.6 bits (662), Expect = 4.400e-65
Identity = 267/1019 (26.20%), Postives = 424/1019 (41.61%), Query Frame = 0
Query:  114 PQILAPAGGREQFLAALNSGADQVFLGLKSFNARARAENFGVEDLRNMVPLAHRYGMKVLVTVNVLIKENEFSALLDVLSALEELEVDAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVLARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSGNRGECSYSCREPYKLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKIEGRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNS--SSVRPEAFLRQDLSLSFHRGTTSFFVRGRYHENVIDLN-NAGHLGVPAGRVSYVSKDGKTFRFTPEVDLERYDGIKITPPSRAFHSTPQHGSLSSFSPEEGTAPGTATTAAAAHTKLLHEKYANDLPE---FSLRNFKVQGSKAFNAMAGAVVDVEIPIEIRRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKVDVGVEVRREGGREGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKK-------TKEGVIADIVEVLGTYGELGFRADAVVIDGLDXXXXXXXXXXXXXGKGVPFI----RRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISXXXXXXXXXXXXXXXXXXXPLPARMWDERRFAIKFDRPEYLDMLDLYLSSMTGREGGRESTVVKEVVFEPKRMYLADVKPEDGVARLMAFGARHGV-----RVRLALPTVVR-----AWDMTPLKGWVEAFVAAHXXXXXXXXXXXXXXVCFEVGNLGAWGLLEEWGXXXXXXXXXXSSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLKAHMQAWPRAMNXXXXXXXXXXXXXXXXXAVGDECLASIAPQYILYKDVPLFMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYSTKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDEEALPN 1106
            P++L+PAG  E F AAL +GAD V+ GL+ F+ARA A+NF + D    +  A +  +KV + +N LIK NE   + D+L ALEE++ DA+I+QD  +  +++  FP + +HASTQM +HNL GV +   +G+RRVVLAREL+  E+ +I    +   T++E F HG+LCYSYSGLCL S  T  RSGNRG C+  CR  Y  ++ DG G+LFSM DL T   ++ +  AG+ + KIEGR K   YVA     YR+ +D        R L   EA + V +  S     A+L   ++    RG        R   N    +   G  GV A     V K G   + +   D++  + I  + P+         G  +    +           A    + L + + +D  +     ++   + G + ++  AG V  +    + +R  +      + +N      +S + ++ +++     VP          VD+ V +R +           + +K        +  +  W S +YS+K        T+EG I D    LG    +     A + +GL                 +       R +  + +K  + E +    +        + ++ + I                     +     D  R ++K D+  Y+  + L                        +R+Y   +   DG++      A + +     ++  +LP ++R           +K  V+  +A                  F++ NLGA GL E                 V    D+ LY LN   SA   R LG  R  LS EDDKENL+A                                 L S     I+Y+D PLF +E C + A     CPG K CG++ ++++NE G++F   +E CK++V S +  SL      LL  G RDFR+D   R Y  +   ++      R      +L N
Sbjct:   11 PELLSPAGNMECFFAALENGADAVYFGLQEFSARASAQNFTLADASKAIVYARKKAVKVYIALNTLIKTNEIDRVTDLLLALEEMQPDALILQDLGLLFLLRSRFPQINLHASTQMTIHNLAGVKQLKQMGFRRVVLARELSIDEIGNI---ARNATTEIEVFVHGALCYSYSGLCLLSSMTGGRSGNRGRCAQPCRMRYTPSTGDG-GYLFSMKDLLTIPRINDIMAAGVHSFKIEGRMKSPAYVAVVTDAYRQAIDG-------RLLKEDEAIRRVMTVFSRETTHAYLFYGMNDKTKRGEKHKLSHDRAGSNCPSPSYERGERGVVAKSSPDVDKAG---QLSESNDIKAANAINSSYPANL-------GLYAGVVVKSEKGKIVIKADADIGVRDLLQVFEHDSAKPVLLHVKTITMDGKRVYSIKAGNVAALNTQQQFKRGAKLYLVSSQKVN------ESVTPKIPKKL-----VP------AKMPVDITVSIRHD----------RMMIKG-------IIRQFSW-SKDYSIKLESGINRITEEGHIRDCFSRLGETSFVLASIRADISEGLFIPLSILNDVRRDYFHNLSIAWQEERERRSREIKKWIRECV--FQDTLAQHNHNQTISHMRIPMKRCEEGNKNTPGGNEDYAGI--FFEDAMRLSVKIDKLNYIQHIPL------------------------ERIYKIYLAVSDGISFTKERDAMNALSQIKDKMVFSLPVILRDRRDGQDTYEDIKIIVQKLIAQGFRQ-------------FQLCNLGAMGLFE--------------GENVQWYADYPLYCLN-PLSAAKLRELGFCRYTLSPEDDKENLQA---------------------------------LFSADADLIIYQDTPLFTSETC-VWANMKRRCPGMKECGFKKVTVKNEYGDQFVAINERCKTVVISERPLSLFPGIPKLLEAGQRDFRIDLCWRDYTPEMIEDIFSGIQNRTRMKYSSLGN 883          
The following BLAST results are available for this feature:
BLAST of NO06G03350 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
XP_005852797.15.400e-20468.74putative protease, partial [Nannochloropsis gadita... [more]
SMF68653.18.600e-17839.62putative protease [Pseudobacteriovorax antillogorg... [more]
EWM26318.12.800e-17657.35hypothetical protein Naga_100078g11 [Nannochlorops... [more]
KHE91618.11.100e-7127.76peptidase [Candidatus Scalindua brodae][more]
OQW95059.14.100e-7128.66hypothetical protein BWK77_08235 [Verrucomicrobia ... [more]
OFW82678.11.300e-6927.99hypothetical protein A2018_08210 [Alphaproteobacte... [more]
WP_099324650.11.600e-6726.90U32 family peptidase [Candidatus Kuenenia stuttgar... [more]
OOP56505.12.300e-6627.01hypothetical protein AYP45_08865 [Candidatus Broca... [more]
OQY98879.12.000e-6525.75hypothetical protein B6D35_10845 [Candidatus Broca... [more]
OQZ02728.14.400e-6526.20hypothetical protein B6D34_10210 [Candidatus Broca... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL091nonsL091Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR029ncniR029Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR028ncniR028Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR147ngnoR147Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK000143NSK000143Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO06G03350.1NO06G03350.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|672528gene_6658Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100078g11gene4201Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO06G03350.1NO06G03350.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO06G03350 ID=NO06G03350|Name=NO06G03350|organism=Nannochloropsis oceanica|type=gene|length=4100bp
ACTACTACTACTACTACTACTACTGCTACCAACGACACCGCCGCCTCCCA
CGATCATGGATGCCCCCGTCTTGCCGTGTGCGCTCCAGTAGGCGGCAATG
CGTGCTGACAGGGCTTCCTTCTCTTGGTCGCTCAAAATTGGGCGTCTTCG
GGAGGTAATCCGCCAGGTGTCCAGCGCTAGCGTAGTACTGTCATCGTACT
TGGTCGTCCTGGCTTGGTAGCGCTGTGATTTTATTGCAGGGAGGAAGAGA
TTGACCGACGAAGGAAGGGAGAATGGTGAGATGCGATGCGTGGAAGAAGG
GTGATGGAGATTACGCACCTCTAAGACCGCAGGTTCTGCAACCGTGGCAC
CTGGCATGGCTTCGAAGAGAGCTTCGGTAAGCGGGTTTGTCTGCTTAATA
CGGGTATGTACGTGGCTTGTATGTGAGGAGAGTCCTGATCAAGACTGCAC
CATCCCACGACGAAGCCTTTCCAACGTGTTCAACAGCGTTATGTTCTCTC
TAAAGGCCAGATGCAACCTCTCACACAACCATGCCTCCCTCCTACTTTAA
GACAGAGTACCTCGCTGTCCTCCGAAGCCGCGCCGCGATGGTGCCGCCGC
CGCGGCCAAGCTTCAGCTTCGAAATACCACAATCAAAGGGGATATGTCTG
CTGCTACCACTGCTACTGCTGCAATTTCTTCTCATGGCCACCACTCAAGC
CTTCATCGTCCTCCCTCCCGGCTCTCCCACCGCTTTTATGCGTTCCTCAC
CTGCCGTACACTTGATGGCGACCACCACTGCCGCCACGTCCACAAGACTC
ATGAAAGGCGTCCGGCCTACCGCTCCCTCGGAGCCATTGATCCCTCCCCC
TATTCCCACGCACCCGGTACCCCAAATTCTGGCACCTGCTGGCGGCCGGG
AGCAGTTTTTGGCCGCTCTGAATTCAGGCGCGGACCAGGTGTTTTTGGGT
CTCAAGTCTTTTAATGCGCGGGCGAGAGCGGAAAACTTTGGCGTGGAGGA
TTTGCGGAACATGGTGCCTCTCGCCCACCGATACGGAATGAAAGTGCTCG
TGACAGTGAATGTGTTGATCAAGGAAAATGAGTTCAGCGCGCTGCTAGAC
GTGTTGTCAGCCTTGGAGGAGCTGGAGGTGGACGCCATTATCGTACAAGA
CCAGGCTGTGGGGACAATCGTTCAGCAATTTTTCCCAACATTAACGATGC
ATGCCTCGACCCAAATGGCCGTTCATAACCTACAAGGGGTGATTAAAGCT
GCAGCGTTGGGATATCGCCGCGTGGTGCTTGCCCGCGAATTGACTGCCAA
GGAAATGAAAGATATTCGAGCAGGGGTTAAAGCAGATGTAACGCAACTAG
AGGCCTTCGCCCATGGATCTCTGTGTTATTCATACAGCGGTCTGTGTTTA
TTCAGTGGAGAAACGGACGCTCGATCAGGGAACCGAGGTGAATGTTCATA
TTCGTGTAGAGAACCGTACAAGCTGACGAGTGAGGACGGAATGGGGTTCC
TTTTTTCGATGGCCGATCTGGATACAAGCCATGACTTGGACTTGCTGGCT
GAGGCGGGCATTGACACGTTGAAGATTGAGGGGCGCAAGAAAGgtgcggg
ttttcgggtatagggagaggcgagcacaagtgcccaaattgcttattggt
ctttgaaaacgcccactcacgcccatttcattctttgcctttctccttcc
ccccctcccttgttccctccttccctccgtctttctccagACGCGCAATA
CGTCGCCTCCTCCGTCGCGCTGTACCGCCGCCGTCTCGACCAGCTCTATC
AACGCCCTACCCTTCGTCCACTGGCCCCTCCTGAGGCCCAACAAACCGTC
AACTCCTCCTCGGTCCGCCCCGAGGCATTTCTCCGGCAAGACCTCTCTCT
CTCCTTCCACCGTGGCACTACCTCCTTCTTCGTCCGTGGGCGGTACCACG
AAAACGTCATCGACCTGAACAACGCAGGCCACCTGGGCGTACCTGCCGGG
CGTGTCTCCTATGTGAGCAAAGACGGCAAGACCTTCCGGTTCACCCCCGA
AGTTGACCTCGAGCGATACGACGGCATCAAAATCACGCCCCCGTCCCGCG
CTTTCCACTCCACCCCTCAACACGGCTCCCTCTCCTCCTTTTCCCCGGAA
GAAGGCACTGCTCCTGGTACTGCTACGACTGCTGCTGCTGCTCACACCAA
GCTCCTTCACGAAAAGTACGCGAACGACCTTCCTGAATTCTCGCTACGGA
ATTTTAAGGTCCAGGGAAGCAAGGCTTTTAATGCGATGGCGGGGGCGGTG
GTCGACGTTGAGATCCCTATCGAAATTAGACGCTCATGGGAGCAACAACC
AGATCGTTTCCGGATGATCAATAGAGGAGATGTCGTCTTTCAATCTAGGA
GTAACGAGCTGAAAAGAAGGGTGGAGGCATTGACCACAGTGCCTGATGGA
TACAAAAGCAGGGAGTGGAGAAAGGTTGATGTGGGGGTGGAGGTGCGGAG
GGAGGGAGGGAGGGAGGGAGAGGCAGGGGGGTTGGAGTTGTGCGTGAAAG
TGAGCAAGCTGGGGCAAGTGTTGGTGGAGGATAAATTGGTGTGGCAGTCG
TTTGAGTATTCCATGAAGAAGACGAAGGAAGGGGTGATTGCTGATATTGT
CGAAGTGTTAGGGACGTACGGCGAGCTCGGATTTAGGGCAGATGCTGTGG
TGATTGATGGTCTTGATGACGGCCAGGACGAGGGAGCGAGGGAGGGAGGG
AAGGGTGGGAAAGGCGTGCCGTTTATCCGGAGGAAGGATCTGAAGGTGCT
CAAGGCGAAGGTGGCCGAGACATTGACCGGGGCGTACGAGGCATTTGTAA
TGGGGAGGAAACAACGAGCTGTCGCCGGGTTAGGTATTAGCGAGAAGTCC
TCCTCCTCCTCCTCCTCCTCCTCCTCCTCTGCTGCTGCTCCAGTCGTTCC
CTTGCCCGCCCGCATGTGGGACGAGCGTCGATTCGCGATCAAATTTGACC
GACCCGAGTACCTGGACATGCTCGACTTATACTTGAGCTCTATGACAGGC
AGGGAGGGAGGGAGGGAGAGCACGGTGGTGAAAGAGGTGGTGTTTGAGCC
CAAGAGGATGTACCTTGCCGACGTGAAACCCGAGGACGGAGTGGCCAGAT
TAATGGCATTTGGAGCGAGGCATGGAGTGCGTGTGCGACTGGCCTTGCCA
ACGGTAGTGCGGGCATGGGACATGACTCCCTTAAAAGGGTGGGTAGAGGC
CTTCGTCGCCGCACATGCGCAGCAGCAGCAGCAGCAGCAGCAGCAGCGAA
CACCACCTGTATGTTTTGAAGTCGGGAATCTGGGGGCTTGGGGGTTACTG
GAAGAGTGGGGTGTGCTTTCCTCCTCGTCCCCCCCCGCTGCCTCCTCGCC
ACGAGTGGATGTCACGACGGATTTCACCCTCTATTCCCTCAATTCCCAAG
CGTCGGCCATGTGGGCTCGGACGTTGGGGGCTTCCCGTATTGCTTTGTCG
GTGGAGGATGACAAGGAGAACCTCAAGGCGCACATGCAGGCATGGCCTCG
TGCGATGAATGCTGCTGGTGCTGGTGCTGGTGCTGGTGCTGGTGCTGCTG
GTACTTCGGCTGCAGTCGGCGACGAGTGCTTGGCATCGATTGCCCCGCAA
TACATCCTTTACAAAGACGTCCCTCTCTTCATGGCGGAAGCGTGCAGCCT
AACAGCGTTGCACGGGAACTCATGCCCAGGTTCAAAGGTGTGCGGATATC
GTACCTTATCCATCGAAAACGAACAGGGAGAGAAGTTTGAGGTGGCACAC
GAACACTGCAAGAGTATTGTGTACAGCACGAAAGCCCAATCCCTCGTGCA
CCGTCAGAAGGACTTGTTGGGATTCGGAGTAAGGGATTTTAGGCTTGATT
TCTTGACGAGGAAGTACGATAAAAAGCAGTTTTTCGAGGTGTTGGATGCG
GCGTTGAGGAGGGAGGAGGGAGATGAAGAGGCGTTGCCTAATACGCATGT
TGCCAATTTTGATCGAAAGCTTTTGTAGGATAGCGAAGATGGGATATAGG
AGCAATCACATACAAATGAAAATAGAATGAAAATGAATCTACGAACGATT
back to top

protein sequence of NO06G03350.1

>NO06G03350.1-protein ID=NO06G03350.1-protein|Name=NO06G03350.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1117bp
MPPSYFKTEYLAVLRSRAAMVPPPRPSFSFEIPQSKGICLLLPLLLLQFL
LMATTQAFIVLPPGSPTAFMRSSPAVHLMATTTAATSTRLMKGVRPTAPS
EPLIPPPIPTHPVPQILAPAGGREQFLAALNSGADQVFLGLKSFNARARA
ENFGVEDLRNMVPLAHRYGMKVLVTVNVLIKENEFSALLDVLSALEELEV
DAIIVQDQAVGTIVQQFFPTLTMHASTQMAVHNLQGVIKAAALGYRRVVL
ARELTAKEMKDIRAGVKADVTQLEAFAHGSLCYSYSGLCLFSGETDARSG
NRGECSYSCREPYKLTSEDGMGFLFSMADLDTSHDLDLLAEAGIDTLKIE
GRKKDAQYVASSVALYRRRLDQLYQRPTLRPLAPPEAQQTVNSSSVRPEA
FLRQDLSLSFHRGTTSFFVRGRYHENVIDLNNAGHLGVPAGRVSYVSKDG
KTFRFTPEVDLERYDGIKITPPSRAFHSTPQHGSLSSFSPEEGTAPGTAT
TAAAAHTKLLHEKYANDLPEFSLRNFKVQGSKAFNAMAGAVVDVEIPIEI
RRSWEQQPDRFRMINRGDVVFQSRSNELKRRVEALTTVPDGYKSREWRKV
DVGVEVRREGGREGEAGGLELCVKVSKLGQVLVEDKLVWQSFEYSMKKTK
EGVIADIVEVLGTYGELGFRADAVVIDGLDDGQDEGAREGGKGGKGVPFI
RRKDLKVLKAKVAETLTGAYEAFVMGRKQRAVAGLGISEKSSSSSSSSSS
SAAAPVVPLPARMWDERRFAIKFDRPEYLDMLDLYLSSMTGREGGRESTV
VKEVVFEPKRMYLADVKPEDGVARLMAFGARHGVRVRLALPTVVRAWDMT
PLKGWVEAFVAAHAQQQQQQQQQRTPPVCFEVGNLGAWGLLEEWGVLSSS
SPPAASSPRVDVTTDFTLYSLNSQASAMWARTLGASRIALSVEDDKENLK
AHMQAWPRAMNAAGAGAGAGAGAAGTSAAVGDECLASIAPQYILYKDVPL
FMAEACSLTALHGNSCPGSKVCGYRTLSIENEQGEKFEVAHEHCKSIVYS
TKAQSLVHRQKDLLGFGVRDFRLDFLTRKYDKKQFFEVLDAALRREEGDE
EALPNTHVANFDRKLL*
back to top
Synonyms
Publications