NO13G00140, NO13G00140 (gene) Nannochloropsis oceanica

Overview
NameNO13G00140
Unique NameNO13G00140
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length4775
Alignment locationchr13:55473..60247 +

Link to JBrowse

Properties
Property NameValue
DescriptionLeucyl aminopeptidase
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr13genomechr13:55473..60247 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
PXD0166992021-01-08
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
PXD0087212019-04-30
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Cellular Component
TermDefinition
GO:0005622intracellular
GO:0005737cytoplasm
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO:0019538protein metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0030145manganese ion binding
GO:0004177aminopeptidase activity
GO:0008235metalloexopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR000819Peptidase_M17_C
IPR011356Leucine_aapep/pepB
Homology
BLAST of NO13G00140 vs. NCBI_GenBank
Match: CBJ32341.1 (Leucyl aminopeptidase [Ectocarpus siliculosus])

HSP 1 Score: 451.8 bits (1161), Expect = 3.400e-123
Identity = 248/502 (49.40%), Postives = 323/502 (64.34%), Query Frame = 0
Query:  114 ELVARDKFDAWLATQPEQTKAMLAFKAEDGLTVKHREGSLEVIPA--WGPGGKEGGR-GVEKVVLFVRKGGLTVAEPFVLSSLPNLLPPSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALK--RKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGEV-----DRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGG-SGSITAALYLQEFLQKRSSPGTSL---QKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVKE 602
            ++ ++  F+ WL +QP  +K+ +    +D     H  G L ++P+     GG+EGG   + +V   + +G  +   PF L SL   L P+ TY L    G     A P  AA SWA+G Y  DR+K K       G+GG  D  K  RK  L WP GADK  V A   STFLVRD++  P E MGP QL+     +A  FG   KV+ GD+LL E    +H+VGRAA +GREPRL+DLTW          PK+TLVGKGVC+DTGGLDIKP   M  MKKDM G AQ+LGLA MIM+  LP+RLRV++PAVEN++D  A+R GD++ +R+GKTSEI  TDAEGRLVLADALVEA SE PDL+ID ATLTGAARVA+GT++  ++ N E L  +L  +S  ++D  W +PL+ GYR  L S +ADL+N G GG  G+ITAALYL EF+ +++ P T     +K +  PWIHMDFM  N   RPGRPEGG++QGMRAL+ ++++
Sbjct:   46 DMNSKASFERWLGSQPPSSKSWIKALGQD----SHTAGRLVLVPSSVGDGGGQEGGEVSLSEVAFCLGEGDDSAVSPFSLCSLREKL-PTGTYAL---RGADGDVAEPDTAALSWAIGGYSFDRFKSKK------GDGGEKDKEKEDRKVVLAWPAGADKGRVTAAAASTFLVRDLITTPCEHMGPQQLEAVFASLAEEFGGTTKVVRGDELL-EGGVTVHAVGRAAGVGREPRLLDLTWAPAGSDAESLPKVTLVGKGVCFDTGGLDIKPAAGMLTMKKDMGGGAQVLGLARMIMSQGLPVRLRVMVPAVENAIDGGAFRPGDVLVSRSGKTSEIGNTDAEGRLVLADALVEASSEMPDLLIDCATLTGAARVALGTDVPVVFCNDEELAAKLHTLSGSVSDQVWRLPLWEGYRSQLSSKIADLKNVGAGGYGGAITAALYLDEFVGEKAPPSTEEGEGRKKEKLPWIHMDFMAFNTNSRPGRPEGGESQGMRALYALLED 532          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: EWM22849.1 (leucyl aminopeptidase [Nannochloropsis gaditana])

HSP 1 Score: 419.9 bits (1078), Expect = 1.400e-113
Identity = 210/227 (92.51%), Postives = 218/227 (96.04%), Query Frame = 0
Query:  399 MASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGGSGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVKEVGAKGGWPK-DGKEEEEGEEGKDE 625
            MAS +P+RLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLI+D ATLTGAARVAVGTEMSAIWSNKE LGRRLQ+ISW INDPCWMMPLY GYRKHLKSNLADLRNTG+GGSGSITAALYL+EFLQKRSSPG SLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVKEVG KGGWPK +GKEEEEG EGK+E
Sbjct:    1 MASDVPVRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIVDCATLTGAARVAVGTEMSAIWSNKEGLGRRLQEISWAINDPCWMMPLYRGYRKHLKSNLADLRNTGSGGSGSITAALYLEEFLQKRSSPGNSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVKEVGEKGGWPKEEGKEEEEGTEGKEE 227          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: WP_044431654.1 (leucyl aminopeptidase family protein [Skermanella aerolata] >KJB93914.1 cytosol aminopeptidase [Skermanella aerolata KACC 11604])

HSP 1 Score: 402.1 bits (1032), Expect = 3.100e-108
Identity = 230/498 (46.18%), Postives = 306/498 (61.45%), Query Frame = 0
Query:  110 AVPLELVARDKFDAWLATQPEQTKAMLAFKAEDGLTVKHREGSLEVIPAWGPGGKEGGRGVEKVVLFVR-KGGLTVAEPFVLSSLPNLLPPSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGEVDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGG-SGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVKEVGAK 606
            +VPL  V +D  ++WL+ QP   +A ++     G+  K   G   ++P     G +GG        F R   G+  A+ +    LP  LPP + ++  D +         T AA  WA+GTY   RYK         G G      K  A L WP+ AD+ EV+    +T+LVRD++N PA  MGP +L Q AE +   F AK KVI+GDDLLK+NYP IH+VGRA+   R PRLIDLTWG+ D P +TLVGKGVC+DTGGLD+KP + M  MKKDM GAA +LG+A MIM ++LP+RLRVLIPAVENSV   A+R  D++  R G T E+  TDAEGRLVL DAL +A ++ P++IID ATLTGAARVA+GTE+ A++SN +AL   L   +   +DP W +PL+  Y K L S +ADL N G+G  +G+I AAL+L+ F++            K TPW H+D M  N   +PGRPEGG+A GMRA++  ++   AK
Sbjct:   13 SVPLTPVTKDGLESWLSGQPATVRAWVS-----GIGFKGEPGKTALLP-----GADGG--------FARVLAGIDAADFWAYGGLPASLPPGDYHIDADLD-----RETATRAALGWALGTYAFTRYK--------SGNG------KTFATLAWPERADRAEVERIATATYLVRDLINTPASDMGPEELAQAAETLGDEFNAKVKVIVGDDLLKKNYPAIHAVGRAST--RAPRLIDLTWGDKDAPGVTLVGKGVCFDTGGLDLKPSSGMLLMKKDMGGAAHVLGVARMIMMAELPVRLRVLIPAVENSVSGDAFRPLDVLATRKGLTVEVGNTDAEGRLVLCDALAKAVNDKPEVIIDFATLTGAARVALGTELPALFSNDDALADGLLKAAESQSDPMWRLPLHQPYAKLLDSKVADLNNVGSGPFAGAILAALFLERFVE------------KTTPWAHLDVMAWNPSSKPGRPEGGEALGMRAVYSYLESRYAK 459          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: XP_005826586.1 (hypothetical protein GUITHDRAFT_114335 [Guillardia theta CCMP2712] >EKX39606.1 hypothetical protein GUITHDRAFT_114335 [Guillardia theta CCMP2712])

HSP 1 Score: 399.8 bits (1026), Expect = 1.500e-107
Identity = 228/498 (45.78%), Postives = 301/498 (60.44%), Query Frame = 0
Query:  110 AVPLELVARDKFDAWLATQPEQTK---AMLAFKAEDGLTVKHREGSLEVIPAWGPGGKEGGRGVEKVVLFVRKGGLTVAEPFVLSSLPNLLP-PSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGRE-PRLIDLTWGEVDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNT--GNGGSGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVK 601
            AV +  V    F  W+  Q E TK   + + F A +    KH      VIP            +EKV++ V   G    E    + LP+ LP  S  Y +V   G           A +WA+G YK D YK K  K+  G E    +      +LVWP G +++ V     + FL RD+++ PAE M P  L+  A  +A+++G K   ++GDDLL++NYP IH+VGRA   G+  PRLIDL WG+   PK+TLVGKG+ +DTGGLDIKP + M  MKKDM G+A +LGLA+MIM  KL +RLRVLI A EN+VDA ++R GDI+ ARNG T+EI  TDAEGRLV+ADALV A  E P+L+IDAATLTGA RVA+GT++ A++ N E LG  L  +S + ND  W +PL+ GY K L S +AD++N   G+G  G+ITAALYLQ+F+               T W+HMDFM  N   RPGRPEGG+A G+RALF +++
Sbjct:   14 AVSIIAVKGQSFSTWIKEQDENTKKWVSTMQFTASEADVGKH-----IVIP-------NPDLSIEKVIVIVGDDGF--GEWLSFAGLPSALPRGSGPYKIVSSAG-----VEDDKIALAWALGCYKFDVYKSKKRKESEGEEKLGFE------KLVWPSGCNEDYVLTAASAIFLTRDLISTPAEDMAPGDLESAARKLASMYGGKVTAVVGDDLLRQNYPQIHAVGRATPPGKHAPRLIDLRWGKESDPKVTLVGKGITFDTGGLDIKPASGMRQMKKDMGGSALVLGLAHMIMRLKLNVRLRVLIAAAENNVDAVSFRPGDILVARNGMTTEIGNTDAEGRLVMADALVAACEEKPELLIDAATLTGAGRVALGTDVPAVFCNNEELGSELCSLSEKENDLVWRLPLFKGYAKSLASRIADMKNVAEGSGYGGAITAALYLQKFVT------------AGTDWVHMDFMAFNTASRPGRPEGGEAMGLRALFALIQ 474          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: WP_084536961.1 (leucyl aminopeptidase family protein [Azospirillum halopraeferens])

HSP 1 Score: 391.3 bits (1004), Expect = 5.400e-105
Identity = 236/516 (45.74%), Postives = 307/516 (59.50%), Query Frame = 0
Query:   92 EKTTSYLATLADEDEEGSAVPLELVARDKFDAWLATQPEQTKAMLA---FKAEDGLTVKHREGSLEVIPAWGPGGKEGGRGVEKVVLFVRKGGLTVAEP--FVLSSLPNLLPPSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGEVDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGG-SGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVKE 602
            E  ++ LATL  +     AVP+  +      AWLA QP  T A +    + AE G T            A  PG  E GR    +V      G+  AEP  + L+ LP  LPP    +  D +  A TA     AA  WA+G+Y+  RYK             A D  K  A LVWP  AD+  V+    + +LVRD+VN PA  MGP  L   A  +AA F A  +VI+GD LLK+NYP IH+VGRAA   R PRLID+TWG  D PK+T+VGKGVC+DTGG DIKP + M  MKKDM GAA  LGLA MIM + LP+RLRVL+PAVEN +   A+R GD+++ R G + E+  TDAEGRL+L DAL EADSE P L+ID ATLTGAARVA+G ++ A+++N EAL   L   +   NDP W +PL+  YRK  +S +ADL N G GG +G+ITAAL+L+ F+ +             TPW+H+D    N   RPGRP+GG+A G+RA++ ++++
Sbjct:   39 EGRSALLATLRAK-AGADAVPITPLPGAGLAAWLAAQPPATAAWVRAVNYTAEAGAT------------ALLPG--EDGRLARVLV------GMP-AEPDLWSLAGLPGALPPGTYRIDADLDRDAATA-----AALGWALGSYRFGRYK-------------ASD--KSFANLVWPDAADRGAVERAASAIYLVRDLVNTPASDMGPADLAGAATDLAAEFEADIRVIVGDHLLKQNYPAIHAVGRAA--ARAPRLIDITWGRKDDPKVTIVGKGVCFDTGGYDIKPSSGMLLMKKDMGGAAHALGLARMIMMAGLPVRLRVLVPAVENMIAGNAFRPGDVLKTRKGLSVEVGNTDAEGRLILCDALAEADSEKPALLIDFATLTGAARVALGPDLPALFANNEALAADLLAAAAAGNDPLWRLPLWQNYRKMFESKIADLNNAGTGGFAGAITAALFLESFVSRE------------TPWVHLDVYAWNGSARPGRPDGGEAMGLRAVYALIEK 498          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: WP_054169032.1 (leucyl aminopeptidase family protein [alpha proteobacterium AAP38] >KPF83945.1 cytochrome C oxidase subunit II [alpha proteobacterium AAP38])

HSP 1 Score: 389.8 bits (1000), Expect = 1.600e-104
Identity = 233/507 (45.96%), Postives = 293/507 (57.79%), Query Frame = 0
Query:   97 YLATLADEDEEGSAVPLELVARDKFDAWLATQPEQTKA---MLAFKAEDGLTVKHREGSLEVIPAWGPGGKEGGRGVEKVVLFVRKGGLTVAEPFVLSSLPNLLPPSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGEVDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGG-SGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVV 600
            +L T A  D     VPL LV +   DAWLA       A    L FKA  G T         ++P           G +  +  V  G     + + L+ LP  LP  +  +  D +  A+TA   T  A  W + TY+  RYK+               + K  A LV P  AD   V+    + FLVRD+VN P E MGPPQL + A+ +AA FGA    I+GDDLL +NYPMIH+VGRAA   R PRLI+L WG+   P +TLVGKGVC+DTGGLDIKP   M  MKKDM GAA  LGLA +IMA+KLP+RLRVL+PAVENS+   A+R  DII  R G T EI  TDAEGRL+L DAL EADSE P L++D ATLTGAARVA+G ++ A++SN +AL   +       +DP W +PL+ GYR  L S +ADL N   GG +G+ITAALYL++F+             K T W H+D    N   RPGRPEGG+A  +RA+F V+
Sbjct:    4 HLLTAAAAD----TVPLTLVTKAGLDAWLAAASPADAAWVKRLGFKAAPGATA--------ILP-----------GADGQIARVLAGVADRIDIWSLAGLPGSLPAGSYAIEADLD--ADTA---TAVATGWVLATYQFTRYKK---------------SSKEFASLVLPAKADAAAVERFAKAAFLVRDLVNTPCEDMGPPQLAEAAQKLAAEFGATFSAIVGDDLLTQNYPMIHAVGRAA--ARPPRLIELRWGDAAHPLVTLVGKGVCFDTGGLDIKPSTGMLLMKKDMGGAAHALGLARLIMAAKLPVRLRVLVPAVENSIAGNAFRPMDIIPTRKGLTVEIGNTDAEGRLILCDALAEADSEKPALLLDFATLTGAARVALGPDLPALFSNNDALASEILVAGTATDDPLWRLPLWQGYRAQLDSKIADLNNAPGGGMAGAITAALYLEQFVS------------KDTVWAHVDLFSWNQSNRPGRPEGGEAMTLRAMFAVI 453          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: WP_102113648.1 (leucyl aminopeptidase family protein [Niveispirillum cyanobacteriorum] >AUN32098.1 leucyl aminopeptidase [Niveispirillum cyanobacteriorum])

HSP 1 Score: 389.8 bits (1000), Expect = 1.600e-104
Identity = 232/507 (45.76%), Postives = 294/507 (57.99%), Query Frame = 0
Query:   97 YLATLADEDEEGSAVPLELVARDKFDAWLATQPEQTKA---MLAFKAEDGLTVKHREGSLEVIPAWGPGGKEGGRGVEKVVLFVRKGGLTVAEPFVLSSLPNLLPPSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGEVDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGG-SGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVV 600
            +L T A  D     VPL LV +   DAWLA       A    L FKA  G T         ++P           G +  +  V  G     + + L+ LP  LP  +  +  D +  A+TA   T  A  W + TY+  RYK+               + K  A LV P  AD   V+    + F+VRD+VN P E MGPPQL + A+ +AA FGA    I+GDDLL +NYPMIH+VGRAA   REPRLI+L WG+   P +TLVGKGVC+DTGGLDIKP   M  MKKDM GAA  LGLA +IMA++LP+RLRVL+PAVENS+   A+R  DII  R G T EI  TDAEGRL+L DAL EADSE P L++D ATLTGAARVA+G ++ A++SN +AL   +       +DP W +PL+ GYR  L S +ADL N   GG +G+ITAALYL++F+             K T W H+D    N   RPGRPEGG+A  +RA+F V+
Sbjct:    4 HLLTAAAAD----TVPLTLVTKSGLDAWLAAASPTDAAWVKRLGFKAAPGATA--------ILP-----------GADGQIARVLAGVADRIDIWSLAGLPGSLPAGSYAIEADLD--ADTA---TAVATGWVLATYQFTRYKK---------------STKEFASLVLPAKADGAAVERFARAAFMVRDLVNTPCEDMGPPQLAEAAQNLAAEFGATFSAIVGDDLLTQNYPMIHAVGRAA--AREPRLIELRWGDAAHPLVTLVGKGVCFDTGGLDIKPSTGMLLMKKDMGGAAHALGLARLIMAAQLPVRLRVLVPAVENSIAGNAFRPMDIIPTRKGLTVEIGNTDAEGRLILCDALTEADSEKPALLLDFATLTGAARVALGPDLPALFSNNDALASEILAAGTAKDDPLWRLPLWQGYRAQLDSKIADLNNAPGGGMAGAITAALYLEQFVS------------KDTVWAHVDLFSWNQSNRPGRPEGGEAMTLRAMFAVI 453          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: WP_012567502.1 (leucyl aminopeptidase family protein [Rhodospirillum centenum] >ACI99717.1 leucyl aminopeptidase PepB, putative [Rhodospirillum centenum SW])

HSP 1 Score: 384.4 bits (986), Expect = 6.600e-103
Identity = 229/492 (46.54%), Postives = 299/492 (60.77%), Query Frame = 0
Query:  111 VPLELVARDKFDAWL-ATQPEQTKAMLAFKAEDGLTVKHREGSLEVIPAWGPGGKEGGRGVEKVVLFVRKGGLTVAEPFVLSSLPNLLPPSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGEVDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGG-SGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVK 601
            VPLEL+ + +  AWL   +P    A  A+    G T +   G++ +IP       EGGR    +V     G     + + L+ LP  L P  +Y L D   G       T AA  WA+G Y+  RY++               + K  A LV P GAD+  V+  V +T LVRD+VN PAE MGPP+L Q AE +A  FGAK KVI GDDLL++NYPM+H+VGRAA   + PRLIDL WG  + P + LVGKGVC+D+GGLDIK   +M  MKKDM GAA  LG+A MIM + LP+RLRVL+PAVEN+V   ++R  DI+  R G T EI  TDAEGRL+L DAL EADS+ P L+ID ATLTGAARVA+G E+ A++SN +AL   L     E +DP W +PL+ GYR+ L S +ADL N  +GG +G+ITAAL+L+ F+               TPW H+D    N   RPGRPEGG+A  +RA++ +++
Sbjct:   14 VPLELIRKAELSAWLDGAEP----AAAAWVRGTGFTAE--PGAVALIPG------EGGRLARVLV-----GVPDRLDLWSLAGLPGSL-PKGSYTLPDRLDGREA----TAAAIGWALGCYQFTRYRK---------------STKDFAHLVMPAGADRHAVERVVSATALVRDLVNTPAEDMGPPELAQAAEKLADEFGAKLKVIAGDDLLEKNYPMVHAVGRAA--AKAPRLIDLRWGNKEHPLVCLVGKGVCFDSGGLDIKGSANMLLMKKDMGGAAHALGVARMIMMAGLPVRLRVLVPAVENAVSGNSFRPMDILPTRKGLTVEIGNTDAEGRLILCDALAEADSDRPALLIDFATLTGAARVALGPELPALFSNDDALAEDLLATGRETDDPLWRLPLWPGYRRMLDSRIADLNNAPSGGFAGAITAALFLERFVAPE------------TPWAHIDLYAWNPTTRPGRPEGGEAMTLRAVYALIE 454          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: WP_027299064.1 (leucyl aminopeptidase family protein [Rhodospirillales bacterium URHD0088])

HSP 1 Score: 383.6 bits (984), Expect = 1.100e-102
Identity = 229/501 (45.71%), Postives = 299/501 (59.68%), Query Frame = 0
Query:  106 EEGSAVPLELVARDKFDAWLATQPEQTKAML---AFKAEDGLTVKHREGSLEVIPAWGPGGKEGGRGVEKVVLFVRKGGLTVAEP-FVLSSLPNLLPPSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGEVDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGG-SGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVKE 602
            ++G A+P+  V RD F AWL    E  +  L    F AE         G L ++P  G GGK     + ++++     G+   +P + L++LP+ L P   Y L   +  +   A  T A   WA+GTY   RYK+   +K G             A L+WP+ AD+ E++      FL RD++N PAE MGP +L   A+ +A   GA   VI+GDDLL  NY  +H+VGRA+   R PRLIDL WG    PK+TLVGKGVC+D+GGLD+KP   M  MKKDM GAA +LGLA  ++ +KLPIRLRVLIPAVEN+V A A+R  D+I+ R+GKT EI  TDAEGRL+L DAL EADS+ PDLI+DAATLTGAARVA+G ++ A++ N E L   L        DP W MPL+  YRK L S +ADL N  +   +GS+ AALYL EF+               TPW H+D M  N   RPGRPEGG+A G+RAL+  +K+
Sbjct:    8 DKGEAIPITPVTRDGFAAWLVAAEESDRTWLKATGFAAE--------PGKLGLLP--GAGGK-----LARILV-----GVAPDDPLWSLAALPDAL-PEGRYAL---DAASWDPATTTKAVLGWALGTYAFTRYKE---RKRG------------FATLIWPERADRAEIERLALGIFLARDLINTPAEDMGPAELAAAAKGLAERTGAACSVIVGDDLLAMNYSTVHAVGRAS--VRPPRLIDLRWGSDSAPKVTLVGKGVCFDSGGLDLKPAAGMKLMKKDMGGAAILLGLASALIDAKLPIRLRVLIPAVENAVSANAFRPMDVIRTRSGKTVEIGNTDAEGRLILCDALAEADSDKPDLIVDAATLTGAARVALGPDLPALFCNDETLAAGLLSAGIAEADPLWRMPLWQPYRKLLDSKVADLNNVADSPFAGSVIAALYLAEFVA------------PTTPWAHLDVMAWNGSARPGRPEGGEAMGLRALYAHIKK 455          
BLAST of NO13G00140 vs. NCBI_GenBank
Match: XP_009041907.1 (hypothetical protein AURANDRAFT_59635 [Aureococcus anophagefferens] >EGB03376.1 hypothetical protein AURANDRAFT_59635 [Aureococcus anophagefferens])

HSP 1 Score: 383.3 bits (983), Expect = 1.500e-102
Identity = 224/509 (44.01%), Postives = 307/509 (60.31%), Query Frame = 0
Query:   99 ATLADEDEEGSAVPLELVARDKFDAWLATQPEQTKAMLAFKAEDGLTVKHREGSLEVIPAWGPGGKEGGRGVEKVVLFVRKGGLTVAEPFVLSSLPNLLPPSNTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGADDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPA----ECMGPPQLQQTAEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGEVDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMASKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVLADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEA--LGRRLQDISWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGG-SGSITAALYLQEFLQKRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVK 601
            A +A +    +A  + ++   ++DAWLA +P++T+A L    E     + +  +L ++P             E+   F+ K     ++ F  SSLP+ LP +  Y L +F      A  P  AA SW +G Y  +  K   A K         DA  + A LV P   D  +  A +  T+L RD++N PA      MGP +L+  A  +AA +GA    + GD LL   YP IH+VGRAA   + PRL+DLTWG  D P++TLVGKGVC+DTGGLD+KP  +M  MKKDM GAA +LGLA M+M + LP+RLRVLIPAVEN+V   A+R GD++ ARNGKT+EI  TDAEGRLVLADALVEA ++ P+L++D ATLTGA RVA+GT++ A+++N+E   +   L   S    D  W +PL+ GYR+ + S +ADL+N G G   G+ITAALYL EF++     G         PW+H+D M  N G +PGRPEGG+A GMRALF +++
Sbjct:    9 APMAAQGMRFAATTVRVLREAEYDAWLAARPDETRAYL----EAAGLGEFKGDALALLP-------------ERTWAFLTK---DPSKLFAFSSLPSRLPGAGPYAL-EF-----AAPPPATAALSWGLGCYSFEACKGSTAPKA--------DAPPKFASLVVPAENDYGDAAAALKGTYLCRDLINAPAGDMQRDMGPAELEAVARRLAAAYGAAVATVEGDALL-AGYPQIHAVGRAAGEKQRPRLVDLTWGAADAPRVTLVGKGVCFDTGGLDLKPAAAMLTMKKDMGGAAHVLGLAAMVMDAGLPVRLRVLIPAVENAVSGDAFRPGDVLAARNGKTTEIGNTDAEGRLVLADALVEACADAPELLVDCATLTGAGRVALGTDVPALFANEEGVDVADELVAASAGEQDQVWRLPLWPGYREQIDSKIADLKNIGAGPYGGAITAALYLAEFVEAPPDGGA------PPPWLHLDMMAYNTGSKPGRPEGGEAMGMRALFRLLE 476          
The following BLAST results are available for this feature:
BLAST of NO13G00140 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
CBJ32341.13.400e-12349.40Leucyl aminopeptidase [Ectocarpus siliculosus][more]
EWM22849.11.400e-11392.51leucyl aminopeptidase [Nannochloropsis gaditana][more]
WP_044431654.13.100e-10846.18leucyl aminopeptidase family protein [Skermanella ... [more]
XP_005826586.11.500e-10745.78hypothetical protein GUITHDRAFT_114335 [Guillardia... [more]
WP_084536961.15.400e-10545.74leucyl aminopeptidase family protein [Azospirillum... [more]
WP_054169032.11.600e-10445.96leucyl aminopeptidase family protein [alpha proteo... [more]
WP_102113648.11.600e-10445.76leucyl aminopeptidase family protein [Niveispirill... [more]
WP_012567502.16.600e-10346.54leucyl aminopeptidase family protein [Rhodospirill... [more]
WP_027299064.11.100e-10245.71leucyl aminopeptidase family protein [Rhodospirill... [more]
XP_009041907.11.500e-10244.01hypothetical protein AURANDRAFT_59635 [Aureococcus... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL132nonsL132Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR017ncniR017Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR057ngnoR057Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR056ngnoR056Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK001902NSK001902Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO13G00140.1NO13G00140.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|675694gene_11835Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_101043g2gene7598Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO13G00140.1NO13G00140.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO13G00140 ID=NO13G00140|Name=NO13G00140|organism=Nannochloropsis oceanica|type=gene|length=4775bp
ATGAACAGCATGCTTGCCCTAGCCGCCTCTGTCTCCGCCTCCTCCCTCAG
ACACCACAATCGTGTCCCTACGCGCGCCTTCGTCTCAGCCGCGGCTGCGT
TGCTCAACAAACCTCACCATTCAGGATGGATGGCTTCGAAGAGAGGTGGC
TCGGCGGGAGGAGGAGGGATATTTGGGCGGAAGGGCAGTGTGAAGATGAT
GGGAACCTCCTGCCTCATGACCTCAACGCCCAACAGgtaagtggggggat
ggggagggatggaggacggaacgacggacgactcatttcttcttattagg
ataaagcaagactagctccaagagtctctttgacacttgcctcttctctc
tgtacatctctgcacccatcacgtacattcgcacgtcttaggcggtaacc
cctccacccccccccaacacacgtacaaacgccacgccacttgattcgta
gCAATTTTAAATATCCTCCCGCCGCGGCCTCGGTATTGGAAAAGACGACC
AGCTACCTGGCCACCTTAGCCGACGAGGACGAGGAAGGAAGCGCGgtagg
taggagggaggagatgggggggaggaaggaggaagggaggagggtctcct
ccctctcagccagcgtttattcttttctttctctcctttcatcggcccct
cggctcaaacccacgcctatcctcaccaatcgctcatctccccgtgtcac
ctccctccttccctagGTCCCTCTCGAGCTGGTTGCACGGGATAAATTTG
ACGCATGGCTGGCCACCCAGCCCGAGCAGACCAAGGCGATGCTAGCCTTC
AAGgtagagggggagggagggaaggagggagggaagaaggtaaaggagag
agaatggagggaataggatcccaagaaatgcatgcactggcagtttgacc
cgtgctcacatcctcccttcccttgctccctccctccctccccctttctc
cctccccattcagGCAGAGGACGGTTTGACGGTGAAGCATCGGGAGGGAA
GCCTAGAAGTGATCCCCGCCTGGGGCCCAGGAGGGAAGGAGGGAGGGAGG
GGAGTGGAGAAAGTGGTGCTGTTCGTCCGGAAAGGCGGGCTGACGGTGGC
GGAGCCTTTCGgtgaggagggagggagggagggagggagggagcgagggg
ggaagggaaagagggagggagtgcgagaaaataaatatgctagggcctat
caagtcagatgttcatgcacttcgaactttccttcctatgcttatagtct
gccctgttttcctccctgcctccctgcctccctctctcccttcctccctc
cagTGCTCTCCTCCCTGCCCAACCTCCTACCCCCCTCCAACACCTACGTG
TTGGTCGACTTCGAGGGAGGGGCAAACACTGCCGCCAACCCCACCCTCGC
CGCCTTCTCCTGGGCCATGGGCACCTACAAgtgagggatggagggaggga
ggaaaagagggagggacggcgccacggcgacgtcctgattcctgctatcg
cctcctttctcagccggcttattctcctgctttctccctccctccctccc
tccctccctccctccctccctccctccttcgtagGATGGATCGGTACAAG
CAGAAGCCGGCCAAGAAGGAGGGCGGGGGGGAGGGCGGGGCGGACGACGC
GCTCAAGAGGAAGGCGCGGTTGGTCTGGCCCAAGGGGGCGGATAAAGAGG
AGgtagggagggagggatggagggagggagggagggagggaaacggggag
ggatggatggatggatttctcttgacttattcgtttctagacgctccatc
ctctctccccttcctcctcccctcctccccccctccctccctccctccct
ccttccgcagGTCCAAGCCAAGGTGGGCTCGACCTTCCTCGTGCGGGATA
TGGTGAACATGCCCGCCGAGTGCATGGGCCCTCCCCAGCTCCAGCAGACG
GCCGAGATGGTGGCGGCACTGTTCGGGGCGAAGgtccgcccgtgcttcgt
ttccctccctccctccttccctccctccctccctccctccctttttctcc
ttccctccttccttcactctatccacccctaccctcgtgtatttgacttg
cccttcccccctttcccccccgccttccctccctccttccctcacccagG
CCAAGGTCATCATCGGCGATGATCTCCTCAAGGAGAACTATCCAATGATC
CACTCGGTGGGCCGCGCCGCGGACATAGGGCGAGAGCCTCGGCTCATCGA
CTTGACTTGGGGGGAGGTGGATAGgtgagggggggagggagggagggggg
gagggagggagggagggggagggggatcgacatgactcgggggaggtgga
taggtgaagggggggggagggagggagggagaaagggagggagggaggga
gggagggagggaggggaagttgcctgtcattgacactcgctcacatcttg
tgccgttttttccctctccccctcactccctccctccctcccttcctccc
cccctccattcctcctttcctagacctaccattaccctcttctctgcctt
cctgtcgctcttctcacattcccctttcccctccctccccccctccctcc
ccccctccctccccagGCCCAAGATCACGCTGGTGGGTAAAGGCGTGTGC
TACGACACCGGAGGCCTTGACATTAAGCCCGGAAACAGTATGGCAGGCAT
GAAGAAGGACATGGCAGGCGCGGCGCAGgtacctccctcccgccctcccc
tcgccctccctgtcactcgcgccatttcggacatgtctccgtttgtttgc
attcattcatccctccctccctccctccctccctccctcccttccgaccg
accgacggagggatcatggcctccaagctgcccatccgtctgcgtgtcct
ccctccctccctccctccctccctccctccctccctcagatcctcggcct
cgcctacatgatcatggcctccaagctgcccatccgtctgcgtgtcctcc
ctccctccctccctccctccctccctccctccctcagatcctcggcctcg
cctacatgatcatggcctccaagctgcccatccgtctgcgtgtcctcacc
ccctccctccctccctccctccctccctccctcagATCCTCGGCCTCGCC
TACATGATCATGGCCTCCAAGCTGCCCATCCGTCTGCGTGTCCTCATCCC
GGCCGTCGAGAATAGCGTCGATGCCCGCGCTTACCGTAACGGGGACATCA
TTCAGgcatgtgccctccctccctccctccctccctccctccctccttct
ccttgcgtcctagcgccttaaactgccttccatcctcccttttactgccc
acctcattgtcacacctccctctccctccctccctccctccctcctatcc
cagGCCCGCAACGGCAAGACCAGCGAGATTATGACGACCGACGCCGAAGG
CCGCCTTGTGCTGGCAGATGCTCTGGTGGAGGCCGATAGTGAAGACCCGG
ATCTCATCATCGATGCCGCCACGCTCACGGgtaagtctatccctccctcc
ctccctccctcctccattcctactttaacttgtcatctcatccacgcgag
tgttcgctccctccctccctccctccctccctccctccctccctccctag
GCGCGGCCCGCGTGGCTGTTGGTACCGAAATGTCAGCCATTTGGAGCAAC
AAGGAGGCGCTCGGTCGACGCCTCCAGgtacgccctccctccctccctcc
ctccctccctctctatgccttcttctttatttttggtcctcctatagcgg
ctgggcggttgcctgccctccttccctccctccctccctccctcctttcc
tccctccctcccgcccctcccttctcgtctcgctcaccgtcacatttccc
gccgtccctcttctcgtttcccccctccccccctccctccctccctccca
agGACATCTCGTGGGAGATCAACGACCCGTGCTGGATGATGCCGCTCTAC
GCAGGgtagggagggatagaaggaaggagggagggagggaagaagggagg
ccagtgacgaagaagagcgatgcatccgtcgcaagcgttggactaaatgc
ttactcccccccccccttcctccttcccttcctccgtcccgccctcgtcc
ttttcctcgtgacagGTACCGCAAGCACCTCAAGAGCAACCTGGCCGACT
TACGGAACACGGGGAACGGGGGCAGgtagggagggagggagggagggagg
gagggaaggaggaaaggaaaacaagtgcaccttgttatcttttactatct
cctcatatcatgtattcgtatccattatccctctcttccttcctccctcc
ctccctccctcccttattcttcagCGGGAGCATCACTGCTGCCCTCTACC
TGCAGGAGTTCCTGCAGAAACGCTCCTCCCCTGGCACCTCCCTTCAAAAG
CATAAGGCAACTCCATGGATCCACATGGACTTCATGGGCTCCAACAATGG
AGGACGACCAGgtagaggaaggagggagagagaaagggaaggagggaggg
aagaaggggaggaaggtgggcttgggaagagacgaggagcggagtttctg
accttttcttttttccctatcgttcttttttcaccgtgacatagGGCGAC
CGGAAGGAGGGGATGCACAAGGCATGCGTGCACTGTTTGAGGTCGTGAAG
GAGGTGGGGGCGAAAGGGGGGTGGCCGAAGGATGGGAAGGAGGAGGAGGA
GGGGGAGGAGGGCAAGGACGAGTAA
back to top

protein sequence of NO13G00140.1

>NO13G00140.1-protein ID=NO13G00140.1-protein|Name=NO13G00140.1|organism=Nannochloropsis oceanica|type=polypeptide|length=625bp
MNSMLALAASVSASSLRHHNRVPTRAFVSAAAALLNKPHHSGWMASKRGG
SAGGGGIFGRKGSVKMMGTSCLMTSTPNSNFKYPPAAASVLEKTTSYLAT
LADEDEEGSAVPLELVARDKFDAWLATQPEQTKAMLAFKAEDGLTVKHRE
GSLEVIPAWGPGGKEGGRGVEKVVLFVRKGGLTVAEPFVLSSLPNLLPPS
NTYVLVDFEGGANTAANPTLAAFSWAMGTYKMDRYKQKPAKKEGGGEGGA
DDALKRKARLVWPKGADKEEVQAKVGSTFLVRDMVNMPAECMGPPQLQQT
AEMVAALFGAKAKVIIGDDLLKENYPMIHSVGRAADIGREPRLIDLTWGE
VDRPKITLVGKGVCYDTGGLDIKPGNSMAGMKKDMAGAAQILGLAYMIMA
SKLPIRLRVLIPAVENSVDARAYRNGDIIQARNGKTSEIMTTDAEGRLVL
ADALVEADSEDPDLIIDAATLTGAARVAVGTEMSAIWSNKEALGRRLQDI
SWEINDPCWMMPLYAGYRKHLKSNLADLRNTGNGGSGSITAALYLQEFLQ
KRSSPGTSLQKHKATPWIHMDFMGSNNGGRPGRPEGGDAQGMRALFEVVK
EVGAKGGWPKDGKEEEEGEEGKDE*
back to top
Synonyms
Publications