NO03G03280, NO03G03280 (gene) Nannochloropsis oceanica

Overview
NameNO03G03280
Unique NameNO03G03280
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length3321
Alignment locationchr3:976010..979330 -

Link to JBrowse

Properties
Property NameValue
DescriptionCathepsin b
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr3genomechr3:976010..979330 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO:0019538protein metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0008234cysteine-type peptidase activity
GO:0016787hydrolase activity
Vocabulary: INTERPRO
TermDefinition
IPR014756Ig_E-set
IPR038765Papain_like_cys_pep_sf
IPR013128Peptidase_C1A
IPR003172ML_dom
IPR000668Peptidase_C1A_C
Homology
BLAST of NO03G03280 vs. NCBI_GenBank
Match: EWM29737.1 (cathepsin b [Nannochloropsis gaditana])

HSP 1 Score: 813.1 bits (2099), Expect = 5.200e-232
Identity = 385/482 (79.88%), Postives = 433/482 (89.83%), Query Frame = 0
Query:   84 LAVTVGV-----TAAVDVPFSSCSSSDALGVTKLVVSEWPIHPGHPTTFTVVFNPTKDIVSGTKLSAAVMIKNMQVFEHSVDMCSQTSLTCPLKAGEAVQVSATQTLPEGIPPVKDLKAVVQASDATGVISCTEVEIELAVGEYTLEHLGRLPAIDSSLVTDVLMSGTTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFG-VTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCIVHGSKDLLSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAGSV 560
            L V VG      + AVDVPF++CS++DALGV+K+V+SEWP+HPGHP T +V F+P +DIV GT+L+A V I+NMQVFEHSVDMCS+TSLTCPLKAGE +QVSA+QTLPEGIPPV  LK VVQA+D TGVISC EVE+ELAVGEY+L+HL RL AIDS+LVTD+LM+GTTWTPHFSPRFAGA LKQ QG+MGTWLRGHPLHMTL  K+ V G    KG+N+TIPE+FD+REAWP C+E+IG+VKDQSACGSCWAFASSAAFEDR CI HGSKDLLSPTDTLACC+GMACGFSQGC+GGQPAGAW+FFASQGVVSGGLFEEVG+G +C+PYPFLTCAHHVEPTEELPACPAEDFETP+C+H+CSEE +E+ Y  DKRM   GY+VRA VEEIQKEIME+GPVSAAFTVYQDFL YSGEGVY HVTGAPLGGHAVKLIGWG +NGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAGSV
Sbjct:   10 LTVVVGAALFLCSRAVDVPFATCSTTDALGVSKVVMSEWPLHPGHPVTLSVAFSPLQDIVFGTQLTAKVFIQNMQVFEHSVDMCSETSLTCPLKAGEDMQVSASQTLPEGIPPVSGLKVVVQATDDTGVISCIEVELELAVGEYSLDHLSRLSAIDSALVTDILMAGTTWTPHFSPRFAGATLKQAQGLMGTWLRGHPLHMTLKLKEIVMGPAALKGENMTIPEAFDAREAWPECAELIGRVKDQSACGSCWAFASSAAFEDRMCIAHGSKDLLSPTDTLACCTGMACGFSQGCNGGQPAGAWSFFASQGVVSGGLFEEVGTGTTCMPYPFLTCAHHVEPTEELPACPAEDFETPRCRHTCSEELYEETYSADKRMAKRGYSVRANVEEIQKEIMESGPVSAAFTVYQDFLTYSGEGVYQHVTGAPLGGHAVKLIGWGSDNGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAGSV 491          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: CBN78981.1 (cathepsin B-like proteinase [Ectocarpus siliculosus])

HSP 1 Score: 311.2 bits (796), Expect = 6.400e-81
Identity = 175/450 (38.89%), Postives = 256/450 (56.89%), Query Frame = 0
Query:  156 EHSVDMCSQTS-LTCPLKAGEAVQVSATQTLPEGIPPVKDLKAVVQASDATGVISCTEVEIELAVGEYTLEHLGRLPAIDSSLVTDVLM------------SGTTWTPHFSPRFAGANLKQVQGMM-GTWLRGH-------------PLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCIVHGSKD---------------LLSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEP-TEELPACPAEDFETPKCKHSCSEEEFE-KDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVE--NGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAGSV 560
            E  +D+C+  + + CPL+AG+  + S   T    + P +  +  V+ S  T VI+  + +       +T   +       ++L+ D L             + ++W+  +S RF G + K  + +  GT +RG               +   +P + ++  V     +  IP +FD+REA+P C+ +IG+V+DQS CGSCWAFAS+ AF DR+CI    K+               +LS  DT ACC G  CG S GC+GGQP  AW +F   GVV+GG + ++G+G +C PY F+ CAHHV+P     PACP  ++ TP+C   CSE  F    Y EDK+M    Y++ A +E IQ+++M+ G V+AAF+V+ DFL YSG GVYTH +G+ +GGHAVK+IGWG +  +G  YWL+ NSWN +WG+ GLFRI+RG +ECG E  I AG V
Sbjct:  113 EFELDLCTDVAGVRCPLQAGD--RFSGVATWNAFVLPERGDENAVE-SLTTTVITAVQPDGPACGQFFTFHDVAGKEDASATLLEDHLSELVESDESSRGPAASSWSRGYSSRFEGFSWKDARRIAGGTVMRGQVGFEELPRRRYTKEIAPAVPGRRRLTPVAQSSSDEDIPANFDAREAFPECASIIGRVRDQSDCGSCWAFASTEAFNDRRCIAGIGKEDAAGAEGEATADQLLVLSAEDTTACCHGFHCGLSMGCNGGQPGSAWKWFTKTGVVTGGDYADIGTGTTCKPYEFMPCAHHVDPGASGYPACPDGEYPTPECLSECSETNFSGGSYGEDKKMAREAYSL-AGIENIQRDMMKYGSVTAAFSVFSDFLTYSG-GVYTHESGSFMGGHAVKMIGWGTDEVSGEDYWLIANSWNPSWGEGGLFRILRGVNECGIEGQIVAGEV 557          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: CEM19676.1 (unnamed protein product [Vitrella brassicaformis CCMP3155])

HSP 1 Score: 295.0 bits (754), Expect = 4.800e-76
Identity = 150/316 (47.47%), Postives = 198/316 (62.66%), Query Frame = 0
Query:  246 TTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCI-VHGS-KDLLSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAGSV 560
            TTW     PRF   ++K  + +MGT++    +H+      Q    T   D   +PESFD+RE WP C +VIG V+DQ+ CGSCWAFAS+ AF DR CI  +GS + LLSP DT +CC    C FS GC GGQPA AW +F   GVVSGG + ++G+GD+C PY    C+HHV+     PAC  E   TPKC   CSE  + + + +D+      + +   V+E ++EIMENGP++ AF+VY DFL Y   GVY HV G  LGGHA+K++GWGVE+G  YWLV+NSWN  WGD+G F+I  G  +CG    ++ G V
Sbjct:  114 TTWVAENPPRFRQMSVKDARRLMGTFVGDQYIHL------QEKKPTVPVDRDALPESFDARERWPECKDVIGHVRDQAECGSCWAFASTEAFNDRMCIKTNGSFQTLLSPQDTTSCCDESHC-FSFGCDGGQPALAWQWFTEVGVVSGGDYGDIGTGDTCWPYQLPMCSHHVK--GPYPACNGEQ-STPKCMAKCSETNYTEQFNQDRHKAYEAFTL-FDVDEAKREIMENGPITGAFSVYSDFLTYK-SGVYQHVKGRMLGGHAIKILGWGVEDGVEYWLVLNSWNDTWGDKGYFKIKMG--DCGINDMLSTGHV 415          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: XP_004614813.1 (PREDICTED: cathepsin B [Sorex araneus])

HSP 1 Score: 285.0 bits (728), Expect = 4.900e-73
Identity = 156/329 (47.42%), Postives = 199/329 (60.49%), Query Frame = 0
Query:  231 PAIDSSLVTDVLMSGTTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCI-VHGSKDL-LSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAG 558
            PA+   LV  V    TTW       F  A+L  V+ + GT L G      LP K Q+       + + +PE+FD+RE WP C   I +++DQ +CGSCWAF ++ A  DR CI  +G   + +S  D L CC G+ CG  +GC+GG P+GAW F+  QG+VSGGL++   S   C PY    C HHV  +   P C  E   TPKC   C E  +   YKEDK  G + Y+V +  EEI+ EI +NGPV AAF+VY DF AY   GVY HV G  +GGHAV+++GWGVE+GTPYWLV NSWN  WGD G F+I+RGQD CG ES+I AG
Sbjct:   24 PALSDELVNYVNKQNTTW--QAGHNFPNAHLSYVKKLCGTVLGG----PRLPQKVQL------TEPIKLPENFDAREQWPNC-PTIKEIRDQGSCGSCWAFGAAEAISDRTCIHTNGRVSVEVSAEDLLTCC-GLQCG--EGCNGGFPSGAWNFWKKQGLVSGGLYD---SHVGCRPYSIPPCEHHVNGSR--PPCTGEGGGTPKCSKIC-EAGYGSTYKEDKHFGCSSYSVSSDEEEIKAEIYKNGPVEAAFSVYGDFFAYK-SGVYQHVAGEMMGGHAVRILGWGVEDGTPYWLVGNSWNTDWGDNGFFKILRGQDHCGIESEIVAG 329          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: AAR19103.1 (cathepsin B [Uronema marinum])

HSP 1 Score: 284.3 bits (726), Expect = 8.400e-73
Identity = 150/341 (43.99%), Postives = 205/341 (60.12%), Query Frame = 0
Query:  232 AIDSSLVTDVLM-------SGTTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCIVHGSKD--LLSPTDTLACCSG-MACGFSQGCSGGQPAGAWAFFASQGVVSGGLF--EEVGSGDSCLPYPFLTCAHHVEPTEELPAC-PAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAGSV 560
            A D  L T  +M       +G+TW   ++ RF G +  Q+Q MMGT     P+HM     D+ +   +   N+++PESFD REA+P C E + +V+DQS CGSCWAF +  A  DR CI  G KD   +S  + L+CC G  ACG   GC+GG  AGAW ++   G+VSG L+  +   S   C PY F  C+HHV+   E  AC     F TPKC   C+ +  +  Y++D   G + Y+V    E+I+ EI + G  +A+F VY DFL YS  GVY + +G+ +GGHA+K++GWGVENGTPYWL  NSWN +WG+ G F+I+RG +ECG ES + AG V
Sbjct:   17 AFDFKLFTSEIMEEVNNYNTGSTWKAGYNKRFEGMSFDQIQAMMGT--IATPVHM---IPDERYTPFETIQNLSLPESFDLREAYPKC-ESLQQVRDQSNCGSCWAFGTVEAISDRICIASGQKDQTRISSENLLSCCRGTFACG--MGCNGGYTAGAWNYYVKTGLVSGNLYTDDNQNSKTECQPYSFPPCSHHVQ--GEYQACTDLPQFNTPKCYTECNSQYTQNSYEQDLHKGVSSYSVPKSEEQIKAEIYQYGSTTASFNVYSDFLTYS-SGVYQNTSGSYMGGHAIKMLGWGVENGTPYWLCANSWNSSWGENGFFKILRGSNECGIESGMVAGFV 346          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: PDM74592.1 (cpr-6 [Pristionchus pacificus])

HSP 1 Score: 282.0 bits (720), Expect = 4.200e-72
Identity = 157/324 (48.46%), Postives = 188/324 (58.02%), Query Frame = 0
Query:  236 SLVTDVLMSGTTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCIVHGS--KDLLSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAG 558
            +L+  V    + WT    PRF         G+MG      P+          F  TD      IPESFDSRE WP C E I  V+DQS+CGSCWAF ++ A  DR CI   +  +  +S  D L+CC   +CGF  GC+GG P  AW ++   G+VSGG F    S   C PYPF  C HH   T   P C ++ F TPKC+  C     EK Y EDK  G   Y V+  VE IQKEIM +GPV  AF VY+DFL Y+G GVY H  G   GGHAVK+IGWGV+NG PYWLVVNSWN+ WG++GLFRIIRG DECG ES +  G
Sbjct:  588 ALIKYVNRKQSLWTARRHPRFDSYPDATKWGLMGVEHVRLPVSALKDLSPTRFLATD------IPESFDSREQWPDC-ESIKVVRDQSSCGSCWAFGAAEAMSDRICIASNAEIQVSISADDLLSCCK--SCGF--GCNGGDPLQAWKYWVKDGIVSGGNFT---SHAGCKPYPFPPCEHHSNKTHYDP-CKSDLFPTPKCEKKCVSGYTEKSYNEDKFYGKTAYGVKDDVEAIQKEIMTHGPVEVAFEVYEDFLNYAG-GVYVHEGGKLGGGHAVKMIGWGVDNGIPYWLVVNSWNEDWGEDGLFRIIRGVDECGIESGVVGG 895          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: XP_012692280.1 (PREDICTED: cathepsin B [Clupea harengus])

HSP 1 Score: 281.6 bits (719), Expect = 5.400e-72
Identity = 147/331 (44.41%), Postives = 194/331 (58.61%), Query Frame = 0
Query:  229 RLPAIDSSLVTDVLMSGTTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCIVHGSKDL--LSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAG 558
            RLP +   +V  +  + TTW       F   +   VQ + GT L+G  L + + Y  +          +  P +FDSRE WP C   I +++DQ +CGSCWAF ++ A  DR CI   SK    +S  D L CC   +CG   GC+GG P+ AW F+  QG+VSGGL++   S   C PY    C HHV  +   P+C  E  +TP+C  SC E  +   YK DK  G + Y V  + ++I KEI ENGPV  AFTVY+DFL Y G GVY HVTG+ +GGHA+K++GWG ENGTPYWL  NSWN  WG+ G F+I++G D CG ES++ AG
Sbjct:   21 RLPPLSHEMVNYINKANTTWKA--GHNFHNVDYSYVQKLCGTLLKGPKLPIMVQYAGE----------MNFPTNFDSREQWPNC-PTIKEIRDQGSCGSCWAFGAAEAMSDRVCIHSDSKVSVEISSEDLLTCCK--SCG--MGCNGGYPSAAWDFWTKQGLVSGGLYD---SHIGCRPYTIEPCEHHVNGSR--PSCSGEGGDTPRCAKSC-EAGYSPSYKSDKHYGKSSYNVGEEEKQIMKEISENGPVEGAFTVYEDFLLYKG-GVYQHVTGSAVGGHAIKVLGWGEENGTPYWLCANSWNTDWGENGFFKILKGSDHCGIESEMVAG 327          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: XP_007429057.1 (PREDICTED: cathepsin B [Python bivittatus] >XP_007429058.1 PREDICTED: cathepsin B [Python bivittatus])

HSP 1 Score: 280.4 bits (716), Expect = 1.200e-71
Identity = 147/351 (41.88%), Postives = 204/351 (58.12%), Query Frame = 0
Query:  210 CTEVEIELAVGEYTLEHLGRLPAIDSSLVTDVLMSGTTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCI-VHGSKDL-LSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAGS 559
            C+ V + L V   +  ++   P +   LV  +    TTW       F  A++  V+ + GT+L G  L     +            ++ +P+SFDSR+ WP C   IG+++DQ +CGSCWAF +  A  DR C+  +G  ++ +S  D L+CC    CG   GC+GG P+GAW ++  +G+VSGGL++   S   C PY    C HH   T   P C  E  +TP+C  SC E  +   Y+EDK  G + Y+V    +EI  EI +NGPV AAFTVY DFL Y   GVY HV+G  +GGHA++++GWGV+ GTPYWLV NSWN  WG+ G FRI+RGQD CG ES++ AG+
Sbjct:    3 CSVVTLGLLVVLTSARNIPHFPPLSHDLVNYINKLNTTWKA--GHNFRDADMSYVKTLCGTFLHGPKLPERFEF----------AADLVLPDSFDSRQQWPNC-PTIGEIRDQGSCGSCWAFGAVEAMSDRICVHTNGKVNVEVSAEDLLSCCQ-FECG--MGCNGGYPSGAWRYWTEKGLVSGGLYD---SHVGCRPYSIPPCEHHTNGTR--PPCTGEGGDTPECVRSC-EAGYFPSYQEDKHYGMSSYSVPGNEKEIMSEIYKNGPVEAAFTVYSDFLMYK-SGVYQHVSGEAVGGHAIRILGWGVDKGTPYWLVANSWNTDWGENGFFRILRGQDHCGIESEVVAGT 330          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: NP_001079570.1 (cathepsin B precursor [Xenopus laevis] >XP_018119719.1 PREDICTED: cathepsin B isoform X2 [Xenopus laevis] >AAH44689.1 MGC53360 protein [Xenopus laevis])

HSP 1 Score: 280.0 bits (715), Expect = 1.600e-71
Identity = 152/334 (45.51%), Postives = 193/334 (57.78%), Query Frame = 0
Query:  226 HLGRLPAIDSSLVTDVLMSGTTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCI-VHGSKDL-LSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAG 558
            HL     +   +V  +    TTW       FA A+L  V+ + GT L+G       P   + FG     D + +P+SFDSR AWP C   I +++DQ +CGSCWAF +  A  DR C+  +G  ++ +S  D L+CC G  CG   GC+GG P+GAW F+   G+VSGGL++   S   C PY    C HHV  +   PAC  E+ +TPKC   C EE +   Y  DK  GT  Y V    +EI  EI +NGPV  AF VY DF  Y   GVY H TG  LGGHA+K++GWGVENGTPYWL  NSWN  WGD G F+I+RG+D CG ES+I AG
Sbjct:   19 HLPYFAPLSHDMVNYINKVNTTWKA--GHNFANADLHYVKRLCGTLLKG-------PQLQKRFGF---ADGLELPDSFDSRAAWPNC-PTIREIRDQGSCGSCWAFGAVEAISDRVCVHTNGKVNVEVSAEDLLSCC-GDECG--MGCNGGYPSGAWQFWTETGLVSGGLYD---SHVGCRPYSIPPCEHHVNGSR--PACKGEEGDTPKCVKQC-EEGYSPAYGTDKHFGTTSYGVPTSEKEIMAEIYKNGPVEGAFLVYADFPLYK-SGVYQHETGEELGGHAIKILGWGVENGTPYWLCANSWNTDWGDNGFFKILRGKDHCGIESEIVAG 329          
BLAST of NO03G03280 vs. NCBI_GenBank
Match: XP_004449108.1 (cathepsin B isoform X2 [Dasypus novemcinctus])

HSP 1 Score: 280.0 bits (715), Expect = 1.600e-71
Identity = 149/327 (45.57%), Postives = 197/327 (60.24%), Query Frame = 0
Query:  233 IDSSLVTDVLMSGTTWTPHFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIPESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCI-VHGSKDL-LSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSGDSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKEDKRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVTGAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGFESDITAG 558
            +   LVT +    TTW       F   ++  V+ + GT+L G      LP + ++       + + +PE+FDSRE WP C   I +++DQ +CGSCWAF +  A  DR CI  +G  ++ +S  D L+CC GMACG   GC+GG PA AW F+  +G+VSGGL+    S   C PY    C HHV  +   P C  E+ +TP+C  +C E  +   YKEDK  G + Y V +  EEI  EI +NGPV  AFTVY+DFLAY   GVY H TG  +GGHA++++GWGV+NGTPYWL  NSWN  WGD G F+I+RG+D CG ES I AG
Sbjct:   26 LSDELVTYINTRNTTWKA--GHNFRNVDMSYVKRLCGTFLDG----PRLPQRVRL------AEEMDLPENFDSREQWPNC-PTIREIRDQGSCGSCWAFGAVEAISDRVCIHTNGKVNVEVSAEDLLSCC-GMACG--DGCNGGFPAAAWNFWTKKGLVSGGLY---NSHVGCRPYSIPPCEHHVNGSR--PPCTGEEGDTPECSKTC-EPGYSPSYKEDKHYGYSSYGVPSSEEEIMAEIYKNGPVEGAFTVYEDFLAYK-SGVYQHETGVMVGGHAIRVLGWGVDNGTPYWLAANSWNTDWGDNGFFKILRGKDHCGIESSIVAG 329          
The following BLAST results are available for this feature:
BLAST of NO03G03280 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM29737.15.200e-23279.88cathepsin b [Nannochloropsis gaditana][more]
CBN78981.16.400e-8138.89cathepsin B-like proteinase [Ectocarpus siliculosu... [more]
CEM19676.14.800e-7647.47unnamed protein product [Vitrella brassicaformis C... [more]
XP_004614813.14.900e-7347.42PREDICTED: cathepsin B [Sorex araneus][more]
AAR19103.18.400e-7343.99cathepsin B [Uronema marinum][more]
PDM74592.14.200e-7248.46cpr-6 [Pristionchus pacificus][more]
XP_012692280.15.400e-7244.41PREDICTED: cathepsin B [Clupea harengus][more]
XP_007429057.11.200e-7141.88PREDICTED: cathepsin B [Python bivittatus] >XP_007... [more]
NP_001079570.11.600e-7145.51cathepsin B precursor [Xenopus laevis] >XP_0181197... [more]
XP_004449108.11.600e-7145.57cathepsin B isoform X2 [Dasypus novemcinctus][more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL068nonsL068Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR000ncniR000Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR052ngnoR052Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK005566NSK005566Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO03G03280.1NO03G03280.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|572515gene_1522Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100022g13gene866Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO03G03280.1NO03G03280.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO03G03280 ID=NO03G03280|Name=NO03G03280|organism=Nannochloropsis oceanica|type=gene|length=3321bp
AAATCATCAGGCTACGAGCACATGCCCGCTAAAGCCGTGCACACATGTGT
ACCAGGGAGCCACTGGAAAGAGATCCAGGCTGGGGCCGTGTTGAGCCCTA
GCGGGGCTCTTCAGTGCGGTGGTGCGAGAGAGAGCCGAGACGGCAACCAA
ATTTCAAACGAGTATGCAGACCCCCACTCAAGCACCCACACCACCGGCAT
ACTTACACACAACGCGCATCCACGACGACACGACACAGCGAAAATGCATT
TTTTTGGGCGCAGCCTAGCCCTGGCTGTGACCGTGGGCGTGACAGCCGCC
GTGGACGTCCCCTTTTCGTCTTGCTCCTCAAGCGATGCTCTAGgtgagtt
gcgtactgatcgttctttgtttgtccgtgagtgtgtgtgggtgctcgcct
ggactgagtgtaataattatgtgcccatttgaagcatgtgtcttgtcttt
ggccccttggtttatctcaatggcgtgttgagcccggaaattcctgcctt
ccctgttgtcgctgtgttcctcgccgggccaggggctcattggtttatgc
tcatagccaagatattcattataatcttggtgacagatgtaatctatttt
gatgtgttagtttgtgttctacttccctttcgtattgttccagcattttt
cacacgccgctcaccatttacacaccaaattttcaatacaaccccttccc
ccaccctcaaaaagGTGTGACCAAATTGGTGGTGTCTGAATGGCCCATCC
ACCCAGGACATCCTACCACGTTTACGGTCGTTTTCAACCCGACCAAAGAC
ATCGTGTCCGGGACTAAGCTATCCGCCGCTGTCATGATCAAAAACATGCA
Ggtatgagtgtatacgtgtatttgtgtgcgtctgcatccctctctgtctc
tctctgtctctctctgtgtctatatatatgcatagtaaggtaaaatgtaa
gtaaggtaaggtggttcgaattcgcttgcgcgttggtgtcgtaagcagct
gcaacagcagccgcaacaccagcagccgcaacactaaaatttactcaaat
caattcatggtcctcccacgctacaacacagGTCTTCGAACACAGCGTGG
ACATGTGCAGCCAAACCAGCCTGACCTGCCCTCTGAAAGCAGGCGAAGCC
GTCCAAGTATCCGCTACCCAAACACTCCCCGAGGGTATTCCTCCCGTCAA
GGACCTCAAGGCAGTCGTCCAGGCCTCGGATGCCACGGGCGTAATCAGTT
GCACTGAGGTCGAGATTGAGCTGGCGGTGGGTGAGTATACTCTGGAGCAC
CTTGGGCGCCTCCCTGCAATTGATTCGTCATTGGTCACTGACGTGTTGAT
GAGCGGCACGACGTGGACGCCGCATTTCTCCCCCAGATTTGCTGGCGCGA
ACCTGAAGCAAGTGCAGGGAATGATGGGGACGTGGCTGAGGGGGCATCCA
TTGCACATGACTTTACCGTACAAGGACCAGGTTTTTGGTGTGACAGACAA
AGGGGATAATGTGACGATTCCTGAAAGTTTCGACTCGCGAGAGGCGTGGC
CGGCGTGTAGCGAGGTTATCGGCAAGgtatggatgtaggcaagggcaccg
cggcgtacgatctgtcatgtgaagatgaatgtcacctctcctctcctccc
tccttccttctttccgtagGTCAAGGATCAGAGTGCCTGTGGCTCGTGCT
GGGCTTTTGCCAGCTCTGCCGCTTTCGAGGACAGGCAGTGCATCGgtacg
tctctattctattcccccccccccttcctctccaactttctactcaccct
ccctcctttcctccctctctccctccttttcttggcgtagTGCACGGCTC
CAAGGACCTCCTTTCCCCCACCGACACCCTCGCCTGCTGCAGTGGCATGG
CCTGCGGATTTTCTCAAGGGTGCAGCGGCGGACAACCGGCTGGGGCCTGG
GCCTTTTTCGCCTCCCAGGGGGTGGTGTCGGGCGGGCTGTTCGAAGAGGT
CGGCTCGGGAGACTCGTGCTTGCCTTACCCTTTCTTGACGTGTGCGCACC
ACGTGGAACCGACGGAAGAATTGCCTGCGTGCCCCGCGGAAGATTTTGAG
ACACCCAAATGCAAGCATAGTTGCTCGGAGGAAGAGTTCGAGAAGGACTA
TAAGGAGGACAAGCGGATGGGGACGGCCGGGTATGCCGTGAGGGCCAAGG
TAGAGGAGATCCAGAAGGAAATCATGGAAAATGGGCCGGTGTCGGCGGCA
TTTACGGTGTATCAGGACTTTCTGGCCTACTCGGGGGAAGGCGTCTACAC
CCACGTAACGGGCGCACCGTTGGGGGGCCATGCGGTCAAgtaagcgaggg
gaaggggggaggaggaggaggaggaggaggaggaggaggaggaggaggag
gaggaggaggaggaggaggaggatagacttccggacctcattctattccc
tttcgtttttcaagtcattcacttagtttcatattccaaaaaaaaactct
ttctgcgcagGTTGATTGGGTGGGGCGTGGAGAACGGGACACCGTACTGG
CTCGTCGTCAACTCGTGGAACAAGgtacgatgggagagatagggagggat
ggagggatggagggagggagggggggagggaaggttgtcttttcgtcatt
ttttgcgtttgttgttttcttggcgattttttgtccgggtgacatttttg
tcgaattcgcgtgtcgcttacagaaggccactcatgccgactcaaacgtg
gaaacatgcatacaccatgtacattcccagGCATGGGGTGATGAGGGTCT
TTTCCGCATTATCCGGGGACAGGACGAATGCGGATTTGAATCGGATATCA
CGGCAGGCAGTGTCTAATAAAATCGAATCGGTTTCTTTAAAATCACAAGG
AGGGGGGAAAGTGAGTAATCCAAGTGAGTTTTCCTCCCAACGACCAGCCG
AGATTTTGGCTGCTGTTACGGAGTGCTTGCTTTTGAAAGAGAGATGCTTA
GAACGTTGAGAGGGAAAAGATGAAAAGAGCGAGACACAGCAAGTGTTTCG
GTGCCCCTTGAAGAGTAGACGAAAATATCGAAGAAGGAAATAAAAGAGGG
GTGTAGAAGGACAAGGCATCCTGATTATCAGCATCGTCATAGTCATCATC
ATAATCATGATCAGCGACCCCGCGGGAGCGGCGCAGATTCGGATGTAGGC
AGCGAGAGAGTGGAGTACATAAGCGACACGCAAAAAGGAGAAAAATGAGG
GAATTAAAGGAGGAGAGAAAAAACATAAAATCGATTCAGGAGACGAGATC
AAGACGTTGGTTTCCGGAGCT
back to top

protein sequence of NO03G03280.1

>NO03G03280.1-protein ID=NO03G03280.1-protein|Name=NO03G03280.1|organism=Nannochloropsis oceanica|type=polypeptide|length=560bp
MPAKAVHTCVPGSHWKEIQAGAVLSPSGALQCGGARESRDGNQISNEYAD
PHSSTHTTGILTHNAHPRRHDTAKMHFFGRSLALAVTVGVTAAVDVPFSS
CSSSDALGVTKLVVSEWPIHPGHPTTFTVVFNPTKDIVSGTKLSAAVMIK
NMQVFEHSVDMCSQTSLTCPLKAGEAVQVSATQTLPEGIPPVKDLKAVVQ
ASDATGVISCTEVEIELAVGEYTLEHLGRLPAIDSSLVTDVLMSGTTWTP
HFSPRFAGANLKQVQGMMGTWLRGHPLHMTLPYKDQVFGVTDKGDNVTIP
ESFDSREAWPACSEVIGKVKDQSACGSCWAFASSAAFEDRQCIVHGSKDL
LSPTDTLACCSGMACGFSQGCSGGQPAGAWAFFASQGVVSGGLFEEVGSG
DSCLPYPFLTCAHHVEPTEELPACPAEDFETPKCKHSCSEEEFEKDYKED
KRMGTAGYAVRAKVEEIQKEIMENGPVSAAFTVYQDFLAYSGEGVYTHVT
GAPLGGHAVKLIGWGVENGTPYWLVVNSWNKAWGDEGLFRIIRGQDECGF
ESDITAGSV*
back to top
Synonyms
Publications