NO22G02190, NO22G02190 (gene) Nannochloropsis oceanica

Overview
NameNO22G02190
Unique NameNO22G02190
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length4111
Alignment locationchr22:688595..692705 +

Link to JBrowse

Properties
Property NameValue
DescriptionAspartic protease
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr22genomechr22:688595..692705 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
PRJNA7699582024-08-13
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
PXD0166992021-01-08
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PXD0087212019-04-30
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO:0006508proteolysis
Vocabulary: Molecular Function
TermDefinition
GO:0004190aspartic-type endopeptidase activity
GO:0016787hydrolase activity
GO:0008233peptidase activity
GO:0004190aspartic-type endopeptidase activity
Vocabulary: INTERPRO
TermDefinition
IPR021109Peptidase_aspartic_dom
IPR033121PEPTIDASE_A1
IPR001461Aspartic_peptidase
Homology
BLAST of NO22G02190 vs. NCBI_GenBank
Match: EWM22250.1 (aspartic protease [Nannochloropsis gaditana])

HSP 1 Score: 294.3 bits (752), Expect = 6.600e-76
Identity = 155/331 (46.83%), Postives = 207/331 (62.54%), Query Frame = 0
Query:   86 QGEIILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTGKSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVAGFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEYHKATTAIIDSGTSLLVGPEEEVKGIARSMGAV-FNHWEDVWVLDDCLHRDIPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGMITFPY-----PIWILGDVFMRRYYSVFDYENERLGLAPA 411
            +G+II+ +  N +YFGEI +GS  Q F VIFDTGSSN W+PS  C  SC  K+ +    S T+ E G  FK+ YGSG V+G+FSED + + G  I +Q FAEVTDASGLG  +  G FDGILG+GF+S++VG V TP  NL+ QGL+ E VF+F+LG + PGELT+GG +   Y+G++ Y PLK+A+YW VAL  V       H        +AIIDSGTSL+ GP++EV  +A+ +GA  F   E    L DC +   P + F I G+ Y +  +DY   D     C+   +         P+WILGD FMRRYY+ F+YE + +GLA A
Sbjct:   53 EGDIIISDVSNAQYFGEISVGSDRQAFQVIFDTGSSNLWIPSEDCLASCASKAKYDHDASDTYVENGAIFKIMYGSGPVQGYFSEDDVELGGLTISQQAFAEVTDASGLGAAFAAGSFDGILGLGFDSISVGKVTTPFHNLIKQGLVAEPVFSFYLGDNAPGELTLGGTDPAHYKGDIHYVPLKSATYWEVALEDV-----QVHGLSLTNVDSAIIDSGTSLITGPKKEVAKLAKLVGATRFILGE---YLIDC-NAQGPDIDFVIDGRTYSLSADDYKIRD--GGLCLFAFMGLDIPRPSGPLWILGDSFMRRYYTTFNYEKQTVGLALA 372          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: CBJ29800.1 (aspartyl protease [Ectocarpus siliculosus])

HSP 1 Score: 292.7 bits (748), Expect = 1.900e-75
Identity = 146/329 (44.38%), Postives = 215/329 (65.35%), Query Frame = 0
Query:   86 QGEIILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTGKSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVAGFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEYHKATTAIIDSGTSLLVGPEEEVKGIARSMGAV-FNHWEDVWVLDDCLHRDIPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGMITFPY-----PIWILGDVFMRRYYSVFDYENERLGLA 409
            +G++I+K+YQN +Y+G+++IG+  Q+F VIFDTGS+N WV   KC LSC   S + A+ SST  E G +F++ Y SG V G  S D++T  G +++ Q FAEV DA GLG  + +G+FDGI+G+ F+ ++V GV TP   L+++G +++ VFAF+LG  + GEL +GG + D Y  E+ Y P+    YW + +   V ++G+          +AI+DSGTSLLVGP+E+VK IA  +GA+ F + E    L  C   D+P LTF IGGKEY +E ++Y+    +   C+L ++         P+WILGDVFMR+YY+VFDY N ++GLA
Sbjct:   88 EGKVIVKDYQNAQYYGQVEIGTPPQSFEVIFDTGSANLWVAGSKCGLSCGLHSRYAASKSSTHAEDGRDFEITYASGPVSGSLSADTVTWGGIQLKDQTFAEVQDAKGLGLAFILGKFDGIMGLAFDEISVEGVPTPFGRLVEEGELDDAVFAFYLGNQKEGELIIGGTDPDHYLHEINYVPVTKKGYWQIDMDN-VDVSGS----SVTSVKSAILDSGTSLLVGPKEDVKKIASKVGAISFMNGE---YLMPC-SSDLPPLTFTIGGKEYTLEGDEYVISAGNDKVCILAIMGMDIPEPMGPLWILGDVFMRKYYTVFDYGNAQIGLA 407          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: XP_009034996.1 (hypothetical protein AURANDRAFT_23053 [Aureococcus anophagefferens] >EGB10172.1 hypothetical protein AURANDRAFT_23053 [Aureococcus anophagefferens])

HSP 1 Score: 274.6 bits (701), Expect = 5.400e-70
Identity = 146/332 (43.98%), Postives = 208/332 (62.65%), Query Frame = 0
Query:   88 EIILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTGKSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVAGFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVAL-GTVVSLNGAGHRQEYHKATTAIIDSGTSLLVGPEEEVKGIARSMGAVFNHWEDVWVLDDCLHRDIPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGM--ITFPY---PIWILGDVFMRRYYSVFDYENERLGLAPARAT 414
            +II+K+YQN +Y+GEI +G+  Q  +V+FDTGSSN WVP+ K  LS    +++    SST+ + G EFK++YGSG V G +S D++ +  + +   +FAEV D SGLG  Y++G FDGILG+ +  ++V GV TP+  L+  G +++ VFAF LG D  GEL +GGV+  +YEG+ +Y PL   SYW V L G  V  +G         A  AI+DSGTSLL GP  EVK IA  +GA     ++  +  D    DI    F +GGK+Y +  +DY+  D  + QC+ GM  I  P    P+WILGDVFMR+YY  FD + E++G+A A+++
Sbjct:   24 DIIIKDYQNAQYYGEISVGTPPQNVAVVFDTGSSNLWVPNKKPFLS--KHAIYENKKSSTYVKNGTEFKIQYGSGPVSGEYSRDTVAIGDYSVANYLFAEVDDTSGLGIGYRLGHFDGILGLAWGGISVDGVPTPLEALVASGQLDDEVFAFSLGDDADGELVIGGVDDSKYEGDFSYVPLSQKSYWEVTLDGLAVGASG-----NMTTAVKAIVDSGTSLLAGPTAEVKKIAEQIGAKSVLGKEYTIDCDAKADDI---VFTLGGKDYALALKDYVIED--AGQCLFGMMGIDIPAPNGPLWILGDVFMRKYYVKFDIKGEQIGIATAKSS 343          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: CEL99302.1 (unnamed protein product [Vitrella brassicaformis CCMP3155])

HSP 1 Score: 274.6 bits (701), Expect = 5.400e-70
Identity = 148/365 (40.55%), Postives = 216/365 (59.18%), Query Frame = 0
Query:   65 LTPMSMEDLAGGDKAVSAPRIQGEIILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTGKSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVAGFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADE--PGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVV----SLNGAGHRQEYHKATTAIIDSGTSLLVGPEEEVKGIARSMGA---VFNHWEDVWVLDDCLHRDIPSLTFNIGGKEYLIEKEDYIFY--DPSSNQCVLGMITFPY----PIWILGDVFMRRYYSVFDYENERLGLAPARATG 415
            + P+       GD   +    + +I L +Y+N +YFG I +GS  Q F++IFDTGSSN WVPS  C  SC   S +    SS++ + G  F + YGSG V GF S D++ V   ++++  FAEVTDASGLG  Y +G+FDGILG+GF S++V G++ P   ++ +GL++E VFAF+LG      GEL +GG++   Y GE+ Y PL   +YW V L +++    S+N AGH         AI+DSGTS++ GP + V  +A+ +GA     N  E  WV++      +P+++F +GGK + +  +DY     D     C+            P+WILGDVFMR+YYS+FDY N+R+GLA A + G
Sbjct:  110 VAPLPRNRTLNGDLLTAFRAGEADIDLHDYENAQYFGSISLGSNEQEFTMIFDTGSSNVWVPSALCDRSCGSHSKYDHKSSSSYAKDGRNFHIVYGSGPVSGFLSSDALNVGDIQLKEYTFAEVTDASGLGLAYSIGKFDGILGLGFPSISVDGIEPPFVTMVKRGLVKEPVFAFYLGTANGMDGELVLGGIDPKHYTGEIHYVPLAAENYWTVRLESLLVGSDSIN-AGH--------VAIVDSGTSIMAGPSKLVDALAKKVGAHRFFLNPQE--WVVNCKNIPTMPNISFELGGKMFELTPQDYTLKLGDSPFLPCLFAFQGIDLGNGPPLWILGDVFMRKYYSIFDYGNKRIGLAKAASGG 463          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: OLP88390.1 (Lysosomal aspartic protease [Symbiodinium microadriaticum])

HSP 1 Score: 271.2 bits (692), Expect = 6.000e-69
Identity = 158/413 (38.26%), Postives = 235/413 (56.90%), Query Frame = 0
Query:    6 VLLALYAGCAAAVVQIPLKRMEPTENRMKAALTRIKSRVDSAN---KQAMARLRGSGEDPALLTPMSMEDLAGGDKAVSAPRIQGEIILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTGKSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVAGFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEYHKATTAIIDSGTSLLVGPEEEVKGIARSMGAVFNHWEDVWVLDDCLHRDIPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGM--ITFPY---PIWILGDVFMRRYYSVFDYENERLGLAPA 411
            V+ A      AAV+++PL++ E             +  +DS N   ++  ARL   G+DP                          +I+ +Y N +YFGEI++G+  Q   V+FDTGSSN WVP+ K  LS    +++  + SST+++ G  F ++YGSG V G FS D + +   +++   FAEV   SGLG  Y +G+FDGILG+G++S++VG VKTPM  L++ G + + +FAF+LG ++PGEL  GGV+   Y G+ ++ PL +A+YW + L  V                +AI+DSGTSLL GP+++V  IA  +GA     ++  V  DC  + +P LTF +GGK+Y + + D I    S +QC+LG+  I  P    P+WILGDVFMR+YY  FD+  +RLG A A
Sbjct:  318 VVAAASTVAVAAVIRVPLQKRE----------LSFEETLDSVNMGVQKWEARLAARGDDP--------------------------VIIDDYMNAQYFGEIEVGTPGQKEMVVFDTGSSNLWVPNHKPFLS--SHNIYDHSKSSTYKKNGTTFAIQYGSGPVSGVFSADDVAIGDLKLKDYTFAEVDKTSGLGIGYILGKFDGILGLGWDSISVGHVKTPMKALVESGKLPKPIFAFYLGNNQPGELLFGGVDPKHYSGDFSFVPLSSATYWQIKLDAVKL-----GSDSVSSVKSAIVDSGTSLLAGPKDDVAKIAAKLGAKSILGKEYVV--DCSAK-LPDLTFTLGGKDYTLSQPDLIL-QQSGSQCILGLTGIDVPAPRGPLWILGDVFMRKYYVQFDWGQQRLGFAKA 683          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: XP_005853900.1 (cathepsin D [Nannochloropsis gaditana CCMP526] >EKU22454.1 cathepsin D [Nannochloropsis gaditana CCMP526])

HSP 1 Score: 266.9 bits (681), Expect = 1.100e-67
Identity = 155/385 (40.26%), Postives = 208/385 (54.03%), Query Frame = 0
Query:   86 QGEIILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTGKSLFRAADSSTFREVGDEFKVRYGSGTV------------------------------------------------------KGFFSEDSMTVAGFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEYHKATTAIIDSGTSLLVGPEEEVKGIARSMGAV-FNHWEDVWVLDDCLHRDIPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGMITFPY-----PIWILGDVFMRRYYSVFDYENERLGLAPA 411
            +G+II+ +  N +YFGEI +GS  Q F VIFDTGSSN W+PS  C  SC  K+ +    S T+ E G  FK+ YGSG V                                                      +G+FSED + + G  I +Q FAEVTDASGLG  +  G FDGILG+GF+S++VG V TP  NL+ QGL+ E VF+F+LG + PGELT+GG +   Y+G++ Y PLK+A+YW VAL   V + G           +AIIDSGTSL+ GP++EV  +A+ +GA  F   E    L DC +   P + F I G+ Y +  +DY   D     C+   +         P+WILGD FMRRYY+ F+YE + +GLA A
Sbjct:   53 EGDIIISDVSNAQYFGEISVGSDRQAFQVIFDTGSSNLWIPSEDCLASCASKAKYDHDASDTYVENGAIFKIMYGSGPVQVESPTSLPPLPAFFGLPHASFLSPPFTTFVIIAYFEATVLNWCCPLLSMFSFWQGYFSEDDVELGGLTISQQAFAEVTDASGLGAAFAAGSFDGILGLGFDSISVGKVTTPFHNLIKQGLVAEPVFSFYLGDNAPGELTLGGTDPAHYKGDIHYVPLKSATYWEVALED-VQVQGL----SLTNVDSAIIDSGTSLITGPKKEVAKLAKLVGATRFILGE---YLIDC-NAQGPDIDFVIDGRTYSLSADDYKIRD--GGLCLFAFMGLDIPRPSGPLWILGDSFMRRYYTTFNYEKQSVGLALA 426          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: OEU16065.1 (Asp-domain-containing protein [Fragilariopsis cylindrus CCMP1102])

HSP 1 Score: 264.2 bits (674), Expect = 7.300e-67
Identity = 137/332 (41.27%), Postives = 205/332 (61.75%), Query Frame = 0
Query:   90 ILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALS----CTGKSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVA-GFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEYHKATTAIIDSGTSLLVGPEEEVKGIARSMGAVFNHWEDVWVLDDCLHRDIPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGMITFPY-----PIWILGDVFMRRYYSVFDYENERLGLAPAR 412
            I+K+Y N +YF  ++IG+  Q+F VI+DTGSSN WVP + C        + K  +  + SS++ E G++F++ YGSG+VKGFFS+D +T+A    I+ Q FAEVTDA GLG  Y +G+FDGILG+GF S+++GG  T   N + Q  +++ +FAF+LG + PGELT GG +  ++EGEL Y  L+ A+YW + L  +   +      + +K   AI+DSGTSL+VGP+ E+  +A S+GA  N   + + +D     D+P + F IGG EY I     +    +   C+   +         P+WILGDVFMR YY+VF+  ++ +G A A+
Sbjct:   76 IIKDYGNAQYFAVVEIGTPPQSFEVIYDTGSSNLWVPKVGCTHCGLPFISHKKKYDESKSSSYEEDGEDFEIMYGSGSVKGFFSKDDITLAEDIIIDAQDFAEVTDAGGLGVAYSLGKFDGILGLGFSSISIGGKTTVFENAIKQNKVDQPIFAFYLGDNGPGELTFGGYDSSKFEGELTYVKLEAATYWEITLDEIACGDYKKEGSDDNK-IKAIVDSGTSLIVGPKAEIAALATSIGAKANIMGE-YTIDCAKVPDMPDVVFTIGGIEYSIPGPKTVI--QAQGTCLFAFMGLEIPPPAGPLWILGDVFMREYYTVFNVHDKTIGFAKAK 403          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: GAX11609.1 (cathepsin D [Fistulifera solaris])

HSP 1 Score: 262.7 bits (670), Expect = 2.100e-66
Identity = 159/417 (38.13%), Postives = 232/417 (55.64%), Query Frame = 0
Query:    5 LVLLALYAGCAAAVVQIPLKRMEPTENRMKAALTRIKSRVDSANKQAMARLRGSGEDPALLTPMSMEDLAGGDKAVSAPRIQGEIILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTG-----KSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVA-GFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEY--HKATTAIIDSGTSLLVGPEEEVKGIARSMGAVFNHWEDVWVLDDCLHRD-IPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGMITFPY--PIWILGDVFMRRYYSVFDYENERLGLAPA 411
            LV+ A +    AAV +IPL +    E      L   +   ++A +Q    LRGS          S  +                 I+ +Y N +YFG I IG+  Q+F VIFDTGSSN WVP + C   C       K+ +    S+T+     +F++ YGSG+V GFFS DS+T+A    I +Q FAE++DA GLG  + +G+FDGILG+GF S++V    T M N L Q +I++ +F+F+LG + PGELT GG +  ++ G+L Y  L +A+YW +AL +V +  G  H  E       TAI+DSGTSL+ GP++++  IA+++GA  N      +  DC   D IP + F I G+EY++  +  +     +       + FP   P WILGDVFMR+YY+VF+Y +E +G APA
Sbjct:    9 LVISATWTAVTAAVTKIPLHKRPDDELIQAYHLREQQYNYETAQRQ----LRGSSNSDTSFLERSESE-----------------IINDYANAQYFGSISIGTPPQSFKVIFDTGSSNLWVPKVGCT-HCGNPFFGKKAKYNHDSSTTYESDNADFEIMYGSGSVSGFFSVDSVTLADDLVITEQRFAEISDAGGLGLAFALGKFDGILGLGFRSISVDDTPTVMDNALKQNVIDQPIFSFYLGDNGPGELTFGGYDSSKFTGDLQYVKLLSATYWEIALDSVSA--GPYHSSENADQSPITAIVDSGTSLITGPKKDIAAIAQAIGAKPNVMGQYTI--DCNTVDAIPDIVFTIDGREYMVPGDKAVLQAQGTCLFAFMGVDFPKPGPQWILGDVFMRQYYTVFNYLDETVGFAPA 399          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: XP_002292244.1 (predicted protein [Thalassiosira pseudonana CCMP1335] >EED90219.1 predicted protein [Thalassiosira pseudonana CCMP1335])

HSP 1 Score: 262.3 bits (669), Expect = 2.800e-66
Identity = 136/331 (41.09%), Postives = 207/331 (62.54%), Query Frame = 0
Query:   89 IILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSC------TGKSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVA-GFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEYHKATT-AIIDSGTSLLVGPEEEVKGIARSMGAVFNHWEDVWVLDDCLHRDIPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGMITFPYPI---WILGDVFMRRYYSVFDYENERLGLA 409
            +I+K+Y N +Y+GE+ IG+  Q F+V+FDTGSSN WVP + C  +C       GK+ F  + S++++  G +F ++YGSG V+G+FS D++T+A    I  Q FAEV++A GLG  Y MGQFDGILG+GFE L++GG KT   N +DQ ++ + VFAF LG +  GELT+GG +  +++G++ + PL    YW + +  + +         Y   TT  I+DSGTSL+ GP   +  IA S+GA+ N     + +D     ++P L F I G+ + +  +D +    S+  C+  M+    P    WILGDVFMR++Y++FDYEN+++GLA
Sbjct:   14 VIIKDYSNAQYYGEVMIGTPPQKFTVVFDTGSSNLWVPKVNCQ-NCGYWFIHGGKNKFDNSQSTSYKADGSDFHIQYGSGDVQGYFSVDTVTLADDIVITDQKFAEVSNAGGLGVGYIMGQFDGILGLGFEGLSLGGAKTVFKNAIDQKVVAQPVFAFSLGDNADGELTLGGYDDSKFKGDITWIPLSEPKYWQIDIEDITA-------GSYSSGTTNGIVDSGTSLITGPSTSIIKIALSVGAMPNIMGQ-YTIDCAKVPNLPDLEFKINGQVWKVPGKDLVI--ESAGTCLFAMMGMDIPTGPQWILGDVFMRKFYTIFDYENQKVGLA 333          
BLAST of NO22G02190 vs. NCBI_GenBank
Match: GAX28975.1 (cathepsin D [Fistulifera solaris])

HSP 1 Score: 260.4 bits (664), Expect = 1.100e-65
Identity = 157/406 (38.67%), Postives = 230/406 (56.65%), Query Frame = 0
Query:   16 AAVVQIPLKRMEPTENRMKAALTRIKSRVDSANKQAMARLRGSGEDPALLTPMSMEDLAGGDKAVSAPRIQGEIILKNYQNVEYFGEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTG-----KSLFRAADSSTFREVGDEFKVRYGSGTVKGFFSEDSMTVA-GFRIEKQIFAEVTDASGLGRTYQMGQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELTVGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEY--HKATTAIIDSGTSLLVGPEEEVKGIARSMGAVFNHWEDVWVLDDCLHRD-IPSLTFNIGGKEYLIEKEDYIFYDPSSNQCVLGMITFPY--PIWILGDVFMRRYYSVFDYENERLGLAPA 411
            AAV +IPL +  P E  ++A   R +  +    + A  +LRGS +        S  +                 I+ +Y N +YFG I IG+  Q+F VIFDTGSSN WVP + C   C       KS +    S+T+     +F++ YGSG+V GFFS DS+T+A    + +Q FAE+ DA GLG  + +G+FDGILG+GF S++V    T M N L Q +I++ +F+F+LG + PGELT GG +  ++ G+L Y  L +A+YW +AL +V +  G  H  E       TAI+DSGTSL+ GP++++  IA+++GA  N      V  DC   D IP + F I G++Y++  +  +     +       + FP   P WILGDVFMR+YY+VF+Y +E +G APA
Sbjct:    5 AAVTKIPLHK-RPDEELIQAYHLREQQYI---YETAQRQLRGSSDSDTSFLERSESE-----------------IINDYANAQYFGSISIGTPPQSFKVIFDTGSSNLWVPKVGCT-HCGNPFFGKKSKYNHDSSTTYESDNADFEIMYGSGSVSGFFSVDSVTLADDLVVTEQRFAEIADAGGLGLAFALGKFDGILGLGFRSISVDDTPTVMDNALKQNVIDQPIFSFYLGDNGPGELTFGGYDSSKFTGDLQYVKLLSATYWEIALDSVSA--GPYHSSENADQSPITAIVDSGTSLITGPKKDIAAIAQAIGAKPNVMGQYTV--DCNTVDAIPDIVFTIDGRDYMVPGDKAVLQAQGTCLFAFMGVDFPKPGPQWILGDVFMRQYYTVFNYLDETVGFAPA 384          
The following BLAST results are available for this feature:
BLAST of NO22G02190 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM22250.16.600e-7646.83aspartic protease [Nannochloropsis gaditana][more]
CBJ29800.11.900e-7544.38aspartyl protease [Ectocarpus siliculosus][more]
XP_009034996.15.400e-7043.98hypothetical protein AURANDRAFT_23053 [Aureococcus... [more]
CEL99302.15.400e-7040.55unnamed protein product [Vitrella brassicaformis C... [more]
OLP88390.16.000e-6938.26Lysosomal aspartic protease [Symbiodinium microadr... [more]
XP_005853900.11.100e-6740.26cathepsin D [Nannochloropsis gaditana CCMP526] >EK... [more]
OEU16065.17.300e-6741.27Asp-domain-containing protein [Fragilariopsis cyli... [more]
GAX11609.12.100e-6638.13cathepsin D [Fistulifera solaris][more]
XP_002292244.12.800e-6641.09predicted protein [Thalassiosira pseudonana CCMP13... [more]
GAX28975.11.100e-6538.67cathepsin D [Fistulifera solaris][more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
ncniR025ncniR025Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR104ngnoR104Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO22G02190.1NO22G02190.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|596172gene_7237Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_101670g3gene9878Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO22G02190.1NO22G02190.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO22G02190 ID=NO22G02190|Name=NO22G02190|organism=Nannochloropsis oceanica|type=gene|length=4111bp
ATGAAGAGGGGACTCGTCCTCCTTGCGCTCTATGCGGGTTGCGCGGCCGC
TGTCGTCCAAATTCCTTTAAAGgtaggcaaattgagggatcgggaggacg
gcgggtgtggaaaggcagggaaagcgggaagcaagggaagaagatggtga
gggatgaacatgatatttgaatgaggagaaggccaccgtgcagggtgacg
gcggaagtgtctatcggaatgagagatatgcggtggaagatgcaaggagg
gaaacagagtgtttgaaagtggtttttaaagaggattcacatgtcataaa
gacctatgtgaagccccgcctctacttacgacgatcctccatccctccct
ctctattccccccccttccgtgttcacgtacagCGGATGGAGCCGACCGA
GAACAGGATGAAGGCGGCATTGACCAGGATCAAGAGgtgagagagggagg
catgggggagaaagggagagagggagggcagagagagcaggaaggaggga
agggagggtgtggaagacgtcggtgttttgaaagaggggcggcaacggag
tgtgtctgcctttgagcacatgcgagtcgcttttggagagaagatcgcgt
cttctcctgcaattcaggaaaagtcccgaatttgaagtctcctttttatc
acccctcactccctccctcccttcctccctccccccctccctcctttccc
ttttctccgtacgactacagCCGCGTCGACAGCGCCAACAAGCAGGCCAT
GGCCCGTCTCCGCGGATCGGGGGAGGACCCCGCCCTCCTCACCCCTATGA
GCATGGAAGACCTCGCAGgtaaaggagggagggagggagggagggaggga
gggagggagggagggagggagggagggagggagggagggagggagggagg
gagggagggagggagggagggagggagggagggagggagggagggaggga
gggagggagggaatgggggaaatggaggagaggtgtggggtcagttgatg
gagggcaattcggcagtcccctcattctcggggtctcagtagattgatac
ctgcgtgttttgttgatggtacattggaatttgcccctctcaggcaaggg
gagcgtgtacgggcctttctatctcgggctttcagcggtcttcctctctt
tttttttctctctctctcacacgcatacatcaacacatccacacatccac
acactcatcagGAGGGGACAAGGCCGTGAGCGCTCCCCGAATCCAGGGAG
AGATCATCCTCAAAAACTACCAGgtacgccctccctctctccctccctcc
ctccctccctccctccctccctccctccctccctccctccctacctcaac
gttacatttcatcactcttcctccctccctccctccctccctccctccct
ccctccctccagAATGTGGAGTACTTTGGCGAAATTCAGATCGGGTCACA
AAATCAAACTTTTTCGgtacgtccctccctccctccctccctccctccct
ccctccctccctccctccctccctccgtctctttccttgccgatgtccga
tcgtccctccttccttgcctccctccccccccctccgtcgtagGTCATCT
TCGACACGGGAAGCAGCAACACATGGGTACCGTCCATGAAGTGCGCCTTG
TCATGCACGGGCAAGAGCCTCTTTCGGGCCGCCGACAGCTCCACCTTCAG
GGAGGTCGGCGACGAGTTCAAGgtagccctccctccctccctccctccct
ccctccctccctccctccctccctccctccctccgtcgccaccctgtcca
gctcctcgagtcctcgaaatgactttcccgcctcccgtcgaaaacttttt
ttctgatggcgctcccctcccttccctccctccctgcctccctcgcccct
tagGTCCGTTACGGCTCGGGTACTGTCAAAGGCTTCTTCTCCGAGGACTC
CATGACCGTTGCGGGCTTTCGCATCGAGAAGCAAATATTCGCCGAAGTCA
CAGACGCCAGTGGCCTAGGGCGgtacgcaagcacttcctccttccctctc
cccctccctcccttgatcttttttctttcctccaattttctctttctctc
cttccattgtgtccgctccgtcgagcttgactaggaagacatccttacct
accccccatccacctccccactccccacctcccttcacgaccgccctcat
ccttccctccctccctcccgccctcccgccctccctcagCACCTACCAGA
TGGGCCAATTCGACGGTATCCTGGGCATGGGATTCGAGTCTCTGGCCGTC
GGGGGCGTCAAGACCCCGATGTTCAATCTCCTTGATCAGgtacgtcctcc
ctcccccccgccctcgccaccgtgctcgccttgccctccctccttctgtc
tccgtcccccctcgccaatttctttgttgttgcatagaagagacgagtgc
ccatttcacctccccttccctccctccctcagGGCCTGATCGAGGAGGGT
GTCTTCGCCTTCTTCCTAGGCGCCGACGAGCCGGGGGAGCTGACGGTGGG
GGGGGTCAATCACGATCGGTATGAGGGTGAGCTCGCCTACACGCCTCTCA
AGAACGCTTCCTATTGGGCAGTCGCCCTGgtacgtctgcccgacctcctt
gccttcctccctcgatcccagccttctctcttgcatgctcatcctctttc
tcttcttttcctgctgtcgcctccctccctccctccctccctccctccct
cctcaccagGGCACGGTGGTATCCTTGAACGGGGCGGGCCACCGCCAGGA
GTACCATAAGGCCACCACCGCCATCATTGACTCAGgtacattttctttcc
ctccctccctccctcccttccccccttccgctcgtgtgctcatccattcc
ttcatcatgactttcgttcgtccattcctccctccttccctccctttctt
cccccgtgccctcccttctcccagGCACTTCCCTTCTGGTCGGTCCCGAG
GAGGAGGTCAAGGGAATCGCAAGGgtaggagagcggaagcgagggaggga
gggagggagggagggaaggagggaggggaaggggggcaccatcacgcgtc
ctattccacctagacctaacgtacaccttcctccctccctccctccctcc
ctccctcaattagTCCATGGGCGCGGTGTTCAACCATTGGGAAGACGTGT
GGGTGCTTGACGACTGCCTGCACAGGGACATTCCTTCGCTCACCTTCAAC
ATCGGAGgtcgggagggagggaggggatgggaagagggatagacggaggg
acaagggaggttctccactcacacgactttcttccctccctccctccctc
cctccctcccccctgcctcagGCAAAGAGTACCTGATTGAGAAGGAGGAC
TACATTTTCTACGACCCGAGCTCGAATCAGTGCGTGCTGGGCATGATCAC
CTTCCCCTACCCCATCTGGATCCTCGGCGACGTGTTCATGCGGCGGTACT
ACTCCGTCTTCGACTACGAGAATGAGgtagacccttccctccctccctcc
ctcccgtcctcttttccctttgccgcctccctttgtgtgtagggttagag
tgtctgtcagtagcgcatctctcctcgcccccgtcctccctccctccctg
cctgcttccctcagcgcctcacccacacacactcattgaccccttcctcc
ctcccttcctccctcccttcctccctccctcctttagCGCCTCGGCCTCG
CCCCTGCCAGAGCAACGGGTTCGGCTCCTAACGCGACGGACGCTCGCGCA
GGCAAGCCCCCCGTGCCGAATGCCATGCCAGGgtaaggagggagggggga
gggcggggggacaggaattgattgggagtgaaagaagaggaaagcgggaa
ggagggaatgagggggagattcactgtcaagcttcaaccttgtcgctata
aacttacacaacgtaccttcccccctttgcgccctccctccctccctccc
tccctccctccctccttcctttcacatttctttctctccttcacttacag
CGGCGCGGGCGGCGACGGCGTCGCCGTTCCTTCGATGGAGAATGCAGATG
GCAAGAATTAA
back to top

protein sequence of NO22G02190.1

>NO22G02190.1-protein ID=NO22G02190.1-protein|Name=NO22G02190.1|organism=Nannochloropsis oceanica|type=polypeptide|length=455bp
MKRGLVLLALYAGCAAAVVQIPLKRMEPTENRMKAALTRIKSRVDSANKQ
AMARLRGSGEDPALLTPMSMEDLAGGDKAVSAPRIQGEIILKNYQNVEYF
GEIQIGSQNQTFSVIFDTGSSNTWVPSMKCALSCTGKSLFRAADSSTFRE
VGDEFKVRYGSGTVKGFFSEDSMTVAGFRIEKQIFAEVTDASGLGRTYQM
GQFDGILGMGFESLAVGGVKTPMFNLLDQGLIEEGVFAFFLGADEPGELT
VGGVNHDRYEGELAYTPLKNASYWAVALGTVVSLNGAGHRQEYHKATTAI
IDSGTSLLVGPEEEVKGIARSMGAVFNHWEDVWVLDDCLHRDIPSLTFNI
GGKEYLIEKEDYIFYDPSSNQCVLGMITFPYPIWILGDVFMRRYYSVFDY
ENERLGLAPARATGSAPNATDARAGKPPVPNAMPGGAGGDGVAVPSMENA
DGKN*
back to top
Synonyms
Publications