NO20G01650, NO20G01650 (gene) Nannochloropsis oceanica

Overview
NameNO20G01650
Unique NameNO20G01650
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length3824
Alignment locationchr20:500863..504686 +

Link to JBrowse

Properties
Property NameValue
Descriptionubiquitin-specific protease 23
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr20genomechr20:500863..504686 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0036459thiol-dependent ubiquitinyl hydrolase activity
GO:0036459thiol-dependent ubiquitinyl hydrolase activity
GO:0016787hydrolase activity
Vocabulary: Biological Process
TermDefinition
GO:0016579protein deubiquitination
GO:0016579protein deubiquitination
GO:0006511ubiquitin-dependent protein catabolic process
Vocabulary: INTERPRO
TermDefinition
IPR038765Papain_like_cys_pep_sf
IPR028889USP_dom
IPR001394Peptidase_C19_UCH
Homology
BLAST of NO20G01650 vs. NCBI_GenBank
Match: EWM22068.1 (ubiquitin carboxyl-terminal hydrolase 36 [Nannochloropsis gaditana])

HSP 1 Score: 583.9 bits (1504), Expect = 1.100e-162
Identity = 418/826 (50.61%), Postives = 478/826 (57.87%), Query Frame = 0
Query:    1 MQHGKGKGGGFGRKGGADHVQSFKRLLQGKGQAKNGINDSSIFP-GGQRRRIEFHKARKPQNGFISLSTPITSXXXXXXXXXXXXXXXXXXXXXAR-------DVGAAAIGLPAPEKGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMFGSRSSKVGKGISYPEDLRLPVSGPERAAEYRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVKVVKPALPPFLPNFTGKRMAEMESGLKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQRLATPGWSQVLAKMREEDARAEDPTKDFMVGLRGKGGREEDEAGRLLSMVDHSWAQCKDLAMFMQQARDAEGGRKGWKERERGDDKKGEKEKDNSGIQQQQVGGDLKMKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXIPALDMGKIVLGYNFAGKFFRTRFSRRRWQRT 819
            M+H   +GG  G +G  D VQSFKRLLQG G   +G ++   +  G  RRRI+FHKARK +NGF+SLST  T                      AR       D  AAA GLP PEKG+FPESKI  LM+W+ V R GPGLYNLGNTCFLNATLQCLAY+PPLAQYF+                              LGLVRDLI NMHTG G+                                     + REG RSPISPKAIVGNL+ALN+HFRVGRQEDAHEFLRHL+DALQ+ CL+ A+VKSNA  RLAETTFVHRIFGGYLRSQV+CT CG  S+TYD+FLDLSLEIHGKVG LEEALARFTAVETLD+ANRWRC +C+ LVCA+K+L+ HTAPNVCTVQLKRFMFGSRSSK+ K ++YPE LRL +SGPE +A YRLAGVLVHAGASVNMGHY+S VKA NG WYEMDD+ V QVG+ TVL QHAYLLFYVK V P+ PP L                                                                        XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX+R ATPGWS+VL K+REEDA  ED T+    GL G  G             D +WAQ + LA+FM +A+           RE G+                    D+ ++                                             XXXXXXXXXXXXXXXXXXX    +PALD+G +V+ Y++AGKFF TRFSRRRW RT
Sbjct:    1 MKHRGQQGGLKGGRGTGD-VQSFKRLLQGPG--NDGAHNGQAWDIGKPRRRIKFHKARKVENGFVSLSTGTTKGPCPASGCLGGVVAGPLTGKKARERFSEAMDKDAAAPGLPTPEKGIFPESKIHPLMTWIRVERVGPGLYNLGNTCFLNATLQCLAYLPPLAQYFLSKKEEERVPSVLSPGAGQGHRGGGQARGGWLGLVRDLIANMHTGSGA----------------------------GGGDRGDGLARREGGRSPISPKAIVGNLKALNRHFRVGRQEDAHEFLRHLLDALQSGCLQAARVKSNAPGRLAETTFVHRIFGGYLRSQVRCTQCGHCSNTYDNFLDLSLEIHGKVGRLEEALARFTAVETLDRANRWRCPACSQLVCAQKRLSFHTAPNVCTVQLKRFMFGSRSSKLSKNVAYPESLRLSLSGPEGSARYRLAGVLVHAGASVNMGHYYSLVKAANGCWYEMDDAQVRQVGLPTVLHQHAYLLFYVK-VGPSPPPALDR----------------------------------------------------------------------XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRRAATPGWSEVLGKLREEDAGNEDHTE----GLEGGEG------------ADTAWAQPEGLALFMARAK-----------REAGEGGXXXXXXXXXXXXXXXXXEDVGVRPLPAPTNAAPVREEGSAPHTGTGPAHSLAPPHSTPEDHVVVSAEGXXXXXXXXXXXXXXXXXXXTSAFLPALDVGPVVMSYSYAGKFFLTRFSRRRWHRT 697          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: XP_005852728.1 (hypothetical protein NGA_0518320 [Nannochloropsis gaditana CCMP526] >EKU23101.1 hypothetical protein NGA_0518320 [Nannochloropsis gaditana CCMP526])

HSP 1 Score: 572.0 bits (1473), Expect = 4.200e-159
Identity = 309/511 (60.47%), Postives = 350/511 (68.49%), Query Frame = 0
Query:    1 MQHGKGKGGGFGRKGGADHVQSFKRLLQGKGQAKNGINDSSIFP-GGQRRRIEFHKARKPQNGFISLSTPITSXXXXXXXXXXXXXXXXXXXXXAR-------DVGAAAIGLPAPEKGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMFGSRSSKVGKGISYPEDLRLPVSGPERAAEYRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVKVVKPALPPFL 504
            M+H   +GG  G +G  D VQSFKRLLQG G   +G ++   +  G  RRRI+FHKARK +NGF+SLST  T                      AR       D  AAA GLP PEKG+FPESKI  LM+W+ V R GPGLYNLGNTCFLNATLQCLAY+PPLAQYF+                              LGLVRDLI NMHTG G+                                     + REG RSPISPKAIVGNL+ALN+HFRVGRQEDAHEFLRHL+DALQ+ CL+ A+VKSNA  RLAETTFVHRIFGGYLRSQV+CT CG  S+TYD+FLDLSLEIHGKVG LEEALARFTAVETLD+ANRWRC +C+ LVCA+K+L+ HTAPNVCTVQLKRFMFGSRSSK+ K ++YPE LRL +SGPE +A YRLAGVLVHAGASVNMGHY+S VKA NG WYEMDD+ V QVG+ TVL QHAYLLFYVK V P+ PP L
Sbjct:    1 MKHRGQQGGLKGGRGTGD-VQSFKRLLQGPG--NDGAHNGQAWDIGKPRRRIKFHKARKVENGFVSLSTGTTKGPCPASGCLGGVVAGPLTGKKARERFSEAMDKDAAAPGLPTPEKGIFPESKIHPLMTWIRVERVGPGLYNLGNTCFLNATLQCLAYLPPLAQYFLSKKEEERVPSVLSPGAGQGHRGGGQARGGWLGLVRDLIANMHTGSGA----------------------------GGGDRGDGLARREGGRSPISPKAIVGNLKALNRHFRVGRQEDAHEFLRHLLDALQSGCLQAARVKSNAPGRLAETTFVHRIFGGYLRSQVRCTQCGHCSNTYDNFLDLSLEIHGKVGRLEEALARFTAVETLDRANRWRCPACSQLVCAQKRLSFHTAPNVCTVQLKRFMFGSRSSKLSKNVAYPESLRLSLSGPEGSARYRLAGVLVHAGASVNMGHYYSLVKAANGCWYEMDDAQVRQVGLPTVLHQHAYLLFYVK-VGPSPPPAL 479          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: CBN80155.1 (Ubiquitin-specific protease 16 [Ectocarpus siliculosus])

HSP 1 Score: 316.2 bits (809), Expect = 4.100e-82
Identity = 171/400 (42.75%), Postives = 228/400 (57.00%), Query Frame = 0
Query:  102 IGLPAPEKGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMFGSRSSKVGKGISYPEDLRLPVSGPERAAEYRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVKVVKPALPP 502
            IGLP P K L+ +SKI++ + W  V R GPGL NLGNTCFLNATLQCL+Y+PPLAQ+ ++                             L  +      +H G+ +                                              ISPK +V NLR + + FR GRQEDAHEFLRHL+D + +  LK   VKS+A +RLAETT ++RIFGGYLRS++KCT CG  SDT+D F+DLS+++   V +++ AL RF A E L   N WRC  C   V A+K L+V   PN   +QLKRF+F  ++SK+   I + + L L VSGPER+A Y L GV+VHAG S++ GHY+++V++  GMW  MDD  VS+V  +TVLR  AY+LFY +  +PA  P
Sbjct:   40 IGLPPPGKELYLDSKIKACLRWKQVHRMGPGLRNLGNTCFLNATLQCLSYLPPLAQHLLKGFYGQGPQGIASRGPRPMGGFREFTKVEILAAMEQHTKQVHQGQFT------------------------------------------GLGAISPKVLVQNLRMIGRQFRQGRQEDAHEFLRHLLDKMVDCYLKRRGVKSSAPNRLAETTPINRIFGGYLRSRLKCTKCGHCSDTFDPFMDLSMDLSRGVRSIDVALRRFVATERLGSGNEWRCGGCKKPVQAEKSLSVFKPPNALVLQLKRFVFTRKASKIKDHIQFADVLNLSVSGPERSALYDLTGVVVHAGGSMSSGHYYAYVRSCAGMWSRMDDCSVSKVKRETVLRDQAYVLFYTR--RPAPKP 395          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: CBN80156.1 (conserved unknown protein [Ectocarpus siliculosus])

HSP 1 Score: 291.2 bits (744), Expect = 1.400e-74
Identity = 163/416 (39.18%), Postives = 224/416 (53.85%), Query Frame = 0
Query:  102 IGLPAPEKGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVG---------TLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMFGSRSSKVGKGISYPEDLRLPVSGPERAAEYRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVKVVKPALP----PFLP 505
            I LP  E+ L+PES++R  ++W    + G GL N+GNTC+LN+ LQCL+Y+PPLAQ+ +                              LG ++ L+  +H  +                                             +  I P+    NLR + + FR GRQEDAHEFLRHL+D +  + L+   V   A +RLAETT +HR+FGGYLRSQ+KC+ CGF SDT+D F+DL++ +  KV          +L+ AL RFTA ETL   N W+C  CN LV A+K L+V   PN    QLKRF F +   KV   IS+ + L L VSGPER A Y L GV+VH+G +++ GHY+++V++  G W  M+DS V++V + TVLR  AY+LFY +   PA P    P LP
Sbjct:  105 IRLPTKEQELYPESQVRPNLTWRWPCQIGVGLENMGNTCYLNSILQCLSYVPPLAQHLLNGSYSQGSHATCSESPFSFNGSTDFCEDDILGAMQKLVGQIHQTKSG----------------------------------------SAEQQAIRPRTFSDNLRKIGEKFRRGRQEDAHEFLRHLVDKMAGSYLERRGVDPFAPNRLAETTPIHRVFGGYLRSQLKCSECGFCSDTFDPFMDLAMNVE-KVDSSGVAMNERSLQAALRRFTAPETLGAGNEWKCGGCNKLVEAEKNLSVFKPPNALVFQLKRFGFTNGPRKVKDHISFGDKLNLEVSGPERWANYDLTGVVVHSGKTMSSGHYYAYVRSSAGSWARMNDSVVTKVTLDTVLRDKAYVLFYTRRPPPAPPAVQRPILP 479          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: ORY06429.1 (cysteine proteinase [Basidiobolus meristosporus CBS 931.73])

HSP 1 Score: 271.2 bits (692), Expect = 1.500e-68
Identity = 160/398 (40.20%), Postives = 212/398 (53.27%), Query Frame = 0
Query:  103 GLPAPEKGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMFG--SRSSKVGKGISYPEDLRLPVSGPERAAE-----YRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVK 494
            G   P K +F +   ++     GV   GPGL NLGNTCF+N+ LQCL Y PPLA Y                                L  + D +    +G+ +                                          SR  ISPK I G LR++ KHFR+GRQEDAHEF R+ +DA+Q +CL     K +A  R+ ETT +H+IFGGYL+SQVKC  CG+ S+T+D  LD+SLEI     ++E+A + FT  E L   NR++C  CN LV A+K++T++ +PN+ TVQLKRF +G      K+ K +S+ E L L  S   R  E     Y+L GVLVHAG S + GHY+SFVKA NG WY M+D  V  V + TVL+Q+AY+LFY K
Sbjct:   46 GFRFPTKTIFKKELSQTWGKEFGV---GPGLNNLGNTCFMNSVLQCLTYTPPLASYLFN---------------EGHKKTCKVPDFCALCEMEDHVTQCFSGKNT------------------------------------------SRGSISPKRIAGKLRSIAKHFRLGRQEDAHEFTRYFLDAMQKSCLHGYDPKLDA--RIKETTLIHKIFGGYLQSQVKCLSCGYESNTFDPMLDVSLEIR-NCPSIEKAFSLFTKPEMLTNDNRYKCEKCNRLVDAQKRMTMYDSPNILTVQLKRFSYGFSLHGGKISKPVSFSETLELK-SHMSRTKENSGTSYKLFGVLVHAGGSCHSGHYYSFVKAPNGSWYCMNDCSVEPVSLNTVLKQNAYMLFYAK 379          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: PRP84401.1 (peptidase C19 family protein [Planoprotostelium fungivorum])

HSP 1 Score: 268.9 bits (686), Expect = 7.500e-68
Identity = 154/395 (38.99%), Postives = 214/395 (54.18%), Query Frame = 0
Query:  107 PEKGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLK-IAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMF-GSRSSKVGKGISYPEDLRL-PVSGPERAAE-----YRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVK 494
            P+  LFP   I+ ++ W  + R GPGL N+GNTCFLN+ +QCL Y PPLA + +                               G     ++  H  R                                         +   +S I+P++IV NLR+L+K F++GRQED+HEFLR++++ +Q + L  +         ++AET+ VHRIFGGYL+SQVKCT C + S+T+D FLDLSLEI   V +LE+AL  FT++E L+ AN+++CS C   V A K+ T+   P+V T+QLKRF F GS   K+ K +SYPE L + P   P   AE     Y+L  VLVH+G S   GHYF++VK+  G+W  M+DS V Q+    VL Q AY+LFYVK
Sbjct:   98 PQYSLFPLEDIKKMIQWTKILRAGPGLDNMGNTCFLNSVIQCLTYTPPLANFLMSRKHSQSCKIN--------------------GFCMFCVLEKHIIR---------------------------------------VFQNTKQSSITPQSIVTNLRSLSKQFKLGRQEDSHEFLRYVLEGMQKSSLHGLQATIGKVDGKIAETSIVHRIFGGYLQSQVKCTVCQYKSNTFDPFLDLSLEIKNCV-SLEKALHAFTSIEVLNGANKYKCSKCMKYVDAHKRFTIRKIPHVLTIQLKRFTFQGSFGGKIQKPVSYPETLDMKPYLDPRCVAESGSCQYKLYAVLVHSGESTRSGHYFAYVKSPAGIWNLMNDSQVQQINASRVLEQKAYILFYVK 432          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: PIA19011.1 (cysteine proteinase, partial [Coemansia reversa NRRL 1564])

HSP 1 Score: 265.0 bits (676), Expect = 1.100e-66
Identity = 159/387 (41.09%), Postives = 208/387 (53.75%), Query Frame = 0
Query:  111 LFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRF-MFGSRSSKVGKGISYPEDLRLPV----SGPERAA-EYRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFY 492
            LFP  ++ +   W  +   GPGL NLGNTCFLN+ LQCL Y PPLA++                                  L R+       G                                        S REG  S ISPKAIVG L+ + KH RVGRQEDAHEFLR L+DA Q + L    +      R+ ETT  H++FGGYL+SQV C  CG+ S+T+D  LD+SL+I G   T+E+AL  FT  ETL  +NR+RC  CN LV A KQ+T++  P + T+QLKRF +FG    K+G+ + +P +L +      + PERA+ +Y L  VLVHAG +   GHY+ FVK+  G+WYE++DS V QV  +TVLRQ AY+LFY
Sbjct:   95 LFPAEQLAA--GWQTMRPIGPGLSNLGNTCFLNSVLQCLTYTPPLAEHM---------------------------------LTREHSAGCRVGES------------------------CMLCRFEAHVVRALSKREG--SSISPKAIVGRLKLVAKHMRVGRQEDAHEFLRLLVDAFQRSLL--TGIDPKIDRRIQETTLTHQVFGGYLQSQVSCGRCGYDSNTFDPLLDISLDIQGG-STIEKALRSFTRPETLTTSNRYRCEKCNKLVDATKQMTIYQLPRILTLQLKRFSVFG--GGKIGRYVEFPLNLNMKSYVSRNSPERASYDYTLYAVLVHAGGTSRSGHYYCFVKSPAGVWYELNDSTVHQVSERTVLRQSAYMLFY 415          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: ORY42766.1 (cysteine proteinase [Rhizoclosmatium globosum])

HSP 1 Score: 263.1 bits (671), Expect = 4.100e-66
Identity = 157/390 (40.26%), Postives = 211/390 (54.10%), Query Frame = 0
Query:  111 LFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMFGSRS----SKVGKGISYPE--DLRLPVSGPERAAEYRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVKV 495
            +FPE  +  L+ W      GPGL NLGNTCFLN+TLQCL Y PPLA Y +                                L + ++ ++   RG                                            ++ I+PK+IVG L+++ KHFRVGRQEDAHEFLR+ ID+LQN+CL   +      H+  ETT +H++FGGY +S++ CT C   S T +  LD+SLE+    G++E+ALARFT  E+L   N++RCS C  LV A KQ+T+  AP +  +QLKRF FG  S     K+ K I++PE  D++  +   ++   Y L GVLVHAG S N GHYFS+VKA NG+WY  +DS V QV V+ VL Q AY+LFY  V
Sbjct:   64 VFPEETL--LLDWPKPITAGPGLQNLGNTCFLNSTLQCLTYTPPLALYLLSRSHSQKCKLRRQTTFCTFCE-----------LEKHVMRSLSGMRG--------------------------------------------KNTITPKSIVGRLKSIAKHFRVGRQEDAHEFLRYFIDSLQNSCLVGFE---KLDHKQKETTVIHQVFGGYTQSRILCTVCKEPSCTIEPCLDISLEVK-NCGSVEKALARFTKTESLTGDNKYRCSKCKTLVDATKQMTILEAPKILVLQLKRFEFGGFSMFGGGKISKMITFPEKLDIQPYMFKTKKKVLYELYGVLVHAGGSCNSGHYFSYVKAPNGVWYLKNDSEVRQVSVKQVLDQQAYILFYSAV 392          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: KYQ92242.1 (peptidase C19 family protein [Tieghemostelium lacteum])

HSP 1 Score: 261.2 bits (666), Expect = 1.600e-65
Identity = 156/417 (37.41%), Postives = 206/417 (49.40%), Query Frame = 0
Query:  104 LPAPEKGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMF-GSRSSKVGKGISYPEDLRLP--VSGPERAAEYRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVKVVKPALPPFLPNFTGKRMAEMESG 518
            LP P+K LF   K+RSLM W  V + G GL NLGNTCF+N+ LQCL Y  PLA +                                  L   +I ++                                              + S   I PK I  N++ +   FR+GRQED+HEF+R +I+ LQ  CL     K +  HR   TT V  IFGGYLRSQVKCT C + S+T+D F+DL ++I+    +L++ LA F   E LD +N+++CS C  LV A KQL +H AP + T+QLKRF F G    K+ K I++   L L   ++     A Y L GVL H G S N GHYF FVK  NG+W+++DD  VSQV +  VL Q AY+LFY K V P         T  +  + E+G
Sbjct:  173 LPVPKKVLFAPEKLRSLMGWKSVSKVGSGLRNLGNTCFMNSVLQCLTYSAPLANFM--------------RSHEHSKNCSSTGFCIFCSLENHIIKSL----------------------------------------------DSSGKVIMPKEIAMNIKKIAPTFRLGRQEDSHEFIRFVIEGLQKVCLS-QYPKGSIPHRDTMTTVVGSIFGGYLRSQVKCTVCNYESNTFDPFMDLCVDIN-HADSLQKGLAHFVKSEILDHSNKYKCSKCKKLVKATKQLKIHIAPPILTIQLKRFSFMGMFGGKINKSINFEPQLNLSPFMTQSTSDAVYDLYGVLTHLGGSTNSGHYFCFVKNSNGVWHKLDDEFVSQVSLDNVLSQKAYILFYSKRVSPQQLSSSSTITNTQNIKNENG 527          
BLAST of NO20G01650 vs. NCBI_GenBank
Match: XP_013899054.1 (hypothetical protein MNEG_7926 [Monoraphidium neglectum] >KIZ00035.1 hypothetical protein MNEG_7926 [Monoraphidium neglectum])

HSP 1 Score: 258.8 bits (660), Expect = 7.800e-65
Identity = 156/392 (39.80%), Postives = 198/392 (50.51%), Query Frame = 0
Query:  109 KGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLAYIPPLAQYFIEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLGLVRDLIVNMHTGRGSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSCREGSRSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVKSNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGKVGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQLKRFMFGSRSSKVGKGISYPEDLRL-------PVSGPERAAEYRLAGVLVHAGASVNMGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVK 494
            K  + ES +  ++ W  V R G GL NLGNTCF+N+ LQC+ + PPLAQ F+                                  RD+    + G                                               +P  P      LRA+N+ FR+GRQEDAHE+LR L+DA+  A LK   +K      LA TTFVHR+FGG LRSQ+KC G  + S TYD FLDLSLEI+ +  TL+ AL  FTA E LD  NR+RC   N LV AKK++T+  APNV  V LKRF F  R  K+ K + +  DL L       P  GP+    Y L GVLVH G S++ GHY  +VKAGNG+W+  DD  V+QV  + V  Q AY+LFYV+
Sbjct:   19 KEFYKESSV--VLRWTTVKRAGVGLINLGNTCFMNSVLQCITHTPPLAQLFLSE--------------------------------RDIAPRSNGGTPPGHFDPIAATQQLVRRAF------------------------SGGAPARPALHAKGLRAINRRFRLGRQEDAHEYLRCLVDAMHEAWLKGLGLKQKPSQELATTTFVHRVFGGRLRSQIKCEGVDYESCTYDPFLDLSLEIN-QAATLQRALQHFTAAEVLDGDNRYRCPKNNKLVRAKKRITIEEAPNVLAVHLKRFDFFGRGHKLSKRVEFGTDLDLGPYMSGWPACGPQL---YDLYGVLVHHGHSLHSGHYVCYVKAGNGIWHLCDDHRVAQVSQRAVEGQQAYILFYVR 348          
The following BLAST results are available for this feature:
BLAST of NO20G01650 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM22068.11.100e-16250.61ubiquitin carboxyl-terminal hydrolase 36 [Nannochl... [more]
XP_005852728.14.200e-15960.47hypothetical protein NGA_0518320 [Nannochloropsis ... [more]
CBN80155.14.100e-8242.75Ubiquitin-specific protease 16 [Ectocarpus silicul... [more]
CBN80156.11.400e-7439.18conserved unknown protein [Ectocarpus siliculosus][more]
ORY06429.11.500e-6840.20cysteine proteinase [Basidiobolus meristosporus CB... [more]
PRP84401.17.500e-6838.99peptidase C19 family protein [Planoprotostelium fu... [more]
PIA19011.11.100e-6641.09cysteine proteinase, partial [Coemansia reversa NR... [more]
ORY42766.14.100e-6640.26cysteine proteinase [Rhizoclosmatium globosum][more]
KYQ92242.11.600e-6537.41peptidase C19 family protein [Tieghemostelium lact... [more]
XP_013899054.17.800e-6539.80hypothetical protein MNEG_7926 [Monoraphidium negl... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL023nonsL023Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
nonsL021nonsL021Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR020ncniR020Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR067ngnoR067Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK002701NSK002701Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO20G01650.1NO20G01650.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|91491gene_6382Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Ubp16gene8365Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO20G01650.1NO20G01650.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO20G01650 ID=NO20G01650|Name=NO20G01650|organism=Nannochloropsis oceanica|type=gene|length=3824bp
TACAGACAGGAAATTGAACGGCTCTTAAAAAACGCCAGCAAGATCCCATG
GATAACCTCCGGTAAGTCGCCAGGATAACCATCGATTAGCAGCCAGGCCC
GCGCTTTTCTTCGTAAAACATGCAGCACGGGAAAGGCAAAGGTGGAGGCT
TCGGAAGAAAGGGCGGCGCGGACCACGTGCAAAGTTTTAAGCGGCTGCTG
CAGGGTAAGGGGCAAGCAAAAAACGGAATCAACGACAGTAGCATCTTCCC
TGGTGGTCAGCGACGGCGCATCGAATTCCATAAGGCCCGCAAGCCCCAGA
ATGGCTTCATCTCCCTCTCCACCCCGATCACCTCCACCAACAGCAGCAGC
AGCAGCAATAACGGCCTCGCCGCCGGCTCGAGCTCCAAGAAATCTAATGC
CCGCGACGTAGGAGCGGCTGCCATAGGTTTACCAGCACCCGAAAAGGGCC
TCTTTCCCGAGTCAAAAATCCGCTCATTGATGTCTTGGGTCGGCGTGGGG
AGGACAGGGCCAGGGCTGTACAATCTGGGCAACACCTGCTTCCTCAATGC
GACCTTGCAATGCCTCGCATACATCCCCCCATTGGCTCAATATTTCATTG
AAGGAGGAGGGTCATCATTGTCATTGTCCTCGTCCATGCAGCGTATTCCA
GGTGGAGGTGGTGGAGGAGGAGGAGGAGGAGGAGGGTGGCTAGGTCTCGT
GCGTGACTTGATCGTGAACATGCACACAGGCCGAGGAAGCGGCAGCGATA
TAGGAGGCGGACGAGCAGGAGGAGAGAGCAGCCATCGCACGCACCACCAC
AGTCATAATCATTACCACTATCATCACCAGAAGCAGCAGCAACAGCAGGT
CTCCTGCCGGGAAGGCTCCCGGAGCCCCATCTCCCCCAAGGCGATCGTCG
GCAACCTCAGAGCTCTGAATAAACACTTTCGGGTAGGCCGTCAAGAAGAC
GCGCATGAATTTCTCCGGCACCTTATCGATGCCCTGCAGAATGCGTGCTT
GAAAATCGCCAAGGTCAAGAGCAATGCTGGCCACCGGCTGGCGGAAACGA
CGTTTGTGCACCGGATTTTCGGCGGGTATCTCCGATCGCAGGTCAAGTGT
ACGGGTTGTGGGTTTGCTAGTGACACGTACGACAGCTTTTTGGATTTGTC
GTTGGAGATTCATGGGAAGGTGGGGACCTTGGAGGAGGCCTTGGCCCGGT
TCACGGCCGTGGAGACCCTGGACAAGGCGAACCGGTGGAGGTGCTCGAGT
TGTAACCATCTGGTGTGTGCCAAGAAGCAGCTGACAGTGCACACAGCGCC
AAATGTGTGTACTGTCCAATTGAAGCGGTTCATGTTTGGCTCCCGAAGCT
CCAAGGTAGGTAAGGGGATCAGCTACCCCGAGGACTTGCGCCTGCCTGTC
TCGGGGCCCGAGCGCGCGGCCGAGTACCGACTGGCCGGCGTGTTGGTGCA
CGCGGGGGCGAGTGTGAATATGGGTCACTATTTCTCCTTTGTTAAGGCGG
GGAATGGCATGTGGTACGAGATGGATGACTCGCACGTGTCGCAGGTAGGG
GTGCAGACCGTGTTGAGGCAGCACGCCTACCTTCTGTTTTACGTCAAGGT
CGTCAAACCCGCCCTCCCTCCCTTCCTCCCTAATTTTACGGGAAAGAGAA
TGGCGGAGATGGAATCGGGATTAAAAAAGGAAGAGAAGAAGAAGGAGAAG
AAGAAAAAGAAGGAGGAGGAGGACGACGTAGGGAAGGTGATGACTAAGAA
GGAGAGGAAGGAGGAGAAGAAGGAGAAGAAGAAGGTGGCAGACAAGTCCA
AGAGAAGAGCCGAGGCCAAGGAAGAGGAACAGAAGCAAGCCGCCGCCACC
ACTGCTACAGCAGCAGCAGCAGGGACCACCACCAGCACTACTACTGCCAT
TAGCAGTATTTGTAGCAGCAACAGCAGCAACAGTATTGTGGAGCAAAGGC
TGGCCACCCCAGGGTGGAGCCAAGTGTTGGCGAAGATGAGAGAAGAAGAC
GCCCGAGCAGAAGACCCTACCAAGGATTTTATGGTCGGTTTGAGGGGGAA
GGGGGGAAGGGAGGAAGACGAAGCGGGAAGACTGCTGAGCATGGTAGACC
ATTCGTGGGCGCAATGTAAGGACTTGGCGATGTTCATGCAACAGGCACGG
GATGCGGAGGGAGGGAGGAAGGGGTGGAAGGAACGAGAGAGGGGCGATGA
TAAAAAAGGGGAGAAGGAGAAAGATAACTCAGGAATACAGCAGCAACAGG
TTGGTGGTGATCTGAAGATGAAAGAAGAGAGAGAAGAAGAGGAGGAGGAG
AACGCGGCCGAGAAAGGAGAGTCTTCCTCGTCTTCCTCCTCGTCTTCCTC
CTCCTCTTCTTCTCCTTCTGTGGGCAACACCACCGATGGCGATGATGCCG
ACGACCCCTACGATTCCTCCGCTTCAATCTCCTCCCCTTCCTCCGTCTCC
TCCTCCTCATCAAGCACCTCCTCTTTCATTCCTGCCCTCGATATGGGTAA
AATTGTCCTGGGTTATAATTTCGCAGGCAAGTTTTTCCGGACCCGCTTCT
CACGTCGACGATGGCAACGGACTTTTTCTCCCTGGGCACGGCATCAGCGG
GAGGAGAAGGCTGCCCTTTCAGGGGAGAGGTGGGGAGAGGAGGGGGAGGA
GGAGGAGGAAGGGGGTTCAGAGGTGGAGGAGGGAGGAAGGAAGAGTGACA
AGAAGCTGAAGGAAAGTAAGGCCAAGCAAGGGATAGCAGCGGAGAGTGCT
GTTGTCAAGATGAAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGAGGA
GGAGGAGGAGGAGGAGGAGGAGGAGCAAGAATGCAAAGACGAAGAGACGG
AGATTCATATTGATGGAAAAGAGGAGCTGAAGACGGCAGATCTTATTGAA
ATAGGTCGTAAATCACACATGCCGAGAAGCAGCAGCGGCAGCAGCAGCAG
CACGGGCGCGATGCCCTCATCTTCTTCGTCCTCCATTTCTTCTTCTTTCG
ATTCCACCACTGCAACTGCGGTCGCAACCTCCAGGGCCAAAAAAGAGAAA
AGGACAGAAAACCCTGGACAAAGAGTGCTGCGACGGGATGAGGAAGCAAT
CGTGATGGTGCCACGATCTTTTCGAGGAGGAGGAGAGAAAAGGGAGGAAA
GCAGGAAGAGGCAATTCGATCCCCGGGCGCTGCGAGCGATGAAGGATGCT
TCTTTGAGTGATCTAGgtacgtagtggggcagggagaggagaaattcagg
gcgtcagctatgtattcatagaccgaatggatgagaataatctattttct
tcacttggccagcgtgagatactcgaaggacatgcgcatcgatttatgcc
taatagacacaaactaatttttaacactctcacttgcgacactcccatac
aacacacacagTTGGACAGTGGGACAACATGGACGAAGAGGAAGCAGCAG
TCACCAAGGAGATGAATGAGGCACGAGACAAGGTCATCGCACAAACCAAG
CGAGAGGAGAGAGTCCAGCGAGGTAAGAGGCAGCTTAGTGAGTATGCAGC
GGCGGTCGTGGCCGGGAAGAAGAAGAAAGTCAAGGGACAAAAGGAGGGCG
GGGAGGTATACGAGATTTTTGAGCATAGTGGCGGTGGTGATGGCGGAAAA
AATAAAAGTAACCCGTTCGAGACGCGACAGATGGAGCTGCAGGAGCGAAA
GAGGGGACCGGGGTCAGCTCCGGTGGGTGGAAACACAAGGGGAAGAGGGA
CACGAAGGGACAGGCCCTTGCAGTCGTAGTAGACAGAAGAAAGATGAAGA
ACCGCTAGAAGGGGGAAAAGTTAA
back to top

protein sequence of NO20G01650.1

>NO20G01650.1-protein ID=NO20G01650.1-protein|Name=NO20G01650.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1155bp
MQHGKGKGGGFGRKGGADHVQSFKRLLQGKGQAKNGINDSSIFPGGQRRR
IEFHKARKPQNGFISLSTPITSTNSSSSSNNGLAAGSSSKKSNARDVGAA
AIGLPAPEKGLFPESKIRSLMSWVGVGRTGPGLYNLGNTCFLNATLQCLA
YIPPLAQYFIEGGGSSLSLSSSMQRIPGGGGGGGGGGGGWLGLVRDLIVN
MHTGRGSGSDIGGGRAGGESSHRTHHHSHNHYHYHHQKQQQQQVSCREGS
RSPISPKAIVGNLRALNKHFRVGRQEDAHEFLRHLIDALQNACLKIAKVK
SNAGHRLAETTFVHRIFGGYLRSQVKCTGCGFASDTYDSFLDLSLEIHGK
VGTLEEALARFTAVETLDKANRWRCSSCNHLVCAKKQLTVHTAPNVCTVQ
LKRFMFGSRSSKVGKGISYPEDLRLPVSGPERAAEYRLAGVLVHAGASVN
MGHYFSFVKAGNGMWYEMDDSHVSQVGVQTVLRQHAYLLFYVKVVKPALP
PFLPNFTGKRMAEMESGLKKEEKKKEKKKKKEEEDDVGKVMTKKERKEEK
KEKKKVADKSKRRAEAKEEEQKQAAATTATAAAAGTTTSTTTAISSICSS
NSSNSIVEQRLATPGWSQVLAKMREEDARAEDPTKDFMVGLRGKGGREED
EAGRLLSMVDHSWAQCKDLAMFMQQARDAEGGRKGWKERERGDDKKGEKE
KDNSGIQQQQVGGDLKMKEEREEEEEENAAEKGESSSSSSSSSSSSSSPS
VGNTTDGDDADDPYDSSASISSPSSVSSSSSSTSSFIPALDMGKIVLGYN
FAGKFFRTRFSRRRWQRTFSPWARHQREEKAALSGERWGEEGEEEEEGGS
EVEEGGRKSDKKLKESKAKQGIAAESAVVKMKEEEEEEEEEEEEEEEEEE
EEQECKDEETEIHIDGKEELKTADLIEIGRKSHMPRSSSGSSSSTGAMPS
SSSSSISSSFDSTTATAVATSRAKKEKRTENPGQRVLRRDEEAIVMVPRS
FRGGGEKREESRKRQFDPRALRAMKDASLSDLVGQWDNMDEEEAAVTKEM
NEARDKVIAQTKREERVQRGKRQLSEYAAAVVAGKKKKVKGQKEGGEVYE
IFEHSGGGDGGKNKSNPFETRQMELQERKRGPGSAPVGGNTRGRGTRRDR
PLQS*
back to top
Synonyms
Publications