NO04G04550, NO04G04550 (gene) Nannochloropsis oceanica

Overview
NameNO04G04550
Unique NameNO04G04550
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length1566
Alignment locationchr4:1241403..1242968 +

Link to JBrowse

Properties
Property NameValue
DescriptionCysteine proteinase
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr4genomechr4:1241403..1242968 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO:0019538protein metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0008234cysteine-type peptidase activity
GO:0016787hydrolase activity
Vocabulary: INTERPRO
TermDefinition
IPR038765Papain_like_cys_pep_sf
IPR013128Peptidase_C1A
IPR000668Peptidase_C1A_C
IPR013201Prot_inhib_I29
Homology
BLAST of NO04G04550 vs. NCBI_GenBank
Match: EWM28848.1 (cysteine proteinase [Nannochloropsis gaditana])

HSP 1 Score: 289.3 bits (739), Expect = 1.100e-74
Identity = 145/194 (74.74%), Postives = 161/194 (82.99%), Query Frame = 0
Query:   37 KHRRGWVLHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQ-KRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGG 230
            +HRRGWVL GHE A EEWSRFKRYFRKVY S +EE +RF VF SSLA+A++RNA NVAAGGEHVFGVTRFSDET EEF+RRYKGRK HG+GV  G  +RRPLA+TSA M REG+E       G V+  RP  VDWV AGATTS+ NQGQCGSCWAFSATSQIESAFI+ G+APWRLSVQQVTSCT+ GFGCGGG
Sbjct:   41 RHRRGWVLLGHETAAEEWSRFKRYFRKVYHSPSEEEHRFHVFASSLAEARVRNAQNVAAGGEHVFGVTRFSDETPEEFNRRYKGRKGHGRGVVSGAEVRRPLAYTSALMGREGQE------GGSVEVSRPPQVDWVAAGATTSIGNQGQCGSCWAFSATSQIESAFILAGHAPWRLSVQQVTSCTSGGFGCGGG 228          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: AAC37213.1 (cysteine proteinase [Trypanosoma cruzi])

HSP 1 Score: 117.5 bits (293), Expect = 5.800e-23
Identity = 76/190 (40.00%), Postives = 95/190 (50.00%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGIRSK 234
            LH  E    +++ FK+   +VY S+AEE++R SVFR++L  A+L  A N  A     FGVT FSD T+EEF  RY    +H                         EE         V   PA  DW E GA T+V NQG CGSCWAF+A   IE  + + GN   RLS Q + SC     GCGGG+ SK
Sbjct:   28 LHAEETLASQFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHA----TFGVTPFSDLTREEFRSRYHNGAAHFAA---------------------AEERARVPVDVEVVGAPAAKDWREEGAVTAVKNQGICGSCWAFAAIGNIEGQWFLAGNPLTRLSEQMLVSCDNTNSGCGGGLSSK 192          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: PBJ69679.1 (cysteine peptidase [Trypanosoma cruzi cruzi] >PBJ69681.1 cysteine peptidase [Trypanosoma cruzi cruzi])

HSP 1 Score: 117.5 bits (293), Expect = 5.800e-23
Identity = 77/190 (40.53%), Postives = 100/190 (52.63%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGIRSK 234
            LH  E    +++ FK+   +VY S+AEE++R SVFR++L  A+L  A N  A     FGVT FSD T+EEF  RY    +H    ++            AR+  + E          V   PA  DW E GA T+V NQG CGSCWAF+A   IE  + + GN   RLS Q + SC     GCGGG+ SK
Sbjct:   28 LHAEETLASQFAEFKQKHGRVYGSAAEEAFRLSVFRANLFLARLHAAANPHA----TFGVTPFSDLTREEFRSRYHNGAAHFAAAQE-----------RARVPVDVE----------VVGAPAAKDWREEGAVTAVKNQGICGSCWAFAAIGNIEGQWFLAGNPLTRLSEQMLVSCDNTNSGCGGGLSSK 192          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: CAA38238.1 (unnamed protein product [Trypanosoma brucei])

HSP 1 Score: 116.7 bits (291), Expect = 9.900e-23
Identity = 74/187 (39.57%), Postives = 98/187 (52.41%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGI 231
            LH  E  E  ++ FK+ + KVY  + EE++RF  F  ++ QAK++ A N  A     FGVT FSD T+EEF  RY+   S+    +K   +R+ +  T+ R                    PA VDW E GA T V +QGQCGSCWAFS    IE  + + GN    LS Q + SC    FGCGGG+
Sbjct:   31 LHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYA----TFGVTPFSDMTREEFRARYRNGASYFAAAQK--RVRKTVNVTTGR-------------------APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGL 192          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: CAC67416.1 (cysteine protease, partial [Trypanosoma brucei rhodesiense])

HSP 1 Score: 116.3 bits (290), Expect = 1.300e-22
Identity = 74/187 (39.57%), Postives = 98/187 (52.41%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGI 231
            LH  E  E  ++ FK+ + KVY  + EE++RF  F  ++ QAK++ A N  A     FGVT FSD T+EEF  RY+   S+    +K   +R+ +  T+ R                    PA VDW E GA T V +QGQCGSCWAFS    IE  + + GN    LS Q + SC    FGCGGG+
Sbjct:   31 LHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYA----TFGVTPFSDMTREEFRARYRNGASYFAAAQK--RLRKTVNVTTGR-------------------APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGL 192          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: XP_845224.1 (cysteine peptidase precursor [Trypanosoma brucei brucei TREU927] >AAX80357.1 cysteine peptidase precursor [Trypanosoma brucei] >AAZ11665.1 cysteine peptidase precursor [Trypanosoma brucei brucei TREU927])

HSP 1 Score: 116.3 bits (290), Expect = 1.300e-22
Identity = 74/187 (39.57%), Postives = 98/187 (52.41%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGI 231
            LH  E  E  ++ FK+ + KVY  + EE++RF  F  ++ QAK++ A N  A     FGVT FSD T+EEF  RY+   S+    +K   +R+ +  T+ R                    PA VDW E GA T V +QGQCGSCWAFS    IE  + + GN    LS Q + SC    FGCGGG+
Sbjct:   31 LHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYA----TFGVTPFSDMTREEFRARYRNGASYFAAAQK--RLRKTVNVTTGR-------------------APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGL 192          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: XP_011773878.1 (cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972] >CBH11593.1 cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972])

HSP 1 Score: 116.3 bits (290), Expect = 1.300e-22
Identity = 74/187 (39.57%), Postives = 98/187 (52.41%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGI 231
            LH  E  E  ++ FK+ + KVY  + EE++RF  F  ++ QAK++ A N  A     FGVT FSD T+EEF  RY+   S+    +K   +R+ +  T+ R                    PA VDW E GA T V +QGQCGSCWAFS    IE  + + GN    LS Q + SC    FGCGGG+
Sbjct:   31 LHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYA----TFGVTPFSDMTREEFRARYRNGASYFAAAQK--RLRKTVNVTTGR-------------------APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGL 192          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: XP_011773879.1 (cysteine peptidase precursor, (fragment), partial [Trypanosoma brucei gambiense DAL972] >CBH11594.1 cysteine peptidase precursor, (fragment), partial [Trypanosoma brucei gambiense DAL972])

HSP 1 Score: 116.3 bits (290), Expect = 1.300e-22
Identity = 74/187 (39.57%), Postives = 98/187 (52.41%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGI 231
            LH  E  E  ++ FK+ + KVY  + EE++RF  F  ++ QAK++ A N  A     FGVT FSD T+EEF  RY+   S+    +K   +R+ +  T+ R                    PA VDW E GA T V +QGQCGSCWAFS    IE  + + GN    LS Q + SC    FGCGGG+
Sbjct:   31 LHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYA----TFGVTPFSDMTREEFRARYRNGASYFAAAQK--RLRKTVNVTTGR-------------------APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGL 192          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: XP_011773880.1 (cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972] >XP_011773883.1 cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972] >CBH11595.1 cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972] >CBH11598.1 cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972])

HSP 1 Score: 116.3 bits (290), Expect = 1.300e-22
Identity = 74/187 (39.57%), Postives = 98/187 (52.41%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGI 231
            LH  E  E  ++ FK+ + KVY  + EE++RF  F  ++ QAK++ A N  A     FGVT FSD T+EEF  RY+   S+    +K   +R+ +  T+ R                    PA VDW E GA T V +QGQCGSCWAFS    IE  + + GN    LS Q + SC    FGCGGG+
Sbjct:   31 LHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYA----TFGVTPFSDMTREEFRARYRNGASYFAAAQK--RLRKTVNVTTGR-------------------APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGL 192          
BLAST of NO04G04550 vs. NCBI_GenBank
Match: XP_011773882.1 (cysteine peptidase precursor, (fragment), partial [Trypanosoma brucei gambiense DAL972] >CBH11597.1 cysteine peptidase precursor, (fragment), partial [Trypanosoma brucei gambiense DAL972])

HSP 1 Score: 116.3 bits (290), Expect = 1.300e-22
Identity = 74/187 (39.57%), Postives = 98/187 (52.41%), Query Frame = 0
Query:   44 LHGHEVAEEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHVFGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGEEXXXXXXXGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESAFIMDGNAPWRLSVQQVTSCTANGFGCGGGI 231
            LH  E  E  ++ FK+ + KVY  + EE++RF  F  ++ QAK++ A N  A     FGVT FSD T+EEF  RY+   S+    +K   +R+ +  T+ R                    PA VDW E GA T V +QGQCGSCWAFS    IE  + + GN    LS Q + SC    FGCGGG+
Sbjct:   31 LHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEENMEQAKIQAAANPYA----TFGVTPFSDMTREEFRARYRNGASYFAAAQK--RLRKTVNVTTGR-------------------APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTIDFGCGGGL 192          
The following BLAST results are available for this feature:
BLAST of NO04G04550 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM28848.11.100e-7474.74cysteine proteinase [Nannochloropsis gaditana][more]
AAC37213.15.800e-2340.00cysteine proteinase [Trypanosoma cruzi][more]
PBJ69679.15.800e-2340.53cysteine peptidase [Trypanosoma cruzi cruzi] >PBJ6... [more]
CAA38238.19.900e-2339.57unnamed protein product [Trypanosoma brucei][more]
CAC67416.11.300e-2239.57cysteine protease, partial [Trypanosoma brucei rho... [more]
XP_845224.11.300e-2239.57cysteine peptidase precursor [Trypanosoma brucei b... [more]
XP_011773878.11.300e-2239.57cysteine peptidase precursor [Trypanosoma brucei g... [more]
XP_011773879.11.300e-2239.57cysteine peptidase precursor, (fragment), partial ... [more]
XP_011773880.11.300e-2239.57cysteine peptidase precursor [Trypanosoma brucei g... [more]
XP_011773882.11.300e-2239.57cysteine peptidase precursor, (fragment), partial ... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL082nonsL082Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
nonsL080nonsL080Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
nonsL078nonsL078Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR058ncniR058Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR110ngnoR110Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR108ngnoR108Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK009332NSK009332Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO04G04550.1NO04G04550.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|538435gene_2242Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100402g1gene1761Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO04G04550.1NO04G04550.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO04G04550 ID=NO04G04550|Name=NO04G04550|organism=Nannochloropsis oceanica|type=gene|length=1566bp
ATGCTGCTTCTCCGATCAGCCATCCTCCTCAGCTGCCTTTTCGCCGCTGC
GAGCGCTGCCATCACGACCGCCACCACCGAGGAGCAGCGGCTACAGCAGC
ACAAGCACAAGCATCGTCGTGGGTGGGTCCTGCACGGTCACGAGGTGGCG
GAAGAGGAATGGTCTCGATTCAAGAGgtacgcacgcggaaaggggttgct
tcgacgttaaccttgataatacccgtgcgtcgtgtcccctcgtgtctttc
gtttaccatgagcgtggaaattggcttcaggcctgcatgtgtgcaagccc
cctatttcctttctgtctccttttgtattccctttccctcccttagcctc
ccctcagaccgtctcatctcttcctgttccgcttgcccatgcccgtctta
actatccttgggtcttccacaaagtcagataaatccctattaaaaagcgt
gcaggcagttcttcttcgttcttccctctttcatgcccgtgttctacaaa
ataactccaccacgcaccacttgtaaaactcacctccctcatcctcccac
ccacccacccacccacccaccctttctccctccttccctccctccctccc
tccctcccaaccatcttaagGTACTTCCGAAAAGTCTACTTCTCCTCCGC
GGAGGAATCATACCGCTTCTCTGTCTTCCGATCCTCCCTAGCACAAGCCA
AGCTGCGCAATGCCCATAACGTGGCAGCGGGCGGCGAACACGTGTTTGGT
GTCACCCGTTTCTCCGACGAAACGCAGGAGGAGTTCGACCGGCGGTACAA
GGGGAGAAAGAGCCATGGGAAGGGAGTGAGGAAGGGGGTGGCGATTCGCA
GGCCCTTGGCGTGGACTTCGGCGAGGATGGAGAGGGAGGGCGAGGAGGAG
GGAGGGAAGGAGGGAGGGGGGCCCGTGCAGAAGCGGCCGGCAATGGTGGA
CTGGGTTGAGGCAGGTGCGACGACGAGCGTGGCGAATCAAGGACAGTGCG
GGgtaagtgtatgcctctctgctttccctccttcttccctcatccttctt
aggtttaaattagcttgtgacgtgggtgtaggggctgggcgttttttgtt
ttttatttttaaaacttttgttaattgcgttatgattggccgccgtgctt
ctttcactctcatcgacttgccctccctctctccatccctccttcccact
ctcctagTCCTGCTGGGCGTTCTCGGCCACGTCACAGATTGAGTCTGCCT
TCATCATGGACGGGAACGCCCCGTGGCGTTTGTCGGTGCAGCAGGTCACC
TCCTGCACGGCCAATGGCTTTGGGTGTGGGGGGGGGATACGATCGAAGCC
TACGAGCAGgtgcgtaggtcaggaagacggagggagggagagaaggggag
gaagggggaggtccatgcatatatgcgcgtgttagggggggaaagcgagt
aaggggaggaagagagggagggagggagggagggagggttggtttgtttt
ttcccttctaacattttcgtccactccccctccctccctccctccttcct
ttcctccagCTGCTAG
back to top

protein sequence of NO04G04550.1

>NO04G04550.1-protein ID=NO04G04550.1-protein|Name=NO04G04550.1|organism=Nannochloropsis oceanica|type=polypeptide|length=239bp
MLLLRSAILLSCLFAAASAAITTATTEEQRLQQHKHKHRRGWVLHGHEVA
EEEWSRFKRYFRKVYFSSAEESYRFSVFRSSLAQAKLRNAHNVAAGGEHV
FGVTRFSDETQEEFDRRYKGRKSHGKGVRKGVAIRRPLAWTSARMEREGE
EEGGKEGGGPVQKRPAMVDWVEAGATTSVANQGQCGSCWAFSATSQIESA
FIMDGNAPWRLSVQQVTSCTANGFGCGGGIRSKPTSSC*
back to top
Synonyms
Publications