NO20G02120, NO20G02120 (gene) Nannochloropsis oceanica

Overview
NameNO20G02120
Unique NameNO20G02120
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length7783
Alignment locationchr20:628771..636553 -

Link to JBrowse

Properties
Property NameValue
DescriptionAAA ATPase domain-containing protein
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr20genomechr20:628771..636553 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
PXD0166992021-01-08
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PXD0087212019-04-30
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0019538protein metabolic process
Vocabulary: Molecular Function
TermDefinition
GO:0005524ATP binding
GO:0000166nucleotide binding
Vocabulary: INTERPRO
TermDefinition
IPR036628Clp_N_dom_sf
IPR027417P-loop_NTPase
IPR019489Clp_ATPase_C
IPR004176Clp_N
IPR003959ATPase_AAA_core
IPR003593AAA+_ATPase
IPR001270ClpA/B
Homology
BLAST of NO20G02120 vs. NCBI_GenBank
Match: OAE25653.1 (hypothetical protein AXG93_4368s1120 [Marchantia polymorpha subsp. ruderalis] >PTQ41594.1 hypothetical protein MARPO_0033s0015 [Marchantia polymorpha])

HSP 1 Score: 961.4 bits (2484), Expect = 2.000e-276
Identity = 513/885 (57.97%), Postives = 660/885 (74.58%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSIGTGIVRAA-GLENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRA--GKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISAQ 890
            T KT E L+  +++AI   + Q TPLH+  AL+ + D + +  V AA G   + S+ ++  QAL + P Q+PAP  +  +SA+   + ++    KK+GD+ L+ DQI+  ++D + ++   ++AG+ A KVKQ +EK+R   GKKVD+A  + N++ALKKYG DLV  A   KLDPV+GRDEEIRRVI+VLSRRTKNNPVLIGEPGVGKTA+VEGLAQR+V GDVP +L   R+IALDMGALIAGA YRGEFE+RLK+VLKEV+DA G ++LFIDEIHLVLGAGR +G+MDAANLLKPMLARG+LRCIGATTL EYR +VEKD AFERRFQ V V EPSVP TISILRGLKE+YE HHG+RI D +LV+AA+L+ RYI  R LPDKAIDLVDEAC+ +RVQLDS+PE +D LER  +QLEVE  AL +E+D  SKARL EV++E+  + ++L+PL+++Y+ EKG+V+E +RL+ K  E+ R I  A R  D+A+VADL+Y AL E+E+ +  + A+I        ++ M+ E V    IAE+VSRWTGIPV++L  +++ +LL LAD LH RVVGQ+EAV+AVA+AVLRSRAGL RP+QPTGSFLFLGPTGVGK+ELAKALA++LFDDE  +VRIDMSEY EQHSVARLIGAPPGY+G+D+GGQLTEAVRR+PYSV+LFDE+EKAH SVFN LLQ+LDDGRLTD QGR VNF NT+IILTSNLGAEHL        T  ++                                + +V+  VRRHFRPE LNRLD++V+FSPLS +QLR + ++QM  ++ RL +R + + + +  LD VL +AY P YGARP+RR+LEK + TQ+S +LI   +D + T+ I  +
Sbjct:    7 THKTNEALAAGQEIAINAGHAQYTPLHLAVALLQDPDGLFSQAVSAARGDGAVSSVERAFNQALKKIPSQSPAPDEVPANSALVKCIRKAQSLQKKRGDSHLSIDQIILAVLDDSQISDCLQEAGVQAAKVKQELEKVRGGEGKKVDNATGDTNFQALKKYGRDLVEQA--GKLDPVIGRDEEIRRVIRVLSRRTKNNPVLIGEPGVGKTAVVEGLAQRVVRGDVPSNLLDVRLIALDMGALIAGAKYRGEFEERLKAVLKEVEDADGKVILFIDEIHLVLGAGRTEGSMDAANLLKPMLARGQLRCIGATTLDEYRSYVEKDAAFERRFQQVFVPEPSVPDTISILRGLKEKYEGHHGVRILDKSLVVAAQLSSRYITGRHLPDKAIDLVDEACANVRVQLDSQPEEVDALERRRIQLEVELHALEKEKDKSSKARLVEVKQELDDLNEKLRPLKMKYQREKGRVDESRRLKQKREEILRSIQDAERRMDLARVADLKYGALQEIEESIAHLDADI-------GDNNMLTETVAPDQIAEVVSRWTGIPVSRLGQNDKARLLGLADRLHLRVVGQDEAVQAVAEAVLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDENQLVRIDMSEYMEQHSVARLIGAPPGYVGYDKGGQLTEAVRRRPYSVILFDEIEKAHPSVFNTLLQLLDDGRLTDGQGRTVNFNNTVIILTSNLGAEHLLAGLSGEQTMTVA--------------------------------KDQVLQEVRRHFRPELLNRLDEMVVFSPLSHEQLRKVCRIQMKDVAMRLAERGVALAVTDAALDLVLTEAYNPIYGARPLRRWLEKKVVTQLSIMLINDEIDENSTVYIDCK 850          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: CCI44512.1 (unnamed protein product [Albugo candida])

HSP 1 Score: 959.1 bits (2478), Expect = 9.800e-276
Identity = 518/912 (56.80%), Postives = 668/912 (73.25%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSIGTGIVRAAGLENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRAGKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQK-AALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISAQKPEGGTWADSELAFNVTPRSAAGASMEVD 919
            T+KTQE L  AK LA +  + Q+TP+H++ AL  + D +   +    G  N   + Q   + L   P Q+PAP  +  DS M  +L  + ++ K+  D  LA D ++  L  H    +  +  G    KVK+AI K+R G+ V SA AE+ Y+AL KYG +LV  AE  K+DPV+GRDEEIRRVI++L RRTKNNPVLIGEPGVGKTA+VEGLAQRIV GDVPESL   ++ +LDMGALIAGA YRGEFE+RLK+VLKEVKD+ G I+LFIDE+HL+LGAG+  GAMDAANLLKPMLARGELRCIGATTL EYRQHVEKD AFERRFQ V+V EPSV  T+SILRGLKERYESHHG++ITD+ALV AAKLADRYI  RF+PDKAID++DEAC+ +RVQLDS+PE ID+LER  LQL+VEATAL +E+D  SK RL +V+ E+ TI D+L PL L+++AEK +VNE +RL++K+ +LQ K+  A R++D+A VADL+Y A+P+++KR+ +  AEI  +  D +++ +V EVV ++ I +IVSRWTGIPV++LT+S   +LL L + +H RVVGQEEAV AV +AV+RSRAGLSR EQPTGSFLFLGPTGVGK+ELAKALA ELFD++KHMVRIDMSEY E+HSVARLIGAPPGY+GH+EGGQLTE+VRRKPY+VVL DE+EKAH  V N+LLQ+LDDGRLTDS GR V+F N ++I+TSN+GAEHL    ++D S  +                       +  S V +   R  V+  +R   RPE LNRLDDIV+FSPL R QLR I+ LQ  +++ RLK+ +I + +    LD +L +AY P+YGARP++RY+EK + T +S+L++AG L     +E+  +        D +L F+V+      A+ME+D
Sbjct:    7 TDKTQEYLQAAKSLAEDAGHAQLTPVHLVQALFDDSDGLAKRLADRVG-ANTPGILQETRRQLKLIPIQSPAPDQVSVDSGMTKMLKYADKRRKEMKDTHLAVDHLILALFTHTQCGTIFKSNGFDEKKVKEAINKVRGGRSVTSASAEEMYDALCKYGQNLVSLAETGKIDPVIGRDEEIRRVIRILCRRTKNNPVLIGEPGVGKTAVVEGLAQRIVIGDVPESLNC-QLFSLDMGALIAGAKYRGEFEERLKAVLKEVKDSEGKIILFIDEMHLILGAGQTSGAMDAANLLKPMLARGELRCIGATTLDEYRQHVEKDKAFERRFQQVMVKEPSVTDTVSILRGLKERYESHHGVQITDSALVTAAKLADRYITERFMPDKAIDIIDEACASVRVQLDSQPEAIDELERRQLQLQVEATALTKEKDEVSKQRLKKVQTELNTINDQLHPLMLQHQAEKERVNEVRRLKDKLQQLQLKVQKAERNQDLATVADLKYYAIPDIQKRIAQ--AEINKRNEDESQTKLVSEVVRDEQICQIVSRWTGIPVSRLTSSMSDRLLHLEERIHSRVVGQEEAVNAVCEAVVRSRAGLSRREQPTGSFLFLGPTGVGKTELAKALAFELFDNDKHMVRIDMSEYMEEHSVARLIGAPPGYVGHEEGGQLTESVRRKPYNVVLLDEIEKAHPKVLNILLQLLDDGRLTDSHGRTVDFTNVVVIMTSNIGAEHLMALGSIDLSPRH--------------SKKARLDSADDVSPVFARQ-RELVLQQLRATIRPELLNRLDDIVVFSPLGRAQLRRIVSLQFESVAKRLKENHISMRVGVGALDVILKEAYDPQYGARPLKRYIEKHVVTGLSKLILAGKLPAKSHVEVIEK--------DGKLDFDVS------AAMELD 885          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: CCA14066.1 (heat shock protein 101 putative [Albugo laibachii Nc14])

HSP 1 Score: 956.8 bits (2472), Expect = 4.900e-275
Identity = 519/914 (56.78%), Postives = 664/914 (72.65%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSIGTGIVRAAGLENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRAGKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQK-AALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGK--VMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISAQKPEGGTWADSELAFNVTPRSAAGASMEVD 919
            T+KTQE L  AK LA +  + Q+TP+H++ AL  + D +   +       N   + Q  A+ L   P QTPAP  +  DS M  +L  + ++ K+  D  LA D ++  L  H   A+  +  G    KVK+AIEK+R G+ V S  AE  Y+AL KYG +LV  AE  K+DPV+GRDEEIRRVI++L RRTKNNPVLIGEPGVGKTA+VEGLAQRIV GDVPESL   ++ +LDMGALIAGA YRGEFE+RLK+VLKEVKD+ G I+LFIDE+HL+LGAG+  GAMDAANLLKPMLARGELRCIGATTL EYRQHVEKD AFERRFQ V+V EP+V  T+SILRGLKERYESHHG++ITD+ALV AAKLADRYI  RF+PDKAID++DEAC+ +RVQLDS+PE ID+LER  LQL+VEATAL  E+D  SK RL +V+ E+ TI D+L PL ++++AEK +VNE +RL++K+ +LQ KI  A R++D+A VADL+Y A+P+++KR+ +  A+I  K  D     +V EVV ++ I +IVSRWTGIPV++LT+S   +LL L + +H RVVGQEEAV AV +AV+RSRAGLSR EQPTGSFLFLGPTGVGK+ELAKALA ELFD++KHMVRIDMSEY E+HSVARLIGAPPGY+GH+EGGQLTE++RRKPY+VVL DE+EKAH  V N+LLQ+LDDGRLTDS GR V+F N ++I+TSN+GAEHL    ++D S  +                         G  V    VR +  V+  +R   RPE LNRLDDIV+FSPL R QLR I+ LQ  +++ RLK+ +I + ++   LD +L +AY P+YGARP++RY+EK + T +S+L++ G L     +E+        T  D +L F+V+P      +ME+D
Sbjct:    7 TDKTQEYLQAAKSLAEDAGHAQLTPIHLVQALFDDADGLAKRLADRVD-ANKTGILQETARQLKLIPSQTPAPDQVSVDSGMTKVLKYADKRRKEMKDTHLAVDHLILALFTHTQCATVFKSNGFDERKVKEAIEKVRGGRPVTSTSAEDMYDALTKYGQNLVSLAESGKIDPVIGRDEEIRRVIRILCRRTKNNPVLIGEPGVGKTAVVEGLAQRIVFGDVPESLNC-QLFSLDMGALIAGAKYRGEFEERLKAVLKEVKDSDGRIILFIDEMHLILGAGQTSGAMDAANLLKPMLARGELRCIGATTLDEYRQHVEKDKAFERRFQQVMVKEPTVTDTVSILRGLKERYESHHGVQITDSALVTAAKLADRYITERFMPDKAIDIIDEACASVRVQLDSQPEAIDELERRQLQLQVEATALANEKDEVSKERLKKVQAELNTISDQLHPLIVQHQAEKERVNEVRRLKDKLQQLQLKIQKAERNQDLATVADLKYYAIPDIQKRIAQ--AKINKKNEDENHPKLVSEVVRDEQICQIVSRWTGIPVSRLTSSTSDRLLHLEERIHNRVVGQEEAVNAVCEAVVRSRAGLSRREQPTGSFLFLGPTGVGKTELAKALAFELFDNDKHMVRIDMSEYMEEHSVARLIGAPPGYVGHEEGGQLTESIRRKPYNVVLLDEIEKAHPKVLNILLQLLDDGRLTDSHGRTVDFTNVVVIMTSNIGAEHLMALGSIDVSPRH----------------SKKARIGSEGDEVTPAFVRQRELVLQQLRATIRPELLNRLDDIVVFSPLGRAQLRKIVSLQFESVAKRLKESHISMRVSVSALDVILEEAYDPQYGARPLKRYIEKHVVTGLSKLILMGRLPAKSHVEV--------TEKDGKLDFDVSP------AMELD 886          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: XP_022018608.1 (chaperone protein ClpB1 [Helianthus annuus] >OTF90694.1 putative heat shock protein [Helianthus annuus])

HSP 1 Score: 956.1 bits (2470), Expect = 8.300e-275
Identity = 514/884 (58.14%), Postives = 655/884 (74.10%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSI-GTGIVRAAGLENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRA--GKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISA 889
            T KT E L+   +LA+   + Q TPLH+  AL+++ + I    I  A G E  +S  +   QAL + P Q+PAP  +   S++  ++ ++    K +GD+ LA DQ++  L++ + ++   ++AG+ A +VK  +EKLR   GKKV+SA  + N++ALK YG DLV  A   KLDPV+GRDEEIRRVI++LSRRTKNNPVLIGEPGVGKTA+VEGLAQRIV GDVP +L   R++ALDMGALIAGA YRGEFE+RLK+VLKEV++A G ++LFIDEIHLVLGAGR +G+MDAANL KPMLARG+LRCIGATTL+EYR++VEKD AFERRFQ VLV EPSVP TISILRGLKERYE HHG+RI D ALV+AA+L+ RYI  RFLPDKAIDLVDEAC+ +RVQLDS+PE ID LER  +QLEVE  AL +E+D  SKARL EV+KE+  ++D+LQPL ++Y+ EK +V+E +RL+ K  EL   +  A R  D+A+ ADL+Y A+ EVE  +  +        G+A E+ M+ E V    IAE+VSRWTGIPV +L  +E+++L+ LAD LH+RVVGQ++AV AVA+AVLRSRAGL R +QPTGSFLFLGPTGVGK+ELAKALA++LFDDEK M+RIDMSEY EQHSVARLIGAPPGY+GH+EGGQLTEAVRR+PYSVVLFDEVEKAH SVFN LLQ+LDDGRLTD QGR V+F NT+II+TSNLGAE+L K                                 G +T++  + R  VM  VRRHF+PE LNRLD+IV+F PLS  QLR + +LQ+  ++ RL DR + + + E  LD +L ++Y P YGARPIRR+LE+ + T++S++LI   +D + T+ I A
Sbjct:    7 THKTNEALASGHELAMNAGHAQFTPLHIAVALISDANGIFRQAISNAGGEEAANSAERVFNQALKKLPSQSPAPDEVPASSSLIKVIRRAQSLQKSRGDSHLAVDQLILGLLEDSQISDLIKEAGVGASRVKSEVEKLRGKEGKKVESASGDTNFQALKTYGRDLVEQA--GKLDPVIGRDEEIRRVIRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPNNLSDVRLVALDMGALIAGAKYRGEFEERLKAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVLVAEPSVPDTISILRGLKERYEGHHGVRILDRALVVAAQLSSRYITARFLPDKAIDLVDEACANVRVQLDSQPEEIDNLERKRMQLEVELHALEKEKDKASKARLVEVKKELDDLRDKLQPLMMKYKKEKERVDEIRRLKQKREELLVALQEAERRYDLARAADLKYGAVQEVETAIARI-------EGNADENVMLTETVGPDQIAEVVSRWTGIPVTRLGTNEKERLIGLADRLHQRVVGQDQAVSAVAEAVLRSRAGLGRAQQPTGSFLFLGPTGVGKTELAKALAEQLFDDEKLMIRIDMSEYMEQHSVARLIGAPPGYVGHEEGGQLTEAVRRRPYSVVLFDEVEKAHQSVFNTLLQMLDDGRLTDGQGRTVDFSNTVIIMTSNLGAEYLLKG------------------------------LSGKTTMV--NAREMVMQEVRRHFKPELLNRLDEIVVFDPLSHDQLRKVARLQLKDVAVRLADRGVALGVTEAALDVILNESYDPVYGARPIRRWLERRVVTELSKMLIREEIDENSTVYIDA 849          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: PON70323.1 (ClpA/B family [Trema orientalis])

HSP 1 Score: 956.1 bits (2470), Expect = 8.300e-275
Identity = 523/923 (56.66%), Postives = 663/923 (71.83%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSI-GTGIVRAAGLENL-DSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRA--GKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISAQKPEGGTWADSELAFNVTPRSAAGASMEVDNGGSANMA 927
            T KT EV++ A +LA+   + Q TPLH+  AL+ +   I    I  AAG E+   S+ +   QAL + P Q+P P  I   +++   + ++    K +GD+ LA DQ++  L++ + +    ++AG+   +VK  +EKLR   GKKV+SA  +  ++ALK YG DLV  A   KLDPV+GRDEEIRRV+++LSRRTKNNPVLIGEPGVGKTA+VEGLAQRIV+GDVP +L   R+IALDMGAL+AGA YRGEFE+RLK+VLKEV++A G ++LFIDEIHLVLGAGR +G+MDAANL KPMLARG+LRCIGATTL+EYR++VEKD AFERRFQ V V EPSVP TI+ILRGLKERYE HHG+RI D ALV+AA+L+ RYI  R LPDKAIDLVDEAC+ +RVQLDS+PE ID LER  +QLE+E  AL +E+D  SKARL EV KE+  ++D+LQPL ++Y  EK +++E +RL+ K  EL   +  A R  D+A+ ADLRY A+ EVE  + ++        G   E+ M+ E V  + IAE+VSRWTGIPV +L   ++ +L+ LAD LH+RVVGQ++AVEAVA+AVLRSRAGL RP+QPTGSFLFLGPTGVGK+ELAKALA++LFDDE  +VRIDMSEY EQHSV+RLIGAPPGYIGHDEGGQLTEAVRR+PYSVVLFDEVEKAHVSVFN LLQVLDDGRLTD QGR V+F+NT+II+TSNLGAEHL        T  ++                                R +VM  VRRHFRPE LNRLD+IV+F PL+ +QLR + +LQM  +++RL +R I + + +  LD+VLA++Y P YGARPIRR+LEK + T++SR+L+   +D + T+ I A  P G     SEL + V             NGG  N A
Sbjct:    7 THKTNEVIAAAHELAMSAGHAQFTPLHLAVALINDPSGIFSQAIANAAGNEDAPKSVERVFNQALKKIPSQSPPPEDIPASTSLIKSIRRAQAAQKSRGDSHLAVDQLILGLLEDSQIGDLLKEAGVATSRVKSEVEKLRGKEGKKVESASGDTTFQALKTYGRDLVEQA--GKLDPVIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVSGDVPSNLAEVRLIALDMGALVAGAKYRGEFEERLKAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVYVAEPSVPDTINILRGLKERYEGHHGVRIQDRALVVAAQLSSRYITGRHLPDKAIDLVDEACANVRVQLDSQPEEIDNLERKRMQLEIELHALEKEKDKASKARLVEVRKELDDLRDKLQPLMMKYRKEKERIDEIRRLKQKREELHIALQEAERRYDLARAADLRYGAIQEVESAIAQL-------EGSTDENLMLTETVGPEHIAEVVSRWTGIPVTRLGQDDKTRLVGLADRLHQRVVGQDQAVEAVAEAVLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDENLLVRIDMSEYMEQHSVSRLIGAPPGYIGHDEGGQLTEAVRRRPYSVVLFDEVEKAHVSVFNTLLQVLDDGRLTDGQGRTVDFRNTVIIMTSNLGAEHLLSGLTGKCTMQVA--------------------------------RDRVMQEVRRHFRPELLNRLDEIVVFDPLNHEQLRKVARLQMKDVASRLAERGIALAVTDAALDYVLAESYEPVYGARPIRRWLEKRVVTELSRMLVKEEIDENSTVYIDA-GPNG-----SELVYRVE-----------KNGGMVNAA 871          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: XP_007210381.1 (chaperone protein ClpB1 [Prunus persica] >ONI08844.1 hypothetical protein PRUPE_5G203700 [Prunus persica])

HSP 1 Score: 953.4 bits (2463), Expect = 5.400e-274
Identity = 510/884 (57.69%), Postives = 654/884 (73.98%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSI-GTGIVRAAG-LENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRA-GKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISA 889
            T KT E LS A +LA +  + Q TPLH+ SAL+++ D +    I  A+G  E   ++ +   QAL + P Q+P P  I   + +  ++ ++    K KGD  LA DQ++  L++ + +    ++AG+   +VK  +EKLR  GKKVD+A  +  ++ALK YG DLV +AE  KLDPV+GRDEEIRRV+++LSRRTKNNPVLIGEPGVGKTA+VEGLAQRI+ GDVP +L   R+IALDMGAL+AGA YRGEFE+RLK+VLKEV++A G ++LFIDEIHLVLGAGR +G+MDAANLLKPMLARG+LRCIGATTL+EYR++VEKD AFERRFQ V V EPSVP TISILRGLKERYE HHG+RI D ALV+AA+L+ RYI  R LPDKAIDLVDEAC+ +RVQLDS+PE ID LER  +QLEVE  AL +E+D  SKARL EV KE+  ++D+LQPL ++Y  EKG+++E +RL+ K  EL   ++ A R  D+A+VADLRY A+ +VE  + ++        G   E+ ++ E V    IAE+VSRWTGIPV +L  +E+ +L+ LA+ LH+RVVGQ +AV+AVA+AVLRSRAGL RP+QPTGSFLFLGPTGVGK+ELAKALA++LFDDE  +VRIDMSEY EQHSV+RLIGAPPGY+GH+EGGQLTEAVRR+PYSV+LFDEVEKAH +VFN LLQVLDDGRLTD QGR V+F+NT+II+TSNLGAEHL    + + T                                 +D R +VM  V+RHFRPE LNRLD+IV+F PLSR QLR + +LQM  ++ RL +R I + + +  LD++L ++Y P YGARPIRR+LEK + T++SR+L+   +D + T+ I A
Sbjct:    7 TRKTNESLSGAHELATDAGHAQFTPLHLASALISDPDGVFRQAIANASGNAEAPRAVERVFNQALKKLPSQSPPPEEIPASTTLIKVIRRAQAAQKAKGDTHLAVDQLIIGLLEDSQIGDLLKEAGIAPARVKSEVEKLRGEGKKVDNAHGDTTFQALKTYGRDLVEEAE--KLDPVIGRDEEIRRVVRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRILRGDVPSNLADVRLIALDMGALVAGAKYRGEFEERLKAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLLKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVYVAEPSVPDTISILRGLKERYEGHHGVRILDRALVVAAQLSSRYITGRQLPDKAIDLVDEACANVRVQLDSQPEEIDNLERKRMQLEVELHALEKEKDKASKARLVEVRKELDDLRDKLQPLMMKYRKEKGRIDELRRLKQKREELLIALAEAERRYDLARVADLRYGAIQDVESSIAKL-------EGSTDENLILTETVGPDQIAEVVSRWTGIPVTRLGQNEKDRLIGLAERLHKRVVGQNQAVDAVAEAVLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDENLIVRIDMSEYMEQHSVSRLIGAPPGYVGHEEGGQLTEAVRRRPYSVLLFDEVEKAHTAVFNTLLQVLDDGRLTDGQGRTVDFRNTVIIMTSNLGAEHLLSGLMGNCT--------------------------------MQDARDRVMQEVKRHFRPELLNRLDEIVVFDPLSRDQLRKVARLQMKDVAVRLAERGIALAVTDAALDYILDESYDPVYGARPIRRWLEKRVVTELSRMLVREEIDENSTVYIDA 849          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: XP_006841133.1 (chaperone protein ClpB1 [Amborella trichopoda] >ERN02808.1 hypothetical protein AMTR_s00086p00119290 [Amborella trichopoda])

HSP 1 Score: 953.4 bits (2463), Expect = 5.400e-274
Identity = 515/889 (57.93%), Postives = 650/889 (73.12%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSIGTGIVRAA------GLENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRA--GKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISA 889
            T KT E L+ A +LA+   + Q+TPLH+  AL++E      GIVR A      G E  +S  + + QA+ + P Q PAP  +   S++   + ++    K KGD  LA DQ++  L++ + +    ++AG+   +VK  +EKLR   GKKV+SA  + N++ALK YG DLV +A   KLDPV+GRDEEIRRVI++LSRRTKNNPVLIGEPGVGKTA+VEGLAQRIV GDVP +L   RV+ALDMGAL+AGA YRGEFE+RLKSVLKEV++A G ++LFIDEIHLVLGAGR +G+MDAANL KPMLARG+LRCIGATTL+EYR++VEKD AFERRFQ V V EPSV  TISILRGLKERYE+HHG+RI D ALV+AA+L+ RYI  R+LPDKAIDLVDEAC+ +RVQLDS+PE ID+LER  +QLEVE  AL +E+D  SKARL EV KE+  ++D+LQPL ++Y  EK +V+E +RL+ +  EL   +  A R  D+A+VAD+RY AL E++  + ++    +       E+ M+ E V    IAE+VSRWTGIPV +L  +E +KL+ LAD LH+RVVGQ+EAV AVA+AVLRSRAGL RP+QPTGSFLFLGPTGVGK+ELAKALA++LFDD K ++RIDMSEY EQHSVARLIGAPPGY+GH+EGGQLTEAVRR+PYSV+LFDEVEKAH+SVFN LLQVLDDGRLTD QGR V+F NT+II+TSNLGAEHL    L   T                                 +  R +VM  VRRHF+PE LNRLD+IVIF PL+  QL  + +LQM  ++ RL +R I + + +  L+ VLA+AY   YGARPIRR+LEK + TQ+S++L+ G +D + T+ I A
Sbjct:    7 THKTNEALAGAHELAVNSGHAQLTPLHLALALISE----AGGIVRQAISNAGGGEEAANSFERVLKQAMRKIPSQEPAPDEVPASSSLIKAVRRAQSSQKSKGDTHLAVDQLILGLLEDSQIGDILKEAGVSPGRVKAEVEKLRGKEGKKVESASGDTNFQALKTYGRDLVEEA--GKLDPVIGRDEEIRRVIRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLSDVRVVALDMGALVAGAKYRGEFEERLKSVLKEVEEADGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVYVAEPSVEDTISILRGLKERYENHHGVRIQDRALVVAAQLSSRYITGRYLPDKAIDLVDEACANVRVQLDSQPEEIDKLERKRIQLEVELHALEKEKDKASKARLVEVRKELDDLRDKLQPLMMKYRKEKERVDEIRRLKQRREELLFALQEAERRMDLARVADIRYGALQEIDAAIAKLEESTD-------ENPMLTETVGPDQIAEVVSRWTGIPVTRLRQNETEKLIGLADRLHQRVVGQDEAVNAVAEAVLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDAKLLIRIDMSEYMEQHSVARLIGAPPGYVGHEEGGQLTEAVRRRPYSVILFDEVEKAHISVFNALLQVLDDGRLTDGQGRTVDFCNTVIIMTSNLGAEHLLAGLLGQET--------------------------------MQTARERVMQEVRRHFKPELLNRLDEIVIFQPLTHDQLLKVARLQMTDVAARLAERGIAVAVTDAALEVVLAEAYDALYGARPIRRWLEKKVVTQLSKMLVKGEIDENSTVYIDA 850          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: XP_009788495.1 (PREDICTED: chaperone protein ClpB1 [Nicotiana sylvestris] >XP_016437499.1 PREDICTED: chaperone protein ClpB1 [Nicotiana tabacum])

HSP 1 Score: 952.6 bits (2461), Expect = 9.200e-274
Identity = 515/885 (58.19%), Postives = 653/885 (73.79%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSI-GTGIVRAAGL-ENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRA--GKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISA 889
            T KT E L++A +LAI   + Q TPLH+  AL+++ + I    IV AAG  E  +S+ +   QA+ + P QTPAP  I   +++  +L ++    K +GD  LA DQ++  L++ + +    ++AG+   +VK  +EKLR   GKKV+SA  + N++ALK YG DLV  A   KLDPV+GRDEEIRRVI++LSRRTKNNPVLIGEPGVGKTA+VEGLAQRIV GDVP +L   R+IALDMGALIAGA YRGEFE+RLK+VLKEV++A G ++LFIDEIHLVLGAGR +G+MDAANL KPMLARG+LRCIGATTL+EYR++VEKD AFERRFQ V V EPSVP TISILRGLKE+YE HHG++I D ALV+AA+L+ RYI  R LPDKAIDLVDEAC+ +RVQLDS+PE ID LER  +QLEVE  AL +E+D  SKARL EV KE+  ++D+LQPL +RY+ EK +V+E +RL+ K  EL   +  A R  D+A+ ADLRY A+ EVE  +  + +  +       ES M+ E V    IAE+VSRWTGIPV++L  +E++KL+ LA+ LH+RVVGQ++AV AVA+AVLRSRAGL RP+QPTGSFLFLGPTGVGK+ELAKALA++LFDD+K MVRIDMSEY EQHSVARLIGAPPGY+GH+EGGQLTEAVRR+PYSVVLFDEVEKAH +VFN LLQVLDDGRLTD QGR V+F NT+II+TSNLGAE+L    +   T                                 E  R  VM  VR+HF+PE LNRLD+IV+F PLS +QLR + + Q+  +++RL +R I + + E  LD +LA++Y P YGARPIRR+LEK + T++S++L+   +D + T+ I A
Sbjct:    7 THKTSEALAEAHELAISAGHAQFTPLHMAVALISDHNGIFRQAIVNAAGSEETANSVERVFKQAMKKIPSQTPAPDEIPPSTSLIKVLRRAQSLQKSRGDTHLAVDQLILGLLEDSQIGDLLKEAGVSTARVKSEVEKLRGKEGKKVESASGDTNFQALKTYGRDLVEQA--GKLDPVIGRDEEIRRVIRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLADVRLIALDMGALIAGAKYRGEFEERLKAVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVYVAEPSVPDTISILRGLKEKYEGHHGVKIQDRALVVAAQLSARYITGRHLPDKAIDLVDEACANVRVQLDSQPEEIDNLERKRIQLEVELHALEKEKDKASKARLIEVRKELDDLRDKLQPLTMRYKKEKERVDELRRLKQKRDELTYALQEAERRYDLARAADLRYGAIQEVEAAIANLESSTD-------ESTMLTETVGPDQIAEVVSRWTGIPVSRLGQNEKEKLIGLANRLHQRVVGQDDAVRAVAEAVLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDDKLMVRIDMSEYMEQHSVARLIGAPPGYVGHEEGGQLTEAVRRRPYSVVLFDEVEKAHPTVFNTLLQVLDDGRLTDGQGRTVDFTNTVIIMTSNLGAEYLLSGLMGKCT--------------------------------MEKARDLVMQEVRKHFKPELLNRLDEIVVFDPLSHEQLRQVCRHQLKDVASRLAERGIALGVTEAALDVILAQSYDPVYGARPIRRWLEKRVVTELSKMLVKEEIDENSTVYIDA 850          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: XP_010537384.1 (PREDICTED: chaperone protein ClpB1 [Tarenaya hassleriana])

HSP 1 Score: 952.6 bits (2461), Expect = 9.200e-274
Identity = 509/884 (57.58%), Postives = 656/884 (74.21%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSIGTGIVRAAGLEN-LDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRA--GKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISA 889
            T KT E ++ A +LA+  ++ Q TPLH+ +AL+++   I    V +AG EN   S  + I QAL + P Q+P P  I   +++  ++ ++    K +GD+ LA DQ++  L++ + +    ++AG+ A +VK  +EKLR   GKKV+SA  + N++ALK YG DLV  A   KLDPV+GRDEEIRRVI++LSRRTKNNPVLIGEPGVGKTA+VEGLAQRIV GD+P +L   R+IALDMGAL+AGA YRGEFE+RLKSVLKEV++A G ++LFIDEIHLVLGAGR +G+MDAANL KPMLARG+LRCIGATTL+EYR++VEKD AFERRFQ V V EP+VP TISILRGLKE+YE HHG+RI D ALV+AA+L+ RYI  R LPDKAIDLVDEAC+ +RVQLDS+PE ID L+R  +QLE+E  AL RE+D  SKARL EV KE+  ++D+LQPL ++Y  EK +++E +RL+ K  EL   +  A R  D+A+ ADLRY A+ EVE  + ++    E       E+ M+ E V  + IAE+VSRWTGIPV +L  +E+++L+ LAD LH+RVVGQ++AV AVA+A+LRSRAGL RP+QPTGSFLFLGPTGVGK+ELAKALA++LFDDE  +VRIDMSEY EQHSV+RLIGAPPGY+GH+EGGQLTEAVRR+PYSV+LFDEVEKAHV+VFN LLQVLDDGRLTD QGR V+F+NT+II+TSNLGAEHL    L   T  +S                             +  R +VM  VR+HFRPE LNRLD++V+F PLS +QLR + +LQM  ++ RL +R + + + +  LD VLA++Y P YGARPIRR+LEK + T++SR+L+   +D + T+ I A
Sbjct:    7 THKTNEAIATAHELAMNAAHAQFTPLHLAAALISDSAGIFPQAVSSAGGENAAQSAERVIKQALKKLPSQSPPPDDIPASTSLIKVIRRAQAAQKSRGDSHLAVDQLILGLLEDSQIGDLLKEAGVAASRVKSEVEKLRGKEGKKVESASGDTNFQALKTYGRDLVEQA--GKLDPVIGRDEEIRRVIRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVKGDIPNNLSDVRLIALDMGALVAGAKYRGEFEERLKSVLKEVEEADGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVYVAEPNVPDTISILRGLKEKYEGHHGVRIQDRALVVAAQLSARYITGRHLPDKAIDLVDEACANVRVQLDSQPEEIDNLQRKRIQLEIELHALEREKDKASKARLVEVRKELDDLRDKLQPLTMKYRKEKERIDEIRRLKQKREELIFALQEAERRYDLARAADLRYGAIQEVESAIAQLEPSSE-------ENLMLTETVGPEHIAEVVSRWTGIPVTRLGQNEKERLIGLADRLHQRVVGQDQAVTAVAEAILRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDENLLVRIDMSEYMEQHSVSRLIGAPPGYVGHEEGGQLTEAVRRRPYSVILFDEVEKAHVAVFNTLLQVLDDGRLTDGQGRTVDFRNTVIIMTSNLGAEHL----LSGLTGKVS----------------------------MQVARDRVMQEVRKHFRPELLNRLDELVVFDPLSHEQLRKVARLQMKDVAVRLAERGVALAVTDAALDVVLAESYDPVYGARPIRRWLEKRVVTELSRMLVREEIDENSTVYIDA 849          
BLAST of NO20G02120 vs. NCBI_GenBank
Match: XP_010663020.1 (PREDICTED: heat shock protein 101 isoform X1 [Vitis vinifera])

HSP 1 Score: 952.2 bits (2460), Expect = 1.200e-273
Identity = 513/885 (57.97%), Postives = 649/885 (73.33%), Query Frame = 0
Query:    8 TEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSI-GTGIVRAAGLEN-LDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQAKKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRA--GKKVDSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTKNNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGASYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLLKPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISILRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACSMIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMATIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVADLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRWTGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSRPEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSVARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQVLDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSXXXXXXXXXXXXXXXXXXXXXEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDIVIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVPEYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISA 889
            T KT E L+ A +LA+   + Q+TPLHV  AL+ + + I    I+ A G E   +S+ +   +AL + P Q+P P  I   + +  ++ ++    K +GD  LA DQ++  L++ + +    ++AG+   +VK  +EKLR   GKKV+SA  +  ++ALK YG DLV  A   KLDPV+GRDEEIRRVI++LSRRTKNNPVLIGEPGVGKTA+VEGLAQRIV GDVP +L   R+IALDMGAL+AGA YRGEFE+RLKSVLKEV++A G ++LFIDEIHLVLGAGR +G+MDAANL KPMLARG+LRCIGATTL+EYR++VEKD AFERRFQ V V EPSVP TISILRGLKERYE HHG+RI D ALV+AA+L+ RYI  R LPDKAIDLVDEAC+ +RVQLDS+PE ID LER  +QLEVE  AL +E+D  SKARL EV +E+  ++D+LQPL ++Y+ EK +++E +RL+ K  EL   +  A R  D+A+ ADLRY A+ EVE       A I    G   E+ M+ E V  + IAE+VSRWTGIPV +L  +++++L+ LA+ LH+RVVGQ++AV AVA+AVLRSRAGL RP+QPTGSFLFLGPTGVGK+ELAKALA++LFDDE  +VRIDMSEY EQHSV+RLIGAPPGY+GHDEGGQLTEAVRR+PYSVVLFDEVEKAH++VFN LLQVLDDGRLTD QGR V+F NT+II+TSNLGAEHL    +   T                                 +D R +VM  VRRHFRPE LNRLD+IV+F PLS  QLR + +LQM  +++RL +R I + + +  LD VLA++Y P YGARPIRR+LEK + T++S++LI   +D + T+ I A
Sbjct:    7 THKTNETLAGAHELAMNSGHAQLTPLHVAVALITDPNGILRQAIIGAGGNEEAANSVERVFNKALKKLPSQSPPPDEIPVSTTLIKVVRRAQSSQKSRGDTHLAVDQLILGLLEDSQIGDLLKEAGVSTSRVKSEVEKLRGKEGKKVESASGDTTFQALKTYGRDLVEQA--GKLDPVIGRDEEIRRVIRILSRRTKNNPVLIGEPGVGKTAVVEGLAQRIVRGDVPSNLAEVRLIALDMGALVAGAKYRGEFEERLKSVLKEVEEAEGKVILFIDEIHLVLGAGRTEGSMDAANLFKPMLARGQLRCIGATTLEEYRKYVEKDAAFERRFQQVYVAEPSVPDTISILRGLKERYEGHHGVRIQDRALVVAAQLSSRYITGRHLPDKAIDLVDEACANVRVQLDSQPEEIDNLERKRMQLEVELHALEKEKDKASKARLVEVRRELDDLRDKLQPLMMKYKKEKERIDELRRLKQKREELLFALQEAERRYDLARAADLRYGAIQEVE-------AAIANLEGTTDENMMLTETVGPEQIAEVVSRWTGIPVTRLGQNDKERLIGLAERLHQRVVGQDQAVSAVAEAVLRSRAGLGRPQQPTGSFLFLGPTGVGKTELAKALAEQLFDDENLLVRIDMSEYMEQHSVSRLIGAPPGYVGHDEGGQLTEAVRRRPYSVVLFDEVEKAHIAVFNTLLQVLDDGRLTDGQGRTVDFTNTVIIMTSNLGAEHLLSGLVGKCT--------------------------------MQDARDRVMQEVRRHFRPELLNRLDEIVVFDPLSHDQLRKVARLQMKDVASRLAERGIALAVTDAALDVVLAESYDPVYGARPIRRWLEKKVVTELSKMLIREEIDENSTVYIDA 850          
The following BLAST results are available for this feature:
BLAST of NO20G02120 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
OAE25653.12.000e-27657.97hypothetical protein AXG93_4368s1120 [Marchantia p... [more]
CCI44512.19.800e-27656.80unnamed protein product [Albugo candida][more]
CCA14066.14.900e-27556.78heat shock protein 101 putative [Albugo laibachii ... [more]
XP_022018608.18.300e-27558.14chaperone protein ClpB1 [Helianthus annuus] >OTF90... [more]
PON70323.18.300e-27556.66ClpA/B family [Trema orientalis][more]
XP_007210381.15.400e-27457.69chaperone protein ClpB1 [Prunus persica] >ONI08844... [more]
XP_006841133.15.400e-27457.93chaperone protein ClpB1 [Amborella trichopoda] >ER... [more]
XP_009788495.19.200e-27458.19PREDICTED: chaperone protein ClpB1 [Nicotiana sylv... [more]
XP_010537384.19.200e-27457.58PREDICTED: chaperone protein ClpB1 [Tarenaya hassl... [more]
XP_010663020.11.200e-27357.97PREDICTED: heat shock protein 101 isoform X1 [Viti... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL021nonsL021Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR020ncniR020Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR067ngnoR067Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK002757NSK002757Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO20G02120.2NO20G02120.2-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO20G02120.1NO20G02120.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|595017gene_6435Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100881g1gene8411Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO20G02120.2NO20G02120.2Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO20G02120.1NO20G02120.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO20G02120 ID=NO20G02120|Name=NO20G02120|organism=Nannochloropsis oceanica|type=gene|length=7783bp
CAGTGAACATACGCCTTAGAATAGGCAAGACAGACAAGCCAAGAGCACAC
CACATCAACCACCTGCCTCTCCGTTACGCAAGCCCCCCCTTCCAAGCAGC
AGCATGTCGATGTCTATCACGCCCACCGAGAAAACTCAGGAGGTCCTCAG
TCAGGCCAAGGATCTAGCGATAGAGATGAGTAACACGCAGgtacgcagtt
ttggaatgggaaaaagggaagggtgattggagaggagggaagatagttgc
gagggcgaaggagaaaaatggacctgccagcgatccccgaggcatatgta
gtactcatggattcctagcggtggattggacgagaaccctaaaaacgaga
gggctttttcaggagtgcagctgggggccgcagatgtgtaggagcgatgc
ttcgtcgccggcatcatttcgatgaggcgatcccatccatgctgtacttg
gccgctccactggaatccttgcgatcgagctgacttcccctccgtccgtc
cctccatccctccttccctccctcccttcaccctcagGTGACGCCTCTAC
ACGTGATGAGCGCCCTGGTTGCTGAGGATGACAGTATAGGCACGGGCATC
GTCCGCGCGGCTGgtaggtggctcgattgagagcgggagggaaggatggg
ggcgaaggagggtgggcgagagggtgaaatgaaacagtgcaccaatattt
cattccatctagacacactcaaggttatcaaatcaagatgatatttgtgc
attcgcactgttgtttttggattccttcgtttctcaccctcctccctttc
ctccctccctttctcgataaagGCCTCGAAAACCTCGACTCTCTCCGTCA
GTCCATCGCACAAGCCCTCAACCGTTGCCCCCGGCAGACCCCTGCCCCTT
CCTCCATCCATTTCGATTCGGCCATGGAAAGCATCTTGGTCCAGTCGGCG
CAACAGGCGAAAAAAAAGGGCGACGCGTTTTTAGCCGCGGATCAGATTCT
GGCAGTATTGATAGACCATGCCAGCGTGGCATCCGCATGTAGGCAGGCTG
GGCTGATGGCTGATAAGgtgagtgggtgaatataaaggaaacaaatcaca
ggaatagacgaaagagatggtcacattcgaaccaaccacacacgctgatg
ctctacttccctccttccctccctccctccctccttccctccctcagGTG
AAGCAAGCGATCGAGAAGCTCCGCGCTGGAAAGAAGGTGGATTCCGCGAA
Ggcacgttatatgtatatatatatatgtatggctcgtcttcccgttctct
actctgcttcctgtcttttcttctcttcttctccccctgtggccctgtct
ccccttctgcctccttcccttctcaccctcctccctccgctcgctcttct
cccatcactccctcagGCGGAGCAGAATTACGAAGCTCTGAAGAAATACG
GCCATGATTTGGTGGGAGACGCCGAAGAGAGTAAGCTCGATCCAGTCGTG
GGTCGGGACGAGGAGATCCGACGCGTGATTCAGgtagagagagggagggg
agaaggagggaagaaaggaagggagggaggatgagcggtcatcattgata
ttcaagcatcaccttttcttattttttccccccctccatcctgccctcag
GTCCTCTCGCGCCGGACCAAGAACAATCCGGTGCTGATTGGGGAGCCCGG
GGTGGGTAAGACGGCCATAGTGGAAGGACTAGCGCAACGCATTGTGGCTG
GGGACGTGCCTGAAAGCCTCAAGAGCCGGCGgtaagggagggagggaaag
aggaagggaggaagggaaggagatacacgaagcgtctccgcggcttttgt
cttacagtccacatcctccctctcttccctccccacccttccctccctcc
ctccctccctcccctctagGGTCATTGCATTGGACATGGGCGCTTTGATC
GCCGGTGCCTCCTACCGAGGTGAATTCGAGGATCGACTCAAAAGTGTGCT
CAAGGAGgtatgatgtccctccctccctccctccctccttccctccctct
ttcaataatcgactccttgttgaggtcatcagttgctcatcaccatcgtc
atctccttcctcatcatcatcttcctcttcttcttctttctcttgttgtc
ggtattttttgtcagGTGAAGGATGCGCATGGGAATATCGTTCTCTTCAT
TGgtacgtacctaccccttccccccctccctccctccctcccgcccgccc
gcccgcccgcccgcccgcccaccctcctgttccccacttcaactcctccc
aacattaccatctcttacctctctcccctcctcccctcctaccctccttc
cctccatccctccctttcagACGAGATCCACCTGGTGCTGGGTGCGGGCC
GTGGGGACGGTGCAATGGACGCAGCGAACTTGCTCAAGCCCATGCTGGCA
CGTGGAGAACTTCGATGCATTGGGGCCACGACACTGAAAGAATACCGgta
cggagggaggggggagagagagagggagggagggggaggtgagttagtgg
gggggggcgatgtctttgcacccttcgcttcctccttcctctttccctcc
ctccctccttcgcaggcaacacgtcgaggaagacccggccttggctccct
tttccatgttcatacgctcacttcccccctccttccctccctccctccct
ctctccttcacagGCAACACGTCGAGAAGGATCCGGCGTTTGAACGCCGC
TTCCAGCCCGTTCTAGTGGGGGAGCCGAGCGTCCCTGCCACCATCTCCAT
CCTTCGAGGGCTCAAGGAACGATACGAGTCTCACCACGGCATTCGTATCA
CCGATGCCGCTCTTGTCCTCGCTGCCAAGCTGGCGGATCGGTACATTCAG
gtaggcctatcctctcttccctcttttatatgccttgatgcccgacagcg
cgttctgttttgttgttgttgttgttgttgttattattattaatgttgtg
gtggtggtgctggtggtggtgctggtggtggtgctggtgctggtggtggt
gctggtggtggtgctggtgctggtgctggtgctggtgctggtggtggtgc
tggtggtgctggtgctggtggtggttttgctccacttgctaactctcttt
ctcgccctcccttccttcctccctccctcccttcctcccttcctccctcc
ctccctcccttagAACCGATTTTTGCCGGACAAGGCGATTGACCTCGTAG
ATGAGGCTTGCTCCATGATCCGCGTCCAGCTGGATTCCCGTCCCGAGgta
accactggaagaggaagggagggagggaagggatgatagggagggtaggg
gtgaggggggccaggggggaaggttcaattcatcttctacgtgtgtgcga
gattcctagtcatccaggacgattctcctcacttcctctccctccctttc
tccctccttccctcctcccttccttttttcgtgtctcaccttccctgaca
gCGCATCGACCAGTTGGAGCGGGCGTTGCTACAGTTGGAGGTCGAGGCTA
CGGCCCTTCGACGCGAGGAGGATGTCCAGTCCAAGGCCCGTCTCACTGAA
gtacgtcttccttccctcgctcgctccctccgtccttcttccctccctcc
ctgctaccaatcgcagtcacgttctccttcgattattgattcttgtgcgc
ttcttccccccctcccttcctccctccttcccttccgcccttcacctctc
cctcccttcctccctccttccctcccgccctaccctgcagGTGGAGAAGG
AAATGGCAACGATCAAGGACGAGTTACAGCCCCTGCAGCTCCGCTACGAA
GCCGAGAAAGGGCAGGTCAACGAGCAACAGCGCCTGCAGAACAAGgtatg
tcgatgtcaatcttccctccccccctccctccctccctccctccctccct
cgcttcctacacatcaactccatctgtccgactctctcctcctccctctc
tctttccctccctttcttatcctcagGTGCTCGAGCTCCAACGCAAAATT
TCCGTGGCACTGCGCGATCGAGACATGGCTCAGgtatgcacatcttctct
ctctatctccttgtcttgtctccctttttcttccgcttttcctcgtctca
ccgcccctccttcctcctcccctccctccctagGTGGCGGACCTGAGGTA
CATCGCATTGCCCGAAGTGGAAAAGCGTCTGGTAGAAGTATCGGCCGAGA
TTGAAGCCAAACGAGGGGATGCTGCTGAGTCAGgtgagggagggagggta
ggagggattgggggagggagggagggagggagggagggagggagggtacc
acgacggcggatttgattttacttcctccctccctccctatcttcctccc
ttcctcccttcattcagGCATGGTTCGAGAGGTAGTAGACGAGCAAAGCA
TCGCCGAGATCGTCAGTCGTTGGACAGGCATTCCTGTGAACAAACTCACC
GCTTCTGgtacgtgcgcttgtcctccctcccacccgccctccctccctcc
ctccctccctcctcggtagatgagttccaaaagccatttttgatgtcgcc
tctcatccccctccctctctcccgtattctcttcctctccccccatccac
cctccccgacctccccctcttcgcttagAACGCCAGAAACTCCTCTCTCT
CGCCGACGTGCTGCACGAGCGTGTGGTTGGGCAGGAGGAAGCGGTGGAGG
CCGTGGCGCAAGCCGTCCTTCGTTCGCGCGCGGGCCTTTCTCGCCCGGAG
CAGCCCACGGGGAGCTTCCTTTTTTTGGgtacgagaaacatgacaggagg
acatgacatcgagaattcagaggaggagggagggacggaaagtaggcggt
aactgatcaagagtggttatacactcagaccgagactcatccctttccgc
cttctccctccttttccctatgcatagGCCCGACTGGAGTGGGAAAGAGC
GAGCTGGCTAAGGCGCTCGCGCAGGAGCTGTTTGATGACGAAAAGCACAT
Ggtatgtgaaactcgaggggggcatgaaaggagcctggccaggcttgagt
gagagactgtccaaatcgaatgcatggaaaagccgaagatgactcgtgcg
agtacgtggcgccactcggtacttatgcaaagcgttcccgcagcttagtg
aattaagcgatgccttattcatccttcctctcgctccctcccttcctctt
ttcagGTGCGCATTGACATGAGCGAGTACGGCGAACAGCACAGCGgtaag
aagatcgaaagggaatgactgcgggagctatgccatctactcaattgtca
acttaccgttcttcgaaaatccctcctctcattgcagTTGCTCGTCTGAT
CGGTGCTCCCCCTGGGTATATTGgtaagtttgtgcgtttagataggtgtg
tatttctttatatttgggatgaccaagaaactaccatgagcttgtctgct
tgtcgcaacatttactgatcgcgcttttctatccttccctccctttttgt
ttctctaccttgtcttatatagGCCACGATGAAGGGGGGCAACTCACCGA
GGCCGTCCGTCGCAAGCCATACAGCGTCGTATTATTTGATGAAGTCGAGA
AAGCCCACGTGgtacgtgtggtaaaaatatttgaaccctgctcagatatc
tgtgaatagactagcgtcctagatgcaatgttgtctgcctttgccctcca
agtcagagcttacctattgccttcgagtgctaacttgttgcctcttttcc
tgactttttggttaaacaccacagAGCGTTTTTAATGTCTTGCTACAAGT
CCTGGACGACGGTCGACTGACTGgtacgtggtgttgtatattgctcactt
gaaaacggaaaaaagaggtcgtgatatgttgtgattgctctctctttgat
tgctatagcacgcgtcggcgagcttgtttgaagcttgacacatgtttctg
agacttgaacattttggtctacacttgagctcgacttgtgctcaattttt
ccgtatccgtacacctttttcgcgcgcagATTCCCAAGGACGCATCGTGA
ACTTTAAGAACACCATCATCATCCTCACAAGTAACCTCGGCGCCGAGCAC
TTGCAGAAGGCGGCTCTCGACAGCTCTACCAgtaagtgcctttattccgc
ttctgaacttttttggtataaattgatatttttaattgatattcggtctc
atgtcgtttctgaccttgctttaagatatgtgacctacaagctctcactt
ttttgttccattaaactctccatgaagGTAACTTGAGCGACGATGAGGAC
TCGGCGAGCAACGGCAACGGCAACGGGCGTAGCAACAAGAAGAGTAAGAA
GGAAGGCGGCAGTACCGTCATTTCTGAGGACGTCCGAGGCAAGGTCATGG
CTGTCGTCCGCCGTCACTTTCGCCCCGAGTTTTTGAACCGgtacgtttac
ctaacattttacacgaatcatatgtttgtgtgtatgaggatcgttccttt
ggaagcatgaggtggagatcgagattatatgcgaggttgacctaccatgc
atgccgtgaatcgtatcgggtttcattttatttcctttggcgagatcgac
aaacttgctttctcttacttatgtgcccctcctcataccctgaaatatag
GCTCGATGACATTGTCATTTTTAGCCCCTTGTCTCGTAAGCAGCTTCGCT
CCATCATGCAACTGCAGATGGCCGCCATTTCCAACCGCCTCAAGGACCGC
AATATCGAGATCCACCTAGCTGAAGATGGCTTGGACCATGTCCTTGCTAA
GGCCTACGTTCCCGAGgtatgctccgatattacgtcgagcttttgttcgg
atttgaaaaatccatgtgtgccgcatatctacctgacatattctgttttt
tcttgattttaaaagctgctttttccagccaactcaaatttgcaatgcag
aggcctgcttccatctttaccaacaataaaatttatttcaaattcttgaa
cctgatttcctcattcccatcacgtgacacacatcacatagTACGGCGCC
CGTCCTATTCGCCGGTATCTGGAGAAAACCATCACTACCCAAGTTTCGCG
CTTGCTTATCGCTGGGACCCTCGACAGGGACCAGACTCTGGAAATAAGTG
CTCAGAAGCCTGAAGGCGGGACGTGGGCGGACTCCGAACTGGCCTTTAAT
GTGACGCCGCGCTCGGCGGCAGGGGCGTCGATGGAGGTGGACAATGGAGG
CAGCGCCAACATGGCCCCACCTATGTTTCGGCCGAGTCCTGGGCGAGGCA
AATGACTGAGAGAGGAGAGTGAAGGGATGAAGGCAGAAAAAAAGGGAAAG
AGAGGGGCGAAGTAGTGTGTTTGATTGCAGATAATTGATTTCTCGGGGGC
CATAAACTCTAGCTTTGGAATTTAAGGAATAAAAGACAAGTATGAGCGTA
CCTTTATGCGTGCGGAGGCAGTTCAGAAGAGATCACTTAGACTTATAGAT
GATTCTGGATAGGGAATGATGGATTGTATGTAGGAAAAATTATATATTGA
CAGGGACGAAGGCAAAAGACGAAGAGCATGATTTATACGAAACATGGCGC
GCTGGAATATGAGATAAGTGTATGTGTCAAAGAAAGCAAAATACGAAAAA
GAAGAAGAGGGCACAGAGACGAGCGAACCGAGGCAAAAGAAGGAAAAGAT
ATAAAAGAAGTAAGATGAGAAGAACACATTTGG
back to top

protein sequence of NO20G02120.2

>NO20G02120.2-protein ID=NO20G02120.2-protein|Name=NO20G02120.2|organism=Nannochloropsis oceanica|type=polypeptide|length=881bp
MSMSITPTEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSIGTGI
VRAAGLENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQA
KKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRAGKKV
DSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTK
NNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGA
SYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLL
KPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISI
LRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACS
MIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMA
TIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVA
DLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRW
TGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSR
PEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSV
ARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQV
LDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSDDEDS
ASNGNGNGRSNKKSKKEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDI
VIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVP
EVCSDITSSFCSDLKNPCVPHIYLTYSVFS*
back to top

protein sequence of NO20G02120.1

>NO20G02120.1-protein ID=NO20G02120.1-protein|Name=NO20G02120.1|organism=Nannochloropsis oceanica|type=polypeptide|length=939bp
MSMSITPTEKTQEVLSQAKDLAIEMSNTQVTPLHVMSALVAEDDSIGTGI
VRAAGLENLDSLRQSIAQALNRCPRQTPAPSSIHFDSAMESILVQSAQQA
KKKGDAFLAADQILAVLIDHASVASACRQAGLMADKVKQAIEKLRAGKKV
DSAKAEQNYEALKKYGHDLVGDAEESKLDPVVGRDEEIRRVIQVLSRRTK
NNPVLIGEPGVGKTAIVEGLAQRIVAGDVPESLKSRRVIALDMGALIAGA
SYRGEFEDRLKSVLKEVKDAHGNIVLFIDEIHLVLGAGRGDGAMDAANLL
KPMLARGELRCIGATTLKEYRQHVEKDPAFERRFQPVLVGEPSVPATISI
LRGLKERYESHHGIRITDAALVLAAKLADRYIQNRFLPDKAIDLVDEACS
MIRVQLDSRPERIDQLERALLQLEVEATALRREEDVQSKARLTEVEKEMA
TIKDELQPLQLRYEAEKGQVNEQQRLQNKVLELQRKISVALRDRDMAQVA
DLRYIALPEVEKRLVEVSAEIEAKRGDAAESGMVREVVDEQSIAEIVSRW
TGIPVNKLTASERQKLLSLADVLHERVVGQEEAVEAVAQAVLRSRAGLSR
PEQPTGSFLFLGPTGVGKSELAKALAQELFDDEKHMVRIDMSEYGEQHSV
ARLIGAPPGYIGHDEGGQLTEAVRRKPYSVVLFDEVEKAHVSVFNVLLQV
LDDGRLTDSQGRIVNFKNTIIILTSNLGAEHLQKAALDSSTSNLSDDEDS
ASNGNGNGRSNKKSKKEGGSTVISEDVRGKVMAVVRRHFRPEFLNRLDDI
VIFSPLSRKQLRSIMQLQMAAISNRLKDRNIEIHLAEDGLDHVLAKAYVP
EYGARPIRRYLEKTITTQVSRLLIAGTLDRDQTLEISAQKPEGGTWADSE
LAFNVTPRSAAGASMEVDNGGSANMAPPMFRPSPGRGK*
back to top
Synonyms
Publications