NO02G03040, NO02G03040 (gene) Nannochloropsis oceanica

Overview
NameNO02G03040
Unique NameNO02G03040
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length3040
Alignment locationchr2:814375..817414 -

Link to JBrowse

Properties
Property NameValue
DescriptionArsenical pump ATPase, ArsA/Get3
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr2genomechr2:814375..817414 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
PRJNA7699582024-08-13
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
PXD0166992021-01-08
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005524ATP binding
GO:0016887ATPase activity
GO:0016787hydrolase activity
GO:0000166nucleotide binding
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
IPR016300ATPase_ArsA/GET3
IPR025723Anion-transp_ATPase-like_dom
Homology
BLAST of NO02G03040 vs. NCBI_GenBank
Match: EWM30283.1 (Arsenical pump ATPase, ArsA/Get3 [Nannochloropsis gaditana])

HSP 1 Score: 1100.1 bits (2844), Expect = 0.000e+0
Identity = 596/812 (73.40%), Postives = 670/812 (82.51%), Query Frame = 0
Query:   10 LAAPSRFGKMLLFLAS--GLVLPSYVEAFLVPAARLTAMSSSLAGKTCLNEQDLV---TVAVKRGALNMAASTTTIPATXXXXXXALPTLAELGEMRKPGLMQRFIFFGGKGGVGKTSTSTAIALHLADQGLRTLVISTDPAHSLGDLLDQKVSGGDPVQVLGCDNLWAMEVDTTRALNRFRALFKELDVGALATQFGVSEEILGGLGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLVQDEDQTQHMARMSKGQAHCLQRLERGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGPIATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTEVSGLCGPGRLFALEVDTGEAVEEFRSVLQGLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGANSLMSMFGG--XXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEALKEQDVAVRRLVVNQVLAEHVKDGYWNRLRAGQQVALTDVGKAAAAAGVHLTEVPYFDTEMRTVYALRVLADVLTKPK 815
            LAA +R     L+L S   +++P++V+ F+ PAARLT       G     EQ L     + +++   +           XXXXXX            K GL QRFIFFGGKGGVGKTSTSTAIA+HLAD+GL+TL+ISTDPAHSLGDLL+QKVSGGDPV+V GCDNLWAMEVDTTRAL+RFRALFKELDV ALA QFGVSEEIL GLGLEDFVAILNNPPPGIDELIALA+VVRLSK  G +S+ G    G+TFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLR+SKVLS+LS LFGGGS ES+ERLRATDE  AKVE+AK QM+ELR+LFRDQDATEFCIVTIATQLAVAESKRL+ AL++E IAVRHLVVNKLVQDEDQ+QHMAR+ +GQA CLQR+ERG+VA HGL++VQVPYLDVEV+G+FGLKFMAD A+ T PGS FGDLF++GGPIATRFVLFGGKGGVGKTSSS ALAVKLAD G TTA+VSTDPAHSLGDAL+MDLS G+VTEV+GL GPGRLFALEVDT EAVEEF+ VLQGLG G KG  +     G+M QL++GEFADV ESAPPGTDELVALARVLKLLKEGTPGEG++FDRI+IDTAPTGHTLRLLSFPEFLEGF+ERV++IR+RL+GA+SLM MFGG                          EG PRDRLREFQLKMIELDDLLHDPAR+EFVAVTIPTEMA+AETERLVEALKEQDVA+RR+V+NQVL E V  GYW RLRAGQQVAL DV +A AA GV +TEVPYFDTE++TVYALRVLADVLT P+
Sbjct:   10 LAAQARIWSKKLWLLSLLAVIVPNHVKGFVRPAARLT------RGINVHFEQHLQFSRPLRMRQERASRMXXXXXXXXXXXXXXXXXXXXXXXXXXXKTGLTQRFIFFGGKGGVGKTSTSTAIAVHLADKGLKTLIISTDPAHSLGDLLEQKVSGGDPVRVEGCDNLWAMEVDTTRALDRFRALFKELDVSALAAQFGVSEEILAGLGLEDFVAILNNPPPGIDELIALADVVRLSKGKGSTSKEGAAVNGLTFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRLSKVLSSLSSLFGGGSVESQERLRATDELLAKVERAKLQMMELRELFRDQDATEFCIVTIATQLAVAESKRLLAALRKEKIAVRHLVVNKLVQDEDQSQHMARIGRGQALCLQRVERGLVADHGLSTVQVPYLDVEVKGIFGLKFMADAAFETSPGSAFGDLFESGGPIATRFVLFGGKGGVGKTSSSAALAVKLADSGFTTAIVSTDPAHSLGDALEMDLSSGRVTEVAGLYGPGRLFALEVDTEEAVEEFKQVLQGLGKGTKGAMKGN---GIMDQLQVGEFADVFESAPPGTDELVALARVLKLLKEGTPGEGKRFDRIIIDTAPTGHTLRLLSFPEFLEGFLERVISIRERLRGASSLMDMFGGLAVGGKDKGFAEGDEGIGGSGLEAEDEGLPRDRLREFQLKMIELDDLLHDPARAEFVAVTIPTEMALAETERLVEALKEQDVAIRRVVINQVLGEEVSQGYWARLRAGQQVALADVERAVAAGGVQVTEVPYFDTEIKTVYALRVLADVLTGPE 812          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: CEM11669.1 (unnamed protein product [Vitrella brassicaformis CCMP3155])

HSP 1 Score: 548.5 bits (1412), Expect = 3.500e-152
Identity = 328/749 (43.79%), Postives = 448/749 (59.81%), Query Frame = 0
Query:  108 RFIFFGGKGGVGKTSTSTAIALHLADQGLRTLVISTDPAHSLGDLLDQKVSG--GDPVQVLGCDNLWAMEVDTTRALNRFRALFKELDVGALATQFGVSEEILGGLGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFG--GGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKL--------VQDEDQTQHMA-------RMSKGQAHCLQRLERGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGPIATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTEV-----------SGLCGPGRLFALEVDTGEAVEEFRSVLQ----GLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLK--GANSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEALKEQDVAVRRLVVNQVLAEHV--KDG--------YWNRLRAGQQVALTDVGKAAAAAGVHLTEVPYFDTEMRTVYALRVLADVL 811
            R I FGGKGGVGKT+T+ A A+  AD+GLRTL+ISTDPAHSLGD L Q++     +   V G DNLWA+EVD  +A+   +A  +  D   L+ + G+   +L G+G+ D V +  NPPPGIDEL+A+A+V++L++  G     GG++GG  FDR+VIDTAPTGHTLRLL+ P FLDGFLGK++KL+ ++  +  +L  +     G  E  E L   D A  ++E  K +++ LR L R+ + TEFC+VTI T++A+AESKRL+ +LK+E IAV+HL+VNKL        V +  +T+ +A       ++   Q   LQRL   +  +  L    VPY DVEV G  GL++  + A+       + DLF+       R ++ GGKGGVGK+SSS ALA+K+A++G  T VVSTDPAHSLGDAL ++L GG++  +                 G+LFALE+DT   V EFR  L          +K     +   G  G   L +  D+ ++APPGTDELVAL++V  +L   T  +GR FDR++IDTAPTGHTLRLL+FPEFL+ F +R+  IRDR +  G  SL +M  G                            RDRL+EFQ KM ELDDL HDPA++EF  VTIPTE+AVAETERLV++LK + + VRRL+VNQV +      DG        Y  R   GQ   + ++ + A   GV +  VPYFD ++  +  LR +   L
Sbjct:   76 RLILFGGKGGVGKTTTAAATAVRFADEGLRTLIISTDPAHSLGDALGQQLHAPPDEMTAVEGVDNLWALEVDAGQAVIEMKAALETFDAIELSRKLGIGTTVLEGIGIGDMVKLFENPPPGIDELVAIAKVLQLTRGKG-----GGSSGGSRFDRLVIDTAPTGHTLRLLAAPQFLDGFLGKVIKLKNQLDALTRSLKKMLASVSGGTEGAEPLLDQDAALQRIEALKDRLLGLRGLLRNSETTEFCVVTIPTEMAIAESKRLLASLKKEGIAVKHLIVNKLMTAGSAEQVGEIQETRRVATLASFVEQLQSDQVKSLQRLHT-LAQEANLRLSFVPYFDVEVTGPLGLRYFGEEAFGRANAPMWSDLFEDP---KRRCIIMGGKGGVGKSSSSAALAIKMAEKGFKTLVVSTDPAHSLGDALKVNLGGGQLVRIDSEGTDLRLLLESATSRGQLFALEIDTEGTVAEFREALSTSLATTRASLKSSRNLRALTGTDGVGSLFDIEDLFDTAPPGTDELVALSKVFGILNRPT-DDGRPFDRLIIDTAPTGHTLRLLAFPEFLDSFFQRLREIRDRFQSSGFGSLGAMLAGEMEDMDLTRRNDSSSDG--------ASGRDRLKEFQDKMAELDDLFHDPAQAEFCIVTIPTELAVAETERLVKSLKHEGMLVRRLIVNQVFSVDADEDDGQKFASLSEYAQRFIEGQSSKVDEINQLARERGVGVVYVPYFDRQIDAIQGLRRVGHAL 806          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: XP_005791198.1 (hypothetical protein EMIHUDRAFT_224177 [Emiliania huxleyi CCMP1516] >EOD38769.1 hypothetical protein EMIHUDRAFT_224177 [Emiliania huxleyi CCMP1516])

HSP 1 Score: 531.2 bits (1367), Expect = 5.700e-147
Identity = 319/719 (44.37%), Postives = 439/719 (61.06%), Query Frame = 0
Query:  100 MRKPGLMQRFIFFGGKGGVGKTSTSTAIALHLADQGLRTLVISTDPAHSLGDLLDQKVSGGDPVQVLGCDNLWAMEVDTTRALNRFRAL---FKELDVGALATQFGVSEEILGGLGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLVQDEDQTQHMARMSKGQAHCLQRLERGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGPIATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTEVSGLCGPGRLFALEVDTGEAVEEFRSVLQGLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGANSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEALKEQDVAVRRLVVNQVLAEHVKDGYWNRLRAGQQVALTDVGKAAAAAGVHLTEVPYFDTEMRTVYALRVLADVLTKPKP 816
            M + G  QR I  GGKGGVGKTS+S+AIA+ LAD GL TL++STDPAHSL D L Q VSGG PV V GC+NL AMEV+T  A++RFRA    F+  D+G      G++EE+LG LGL++F  IL+N PPG+DEL+ALAE + + +  G   +  G A    F+RVV DTAPTGHTLRLL+FP+FLD  LGK+V L+ R+   +  L G+ GG     K      + A A++++ + ++  LR+L  D+D T+F +V I ++LAVAE  RL+ AL  + + V HL+VN                  Q   L+RL+ G      LA  ++P+ D+E+RGVF L+++A  A+R      + DL         RFVL GGKGGVGKT++S +LAV+ A +G +T +VSTDPAHSLGDAL+ DLS G+V  V G+ G   L+A EV   +AV EF+ ++ G+    +GG    Q  G      L +FAD+ ++ PPG DEL+AL++++ L +      G  FDR+VIDTAPTGHTLRLL+FP+FL+ FI R++ +R R  GA    +M GG                              +L+EFQ +M  L  LLHDP  +EF  VTI T +++ E ERL+  L+ + +AVRR VVN+++A  V DGY  RL  GQ+  L ++   AA   V +T+VPYFDTE+R+VY LR + + L    P
Sbjct:    1 MLQSGAAQRLILVGGKGGVGKTSSSSAIAVRLADTGLSTLIVSTDPAHSLSDALMQDVSGGSPVGVAGCENLQAMEVETADAVDRFRAAVSGFRAADLGL----GGLAEEVLGQLGLDEFADILDNVPPGLDELLALAETLAVVR--GAEPDSDGAASLTGFERVVFDTAPTGHTLRLLAFPEFLDSLLGKVVALKARLLAAIGLLKGVLGGNDPTDK-----IEAAVARLQRWRDRVASLRELLTDEDVTDFVVVGIPSRLAVAECARLLSALADQGVPVSHLIVN------------------QGRELKRLD-GSSPLGELALSRLPFFDLEMRGVFPLQYVASQAFRGTNADAWEDLL---ADQKDRFVLVGGKGGVGKTTTSASLAVQFATDGHSTLLVSTDPAHSLGDALETDLSSGEVVRVEGVAG-ASLYACEVKVDDAVAEFKRLVGGVSSD-EGGAASAQGLG------LSDFADIFDAVPPGVDELIALSKIVALAQR--DAYGIHFDRVVIDTAPTGHTLRLLTFPDFLDRFITRLLVLRSRFDGA---ANMLGGAQQLL------------------------GKLKEFQAQMQALQALLHDPETTEFCIVTIATALSLNEAERLLLELRREGIAVRRGVVNRLIATDVADGYVARLANGQRQCLAELDDLAARCAVDVTQVPYFDTELRSVYGLRAMGNALFDAPP 649          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: OSX81440.1 (hypothetical protein BU14_0021s0045 [Porphyra umbilicalis])

HSP 1 Score: 500.0 bits (1286), Expect = 1.400e-137
Identity = 322/765 (42.09%), Postives = 440/765 (57.52%), Query Frame = 0
Query:  107 QRFIFFGGKGGVGKTSTSTAIALHLADQGLRTLVISTDPAHSLGDLLDQKVSGGDPVQVLGCDNLWAMEVDTTRALNRFRALFKEL-----DVGALATQFG----VSEEILGG-----------LGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLR-------------VSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLVQDEDQTQHMARMSKGQAHCLQRLERGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEP-GSPFGDLFQ--------------------------TGGPIATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTEVSGLCGPGRLFALEVDTGEAVEEFRSVLQGLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGA-NSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEALKEQDVAVRRLVVNQVLAEHVKDGYWNRLRAGQQVALTDVGKAAAAAGVHLTEVPYFDTEMRTVYALRVLADVL 811
            +R+IF GGKGGVGKTST+ A+A+  AD+GLRTLV+STDPAHSLGD L   ++ G    V+    L A+E DT  A+  F+A+ + +     + G  + + G    V++E  GG           LGL +F  +L+  PPG DE +AL +V+RL +              + +DRV+IDTAPTGHTLRLLSFPDFLD FL K V L+ R             +S +L+T++G  GG +   K+  RAT  A  +V   ++QM EL DLFRD   +EF +V+I T+L+VAES+RL+ AL  E I VR+LV N++V  + +  ++ R++ GQ   +QR+ R   A   L   QVP  D EVRGV+GL+ ++ +AY  +  G  +GDLF                           +G   A RFV  GGKGGVGKTS + A+ VKLAD G+ T V+STDPAHSLGDAL MDLSGGK   V        L+A+E+DT  AV +FRSV+Q L  G  GG        L  +L +GEFAD+L++ PPG DELVAL +V+ L++ G       FDR+VIDTAPTGHTLRLL+FPEF++ F+ RV+ ++ RL GA N +  +FGG                               +  F+  M  L  ++ D   ++F  VTIPT +AVAE+ERLV AL+   VAV  ++VN VL  +  + +  RL  GQ   L  +  A    GV +T+VPYFD E+R  + LR + +V+
Sbjct:  107 RRYIFVGGKGGVGKTSTAAALAVRCADEGLRTLVLSTDPAHSLGDALAVDLTSGKVTPVV--PGLDALESDTADAVAEFKAILQTVKGGTSEDGEASAKDGADGKVTKEGGGGNSAFLGKLGKQLGLSEFGEVLDTIPPGADEFVALTKVLRLVEAPDAK---------VHYDRVIIDTAPTGHTLRLLSFPDFLDAFLAKAVALRSRLDGASSLLSATGNLSSILNTVAGGSGGSTPSKKDVQRATAVAAERVAAYREQMAELSDLFRDPARSEFVVVSIPTELSVAESRRLVDALWSEGIWVRNLVANQIVPADREASYVKRLTTGQEVQIQRI-RDSEALGSLHLTQVPRFDTEVRGVYGLRALSTVAYPDDQLGDRWGDLFDATVSSGSNPEEEPAAPSVLDDAEADASGLGAAARFVFVGGKGGVGKTSCAAAMGVKLADAGIRTLVLSTDPAHSLGDALTMDLSGGKPLAVDAT--NNLLYAMEIDTAAAVSQFRSVIQSLATGTGGG----VGGDLARKLGVGEFADILDNTPPGVDELVALVQVIDLVRTG------GFDRVVIDTAPTGHTLRLLAFPEFIDAFLGRVLRLKARLDGAINKVKGLFGGGKKGEEAADASLTSASA-------------AVDRFRRNMTALRAVIQDQEATQFAVVTIPTALAVAESERLVAALRTDHVAVANVIVNLVLPANAAEPFVRRLVKGQTGCLEKLRTAVKEKGVAVTQVPYFDVEVRGEFGLRAMGEVM 834          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: XP_005717752.1 (unnamed protein product [Chondrus crispus] >CDF37881.1 unnamed protein product [Chondrus crispus])

HSP 1 Score: 462.2 bits (1188), Expect = 3.300e-126
Identity = 302/781 (38.67%), Postives = 438/781 (56.08%), Query Frame = 0
Query:   35 AFLVPAARLTAMSSSLAGKTCLNEQDLVTVAVKRGALNMAASTTTIPATXXXXXXALPTLAELGEMRKPGLMQRFIFFGGKGGVGKTSTSTAIALHLADQGLRTLVISTDPAHSLGDLLDQKVSGGDPVQVLGCDNLWAMEVDTTRALNRFRALFKELDVGALATQFGVSEEILGGLGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLVQDEDQT---QHMARMSKGQAHCLQRLERGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGPIA----TRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTEVSGLCGPGRLFALEVDTGEAVEEFRSVLQGLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGA-NSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEALKEQDVAVRRLVVNQVLAEHVKDGYWNRLRAGQQVALTDVGKAAAAAGVHLTEVPYFDTEMRTVYALRVLA 808
            AFL P   + + SS++  ++ L+ +     A KR  L     T  +            + A L  +      +RFIFFGGKGGVGKTS++ A+A+  AD GL TLVISTDPAHSLGD L   +S G   +V     L+A+E DT  A+ +FR L                  +   LGL++F  +L   PPG DELIAL  V+ L ++             I FDRVVIDTAPTGHTLR L+FPDFLD FL + + L+ R    L++  GL G                  +V   + +M+EL DLFRD + TEF +V+IAT+LAVAESKRL+  L  E I VRH+VVN+++ + ++    ++++++ KGQA  +      +  ++GLA   VP  D EVRG++GL+ M +IA++      +G LF     ++    ++FV  GGKGGVGKTS S AL  KLA EG  T V+STDPAHSL DAL ++L GG   E+      G LFA+E+DT  A+  F+++ +                          FA +L++ PPG DELVAL +V++L+K G       FDR+V+DTAPTGHTLRLLSFP+FL+ F+ +VM ++ RL  A ++L ++ G                              ++LRE    M+EL +L+ D  R++F  VT+PT +A+AE+ERLV +L++  + V+ +VVNQV+A+   + +  R+  GQ+  + ++ +A    G+ LT+VP+FD E+R V+ LR ++
Sbjct:   14 AFLPPVV-VPSRSSTVTPRSLLSHRKSSHFARKR--LRAKPRTVAVADVGNVAASKTASSANLDSLVAQRDQRRFIFFGGKGGVGKTSSAAAVAVECADAGLTTLVISTDPAHSLGDALRFDLSDGKMHRVDPEMGLYAIESDTREAVEQFRELV---------------TSVADKLGLQEFSEVLETIPPGADELIALVSVLDLVEQENSD---------IKFDRVVIDTAPTGHTLRFLAFPDFLDKFLTQALALRGR----LNSAKGLIGN-----------------RVAVYRDKMIELSDLFRDPERTEFVVVSIATELAVAESKRLIEKLWDEGIWVRHVVVNQILPEGNKVSVDKYLSQVRKGQAREISFATEQIADEYGLAVTIVPRFDTEVRGIYGLEAMGNIAFKENRRKSYGRLFDEDARVSEGAESQFVFVGGKGGVGKTSISAALGTKLAVEGFKTLVLSTDPAHSLADALQVELKGGAPVEIE--MPEGELFAMEIDTEAAIASFQALAKDF------------------------FASLLDNTPPGIDELVALTQVMELVKFG------DFDRVVVDTAPTGHTLRLLSFPDFLDKFLGKVMRLKKRLDSAMDTLRNVLGRKDSADAVDRAAQGV---------------EKLRE---NMVELRELVKDEERTQFAIVTVPTGLAMAESERLVRSLRKDGILVKNVVVNQVIADAAAEKFVQRILQGQERCVEELRRAGEDKGIGLTKVPFFDAEVRGVHGLRAMS 696          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: XP_002178015.1 (predicted protein [Phaeodactylum tricornutum CCAP 1055/1] >EEC50829.1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1])

HSP 1 Score: 442.2 bits (1136), Expect = 3.500e-120
Identity = 287/761 (37.71%), Postives = 433/761 (56.90%), Query Frame = 0
Query:   90 ALPTLAELGEMRKPGLMQRFIFFGGKGGVGKTSTSTAIALHLA---DQGLRTLVISTDPAHSLGDLLDQ--KVSGGDPVQV---LGCDNLWAMEVDTTRALNRFRALFKELDVGALATQFGVSEEILGGLGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKL-------VQDEDQTQHMARMSKGQAHCLQRLERGVV------------AKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGPIATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTEVSGLCGP---GRLFALEVDTGEAVEEFRSVLQGLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGANSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEAL----KEQDVAVRRLVVNQVLAEHVKDG--YWNRLRAGQQVALTDVGKAAAA--AGVHLTEVPYFDTEMRTVYALRVLADVLTK 813
            +L  L E    R PG +   +F GGKGGVGKT+ S+A+A+ LA   ++ L+ L++STDPAHSLGD LD+  + + G PV +   L    L A EVD + AL  FR      D+  LA   GVS ++L   GL +F  +LNNPPPG+DEL+AL+ V+     + G            +D V++DTAPTGHTLRLL+ P FLDG LGKL+K++L++S + STL   F  G+ E+++R ++ D+A  ++E+ +++M  LR+  +D  +T F +VT+ T+L VAESKRL   L  + +++  +VVN+        V  E   Q+  R   GQ   + +LE  V             +   +   +VP+ DVE+ GV  L ++A   + TE  S F  L         R V+ GGKGGVGKT++S+ALAV +A +G   A++STDPAHS+GDA+++DLSGGK+ +V  +  P   G L  LE+D   A+ +F+ V+  L     GG+     AGL   L   +  +V ++ P GTDE+VALA+++ L+K+G       FDRIV+DTAPTGHTLR+LS P FL   I+R++ I +++    ++  + G                             +  L  FQL+M +L++L  D A++EF+ VT+PTE+AV E+ RL+  L     +  +  R +V NQVL +   D   + + +   Q +++ D+  A ++  A   +T++ Y DTE R V+ L+VLAD L +
Sbjct:   80 SLNKLVEDISSRSPGQLPSTVFVGGKGGVGKTTVSSALAVSLASAIEKDLKVLIVSTDPAHSLGDALDEDLRKNNGRPVAMTDSLTGGRLDACEVDASAALEDFRENIAAFDIDRLADALGVSVDLLESFGLREFSGLLNNPPPGLDELVALSNVLDSESVAKG------------YDVVIVDTAPTGHTLRLLALPKFLDGLLGKLIKIRLQLSGLASTLQTFF--GNDEAQKRAKSIDDAVNRLEQFRRKMSNLRERLQDSQSTRFVVVTVPTKLGVAESKRLAAELNYQGVSITDIVVNQCVGGIDDDVDSEALQQYYDRRKDGQKKWIAKLEEAVQDVSCSEEYKANGSSAPIGITRVPFFDVELVGVPALGYLAAQCF-TENLS-FAHLMNVDSSNEPRVVICGGKGGVGKTTTSSALAVSMASKGHKVALISTDPAHSIGDAIEIDLSGGKLVDVPLIGIPTTDGSLSVLEIDPSTAINQFKGVVDQL----IGGDDNPSDAGLRNTLR--DLQEVFDTLPAGTDEVVALAKIVNLVKKG------GFDRIVLDTAPTGHTLRMLSTPGFLAELIDRLLIIAEKVNSNTAIKMLIGS--------------SARSEDISNAAATAKSTLLSFQLQMYDLENLFADAAQTEFLIVTVPTELAVRESMRLLNDLTFESPDMPIKCRNIVANQVLGDDGNDAKTFLDHVGQTQAISVKDLEDAVSSYPAPPLITKIKYLDTEPRGVFGLKVLADELLR 798          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: GAX20039.1 (hypothetical protein FisN_1Lh480 [Fistulifera solaris])

HSP 1 Score: 436.8 bits (1122), Expect = 1.500e-118
Identity = 286/743 (38.49%), Postives = 427/743 (57.47%), Query Frame = 0
Query:  108 RFIFFGGKGGVGKTSTSTAIALHLA---DQGLRTLVISTDPAHSLGDLLDQ--KVSGGDPVQV---LGCDNLWAMEVDTTRALNRFRALFKELDVGALATQFGVSEEILGGLGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLV---QDEDQT---QHMARMSKGQAHCLQRLE------------RGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGP-IATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTE---VSGLCGPGRLFALEVDTGEAVEEFRSVLQGLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGANSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEAL----KEQDVAVRRLVVNQVLAEHVKD--GYWNRLRAGQQVALTDVGKAAAAAG--VHLTEVPYFDTEMRTVYALRVLADVLTK 813
            R IF GGKGGVGKTS S+A+A+ LA    + L+ L++STDPAHSLGD LD+  + S G P+Q+   L    L+A EVD   AL+ FR      D+  LA    +S +IL  LGL +F  +LNNPPPG+DEL+AL+ V+     +G             FD +V+DTAPTGHTLRLL+ P FLDG LGKL+ L+++++ + STL    G   AE  +R +  D+A  ++EK + +M  LR+  +D+  T F +VTI ++L V ESKRL+  L  + ++V  +VVN+ +   + ED T   ++  R   GQ   + +L             R   + + +   +VP+ DVE+ GV  L ++ +  Y   PG  F  L +      + + V+ GGKGGVGKT++S++LAV +A +G   A++STDPAHSLGDA+DM+L+GG++ +   +    G G L  LE+D   ++ EF+ ++  L G     +  +  AG    L   E  +V  + P GTDE+VALA+++KL+K G       +DRIV+DTAPTGHTLR+LS P F+   IER++AI +++  +NSL+ MF G                            +  L  FQ +M +L+DL  D  ++EF+ VTI +E+A  E+ RL+  L     +  + VR +VVNQVL E+ KD   + + +  GQ+V++ ++    ++ G    +T+  Y DTE R V+ LR+LAD L K
Sbjct:   83 RTIFVGGKGGVGKTSVSSALAVSLASDIQKDLKVLIVSTDPAHSLGDALDEDLRKSHGKPLQMTDPLTGGRLFACEVDAAAALDEFRENLAAFDIDQLADALNISPDILESLGLREFSGLLNNPPPGLDELVALSNVLDTDSMAG------------DFDVIVVDTAPTGHTLRLLALPKFLDGLLGKLINLRMKLAGLTSTLQAFLGNSQAE--QRAKTIDDAVNRLEKFRTKMGILREKLQDRSKTNFIVVTIPSKLGVQESKRLVSELGSQQVSVTDIVVNQCIGGQETEDTTPLEKYYERRRAGQERWMNKLSETIKEVSESNAYRANGSPNPICLTKVPFFDVELVGVPALAYLGNQCYAENPG--FAHLMEAHNERNSPKVVICGGKGGVGKTTTSSSLAVTMAAKGHRVALISTDPAHSLGDAIDMNLAGGRLVDCPLIGVPPGDGSLSVLEIDPAASLSEFKGLVDQLVG---VDDASEADAGFRNTLR--EIQEVFNTLPAGTDEVVALAKIIKLVKNG------DYDRIVLDTAPTGHTLRMLSTPGFIAELIERLLAIAEKV-NSNSLVKMFIG-------------GSSRSEQIANAAATAKSALLSFQFQMYDLEDLFADAEQTEFLIVTIASELAARESIRLLNDLTFEAPDMPIKVRNVVVNQVLEENDKDLKNFVSHVSHGQKVSIDNLESFLSSMGNPPRVTKCEYMDTEPRGVFGLRMLADQLLK 784          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: GAX14284.1 (hypothetical protein FisN_1Hh480 [Fistulifera solaris])

HSP 1 Score: 436.0 bits (1120), Expect = 2.500e-118
Identity = 285/743 (38.36%), Postives = 428/743 (57.60%), Query Frame = 0
Query:  108 RFIFFGGKGGVGKTSTSTAIALHLA---DQGLRTLVISTDPAHSLGDLLDQ--KVSGGDPVQV---LGCDNLWAMEVDTTRALNRFRALFKELDVGALATQFGVSEEILGGLGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLV---QDEDQT---QHMARMSKGQAHCLQRLE------------RGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGP-IATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTE---VSGLCGPGRLFALEVDTGEAVEEFRSVLQGLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGANSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEAL----KEQDVAVRRLVVNQVLAEHVKD--GYWNRLRAGQQVALTDVGKAAAAAG--VHLTEVPYFDTEMRTVYALRVLADVLTK 813
            R IF GGKGGVGKTS S+A+A+ LA    + L+ L++STDPAHSLGD LD+  + S G P+Q+   L    L+A EVD   AL+ FR      D+  LA    +S +IL  LGL +F  +LNNPPPG+DEL+AL+ V+     +G             FD +V+DTAPTGHTLRLL+ P FLDG LGKL+ L+++++ + STL    G   AE  +R +A D+A  ++EK + +M  LR+  +D+  T F +V I ++L V ESKRL+  L  + ++V  +VVN+ +   + ED T   ++  R   GQ   + +L             R   + + +   +VP+ DVE+ GV  L ++ +  Y   PG  F  L +      + + V+ GGKGGVGKT++S++LAV +A +G   A++STDPAHSLGDA+DM+L+GG++ +   +    G G L  LE+D   ++ EF+ ++  L G     +  +  AGL   L   E  +V  + P GTDE+VALA+++KL+K G       +DRIV+DTAPTGHTLR+LS P F+   IER++AI +++  +NSL+ MF G                            +  L  FQ +M +L+DL  D  ++EF+ VTI +E+A  E+ RL+  L     +  + VR +VVNQVL ++ KD   + + +  GQ++++ ++    ++ G    +T+  Y DTE R V+ LR+LAD L K
Sbjct:   83 RTIFVGGKGGVGKTSVSSALAVSLASDIQKDLKVLIVSTDPAHSLGDALDEDLRKSHGKPLQMTDPLTGGRLFACEVDAAAALDEFRENLAAFDIDQLAEALNISPDILESLGLREFSGLLNNPPPGLDELVALSNVLDNDSMAG------------DFDVIVVDTAPTGHTLRLLALPKFLDGLLGKLINLRMKLAGLTSTLQAFLGNSQAE--QRAKAIDDAVNRLEKFRTKMGILREKLQDRSKTNFIVVAIPSKLGVQESKRLVSELGSQQVSVTDIVVNQCIGYQETEDTTPLEKYYERRRSGQERWINKLSATIKEVSESNAYRANGSPNPICLTKVPFFDVELVGVPALAYLGNQCYAENPG--FSHLMEAHNERDSPKVVICGGKGGVGKTTTSSSLAVTMAAKGHRVALISTDPAHSLGDAIDMNLAGGRLVDCPLIGVPPGDGSLSVLEIDPAASLSEFKGLVDQLVG---VDDASEADAGLRNTLR--EIQEVFNTLPAGTDEVVALAKIIKLVKNG------DYDRIVLDTAPTGHTLRMLSTPGFIAELIERLLAIAEKV-NSNSLVKMFIG-------------GSSRSEQIANAAATAKSALLSFQFQMYDLEDLFADAEQTEFLIVTIASELAARESIRLLNDLTFEAPDMPIKVRNVVVNQVLEDNDKDLKNFVSHVSHGQKISIDNLESFLSSMGNPPRVTKCEYMDTEPRGVFGLRMLADQLLK 784          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: PSC69048.1 (arsenical pump-driving ATPase [Micractinium conductrix])

HSP 1 Score: 427.2 bits (1097), Expect = 1.200e-115
Identity = 290/750 (38.67%), Postives = 414/750 (55.20%), Query Frame = 0
Query:   69 GALNMAASTTTIPATXXXXXXALPTLAELGEMRKPGLMQRFIFFGGKGGVGKTSTSTAIALHLADQGLRTLVISTDPAHSLGDLLDQKVSGGDPVQVLGCD-NLWAMEVDTTRALNRFRALFKELDVGALATQFGVSEEILGGLGLEDFV---------AILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLVQDEDQTQHMARMSKGQAHCLQRLERGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGPIATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTEVSGLCGPGRLFALEVDTGEAVEEFRSVLQGLGGGMK-----GGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGANSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEALKEQDVAVRRLVVNQVLAEHVKDGYWNRLRAGQQVALTDVGKAAAAAGVHLTEVPYFDTEMRTVYAL 804
            G    AAS    P          P      E+   G  +++I   GKGGVGKTS + ++A+  A +G  TLV+STDPAHSLGD L Q ++GG PV + G D  LW ME+D  +     +A FK    G    +   +++ +GG GL   V          +L++PPPG DE +A+++V++  K    +           F R+V DTAPTGHTLRLL+ PDF++  L K+V+L+ ++S    T+ GLFG GS++        DEA  K+E+ +  +  ++ LFRDQ ATEF I TI T L V ES RL+ AL++E+I  + +VVN+L+     ++++    K Q   L  ++ G    H L +++ PYLD+EVRGV  L +      RT  G+   +L    G    ++ + GGKGGVGKTS S +LAV LA+ G TT VVSTDPAHSL D+LD D+SGGK  EV G    G +F +E+D   A +E R  L G   G K     G       A  +  L LGE   +L++ PPG DE +A+A+V++ LK+    E  +F RI+ DTAPTGHTLRLL+ P+FL+  + +++ +R +L      ++ F G                        +    ++L  F+  M E  ++  +PA +EFV VTIPT MA AE+ RL +ALKE+ V +R LVVNQVL  +++D Y    RA QQ +L  + +      + L E P FD E+R V AL
Sbjct:   43 GTATAAASGRRRPLVYAAAAVEAPAATPFEEL-SAGTERKYIMVSGKGGVGKTSLAASLAVRFAQEGHTTLVVSTDPAHSLGDSLAQDLAGGVPVLIEGTDLPLWGMEIDPEQE----KAKFKAWSAGQGKQE---AKDFMGGFGLGSVVEQLADLKLGELLDSPPPGFDEAVAISKVLQFVKGEEYA----------RFSRIVFDTAPTGHTLRLLTVPDFVEASLAKIVRLRKKLSGASQTVRGLFGAGSSQ--------DEAVDKLEQLQDSIRLVKALFRDQQATEFIIATIPTVLGVNESGRLIRALRKESIPCKRIVVNQLIGPNMGSKYLEMKVKDQEKALAMID-GDPGLHDLRTLRAPYLDLEVRGVPALGYFG----RTLWGNFAPELAAGKG---RKYFMLGGKGGVGKTSCSASLAVTLAEAGHTTLVVSTDPAHSLSDSLDQDVSGGKPIEVEGT--NGGVFGMEIDLDMARQELRE-LSGADEGRKLDDVLGSVGLSGVADQLKDLRLGE---LLDTPPPGIDEAIAIAKVIQFLKD---PEYARFTRIIFDTAPTGHTLRLLALPDFLDTSVGKILRLRQKLTSIKDSVTGFFG---------------------SKEKDASAEKLDSFKDYMAEARNVFRNPATTEFVIVTIPTAMAAAESIRLAKALKEEQVPIRTLVVNQVLQSNLQDKYLQTRRADQQRSLQRLKEDPELGKLQLIEAPLFDLEVRGVPAL 728          
BLAST of NO02G03040 vs. NCBI_GenBank
Match: PRW56240.1 (ATPase ASNA1-like protein 2-like [Chlorella sorokiniana])

HSP 1 Score: 421.8 bits (1083), Expect = 4.900e-114
Identity = 277/713 (38.85%), Postives = 399/713 (55.96%), Query Frame = 0
Query:  104 GLMQRFIFFGGKGGVGKTSTSTAIALHLADQGLRTLVISTDPAHSLGDLLDQKVSGGDPVQVLGCD-NLWAMEVDTTRALNRFRALFKELDVGALATQFGVSEEILGGLGLEDFV---------AILNNPPPGIDELIALAEVVRLSKESGGSSELGGTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLSTLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFCIVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLVQDEDQTQHMARMSKGQAHCLQRLERGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPGSPFGDLFQTGGPIATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVSTDPAHSLGDALDMDLSGGKVTEVSGLCGPGRLFALEVDTGEAVEEFRSVLQGLGGGMKGGN--RRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLKLLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRLKGA-NSLMSMFGGXXXXXXXXXXXXXXXXXXXXXXXXEGPPRDRLREFQLKMIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEALKEQDVAVRRLVVNQVLAEHVKDGYWNRLRAGQQVALTDVGKAAAAAGVHLTEVPYFDTEMRTVYAL 804
            G  +++I   GKGGVGKTS + ++A+  A  G  TLV+STDPAHSLGD L Q +SGG P+ V G D  LW ME+DT R   RF+A        +       +++ +GG GL   V          +L++PPPG DE +A+++V++  K    +           F R+V DTAPTGHTLRLL+ PDF++  L K+ +L+ ++S    T+ GLFG   ++        DEA  K+E+ +  +  ++ LFRDQ ATEF I TI T L V ES RL+ AL++E I  R +VVN+++     T+++    K Q   L  ++      HGL  ++ PYLD+EVRGV  L +     +    G    DL    G    ++ L GGKGGVGKTS S++LAV LA  G TT VVSTDPAHSL D+LD D+SGG+  EV G    G+++ +E+D   A +E R  L G   G K  +       AG+  QL+     ++L++ PPG DE +A+A+V++ LK+    E  +F RIV DTAPTGHTLRLL+ P+FL+  + +++ +R ++    +S+   F G                        +    ++L  F+  M E  ++  +P+ +EFV VTIPT MA AE+ RL +AL+++ V +R LVVNQ+L   ++D Y    RA QQ AL  +        + L E P FD E+R V AL
Sbjct:   77 GTDRKYIMVSGKGGVGKTSLAASLAVRFAQAGHTTLVVSTDPAHSLGDSLAQDLSGGVPILVEGTDLPLWGMEIDTEREKERFKA-------WSAGKGKEEAKDFMGGFGLGSVVEQLADLKLGELLDSPPPGFDEAVAISKVLQFVKGEEYA----------RFTRIVFDTAPTGHTLRLLTVPDFVEASLAKITRLRRKLSSASQTIRGLFGADGSQ--------DEAVDKLEQLQDSIRLVKALFRDQQATEFIIATIPTVLGVNESGRLIRALRKERIPCRRIVVNQIIGAGMGTKYLQMKEKDQERALAMIDED-PGLHGLRHLRAPYLDMEVRGVPALSYFGQQVW----GGVVDDLAAGQG---RKYFLLGGKGGVGKTSCSSSLAVALASAGHTTLVVSTDPAHSLSDSLDQDVSGGRPVEVQGT--NGQVYGMEIDLELARQELRE-LSGQDEGRKLDDILGSVGLAGVADQLKDLRLGELLDTPPPGVDEAIAIAKVIQFLKD---PEYARFTRIVFDTAPTGHTLRLLALPDFLDTSVGKILRLRQKIMSVKDSVTGFFSG---------------------SSEKDASSEKLDAFKDYMAEARNVFRNPSTTEFVIVTIPTAMAAAESIRLAKALQKEQVPIRTLVVNQLLPPGLQDKYLQTRRADQQRALQRLRADPELGQLQLIEAPLFDLEVRGVPAL 729          
The following BLAST results are available for this feature:
BLAST of NO02G03040 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM30283.10.000e+073.40Arsenical pump ATPase, ArsA/Get3 [Nannochloropsis ... [more]
CEM11669.13.500e-15243.79unnamed protein product [Vitrella brassicaformis C... [more]
XP_005791198.15.700e-14744.37hypothetical protein EMIHUDRAFT_224177 [Emiliania ... [more]
OSX81440.11.400e-13742.09hypothetical protein BU14_0021s0045 [Porphyra umbi... [more]
XP_005717752.13.300e-12638.67unnamed protein product [Chondrus crispus] >CDF378... [more]
XP_002178015.13.500e-12037.71predicted protein [Phaeodactylum tricornutum CCAP ... [more]
GAX20039.11.500e-11838.49hypothetical protein FisN_1Lh480 [Fistulifera sola... [more]
GAX14284.12.500e-11838.36hypothetical protein FisN_1Hh480 [Fistulifera sola... [more]
PSC69048.11.200e-11538.67arsenical pump-driving ATPase [Micractinium conduc... [more]
PRW56240.14.900e-11438.85ATPase ASNA1-like protein 2-like [Chlorella soroki... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL016nonsL016Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR032ncniR032Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR000ngnoR000Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK006952NSK006952Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO02G03040.1NO02G03040.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|574544gene_893Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100003g182gene304Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO02G03040.1NO02G03040.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO02G03040 ID=NO02G03040|Name=NO02G03040|organism=Nannochloropsis oceanica|type=gene|length=3040bp
TGTTCTATTGAGTATTGGTGGGGAGGTTGCTTTTCATGGAAGGAATAGGC
ATCACACGACAAGTCAAGGTATTTCAATGTCTTATTCCAGTTTATTGCTG
ATTGTGGTCGGCAAAATAAGGTAGCAAGCGGAAATGGCCATTTAAATGTG
TCTTATCGAACCTTAATAATGAACGTCGTTTTACCAAACGCGATAATCCC
AATTTTGGGGAGAGGGGGGAGGGAGCTGCATGTGTCGGAGATGTTACATT
AGGATGGAGTCCTTGATTTGTATCTTACATAGGCATCCACGCGCCCATAC
CATAGCCAATACAGCCAAGACATGCGGGGAAGAAGAGAAGGGAAGGGCCT
TGCGGCTCCGTCACGTTTTGGCAAAATGCTCCTCTTCCTCGCCTCGGGCC
TCGTGTTGCCATCGTATGTGGAAGCCTTCCTCGTCCCGGCCGCCCGGCTC
ACGGCCATGAGCAGCAGCCTCGCAGGAAAGACCTGTCTAAACGAGCAAGA
TTTAGTGACGgtacgtgtgcctcgaatgttcgtagtggagcctcaggcat
tcctattcagagaaaacttgtctgtgaaaaccttgtatgtgctgaatgat
tcccctcctccaacctcctcagGTGGCAGTGAAAAGAGGAGCACTCAACA
TGGCAGCCAGCACCACAACAATCCCAGCTACCTCCTCCTCCACGACAACG
GCCTTGCCTACCCTGGCAGAGCTTGGCGAAATGCGCAAGCCCGGGCTAAT
GCAGCGCTTCATTTTCTTTGGAGGGAAGGGTGGGGTGGGCAAGACGTCGA
CCTCCACGGCCATTGCATTGCACTTGGCCGATCAAGGCCTACGCACGCTT
GTCATCAGCACGGATCCAGCACATTCTCTGGGAGATTTGCTTGACCAGAA
GGTGTCGGGCGGTGACCCGGTGCAGGTGCTGGGGTGTGATAATTTGTGGG
CCATGGAGGTGGACACAACGCGGGCACTCAACCGCTTCCGGGCGCTCTTC
AAAGAGCTGGACGTGGGGGCTTTGGCGACACAATTCGGGGTTTCGGAGGA
AATATTGGGTGGACTGGGCCTAGAAGATTTTGTTGCCATTCTCAATAACC
CTCCTCCTGGCATCGATGAGCTCATTGCCCTGGCGGAGGTGGTGCGTCTG
TCAAAGGAAAGCGGCGGGAGCAGTGAGTTGGGAGGGACCGCCGGCGGGAT
CACATTTGATCGTGTGGTCATTGACACGGCACCCACAGGACATACTCTGC
GGTTGCTGAGTTTTCCTGATTTTTTGGATGGCTTTCTGGGTAAGCTGGTG
AAGCTCCAGCTACGTGTGTCCAAGGTGCTCTCGACGCTCTCAGGCCTGTT
TGGGGGAGGGTCGGCTGAATCGAAGGAGCGGCTGCGGGCGACCGACGAGG
CGTTTGCGAAGGTGGAGAAGGCAAAGCAGCAGATGGTGGAGTTGCGGGAT
TTGTTTCGAGACCAAGACGCAACCGAGTTTTGTATAGTCACGATCGCTAC
GCAGCTGGCTGTGGCGGAGTCGAAGCGGCTGATGATGGCGTTGAAGCAGG
AAAATATTGCCGTTAGGCACCTGGTCGTGAATAAATTGGTGCAAGACGAA
GACCAAACGCAGCATATGGCGAGGATGAGTAAGGGCCAGGCCCATTGTTT
GCAGCGGTTGGAGCGCGGGGTGGTAGCAAAGCACGGGCTGGCTTCTGTTC
AGGTGCCTTACTTAGACGTGGAGGTCCGTGGGGTTTTTGGTCTTAAATTC
ATGGCGGATATTGCTTACAGAACAGAACCGGGATCGCCGTTTGGGGATCT
GTTTCAAACAGGCGGGCCGATTGCGACAAGGTTTGTCTTGTTTGGTGGGA
AGGGGGGGGTCGGGAAGACGTCGTCCTCGACGGCTTTGGCTGTGAAGCTG
GCAGATGAGGGCCTCACGACCGCGGTGGTGAGCACGGACCCGGCACATTC
CTTGGGGGATGCGTTGGACATGGATTTATCGGGTGGGAAGGTGACGGAGG
TAAGCGGGTTATGTGGGCCGGGGCGACTGTTTGCCTTGGAGGTGGACACG
GGTGAGGCGGTGGAGGAGTTTAGGAGCGTGCTGCAAGGGTTGGGAGGGGG
AATGAAGGGGGGGAATAGAAGGAAGCAGAAGGCGGGATTGATGGGCCAGT
TGGAGTTGGGGGAGTTTGCGGATGTGCTCGAGTCAGCGCCCCCGGGGACA
GATGAGTTGGTGGCGTTGGCACGAGTCTTGAAGCTATTGAAGGAGGGGAC
GCCGGGGGAGGGGAGAAAATTTGATAGGATCGTTATTGACACAGCACCCA
CCGGGCATACTTTGCGACTGTTGAGCTTTCCCGAGTTTTTAGAGGGCTTT
ATTGAGAGGGTGATGGCGATCAGGGATCGTCTGAAGGGAGCGAATTCGTT
GATGAGTATGTTTGGGGGGTTAGGGGGAGGGGCAGCAGAAAGCGGGAACG
ACGGCGAGGAGGAAGAAGAGGAGAAGGAGGCGGTGGAGGACGAGGGGCCT
CCGAGGGATCGATTGAGGGAGTTTCAGTTGAAGATGATTGAGTTGGATGA
TTTGCTTCATGACCCAGCAAGATCCGAGTTTGTGGCTGTGACTATACCGA
CAGAGATGGCCGTGGCGGAGACGGAGCGATTGGTAGAAGCCCTGAAGGAG
CAAGATGTTGCCGTGCGTCGCTTAGTTGTGAATCAAGTCCTGGCCGAGCA
TGTGAAGGACGGGTATTGGAATCGGCTTCGCGCCGGTCAGCAGGTGGCGT
TGACAGATGTGGGGAAGGCAGCGGCGGCGGCGGGGGTGCACTTGACGGAG
GTGCCGTATTTCGACACGGAGATGCGGACAGTGTACGCGCTGAGGGTGTT
GGCTGACGTGCTGACTAAGCCCAAGCCGTAAGGTAGAAGGGAAAGAGTGA
ATAGAGGTGTGAGCAGGAAGCTGCCCCGGGTGATATCTCATGGAAGGATT
TAGTAGGAGACGGTAAGAAAGAAGACTCTCATGCACTGATGTAAAAAATA
CACAATTTAAAAACAAAATACTCACAAAAAGATACATCCA
back to top

protein sequence of NO02G03040.1

>NO02G03040.1-protein ID=NO02G03040.1-protein|Name=NO02G03040.1|organism=Nannochloropsis oceanica|type=polypeptide|length=816bp
MRGRREGKGLAAPSRFGKMLLFLASGLVLPSYVEAFLVPAARLTAMSSSL
AGKTCLNEQDLVTVAVKRGALNMAASTTTIPATSSSTTTALPTLAELGEM
RKPGLMQRFIFFGGKGGVGKTSTSTAIALHLADQGLRTLVISTDPAHSLG
DLLDQKVSGGDPVQVLGCDNLWAMEVDTTRALNRFRALFKELDVGALATQ
FGVSEEILGGLGLEDFVAILNNPPPGIDELIALAEVVRLSKESGGSSELG
GTAGGITFDRVVIDTAPTGHTLRLLSFPDFLDGFLGKLVKLQLRVSKVLS
TLSGLFGGGSAESKERLRATDEAFAKVEKAKQQMVELRDLFRDQDATEFC
IVTIATQLAVAESKRLMMALKQENIAVRHLVVNKLVQDEDQTQHMARMSK
GQAHCLQRLERGVVAKHGLASVQVPYLDVEVRGVFGLKFMADIAYRTEPG
SPFGDLFQTGGPIATRFVLFGGKGGVGKTSSSTALAVKLADEGLTTAVVS
TDPAHSLGDALDMDLSGGKVTEVSGLCGPGRLFALEVDTGEAVEEFRSVL
QGLGGGMKGGNRRKQKAGLMGQLELGEFADVLESAPPGTDELVALARVLK
LLKEGTPGEGRKFDRIVIDTAPTGHTLRLLSFPEFLEGFIERVMAIRDRL
KGANSLMSMFGGLGGGAAESGNDGEEEEEEKEAVEDEGPPRDRLREFQLK
MIELDDLLHDPARSEFVAVTIPTEMAVAETERLVEALKEQDVAVRRLVVN
QVLAEHVKDGYWNRLRAGQQVALTDVGKAAAAAGVHLTEVPYFDTEMRTV
YALRVLADVLTKPKP*
back to top
Synonyms
Publications