NO03G01590, NO03G01590 (gene) Nannochloropsis oceanica

Overview
NameNO03G01590
Unique NameNO03G01590
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length22251
Alignment locationchr3:487795..510045 -

Link to JBrowse

Properties
Property NameValue
DescriptionPhospholipid-transporting ATPase
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr3genomechr3:487795..510045 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0000166nucleotide binding
GO:0005524ATP binding
GO:0000287magnesium ion binding
GO:0004012phospholipid-translocating ATPase activity
GO:0000166nucleotide binding
Vocabulary: Cellular Component
TermDefinition
GO:0016021integral component of membrane
GO:0016020membrane
Vocabulary: Biological Process
TermDefinition
GO:0015914phospholipid transport
GO:0006810transport
Vocabulary: INTERPRO
TermDefinition
IPR036412HAD-like_sf
IPR008250ATPase_P-typ_transduc_dom_A
IPR023299ATPase_P-typ_cyto_domN
IPR023298ATPase_P-typ_TM_dom
IPR006539P-type_ATPase_IV
IPR032630P_typ_ATPase_c
IPR032631P-type_ATPase_N
Homology
BLAST of NO03G01590 vs. NCBI_GenBank
Match: EWM29580.1 (putative phospholipid-transporting atpase ia isoform 1 [Nannochloropsis gaditana])

HSP 1 Score: 1755.3 bits (4545), Expect = 0.000e+0
Identity = 1008/1620 (62.22%), Postives = 1164/1620 (71.85%), Query Frame = 0
Query:    1 MSQHYGAAA-----SGEPLIPPSPRPPRNDNELATVREAQRQQ-CVRSPILDMVLRFITPRSPDDHHPSXXXXXXXXXXXXXXXTCIRRFHVLSPTYPMHVRTTKDGKKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNG---EGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGMWVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLE--GGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQG-HRFSLPVD---PFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLPLSYQAESPDEEALVNAVARELNWSFLGRTPTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTP-DNKIVLLTKGADNVIFERATAYLGTTRQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGR--REEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGEGVAAGMLSEPLVEHDVDEDDAMPY------------SHMSKNPLKDVQSDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFMLAMPPICFGLFDRDLSADTIMGNATPDA----RQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTA-SFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYGGLPIVAGGFYGVPGRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFFPSPVDFAMEYDRGYVIPEAAPAPGARREERARTVEEEEAARG----EHAPFKGSEAAQAEEEAMERRRSGWYPGKLLVSLPLLRRLNSRISQKDKEEMGMMEAEGGLMPSSYDYTSSSFDLGPGAGHNGGP-GEGKVLARRHSRG 1581
            MSQ  G  A     S +P   PS + P + NELA VR + RQQ C+ S ++ +V R+  PRSPD++ P                   R FHVL+  YP H R TK GK +R +YPDNSVSTAKYN+FTF+PRALFEQFRRLANIYFLVVTVLMLIGTYSD Y+SPLTP+TTL PL VVL +TMGKEAFED KRHTADQ TNNR ARV+RL        G    G +EEV W++IGVGR+V+V D+EEIPAD++LLTSS+  GNC +ETSNIDGETNLKIKEAART E+G G AF  A +LQ W AA+ CEAPN+RIH++TGTL L  K      ++RRV V+QANLLLRGSRLRNT+WALGL VYTG  TKIVMNSR APSKLS IEVTTNRLLYLILG+Q+LLV+VTL AYLVWTD  +SQLHYLCM+YL  PS+FLRMNCQ +  +AS+LGMW+TFLLLYNNFVPISLYVTVEMVNYIQA+YIDQDLSMYD SSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGG ++GN+E    + EE                     S  SG +   ++     +LP +    F  RG+ L+ELA           ++ I KT  S   AA    YFAECLAVCHTV                                             P  K     YQAESPDEEALV A A EL W F GR+ T ALV  P  +RLTYQVLA LPFTSTRKRMSVIVR P + K+VLL KGAD+V+FERA+ +LG  R+ LDAHLS FA+DGLRTLVLAR+E++E++F++WL E++KA+TAVE R E +A++ E +EREL V+GATAIEDKLQ+GVP+TIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLH++M LIKV+D K +G  +  R E E  AALR+QLRKLV HFE L+ED TLV GL  SSR   G    K   LWR  + R   +  R++R    T L         + GAG    G GV   ML EPLVE D +ED    Y            S  S NPL+DVQ+DHLALI+DGP+L R+FGDWEMERLLLRVA LCKSVVACRVSPAQKRMLIRLVKKGVKHP PITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAI+QFRF+E LLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSL EDYVYTSYNFMLAMPPICFGLFDRD+S +TI G A  DA     +S+       +   ++  T L  + +  +   T+  T  SFRW YMSGR+NLDLN+GQMALWL QAILDS+LIFGFSFGAM+ P QVLS+ G VDDLY LGLITFTGML+GML KAATN XXXXXXXXXXXXGSALL+ +FLA+YG LP+  G F+GVP +   HPSFWLIGV LVPTV V+VDY FIYLRLSFFPSPVD A+EYDRGY       +          +  EEE        +   F  S+ ++++    +R + GWYPGKLLVSLPLLRRLN+R+SQK+K+EMG+ +AEGGL+PSSYDYTSSSFDLGPGAG    P G+  +L +  S G
Sbjct:    1 MSQGDGGVAGPVTPSFKPPSQPSIQSPPHSNELAMVRGSPRQQWCLLSSVVALV-RWFKPRSPDEYRPCPPTASSETTDEARIAP-TRHFHVLTADYPEHTRETKSGKPKRVKYPDNSVSTAKYNLFTFIPRALFEQFRRLANIYFLVVTVLMLIGTYSDFYESPLTPYTTLIPLCVVLTITMGKEAFEDLKRHTADQKTNNRIARVVRLERPGCRGAGGDENGGIEEVRWRDIGVGRVVQVRDREEIPADLVLLTSSDAGGNCYVETSNIDGETNLKIKEAARTAEDGGGPAFWKAEELQAWGAAMVCEAPNARIHSYTGTLTLLSKGPG---ERRRVAVSQANLLLRGSRLRNTQWALGLAVYTGAQTKIVMNSRTAPSKLSAIEVTTNRLLYLILGLQVLLVTVTLAAYLVWTDGRQSQLHYLCMNYLDAPSSFLRMNCQPSDTDASDLGMWITFLLLYNNFVPISLYVTVEMVNYIQAFYIDQDLSMYDSSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGEIFGNMENTSNKVEERCSGKETGQSIRAEEERASEGSCKSGSKSPSSKATSHANLPTEVSSRFVGRGQPLKELA-----------DQAIQKTKDSTGNAAQ---YFAECLAVCHTV---------------------VVDAASPPPSHGDRQDTQAAEAGVPNKKGNGGRYQAESPDEEALVEAAAVELGWRFDGRSSTEALVEAPLGRRLTYQVLATLPFTSTRKRMSVIVRRPGEGKVVLLMKGADSVVFERASNFLGAAREVLDAHLSEFASDGLRTLVLARRELEEEEFKAWLVEYEKAATAVERRDELMAQVAEGVERELTVIGATAIEDKLQDGVPETIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHSEMVLIKVVDHKGEGSEKTPRTEAESTAALRQQLRKLVTHFECLIEDDTLVDGLTGSSRGGHGNVGVKP--LWRMWQKRRVERSRRRQRPSFYTKL--------FNSGAGVEEAGTGV-NDMLLEPLVE-DGEEDGGEEYHVGGWEENFVRHSVTSSNPLQDVQTDHLALIVDGPSLGRIFGDWEMERLLLRVARLCKSVVACRVSPAQKRMLIRLVKKGVKHPKPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAISQFRFIEPLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLFEDYVYTSYNFMLAMPPICFGLFDRDISVETIKGGAREDAGRGPDKSDDNISEEPSLNDENLHTTLGFATSQKSFPTTSVRTGNSFRWVYMSGRENLDLNMGQMALWLVQAILDSVLIFGFSFGAMNTPHQVLSASGGVDDLYTLGLITFTGMLLGMLGKAATNXXXXXXXXXXXXXGSALLFCLFLAVYGALPVTGGAFFGVPRQAANHPSFWLIGVLLVPTVCVIVDYTFIYLRLSFFPSPVDIAVEYDRGYFRTSRYKSQNGEHLTAIFSPPEEEEGEDVTGTDKVSFVNSDVSRSKSHLEKRWQRGWYPGKLLVSLPLLRRLNARVSQKEKDEMGLSQAEGGLIPSSYDYTSSSFDLGPGAGSVTCPTGKIGILIQSESHG 1568          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: EWM22623.1 (phospholipid-transporting atpase [Nannochloropsis gaditana])

HSP 1 Score: 1052.4 bits (2720), Expect = 1.400e-303
Identity = 626/1378 (45.43%), Postives = 820/1378 (59.51%), Query Frame = 0
Query:   82 RRFHVLSPTYPMHVRTTKDGKKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFV-TAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGMWVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLPLSYQAESPDEEALVNAVARELNWSFLGRTPTSALVL-----NPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFERATAYLGTTRQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGEGVAAGMLSEPLVEHDVDEDDAMPYSHMSKNP-LKDVQSDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFMLAMPPICFGLFDRDLSADTIMGNATPDARQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYGGLPIVAGGFYGVPGRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFFPSPVDFAMEYDR 1453
            R FHV S     H     DG      Y +NSVST+KY + +F P++LFEQFRRLAN+YFLV+ +L+++GTY+ L+ +PLTP+TTLFPL+VVLAVTMGKE FED KRH AD+ TN     +L L          GE + +  + I VGR+V+VLDK+ +PADMILLTSSE  G+C +ETSNIDGETNLKI++AA+T  +G G  +    ADL GW+  VECE PNS IH+F+G L      R++++  R   V+++NLLLRGS +RNTKW LGLVVYTG DTKIV NSR APSKLST+E T N +LYLIL  Q++L + ++  ++VW      +L Y+C++     +AF   NC  TA E S LGMW TF  L+NNFVPISLYVT+EMVNY QA++ID D+ MYD  +D PA+ARTSNMNGDL S++YVFSDKTGTLTRN+MEFRRCSV G V+GN+E                                           + P D   + G+ L +LA+                  A    + + A  F   LAVCHTV                                                     +YQAESPDEEALV+A A +L + F GR P    +          + LT+Q+L  + FTSTRKRMSVIV+TPD K++LLTKGADN++  RA  +  T    +DAHL +F+ DGLRTL+LA +E+ E +F +W   +QKA+ +++ R+E +  + ++IER+LVVVG TAIEDKLQEGVPDTIA LL+ GIKVWVLTGDK+ETAINI YSCRLL ++MTL+K+   KD G+         A +R+ L+KL+   + LVE +  +G                                GR+  E+ +                 R G G+  ++    E  V+   DED  + +   ++ P  K++ SD +AL++DGPALA V GD E E + LR++++C+SVVACRVSPAQKR+++RLVK GVK P PITL+IGDGANDVAMIQEAQ+GVGISG+EG+QAVNSADFAIAQFR+L  LLL HGR+ YRR SKVILYSFYKN+VLTF+LF Y + TGFSGQSL  D VY+++NF+ AMP IC G FD D+                              P  +LA                 ++W YMSGRD++DLN+  M  W  QA+LDS+LIF FS  A  +  ++   +GDV  LY+ G   ++ ML+ +  K AT T            GS LLY+IF+  Y  +P  +  FY V   M+R    WL+ V L   VSV +D+  I ++L+F P+PVD A+E  R
Sbjct:   17 RVFHVAS-AEAGHTGHEHDG----HAYCNNSVSTSKYTVLSFFPKSLFEQFRRLANVYFLVIIILLMLGTYTPLFDAPLTPYTTLFPLLVVLAVTMGKEGFEDVKRHIADRETNTAPVEMLSL-------EKPGEFDSMQRQEIRVGRVVRVLDKQMVPADMILLTSSEAEGSCYVETSNIDGETNLKIRQAAKTAADGVGSMWQDDPADLHGWQGTVECELPNSHIHSFSGVL------RHEKEGNRETPVDESNLLLRGSSVRNTKWVLGLVVYTGRDTKIVQNSREAPSKLSTVEHTVNNMLYLILTAQVVLATASVVCFVVWNKVRRFKLDYICIEAASSENAFYAENC-GTAIEPSNLGMWFTFFTLFNNFVPISLYVTMEMVNYCQAFFIDNDIKMYDSEADMPAMARTSNMNGDLASIQYVFSDKTGTLTRNVMEFRRCSVAGTVFGNMEVSE----------------------------------------NAPPDKSVVEGQPLSDLAK-----------------QAISQGSGSAAYSFMLVLAVCHTVVMESLDDGG-------------------------------------------TAYQAESPDEEALVSAAA-DLGFRFTGRGPGEVRLKVGGDDKAGGEELTFQLLCTIAFTSTRKRMSVIVKTPDGKVLLLTKGADNIVGGRAKEFHSTDSDAVDAHLRLFSEDGLRTLMLAVRELPESEFDAWFQGYQKAAASIQNRTEAIGAVADEIERDLVVVGTTAIEDKLQEGVPDTIADLLDGGIKVWVLTGDKMETAINIGYSCRLLSSRMTLLKL---KDTGD--------PATIRRHLKKLLNALDWLVERERKLGDTL-----------------------------GRRIMERMTQ--------------CARRGPGDFRSSWQAGEEDVKILEDEDLKVCFRPQNEAPHFKELTSDTVALVVDGPALAHVLGDPEYEAMFLRLSSICRSVVACRVSPAQKRLVVRLVKAGVK-PMPITLSIGDGANDVAMIQEAQIGVGISGKEGQQAVNSADFAIAQFRYLRRLLLIHGRYDYRRMSKVILYSFYKNMVLTFILFYYLFFTGFSGQSLFNDLVYSAFNFLCAMPIICVGFFDIDI-----------------------------FPQHVLA-----------------WKWVYMSGRDHMDLNIRLMVQWFVQALLDSVLIFCFSLFAARSAHEIWGWDGDVAGLYLFGTTVYSVMLLAVSLKVATITYTWTRVSWFFFIGSLLLYLIFIFSYSAMP-ASTTFYNVAAHMMRMAPHWLL-VLLGSVVSVALDHFIISVKLAFSPTPVDVAVEKSR 1171          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: CBN79051.1 (conserved unknown protein [Ectocarpus siliculosus])

HSP 1 Score: 946.0 bits (2444), Expect = 1.400e-271
Identity = 596/1475 (40.41%), Postives = 824/1475 (55.86%), Query Frame = 0
Query:  101 GKKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGE-NGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQL----------APKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGMWVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLPLSYQAESPDEEALVNAVARELNWSFLGRTPTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFERATAYLGTTRQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRG--EGVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDVQSDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFMLAMPPICFGLFDRDLSADTIMGNATPDARQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYGGLPIVAGGFYGVPGRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFFPSPVDFAMEYDRGYVIPEAAPAPGARREERARTVEEEEAARGEHAPFKGSEAAQAEEEAMERRRSGWYPGKLLVSLPLLRRLNSRISQKDKEEMGMMEAEGGL-MPSSYDYTSSSFDLGPGA 1562
            G + +    DNSV T+KYN+ TF+PR+LFEQFRR+AN+YFLV++VLM++G Y+DL++SPL PF+T+ PLI+VL+VTM K+  ED KRH +D   NN  A       T +  +  G    V WK++ VG IVK+ DKEEIPAD++LL+SSEP G   IET+NIDGETNLKI+ +A T      G  + +A +L G    +E EAPN+RIH FTGTL L                     R V V+Q+NLLLRG+RLRNTKWA+G+V YTG ++KI  N+R+ PSK S ++  TN+++++I     ++ +++L  YLV+   N+ +L+YLC D    P    R NC+ ++  +S +G W TFL+LYNNFVPISLYVT+EMVN+IQA +ID+D+ MYD + DTPA AR+SNM  DLG V YVFSDKTGTLT+N+M+F+RCSVGGV+YG L+                                                    + K L    +L++A +   + E+ S    ++  +A +   FA CLA+ HTV                                               PK      QAESPDEEALV+   + L  +F+ R+P    +      RL+Y ++  +PF STRKRMSV+VR PD   VL  KGADN+I +R+  Y+G+ ++ + +HL VF+ DGLRTL+LA+KE+ ++ F  W ++++KAS A   R+E++AE+ +++E +L VVGATAIEDKLQ+ VP TIA L +AG+K+WVLTGDK+ETAINI YSCRLL  +MTLIK+          +E+E    ++  QLR L+ HF +LVED  LV   W   +                     +S  G  RR +R       +AP+ +      GG    E   A  L  PL+E                 PL ++ +D LAL++DGP+LA V G+ E ER+LL + ++CKSV+ACRVSPAQKR+++RLVK+GV  PTP+TL+IGDGANDV MIQEAQ+GVGISG+EGRQAVN++DFAIAQFRFL+ L+L HG W YRR  KVILYSFYKN VLTF LF + + TGFSGQSL E  VY+ +NF  AMP +  G+FD+D      +GN T  A +  ++                                      Y  GR  +DLNL  M  W+ QAILDS+ +F     A      V +  G  D LY+ G   + G+++ M+ K    T            GS  L+  F+++Y  L   A  FY V  +++   +FWLI +  VP V+  +D +   L  +F P+    A E+DRG+         G +R +     E +  AR E            +E    +R    +  K+      LR LN  ++  +   MG+      +   SS+ +   + D GPGA
Sbjct:   25 GGQEQSAMADNSVVTSKYNVITFVPRSLFEQFRRIANVYFLVISVLMMLGWYTDLFESPLAPFSTIIPLILVLSVTMVKDGAEDLKRHRSDNRVNNTEA-------TAMDIHTRGGFVPVAWKDVKVGMIVKIADKEEIPADVVLLSSSEPGGVAYIETANIDGETNLKIRTSAPTRPGQPPGPLWSSAEELHGVRMELEYEAPNARIHFFTGTLTLHGGAGXXXXXXXXXXXXXXXSRDVPVDQSNLLLRGARLRNTKWAIGVVAYTGRESKIAQNARSVPSKQSNLDKVTNKIMFVIFTCMAVVTTLSLVGYLVFEAENDDKLYYLCYDSDNSPVPLFRDNCE-SSDSSSSVGQWFTFLILYNNFVPISLYVTLEMVNFIQAAFIDEDILMYDETQDTPAQARSSNMGADLGQVEYVFSDKTGTLTQNLMKFKRCSVGGVIYGELD---------------------------------------------------QKSKDLMTPQQLTHAVDAPPLSELASNIAGAEKGSAPLD--FALCLALNHTV------------------------------------------VLEEDPKTGQKQMQAESPDEEALVDG-GKTLGVNFVDRSPGKVELDVTGKGRLSYNLILTIPFDSTRKRMSVVVRAPDGSYVLYCKGADNIIMDRSRGYMGSDKETVASHLGVFSNDGLRTLLLAKKEMSQEFFDEWYEKYRKASIATGDRAEQIAEVAKEVEADLDVVGATAIEDKLQDEVPATIADLGKAGVKLWVLTGDKMETAINIGYSCRLLEPEMTLIKL----------KEKEGDPQSVVNQLRALMTHFNRLVEDDGLVKRFWGHVK---------------------QSPLGLLRRSRRXXXXXXLSAPSSMGDRNRTGGVATMEEDGAATLPTPLLEQP-----------QGAPPLSELTADSLALVLDGPSLAHVLGNPEAERMLLTLGSMCKSVIACRVSPAQKRLIVRLVKRGVV-PTPVTLSIGDGANDVGMIQEAQIGVGISGKEGRQAVNNSDFAIAQFRFLKRLMLVHGHWDYRRVCKVILYSFYKNFVLTFCLFYFCFYTGFSGQSLFESLVYSGFNFFTAMPILLIGIFDKD------VGNQT--ATECHKL--------------------------------------YAVGRAGMDLNLRTMTKWVCQAILDSLTVFFLPLAAYRDATTVWAERGYGDGLYVFGTTVYAGLIMAMMMKVFNMTNTWNYQSWFFWWGSIALFFSFISLYSLLVSYAYDFYYVAMQLMSRSAFWLI-IIQVPCVTWSLDTLIKMLEHNFRPTVGHHAREFDRGF-----TSETGLQRLD-----EIKARARAEELSPTPGGYTDVQEVLGPKRDDANHKWKM--GPETLRALNEGVNPGELASMGITAGSDAVPNRSSFAFDHVTADFGPGA 1293          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: XP_009040504.1 (hypothetical protein AURANDRAFT_55026 [Aureococcus anophagefferens] >EGB04767.1 hypothetical protein AURANDRAFT_55026 [Aureococcus anophagefferens])

HSP 1 Score: 774.6 bits (1999), Expect = 5.800e-220
Identity = 553/1474 (37.52%), Postives = 740/1474 (50.20%), Query Frame = 0
Query:  108 YPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDY-LGGPSAFLRMNCQATAKEASELGMWVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLPLSYQAESPDEEALVNAVARELNWSFLGRT-----PTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFERATAYLGTT-----RQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAV-EGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGE-GVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDVQSDHLALIIDG-PALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFMLAMPPICFGLFDRDLSADTIMGNATPDARQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYGGLPIVAGGF-----YGVPGRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFFPSPVDFAMEYDRGYVIPEAAPAPGARREERARTVEEEEAARGEHAPFKGSEAAQAEEEAMERRRSGWYPGKLLVSLPLLRRLNSRISQKDKEEMGMMEAEGGLMPSSYDYTSSSFDLGPGAG 1563
            Y DNS++T KYN  TFLPR+LFEQFRR AN YFL++++LM+IGTY+DL+ SPLT ++T+ PL ++LA+TM KE  ED KRH +D++ NN  AR+L   ++P T    G +E V WK I  G+IV V D+EEIPAD++LL SSE    C +ETSNIDGETNLKIK  A    N     F      +G    +E EAP  ++H+F GTL+ A  +         + ++ +  LLRGS LRNTK A+G+V YTG DT++V NSR  PSKLS +E   N ++  ILG  + + ++++ AY +W ++N+  L Y+C  Y   G  A    NC + + + S   MW TF +LYNNF+PISLYVT+EM+NY QA Y+D DL MYD +SDTPALARTSNMN DLG + +VFSDKTGTLT+NIM+F+RC+VGG VYG                                                 VDP   R +AL++L        G  VE                   FA  +AVCHTV                                                      YQAESPDEEALV   A +L  +F  RT      T A     +   L+Y VLA +PF STRKRMS IVR P+ K+ ++TKGADN++F  A A  G       R+ LDA L  FA DGLRTLVLA+++V + ++++W + +  A TA+   R E+L      IE++L +VGATAIEDKLQ+GVP TIA L +A IK+WVLTGDK+ETAINI YS RLL   M L+K+                                                                                           P E   GA  G  G+ GVAA +             +A+              SDHLALII+G  AL  + GD ++E   LR+A+ C++VVACRVSPAQKR+L+ LV++   +P PITLAIGDGANDV MIQEA +GVGISG+EGRQAVN+ADFAIAQFRFL+ LL  HGR +YRR SKVI+YSF+KNIVLTFVLF +     +SG S  E +VY+ +NF L + P+  G FD D++      +AT D                                         +   Y +G   +DLN+  MA    +AI  S+ I+  +      P  +    G   D+++LG   F GM++ M+ +A                   L+   F+       +   GF     YGV       P FWL+   LVP V   +  + + + L FFPS  D   E D G+V  E           R R      A     + F  +++A++                  V+   LR +++ I ++  +++G+ E     + SSY Y  +S  +G GAG
Sbjct:   43 YSDNSITTHKYNALTFLPRSLFEQFRRTANQYFLLISLLMIIGTYTDLFYSPLTAWSTIGPLSLILAITMTKEGIEDLKRHKSDEHVNNSEARILS--NSPET--PPGTVETVAWKAIAPGQIVLVKDREEIPADLVLLWSSE-GAQCYVETSNIDGETNLKIKRPATDSAN--APLFPHPDKSKGVGMTLEFEAPCGKVHSFEGTLKHAGGE---------IALDASQFLLRGSTLRNTKLAIGVVAYTGKDTRLVRNSRDVPSKLSELERVVNNMVLFILGAMVCITTISVIAYCLWNESNKKDLWYMCYRYKQDGVPALFDENC-SNSDDYSNGSMWFTFFILYNNFIPISLYVTIEMINYCQAAYVDGDLEMYDEASDTPALARTSNMNADLGMIAHVFSDKTGTLTQNIMKFKRCAVGGGVYG----------------------------------------------GETVDP-PRRIEALKQL-----VITGDGVER-----------------DFAAIMAVCHTVVPEVREDG-------------------------------------------TTGYQAESPDEEALVEG-ACDLGLAFASRTVDVVDVTLASPSGTKGTSLSYTVLATIPFDSTRKRMSAIVRLPNGKVRVMTKGADNIVFGLADAAAGYARVPGGREALDADLEKFARDGLRTLVLAQRDVSDREYKAWAEAWHAAETALGSARKEKLVAAAALIEKDLAIVGATAIEDKLQDGVPSTIAELAKAEIKLWVLTGDKMETAINIGYSARLLTPDMYLVKL-------------------------------------------------------------------------------------------PVE---GADAGPLGDYGVAAQL-------------EALE------------ASDHLALIIEGATALEAILGDDDLENRFLRLASCCRAVVACRVSPAQKRILVGLVRRKT-NPAPITLAIGDGANDVGMIQEANIGVGISGKEGRQAVNNADFAIAQFRFLKPLLFHHGRKNYRRMSKVIIYSFFKNIVLTFVLFYFQADCAWSGTSFYESWVYSGFNFFLGLIPLAMGFFDHDVA------DATVD----------------------------------------KYPRLYAAGLHRMDLNVTNMAYGTLEAIAASLAIYYLTREVYWRPMSIWQDHGKAMDVWVLGTAVFVGMVMAMMARACLLVDSWNEVQLVFVVLQHLMLFTFIIFMAQAYVAWYGFLDYDYYGVAYHAYALPVFWLVSCVLVPAVVSALQILVLGVHLDFFPSINDIGKELDHGHVDGEHLHHHPQHAFIRLRAPASVHALVA--SLFGTTDSARS----------------TFVTRESLRDVHATIGKEQSKKLGIHED----VASSYAYDVASETMGAGAG 1198          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: XP_009035813.1 (hypothetical protein AURANDRAFT_53224 [Aureococcus anophagefferens] >EGB09777.1 hypothetical protein AURANDRAFT_53224 [Aureococcus anophagefferens])

HSP 1 Score: 766.5 bits (1978), Expect = 1.600e-217
Identity = 525/1390 (37.77%), Postives = 702/1390 (50.50%), Query Frame = 0
Query:  101 GKKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDY-LGGPSAFLRMNCQATAKEASELGMWVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLPLSYQAESPDEEALVNAVARELNWSFLGRT-----PTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFERATAYLGTT-----RQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAV-EGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGE-GVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDVQSDHLALIIDG-PALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFMLAMPPICFGLFDRDLSADTIMGNATPDARQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYGGLPIVAGGF-----YGVPGRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFFPSPVDFAMEYDRGYVIPEAAPAPGARREERA 1472
            G+  +  Y DNS+ T KYN  T +P++LFEQFRR AN YFL++ +LM+IGTY+DL+ SPL P++T+ PL ++LA+TM KE  ED KRH +D++ NN  AR+L   ++P T    G +E V WK I  G+IV V D+EEIPAD++LL SSE    C +ETSNIDGETNLKIK  A    N     F    + +G    +E EAP +++H+F GTL+ A  +         + ++ +  LLRGS LRNTK A+G+V YTG DT++V NSR  PSKL+ +E   N ++  +LG  + + ++++ AY +W ++N+  L Y+C  Y   G  A    NC   +   S+  MW TF ++YNNF+P+SLYVT+EM+N  QA+Y+D+DL MYD +SDTPALARTSNMN DLG + +VFSDKTGTLT+NIM F+ C+VGG VYG                                                       R +AL++L        G  VE                   FA  +AVCHTV                                                      YQAESPDEEALV   A +L  +F  RT        A     +   L+Y VLA +PF STRKRMS IVR P+ K+ ++TKGADN++F  A A  G       R+ L+A L  FA DGLRTLVLA+++V + ++++W + +  A TA+   R E+L      IE++L +VGATAIEDKLQ+GVP TIA L +A IK+WVLTGDK+ETAINI YS RLL   M L+K+                                                                                           P E   GA  G  G+ GVAA +             +A+  +          Q DH ALII+G  AL  + GD ++E   LR+A+ C++VVACRVSPAQKR+L+ LV++   +P PITLAIGDGANDV MIQEA +GVGISG+EGRQAVN+ADFAIAQFRFL+ LL  HGR +YRR SKVI+YSF+KN+VLTFVLF +     +SG S  E +VY+ +NF L + P+  GLFD D++      +AT D                                         +   Y +G   +DLN+  MA    +A+  S+ I+         P  V    G   D+++LG   F GM++ M+ +A                   L+   F+       +   GF     YGV       P FWL+   LVPTV   +  + + + L FFPS  D   E D G V  E A     RR E A
Sbjct:   36 GELNKDTYCDNSIMTHKYNALTLIPKSLFEQFRRTANQYFLLIGLLMIIGTYTDLFYSPLLPWSTITPLSLILAITMTKEGIEDLKRHKSDEHVNNSEARILS--NSPET--PPGTVETVAWKAIAPGQIVLVKDREEIPADLVLLWSSE-GAQCYVETSNIDGETNLKIKRPATDSAN--APLFPHPDESKGVGMTLEFEAPCAKVHSFEGTLKHAGGE---------IALDASQFLLRGSTLRNTKLAVGVVAYTGKDTRLVRNSRDVPSKLAELERVVNNMVLFLLGAMVCITTISVVAYCLWNESNKKDLWYMCYSYKQDGVPALFDENC-GNSDGHSDGFMWFTFFIIYNNFIPLSLYVTIEMINLCQAFYVDRDLEMYDEASDTPALARTSNMNADLGMIAHVFSDKTGTLTQNIMTFKGCAVGGGVYGG----------------------------------------------------ETRIEALKQL-----VIAGDGVER-----------------DFAAIMAVCHTVVPEVREDG-------------------------------------------TTGYQAESPDEEALVEG-ACDLGLAFASRTVDVVDVALASTSGTEGASLSYTVLATIPFDSTRKRMSAIVRLPNGKVRVMTKGADNIVFGLADAAAGYAKVPGGREALNADLEKFACDGLRTLVLAQRDVSDREYEAWAEAWHAAETALGSARKEKLVAAAALIEKDLAIVGATAIEDKLQDGVPSTIAELAKAEIKLWVLTGDKMETAINIGYSARLLTPDMYLVKL-------------------------------------------------------------------------------------------PVE---GADAGPLGDYGVAAQL-------------EALEAAAS--------QGDHPALIIEGATALEAILGDDDLENRFLRLASRCRAVVACRVSPAQKRILVGLVRRKT-NPAPITLAIGDGANDVGMIQEANIGVGISGKEGRQAVNNADFAIAQFRFLKPLLFHHGRKNYRRMSKVIIYSFFKNMVLTFVLFCFQADCAWSGTSFYESWVYSGFNFFLGLIPLAIGLFDHDVA------DATVD----------------------------------------KYPRLYAAGLHRMDLNVTNMAYGTLEAVAASVAIYYLPREVYWRPMSVWQDHGKAMDVWVLGTAVFVGMVMAMMARACLLVDSWNKVQLGFVVLQHLMLFTFIIFMAQAYVAWYGFYDYDYYGVAYHAYALPVFWLVSCVLVPTVVSALQLLVLGVHLDFFPSINDIGKELDHGLVDGEHATVTARRRSEPA 1128          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: XP_009175434.1 (hypothetical protein T265_10713 [Opisthorchis viverrini] >KER20810.1 hypothetical protein T265_10713 [Opisthorchis viverrini])

HSP 1 Score: 664.5 bits (1713), Expect = 8.400e-187
Identity = 486/1361 (35.71%), Postives = 661/1361 (48.57%), Query Frame = 0
Query:  111 NSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGM---WVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLPLSYQAESPDEEALVNAVARELNWSFLGRTPTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFERATAYLGTTRQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGEGVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDV------QSDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFML-AMPPICFGLFDRDLSADTIMGNATPDARQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYG----GLPIVAGGFYGVPGRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFFPSPVDFAMEYDRGYVIP 1458
            N +STAKY+++TF+P+ L+EQFRR ANI+FL + +L  I        SP   FTTL PL+++L V+  KE  ED KRH AD  TN     VLR          EGE  E  WK++ VG +VKV + +EIPAD++LL SSEP   C IETSN+DGETNLK+++      +      +TA  L  +   VECE PN ++  F G L+             R  +    LL+RG+ L+NTKW  GL VYTG ++K+++NS + P K ST+E  TN  +  + G+ + L   T  A LVWT  NE  + YL                     +AS L +    +T L+LYN  +PISL V +E+V +IQA YI+ DL MYDP +DTPA+ARTSN+N +LG VRY+FSDKTGTLTRN+MEF+RCS+GGV+YGN                                                               E SNA   ++   ++ +  A+D     +A +F   LA+CHTV                                              +   LPL+YQA SPDE ALV A AR L + F  RTP S + +    + L Y+VL VL FTS RKRM V+VR P  +I++L KGAD VIFER           L+ HL +FA  GLRTL +A  EV  +    W  E+  ASTA++ R ERL ++ E IE+ L ++GATAIEDKLQEGVP+TIA+L++AGI VWVLTGDK ETAINI YSCRLL   + L+ V             E  D   R +LR+LV  F   +  +  V  +     SY                  CR                                         L  P+++      +++  +      L ++        + +ALI+DG  L       E  +  + VA  C+SV+ CRVSP QK  L+RLV+  VK    +TLAIGDGANDV MIQ A VGVGISG EGRQA  ++D+AIAQFRFL  LLL HG W+Y R +K+ILYSFYKN+ L  + F +  L+GFSGQ + E +    YN +  A PP+  GLFDR  S    +    P+  +                                T ++ASF             NL     W+  ++  S ++F     A S+    L S G    L +LG   +T ++V +  KA                GS   + +FL +Y      LP+ A    G+   +     FW+ G+ L+P+  +  D  +   + SF  S  +  M+ ++ +V P
Sbjct:   30 NEISTAKYSVWTFIPKFLYEQFRRYANIFFLAIALLQQIPGV-----SPTGRFTTLVPLLIILTVSAIKEMIEDLKRHYADDATNKSKTLVLR----------EGEWVETMWKDLMVGDLVKVCNNQEIPADLVLLASSEPQAMCYIETSNLDGETNLKLRQGLPQTAD-----LLTAGSLGAYRGWVECELPNRKLEEFVGVLRAF--------DGVRYPLKPNQLLIRGASLKNTKWVFGLAVYTGKESKVMLNSTSRPLKQSTVERQTNTYILFLFGVLLFLTLFTFFANLVWTRWNEPTMWYL----------------DGKVTDASALRIVLDLITCLILYNTVIPISLPVMLEVVRFIQALYINWDLDMYDPDTDTPAMARTSNLNEELGQVRYLFSDKTGTLTRNVMEFKRCSIGGVMYGN-------------------------------------------------------------DTEDSNAMNDRA---LLKRLKAND----PLAKHFFTVLALCHTVVPDAHL----------------------------------------EDPELPLTYQASSPDEAALVKA-ARALGFVFTTRTP-SGVSIRVDGKELHYEVLQVLEFTSFRKRMGVVVRDPRGRILVLVKGADTVIFERLAKNCQYQEATLE-HLEIFARTGLRTLCIASAEVSSEFHAGWSKEYYAASTAIDRREERLEQVAEAIEKNLHLLGATAIEDKLQEGVPETIANLIQAGISVWVLTGDKQETAINIGYSCRLLSPVLDLLTV-----------NTESLDET-RTKLRELVELFGPNLRSENDVALIVDGHVSYS-----------------CR-----------------------------------------LLSPVLDLLTVNTESLDETRTKLRELVELFGPNLRSENDVALIVDGHTLEFAL-SCECRKDFVEVALSCRSVICCRVSPWQKAELVRLVRTSVK--DAVTLAIGDGANDVGMIQAAHVGVGISGMEGRQAACASDYAIAQFRFLNKLLLVHGAWNYNRLTKLILYSFYKNVCLYLIQFWFAILSGFSGQIIFERWTIGLYNVLFSAAPPMALGLFDRSCSVRNCL--LYPELYRD-------------------------------TQASASF-------------NLKVFLCWILNSVFHSAILFWIPLAAFSS--NTLYSSGHSASLLVLGNSVYTYVVVTVCLKAGLEHTAWTWLSHLAIWGSVATWFLFLVVYSHFYPTLPL-ASDMVGMDSAVYGCWVFWM-GLILIPSFCLTRDVAWKMAKRSFAGSLREQVMQMEQMHVDP 1112          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: XP_012749694.1 (hypothetical protein SAMD00019534_104020 [Acytostelium subglobosum LB1] >GAM27227.1 hypothetical protein SAMD00019534_104020 [Acytostelium subglobosum LB1])

HSP 1 Score: 650.2 bits (1676), Expect = 1.600e-182
Identity = 473/1367 (34.60%), Postives = 668/1367 (48.87%), Query Frame = 0
Query:  102 KKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGMWVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVE-------EMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLPLSYQAESPDEEALVNAVARELNWSFLGRTPTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFERATAYLGTTRQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGEGVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDVQSDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFMLA-MPPICFGLFDRDLSADTIMGNATPDARQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYGGLPI------VAGGFYGVPGRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFFPSPVDFAMEYDRGY 1455
            + R ++Y  N+V+T+KY + TFLP+ LFEQF RLAN YFL+++ + +I        SP   FTTL PL+VVLA+T  KEA+EDF+RH  D   NN    VLR         G   ++E  WKN+ VG IVKVL+++ IPAD+++L SSEP   C +ET+N+DGETNLK+K+    G N   + + +  +L    + +ECE PN+R+++F G++ L  K      +Q         +L+RGS LRNTKW  G+V+Y+G DTK++ NS   PSK S +E  TN  + +I  +Q+LL      A  VW + +    + L  D             Q+  K  +    ++TFL+L+NN +PISLYV++E V   QA++I+ D  MY   +DTPALARTSN+N DLG + Y+FSDKTGTLT+N MEF++CS+GGV YG+   G  E ++                 G+    G         +  L  DP      +  EL   SN CE   +        ++ +   + D   + +   F   LAVCHTV                                             P+ +   ++YQA SPDE ALV A A+E+ ++F  R+  S  +++P  Q L +Q+L +L F S RKRMSVIVR PD +++L TKGAD  IFER  A   T       HL  +A++GLRTL +A +E++   ++SW  ++  AS  + GR   L  + E IER+L+++GATAIED+LQ GVP++IA L EAGIK+WVLTGDK ETAINI Y+CRLL   M L+ V +   +   R            +L++LV  +                                                                     R G                                +P KD     LALIIDG  L  V  + EM  ++L+++  CKSV+ACRVSP+QK  ++ LV+  +     +TLAIGDGANDV MIQ A VG+GISG EG QA   +D+AIAQFRFL  LLL HGR+SYRR SK+I Y FYKNI L    F +T   G+SGQ+  E Y  T YN +    P I FG+ D+D+S  +IM                      DHP                          Y SG  +   N+     W+   +  S +++    G     R V  + G   DL  +G+IT+  +++ +  K A  T            GS +L+ I+L  +G          V    Y +   + + P F+L  V +VP V +  DY + ++     P       E D  +
Sbjct:  108 RNREYKYCGNTVTTSKYTLITFLPKNLFEQFCRLANFYFLIISAIQIIPGI-----SPTGRFTTLGPLLVVLAITAIKEAYEDFRRHRQDDRVNNCHTEVLR---------GSTFVDE-RWKNLKVGDIVKVLNRQYIPADLVVLASSEPQSTCYVETANLDGETNLKLKQ----GLNETAQ-YNSLDNLATINSNIECEHPNNRLYSFIGSMYLDGKGHPLSARQ---------VLMRGSLLRNTKWIYGVVIYSGRDTKLMRNSSDTPSKRSGVEKKTNVFILIIFILQMLLCLGAAIANGVWNNRHVDDWYLLWSD-------------QSPVKNGAM--SFLTFLILFNNIIPISLYVSMEFVKVFQAFFINNDQQMYHADNDTPALARTSNLNEDLGQIDYIFSDKTGTLTQNKMEFKKCSIGGVSYGS---GMTEATM-----------------GAMMREGAMISDQPAQQQQLNNDPTT---NSSNELLGASN-CESPPLSASSFRDAKLNANLNSEDLNMSKLIKEFFSVLAVCHTV--------------------------------------------VPEEENGVITYQASSPDESALVTA-AKEVGFNFCRRSLKSVTIIDPNGQELEFQILNILEFNSVRKRMSVIVRHPDGRLLLYTKGADTAIFER-LAPNQTFADSTINHLQEYASEGLRTLCVAYREIEPAVYESWSSDYYTASNTIIGREAALDRMAEAIERKLILLGATAIEDRLQVGVPESIASLREAGIKLWVLTGDKQETAINIGYACRLLTNNMELLVVNESSIENTDR------------ELKRLVEEY---------------------------------------------------------------------RNG--------------------------------HPTKD-----LALIIDGSTLVYVLENKEMALMMLKISERCKSVIACRVSPSQKADIVGLVRDNL---DAVTLAIGDGANDVNMIQRAHVGIGISGEEGLQAARCSDYAIAQFRFLTRLLLVHGRYSYRRISKLIAYCFYKNITLYITQFWFTIFNGWSGQTYYERYTLTLYNILWTFFPIIVFGILDKDVSEQSIM----------------------DHP------------------------HLYSSGPRHHHFNIKVFWGWICNGVFHSFVLYALPMGIYH--RAVPFASGFTIDLISVGIITYACVVITVNCKLALETRFWTWINHLATWGSIVLFFIWLMAFGKFEDINQSLGVGVDIYDIIFNVGKAPLFYLTLV-IVPVVCLYRDYTWKFVNRYALPQAYHIVQELDSSH 1190          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: XP_022807594.1 (phospholipid-transporting ATPase IA-like [Stylophora pistillata])

HSP 1 Score: 634.4 bits (1635), Expect = 9.300e-178
Identity = 466/1354 (34.42%), Postives = 661/1354 (48.82%), Query Frame = 0
Query:  102 KKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGM-WVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLP-----LSYQAESPDEEALVNAVARELNWSFLGRTPTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFER---ATAYLGTTRQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGEGVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDVQSDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFML-AMPPICFGLFDRDLSADTIMGNATPDARQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYGGLPIVAGGFYGVP-----GRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFF 1441
            K +  ++ +N +STAKYN  TFLP+ L EQF R +N++FL + +L  I        SP   +TT  PL+ VL+ +  KE  ED+KRH AD   NNR  +VLR          +  ++ + W  + VG IVKV++ +  PAD+IL++SSEP G C IETSN+DGETNLKI++A             +  +++  +  VECE PN+R++ F G + L        Q Q+ V V    +LLRG+ LRNT+W  GLVVYTG+D+K++ NS AAP K S ++ TTN  +  + G+ ++L   +   + +WTD +     YL   Y G              K+A  LGM ++TF++L+NN +PISL VT+E+V +IQA +I+ D+ MY   +DTPA+ARTSN+N +LG V+Y+FSDKTGTLTRN+MEF++CS+GG+ Y                                  SGGR             D F        + A L N  E                  A+V   F   LAVCHTV                                               P+R P     + YQA SPDE ALV   A++L +SF  RTPTS +++N   Q   Y+VL VL F STRKRMSVIVRTP+ KI L  KGAD VIFER      Y+ +T +    HL  FA +GLRTL +A  E++ ++FQ W D + KAST++E R + + +  E IE+ L ++GATAIEDKLQEGVP++IA L +A IK+WVLTGDK ETAINI Y+CRLL  +M L+                                   +  ++TL G               + WL                                E  R  GR G  +                            K+P      D L LII G  L     D E++   L +A  CK+V+ CRVSP QK  ++RLVK+ VK    ITLAIGDGANDV MIQ A VGVGISG EG QA +++D+AIAQFR+L  LL  HG WSY+R +K+ILYSFYKN+ L  +   +    GFSGQ L + +    YN +  ++PP+  GLFDR +++++++                                               +   Y   ++    N     +W+  ++  S+L+  F         +   S+G +   + LG + +T +++ +  KA                GS   + IFL IY  +P +A   Y  P      RM+     + I + ++P +++++D+++   R +F+
Sbjct:   54 KPQTQQFCNNKISTAKYNFLTFLPKFLLEQFSRYSNVFFLFIALLQQIDDV-----SPTGRYTTAVPLLFVLSCSAVKEIIEDYKRHQADDQVNNRRVKVLR----------DNTMQSLLWTEVQVGDIVKVVNGQFFPADLILVSSSEPMGMCYIETSNLDGETNLKIRQALPLTAK-----MTSLLEIRCMQGRVECEGPNNRLYDFVGNITL--------QTQKSVPVGPEQILLRGANLRNTQWIFGLVVYTGHDSKLMQNSTAAPIKRSNVDHTTNIQILFLFGLLLVLALCSTIGFKIWTDNHRDTDWYL--GYSG--------------KKAQNLGMSFLTFIILFNNLIPISLTVTLEVVKFIQAIFINLDIDMYYDKTDTPAMARTSNLNEELGQVKYIFSDKTGTLTRNVMEFKKCSIGGISY----------------------------------SGGR-------------DTF-------MDPALLDNLRE--------------HHPTASVIREFLTLLAVCHTVV----------------------------------------------PEREPGNPDKIVYQAASPDEGALVKG-AKKLGFSFNVRTPTS-VIINAMGQEEVYEVLNVLEFNSTRKRMSVIVRTPEGKIKLYCKGADTVIFERMQEKQMYMDSTVE----HLEDFAKEGLRTLCIAMAELEPEEFQRWSDIYYKASTSLENREKNVDDAAELIEKNLFLLGATAIEDKLQEGVPESIAALADADIKIWVLTGDKQETAINIGYACRLLTPEMKLL-----------------------------------ICGEETLDG--------------TREWL-------------------------------NEHIRLVGRSGSSK----------------------------KSP--STIRDDLGLIITGKTLLHGLSD-ELKLSFLELALGCKAVICCRVSPLQKAQVVRLVKQHVK--DAITLAIGDGANDVGMIQAAHVGVGISGVEGLQAASASDYAIAQFRYLNKLLFVHGAWSYQRLAKLILYSFYKNVCLYVIELWFALDNGFSGQILFDKWCIGIYNVVFTSVPPLAIGLFDRTVTSESML----------------------------------------------KYPKLYKESQNAEIYNTKVFWMWIAASVYHSLLL--FYLPCFMLRHEAPFSDGVIVGEWFLGNVVYTLVVITVCIKAGMELDTWNWLCHVAIWGSIASWFIFLLIY-CIPDIA--LYIAPHMIGQDRMLYSCIVFWISLFIIPMITLLLDFLYKIFRRTFY 1079          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: XP_023774990.1 (phospholipid-transporting ATPase IB isoform X1 [Cyanistes caeruleus])

HSP 1 Score: 632.9 bits (1631), Expect = 2.700e-177
Identity = 460/1339 (34.35%), Postives = 655/1339 (48.92%), Query Frame = 0
Query:   96 RTTKDGKKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGMWVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLPLSYQAESPDEEALVNAVARELNWSFLGRTPTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFERATAYLGTTRQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDC-KRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGEGVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDVQSDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFML-AMPPICFGLFDRDLSADTIMGNATPDARQSERMQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLNLGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITFTGMLVGMLYKAATNTXXXXXXXXXXXXGSALLYVIFLAIYGGL-PI--VAGGFYGVPGRMVRHPSFWLIGVSLVPTVSVVVD 1430
            RT    + ++ ++ DN VSTAKY++ TFLPR L+EQ R+ AN +FL + +L  I        SP   +TTL PL+ +L V   KE  ED+KRH AD   N +   VLR           G  + + WK + VG IVKV + + +PADMI++++SEP   C IET+N+DGETNLKI++           +  +  +L      +ECE PN  ++ FTG L+L          Q  V V    +LLRG++LRNT+W LG+VVYTG+DTK++ NS  AP K S +E  TN  + ++  + +++  V+    L+W   +   + YL      G +  L +N              +TF++LYNN +PISL VT+E+V + QA +I+ D+ MY P +DTPA+ARTSN+N +LG V+Y+FSDKTGTLT NIM F++CS+ GV YG+      E S                                        + F+    +  E  E  +    +++E        +D   A     F   LAVCHTV                                             P+ +   + YQA SPDE ALV   A++L + F GRTP S ++++   +  T+++L VL F+S RKRMSVIVRTP  ++ L  KGADNVIFER +       Q L  HL  FA +GLRTL +A  ++ E  ++ WL+ + ++ST ++ R+++L E  E IE++L+++GATAIED+LQ GVP+TIA L++A IK+W+LTGDK ETA+NI YSCRL+   M+LI V       E   +  + D                       +G     S S +G   C + WL                   KR+T  S T     L    G+                 E+D+                        ALIIDG  L      +E+ +  L +A  CK+V+ CRVSP QK  ++ +VK   KH   ITLAIGDGANDV MIQ A VGVGISG EG QA N +D+AIAQF +LE LLL HG WSY R +K ILY FYKN+VL  +   + ++ GFSGQ L E +    YN +  A+PP   G+F+R  + D+++           R       T                   A   +T  F W +                    A++ SI++F F    +      + + G   D   +G I +T ++V +  KA   T            GS LL+++F  +Y  + P   +A    G  G ++R   FW  G+ LVPT  +V D
Sbjct:   16 RTIYLNQPQQSKFRDNWVSTAKYSVVTFLPRFLYEQIRKAANAFFLFIALLQQIPDV-----SPTGRYTTLVPLLFILTVAGIKEIIEDYKRHKADSAVNKKKTVVLR----------SGMWQNIMWKEVAVGDIVKVTNGQHLPADMIIISTSEPQAMCYIETANLDGETNLKIRQGLSQ-----TASLQSREELMKVSGRIECEGPNRHLYDFTGNLRL--------DGQSPVPVGPDQILLRGAQLRNTQWVLGIVVYTGHDTKLMQNSTKAPLKRSNVEKVTNMQILVLFCILLVMALVSSVGALLWNRTHGEVVWYL------GSNKMLSVNFGYNL---------LTFIILYNNLIPISLLVTLEVVKFTQALFINWDIDMYYPETDTPAMARTSNLNEELGQVKYLFSDKTGTLTCNIMNFKKCSIAGVTYGHFPELERERS---------------------------------------SEDFSQLPPSTSESCEFDDPRLLQNIE--------NDHPTAVHIQEFLTLLAVCHTV--------------------------------------------VPERQGNTIIYQASSPDEGALVKG-AKKLGYVFTGRTPHS-VIIDALGKEKTFEILNVLEFSSNRKRMSVIVRTPAGQLRLYCKGADNVIFERLSKDSQYMEQTL-CHLEYFATEGLRTLCIAYADLSEKSYREWLNVYNESSTVLKDRTQKLEECYEIIEKDLLLLGATAIEDRLQAGVPETIATLIKAEIKIWILTGDKQETALNIGYSCRLISQSMSLILV------NEDSLDNWQLD----------------------WMGDAKPVSWSTRGVTGCLQMWL-------------------KRATRASLTQHCTSLGESLGK-----------------ENDI------------------------ALIIDGHTLKYAL-SFEVRQSFLDLALSCKAVICCRVSPLQKSEIVDMVK---KHVNAITLAIGDGANDVGMIQTAHVGVGISGNEGMQATNCSDYAIAQFSYLEKLLLVHGAWSYNRVTKCILYCFYKNVVLYIIELWFAFVNGFSGQILFERWCIGLYNVIFTALPPFTLGIFERSCTQDSML-----------RFPQLYKIT-----------------QNADGFNTRVF-WGH-----------------CINALIHSIILFWFPLKVLE--HDAVFTNGQGVDYLFVGNIVYTYVVVTVCLKAGLETTAWTRFSHLAVWGSMLLWLVFFGVYSAIWPTFPIAPDMLGQAGMVLRCGYFW-FGLFLVPTACLVKD 1076          
BLAST of NO03G01590 vs. NCBI_GenBank
Match: XP_020607975.1 (phospholipid-transporting ATPase IB-like isoform X1 [Orbicella faveolata])

HSP 1 Score: 630.2 bits (1624), Expect = 1.800e-176
Identity = 435/1147 (37.93%), Postives = 590/1147 (51.44%), Query Frame = 0
Query:  102 KKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIGTYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAARVLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSEPSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVECEAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWALGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTLGAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGM-WVTFLLLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNGDLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVXXXXXXXXXXXXXXXXXGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAELSNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPKPKRLP-----LSYQAESPDEEALVNAVARELNWSFLGRTPTSALVLNPQNQRLTYQVLAVLPFTSTRKRMSVIVRTPDNKIVLLTKGADNVIFER---ATAYLGTTRQELDAHLSVFAADGLRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVVVGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLHAKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVGGLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAPAELHRGAGRGGRGEGVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDVQSDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLIRLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADFAIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGFSGQSLLEDYVYTSYNFML-AMPPICFGLFDRDLSADTIM 1239
            K +  ++ +N +STAKYN  TFLP+ L EQF R +N++FL + +L  I        SP   +TT  PL+ VL+ +  KE  ED+KRH AD   NNR  +VLR          +  ++ + W  + VG IVKV++ +  PAD+ILL+SSEP G C IETSN+DGETNLKI++A             +  +++  +  VECE PN+R++ F G + L        Q Q+ + V    +LLRG+ LRNT+W  GLVVYTG+D+K++ NS AAP K S ++ TTN  +  + G+ ++L   +   + +WTD ++    YL   Y G              K A  LGM ++TF++LYNN +PISL VT+E+V +IQA +I+ D+ MY   +DTPA+ARTSN+N +LG VRY+FSDKTGTLTRN+MEF++ S+GG+ Y                                          QG  F   +DP           A L N  E                  A+V   F   LAVCHTV                                               P+R P     + YQA SPDE ALV   A++L +SF  RTPTS +++N   Q   Y+VL VL F STRKRMSV+VRTP+ KI L  KGAD VIFER      Y+ +T +    HL  FA +GLRTL +A  E++ ++FQ W D + KAST++E R + + +  E IE+ L ++GATAIEDKLQEGVP++IA L +A IK+WVLTGDK ETAINI Y+CRLL  +M L                                              L  S  S  G R+   WL                                E  R  GR G                              SK     ++ D L LII G  L+    D E++   L +A  CK+V+ CRVSP QK  +++LVK+ VK    ITLAIGDGANDV MIQ A VGVGISG EG QA +++D+AIAQFR+L  LL  HG WSY+R +K+ILYSFYKN+ L  +   +    GFSGQ L + +    YN +  ++PP+  GLFDR +++++++
Sbjct:   57 KPQTQQFCNNKISTAKYNFLTFLPKFLLEQFSRYSNVFFLFIALLQQIDDV-----SPTGRYTTAVPLLFVLSCSAVKEIIEDYKRHQADDQVNNRRVKVLR----------DNTIQSLLWTEVQVGDIVKVVNGQFFPADLILLSSSEPMGMCYIETSNLDGETNLKIRQALPLTAK-----MTSLIEVRCMQGRVECEGPNNRLYDFVGNITL--------QTQKSLPVGPEQVLLRGAHLRNTQWIFGLVVYTGHDSKLMQNSTAAPIKRSNVDHTTNIQILFLFGLLLVLALCSTIGFKIWTDNHQETDWYL--GYSG--------------KRAQNLGMSFLTFIILYNNLIPISLTVTLEVVKFIQAIFINLDIDMYYDETDTPAMARTSNLNEELGQVRYIFSDKTGTLTRNVMEFKKVSIGGISY----------------------------------------SGQGDTF---MDP-----------ALLDNLRE--------------HHPTASVIREFLTLLAVCHTVV----------------------------------------------PERDPSNPDKIVYQAASPDEGALVKG-AKKLGFSFNVRTPTS-VIINAMGQEEVYEVLNVLEFNSTRKRMSVVVRTPEGKIKLYCKGADTVIFERLQDKQMYMDSTVE----HLEDFAKEGLRTLCIAMTELEPEEFQRWSDIYYKASTSLENREKNVDDAAELIEKNLFLLGATAIEDKLQEGVPESIAALADADIKIWVLTGDKQETAINIGYACRLLTPEMKL----------------------------------------------LICSEESLDGTRE---WL-------------------------------NEHLRIIGRAG-----------------------------SSKKKPPSIRDD-LGLIITGKTLSHGLTD-ELKLSFLDMALSCKAVICCRVSPLQKAQVVKLVKQHVK--DAITLAIGDGANDVGMIQAAHVGVGISGVEGLQAASASDYAIAQFRYLNKLLFVHGAWSYQRLAKLILYSFYKNVCLYVIELWFALDNGFSGQILFDKWCIGIYNVVFTSVPPLAIGLFDRTVTSESML 926          
The following BLAST results are available for this feature:
BLAST of NO03G01590 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM29580.10.000e+062.22putative phospholipid-transporting atpase ia isofo... [more]
EWM22623.11.400e-30345.43phospholipid-transporting atpase [Nannochloropsis ... [more]
CBN79051.11.400e-27140.41conserved unknown protein [Ectocarpus siliculosus][more]
XP_009040504.15.800e-22037.52hypothetical protein AURANDRAFT_55026 [Aureococcus... [more]
XP_009035813.11.600e-21737.77hypothetical protein AURANDRAFT_53224 [Aureococcus... [more]
XP_009175434.18.400e-18735.71hypothetical protein T265_10713 [Opisthorchis vive... [more]
XP_012749694.11.600e-18234.60hypothetical protein SAMD00019534_104020 [Acytoste... [more]
XP_022807594.19.300e-17834.42phospholipid-transporting ATPase IA-like [Stylopho... [more]
XP_023774990.12.700e-17734.35phospholipid-transporting ATPase IB isoform X1 [Cy... [more]
XP_020607975.11.800e-17637.93phospholipid-transporting ATPase IB-like isoform X... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL070nonsL070Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR000ncniR000Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR053ngnoR053Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK005743NSK005743Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO03G01590.1NO03G01590.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|626698gene_1340Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100006g104gene1025Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO03G01590.1NO03G01590.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO03G01590 ID=NO03G01590|Name=NO03G01590|organism=Nannochloropsis oceanica|type=gene|length=22251bp
AACGGCAGGTGGCCTGTCGGCAGAGCCAAACGATGTGGTAGCTCCACAAA
CACTTTGGAGCCAGCGTCCATCCTGCACCCAGGACGACGTCAAAGCAATA
TGAGCCAACATTACGGCGCCGCCGCCTCAGGGGAGCCCTTGATCCCACCA
TCACCGCGACCGCCACGTAACGACAACGAGCTTGCCACGGTGCGGGAAGC
TCAGCGGCAACAATGTGTGCGATCCCCCATCCTCGACATGGTGCTTCGCT
TCATCACACCGCGGAGCCCGGACGACCATCATCCCTCCCCTTCCCCTTTC
TCGTCCTCCTCCACAAGCAGCAATAACAATGGCACCTGTATTCGACGGTT
CCATGTCCTAAGTCCAACCTATCCTATGCACGTCCGCACGACCAAAGACG
GAAAGAAGAGGCGGTTCCGATACCCAGACAACAGCGTTAGCACGGCCAAG
TACAACATCTTCACTTTTCTCCCTCGCGCACTCTTTGAGCAATTCCGACG
CCTGGCCAACATTTACTTCTTGGTTGTGACAGTCCTAATGCTCATTGGGA
CCTACTCTGATTTGTATCAATCACCCCTGACACCATTCACGACGCTCTTT
CCTTTGATTGTAGTGCTGGCTGTGACCATGGGCAAGGAAGCATTCGAAGA
CTTTAAACGGCACACGGCCGACCAAAACACCAACAACCGCGCCGCCCGGG
TCTTGCGCCTAGTTTCTACCCCTGTCACCAACAATGGAGAGGGAGAGTTG
GAGGAGGTGTTCTGGAAAAACATAGGGGTTGGCCGAATTGTCAAGGTGCT
GGACAAGGAGGAGATACCGGCGGACATGATCCTTCTCACTTCTTCCGAGC
CAAGTGGGAACTGCTGTATTGAAACAAGCAACATCGATGGAGAAACTAAT
TTGAAAATCAAGGAGGCTGCGCGTACAGGGGAGAATGGGTGCGGGCGAGC
ATTTGTCACGGCAGCAGACCTACAAGGGTGGGAGGCAGCAGTCGAGTGCG
AAGCCCCAAACTCCCGCATTCACACTTTTACTGGCACCTTGCAACTGGCT
CCAAAAGATCGGAACCAAGAGCAACAGCAGCGAAGGGTGGGGGTGAACCA
AGCCAATCTATTGCTGCGAGGATCTCGACTGCGGAACACTAAGTGGGCGC
TGGGACTAGTGGTCTACACCGGGTATGATACAAAAATAGTTATGAATTCC
CGCGCAGCCCCCAGCAAGCTTTCCACCATCGAAGTGACGACCAACCGGCT
GTTATACCTCATTTTGGGGATGCAGATTTTGTTGGTTTCCGTGACGTTGG
GTGCTTACTTGGTATGGACGGACGCGAATGAGAGTCAGCTGCACTACCTG
TGTATGGATTATTTGGGCGGACCTAGTGCCTTCCTGAGGATGAATTGCCA
AGCCACTGCCAAGGAGGCGTCAGAGCTGGGCATGTGGGTGACTTTTTTGC
TGCTTTACAATAATTTTGgtgcgtttttaagagggaaggagggagggagg
gtgtggctagcacatttatgcccctgggaccgtgacttttatcccacttt
ctccattcgatcgcatccctccttccatacaccctccctccatccctctc
tcttatccatttatactccatttccctcaacgtggccgtcggaaaggtga
cttcattgatcaagactagactcacattattactccctccctacattcct
ccctccctccctccctctttccctccctcctccccccctagTCCCCATTT
CCCTCTACGTGACCGTCGAAATGGTCAATTATATTCAAGCCTACTACATC
GATCAAGATCTGTCTATGTACGACCCATCATCAGACACTCCTGCCTTGGC
TAGGACGAGTAATATGAATGGGGACTTGGGCTCAGTCCGGTATGTGTTCA
GTGACAAAACAGGCACACTCACAAGGAATATCATGGAGTTCAGAAGGTGT
TCGGTTGGGGGTGTGGTCTATGGGAATTTGGAAGGGGGGAGGGAGGAAGA
GAGCGTAGAAAGGGGGGAGGAAAGGCAGCAAATCCAAGGGGAAGAGGAGC
GTACGAGTGGGAGCAGTGCGGGGAGTGGAGGCAGGAGAGAGGGAGCTCAA
GGACATCGCTTTTCTTTGCCAGTTGATCCATTCGCTTTGAGAGGGAAGGC
GTTGCAGGAACTTGCAGAATTGTCAAATGCTTGTGAGGGTAAGAGTGTGG
AAGAGATGATTTCCAAGACTCCTGCTTCTGATTCTGCTGCTGCTACTGTT
GCAGTTTATTTCGCGGAATGTCTGGCGGTTTGTCATACGGTAGTAGTGGA
GAAGCCGGCTCCCTCATCTCCACCACCACCACCACCACCGCCGTCATCAT
CCTTCTCGTCGAGCAGCAGCCCCTCCCGCGGAGGAGGCGACAGAAACAAC
ACCACAGCAACAACTACAACCAAAACACCTAAACCGAAACGCCTCCCCCT
CTCCTACCAAGCTGAATCTCCCGACGAAGAAGCCCTCGTCAACGCCGTCG
CCCGCGAACTCAATTGGTCCTTCCTCGGGCGTACCCCCACTTCCGCCCTG
GTCCTCAACCCCCAAAACCAACGACTCACCTACCAAGTTCTGGCCGTCCT
TCCTTTCACTTCGACGCGAAAACGTATGTCCGTCATCGTGCGCACGCCCG
ATAACAAAATCGTTTTGTTGACCAAGGGCGCGGATAACGTAATTTTTGAA
CGGGCCACCGCGTACTTGGGCACAACAAGACAAGAACTCGATGCCCACCT
TTCTGTCTTTGCCGCTGATGGACTACGAACACTCGTCCTTGCACGTAAAG
AGGTAGATGAAGATGACTTTCAATCATGGCTTGATGAGTTTCAGAAGGCT
TCTACAGCAGTAGAGGGGAGAAGTGAACGCTTGGCGGAAATAGGAGAAAA
GATTGAGAGGGAATTAGTGGTGGTAGGAGCGACGGCGATTGAGGATAAAT
TACAGGAGGGAGTGCCAGACACGATTGCGCATTTGCTGGAAGCCGGGATT
AAAGTTTGGGTGTTGACGGGAGATAAAGTGGAGACGGCGATTAATATCGC
GTATTCGTGTCGGTTGTTGCATGCGAAAATGACATTAATCAAAGTGATTG
ACCCCAAGGATGACGGGGAGGGCAGGAGGGAGGAAGAGGAGTGTGATGCG
GCCTTGAGGAAGCAGTTGAGGAAATTGGTGGCGCATTTTGAGCAATTGGT
TGAGGACAAGACGCTTGTGGGCGGGTTGTGGGCATCCTCTCGCTCTTATC
AAGGAGAAAGAGATTGTAAGAGATGGTTGCTATGGCGATGGTGTAGAGGG
AGGTGTCGGAGTCAGGGAGGGAGGAAGAGGAGGGAGAAGAGAAGCACCGC
ATTAAGTTATACGACGGCGCCTGCAGAACTTCATAGAGGTGCTGGAAGAG
GAGGAAGGGGCGAAGGAGTGGCGGCTGGGATGCTCTCAGAGCCTCTGGTA
GAGCATGACGTTGATGAGGACGATGCAATGCCTTATTCTCACATGAGCAA
GAATCCTTTGAAAGACGTGCAGTCTGACCACCTGGCTCTCATCATCGACG
GGCCAGCTCTTGCGCGTGTGTTTGGAGATTGGGAAATGGAGCGCTTGTTG
CTTCGCGTTGCCACACTTTGTAAATCTGTGGTGGCCTGCCGCGTCAGCCC
TGCGCAGAAACGGATGCTGATTCGATTGGTCAAGAAGGGAGTGAAGCACC
CGACGCCCATCACGCTAGCGATTGGGGATGGTGCGAATGACgtaagttgt
tggtttgtccaggggcatgggcatctaagaattacccttgccagagctac
ctatattctcaataatcactaccactgaccccttcctcctttcttccccc
tcccccccctctctcctttccttttctcgttttaagGTGGCCATGATTCA
AGAGGCCCAAGTTGGGGTGGGAATAAGTGGGCGAGAAGGCCGGCAAGgta
ccggggtagggggggaaaggagggagggcgagcgccgatcaatagtatgc
acctagcatgatgaaggcaaaaatagtcttcattccactcactaatccct
tttccctttctttctcccttttccctccctccagCCGTTAATTCAGCCGA
CTTCGCTATCGCTCAATTCCGTTTTCTGGAAACACTCCTTCTCAAGCACG
GGCGGTGGTCCTATCGGCGCACATCCAAAGTGATCCTTTACTCTTTTTAC
AAGAATATTGTGCTGACTTTTGTCCTCTTCGCCTATACCTGGTTAACTGG
CTTTTCAGGGCAGTCGCTGTTAGAGGATTATGTGTACACGTCATACAACT
TCATGTTGGCCATGCCGCCGATTTGgtgagtgtctttcaactacttgtgt
tgtggattgcttattgttgctcacatttaatgtgactctttccactttcc
attcttttcttgcctgcgtaggcttttttggtcttcaatcctttcctgtt
gttgagctaaatttatctctttcttccacccgtacccttcccttgcagTT
TTGGGCTCTTCGACCGAGACCTCTCCGCCGACACAATCATGGGCAATGCC
ACGCCTGACGCACGACAGTCAGAGAGAATGCAGCACGCCTCTAGCACCAC
CACCATCGATCACCCCGCTACCATTTTAGCTATTTCTCGAACTGACTACA
ACGCTGCTGCTACCACTGCTTCTACTGCAAGCTTCCGCTGGGCCTACATG
AGCGGTCGTGACAACCTGGACCTGAATCTCGGTCAAATGGCTTTGTGGCT
CTTCCAAGCCATCCTAGACTCCATCCTCATCTTCGGTTTTTCCTTTGGAG
CAATGTCAGCTCCTCGTCAAGTGCTATCCTCGGAGGGGGACGTGGATGAC
TTGTACATGCTAGGTTTGATTACATTCACGGGGATGCTGGTCGGCATGCT
CTATAAAGCAGCCACGAATACTTATACCTGGACGTGGGTCAATTTTTTCT
TCTTCTTTGGGAGTGCCTTGCTCTATGTGATTTTTTTGGCGATCTACGGA
GGCTTGCCGATCGTGGCGGGCGGGTTCTATGGCGTCCCTGGGCGAATGGT
CCGTCATCCTTCCTTTTGGTTGATAGGCGTATCGCTGGTGCCTACCGTAT
CAGTGGTTGTTGATTACATTTTTATTTATCTGCGCCTATCTTTCTTTCCG
TCCCCAGTGGATTTTGCCATGGAGTACGATCGGGGATATGTTATACCGGA
AGCAGCACCAGCACCAGGAGCCAGAAGAGAAGAAAGAGCAAGAACAGTAG
AGGAAGAAGAAGCAGCGCGCGGGGAACATGCACCGTTCAAGGGCTCAGAG
GCAGCGCAGGCAGAGGAAGAAGCGATGGAGCGACGGCGGAGTGGGTGGTA
CCCAGGCAAGCTACTTGTAAGCTTACCGCTGTTGCGGCGTTTGAATTCGC
GGATTTCGCAGAAGGACAAGGAGGAGATGGGTATGATGGAGGCAGAGGGG
GGATTAATGCCGTCAAGTTATGATTATACGAGCTCGAGCTTTGATTTGGG
ACCGGGGGCCGGACACAACGGAGGACCTGGTGAAGGCAAGGTCTTAGCGC
GGCGGCACAGTCGGGGTTAGACGCACTATCGGGGTTAGACTCGCGATTGA
AGAGGTGAAGGGGAGACAGTTTGCGTGGCAGTCAAATGGCAGGTAGTAGA
GGCATTCACCCAAGAAATTGTGCATATGTGACTGCATGTGAAGAACCGCA
ATGCGACGGAGATCATTAAAGTTTGTAACGCACACAATTCAAGGAAGTGT
CAAAAATAGAAAGTTATAATAAAAAAATACAAAAAGTCGAGTGTTTGTCA
TATAATGAGGCTTACATTTCGTCTTCTTTTATTGATTGAAACGTCAGCAA
CTATTTTGATTTTTTCCTCTGAGATTGTATTGTCGTCTTCATCGTGTGCA
ATAACGGGCTTCTTCAAGTTTATATTTGACTTACCATTCTCTCTGGTTCT
ATGTATCACACATATACACACGCGATGTAAAGTCTTTCGTGTAATCATTT
TCCAGCATGGTCATAGGCGCCGATAATGTAGTTCTATCTACTCCATCACG
CTACTTGTGCTCATAATACATGCACATTTTTTTCTAACTTTTCTGACATT
TTTACGTACTGTTTGGGCTGTTGGGGTGATTATGAAAATGTAGTTACTTG
TCCTTTTGTATCATCTATCTATCTTTCTGCGTTTGGAAATAATTCATTTT
CTACTTTCCAAGGTTGGCCTCAAAGGCTACAAGAGTCAGGGCGTCTCTAC
AAACAGACACACACTTCTCTCTTCTCTTGTCTCTCCATCCTCTGCCGCCC
TCATCTATATCTTCTACCTTCAGCGTTGCCTATGGCCTCCCCCTTTCCTT
CCTATCATTTCTTGTCTAGCGCTCCAAAAAGTGATTTGTATGGAAAGGGA
AATAAGACTTGATGCCCTCCCGACAGAATGGACATAGCCGACAACTCTCT
CGGCACGTCGAGCACGTAGTATGTCCACATGGTGCTAAGATCTCATTCAC
ATTCCGCTGGAAACAGATATGGCATTGAAAAGCTTGCTTGATTGATTCCA
GCTCCATACTCAGCCCATGTGCCTCCTCCCGCACTGATTGCGTCTCATGG
TATGCTGCCTCCAAATCATGCTGCAGACGAATGTTACGGGCCAGCACATC
CTTTGTATCTTCGCCCAGCGACAGATTCATGCGCGACAGAATATCATCGA
CGGCAGCAAGGAGCTGCGTCGATCCCACAGGAGCGCGACAGGTGGTAACG
ACCATATTTTCATCCGCGACATCTTCCTCTATCTTCATATTCTGCACTAG
CGGTGGAACGGCATAGACTGACACCTTCTCGTGCTCCCGATCTCGACTGC
TGAGAGGTTCCCGTTGTGTCTCGCTATCATTACTTGACGAAATAGATGGG
CCCCTATTGGTGGCCGAGAAGGAGTAGATCTCCGTTGACAGCATTTCCTG
AACTTTGTTCGGCGCTACACGAACTAACAATCGCCGCACCACTCCCCTAC
TGCTGTCGCTGCCGTCCCCATTGCCTCTCTCTATATTCCCCGAAACCACC
ACCCCATACCGAAGCAGGTCGTTACCAACACCAGCAACCCCATCAGCAGT
CGTCTCTTCTCTCGCCTGCCAAGCGACAATTTCCCCCTGCATGTGCACAC
GCATGGGTCGAAATTCCACTCGATGCTGATCCACATCGTGCAGCAGCTGC
CCAGGCATGCCTCGACGCAGATGTTGCGCACGGCTCTCATCCCGGGCATT
TAGCCGAAGAAGCTGCATAACCTGTTTAATATTCGATGGGGGATGTGTTG
CCAGCAAGGCCGACATGGGAGCTACCAACCATTCTTCCCCTGGTCCTCCC
AAAGAAAACATCTGTGCGACAGCCATGGCCAGTACGTAGGCAGGCGTCAC
GGAGGGGGGCAACAAAGTGACGGCGAGTAACACGGTGCAATTCGCACTGT
CGACGTAAAACATCGATCCTTCCGGCTGCAAAGTGATGTCTTCCTCCTCC
TCACCCTCTTCGTGGCTGCGGAGGAATCGAGAACGGAGAGTTGTGACAAA
CTCTACCTTGAATACATTAAGTGTCTTGTGCAAAGCGCTCGCAATATCAT
CGACCAGCATCGAGCTCCTTCCACCTCTACTACCAGCTGTCGGCAACAGC
CGTGCTAAAATGAAAGTTGCTGATCCGCCCGTCGCCTCCCGCTGCTGCTG
TGTCAGCAAAAGACAAGCCAGGGCATGAGCAAACTCGTTCGAGGTCAACG
TCGCAGCCAATGAGGCCTTGTCGCTCACCAGAGAGGAAGCCGCCAACACC
GGCGCAAAACCCTCTTCGAGAGCCTCCCGCACGACTGCCGAAATGCGAGG
GATACCAAAATTCTCACACGTCTCCAGACTAATGCGCGGATGGACCACCC
TTAACAAATGAGCATGCACCCGGCCCCGCAGCCAAGACGCATCGTCATAC
AAACACTTATCAAGCGACACCAATATCGACCGGTCATCCGGCAAATAAAG
GCGCCAACCACGGAGCACGTCATCCACAGCCCTTCCAACCTCTGTTCCTC
CCTTTCCTCCCCCCTTCCCTTGCTGCATTGACAAGGTCTCGGCTAACAGC
ACCACAACTTTCAACACCGCCTCCAGCTCATTCGGATTCAACGAGCTATC
CCCACACTCTCCCTGTAGCTCTGCAAGAAATAATGCATAATCCTGCGGGG
ACGGGCTAGCCCGCGTCCCAAGAGCAGTAAACAAAGTATCGTACGCGCCA
AATGACCGCGGGACTTCAAACATAAATGGGGCCAGATCCTGAGTCAGGCG
GAAATACAAACGTGCAGCTTTCACCAGTCTGTTACCTACCGGCACACAAG
GCAGGTCCTTGAGCGCCAACTTGCTTCGTGGTGGGATGATATCCCAGTGC
TCCTCCAGGAACTTAAATATGCTCTGAAATACCACCACAGGGGAGTCGCC
ATACACCCACCGGTCCAACAACGACCCTTCTCGGGAATCGGCTACTGGCA
CTACTGCAAACGTACCATCAGCGTGTAGCGTAGTGGTTGTTTTCATTCCT
CCTTCGCACAGATTTATCAAATGCTGCACCACCGGCTCGCAAGCCGGGGG
TGAGGACAAGCCCAAAGTTGACCAGCAAAGCTGCGGTGGTACCAGGTGTG
GCAATAAAACAGGTATCACCGACCATACAAGATTTCGATCCCTCGCTGGA
GCAGCTTCTCCAAAACTCACTAGTTGATACTGCCGATTGGGCGCCATGAT
CACATCAGCTTCTCCTGCCACCCCTCCACCTGTATCACAGCCACCAGCCG
TCGCCTCCCCTGAACCCGACCCCGGGCCCGTCATGTGCAATCCTGGTCTT
TCCACAGGGACACATCGGATTGGCGCCAACTTGCGTTGTATAGCTTTATC
CCCACTCAACTCCCCATCCGCTGAAACAAAATACCTCATCAACTTCGCCG
CCTTACACGCTGCCGCAAACGGCAACGGCACCAAAGGTGCTAACGCCTCA
ATCCTCTTCGCGCACTCCAACAACGTGCCCCCATCACACTTGGCTTTCAA
CCCCAGGTCTCCCAAAATCGATAACCATGCCGGACTTCCAAATTCCCCTG
GAGGGAACACAGACTGGTCCTCCTCAAAAACCTCCCTCAAAATGTCACTT
CGTGGATCCAGGAAATCAGACGCCGGCGCCATCTCCCCATTTGTTTTCTC
GATAAAATTCGCCTCCTTCAATGCAATCACCAACGGCGTATCCGAGACAA
GCCCATCAGAGGTCCACCTTCTTTTGATCAACTCCAAGACTCGCCCACGC
AATCCCGACTCCATCGACGCCAGTTCTGGAATCAGAAACCGAGACAAGAT
CTCCGCTTTTCTCAGGGTGGCCACCCCTAAGTCCTTGTACAAATCGTCCA
GGGCCGGCTTCCGTTTAAGGACTTTCGACTGGCTTGTCATATTATTGCTC
CTGCTGCTACTGCGACTGGATCTCGTGCGTTGCGTCAGGCCTCCTACACC
TGTCGCCTGTGCAGACGACGACAAGGAGGAGGAAGAGGAGGAAGAGGATA
CTTCGAGGATAGAGAGAAGCTCGTCCAAAAAGTGGTCCTCGCCCGCTCGT
CGATCATCAAGCATGACATACGGCCCACCACCAGCTAGAAGGGCAATACG
CGTCCCCGCCAATGTCTCAAAAAGAGGTAATTGCCGCAATTTCTCCATCT
CCGACGCCGTCAAGGCTAGAGGCACATTCCCTCCCTGGCTGGAGCTCGTA
AGCCGCTGTAGTAGATGATCTTGCTCCACACCTGAGAGGTTCTCTGTCAG
TTTCAGTCCCTCTTTGAACGTATGTAAACATGAGATGACTGCACGCGTGA
CTTCACGATTATCCGTCGTCAACAAGTCCGCCATTACTTTCTTGGGGAAG
AAAGCCATTTCCAAAATAGGGGCACGGGCACGGCTCAAAAGTCGAAGCAG
CTCTAGGCGTTTTTGGTGTTCAGGCCCCGTCATCACCGCCACAGCTGTTG
CTGCTGCTACTGCTGTTTGAAGCTCAAAACCAGCATCGCCAGCAACTGAC
GTGGCTGGTCCGACCTCTGCATCCTCCACGTCGTCGTCATTGCCGTGGAC
GACCTGAGTGACAGGCACCATGGCTTCCAGTCCATCTGTATCGTAATCAA
CCAGCAACCCTATCGGTGTCTCCCTTTCTAGGTCTCCTCCCCCTTCTCGT
TCTCTTTCTCCTCCCTCCCCTTCAAGGGCGACAATGTCCTGCGAGGGCAA
AGAATGTGTCACTGATGCAGCAGTAGGTGTAGGTGTCCTCGAAATGGACG
ATGAGCGTCGCTGCCAACCGTTACGATCAGTGGTGGTGACAACCATGCCT
TCTTCTACCGCCTCTTTGGTTGCACGCGAAAAAGCCTCCAGCTCCTTGAC
ATCCAAGCATATATCTCGCGGGTGCAAATGAATGTGGCCCCGGTGACAAC
TCCTCACGCCACTCACATCCGAAGGAACGCGTACCACTTTCCATTCGTCC
TCCATCTCTCCCACTGTCATCGCAGAAGCTGTAGCCGCTGCAGCTGCTTG
ATCCTCTAAACCCTCCTGCAAACGATTATCATAAAAATCAGGTAAAAGCC
GTAGAACATGAACAGCGAGCGAACCAGTCACCAACTCCACACCCCCTCGT
CGGGGAGAAGATCCAGCTGCGGTGCCATTACAAATAGGAATTAACGGCCA
AGCAGCCAGCAATTCCTCCGCTCGACTAAAGGGCATCTCCTGCCAAAAAC
GGTGAAGCCACTGAGGGGTCAGCGGGTATGGGGCGCGGGCCCAATCCACT
TGCACACGGACAAGACTCTTCCATTCTGAAGGCAAGACGGTGTGGACGTA
CGAAGCGAGGATCTCCGGGCTAAATTTGACCAGGCCCAGAACAGAGAGAA
AGTCAGGCTCGTCAACTAATTTACCCATCGCTTGCACCGCCTTACTGGAA
ATGAACAACTCGCGCATGTTGGGAAAAAGAGATTTCTGGGCATCATCTGC
CACTATGTACGCGACATCGGCGCGATTCGAAGGGTTGATTCGCCCTACGG
ATCCGTCCTCCAACGGCAGCAAGGGCAGGCTCGCAAGTTCTCGCCAGCGC
TCACGGCAATTTCGAATGCCTTCCTCTCCCACTCGTCCTGCATGACAGTC
ACTCAAGCAATACGTAAGGAGGTCCAGTCCAAGTTCAGGTGATGACACGA
TTTCACGCGAGACGAAGGAGGCGGAGGCTGCGCGAAGTTGTCGTCGGAGA
AGGGCGGGGGTCACTTGTTTGACTGGATAATTGCGCTCTTGCAGCTCACA
GGCCACAGCAAAGGGGACATCGAAGAGGGGGAATAGGCGCCGGACAAGGG
CTTCAAGCTTTGGATTCCCCTGCTCATTGTCGAAAAAGTACGCCTCTCCG
ATCTTCACAAAGTGCTTGCCAGGTGCTACACGCCCCACCCCTCCTCCTTC
GGACTGATTCCCCAACAAAAAGTAATTACCATTGGCCAAGCGATCAAATA
ACGCTGAAGCCCGCACCCGCCCCTGCAGGCGCTCAGAAACCTTTCCCAAA
AAGGGCCAATACTCGTACAAACCCCTTCCATCCCCCTCCGCCCGCAGCAC
TATCTCTTTCAACGACGACATCAACGATGGCATCACCTCGTCGTTCAACG
TGCTTAACAACGCCTCATTCCACTTCACACACCGTGACAATTCATCTAAC
CCCTCCCCTCTCCAGACCATCGCCCCCGTAAACCCATACCCACCAATCAT
CCGCCCCTGGGCTACATGCTCGTTCAAGACTGGCCGAAGGGCATCAAACA
AGAAAAATGGCGCATTCACATGGCACGGAAGACCGAGATCCACCCCAGTA
TCCAGCAGAGAGTAGGCCCGTCCAGGGCGCGAAGCCGACCAGCGCATCTC
TCCTCCAAGCGGAGTCCCTTGTGGACAAACCTCCATCCGCGCGGCCGCCG
AAATGAGGGGAAGCAACGGTTTACCAAAGCAATTTCTCAACTGCTCCGAC
AAGGCTGCTTCTCGTGAGGCACCGTGGGCCAGGGCATCAAGGAGAAGCCA
CTGTTCCTGACGTGAAGGAAGATCTCGAGCCTCGCACGACACCTTCACCA
TGTGAATGCTCATAGGCGCGGCGTTCCCACCTCCTTTCCACAAGTGAGCT
AGCTTGAACTTGCCCCATTCCTTGTTCTCCAGAAGCGCACGACGTTCTCT
GTGCAAGCAAGCTAAGTTCGGGTCAGTGACCACGGCTTTCATCAGAGGCT
GAGGAGCGGAAGCGTCCTTTGCCCAAAAGGCAGCAATAACCTTCTCCAGG
CTCAAGCTAAACAAGAGCGATGTGGGCACTGAGTCCAACATACGTGGCAG
AGCCAAAGACACCTTGTCGAGGCCGGGAACATGGATAGACAAGGCCGACT
CCCGCCGTCGCAAGGGGCAACGCATCACGGTGCCCTTGAAATGGAAAGCG
GATCCGCCCTTACTGCCCTCCCCATTGCTGCTGCTACTGCTCTCCACGCT
ATCACTCCATGTAGGCAGGCTCATGAACGGCGAAAACTGGTCCGGGAAAC
GCAGGTGGAGATTCTGTCCCGCCATAGTGTAACGTCGGGCCAAGGGCGTT
TCCACCAGTCCGTCTCCCCGCGCAAGCGAAGAAGCAGCCGACAACCCAAT
ATCATCTTCCTTCTTCTCCTCTCTTGCCTCTTTCTCTTCTTGCTCCTGCC
GCTGAACGCGCTCCACGTTCGCTTCTGACACGAGATGAGATCCACAGGGG
TCAAAAAGCAAAAACTGGTCCCCACTCAATACCTCCAAAAGATCCGTCAA
GTGAAACAAGGATAAAAGCCCCAACCCCGCGACTGGCCACCCATAAGAGC
CTCTCGCCTTATCTTCCTCTTGCTGCGGTTGTTGCTGCTGTTGTTGCTGC
CGCTGTTGGTGCAAGGGCGACTTGCGGCCACTCACGCTCATGCTACGGAA
ACGAGCCAATCCTCCTCCACCACCACCACCACCACCACCACCACCACGAG
CATCCTCATTCAATGCAGGCGGAGGGTAGGGACGGAGGATGGGTCCGTCC
TTGAGCCGCGAGGGAGAGGTAAGACGAATGATGTCATCAAAACCAATCAC
CACGTCCTTCAAATACACAACCAACGCCGGGCCTTGCGCCTCGGCCAAGA
GCGGGTGCAGCAAGCTCTCCTGACCATACACGCGCTCGTCTACCACTATT
TCCACCCCCGGGCATCCCACATCATCGGCCAACTCCAACAGGTCAACGAC
TGCCTGCAAAACCAAATCCCCCGTCACATCACCTTCTCCTCCCTCCTCTT
CCTTCTGCTGTTGCATCTGCTCCGCCCTTCGGGCCGCCGGCGCGAGGAAC
GAACGGATCTCCTGCGCGTGAGCACAAGGAATAGAGGCCACAGCGCACTG
ATTTGATAATAAAGCCTCACGTAAACTACGGGCACCCAAGCCACGAGCTA
CTTCATTTTTGACCCCCGCATGAACAAACTTCAGGTTGATCTGACCTTGC
TGCATAGTATTAAGCAACCAAGGGGCATCCCCAAAGCAAAGCTCATCCAT
GCGATGCATCACCCCATCCTCCCCTGGTGCATATACCACCATACTCCCCT
CATGCAATGCCTGCTGCCGATCTTCAAATGGCGTTCTATGCAGAGCTCGT
AAAAGCCCCAATGCTTGGGTCAGATCCGCAGGCGATAAAGACGTTCGTGG
CTCCGCTGAACGATACATCGCCTCGAGCGCCTTGGCCAAATCCGAAGGAG
AGAAAGACTCCCGCACTCCCAACAAACGCAAAAGTGGTGCATACGAGGTG
ACTAGCTCTGAAGGAACCTGAACAAGATATGGTCGAGTGTGAAGGGGCGC
GGAAAAGGCAACACGCGTAGTAGGCAGGAATTCGTCCGATACCCAGATCC
ACGCGCCGCCTGCTAGAATGTCAGGAACTGCTTCTTGTCCCGGACGCAGG
CGCGGCGTTGGACTTTGCCGTCCTGTTGTAGGTGCCATGGACATCGTTGA
TACGGTTGGTGTTGTTCTCGGCTCGTCATCCTCCTCCATCAGCACGAGCT
GTTCTTGCAAAGCAGAGTAAATTTCCGGGACGCATTTGTTCAAAATTTGT
CGAAAATGTCCTCCTCGTCCACCTTGGAACGCCGCCGACAACCCCAAAAT
TTGATGCGCAAGAGTGATCGGGTCTATTGCATCTTCCCAACCCAAAACCT
GTCGCAACAAGGGGCTGCTTACATCTGTTTCGGCGATACCCTTCCGCAGA
GAGCAAAGCCACATGTCGGCTTGAGGCCGCGTTTGACGAGGGGGCAAGAC
AAGAGGCAAAGCAGCGCGGGAAGAAGATGAAGAGGAAGGCCAAGGAAGTG
AAGGATGCAATGGTGCCTGTCGTACAGGAATCCAGGCAGGGGCTCGTAGT
GCTTTCACGAATGCTTTAGGCAAGGCCAGGCGAGGCCGCCAGCGTGAAGC
ATTTAGAGTAATGAAGCCCGAATCGTCTGACTCACTCCCATTACTCTCGT
CGTCGCCTGTCTCTTGTTGACCCTTCTCTTCTCCACCCAGGTACAGCTCT
CCCTCCGCACTCATAGTATCCTCAGCAAGTTCCTCCATATGGTGGTCCAA
AAACCGGAGCAAGTCCTTTCCCCGTCTCACGGCACGTTCCCCCTCCCCTG
CCCCCGCCAATGTCCCCACGGAATTCGCGCTCATTATCACGCCCTGCAAC
GACAATGTAGCCTTCAACCCCAATCCTCTCAAACGCGAAAGCAGCTCAGA
AGTACACAGCGCAGGGGCTGGGAAGGCATCTTGTCCCAACAGATTCTCCA
AACCGCCTTGCAGTGGCTCATACAAATCACACGCTCGCCGCAATATCGCG
ACTGGTCTAGCACCATTAGGCACAAAGGCCGTCAATTGCAGCTCCTCCAC
AAAGCCCTTTCGGTCCGTGACCTCATCCTGCAATAACCGAGGCAAATCTG
AGAGCATATTAACGACTGCTTTATCTCTTACCGACGGCTCCAGCGACTTC
AGTGCGCCCAAAACGTACTTCCGATAAAATATATCCAGAGGGATATGCCG
TAATCCCAAAAACTCTAACAACGCCAAGTCGTCCGGTCCATTGCTTTTTA
TAAAAGTCGGGCAACCTGTATCAAGCAAGCGTGGGTCCACACCATCGGGA
GGAAGCCAATGTTCAGTGGAGTTCAAAGCACAAAATGCCACCATGGGGGG
GGGTTACTACTGCTCCTTGTCTTATCGTTCAAGTGCTGCTGGCTTTCGTA
CGACTCGTAAATGGGCAGGGCTCTCAAGATTTGAACCCCAGTCTCCGTTG
TCATCGATTTCATCTCTACCCCACCCGGCGACGGTGACGACGAAACCGTC
TCCTTACATTGCAGCATTTCAATCTGCCGACGGTCGCCCAACAAAGCCCG
CAACGCCCGTCGACCCGACGGGCTCACGCTGCCAAACAACATGCCCACCT
GAGCGATCAGGTCTTGGTTCTCGCAGCGTTTTGCCAGCGTCTGTAATAGC
CCTAGAATATTCAATCCCTTGACCAATACTTGGCTTTCGGGCGCAGCAAA
AATAGACTCCACAGTAGCAGGAATCACATTTCGATTCACCACTCGGATGC
CGAGATTAACCAAGGCCTGTTCCAACTTTTCAACACCCTTGGGGGAGGAA
GGAAGCAGGGTTTGTGTCATGTGTAGAAGAGGGAGGGAGGAGTCGAGGCG
GACGAGCATGGGCTTTAGATCAGGTCCTTGGAGGACGGGGAGCACAGGCC
ATTTGCCACACACGCGGGCGACGTCCAACACCGTTGCAGTCGCTGCAGTG
ATGGTGCTAACGCTGGTCAGGCTAAAGTTTTCTGCCTGCATGACACTCCT
GGGTATGAGCCGGCGGAGAGGAGAAGATGACGAAATCAGAGAAGAAGAGG
AGCGTTGGTTCGATCCTCCCTTGTTGTTGCCCAACAACCCACGGGAGCTG
GTGAGATATGCCCACAACCCGTAAATCCATTCGCTGTCAGGCACCACCCG
CCCACCCACGACATCCCCGGGGCGCCAAGGCACTTCCCGTTTATCCTCGA
AAGGCGGTGGCAACAGCTCGAAGAGGAGGCTGTCCAGGTATTCTGGCGCC
AGGCGGACAACATTGGTAGGGCGCATCAAAGCAGCATTGTCGAAAAGAGC
TGCTACCTTGGGTAGCAACCGCTCCCGCTCGACCACAAGCACATGCTGTC
CATGCACAAACAATGGTCGCTCCTCGTCATCCGCTATGAAGTATGTGGAA
TGTAACACCGGACCATCCGTGACGGAAGCCCTAACCGCAGCTGACGACAA
CAATACAAAATTGCCCAGGGAACCATCAGCTAGGGGAATGAGAGGTAAGC
CATCAAGAGGCGCCAAGTCCTCCTTCTCTAACAAATCGCTCGTGCAATAC
ACCAACAAGAATTGCGCATCCTCTTGTGAAGCAGGCCAAGGCCCCATGAA
TGATTGACGGCCTCCTATGCTTGTAGCACTACCCATGCCTCGGGCTTCCC
GTACCAGCTGCCGGACAAGCTGTGGGCTCACTTCGTTATAACAGCACTCC
TCTGCCACCAGCACGCGCTTCAGCACCATTGGCACGGAGACCACGCAGCA
GCCCCGCATCAAGAGCACGTCCCGCAAGCGTGCTGCCACACGAGCCGCCT
CGTCCTCCTCCTTGTCCTTGTCCGGCATGCTCATCAACATGGCGGTTTGA
GATGGAGCCCACGCCCCTCCTCCCACCTCCGACCACAGCAAGGGCATTGG
CGCCGCTAATTGGTAAAATCGGGAGACCAAAACCTGGAAATGCGTGGGTA
CGGCCTCAGGGAGAGGAAATAAGTTGTAGTAATACGTCAAGGGCAGGCAG
TGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGCTGTTGCTTCCCCCC
CGCACCGACAGAAGCAGCAACAGGACTCAGCAACTGACCCATATGCTGCA
CAAGCTGGATATACAGTGGTGCGACTACATCCTGAAGCAATGCTTGATTC
CAATCCGACCGCAACTTCCCCTCTCCCGTCATTTCCCCCCCAGCATTAGC
CCAGATATCGCGACGGTTGCTTGATAACTCAAAATTTGCATTCACAAAAC
ACGGCATCCCCGAATCCACTGGTAAGGGGAGAAAACAGAACACCTGCCCC
CGCTTCGGTACGTTCTCCACCAAGCATCGCGGGCCAAGTGCATCTACGTT
TGCCGCTTTTCTCTTACCCTCCAGTCCATCCTCAATACGAAGGGGGCCCT
GACGCGTATACACATGAGCAGCGACTCCACCAAATGGAATGAGCTTCATA
TGCCGATTGCTCCTATCCACGGCCATCGCTAATGCTGCCCCTCCCCCCAG
ACCGGCCAAGACCAAAAATTCGTCCAAACGCAGCATCCTCGAAGAGAGGA
CACGACCCATAAAATCAATCTTCCCATCCTCATACCACACCTCCATCAGC
TGCTTCACCCGAGGCAAATGCGCAGTGGGAGTCGAGGCCAGCTTGGCATA
GAATGCTTCCTTACTCAAGGGAGCGCGCAGAGGGCCCGTCATAAAAGTCG
AAACCTCCGTCCAGGCCATTCGCGCCTCCATTCGCCGGCTGGCACGGTAG
AGCAGCTTCGGTTCCGTCATTTTATCCTCTACACCCTCGTGCCCTCCCTC
ACCGTTATCACGCGCAACATACACTTCAATAGTATCCACACTTCGAAGGA
AAAGGAGAAACCGGGCGATGGTGGTCTGGAACGAGGCCAGCAGCTCCTTG
ACCGTCGCAGAATCATATCTACTCTTACTAATCTCCGAGTCCCTCGCCTG
CTCGGCCGTCCGCAGCGGAAGCCGAAAAAGAGTTCCCAGGAACGTCTCCT
GCAACGTACAGCCAAACAAGGTATAAGGCTCAAACTGATCAGGAAAGTGC
GCGAGAAGATCAGTGCCGACAAACCGAATCTTAATACCAGGTTGCTGTCG
CGTAGCTCCAGGCAGAAACTTGACATGTGGATCAAAAATGACCAGATGCT
CCGCCGAGACAAAAGAAGGCATGTCCGTACAGTGATAAACAGCATTCATT
CCCAAACCAAAGCGGCCTGTAGAGTTCTGTTTGTCCATCTTGGAGCCTTG
ACCAATGCGGGCAATGTTCTGAAAATCCTGTGGGGAGAACATGGCATCAT
TGTAAAAATAAAGCGCTGGTCCCTGCCACCCCGCCATTTTCTGCCCCAAA
AGCGACGACGTGCCGTACTGCCGCTCACTATACAAGATCCGCACAGTAGA
TGCTCCCGCATCATCTGCATTCTGTAGCAGCTCGCTAATCTCTGCTCTCT
CTGGATATAGCTCCAGAATGTGCCGAAGGCGACGAGTCAAGGCTTCCGAC
TGCCCGAACGCCTCTGCCTCCCCTGCAGGCAATCGAAAGTCCATCGTGTC
CGCATGGCTTTCCATCAACAAGCGTCGCACCGAGCGAATGCCAAGCTTCT
CTCCCGTGACCGAAGAGATCTTAGGGTGTGCAAATCGAATGTCAGACCGG
CGTTGTGTCTTCTTCGAGAGCCACGGTGCATCATCATAAACTAATTCTGG
CGCCCGCGCCAGCTTGCCCTGCTCGTCAGGCGCAAACAATTCCCAATCTG
AGACCTGCATGACTTCATCACTCAAGCGCTGCACCAATGCGATTGCAAGC
TCCAATTGGAAAGGCCTCAAAGCTACAGGCTCTGTCACTGCCGCCATTGT
CCCACCGTCCTCGTGACCCGCCGCAAATTTGCTACCGCTGCTGCTGTCCT
TGGCACCCGTTTCCATCGCCATGTTGTGCAATACCTGCACAAAGTCACTC
GGGCCAAAAGAAGGCCGGACACCCACCATTTTGAGGAGCGGGGCAAAGCA
GGCTAGGTCGGGTGGCACGCTGTAGAGATAAGGGGTGGCATTAACTTGGG
ACGCGAAGGCCACCTGCTCTGCTTCCACAAAACGGTCGCCTACAAATACC
CATGGTGCCCCATGCAGATACCTGCGGACGGATTGCTCCTCCTCTCCAGT
CGAGTTTTGTAGCGTCAAGCTCAATTTCTGGTAGATGACGGGAATGACCG
AAGAGACCACCTGCTGCATAGCCTGTACAGCATCGTTCTCCATCTTCCCG
TCATCCCTCCCCTCCTCTTCCTTCTTCTCATCCTTTGATTTCTGGCTTTG
TTTGGCGGCACCAGCCACAGCAGTTACAGTCGCGGCATAAGCATGGAAGC
TTTCCGCGATCGCTACTAGCTGCCGCGCCACCATCGATGGAGGGAGGGGG
TTTTTCCAACCAAACACATACTTGACCTCTTCGGCCACTGGCGTCTCAGA
GGGAATAATCCCGAACAAGGAAGAGCCATACCACATGTCTTCTGCCAGCC
GGGCGTGCGACGGCGCAGCCAGGATTCTTGTGCTGCTACCCCTGCTGCTG
CTGGCAGTAGTGCGCCAGGGCAGGAAGGGAAGAGGTTTTGTGGCCAGCAC
AGGTACCCAAGGTATGGCCTTCAGCTCTACCACCATCACTCTACGTTGCC
GCTCGTCCTCATCCAGCTGATCCTGCCGAGCCTTCTCCCGTGCGGTCCGT
CCCTCACCAAACAAACTCAGTGCCCGGGTAAATAGCCCCGCCGATAATGC
CTTCTTCCTCCGTACTACCTCCGCCTCTTGCTCCCGCACCGGAACAAAAA
ATCGCTCCGCATTCCGATTCAAAAATCTGAACAAAGTCTGTGCTCTTATC
AATCCCTCAGATGCCGTCAGTGGCTTTTCAGCCAATTCGTGAGTGGAACG
TGCGATATGGACCACGTCCGAGTAGAGGAGGGTAGAGCGAAGCCCAAGCT
GACGGAGCAAAGGCAAAATTTCGGGAGTGGCAAAATCTTGTCGAGGAAAA
AAATCAGGACCTACCAATAGGGCTAGCTCTTCTACTTCTGGGTCATACAT
GTCTTGACAACGGGCAACGATCCCTGCAACATTGGGGAGGAATGCAAGCC
GCTGAAGGGCATGGAGGAAAAGGGGGTCTACTTGAGTAAGAGAGGGGAGG
TCTGAGAGTACTAGGTCGAGCATGGCACGGGCACGGAGGGTGCTATCTGA
AGTGGCGAAGGAAGGAAAGAAATGTTGCTTCAGGAAGGCGACTTTGGAGA
CAGTCTGAACACCCAATTTCTTGGCCAAAGCTGGCTCCAAGGGGTGGGTG
GTGCTTCGACTGCTGCTGATGCTGCCGGCATTGGTGTCGCTGCTGGTGGT
GGTGGTGGTTCTCTCAACGTCATCGTCGTGCATGCGTAGGAAGCTTGTCG
TCAAAAGGCCCTCCAAAAGTATAATCTTGCAAGAAGGGATGCCTTCAAGC
AGATACAGAGGCTTAGCCTGCAGGGGCATGTAACACCTCTTTCCAGCCGG
GGTGCCATATACAGGAAAGATGGGCAAGGAGGAAATCATGAGCAGATCGT
CTTTGGTGAGCTCGGCCAGGGGCTCGGATACCAGCAAAGCTCGCAACGCA
TCGAAAGAGTCGTGTGCCTGTGGAGCAAGGGCTTTGGTCAGGACCTGCTT
AAAGTCCATCTGTGGTTGCTTCTGTTGGTGTTGGGCGAACAGAAGACGCA
AGCATGCTAGCACGCCCGCCCGTCGGAACGGCTGGACATAAGCCCTCAAG
GCCGCCGCTAGTTTGGCTTCCCCGATGGCAGCGACCAGCTCTTGGTGCAC
TGTCAAGACGCCCATCGTCCGAAGACTACTTAGAAGGGCAGAAGAGACCA
TTGTAACAGCTGTAGAATGAGAAGGGATGAGGAGGAGGGGCAGCTTGGTG
GAGAGACGGGCGACACACGCTTCGTTGCAGGGGAGCAAGGGTAGGCCTTC
AGCTAATGGGGTGAGGTCGGGCATGTGAGCGAGTAGGTACCGCCACAGGC
GCAAGAACCACACCAAGGTGGGTGGGGCCAGCGAGGTCATCGTAGAGGAG
GATCTTGCAGCAGCCTGCGCATTCCAAGCCACCCGCGCCACGTCTCGCCA
GGCGTGGGGTAGGACGTACCGCAGGACCACCGGTAGGTACGAGGCAGAGA
AAGTACGAACGTTCGAAGCTTTTTGGACCCTAGGTGCTTGCAGAATAGCG
CGGGCCTTGTCCGGCAGCTCGTCAAAACGCAACAGTCTAGAGGCCACGGG
CGCAAAGAGCGCTACTTCCTCACACATACCCTCTCCTTCTCCTTCTCCTT
CTCCCCACCCTCCTCCACTATTATTCCTACTACCATCGGTCGTGCTCGTC
GTAAGGCCCAACACCAACTCACCCAGCTCAAACGGCTCCCCTGCCTTCTC
CACCACTTCTCGTTGTTCGGTCAAATAAGCCATTGCTTTTTCCATGTCCC
CCCCACTCAATCTACACGCCCGCAACGCTAGCCCTACCGGAAACCCCATG
GTCAGGAGCTGTGTGACTACCTGCTGCTGGTCCATCTGGTGGTGTCCATC
GCTCAGGCCTTGAAACAATACGATGGTACGGGGTTCGGCCTTGGGATCGG
CCGTGAGGAGGAAGGGCAGTCCGTCCAATTCGGTAAATGCTTGGTCAGAG
AGGTCGGAAAAACAATAACGAAGAAGGAGAAGGACTGCTTCGCGTCTTTG
CAGCGCTGGGTGTGTTTTGGTGCTGCTAATTGCAGATATAGAGGAAGGAA
AAGTGCGGAAATGCGCGCGGACAAAGGCAGGGGTGGCGAGACGAGGCACG
ACTTTTCGGGATTCCAGCACGTCTTCAAGGCCTGGATGGGAGGTGTGAAT
CACAGGCAAACCCTCATCTAGCAGAACTCGTTCTAAATTCAAAGACGCGC
CGCCTGCAACAGCATCAGCAAGTGTTCCTCCTTTCTTCTCCTTCTCTCGT
CGTCCCTCCACCGCAGGAATCAAAACAGCATCCTGCACCGAGACCCACGA
CCCTCCTCCTCCCGCTTTGCCCAACAGCGTGTGCAGCAAGGGCAAGGGCA
GCACCGCAGCAAAAAATGAATCCATCAAGGCAGTCCACGCCGGACCATTT
CCTTTCAAAGGCCAGAGAGCCTCATAGACTGTCCCAGGCCCACAACGTTT
GGCGGCCAGGGAAAGGAGTTGCGTGTAGCAAGGCACGGCCACTTCGCACA
GGAGAGAGTGGTTCCACTCAGCGCGCATTTGGCCATCGCCTGACATATCG
TCCCCCCACCACAAATCCCTCCGGTTACTAGAAAGTTCGAAAAACCCATT
CACATGTACAGGTAGGCCCGTGAAGACAGGCAAGGGTAAAAAACAATAGG
CCAACCCTTGCACTGACGGCACCTCGATGCTGCTGCTGCTGCTGCTGCTG
C
back to top

protein sequence of NO03G01590.1

>NO03G01590.1-protein ID=NO03G01590.1-protein|Name=NO03G01590.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1581bp
MSQHYGAAASGEPLIPPSPRPPRNDNELATVREAQRQQCVRSPILDMVLR
FITPRSPDDHHPSPSPFSSSSTSSNNNGTCIRRFHVLSPTYPMHVRTTKD
GKKRRFRYPDNSVSTAKYNIFTFLPRALFEQFRRLANIYFLVVTVLMLIG
TYSDLYQSPLTPFTTLFPLIVVLAVTMGKEAFEDFKRHTADQNTNNRAAR
VLRLVSTPVTNNGEGELEEVFWKNIGVGRIVKVLDKEEIPADMILLTSSE
PSGNCCIETSNIDGETNLKIKEAARTGENGCGRAFVTAADLQGWEAAVEC
EAPNSRIHTFTGTLQLAPKDRNQEQQQRRVGVNQANLLLRGSRLRNTKWA
LGLVVYTGYDTKIVMNSRAAPSKLSTIEVTTNRLLYLILGMQILLVSVTL
GAYLVWTDANESQLHYLCMDYLGGPSAFLRMNCQATAKEASELGMWVTFL
LLYNNFVPISLYVTVEMVNYIQAYYIDQDLSMYDPSSDTPALARTSNMNG
DLGSVRYVFSDKTGTLTRNIMEFRRCSVGGVVYGNLEGGREEESVERGEE
RQQIQGEEERTSGSSAGSGGRREGAQGHRFSLPVDPFALRGKALQELAEL
SNACEGKSVEEMISKTPASDSAAATVAVYFAECLAVCHTVVVEKPAPSSP
PPPPPPPSSSFSSSSSPSRGGGDRNNTTATTTTKTPKPKRLPLSYQAESP
DEEALVNAVARELNWSFLGRTPTSALVLNPQNQRLTYQVLAVLPFTSTRK
RMSVIVRTPDNKIVLLTKGADNVIFERATAYLGTTRQELDAHLSVFAADG
LRTLVLARKEVDEDDFQSWLDEFQKASTAVEGRSERLAEIGEKIERELVV
VGATAIEDKLQEGVPDTIAHLLEAGIKVWVLTGDKVETAINIAYSCRLLH
AKMTLIKVIDPKDDGEGRREEEECDAALRKQLRKLVAHFEQLVEDKTLVG
GLWASSRSYQGERDCKRWLLWRWCRGRCRSQGGRKRREKRSTALSYTTAP
AELHRGAGRGGRGEGVAAGMLSEPLVEHDVDEDDAMPYSHMSKNPLKDVQ
SDHLALIIDGPALARVFGDWEMERLLLRVATLCKSVVACRVSPAQKRMLI
RLVKKGVKHPTPITLAIGDGANDVAMIQEAQVGVGISGREGRQAVNSADF
AIAQFRFLETLLLKHGRWSYRRTSKVILYSFYKNIVLTFVLFAYTWLTGF
SGQSLLEDYVYTSYNFMLAMPPICFGLFDRDLSADTIMGNATPDARQSER
MQHASSTTTIDHPATILAISRTDYNAAATTASTASFRWAYMSGRDNLDLN
LGQMALWLFQAILDSILIFGFSFGAMSAPRQVLSSEGDVDDLYMLGLITF
TGMLVGMLYKAATNTYTWTWVNFFFFFGSALLYVIFLAIYGGLPIVAGGF
YGVPGRMVRHPSFWLIGVSLVPTVSVVVDYIFIYLRLSFFPSPVDFAMEY
DRGYVIPEAAPAPGARREERARTVEEEEAARGEHAPFKGSEAAQAEEEAM
ERRRSGWYPGKLLVSLPLLRRLNSRISQKDKEEMGMMEAEGGLMPSSYDY
TSSSFDLGPGAGHNGGPGEGKVLARRHSRG*
back to top
Synonyms
Publications