NO04G03100, NO04G03100 (gene) Nannochloropsis oceanica

Overview
NameNO04G03100
Unique NameNO04G03100
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length6612
Alignment locationchr4:844936..851547 -

Link to JBrowse

Properties
Property NameValue
DescriptionClathrin heavy chain
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr4genomechr4:844936..851547 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0005515protein binding
GO:0005198structural molecule activity
GO:0032051clathrin light chain binding
GO:0005198structural molecule activity
Vocabulary: Cellular Component
TermDefinition
GO:0030130clathrin coat of trans-Golgi network vesicle
GO:0030132clathrin coat of coated pit
GO:0071439clathrin complex
GO:0030132clathrin coat of coated pit
GO:0030130clathrin coat of trans-Golgi network vesicle
Vocabulary: Biological Process
TermDefinition
GO:0006886intracellular protein transport
GO:0016192vesicle-mediated transport
GO:0048268clathrin coat assembly
GO:0016192vesicle-mediated transport
GO:0006886intracellular protein transport
Vocabulary: INTERPRO
TermDefinition
IPR036322WD40_repeat_dom_sf
IPR013320ConA-like_dom
IPR016025Clathrin_H-chain_link/propller
Homology
BLAST of NO04G03100 vs. NCBI_GenBank
Match: EWM28710.1 (clathrin heavy chain [Nannochloropsis gaditana])

HSP 1 Score: 1321.6 bits (3419), Expect = 0.000e+0
Identity = 902/2243 (40.21%), Postives = 1222/2243 (54.48%), Query Frame = 0
Query:    9 GQPLIATSRPDRTTVFVVDKSVAPQVIREVACG-GTCGGSSSCSSWAGEVVCGGPVAQVAVSTTNAGHSLLGALLESGVLEIWDIAAPEGGLYTLREVVHLPAAASGEVN--------SSXXXXXXXXXXXXXXXXXXXXALSDCLF----AGSRGLP----------------------------SAMDASRKKVAVVALTCHESQPLLCVGFQDGVLHLYDVGGVAAEKGRAGVEDNVISATARSYVPDGSSKQENRDFGDDNNDNDDGQEQASGIATTGKAAMSVLLGRSVRN-LLLPVAALKFGIHMQGALSCICMTSEFGIILAGSTRGEVAVWSTNTLLRETAQGENGSDDALNLSGAVLPLQSCEVMAMPSPIERIELLKCSKPLILVSVRAANGDSDVTRVALLAFLPKSLVVVHMGPPVRTRHLAFQSRNSCIILSGEGNKVNKLLVSKLADVLACPAFPLPSQLAAIGPPQLDCGAFLADSWGCSQPSSSPSSPTHAPFIYSIQSRMLLSLHQDDTN-------SFSASRKSGLSSVCGSLTLKALVVARPLHHLQNKRSSDRDKDIQAEAVLLLPTYGKDFSVLLDRLSLSHSVPIFSGLNEDNDQILLLPHRLIVSPDGSLIVVLLRILRPKVAGSDNVVDACAPLAYVVVRQSSTAQDGGSVETNNGNQVALVSLDFVHAGVTIDAAFLSPRILALLQ---SYTGKCGAVVKTTL-LQANGDPSAATTAWVMSQTVEDGDVQTEDGAQEYIPTRLFRAGGSGFPYEGEGSLRVLCVRQQRQAG----GNGST----ILTCSRIGGGFP--PSFVGSAVLRAQEVIMEAINQTSNGQGIEAGSVIAVLTSQRLLLFSPDLTLLAGVELTMAPCGLAWLGPTPLIVFADGRVLYLSLDSGATE-----LRPVLSLDQEQAGADVILVAALPDRLVYAANNGASGSLRIFTRALLPLEPLLVGFLAWPRLVKLPVISFAPSALYSPSPPPPSAVEAVTSSPFLTVALKTLLELYGPLAKQRGARYALGEGPSVDAGATAWACAALSRAGFGGWAAALAGVAGTVQLGKGRESKTALLAGQSEGPKAFQPRPWIPPEVKAGLAAQNSQWQDAMCEAMGDNIVKGQDVALHSDITLPPRFRHRSRVLALLGQRAWQAGQTEEAFRLFDLAGEDENAMNLLILHLLGGGPVSDGCAQLLRDLCKAEPHLLACLREGEAFMTKEVGPEKKRKAAVASLVCARLSLQTFPFDYQRRQYLLPRLSGAILQLPQWPQAGXXXXXXXXXXXXXXXXXXXXXXXRGGPPRAPLVQALLLDRVEEWLGRVAPEGLREEISRLDERERDATKREEEWVKGVGEGR-EEDNVVLYLRFNE---PTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKSTCPIDQGDDVKVRMGMDAFWGTSNI-PYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRIKSDGGRSIRNEGSDIQEAAPTSPPP-LPVKAARKRENFAP---------LPNMADGVMFPVRRRGAGREDTSSITLLDGLRGLSGEGACSLIASLPAPTANDTAVRRWQSRANSLARSADGLVIQSSPPPTHPSVDIDTGWAGFKEDPVSRPTSFVVTTTAPL--AAAEHAPPTLGLQMVSTPPELKHLLLLLPSELDVNPANVLRDEIAGYFSQCGQYLCLREMTRFTNLESGIP--KVAIVVLAVTTSADQLARRDRYPFQAKSAIL-XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXRIKQPQAGVLQVYDLMQRRGVARQPVQTRVLFWRWISRGVIALVTPRAVFLWTVGVKSGPSGVPTKTFDRRDLTLLGPNANVRDYHHAAGGEWGVLTTVDGDGARLAVQFHDIRSGKVIVESGTELISANVGKCDGGVDQGNRRSQTYFVMLRRCPELTVDLCV--EEG-KGNEQGMCNPRLVRLARLPLPSAPASLSDSWRAWVLFPPGRTDIIVLLFSAGGLLYTCCSATHRIEARGSVFPEEQAGAVLDVSWDVVSGDMLVLEAERLAVFRVSNL 2161
            G   +A +RPDRT+VF+VD+ VAP+V+REVA   G    SS   S   E+ C GP+AQ+A+S+T  GH+ L A L+SG++EIWD+ A  G LY+LR+ V L      E +        S                      LS  +F        GLP                            S++D++R+K  V+ LTCHES+PLLCV FQDGVLHLYD+  +A E G           T  +    G  +Q+     DD  D+D    Q + +++ G+ A S    R+ R  LLLPVAALK      GALS +  +    +++AGSTRGEVAVWS   L R  AQ     D +L+L+GA LPLQ+C++  +P+ I  I LL    P +   ++  +   ++ ++A+L  LP S+V+  +  P + R LAFQ R+S ++L  E +   ++    LA+ L+  +  +PS+LAA+G PQ     F     G S    SP + +  P+I+ IQSR+  ++ Q D+        + + +RK G     GS   KA++  + L  +Q +R+ D   +    A  LLPTYG +F  LL  L +S S+ +  GL   +DQ LL+PH L+VSPD S I+VL  IL+P +     V+   +PLAYVVVR S +       E    + +   S  F  AG+T DA FLS R +ALLQ       + G+V    + LQAN +       W + +  +  D    DG ++   TRL      G    GE S+ +LC  Q+R       GN +      LT     G      +   S  L   E ++E   +TS GQ IE+ ++ AV TSQRL+L S DL +LA V+    PCGL WLG TP++ F DGRVLYLS+D  A       LR + SL+Q  AG D  L+ AL DRLVYAA  G +  +R+ TR L PLEPLL+G LA P  V L   S A S          S   ++  S  +   ++ +L  +GPLA  +  RYA  EGP    GAT+W CAALSRAG+  WA+ LAGV     +    + +   + G +      + RPW+   +  GLAAQ+SQW+ A+ EA G  I++ QD    +   LPPRFR  S +LA +GQRAW AGQ E A  LFDLAGEDE    LL+LH L     SD    LL +LC      L  +R   +  T   G E K+ A +A +V   LS Q FP D QRRQ LL  L  +   +  W  +                          G  R    + LLL+ VEEWLGR APE L++ ++   E + DA +  EEWV+GVGEGR EEDN +LYLRFN+        D S A+   L   GDLSQYGH++ +A GQ   ++ ++STCPIDQGD VKV+MGMDAFW +  +  +    +    RG  A++ARGSALDVGPYHGPQ+ PGR RLT+E+WIQR  SSS+ CS+ PE+L  R+T+  +   +W  G+SA+ AL+FWTE  Q PLST  G V+ G WTHVAF+LE+ +S  KQA V  FVGGK     Q  ++FP L +S LRRTVLE+GPNL GH+MTE+R+WACARSA+ LYD RESYLQLAE++KKL F+I+SD  +S  N   D   A  + P   LP   A +     P         L N ++G     R RG  RE  +S T    L G +  G  +L+ +LP P+ + T  +R +  + + AR      + +  P      D  +G  G   +P S P +   +       +   H P         +P  L+  L+LLP ELDVNPANV+RD++AGYF+  G++LC RE    TN E+  P  KVAIVVL +  S    +RRDRYPF+A+SAIL                                        K  QAGVLQVYDL+ R+ VARQPVQT + FWRW+ RG IALVT R+VF W + ++ G S  PTK FDR DL+  G ++ VR+Y H+    WGVLTTVD +  RLA+Q H+  +GK +  +  +L+ ANV  CD GV  G+  S+   V+LR  P L VDLC   EEG + N Q +C  RLVR+    LPS P+ ++ + RAWV+ P G T ++ +LF+A G LYT    TH I+ARGSVFP+ ++ +V DV  D  SGD+L+L+ ERLAVFR+SNL
Sbjct:    6 GSSWVAAARPDRTSVFLVDRKVAPRVVREVASEVGNSSHSSGTCSLVSELECSGPIAQLALSSTITGHTFLAAFLDSGLMEIWDVGAAHGSLYSLRDTVVLCGPPGEETSAGRRVMSTSGIVGGRKSLRSCVDYVASSSTGLSSAIFFTVMRNHGGLPSGSNTPNHLRFVKLTPSVDEVRLGSQRLSSLDSTRRKSHVLHLTCHESRPLLCVAFQDGVLHLYDISPLAREDG-----------TQVTPHCSGEGEQQRGSDYDDEGDSDVETGQENDLSSLGRGASSFFSIRAARKLLLLPVAALKCPGDFHGALSSVSFSLAPELVVAGSTRGEVAVWSMQLLQRREAQ---KGDRSLSLTGAALPLQTCKLACLPARIGWISLLAEKGPFLFACLQKEDHSYEM-QIAVLLLLPTSIVMWDVIAPHKARFLAFQRRSSNLVLLEEHHDGGRVFFGGLAEQLSQSSQMVPSRLAAMGQPQCMDQTFSGKLEGTSTLKVSPVTYSSVPYIFWIQSRLTFAVRQSDSKADNLGKMTVTTNRKKGPLPSTGSFVFKAILATQTLESMQRERNEDLCDNDPIGATFLLPTYGDEFVGLLSSLGVSDSMSLHFGL---DDQTLLMPHSLLVSPDASTIIVLSSILKPGL-NDKRVMGVTSPLAYVVVR-SGSRPSREEEENPKADTIKRGSFCFEDAGLTFDAVFLSSRKIALLQPTFCENARKGSVTTVVVQLQANRELDDPKKVWKVMRCTKAEDFVGHDGFKKRAITRLLTPCTFGASETGEASVHILCAAQKRATSLTQDGNYTNEELLKLTFYEAAGKHTDHANATQSLCLFEDERVIEVHRRTSEGQCIES-AIFAVATSQRLMLLSADLAILACVKTYSKPCGLVWLGHTPVVSFVDGRVLYLSIDPHADSISRCGLRHLCSLEQMHAGEDSFLITALSDRLVYAAKCGEAREIRVLTRPLFPLEPLLLGLLALPHRV-LSTSSTACST---------SRARSIDQSTMIVGQIREVLTCFGPLAVPQD-RYAREEGPGATTGATSWTCAALSRAGYNYWASELAGVRSLSDI--LCDGEPFPIVGNAHEENVRKSRPWLSFAMTIGLAAQSSQWRQALVEATG-GIIEKQDEVQQAGAMLPPRFRSGSHILARVGQRAWHAGQAEVALNLFDLAGEDEKMAELLLLHDLDRIQFSDDHTGLLDNLCDLN-SALQLMRRLVSSQTYTSGREIKQNADIADVVGINLSSQIFPLDKQRRQSLLTGLQTSHQNIVSWQSSSTFSNIANTALTPTPQSVEGSTAYSCGLARPIPARYLLLNSVEEWLGRAAPETLQDRVAGGSELDEDAPRGREEWVRGVGEGRGEEDNAILYLRFNDLNASPIPGDESSAL---LAQIGDLSQYGHTVTVAQGQ--PITCLESTCPIDQGDGVKVQMGMDAFWNSVRVSEWIEEDTPSARRGFSALIARGSALDVGPYHGPQQSPGRSRLTMEMWIQRPISSST-CST-PEILAVRRTLSSKPSCVWAFGVSADDALMFWTERCQTPLSTASGAVLGGKWTHVAFSLEIDASNSKQALVNLFVGGKDVWPAQLRVEFPSLKDSDLRRTVLEIGPNLDGHKMTEVRMWACARSAEALYDNRESYLQLAERKKKLTFKIRSD--KSKDNRLGDSATATSSHPASLLPFAVADRNLGLPPYTSSTEPACLLNSSNGRRV-TRLRGPEREANASRT-SGNLDGSACGGMENLLNALPPPSCDGTGRQRSRRGSPNSARQ-----VPTPFPARSGKGDEASGGLGEASEPFSLPYNKAASEEFDCLRSTTAHRP--------ISPRPLERHLILLPEELDVNPANVMRDDMAGYFACGGRHLCFREEFSSTN-ENDQPRQKVAIVVLEIAASRGGPSRRDRYPFEARSAILVGGLSSWSRETEPLPSALIACYTLANTSQSNASAAAAGSNKTSQAGVLQVYDLVLRKSVARQPVQTTLHFWRWLDRGCIALVTQRSVFTWVLNLEGGSSRAPTKLFDRLDLSSSGIDSKVREYCHSVETGWGVLTTVDSEQCRLAIQLHEFSTGKALFLTDADLLGANV--CDCGV-AGHGGSKVCLVLLRSAPALCVDLCTICEEGDRDNTQDVCLDRLVRIF---LPSTPSRVAPAGRAWVVGPAGPTGVLSILFAAKGHLYTLDLITHEIQARGSVFPDGRS-SVSDVDLDATSGDLLILDHERLAVFRISNL 2180          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: OAO12544.1 (Dek1-calpain-like protein [Blastocystis sp. ATCC 50177/Nand II])

HSP 1 Score: 124.0 bits (310), Expect = 5.600e-24
Identity = 204/831 (24.55%), Postives = 326/831 (39.23%), Query Frame = 0
Query:  794 QGIEAGS--VIAVLTSQRLLLFSPDLTLLAGVELTMAPCGLA-------WLGPTPLIVFADGRVLYLSLDSGATELRPVLSLDQEQAGADVILVAALPDRLVYAANNGASGSLRIFTRALLPLEPLLVGFLAWPR-LVKLP---VISFAPSALYSPSPPPPSAVEAVTSSPFLTVALKTLLELYGPLAKQRGARYALGEGPSVDAGATAWACAALSRAGFGGWAAALAGVAGTVQLGKGRESKTALLAGQSEGPKAFQPRPWIPPEVKAGLAAQNSQWQDAMCEAMGD-----NIVKGQDVALHSDITLP-PRFRHRSRVLALLGQRAWQAGQTEEAFRLFDLAGEDENAMNLLILHLLGGGPVSDGCAQLLRDLCKAEPHLLACLREGEAFMTKEVGPEKKRKAAVASLVCARLSLQTFPFDYQRRQYLLPRLSGAILQLPQWPQAGXXXXXXXXXXXXXXXXXXXXXXXRGGPPRAPLVQALLLDRVEEWLGRVAPEGLREEISRLDERERDATKREEEWVKGVGEGRE-EDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKST-CPIDQGDDVKVRMGMDAFWG---TSNIPYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQ-APLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGK--KAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRIKSDG 1598
            Q +E G   ++A+ T+QR+L+ S D  ++A     +   G A       WLG T +   +D R+ Y+  D      RP+ SLD      + +L++ LPDR++YA  +      ++ ++AL+PLEPL++G L  P   VK     ++       Y P+   P+                      GP             GP  +AG T      L + G+   A A                    +   S   K F   P I PE+K+ LA    +  DA+ E + D       +K  +        LP PR     R+ AL    A  +G    A +  DL G+D + M L+ L    GG V+   A+LL+        L   +++    ++ +    KKR     + +     + T     ++   LLP  +  + ++P+ P+                            P + P ++   +  V  W+G         +I  + E E +A +        +  G E ED+VVLYLR  E     + ++A  V          Y  S    N      +FV+ T  P+D+GD+ KV+       G         G+L+S         +    + L VG +H     P R   TVE+W++R   +          LITR T  ++   +W  G+     +   T GK     S VPG      W H+A T++  +   K   V+ FV GK  K     S  + P     Q+      +G      R+TE+R WA ARS DD+ D+ ++YL LAE + K+   +  +G
Sbjct:  687 QQLEEGERPLMAIRTNQRVLMVSEDYEVIAARPSVVLIGGTARLTTSMQWLGYTIMYSCSDNRLYYMLADG---RERPIGSLDIH--ATEPMLMSVLPDRVMYACKHRDHRLTQLLSKALVPLEPLVLGLLHAPEGYVKNREELMMKIIKQYAYKPAEEDPN----------------------GPRP----------TGPGYNAGVTGEMIRELKKKGYLSVAYA--------------------IVRSSSSNKEFPDYPQIAPEIKSDLAITLHRLDDALQELLSDAPQVMEYLKSAEEGKPFTALLPHPRSTLAERLRALASIAA-ISGDYNTARKCLDLCGDDWDLMGLMNL----GGDVT---AELLQQQATRADGLRPDIKQAAEILSGKTFKSKKRGDDSNAAMNLPQRIPTLLVGGEKA--LLP--AWKLKRMPRIPEIPKPEPL---------------------PSKFPYLE---MSNVAAWVG-------ARQIVEIPEAEAEAQEDPRFGGAALMVGNEGEDSVVLYLRMEE----REDAVAGVV----------YDSSNQANNMTVNGAAFVEPTDAPVDKGDNDKVKTLRQLVLGWDKEGKEAPGTLTSK--------ITLEKNTLRVGYFH---PDPQRRFFTVEMWVKRDPRNGYM------PLITRGTEQDR---MWSFGLENGYVVFMTTTGKMFTDASLVPGNT----WKHIAATIDCTNK--KTTAVSLFVDGKMVKEAKIASPEREPLEENDQIL-----IGGTGTFARITEVRYWASARSEDDIRDFSKAYLDLAEVKGKMKGLVIHEG 1372          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: OQR92194.1 (hypothetical protein ACHHYP_03966 [Achlya hypogyna])

HSP 1 Score: 114.4 bits (285), Expect = 4.400e-21
Identity = 144/567 (25.40%), Postives = 226/567 (39.86%), Query Frame = 0
Query: 1049 AGLAAQNSQWQDAMCEAMGDNIVKGQDVALH--------SDITLPPRFRHRSRVLALLGQRAWQAGQTEEAFRLFDLAGEDENAMNLLILHLLGGGPVSDGCAQLLRDLCKAEPHLLACLREGEAFMTKEVGPEKKRKAAVASLVCARLSLQTFPFDYQRRQYLLPRLSGAILQLPQWPQAGXXXXXXXXXXXXXXXXXXXXXXXRGGPPRAPLVQALLLDRVEEWLGRVAPEGLREEISRLDERERDATKREEEWVKGVGEGR--------------EEDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKSTCPIDQGDDVKVRMGMDAFWGTSNIPYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRI 1594
            + L     +W+DA+   + D      D ALH        S+  LPPR  + +  L+ LG      GQ E A R FDLAG D   + L+ +  L     S     +L  +  A P L A L   +A  T          A     +  RL  +T  F  +RR  LL +L+G    +PQ                            +   P  PL          +WLG   P    +E  +      D T        G G+ R              +ED+V+ Y RF +   L   + A +  L    D S+  + L +    S ++  + ST P+D+G+D K+   + A+      P  + +      G    V +G  +D G  +   E P R  LTVE W++  + +     +Q +VL  R         LW L + A G L      +   L+   G  + G W HVAF +++ +    +A V   V G   ++ +  IK    +       ++ VGP L G  +TE+R+WA +RS + + D +E+YL +AE +K++   I
Sbjct:  790 SALLLATHRWKDAVMCLVSD------DPALHEYAQHPQGSEAQLPPRLSNVATALSELGSTLQALGQLELAGRCFDLAGNDRALLTLVGVQAL-KTMNSSVAHDVLHSVKGANPQLFAALVAADAAQT---------PARAKQDLFRRLCTETLVF--ERRSRLLGQLAG----MPQ------VRLSPVKAAVDTAPTAWKFFTWKRLEPEEPL----------DWLGTPTPHYSAQEFVK-KALHIDTTS------DGFGDTRPTPAAATSIGPFLDDEDSVMAYWRFEDAAAL--GARATDATLL---DTSKRENHLVV----SPAIELMPSTAPVDRGEDAKL---LPAY--ALRFPAAAPADG-ADWGATCAVRKGGTMDFG--YAFDEDPYRRHLTVESWVKFYADAP---PAQAQVLFARTP-------LWQLSVDASGRLALQLHDR--TLACDGGLQLTGGWQHVAFVVDVIAE--DKAAVRVTVDGASVLAKEIAIK----AVKDATAAIMNVGPRLLGFEITEIRVWATSRSVEQINDMKENYLGIAETKKRIKVAI 1276          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: XP_014527066.1 (hypothetical protein JH06_3884 [Blastocystis sp. subtype 4] >KNB43623.1 hypothetical protein JH06_3884 [Blastocystis sp. subtype 4])

HSP 1 Score: 110.9 bits (276), Expect = 4.900e-20
Identity = 207/838 (24.70%), Postives = 319/838 (38.07%), Query Frame = 0
Query:  778 RAQEVIME-AINQTSNGQGIEAGSVIAVLTSQRLLLFSPDLTLLAGVELTMAPCGLA-------WLGPTPLIVFADGRVLYLSLDSGATELRPVLSLDQEQAGADVILVAALPDRLVYAANNGASGSLRIFTRALLPLEPLLVGFLAWPRLVKLPVISFAPSALYSPSPPPPSAVEAVTSSPFLTVALKTLLELYGPLAKQRGARYALGEGPSVDAGATAWACAALSRAGFGGWAAALAGVAGTVQLGKGRESKTALLAGQSEGPKAFQPRPWIPPEVKAGLAAQNSQWQDAMCEAMGD---------NIVKGQDVALHSDITLP-PRFRHRSRVLALLGQRAWQAGQTEEAFRLFDLAGEDENAMNLLILHLLGGGPVSDGCAQLLRDLCKAEPHLLACLREGEAFMTKEVGPEKKRKAAVASLVCARLSLQTFPFDYQRRQYLLPRLSGAILQLPQWPQAGXXXXXXXXXXXXXXXXXXXXXXXRGGPPRAPLVQALL-LDRVEEWLGRVAPEGLREEISRLDERERDATKREEEWVKGVGEGRE-EDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVK-STCPIDQGDDVKVRMGMDAFWG---TSNIPYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGK--KAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKL 1590
            RA EV+ E A  Q   G+      + A+ T+QR+L+   +  ++A     +   G A       WLG T +   +D R+ Y+  DS     RP+ SLD      + IL++ LPDR++YA  +      ++ +RAL+PLEPL++G                   LY+P     +  E           +  +++ YG    +         GP  +AG T        + G+   A A+            R + T          K F   P I PEVK+ LA    +  DA+ E + D           +K  +        LP PR     R+ AL    A  +G    A +  DL G+D   MNL+    +GG   S+    LL+        L   +++    ++ +    KKR     S+    L         QR   LL  + G    LP W                              P   P V  LL +  V  W+G         +I  + E E +A +        +  G E ED+ VLYLR  +     D S           D S Y +++ + NG     +F++ S  PID+GD  KV++      G         G+L+S         +    + L V  +H     P R   TVE+W++R   +          LITR T   +S  +W  G+     +   T GK   + T    V    W H+   ++  +   K  +V  FV GK  K     +  + P     Q+      +G      R+TE+R WA  RS DD+ D+ ++YL LAE + K+
Sbjct:  696 RAGEVVCEVAWQQLEEGE----KPLCAIRTNQRILMVDLEYHIIAQRPAVVMIGGTARLTTSMQWLGYTIVYSCSDNRLYYMMADS---RERPIGSLDIH--ATEPILMSVLPDRVLYACKHRDHRMTQLLSRALVPLEPLVMGL------------------LYAPEGYVKNREE----------IMMKIIKQYGYKPAEEDPNGPRPTGPGYNAGVTGELIREFRKKGYLSIAYAIV-----------RSNNT---------NKEFPDYPQIAPEVKSDLAMSLHRLDDALQELLSDAPQLMVQIEEYLKSAEEGKPFTALLPHPRSTLAERLRALASIAA-ISGDFNTARKCLDLCGDDWELMNLM---NIGGETTSN----LLQQQASRADGLRPDIKQAAEILSGKTLKSKKR--GNESITAMNLP--------QRIPTLL--VGGEQASLPSWKLKRIPRIPEIPK-----------------PEPLPSVYPLLEMSNVAAWVG-------ARQIVEVQEDEGEAQEDPRFGGAAMMVGNEGEDSCVLYLRMED---REDGSQ------NTVFDDSNYSNNMTI-NG----AAFIEDSDVPIDKGDGDKVKVLRQLVLGWDKDGKEALGTLTSK--------ITMERNTLRVAYFH---PDPQRRYFTVEMWVKRDGRNGYM------PLITRGT---ESDRMWSFGLENGYVVFETTTGK---MFTDSSLVPADTWKHIGAVVDCTNK--KMTSVRLFVDGKMVKENKIVNPEREPLDENDQIM-----IGGKGTFARITEVRYWASERSDDDIRDFSKAYLDLAETKGKM 1388          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: XP_008875189.1 (hypothetical protein H310_10550 [Aphanomyces invadans] >ETV96397.1 hypothetical protein H310_10550 [Aphanomyces invadans])

HSP 1 Score: 106.3 bits (264), Expect = 1.200e-18
Identity = 89/291 (30.58%), Postives = 137/291 (47.08%), Query Frame = 0
Query: 1305 EEDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKSTCPIDQGDDVKVRMGMDAFWGTSNIPYGSLSSSLLPRGLRAVVAR--GSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRI 1594
            EED VV Y RF E    A  +   +       D S+  + + L N     L  V ST P+D+G++ K++             Y     S    G   VV +   S LD+G  +   + P R  LTVE+WI+   +++   +      + R+        LW L + + GAL F   G+   + T    V E  W HVA  +++ S    +AT+   VGG  A++    ++ P  + S    T L VGPNL G  +TE+R+WA  RSA  L+D +E+YL +AE +K++  +I
Sbjct: 1035 EEDGVVAYWRFEEGAAHATVTSGTQFV-----DTSKRENHITLQN-----LDLVVSTAPVDRGEEAKLQP-----------EYALRFPSPSAGGCGHVVVKKGASTLDIGVSY--DDDPYRRSLTVEMWIKPVETAAGAFTG----TLMRRETPSPGDALWELAVDS-GALAFTLLGQ--TVRTDKSAVKEATWQHVAAVVDVSSD--DKATIRLAVGGAVAVTND--VRLPKAARSGDVST-LVVGPNLGGVEVTEIRIWATPRSAQQLHDMKENYLAMAESKKRIKMKI 1290          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: XP_009828429.1 (hypothetical protein H257_05301 [Aphanomyces astaci] >ETV81692.1 hypothetical protein H257_05301 [Aphanomyces astaci])

HSP 1 Score: 104.4 bits (259), Expect = 4.600e-18
Identity = 89/289 (30.80%), Postives = 141/289 (48.79%), Query Frame = 0
Query: 1305 EEDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKSTCPIDQGDDVKVRMGMDAFWGTSNIPYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRI 1594
            EED VV Y RF +    A  +  ++       D S+  + L +     + L  V ST P+D+G++ K+       + + N  +G         G   V    S+LDVG  +   E P R  LTVE+W++ ++     C+     L+ R+T    ++ LW  G+   GAL+F   G+    S VP +  E  W HVA  +++ S    +A+V   VGG   ++ +  I     S   +  TV+ VGP L G  MTE+R+WA  RSA  L D +++YL +AE +K++  +I
Sbjct: 1043 EEDGVVAYWRFEDGANNASVADGIQFV-----DTSKRENHLTV-----QHLDLVLSTAPVDRGEEAKLPPEYALRFLSPNTMHGC--------GTVEVKKGSSSLDVGVAY--DEDPYRRCLTVEMWVKPAADWGG-CTG---TLMRRET---PAVVLWEFGLDG-GALVFTLLGQTVKSSPVPFSA-EDTWQHVAAVVDITSE--VRASVRLAVGGALVVTKEVTIT-SSTSTGDIMSTVV-VGPQLTGMDMTEIRIWATPRSAQQLRDMKDTYLTMAESKKRIKMKI 1298          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: XP_009828430.1 (hypothetical protein, variant 1 [Aphanomyces astaci] >ETV81693.1 hypothetical protein, variant 1 [Aphanomyces astaci])

HSP 1 Score: 104.4 bits (259), Expect = 4.600e-18
Identity = 89/289 (30.80%), Postives = 141/289 (48.79%), Query Frame = 0
Query: 1305 EEDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKSTCPIDQGDDVKVRMGMDAFWGTSNIPYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRI 1594
            EED VV Y RF +    A  +  ++       D S+  + L +     + L  V ST P+D+G++ K+       + + N  +G         G   V    S+LDVG  +   E P R  LTVE+W++ ++     C+     L+ R+T    ++ LW  G+   GAL+F   G+    S VP +  E  W HVA  +++ S    +A+V   VGG   ++ +  I     S   +  TV+ VGP L G  MTE+R+WA  RSA  L D +++YL +AE +K++  +I
Sbjct: 1043 EEDGVVAYWRFEDGANNASVADGIQFV-----DTSKRENHLTV-----QHLDLVLSTAPVDRGEEAKLPPEYALRFLSPNTMHGC--------GTVEVKKGSSSLDVGVAY--DEDPYRRCLTVEMWVKPAADWGG-CTG---TLMRRET---PAVVLWEFGLDG-GALVFTLLGQTVKSSPVPFSA-EDTWQHVAAVVDITSE--VRASVRLAVGGALVVTKEVTIT-SSTSTGDIMSTVV-VGPQLTGMDMTEIRIWATPRSAQQLRDMKDTYLTMAESKKRIKMKI 1298          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: XP_009828433.1 (hypothetical protein, variant 4 [Aphanomyces astaci] >ETV81696.1 hypothetical protein, variant 4 [Aphanomyces astaci])

HSP 1 Score: 104.4 bits (259), Expect = 4.600e-18
Identity = 89/289 (30.80%), Postives = 141/289 (48.79%), Query Frame = 0
Query: 1305 EEDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKSTCPIDQGDDVKVRMGMDAFWGTSNIPYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRI 1594
            EED VV Y RF +    A  +  ++       D S+  + L +     + L  V ST P+D+G++ K+       + + N  +G         G   V    S+LDVG  +   E P R  LTVE+W++ ++     C+     L+ R+T    ++ LW  G+   GAL+F   G+    S VP +  E  W HVA  +++ S    +A+V   VGG   ++ +  I     S   +  TV+ VGP L G  MTE+R+WA  RSA  L D +++YL +AE +K++  +I
Sbjct:  759 EEDGVVAYWRFEDGANNASVADGIQFV-----DTSKRENHLTV-----QHLDLVLSTAPVDRGEEAKLPPEYALRFLSPNTMHGC--------GTVEVKKGSSSLDVGVAY--DEDPYRRCLTVEMWVKPAADWGG-CTG---TLMRRET---PAVVLWEFGLDG-GALVFTLLGQTVKSSPVPFSA-EDTWQHVAAVVDITSE--VRASVRLAVGGALVVTKEVTIT-SSTSTGDIMSTVV-VGPQLTGMDMTEIRIWATPRSAQQLRDMKDTYLTMAESKKRIKMKI 1014          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: XP_009828434.1 (hypothetical protein, variant 5 [Aphanomyces astaci] >ETV81697.1 hypothetical protein, variant 5 [Aphanomyces astaci])

HSP 1 Score: 104.4 bits (259), Expect = 4.600e-18
Identity = 89/289 (30.80%), Postives = 141/289 (48.79%), Query Frame = 0
Query: 1305 EEDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKSTCPIDQGDDVKVRMGMDAFWGTSNIPYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRI 1594
            EED VV Y RF +    A  +  ++       D S+  + L +     + L  V ST P+D+G++ K+       + + N  +G         G   V    S+LDVG  +   E P R  LTVE+W++ ++     C+     L+ R+T    ++ LW  G+   GAL+F   G+    S VP +  E  W HVA  +++ S    +A+V   VGG   ++ +  I     S   +  TV+ VGP L G  MTE+R+WA  RSA  L D +++YL +AE +K++  +I
Sbjct:  759 EEDGVVAYWRFEDGANNASVADGIQFV-----DTSKRENHLTV-----QHLDLVLSTAPVDRGEEAKLPPEYALRFLSPNTMHGC--------GTVEVKKGSSSLDVGVAY--DEDPYRRCLTVEMWVKPAADWGG-CTG---TLMRRET---PAVVLWEFGLDG-GALVFTLLGQTVKSSPVPFSA-EDTWQHVAAVVDITSE--VRASVRLAVGGALVVTKEVTIT-SSTSTGDIMSTVV-VGPQLTGMDMTEIRIWATPRSAQQLRDMKDTYLTMAESKKRIKMKI 1014          
BLAST of NO04G03100 vs. NCBI_GenBank
Match: XP_008615876.1 (hypothetical protein SDRG_11610 [Saprolegnia diclina VS20] >EQC30550.1 hypothetical protein SDRG_11610 [Saprolegnia diclina VS20])

HSP 1 Score: 103.2 bits (256), Expect = 1.000e-17
Identity = 137/568 (24.12%), Postives = 224/568 (39.44%), Query Frame = 0
Query: 1049 AGLAAQNSQWQDAMCEAMGDNIVKGQDVALH---SDITLPPRFRHRSRVLALLGQRAWQAGQTEEAFRLFDLAGEDENAMNLLILHLLG-GGPVSDGCAQLLRDLCKAEPHLLACLREGEAFMTKEVGPEKKRKAAVASLVCARLSLQTFPFDYQRRQYLLPRLSGAILQLPQWPQAGXXXXXXXXXXXXXXXXXXXXXXXRGGPPRAPLVQA------LLLDRVE-----EWLGRVAPEGLREEISR----LDERERDATKREEEWVKGVGEG---REEDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANGQSESLSFVKSTCPIDQGDDVKVRMG-MDAFWGTSNIPYGSLSSSLLPRGLRAVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLITRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVAFTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGPNLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRI 1594
            AG+     +W+DA    + D+     + AL+   ++  LPP+  H +  L  +       GQ E A +  D+AG D   ++L+     G     +D    +LR L  A P ++A +   +A         K  K  V  L+C      T     +RR  LL  LSG  L+  + P                             P + PL+ A          R+E     +WLG   P    +E  +    +D  +     R          G    +ED+V  Y RF +   L  S       L    D S+  + L +    S ++    ST P+D+G++ K+    M  F   + +      +S         V +G ++D G      E P R  LT+E W+ R  +      S   VL  R          W + + + G L      +  P  T    ++ G W H+A  L++ S    +A+V   + G   ++ +  +K    S+S    + L++GP L G  MTE+RLWA +RS + + D +E+YL +AE +K++   I
Sbjct:  767 AGIFLAAHRWRDAATYLVSDDPAL-YEYALNPQGAEAQLPPKLSHVAAALTRMAATLQSMGQFELAAQCLDMAGNDAALLSLVCAISFGLKCGSADVVEAILRGLKTAHPPVVAAVTAYDAAQI-----PKHVKMDVFRLLC------TEHLVLERRSRLLASLSG--LRRVRLP-----------------------------PIKIPLLAAPDGWKYFTWKRLEPEDPVDWLGSSTPHFSSQEFVKKSLQIDVSDGFGEMRSTPMASAPSFGPFLDDEDSVTAYWRFEDAATLGPSRTTDATLL----DTSKRENHLVV----SPAIVLDISTAPVDKGEESKLLPAYMLHFPAVAPLDGSGWHAS-------CAVRKGGSMDFGTSF--DEDPYRRHLTLECWV-RYYAEGPIAGSSTSVLAARSP-------FWSISMESSGRLCVQIHDRTLP--TDGTLLLNGSWQHLALVLDIVSD--DKASVRVLLDGASVLAKEIAVKMTSQSDSS---SALQLGPRLVGFDMTEIRLWATSRSVEQINDMKENYLGIAETKKRIKVAI 1259          
The following BLAST results are available for this feature:
BLAST of NO04G03100 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM28710.10.000e+040.21clathrin heavy chain [Nannochloropsis gaditana][more]
OAO12544.15.600e-2424.55Dek1-calpain-like protein [Blastocystis sp. ATCC 5... [more]
OQR92194.14.400e-2125.40hypothetical protein ACHHYP_03966 [Achlya hypogyna... [more]
XP_014527066.14.900e-2024.70hypothetical protein JH06_3884 [Blastocystis sp. s... [more]
XP_008875189.11.200e-1830.58hypothetical protein H310_10550 [Aphanomyces invad... [more]
XP_009828429.14.600e-1830.80hypothetical protein H257_05301 [Aphanomyces astac... [more]
XP_009828430.14.600e-1830.80hypothetical protein, variant 1 [Aphanomyces astac... [more]
XP_009828433.14.600e-1830.80hypothetical protein, variant 4 [Aphanomyces astac... [more]
XP_009828434.14.600e-1830.80hypothetical protein, variant 5 [Aphanomyces astac... [more]
XP_008615876.11.000e-1724.12hypothetical protein SDRG_11610 [Saprolegnia dicli... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL076nonsL076Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR058ncniR058Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR106ngnoR106Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK009449NSK009449Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO04G03100.1NO04G03100.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|293321gene_2083Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100002g85gene1847Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO04G03100.1NO04G03100.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO04G03100 ID=NO04G03100|Name=NO04G03100|organism=Nannochloropsis oceanica|type=gene|length=6612bp
ATGACGACGCCAAAGAAGAAAAGAGGACAGCCGCTAATCGCAACGTCGCG
GCCAGACCGCACAACTGTCTTCGTAGTGGACAAGAGTGTTGCACCCCAAG
TGATTCGCGAAGTCGCCTGTGGTGGAACTTGCGGTGGTAGCAGCAGCTGT
AGCAGTTGGGCCGGCGAGGTGGTCTGTGGGGGTCCCGTGGCTCAAGTGGC
AGTGAGCACAACCAACGCAGGGCATAGCTTACTTGGAGCACTTTTAGAGT
CGGGCGTTTTAGAAATCTGGGATATAGCGGCACCGGAGGGCGGGTTGTAC
ACCCTGCGGGAAGTGGTGCACTTGCCCGCAGCGGCCAGCGGGGAAGTGAA
CAGCAGCGGCAGCAGTACCAGCAGCAGTGGCGAAGGTGACAAGAATGCCA
GTAATAGCAATTCGGACGCCCTATCGGACTGTCTCTTCGCGGGCTCGAGA
GGTCTACCTTCAGgtttgtttttttttgtgacgcgggggattgggggcaa
gtccggtggcagcagcgagctttcacgtctgcatgtcgtcagaattcacc
cctcaactggtcctccgaccgttggcacacaacggctcacagCTATGGAT
GCATCGCGGAAAAAGGTCGCTGTAGTTGCCTTAACGTGCCATGAGTCCCA
ACCTCTGCTCTGCGTGGGCTTCCAAGACGGGGTGCTGCACCTTTATGACG
TTGGCGGAGTGGCCGCGGAGAAAGGCAGGGCCGGAGTCGAAGATAACGTA
ATTAGTGCTACCGCGCGAAGCTATGTCCCGGACGGCAGTAGCAAGCAGGA
GAACAGAGATTTTGGAGATGATAACAACGACAATGACGACGGCCAGGAGC
AAGCGAGTGGCATAGCAACGACAGGGAAGGCAGCTATGTCAGTGCTCTTG
GGCAGATCAGTCCGAAACCTGCTCCTGCCTGTGGCGGCTCTCAAATTTGG
GATTCACATGCAAGGTGCTCTCTCGTGCATCTGTATGACCTCTGAATTCG
GAATAATTTTGGCCGGTAGCACGCGCGGAGAGGTCGCGGTCTGGTCAACC
AACACGCTCCTTCGAGAGACTGCGCAGGGAGAGAATGGTAGTGACGATGC
GCTGAACTTGTCTGGTGCTGTGTTACCCCTACAGAGCTGCGAAGTCATGG
CCATGCCTAGCCCTATCGAGCGCATAGAACTTCTCAAGTGTAGCAAGCCG
TTAATTTTGGTCAGCGTGCGGGCAGCCAATGGTGACAGTGACGTGACACG
CGTGGCGTTGCTGGCGTTCCTGCCTAAGTCTCTGGTCGTCGTGCACATGG
GGCCGCCCGTGCGGACTCGGCATTTAGCTTTTCAAAGCCGCAACTCTTGT
ATCATCTTGTCGGGAGAGGGCAATAAAGTGAACAAGCTGCTCGTGAGCAA
ACTTGCTGATGTGTTGGCTTGTCCTGCGTTTCCCTTGCCGTCACAGTTGG
CTGCGATAGGGCCACCTCAGCTTGACTGTGGGGCGTTTTTAGCGGACTCG
TGGGGATGTTCACAGCCGTCTTCTTCGCCATCATCGCCTACACATGCTCC
TTTCATCTACTCAATACAGTCACGTATGCTTTTATCTCTCCACCAGGACG
ACACGAATAGCTTCTCAGCTAGTAGGAAAAGCGGGCTGTCCTCAGTTTGC
GGGTCATTGACGCTCAAGGCCCTGGTGGTTGCTCGACCACTGCACCACCT
CCAAAACAAACGCTCAAGCGATCGAGACAAGGACATCCAAGCGGAAGCCG
TGCTTCTACTTCCGACGTATGGGAAAGATTTTTCCGTTCTTTTGGACAGG
CTGAGCTTGTCGCATTCAGTGCCGATTTTTTCCGGGCTGAACGAAGACAA
CGACCAGATATTACTGCTGCCCCATCGCCTCATCGTCAGCCCGGACGGAT
CACTGATTGTGGTTCTGCTCAGGATACTGAGACCCAAGGTGGCTGGAAGC
GACAATGTAGTCGATGCTTGCGCCCCGCTGGCTTATGTGGTTGTGCGTCA
AAGCAGTACGGCGCAGGATGGGGGATCGGTAGAAACAAACAACGGCAATC
AAGTGGCTCTAGTCAGCCTCGACTTTGTGCACGCAGGCGTGACAATAGAT
GCAGCCTTCTTGTCGCCACGGATCCTCGCACTTCTACAATCTTATACTGG
CAAATGCGGTGCTGTTGTCAAGACGACCCTCCTTCAGGCGAATGGAGATC
CCTCCGCAGCCACCACTGCATGGGTTATGTCACAAACTGTCGAAGATGGC
GACGTGCAAACAGAGGATGGAGCACAAGAATACATTCCTACGCGGCTATT
TAGAGCTGGTGGGAGCGGATTTCCATATGAAGGGGAAGGGTCACTTAGGG
TCTTGTGTGTCAGGCAGCAGAGACAGGCGGGGGGGAATGGAAGTACCATC
TTGACTTGCTCTCGGATAGGTGGAGGGTTTCCGCCAAGCTTCGTCGGGTC
GGCTGTCCTCCGTGCGCAAGAGGTCATAATGGAAGCCATCAACCAGACGA
GTAATGGGCAGGGCATAGAGGCTGGCTCTGTGATAGCTGTTTTGACGTCT
CAACGTCTGTTACTCTTCTCTCCAGACTTGACTTTATTGGCCGGCGTGGA
ATTGACGATGGCGCCGTGCGGATTGGCATGGTTGGGTCCGACCCCGCTGA
TTGTCTTTGCAGATGGTCGTGTCTTGTACCTATCTCTTGATTCTGGCGCC
ACCGAGCTGCGGCCGGTCTTATCGCTGGACCAAGAACAGGCTGGGGCAGA
CGTGATACTTGTTGCAGCTTTGCCTGATCGCTTGGTGTATGCGGCAAACA
ATGGCGCATCTGGCTCTTTGCGCATCTTCACGCGGGCATTACTGCCTCTC
GAACCCCTGCTGGTGGGATTCCTGGCTTGGCCTCGCCTTGTCAAATTACC
TGTTATCTCCTTTGCACCATCCGCACTGTACTCTCCCTCCCCACCGCCAC
CATCGGCAGTAGAAGCAGTAACTTCCTCACCATTTCTCACGGTAGCTTTG
AAAACACTTCTGGAGTTGTACGGCCCTCTGGCAAAGCAGCGTGGAGCACG
GTATGCACTTGGAGAAGGGCCAAGTGTAGATGCAGGCGCCACCGCATGGG
CGTGTGCGGCGTTGTCTCGCGCAGGCTTTGGTGGATGGGCGGCAGCACTT
GCCGGTGTTGCAGGGACTGTTCAACTTGGGAAGGGACGAGAGTCCAAAAC
AGCGCTCCTTGCTGGGCAGTCGGAGGGTCCGAAGGCCTTCCAACCGCGGC
CCTGGATTCCGCCAGAAGTCAAGGCTGGGTTGGCGGCGCAAAATTCGCAA
TGGCAGGATGCAATGTGCGAGGCAATGGGGGACAACATCGTGAAGGGACA
AGATGTCGCGCTTCACTCGGACATCACATTGCCCCCACGCTTCCGCCATC
GCAGTCGGGTGTTGGCGCTGTTGGGGCAACGCGCATGGCAGGCGGGGCAG
ACAGAGGAGGCTTTCCGGCTTTTCGACTTGGCAGGCGAAGATGAAAACGC
CATGAATCTACTGATACTGCACCTCTTGGGTGGAGGCCCTGTGTCAGATG
GCTGTGCACAGCTTCTTCGCGATCTCTGTAAAGCAGAGCCACATTTGCTG
GCGTGTCTTCGGGAAGGGGAAGCGTTTATGACGAAGGAAGTTGGACCGGA
AAAGAAAAGGAAGGCTGCGGTGGCTAGTTTAGTATGTGCAAGATTGAGCT
TACAGACGTTTCCCTTTGACTATCAGCGGCGTCAATATTTGCTTCCTCGT
CTATCAGGCGCTATTTTACAGCTCCCGCAATGGCCGCAAGCCGGCCTTGC
GCCTTCTACCACTGCTTTTGTCGCCTTCCCGGCAACGCCGTCTTCTTCCT
CCTCCTCCGTCAGTCGAGGCGGTCCACCGCGTGCACCTCTAGTACAAGCC
CTGCTGCTTGATCGTGTGGAAGAATGGCTGGGACGAGTGGCACCCGAGGG
GTTGAGAGAAGAAATATCTAGGTTGGACGAACGTGAACGAGATGCAACTA
AGAGGGAAGAGGAATGGGTCAAGGGCGTAGGGGAGGGACGGGAGGAGGAC
AATGTAGTCTTGTATTTGCGATTTAACGAGCCTACTTTTTTGGCAGATTC
TTCTATGGCCGTAGAAGTTCCTCTCCGTATGGCTGGTGACTTGTCACAGT
ACGGTCATTCCTTGGACCTGGCCAATGGGCAATCCGAGTCGCTATCCTTC
GTGAAGTCGACTTGTCCGATTGACCAGGGGGACGATGTCAAAGTGCGCAT
GGGTATGGACGCCTTCTGGGGAACATCAAACATACCATATGGAAGCTTAT
CATCCTCTTTGTTACCCCGTGGCTTGAGAGCCGTCGTGGCCCGGGGTTCT
GCCTTAGATGTGGGACCTTACCACGGACCACAGGAGCAACCCGGCAGGTG
TCGCCTCACGGTGGAACTTTGGATTCAGCGGTCGTCGTCCTCCTCCTCTT
ACTGTTCGTCGCAGCCCGAAGTCTTGATCACTCGCAAAACTATAGATGAG
CAGTCCATGTATCTGTGGGGTCTAGGCATATCCGCAGAAGGTGCGTTGAT
CTTTTGGACCGAAGGGAAACAGGCGCCACTGAGCACGGTGCCGGGTACCG
TAATGGAAGGAATTTGGACACACGTTGCCTTTACGCTCGAGATGAAAAGC
TCAGGGGGCAAACAAGCAACAGTAGCCTTTTTCGTGGGGGGCAAGAAGGC
TATCTCGACTCAGTCGCTGATCAAATTTCCCTTTCTGAGCGAGAGCCAGC
TTCGCCGCACGGTCCTTGAGGTGGGTCCCAACCTCCGTGGCCACCGGATG
ACCGAACTTAGGCTGTGGGCCTGCGCTCGCAGCGCGGACGATCTTTATGA
CTATCGGGAGAGCTACCTTCAACTAGCGGAGAAGAGGAAGAAGCTGGCCT
TTAGGATCAAAAGTGATGGCGGTCGAAGCATCAGAAACGAGGGCAGCGAC
ATACAAGAGGCAGCTCCGACATCACCACCACCATTACCGGTAAAAGCAGC
ACGGAAACGCGAAAATTTTGCCCCCTTACCCAATATGGCTGATGGCGTGA
TGTTTCCGGTGCGACGGAGAGGAGCTGGGCGTGAGGATACTTCCTCCATA
ACGCTTCTAGATGGACTACGTGGTCTTAGTGGAGAAGGGGCATGTAGCCT
TATAGCCTCCCTTCCAGCCCCCACTGCCAACGATACGGCTGTTCGGCGAT
GGCAAAGCCGGGCGAATTCACTGGCTAGATCTGCGGATGGACTTGTCATA
CAGTCCTCACCTCCACCAACCCATCCCTCGGTGGATATCGACACAGGCTG
GGCAGGTTTTAAGGAGGATCCGGTCTCTAGACCCACTTCCTTCGTAGTTA
CCACTACCGCACCTCTTGCTGCCGCCGAGCATGCGCCTCCAACCCTGGGG
CTTCAAATGGTGTCAACCCCGCCTGAGCTGAAGCATCTCTTGCTGCTTCT
ACCTTCAGAGCTGGACGTCAATCCAGCCAACGTCCTTCGGGATGAGATTG
CCGGCTATTTCTCCCAGTGCGGACAGTATTTGTGTTTGCGGGAAATGACG
AGGTTTACGAATTTGGAAAGTGGGATCCCTAAAGTCGCTATTGTGGTGCT
CGCAGTGACCACGTCAGCGGATCAATTAGCTCGCCGCGACCGCTATCCTT
TTCAGGCCAAGAGCGCTATCTTGGTGGGAGGCTTGTCGTCGTCCTCTTCC
TCGTCGTCCCTGCCATTTTTGCTTTCCTCTCCTGTGATTGCTTGCTATAC
GGTCGCAAAGACAACTACAACGACGACTGGAGGGAGAATAAAGCAGCCAC
AGGCAGGTGTGCTACAAGTTTACGACTTGATGCAGCGGAGGGGCGTGGCT
CGACAACCTGTGCAGACGCGGGTACTCTTCTGGCGCTGGATTTCCCGGGG
TGTGATCGCTTTGGTGACGCCCCGAGCTGTCTTTCTTTGGACTGTGGGCG
TAAAAAGTGGGCCCTCCGGTGTCCCGACCAAGACATTTGACAGGCGTGAT
CTCACCCTTCTGGGTCCCAACGCCAACGTTCGCGACTACCACCACGCGGC
GGGCGGGGAGTGGGGTGTACTGACTACCGTGGATGGTGACGGAGCGCGAT
TGGCCGTTCAATTTCATGACATCAGGAGCGGCAAAGTCATCGTTGAGTCT
GGCACAGAATTGATCAGTGCCAATGTGGGAAAGTGCGACGGGGGAGTCGA
TCAAGGGAATAGAAGGAGCCAAACGTACTTTGTCATGCTCCGTCGCTGCC
CGGAGCTCACCGTGGATTTATGCGTTGAGGAAGGAAAAGGAAACGAGCAG
GGAATGTGTAATCCCCGCTTGGTCAGGCTGGCCCGTTTGCCACTGCCGTC
GGCGCCCGCTTCCTTGTCTGACTCGTGGCGTGCCTGGGTGCTTTTTCCAC
CGGGCCGCACTGACATCATCGTGCTTTTGTTCTCGGCGGGCGGACTCCTC
TATACCTGCTGCTCTGCTACGCATCGAATAGAGGCTCGGGGCTCCGTCTT
TCCTGAAGAGCAAGCAGGCGCGGTCCTGGATGTATCCTGGGACGTGGTGA
GCGGAGACATGTTGGTGTTGGAGGCGGAACGGTTGGCTGTTTTTAGAGTT
AGCAACTTGTGA
back to top

protein sequence of NO04G03100.1

>NO04G03100.1-protein ID=NO04G03100.1-protein|Name=NO04G03100.1|organism=Nannochloropsis oceanica|type=polypeptide|length=2161bp
MTTPKKKRGQPLIATSRPDRTTVFVVDKSVAPQVIREVACGGTCGGSSSC
SSWAGEVVCGGPVAQVAVSTTNAGHSLLGALLESGVLEIWDIAAPEGGLY
TLREVVHLPAAASGEVNSSGSSTSSSGEGDKNASNSNSDALSDCLFAGSR
GLPSAMDASRKKVAVVALTCHESQPLLCVGFQDGVLHLYDVGGVAAEKGR
AGVEDNVISATARSYVPDGSSKQENRDFGDDNNDNDDGQEQASGIATTGK
AAMSVLLGRSVRNLLLPVAALKFGIHMQGALSCICMTSEFGIILAGSTRG
EVAVWSTNTLLRETAQGENGSDDALNLSGAVLPLQSCEVMAMPSPIERIE
LLKCSKPLILVSVRAANGDSDVTRVALLAFLPKSLVVVHMGPPVRTRHLA
FQSRNSCIILSGEGNKVNKLLVSKLADVLACPAFPLPSQLAAIGPPQLDC
GAFLADSWGCSQPSSSPSSPTHAPFIYSIQSRMLLSLHQDDTNSFSASRK
SGLSSVCGSLTLKALVVARPLHHLQNKRSSDRDKDIQAEAVLLLPTYGKD
FSVLLDRLSLSHSVPIFSGLNEDNDQILLLPHRLIVSPDGSLIVVLLRIL
RPKVAGSDNVVDACAPLAYVVVRQSSTAQDGGSVETNNGNQVALVSLDFV
HAGVTIDAAFLSPRILALLQSYTGKCGAVVKTTLLQANGDPSAATTAWVM
SQTVEDGDVQTEDGAQEYIPTRLFRAGGSGFPYEGEGSLRVLCVRQQRQA
GGNGSTILTCSRIGGGFPPSFVGSAVLRAQEVIMEAINQTSNGQGIEAGS
VIAVLTSQRLLLFSPDLTLLAGVELTMAPCGLAWLGPTPLIVFADGRVLY
LSLDSGATELRPVLSLDQEQAGADVILVAALPDRLVYAANNGASGSLRIF
TRALLPLEPLLVGFLAWPRLVKLPVISFAPSALYSPSPPPPSAVEAVTSS
PFLTVALKTLLELYGPLAKQRGARYALGEGPSVDAGATAWACAALSRAGF
GGWAAALAGVAGTVQLGKGRESKTALLAGQSEGPKAFQPRPWIPPEVKAG
LAAQNSQWQDAMCEAMGDNIVKGQDVALHSDITLPPRFRHRSRVLALLGQ
RAWQAGQTEEAFRLFDLAGEDENAMNLLILHLLGGGPVSDGCAQLLRDLC
KAEPHLLACLREGEAFMTKEVGPEKKRKAAVASLVCARLSLQTFPFDYQR
RQYLLPRLSGAILQLPQWPQAGLAPSTTAFVAFPATPSSSSSSVSRGGPP
RAPLVQALLLDRVEEWLGRVAPEGLREEISRLDERERDATKREEEWVKGV
GEGREEDNVVLYLRFNEPTFLADSSMAVEVPLRMAGDLSQYGHSLDLANG
QSESLSFVKSTCPIDQGDDVKVRMGMDAFWGTSNIPYGSLSSSLLPRGLR
AVVARGSALDVGPYHGPQEQPGRCRLTVELWIQRSSSSSSYCSSQPEVLI
TRKTIDEQSMYLWGLGISAEGALIFWTEGKQAPLSTVPGTVMEGIWTHVA
FTLEMKSSGGKQATVAFFVGGKKAISTQSLIKFPFLSESQLRRTVLEVGP
NLRGHRMTELRLWACARSADDLYDYRESYLQLAEKRKKLAFRIKSDGGRS
IRNEGSDIQEAAPTSPPPLPVKAARKRENFAPLPNMADGVMFPVRRRGAG
REDTSSITLLDGLRGLSGEGACSLIASLPAPTANDTAVRRWQSRANSLAR
SADGLVIQSSPPPTHPSVDIDTGWAGFKEDPVSRPTSFVVTTTAPLAAAE
HAPPTLGLQMVSTPPELKHLLLLLPSELDVNPANVLRDEIAGYFSQCGQY
LCLREMTRFTNLESGIPKVAIVVLAVTTSADQLARRDRYPFQAKSAILVG
GLSSSSSSSSLPFLLSSPVIACYTVAKTTTTTTGGRIKQPQAGVLQVYDL
MQRRGVARQPVQTRVLFWRWISRGVIALVTPRAVFLWTVGVKSGPSGVPT
KTFDRRDLTLLGPNANVRDYHHAAGGEWGVLTTVDGDGARLAVQFHDIRS
GKVIVESGTELISANVGKCDGGVDQGNRRSQTYFVMLRRCPELTVDLCVE
EGKGNEQGMCNPRLVRLARLPLPSAPASLSDSWRAWVLFPPGRTDIIVLL
FSAGGLLYTCCSATHRIEARGSVFPEEQAGAVLDVSWDVVSGDMLVLEAE
RLAVFRVSNL*
back to top
Synonyms
Publications