EWM30348.1, cds346 (CDS) Nannochloropsis gaditana

Overview
NameEWM30348.1
Unique Namecds346
TypeCDS
OrganismNannochloropsis gaditana (N. gaditana B-31)
Alignment locationCM002455.1:850872..853865 +

Link to JBrowse

Properties
Property NameValue
Protein idEWM30348.1
Productalpha-galactosidase
Orig transcript idgnl|cribi|Naga_100003g150.1865.mrna
GbkeyCDS
Mutants
Expression
No biomaterial libraries express this feature.
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
CM002455.1supercontigCM002455.1:850872..853865 +
Analyses
This CDS is derived from or has results from the following analyses
Analysis NameDate Performed
GO annotation for N. gaditana B312020-04-08
BLAST analysis for N. gaditana B-312020-04-07
InterPro analysis for N. gaditana B-312020-04-06
Gene prediction for N. gaditana B-312014-02-18
Annotated Terms
The following terms have been associated with this CDS:
Vocabulary: Molecular Function
TermDefinition
GO:0016787hydrolase activity
GO:0003824catalytic activity
GO:0004553hydrolase activity, hydrolyzing O-glycosyl compounds
Vocabulary: Biological Process
TermDefinition
GO:0009056catabolic process
GO:0005975carbohydrate metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR017853Glycoside_hydrolase_SF
IPR013785Aldolase_TIM
IPR000111Glyco_hydro_GHD
Homology
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|585113007|gb|EWM30348.1| (alpha-galactosidase [Nannochloropsis gaditana])

HSP 1 Score: 2066.58 bits (5353), Expect = 0.000e+0
Identity = 997/997 (100.00%), Postives = 997/997 (100.00%), Query Frame = 0
Query:    1 MEGVKERKEEKYILRNEHAALTVWPAGIPLDHPDHCHHQHHHLNQYEPAPGPRFSFISNENTSLGIFDTSTIEFRGKLRGKAWKTTMAEAVDVQVIKSNLGCARFPGLDDFGPEALTLEARLPEIGQVCVDFALHTIGPSMVLRLRCPALACSHDVRLDEFVLFDGDCDVSQGPRRDGKSLARIWWEYLSRRVRGLGPRRVFVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAGPAEFDRSNNKELYSEMFAHVSDKSRGTGLLLGFLTQHEQFGAFSFDECFVEVGAHCRCDGQLLKAGELMQSDWCLVELQTTLPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKGARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKWNLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLEREMPEILRLYVDQDSFSEETSMCHAANTFFSHHVPQDPWVVVGFCNWDDAKKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMRRAPLYLGSDVHFSCGWELRKCIWGEESDYLDKAKVTRVSVHVVIDAGKVVDGTSHHIWLDLPGSEDSIKLDGAPEGSERVNFSVKGIYEGLEESTAPSLPTLFSGKHRSEGALATAAPRSPSIRRTVWKIRFPEGHSQRISWTLNYISTFDQDDQIRKDVQKK 997
            MEGVKERKEEKYILRNEHAALTVWPAGIPLDHPDHCHHQHHHLNQYEPAPGPRFSFISNENTSLGIFDTSTIEFRGKLRGKAWKTTMAEAVDVQVIKSNLGCARFPGLDDFGPEALTLEARLPEIGQVCVDFALHTIGPSMVLRLRCPALACSHDVRLDEFVLFDGDCDVSQGPRRDGKSLARIWWEYLSRRVRGLGPRRVFVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAGPAEFDRSNNKELYSEMFAHVSDKSRGTGLLLGFLTQHEQFGAFSFDECFVEVGAHCRCDGQLLKAGELMQSDWCLVELQTTLPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKGARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKWNLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLEREMPEILRLYVDQDSFSEETSMCHAANTFFSHHVPQDPWVVVGFCNWDDAKKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMRRAPLYLGSDVHFSCGWELRKCIWGEESDYLDKAKVTRVSVHVVIDAGKVVDGTSHHIWLDLPGSEDSIKLDGAPEGSERVNFSVKGIYEGLEESTAPSLPTLFSGKHRSEGALATAAPRSPSIRRTVWKIRFPEGHSQRISWTLNYISTFDQDDQIRKDVQKK
Sbjct:    1 MEGVKERKEEKYILRNEHAALTVWPAGIPLDHPDHCHHQHHHLNQYEPAPGPRFSFISNENTSLGIFDTSTIEFRGKLRGKAWKTTMAEAVDVQVIKSNLGCARFPGLDDFGPEALTLEARLPEIGQVCVDFALHTIGPSMVLRLRCPALACSHDVRLDEFVLFDGDCDVSQGPRRDGKSLARIWWEYLSRRVRGLGPRRVFVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAGPAEFDRSNNKELYSEMFAHVSDKSRGTGLLLGFLTQHEQFGAFSFDECFVEVGAHCRCDGQLLKAGELMQSDWCLVELQTTLPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKGARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKWNLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLEREMPEILRLYVDQDSFSEETSMCHAANTFFSHHVPQDPWVVVGFCNWDDAKKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMRRAPLYLGSDVHFSCGWELRKCIWGEESDYLDKAKVTRVSVHVVIDAGKVVDGTSHHIWLDLPGSEDSIKLDGAPEGSERVNFSVKGIYEGLEESTAPSLPTLFSGKHRSEGALATAAPRSPSIRRTVWKIRFPEGHSQRISWTLNYISTFDQDDQIRKDVQKK 997          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|397617232|gb|EJK64341.1| (hypothetical protein THAOC_14938 [Thalassiosira oceanica])

HSP 1 Score: 376.326 bits (965), Expect = 8.253e-110
Identity = 252/782 (32.23%), Postives = 370/782 (47.31%), Query Frame = 0
Query:  198 PRRVFVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAG--PAEFDRSNNKELY-------------------------------SEMFAHVS----DKSR------------GTGLLLGFLTQHEQFGAFSFDECFVEVGAHCRCDGQLLKAGELMQSDWCLVELQTT--LPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKF-PNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKG-ARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKWN---LPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDL--LEREMPEILRLYVDQDSFSEE--------TSMCHAAN------------TFFSHHVPQD-------------------PWVVVGFCNWDDAK-----------KHSMHPYKSILRP----LLVDVGPVI------ILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMRRAPLYLGSDVHFSCGWELRKCIWGEES 861
            P  V+VNG+ +WSF G V +G       +P  LSAAF+ G     A+  ++N    Y                               S+MFA +S    DKS+            G  L++GFL Q EQ+G    D+     G +  C  Q + A + + +DW   ++  +    EE    ++   G  N A           RP  K +  GWCSWYH+Y  IS D+L  N  +++  R  +   +  IDDGY  +WGDW  + P KF  +  MR + DA+RS+G++PG+WLAP A DK S +A  HPDWIIR   G      ANSAN GK+FYGLD T P V+  + +T+       W FD LKLDFLY + L+G  + DP+++ A+ + L LR IR A G  T+I+GCG P+G+ +G+ +  RVS D GP ++P FPLP   W+   LP  + M+RNT+ R  +   WW NDPDC+LL E+T  T  EV+  A++ AL+GG  L+SDD++ VS +R+ +A+ + P+ G  A+ LDL      MP ILRL+  + +            T+  H+++            TF   + P D                    W +V   NW D              HS+  + +   P       D    +        H+F FW S+Y  +     L+   +M+T           ++PH+  +  + + P                     MR  P Y+GSD+HFSCG+E++   W +ES
Sbjct:  324 PTNVYVNGYQSWSFCGSVLRGEPQPKSAMPNFLSAAFNRGGMVLSADGSKANMNSDYWNDGVRVDSSDLSDHEDDSDDELTETAAHYKSDMFACISSNGQDKSQCEDNRIQLDEEGGPALVVGFLAQREQYGVAIMDKQLRRFGLYA-CH-QGVVARKPISTDWAFCQIVDSHCYDEEAMVYYVHAVGDHNDA-----------RPLEKGLTTGWCSWYHYYADISHDSLAKNAHILSKSRSSIGFNVCLIDDGYMTSWGDWTSLKPGKFVKDGGMRVLADAIRSKGMKPGVWLAPFACDKGSDLARQHPDWIIRNDAGR----IANSANCGKFFYGLDATNPAVRKHVYDTIRRAVRD-WGFDVLKLDFLYASCLEGNGKYDPSMSRAEAMHLGLRTIRAAAGCETFIIGCGCPIGSAVGFADGMRVSCDTGPTFVPEFPLP--HWDNGTLPALKGMLRNTMTRAVVGHRWWHNDPDCLLLGETTSLTDDEVVSAASIIALTGGMLLLSDDMEKVSEKRLSVAKRVFPLTGVTAVPLDLHSTANVMPSILRLWCSERTAKTAAESKDVDFTNNSHSSSPEKVLREQASRLTFEVGYSPGDIVDPYSRERSCFSVAPGLGSWTLVSISNWLDETATVSVSFSALVSHSIEDFTATGAPRSASRTTDTSSGLGECGDNGFHVFSFWKSEYVWIPHGTFLDGSPMMTT-----------LRPHATEI--LHIKPVDY------------------MR--PQYIGSDLHFSCGFEVKTFEWTDES 1052          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|1210511636|dbj|GAX29450.1| (hypothetical protein FisN_16Hh109 [Fistulifera solaris])

HSP 1 Score: 370.933 bits (951), Expect = 1.214e-108
Identity = 241/732 (32.92%), Postives = 351/732 (47.95%), Query Frame = 0
Query:  195 GLGPRRVFVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAGPAEFDRS--NNKELY-----------SEMFAHVS-------------DKSRGTGLLLGFLTQHEQFGAFSFDECFVEVGAHCRCDGQLLKAGELMQSDWCLVELQT--TLPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKGA-RSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKW----NLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLERE--MPEILRLYVDQ-----DSFSEETSM-----CHAANTFFSHHVPQDP-----------------------WVVVGFCNWDDAKKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMR----RAPLYLGSDVHFSCGWELRK 854
            G  P   +VNG+ +W+F+G + +GA      +P  +S AF+ G  P   D +   +   Y           S+ F  ++             D++ G   ++G+L+Q +QFG  + D     +  H  C GQ+L     + +DW  V+L T  +  EEP   F+  A   N A           RP   N+  GWCSWY FY+ ISE  L  N   +  +R  +P  +  +DDGY  AWGDW    P +FP+ S+  + D +R   +RPGLWLAP A DKHS +  +HP+WIIR  GGL     ANS+N GK+FYGLD T P V+  + + +       W +  LK+DFLY A L+G+ R D +++ AQ + +A++ IREA G ST+++GCG P+ + IG V+A RVSAD GPAW P FP P   W     LP  + M+RN+I R P+   WW NDPDC++L   T  T  EV   A++ A++ G  L+SDDL  VS +RM I   + P+ G  A+ LDL   +  +P +LRL+        DSF    S+      +A  TFF+      P                       W +V   NW D+             P++  + P  +L +          +  N + + D+  ST     G  G       +  Y    D R+ P    S    A       ++      P Y+G ++HFSCG E+R 
Sbjct:  242 GYFPTHTYVNGFQSWTFSGSIPRGAPQPQSAMPDRVSRAFNAGGAPPPTDATILTSSSSYVPPNDFVPTYISDFFTCITSDGETTEPLYPPLDETGGPACVMGWLSQRQQFGIITADCNLERLQMHASCQGQILLPHRRIVTDWAYVQLTTPHSYDEEPMVYFLHAAAAYNEA-----------RP-MANLLTGWCSWYVFYQNISETILRENFVTLKEMRTHVPTNVAVVDDGYMTAWGDWDSCKPGQFPS-SLGVVADDIRKNQMRPGLWLAPFAADKHSVLTKEHPEWIIRNNGGL----PANSSNCGKFFYGLDATNPAVRRHVHDAIERAVK-QWGYSVLKIDFLYAACLEGSGRYDLSMSRAQAMHVAMQTIREAAGSSTFLIGCGCPIASGIGIVDAMRVSADTGPAWYPQFPFP---WWDHGTLPSLKAMIRNSITRAPLGHRWWHNDPDCLMLGNHTSLTDVEVASAASIVAMTCGMLLLSDDLPKVSLKRMNILSKIFPLTGIPAVVLDLHSTKDGIPRLLRLWATDKFDVLDSFRSSMSLDEEFDHNAEATFFARQASFYPDQESVALNERKRTCIHVTKGLGTWTIVSISNWADS-------------PVVSCIPPAALLPLGG-------TLEDNEEASTDSFSSTKASEPGRHGYHTLAFWSCKYNWIPDHRKNPEQTISRRLNAHETEIYHIKPVTPENPQYIGGNLHFSCGKEVRS 932          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|1210514799|dbj|GAX26278.1| (hypothetical protein FisN_16Lh109 [Fistulifera solaris])

HSP 1 Score: 362.459 bits (929), Expect = 2.621e-105
Identity = 234/733 (31.92%), Postives = 348/733 (47.48%), Query Frame = 0
Query:  195 GLGPRRVFVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAGPAEFDR----------------------------SNNKELYSEMFAHVSDKSRGTGLLLGFLTQHEQFGAFSFDECFVEVGAHCRCDGQLLKAGELMQSDWCLVELQT--TLPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKGA-RSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKW----NLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLERE--MPEILRLYVDQ-----DSFSEETSM-----CHAANTFFSHHVPQDP-----------------------WVVVGFCNWDDAKKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMR----RAPLYLGSDVHFSCGWELR 853
            G  P   +VNG+ +W+F+G + +GA      +P  +S AF+ G  P   D                             S++ E   +M+  + D++ G   ++G+L+Q +QFG  + D     +  H  C GQ+L     + +DW  V+L T  +  EEP   F+      N A           RP   N+  GWCSWY FY+ ISE  L  N   +  +R  +P  +  +DDGY  AWGDW    P +FP+ S+  + D +R   +RPGLWLAP A DKHS +   HP+WIIR   GL     ANS+N GK+FYGLD T P V+  + + +       W +  LK+DFLY A L+G+ + D +++ AQ + +A++ IR+A G +T+++GCG P+ + IG V+A RVSAD GPAW P FP P   W     LP  + M+RN+I R P+   WW NDPDC++L   T  T  EV   A++ A++ G  L+SDDL  VS +RM I   + P+ G  A+ LDL   +  +P +LRL+        DSF    S+      +A  T+F+     +P                       W +V   NW D+             P++  + P  +L +          +  N + + D+  S      G  G       +  Y    D R+ P    S    A       ++      P Y+G ++HFSCG E+R
Sbjct:  273 GYFPTHTYVNGFQSWTFSGSIPRGAPQPQSAMPDRVSRAFNAGGAPPPTDATLLASSPRPYVPNDDFVPTYTSDFFTCISSDGETTEQMYPPL-DETGGPACVMGWLSQRQQFGIITADSNLERLQMHASCQGQILLPNGRIVTDWAYVQLTTPHSYDEEPMVYFLHAVAAYNEA-----------RP-MANLLTGWCSWYVFYQNISETILRENFVTLKEMRTHVPTNVAVVDDGYMTAWGDWDSCKPGQFPS-SLGVVADDIRKNQMRPGLWLAPFAADKHSLLTKGHPEWIIRNNIGL----PANSSNCGKFFYGLDATNPAVRRHVHDAIERAVK-QWGYSVLKIDFLYAACLEGSGKYDLSMSRAQAMHVAMQTIRKAAGSTTFLIGCGCPVASGIGIVDAMRVSADTGPAWYPQFPFP---WWDHGTLPSLKAMIRNSITRAPLGHRWWHNDPDCLMLGNHTSLTDIEVASAASIVAMTCGMLLLSDDLPKVSPKRMNILSKIFPLTGIPAVVLDLHSTKDGIPRLLRLWATDKFDMLDSFRSSMSLDEEFDHNAEATYFARQASFNPDRESVALNERNRTCIHVTKGLGTWTIVSISNWTDS-------------PVVSCIPPAALLPLGG-------SLEDNEEASTDSFSSAKANEPGRHGYHTLAFWSCKYNWIPDHRKNPEQTISRRLNAHETEIYHIKPVTPEKPQYIGGNLHFSCGKEVR 963          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|973085490|gb|KUK46333.1| (Alpha-galactosidase-like protein [Anaerolinea thermophila])

HSP 1 Score: 329.717 bits (844), Expect = 1.419e-95
Identity = 233/738 (31.57%), Postives = 346/738 (46.88%), Query Frame = 0
Query:  202 FVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGA--GPAEFDRSNNKELYSEMFAHVSDKSRGTGLLLGFLTQHEQFGAFSF------DECFVEVGAHCRCDGQLLKAGELMQSDW-CL--VELQTTLPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGG--LSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKGARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFP----LPDCKWNLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLEREMPEILRLYVDQDSFSEETSMCHAANTFFSHHVPQDPWVVVGFCNWDDAKKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMRRAPLYLGSDVHFSCGWELRKCIWGEESDYLDKAKVTRVSVHVVIDAGKVVDGTSHHIWLDLPGSEDSIKLDGAPEGSERVNFSVKGIY 922
            + NGW +WS TG  +QG +         L   F +     P      +      +MF  + + ++  GLL GF++Q   FG+                G H R D      GE +++DW CL  V+L    P E +   +A+A ++ + +               +VPVGWCSWYHFY+ I+++++ +NL  +  L+ ++PL L QIDDG++   GDW    P  FP   M  +   +    L PGLWLAP  V   +++  +HP+W++R + G   S G+  N+     + Y LD+T PE   +  + + T     W F+YLKLDFLY AALKG   DPT   AQVL+  L  +R A GP+T +L CG PLG+ +G  +A R+ AD    W P FP    L   + N+P  RN ++N + R P+H  WW+NDPDC+L+R  T+    EV  +AT   ++GGS LVSDDL S+   R+RIAQV+LPVI + A   DL E   P ++RL ++                      P   W ++   NW D      HP      P   ++    I    EFW+ +  +            M T+  F+  +   + PH  R+ A+R    Q   AL                  P YLGSD+H S G E+         ++    ++    + + I+ G+  DG   HI + LP       LD   + S  +    +GIY
Sbjct:  142 YSNGWTSWSNTGTFRQGDKQHT-----TLIGRFQNPQIINPDTLRHKDGDHFSGDMFGLLCENTQKIGLLAGFISQEMHFGSLETALSPTPSLSMWANGDHARLD-----PGESIRTDWACLSFVDLNENQPLEIYLNTVAKANKIRSEI---------------SVPVGWCSWYHFYQNITQEDIEANLTSVLALKDRVPLPLLQIDDGFETYPGDWFDFVP-GFPEGVM-PLATQISKSDLIPGLWLAPFIVHPKAKLVKEHPEWLLRDENGKLASAGFVWNT-----FTYALDLTHPEALDYACDVIRTAVK-EWGFEYLKLDFLYAAALKGQYQDPTKTRAQVLRNGLEALRHAAGPNTVMLACGCPLGSALGLFDAMRIGADVSGYWEPHFPPVSKLLTKEVNMPSARNALQNILTRAPLHRQWWINDPDCLLVRPDTKLNLPEVQSLATAIGMTGGSLLVSDDLPSLPEDRLRIAQVLLPVIDQQAQICDLFESHTPSLIRLDLEN---------------------PLGLWHLLAVFNWKD------HPADLEFSPQKFNLPLDPIYWCREFWSGEIGK------------MGTSSPFNFEN---VPPHGVRVIAVR----QCQPAL------------------PTYLGSDIHLSQGLEI--------GEWRSNERI----ISLRIEVGRSADG---HINIYLPWEASEAWLD---QRSCSIQSKGQGIY 764          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|1010955800|ref|WP_061915027.1| (glycosyl hydrolase [Bellilinea caldifistulae] >gi|937444703|dbj|GAP10252.1| alpha-galactosidase [Bellilinea caldifistulae])

HSP 1 Score: 329.331 bits (843), Expect = 2.002e-95
Identity = 197/586 (33.62%), Postives = 300/586 (51.19%), Query Frame = 0
Query:  202 FVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAGPAEFDRSNNKELYSEMFAHVSDKSRGTGLLLGFLTQHEQFGAFS---FDECFVEVGAHCRCDGQLLKAGELMQSDWCLVELQTTLPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWF-YGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKGARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVF-----PLPDCKWNLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLEREMPEILRLYVDQDSFSEETSMCHAANTFFSHHVPQDPWVVVGFCNWDDAKKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLN 778
            + NGW +WS+T      A      L         +   P   D        S+ F  ++D  +   +L GFL+Q E FG+     +D+  + + A+   D  LL++   + +DW ++ +     +     ++    +      E + +I       +N P GWCSWYHFY  ++  N+  NL  +  L+P LPL+L QIDDG++   GDW       FP + + ++   +   G  PGLWLAP  V   SQ+  +HPDW++R +     G   N+  V   F +GLD+T+P+   +  + + T +H  W+F YLKLDFLY AAL G   +PTL  AQVL+  ++ +REA G  T++LGCG+PLG+ +G V+A R+ AD    WLP F     P  +   ++P  RN + N + R P+H  WW+NDPDC+L+R  T  T +EV  +AT  AL+GGS L+SDDL ++   R+R+AQV+LPVIG+ A  LDLL  +MP +LRL ++  +F                      W V+   NW+D  +    P+   L+   +        H+  FW  + F  R    +N
Sbjct:  147 YSNGWQSWSYTAAYPANASLRRSWLGPFQKPMVINAGTP---DLKQTGYFTSDFFGILTDTVQNQSILAGFLSQREHFGSLEAVLYDQPVLRLWANG--DHTLLESQREIHTDWAVLLIADAEQQHILNPYLQAVAQ------EHQVRI------PQNSPSGWCSWYHFYTNVTAQNVRDNLRTLIQLKPSLPLELIQIDDGFESQVGDWFSFKE-TFP-QGVAELSQEIAQAGFTPGLWLAPFIVHPKSQLEREHPDWLLRDR----RGRPVNAGFVWNAFAHGLDLTVPDALEYACQVVRTAAH-EWNFPYLKLDFLYAAALPGVYRNPTLTRAQVLRRGMQALREAAGQETFLLGCGAPLGSVLGLVDAMRIGADVSGDWLPAFYNIRFPFKNEP-HMPSARNSINNILTRAPLHRRWWINDPDCLLVRPDTHLTQSEVESLATAIALTGGSLLLSDDLPALPQERIRLAQVLLPVIGQTARVLDLLSSQMPGLLRLDLE-GAFGN--------------------WHVLACFNWEDTPQ----PWHFELQTFQLKEQDY---HLHSFWDDQTFTCRAGETIN 679          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|219130254|ref|XP_002185284.1| (predicted protein [Phaeodactylum tricornutum CCAP 1055/1] >gi|217403199|gb|EEC43153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1])

HSP 1 Score: 330.102 bits (845), Expect = 9.621e-94
Identity = 241/775 (31.10%), Postives = 340/775 (43.87%), Query Frame = 0
Query:  198 PRRVFVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAGPA----------------EFDRSNNKELY-----------SEMFAHVS----------------------DKSRGTGLLLGFLTQHEQFGAFSFDECFVEVGAHCRCDGQLL---KAGEL------MQSDWCLVELQT--TLPEEPFATFIARAGRVNTALLEEKYKIIACRP-DTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKG-ARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKW----NLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLERE--MPEILRLYVDQ-----DSFSEETSMC----HAANTFF-----SHHVPQD-------------------PWVVVGFCNWDDA-------------------KKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMRRAPLYLGSDVHFSCGWEL 852
            P  ++++G+ +WSF G + +G       +P  LS AF++G  P                    R N+ + Y           S+ F  V+                      D++ G GL+LG+L+Q EQ+G    D        H    GQ++    +G        +++DW   +L    +  EEP   ++  A   N A           RP    ++  GWCSWYHFYE I+                               AWGDW  + P  FP        D V   G+R GLWLAP A DKHS++   HPDWIIR   G+     ANS+N GK+FYGLD T P V+ ++ E +    H +W FD LK+DFLY A L+G  + D +L+ AQ + LA++ IR+A GP+ +++GCG P+G+ IG+V+  RVSAD GP W P  PLP   W     LPC R+MVRN+++R P+   WW NDPDC+LL EST+ T  EV   A+V A++ G  L+SDDL  VS  R  I   + P+ G  A+ LDL      +P +LRL+        DSF E   +     +A  T+F     S+H  +D                    W VV   NW D                    ++    P   +  P  VD       H F FW+SKY              +    + D   G E +    +L A   +   I +     +Q               Y+GSD+HFSCG E+
Sbjct:  234 PTHIYIHGYQSWSFAGSIVKGQDQPQSAMPDFLSRAFNYGGSPPPVSDDVLTYVPPLSHNHIHRDNDGDAYTGPQSWKTHYQSDFFTCVTSDGTIPSFWTSRREKQFPFQALDETGGPGLVLGWLSQREQYGVIMADVDLRRYAMHVSGHGQIIWGRGSGSTTTNTIALETDWAYAQLIAPHSYDEEPMVHYLEAAAGYNQA-----------RPLRNGSLLTGWCSWYHFYENITA------------------------------AWGDWDSVKPGAFPQGMAAVARDIVAQGGMRAGLWLAPYAADKHSRLVKTHPDWIIRNDSGI----PANSSNCGKFFYGLDATNPAVRTYVYECIRRAVH-SWGFDVLKIDFLYAACLEGNGKHDLSLSRAQTMDLAMQAIRDAAGPNVFLIGCGCPVGSGIGYVDGMRVSADTGPTWYPALPLP---WWDHGTLPCLRSMVRNSMSRAPLGHRWWHNDPDCLLLGESTRLTDEEVASAASVVAMTCGMMLLSDDLTKVSVARTNILTKIFPMTGVTAVVLDLHSASDGLPSLLRLWCTDKYDLLDSFRERMVVSAQDHNAEATYFARQSSSYHPDKDQQHPIERQRSCIHVTKGLGTWTVVSVSNWSDRTAVVNLPPPALLPPPMTGWEQGDEEPESFLQTPEEVDC-EQHGCHAFGFWSSKY------------TWLPNQKYNDNGQGPE-RILRRKLVAHETEIYHIKAVTPDAAQ---------------YVGSDLHFSCGHEV 930          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|223996607|ref|XP_002287977.1| (hypothetical protein THAPSDRAFT_261491, partial [Thalassiosira pseudonana CCMP1335] >gi|220977093|gb|EED95420.1| hypothetical protein THAPSDRAFT_261491, partial [Thalassiosira pseudonana CCMP1335])

HSP 1 Score: 308.145 bits (788), Expect = 5.485e-93
Identity = 155/328 (47.26%), Postives = 208/328 (63.41%), Query Frame = 0
Query:  351 RPDTKNVPVGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKS-MRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKG-ARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKWN---LPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIG 673
            RP  K +  GWCSWYH+Y  I  D+L  N  ++   +  +   +  IDDGY  AWGDW  + P KF  +  MR + DA+RS+G++PG+WLAP A DK SQ+A DHPDWII+   G      ANSAN GK+FYGLD T P V+  +  T+       W F+ LKLDFLY + L G  + D T++ A+ + L LR IR A     +I+GCG P+G+ IG+V+  RVS D GP W+P FPLP   W+   LP  R M+RNTI+R P+   WW NDPDC+LL EST+ T  EV+  A+V A++GG FL+SDD+Q VS  R+ +A+ + P+ G
Sbjct:   12 RPMEKGLTCGWCSWYHYYSDIDHDSLSKNAHILEQKQKTIGFNVCLIDDGYMTAWGDWTSLKPGKFLKEGGMRVLADAIRSKGMKPGVWLAPFACDKSSQLAKDHPDWIIKNDCGRY----ANSANCGKFFYGLDATNPAVRKHVYNTILRAVE-EWGFEVLKLDFLYASCLAGNGKYDNTMSRAEAMYLGLRTIR-AAAKDAFIIGCGCPIGSAIGFVDGMRVSCDTGPTWVPEFPLP--HWDNGTLPALRGMLRNTISRAPLGHRWWHNDPDCILLGESTKLTDNEVVSAASVVAMTGGMFLLSDDMQKVSDARLNVAKRIFPLTG 331          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|1084578741|gb|OGO34814.1| (hypothetical protein A2W35_12825 [Chloroflexi bacterium RBG_16_57_11])

HSP 1 Score: 317.39 bits (812), Expect = 3.312e-91
Identity = 248/746 (33.24%), Postives = 332/746 (44.50%), Query Frame = 0
Query:  127 QVCVDFALHTIGPSMVLRLRCPALACSHDVRLDEFVLFDGDCDVSQGPRRDGKSLARIWWEYLSRRVRG--LGPRRVFVNGWNAWSFTGVVQQGARPSAPGLPGILSAAFHHGAGPAEFDRSNNKELYSEMFAHVSDKSRGTGLLLGFLTQHEQFGAFS--FDECFVEVGAHCRCDGQLLKAGELMQSDW-CLVELQTTLPEEPFATFIARAGRVNTALLEEKYKIIACRPDTKNVPVGWCSWYHFYE-----AISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFY-GLDVTLPEVQAFIRETLTTVSHGAWSFDYLKLDFLYVAALKGARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVF----PLPDCKWNLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPVIGRAAIALDLLEREMPEILRLYVDQDSFSEETSMCHAANTFFSHHVPQDPWVVVGFCNWDDAKKHSMHPYKSILRPLLVDVGPVIILHIFEFWTSKYFQVRLNLDLNPDALMSTADFFDGMDGGEIQPHSARLYAMRVDPRQIPSALTSTSQGAESARGSTMRRAPLYLGSDVHFSCGWELRKCIW 857
            +  +DFAL    P++  RLR         V LD   L D  CD   G                  ++RG  L     F NGW  WS+ G      R     L G + A     AG     RS   +  S+MF  + D+S    LL GFL+Q + FG+         + +      DG  L  G  +++DW CL  L    P+ P   ++   GR   A L +  +          +P GWCSWY F       A++E ++  NL  +  L+  LPL + QIDDG++   GDWL  +P  FP+  +  +   +R  GL PGLWLAP  V   S++AA+HP W++R +         N+  +   F   LD+T PE   + R  +    H  W F YLKLDFLY AAL G   DPTL  AQ+L+L L  +R+A G   ++LGC  PLG  +G V+A RVSAD    W+P +         + NLP  RN   N++ R P++  WW+NDPDC+L R  T  T +EV  +ATV AL+GGS  VSD L ++   R+RI + +LP IGR     D LE   P  LRL +   +                      PW ++   NW+DA      P    L P    + P       EFW     Q RL        L  T   F         PHSA L AMR   RQ P                     P YLGSD+H S G E+    W
Sbjct:   51 RCSLDFALPDNDPALFWRLRIENTG-PQPVSLDRLTLLD--CDPISG---------------TQPQIRGIHLQDAAFFSNGWQTWSYAGAYAPRDRFHRTRL-GPIRAPTDVNAGTPMPGRSG--QFSSDMFGVLGDRSSRRALLAGFLSQEQHFGSLEARLQPGGLALSLWANGDGARLDPGVKIKTDWACLYFLDIDDPD-PLGPYLEAVGR--QAGLADSSRTA--------IPTGWCSWYQFSSETYTGALTEGDIRDNLGALTLLKDDLPLSVVQIDDGFEAQIGDWLAFNP-GFPH-GLAPLAAEIRQAGLTPGLWLAPFIVHPRSRLAAEHPGWLLRGR----FNCLVNAGLLWDSFTTALDLTHPEALDYARRVIQAAVH-EWGFSYLKLDFLYAAALPGKHKDPTLTRAQILRLGLSTLRQAAGKGAFLLGCACPLGPAVGLVDAMRVSADTARRWVPAYRGIETFIASEPNLPSARNACHNSLTRAPLNRRWWINDPDCLLARPGTHLTLSEVQTVATVIALTGGSLFVSDHLPALPEERLRIFRALLPPIGRRPRLPDWLESPTPRRLRLDLQGAA---------------------GPWHLLALFNWEDA------PKDLTLLPGDFSLDPQAAYWAREFWRG---QTRL--------LNETGWAFPA-----CPPHSAILLAMR---RQSPD-------------------QPQYLGSDLHISQGLEVDSWQW 692          
BLAST of EWM30348.1 vs. NCBI_GenBank
Match: gi|1072231292|gb|OEU16024.1| (glycoside hydrolase, partial [Fragilariopsis cylindrus CCMP1102])

HSP 1 Score: 298.901 bits (764), Expect = 1.444e-89
Identity = 145/320 (45.31%), Postives = 207/320 (64.69%), Query Frame = 0
Query:  359 VGWCSWYHFYEAISEDNLLSNLEMMANLRPQLPLQLFQIDDGYQRAWGDWLKIDPLKFPNKSMRDMVDAVRSRGLRPGLWLAPMAVDKHSQIAADHPDWIIRQQGGLSHGWAANSANVGKWFYGLDVTLPEVQAFIRETLTTVSHGA--WSFDYLKLDFLYVAALKG-ARSDPTLNTAQVLQLALRLIREAVGPSTYILGCGSPLGATIGWVNANRVSADAGPAWLPVFPLPDCKW----NLPCGRNMVRNTINRLPMHGVWWVNDPDCMLLRESTQFTPAEVIGIATVKALSGGSFLVSDDLQSVSHRRMRIAQVMLPV 671
             GWCSWYH+YE I+E+NL  N   +A ++ Q+P  +  +DDGY  AWGDW  + P KF   +M  +   + S  +RPGLW+AP   DKHS+I  +HP+WIIR + G      ANS+N GK+FYGLD T P+V+  +    T+V      W F  LK+DFLY A+L+G  + D +++ A+ + LAL+ IREA GP+ +++GCG P+G  IG+++  RVSAD GP+W P FPLP   W     LP  R M+RN+++R PM   WW NDPDC+LL EST+ T  EV+  A++ A++ G  L+SDDL  VS  R+++   ++P+
Sbjct:   24 TGWCSWYHYYENITEENLRRNFSRLATMKKQVPTNMAMVDDGYMTAWGDWDSLKPKKF--TTMDVVASDIASSHMRPGLWMAPFTADKHSKIIKNHPEWIIRNEKGH----PANSSNCGKFFYGLDATNPQVRDHV---FTSVRRAVRDWGFRVLKIDFLYAASLEGNGKYDMSMSRAEAMHLALQTIREAAGPNVFLIGCGCPMGTGIGYIDGMRVSADTGPSWYPEFPLP---WFDNGTLPSLRGMIRNSMSRAPMGHRWWQNDPDCLLLGESTKLTHEEVVSAASIIAMTCGMLLISDDLSKVSQDRLQVLNRIVPM 331          
The following BLAST results are available for this feature:
BLAST of EWM30348.1 vs. NCBI_GenBank
Analysis Date: 2020-04-07 (BLAST analysis for N. gaditana B-31)
Total hits: 10
Match NameE-valueIdentityDescription
gi|585113007|gb|EWM30348.1|0.000e+0100.00alpha-galactosidase [Nannochloropsis gaditana][more]
gi|397617232|gb|EJK64341.1|8.253e-11032.23hypothetical protein THAOC_14938 [Thalassiosira oc... [more]
gi|1210511636|dbj|GAX29450.1|1.214e-10832.92hypothetical protein FisN_16Hh109 [Fistulifera sol... [more]
gi|1210514799|dbj|GAX26278.1|2.621e-10531.92hypothetical protein FisN_16Lh109 [Fistulifera sol... [more]
gi|973085490|gb|KUK46333.1|1.419e-9531.57Alpha-galactosidase-like protein [Anaerolinea ther... [more]
gi|1010955800|ref|WP_061915027.1|2.002e-9533.62glycosyl hydrolase [Bellilinea caldifistulae] >gi|... [more]
gi|219130254|ref|XP_002185284.1|9.621e-9431.10predicted protein [Phaeodactylum tricornutum CCAP ... [more]
gi|223996607|ref|XP_002287977.1|5.485e-9347.26hypothetical protein THAPSDRAFT_261491, partial [T... [more]
gi|1084578741|gb|OGO34814.1|3.312e-9133.24hypothetical protein A2W35_12825 [Chloroflexi bact... [more]
gi|1072231292|gb|OEU16024.1|1.444e-8945.31glycoside hydrolase, partial [Fragilariopsis cylin... [more]
back to top
Relationships

This CDS is a part of the following mRNA feature(s):

Feature NameUnique NameSpeciesType
rna346rna346Nannochloropsis gaditana (N. gaditana B-31)mRNA


Sequences
Synonyms
Publications