NO01G03970, NO01G03970 (gene) Nannochloropsis oceanica

Overview
NameNO01G03970
Unique NameNO01G03970
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length5426
Alignment locationchr1:1057411..1062836 +

Link to JBrowse

Properties
Property NameValue
DescriptionAlpha-l-rhamnosidase
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr1genomechr1:1057411..1062836 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0003824catalytic activity
GO:0003824catalytic activity
Vocabulary: INTERPRO
TermDefinition
IPR0089286-hairpin_glycosidase-like
IPR008902Bac_rhamnosid
IPR035396Bac_rhamnosid6H
IPR013737Bac_rhamnosid_N
Homology
BLAST of NO01G03970 vs. NCBI_GenBank
Match: WP_066792603.1 (alpha-L-rhamnosidase [Caldivirga sp. MU80])

HSP 1 Score: 453.0 bits (1164), Expect = 2.600e-123
Identity = 315/926 (34.02%), Postives = 454/926 (49.03%), Query Frame = 0
Query:   28 ITSQRVEYMPCPAMGIDVANPRFAWVVEPSDISVRGAYQEAFRIIITA----ADAGREVIWDSGRVNNSESRHIRFGSVGELPTAALASATAYDWTVTAWVATDVLPVSPSISSSTTTRKKAETTGAVQSQPSPAARFVTGMLNASKDWAGADWLVAANSSTQNLCRAPFIVPNSEKVERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPR----------TFIAKLFVNGKEVLSTSSKQQEMWLCA-VGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGYE--PRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMS-VPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWCPVPGQ----PMCSPHLTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFY-HGGTYD------------NGVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQGAGTL 919
            ++  RVE+   P +GID   PRF+WV+E  +   RG +Q A+RII+++    A  G   +WDSGRV  S  + +++G         L+S T Y W V AW                        +  V+   S    F T +LN  ++W  A W+         L R+ F +    +V  A   + GLGY++  +NG RV D  L+  W++Y + V YSVYDVT L++      GGN +G+ LG G YS    +  T +P             I     NG  V  T+    E W C   GPI+YD +YNG  +D RL    VGWD+P F          +D   + W P +V   P      +  +P ++    L       P+ GV+V DFGQN+ G V L +       V +RH+EV        V   G I+V+N+RGA ATD Y+                R G VE     V EP FTYHGFRY EI GY   P + DV+A+++H+D+ P+G ++ SD ++N I +   W  RANI++ V TDCPQRDER GWLGDA LS + A +NF+MV  Y KF+RD+ D+Q   G I D  P      PADP+WG+A   + + LY   GDV IL E Y A+ K+ +   L ++    +  +  YG+W P PG+      C P +  ++    +   L+ +A  +GK ED   +         AF+ AF    G Y              G QTC ALPL+   VP +    + + LV+++      H   GI G KY+ EVL + G  D+A + + Q  YPS+G+M+    E ATT+WE W     G  MNS NH M G  V  WFY+ LGG+          G+  ++  P + +   +   + ++ T++G+V++EW +  G L
Sbjct:    7 VSDARVEFTVNP-LGIDEQRPRFSWVLEHEE---RGQFQTAYRIIVSSSLENAVKGIGDVWDSGRV-ESRDQVVKYGG------PPLSSFTRYYWRVKAW-----------------------DSRGVEGDWSSIQWFETALLNLGEEWT-AKWIGGG-----QLLRSTFKIDG--EVLEARAYVTGLGYYELRINGERVGDRVLDPPWSEYDKTVYYSVYDVTNLVR-----NGGNAVGLILGRGRYSPVSPSR-TQIPNLKYYDEPKAGAMIRIKLRNGSIVTITT---DESWRCLDKGPIIYDDIYNGYRYDARLEP--VGWDEPGF----------ND---SNWAPCIVVKPPSARLRSTATVPGVKVKGTLKPREYYNPRPGVYVFDFGQNMTGWVRLRVRGLSGMEVKVRHSEV--------VNPDGSINVENIRGAEATDTYVL---------------RGGGVE-----VLEPRFTYHGFRYAEITGYSGVPSIDDVEAVIVHSDLEPVGSLSCSDRMVNDIHRITWWSLRANILNGVVTDCPQRDERMGWLGDAWLSSDSAAYNFNMVKYYEKFIRDMVDSQKDDGSIPDVVPPYWNLYPADPAWGTALIYIPWLLYVHYGDVDILAEAYDAMKKWWN--FLWSKAKDGLLYFSKYGEWVP-PGRIHSIEYCPPEILSTWILHRDALTLAQIARVLGKGEDEGYFKGKAEEIREAFNRAFLTERGYYSRYTAPDGSVKVLGGSQTCNALPLYLDMVPGNRVNDIVKALVNNVEVEWNRHLVVGIFGAKYVPEVLVKHGYVDLAYKAITQETYPSWGFMVK---EGATTLWERWELITGG-GMNSLNHHMLGS-VDAWFYRNLGGI-----IPLEPGFSRIMIKPIMPS--GIRHCSASLYTVRGLVSVEWSRSDGEL 823          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: OGD23077.1 (hypothetical protein A2W03_00645, partial [Candidatus Aminicenantes bacterium RBG_16_63_16])

HSP 1 Score: 441.8 bits (1135), Expect = 6.000e-120
Identity = 268/689 (38.90%), Postives = 366/689 (53.12%), Query Frame = 0
Query:  232 WTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPRTFIAKLFVNGKEVLSTSSKQQEMWLCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGY--EPRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWCPVPGQPMCSPHLTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFYHGGT--YDNGVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQGAG 917
            WT Y +R  Y  YDVTG ++      G N +G+ LG GW+ +        +   F   + + G E L   S     W    GPIV DSVYNGE +D RL     GWD   + ++G             W PA   +  P G L S  MP I+  + +V ++++ P+ GV+V D GQN AG   L +      +V +R AE+L         + GMI+ +NLRGARA D+YI                 +GE EE    V+EP FTYHGFRYVEI G+   P    ++  V+HT V   G    S  +LN +Q+ I WGQ +N+ S+PTDC QRDER GW+GDAQ + EEA++NFDM A Y  F+RDI D QD  G I DT P   G RPADP+WG+A+P + +++Y+  GD  ILEE Y  + KYV+  L   E  G+V  Y  Y DW  V   P     +  SF +  +++ L+DMA  +G+E+D K+YG        AFH  ++   T  Y NG QT   L L+ G  P   +  V   L  ++V     H TTGIIG KY++E+L+  G +D+A  +  Q  YPS+GYM+ +    ATT+WELW   + GP+MNS NH MFG  VG W YK L G+   P   E  G+  +  AP +     L     +V+T +G V+  W +  G
Sbjct:    3 WTTYDKRALYVTYDVTGYLR-----QGANAVGVMLGQGWFKS--------LALLFQLNIELEGGERLEVVS--DGSWKVKPGPIVSDSVYNGETYDARL--ETPGWDRAGYDESG-------------WTPAQPVTG-PKGVLSSQMMPAIQVTDTIVPLNMSSPKAGVYVFDMGQNFAGWAELRVRGPRGAAVKMRFAELLY--------DTGMINQENLRGARAEDIYIL----------------KGEGEE----VYEPRFTYHGFRYVEISGFPGAPSADTLRGRVVHTAVEQTGSFACSKPVLNGLQRIIVWGQTSNLHSIPTDCCQRDERMGWMGDAQGTAEEAIYNFDMAAFYTNFLRDIRDVQDEKGTITDTVPHVWGSRPADPAWGTAYPLICWYVYQYYGDKRILEEHYEGVKKYVE-FLRTREENGLVK-YFYYADWVSVDKTP---GSIVSSFYYYYDVRVLADMAKILGREQDAKLYGELVEKIKLAFHKEYFDPKTKNYANGTQTANTLALFLGLAPEADRGAVWGNLFDNIVYKNYSHLTTGIIGTKYIMELLTTHGNSDLAYDIASQTTYPSWGYMIEH---GATTLWELWQL-REGPSMNSHNHPMFGS-VGSWLYKALAGISLAP---ESTGFEKIRIAPQMV--RDLRHAAGSVQTYRGPVSSSWSREDG 617          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: OGU26950.1 (alpha-L-rhamnosidase [Ignavibacteria bacterium GWA2_54_16])

HSP 1 Score: 440.3 bits (1131), Expect = 1.700e-119
Identity = 329/1030 (31.94%), Postives = 490/1030 (47.57%), Query Frame = 0
Query:   32 RVEYMPCPAMGIDVANPRFAWVVEPSDISVRGAYQEAFRIIITAAD----AGREVIWDSGRVNNSESRHIRFGSVGELPTAALASATAYDWTVTAWVATDVLPVSPSISSSTTTRKKAETTGAVQSQPSPAARFVTGMLNASKDWAGADWLVA---------------------ANSSTQNLCRAPFIVPNSEKVERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPRTFIAKLFVNGKEVLSTSSKQQEMWLCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGYE--PRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGG-RPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAG-VVNLYKSYGDWCPVPGQ--PMCSP-HLTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFYHGGTYD---------NGVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQGAGTLPTTAXXXXXXXXXXXXXXXXXXXXXSSSRHLQASIQVPLGSKSRVILDGRALGDVNPMLSVTVDGVDLKQAMKMSGQGLEVLQASDDFVELLLASGGWAIGV 1021
            R EY+  P +GID   PRF+W+V       R   Q A+RII+++++     G   IWD+G+VN+ E+ +I F        + L S     W V  W  TD                       + S  S  ARF  G+L  + DW G  W+                        NS T  L R  F +   +++ RA   + GLGY++  LNG++V D+ L+   T+Y +   YS YDVT  +  G+       +G+ LGNG Y  + G +   +    + +      + ++TSS     W  + GP+  + +Y GE +D RL   + GWD+P+F               ++W  A V +  P   + +  M PIR ++ L     +    G  V DFGQN AG V LS+       V LRHAE+L         + G +++   + A +TDVY                   GE  E     +EP FTYHGFRY+EI      P +  V   V+H+DV  +G  + S  L+N+I + I WGQR+N+MS+PTDC QRDER+GWLGDA L+ EE++FNF M + Y KF+RDI   Q   G + DT P  +G   PADP+W +A+ ++ + +Y+  GD  IL  ++ ++ KYV    L     G +V     YGDWCP PG   P  +P  LT ++ FL +   L  +A A+G+EED +   A   +   AF+  F   G Y+         +  QT  ALPL+   VP + +  V E+L+  +V+ Q YH  TGI+G +Y+L+VLS LG+TDVA ++  QR YP +GYM+    E ATT+WE W     G  MNS NH+M G  V  WFY+ L G+     +     ++ + F PP+     L+      R+++G V++ W +                               S+     +I VP+G+   V +  +    V  +   TV     +Q  K   QG E++   + +V L + SG +A  V
Sbjct:   10 RCEYLIDP-VGIDEPRPRFSWIVPQGG---RTRSQRAYRIIVSSSELRALQGEGDIWDTGKVNSDETSNIPFNG------SVLLSRQECFWRVCWWEQTD-----------------------ISSPWSGIARFEMGLLQET-DWKG-QWISRQGVKEFRSKGSTLLGEPLGDYVNSFTLYL-RKEFRL--KKRIARARAYVCGLGYYELRLNGSKVGDSVLDPAQTEYRKVALYSTYDVTSHLASGD----ACTIGVLLGNGRYIRSYGYDAPKLRMQLVVECEDGTVDSVATSSD----WKVSYGPLQENGLYFGERYDARL--EMPGWDEPSFDD-------------SSWESAQVVAGVP---VAAQMMEPIRVVQILSPRKWSMLSSGEAVYDFGQNFAGWVRLSVSGPAGTEVKLRHAELLN--------DDGSLNISPNQNAESTDVYTL----------------RGEGAE----AYEPRFTYHGFRYLEITADPALPSIISVLGCVVHSDVAEVGQFSCSHELINKIHRNILWGQRSNLMSIPTDCSQRDERQGWLGDAHLAAEESMFNFGMASFYTKFLRDIHHAQREDGSLPDTVPAYLGRLYPADPAWSAAYITIAWLMYQFYGDTRILGRYFDSMKKYV--FFLRDHSDGLIVKTLGKYGDWCP-PGSIAPKRTPVELTSTWYFLHDTVLLGRIAAAIGQEEDQRKLEALAINIRSAFNRTFLRDGEYEVNRFAPVDRSPGQTSNALPLYLDMVPPEGKAKVVERLLHGVVNEQDYHLDTGILGTRYLLDVLSDLGRTDVAFRVAVQRTYPGWGYMVE---EGATTLWERWEK-IGGGGMNSHNHIMLGS-VDAWFYRVLAGL-----STLAPAWKHIRFKPPVV--QGLDSAQAEFRSVRGRVSVAWRR-------------------------------SAEEFLMAIVVPVGAVGTVYVPVQREDHVVILDGTTVWSATQRQGSK--AQGCELVGCEEKYVLLNVGSGEYAFRV 899          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: WP_012185241.1 (alpha-L-rhamnosidase [Caldivirga maquilingensis] >ABW01021.1 alpha-L-rhamnosidase [Caldivirga maquilingensis IC-167])

HSP 1 Score: 439.9 bits (1130), Expect = 2.300e-119
Identity = 318/920 (34.57%), Postives = 454/920 (49.35%), Query Frame = 0
Query:   28 ITSQRVEYMPCPAMGIDVANPRFAWVVEPSDISVRGAYQEAFRIIITA----ADAGREVIWDSGRVNNSESRHIRFGSVGELPTAALASATAYDWTVTAWVATDVLPVSPSISSSTTTRKKAETTGAVQSQPSPAARFVTGMLNASKDWAGADWLVAANSSTQNLCRAPFIVPNSEKVERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWY---STN----GGAEPTDVPR-TFIAKLFVNGKEVLSTSSKQQEMWLCAV-GPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRME-ELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGYE--PRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMS-VPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWCPVPGQ----PMCSPHLTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFY-HGGTYD------------NGVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQ 914
            I   RVE+   P +GID + PRF+W++E  +   RG YQ A+R+I+++    A  G   +WDSG+V NS  + I++          L+S T Y W V AW                        +  V+   S    F T +L   ++W+G  W+         L R  F V  S  V  A   + GLGY++  +NG RV D  L+  W++Y + V YSVYDVT L+K GE     NV+G+ LG G Y   S N     G +  D P+ + + ++ ++   V++ ++   E W C V GPI+YD +YNG  +D RL     GWD            AG D   + WV   V   PP G L S    P  +++  L       P+ GV+V DFGQNI G V L +       V +RH+EV+           G ++V+N+RGA ATD YI       +GR         +VE     V EP FTYHGFRY E+ GY   P + DV+A+++ TD    G I  S  ++N I +   W  RAN+++ + TDCPQRDER GWLGDA LS + A+FNF+MV  Y KF+RDI D+Q   G I DT P      PADP+WG+A   + + LY   GDV ILEE Y A+ K+     L + V   V  +  YG+W P PG+      C P +  ++    +   L+ +A  +G+ ED   +         AF+  F    G Y              G QTC ALPL+   VP +    + + L  ++      H   GI G KY+ EVL + G  D+A + + Q  YP +GYM+    E ATT+WE W     G  MNS NH MFG  +  WFY+ L G+     T E  G+  ++  P + +   L   + ++ T++G+ ++EW +
Sbjct:    7 IIDARVEFTVNP-LGIDESKPRFSWILEHEE---RGQYQSAYRVIVSSSLENAVKGIGDVWDSGKV-NSRDQVIKYNG------PPLSSFTKYYWRVKAW-----------------------DSNGVEGDWSDVQWFETAVLK-PEEWSG-KWIGGG-----QLLRRSFRVEGS--VIEAKAYVTGLGYYELRINGERVGDRVLDPPWSEYDKTVYYSVYDVTNLVKSGE-----NVIGLILGRGRYGPVSPNRAQIPGLKYYDEPKASAMIRIRLSDGSVITINT--DESWKCLVKGPILYDDIYNGYRYDARLEP--YGWD-----------KAGFDD--SNWVQCSV-VKPPGGRLRSTAAVPGTKVKGTLKPREYYNPRPGVYVFDFGQNITGWVRLRVRGSSGVEVKVRHSEVIN--------SDGSLNVENIRGAEATDTYIL------SGR---------DVE-----VLEPRFTYHGFRYAEVTGYPGVPSIDDVEAVIVQTDFESTGSIATSSKIINDIHRITWWSLRANLLNGIQTDCPQRDERMGWLGDAWLSSDSAVFNFNMVKYYEKFIRDIIDSQRDDGSIPDTVPPYWNTYPADPAWGTALIYIPWLLYVHYGDVEILEEAYEAMKKWWS--FLNSRVKDNVLYFSKYGEWVP-PGRVFSAEYCPPEILSTWILYRDTLTLAQIAKVLGRGEDASFFTKRAEEIRDAFNRVFLTERGYYSKYTAPDGSVRMLGGSQTCNALPLYLDMVPGNRVNDIVKALAHNIEADWDRHLVVGIFGAKYVPEVLVKYGYVDLAYRAVTQESYPGWGYMIK---EGATTLWERWEK-LTGAGMNSHNHHMFGS-IDAWFYRDLAGL----MTLE-PGFSRIMIKPNIPS--ELRYCSASLYTVRGLTSVEWSR 817          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: WP_012548615.1 (alpha-L-rhamnosidase [Dictyoglomus thermophilum] >ACI19983.1 alpha-rhamnosidase [Dictyoglomus thermophilum H-6-12])

HSP 1 Score: 434.9 bits (1117), Expect = 7.300e-118
Identity = 300/922 (32.54%), Postives = 455/922 (49.35%), Query Frame = 0
Query:   41 MGIDVANPRFAWVVEPSDISVRGAYQEAFRIIITAA----DAGREVIWDSGRVNNSESRHIRFGSVGELPTAALASATAYDWTVTAWVATDVLPVSPSISSSTTTRKKAETTGAVQSQPSPAARFVTGMLNASKDWA--GADWLVAANSSTQNLCRAPFIVPNSEKVERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPRTFIAKLFVNGKEVLSTSSKQQEMWLCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKG-SVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGYE--PRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWC-PVPGQPM-CSPHLTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFYHGGTYDN--------------------------------------GVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQ 914
            +G+D  NP F+W +   +   +   Q A+++I++++    +     +WD+G+V +SE        V +     L     Y W V  W + D       +++  T          + ++ +  A+++T   +  + ++  GA + +    +   + R  F +  S+K++RA + +AGLG ++  +NG R+ D  L+ G TDY +RV Y+VYDV+  I+ G+     N +G+ LGNG Y    G    D P+  I ++ V  ++         E W    GPI  +S+Y+GE++DGR    + GW+ P F               +TW  A++ + PP G+L S   PPIR  + +  I +  P+ G +V DFGQN  G + + +     G  + +RHAE+          E G ++    R A ATDVYI                 +GE  EE    +EP FTYHGFRYVEI GY   P L D++  V+HT V   G+   S+ L+N+I   I WGQ +N+MS+PTDCPQRDER GW+GDAQLS EEA+FNFDM+  Y K++ DI D Q   G + D  P      P DP+W +A+ ++ ++LY+  GD  +LEE Y    KYV+ +   A    +V+ YK YGDWC P   +P   S  LT +F F  ++  LS +A  +GKE D K Y         AF+  F     Y +                                        QT   LPL+   VP D  + V + L+ D++    YH  TGI+  +Y+ +VL+  G  +VA +++ Q+ YPSFGYM+    E ATT+WE W        MNS NH+MFG  V  WFY+ + GVR G       G+  +IF P       L+     + T++G V + W++
Sbjct:   21 LGVDKKNPIFSWKLRHLE---KNEKQTAYQVIVSSSLETINDNIGDVWDTGKVLSSE-------QVIKYEGKELEPCKVYFWKVRWWDSKDQESPFSVVNTFET---------GLMNEENWKAKWITKKEHKYEVYSPDGAPFGLNYTIAYAPMFRKSFSI--SKKIKRARVYIAGLGLYELYINGERIGDRVLDPGQTDYKKRVLYTVYDVSKNIRDGK-----NAIGVILGNGRYVKEYG---YDFPK-LIIQVLVEYEDDSIEWIVSDESWKTTYGPITLNSLYHGEIYDGR--KEIKGWNLPDFDD-------------STWENAIL-AEPPGGKLYSEIYPPIRITKTIKPIKMWSPEPGTYVYDFGQNYTGWIKIKVRTNESGKEIRIRHAELTY--------EDGTLNYSTNRTALATDVYI----------------TKGEGYEE----YEPRFTYHGFRYVEILGYPGVPTLEDIEGKVVHTAVESNGEFICSNELINKIHHNIIWGQLSNLMSIPTDCPQRDERMGWMGDAQLSAEEAIFNFDMIGFYRKYLNDIRDAQKENGSLSDVIPPYWSIYPGDPAWSTAYITIAWYLYQYYGDKYVLEEHYEGFKKYVEFLKKLAP-DYIVSFYK-YGDWCQPGTVRPKDNSGELTSTFYFYHDVITLSKIAKLLGKEADYKYYSELADKIKSAFNKKFLKEKAYASIPTELSEENVKALLEKYPEDIKDFLRQQFTILSSLGMFTSQTLNTLPLYLNLVPEDKVQDVLKTLLEDIIIRHDYHLDTGIVATRYIFDVLTSYGYDEVAYKIVNQKTYPSFGYMIE---EGATTLWERWEK-LTSTGMNSHNHIMFGS-VDAWFYRVIAGVRVGE-----PGWNKIIFEPHPVG--DLKYAKARLNTIKGEVEINWQK 854          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: WP_013782175.1 (alpha-L-rhamnosidase [Mahella australiensis] >AEE97752.1 alpha-L-rhamnosidase [Mahella australiensis 50-1 BON])

HSP 1 Score: 431.8 bits (1109), Expect = 6.200e-117
Identity = 311/912 (34.10%), Postives = 443/912 (48.57%), Query Frame = 0
Query:   26 YTITSQRVEYMPCPAMGIDVANPRFAWVVEPSDISVRGAYQEAFRII----ITAADAGREVIWDSGRVNNSESRHIRFGSVGELPTAALASATAYDWTVTAWVATDVLPVSPSISSSTTTRKKAETTGAVQSQPSPAARFVTGMLNASKDWAGADWLVAANSST---QNLCRAPFIVPNSEKVERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPRTFIAKLFVNGKEVLSTSSKQQEMWLCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGA--AGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGY--EPRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWCPVPGQ--PMCSP-HLTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFYH--GGT--------YDNGVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQ 914
            Y     R +YM    +GID   P F+W ++          Q A+R+I    I A +     IWDSG V +  S  + +G       A L   T Y W    W   +   +SP    +T      ET   +    +  A+++           G D + A         +L R  F      K+ RA   ++G+GY++  +NG +V D  LE G TDY + V YS YD+T  I       G N +G+ LGNG Y  + G    D PR  IA++ +  ++         E W     PI  + +Y GE +D R+   + GWD P     G++ A  AG++  AA           P G+LIS  MPPI+  +   A+ LT P+ GV++ DFGQN  G V L         V L+ +E+L         + G ++ +  R A ATD YI                 +GE  EE    +EP FTYHGFR+VE+ GY   P L  ++   +HT V P G    S+ L+N I K I +GQ +NIMS+PTDCPQRDER GW+GDAQL  EEA +NFDM A + K++ DI D Q   G + D  P      PADP+WG+A+ S+ + LY+  GD  + EE Y  + ++VD +    +  G+VN Y  YG+WC  PG   P   P  +T +F +  +   LS++A  +G+ ED   Y         AF+  F++  GGT        Y +  QTC  L L     P   ++ V  KL+  +V    YH  TGIIG KY+L+ L   G  DVA +M+ Q+DYPSFGYM+    E  TTIWE W        MNS NH+MF G V  WFYK L G+       + +G++ +   P +     L   + TV+++ G ++  W++
Sbjct:    6 YAPMDLRCDYM-ADFLGIDNTKPIFSWGLKHDQ---PNQSQTAYRLIVADNIEAINNDEGNIWDSGLVPSDNSTCVVYGG------APLKPYTQYFWK-ACWQDKN-RQISPYSQIAT-----FET--GLMGDANWQAKWIGDKSRQQVVLPGGDGMNAGKGFVLYMGSLFRKEF--KPKGKIARARAYISGIGYYELRINGQKVGDRVLEPGQTDYKKTVLYSTYDITPYI-----NDGANAIGVILGNGRYVKDYG---YDFPR-LIAQMHIYYQDGSMDVITTDETWKTHASPIRENGIYYGETYDARM--EIEGWDMP-----GLNDADWAGAEKVAA-----------PGGKLISQAMPPIKITKTFPAVKLTNPKPGVYIYDFGQNFTGWVRLKAEGPAGTQVKLKFSELLY--------DDGTLNTNVNRNAEATDTYIL----------------KGEGIEE----YEPRFTYHGFRFVEMTGYPGTPSLDTLEGRFIHTAVEPKGSFECSNQLINNIHKNIIYGQLSNIMSIPTDCPQRDERMGWMGDAQLVAEEAGYNFDMAAFWKKYLNDIKDCQKEDGSLSDVVPAYWPIYPADPAWGTAYISIAWELYKFYGDTTVFEEHYDNMKRWVDFLTAHTDEIGLVNYY-HYGEWC-APGSVPPKNMPWEITSAFYYYHDALVLSEIAKLLGENEDADNYARLAAEIKSAFNKKFFNEKGGTFFANESNNYSSTSQTCNVLGLQYNLAPEGTEQDVINKLIKLIVVDADYHFDTGIIGTKYILDTLDEYGYKDVAYKMMTQQDYPSFGYMIR---EGGTTIWERWEK-LTNKGMNSHNHIMF-GTVDAWFYKALAGI-----LPDSSGFKKITIKPVVPE--GLNYASATVKSIYGAISSRWQR 827          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: WP_008193444.1 (MULTISPECIES: alpha-L-rhamnosidase [Thermotoga] >EJX26467.1 alpha-L-rhamnosidase [Thermotoga sp. EMP] >AIY87265.1 alpha-L-rhamnosidase [Thermotoga sp. 2812B])

HSP 1 Score: 431.8 bits (1109), Expect = 6.200e-117
Identity = 320/974 (32.85%), Postives = 466/974 (47.84%), Query Frame = 0
Query:   32 RVEYMPCPAMGIDVANPRFAWVVEPSDISVRGAYQEAFRIIITAADAGR----EVIWDSGRVNNSESRHIRFGSVGELPTAALASATAYDWTVTAWVATDVLPVSPSISSSTTTRKKAETTGAVQSQPSPAARFVTGMLNASKDWAGADWLVAAN----------SSTQNLCRAPFIV------PNSEKVERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPRTFIAKLFVNGKEVLSTSSKQQEMWLCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGYE--PRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWCPVPG--QPMCSP-HLTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFYH-----------GGTYDNGVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQK-YHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQGAGTLPTTAXXXXXXXXXXXXXXXXXXXXXSSSRHLQASIQVPLGSKSRVILDGR 969
            R EYM  P + IDV  PRF+W++       R  +Q A+RII++++          IWDSG+V + E+ ++ +G         L S T Y W V  W                T  ++AE         S  A F T  L   +DW  A W+              +S Q+ C+  +          ++ V+RA   + GLG ++  +NG +V D  L+ G TDY +   YS YD+T  +         N +G+ +GNG +  + G     +   F+ +     +E L T     E W  + GP++ + +Y GE +D RL   + GWD+  F               ++W     ++ P   +L S  MPPI+  E L    +  P+ GV+V DFGQN  G V L +       + +R+AE+        V   G ++   LR A +TDVYI                 +G  EE    ++EP FTYHGFRYVEI GY   P L  V+   +HT V  IGD   S+ L+NQI K I WGQ +N+MS+PTDCPQRDER GWLGDAQL+ EEA+ NFDM A Y K++ DI  +Q   G I D  P      PADP+WG+A+ ++ +++Y    D  +LEE Y ++ KYV+ +   ++   + NL K YGDWCP PG   P  +P  LT +F F  +   LS +A  + K+ D + Y     +   AF+  F             G    +  QT   LP+++  VP + +  V + L  +LV+ +   H  TGIIG +Y+LEVLS  G+ D+A +++ Q  YP +GYM+    E ATT+WE W     G  MNS NH+MFG  +  WFYKY+ G+R         G++     PP+ A   L+  +  +  +QG + + WE+  G                           S   + QA + VPL SK  V   GR
Sbjct:    8 RCEYMRNP-INIDVEKPRFSWILISDQ---RDQFQRAYRIIVSSSYEKALNWIGDIWDSGKVLSGENINVEYGG------KELESFTRYYWRVKCW----------------TDNEEAE---------SDIAFFETAALE-EEDWK-AKWITKKEFISFISNENPASGQDKCKQYYAAYFRKEFEITKNVKRARAYICGLGIYELRINGRKVGDNVLDPGQTDYSKIALYSTYDITDYL------NEKNAIGVIVGNGRHIESYGYGKPRLIAQFLIEYEDGTREFLVT----DENWRVSHGPLMENGIYYGERYDARL--EMPGWDEYGFDD-------------SSWEEVEATNRP---KLKSQMMPPIKITETLKPKKMWSPKPGVYVFDFGQNFTGWVRLKVRGPRGTEIKIRYAEL--------VDNDGTLNTSTLRSAESTDVYIL----------------KGAGEE----IYEPRFTYHGFRYVEITGYPGVPTLESVEGRFVHTAVEKIGDFVCSNDLVNQIHKNIIWGQLSNLMSIPTDCPQRDERMGWLGDAQLTAEEAILNFDMAAFYTKYLMDIRLSQREDGSIPDVVPPYWKLYPADPAWGTAYITIAWYMYLYYNDKRVLEEHYDSMKKYVEFLKDNSDNYIIRNLGK-YGDWCP-PGSIHPKGTPLELTSTFYFYHDALLLSKIAKVLNKKIDVEKYKRLSENIKNAFNGEFLKFEKGKHIYSGCGCNKKSTNQTLCVLPIYSNIVPEEYKLEVFKSL-EELVEVRSDGHLDTGIIGTRYLLEVLSDNGRFDLAYKIVTQETYPGWGYMIK---EGATTLWERWEKLCNG-GMNSHNHIMFGS-IDAWFYKYVAGIR-----ILEPGWKKFQVKPPINA--DLQFASARLIAIQGEIFVSWERLNGKFRMVV---------------------SIPANTQALVYVPLMSKKEVFESGR 852          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: OHE61220.1 (hypothetical protein A2Y36_07205 [Treponema sp. GWA1_62_8] >OHE65812.1 hypothetical protein A2001_11755 [Treponema sp. GWC1_61_84] >OHE71429.1 hypothetical protein A2Z99_21490 [Treponema sp. GWB1_62_6])

HSP 1 Score: 425.2 bits (1092), Expect = 5.800e-115
Identity = 307/901 (34.07%), Postives = 431/901 (47.84%), Query Frame = 0
Query:   43 IDVANPRFAWVVEPSDISVRGAYQEAFRIIITA----ADAGREVIWDSGRVNNSESRHIRFGSVGELPTAALASATAYDWTVTAWVAT---------DVLPVSPSISSSTTTRKKAETTGAVQSQPSPAARFVTGMLNASKDWAGADWLVAANSSTQNLCRAPFIVPNSEKVERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPRTFIAKL--FVNGKEVLSTSSKQQEMWLCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGYE--PRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWCPVPG--QPMCSP-HLTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFYHG-GTY--------DNGV-QTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQ 914
            ++   PR +W+ E     +RG  Q A+RII +A     + GR  +WDSG+V ++E+  + +G         L   +AY W V  W               P S + S  T    +A    +  S   PA    + +L   KD    +  V +        R  F +   E   RA   + GLGY++A  NG R+ D  L+ G TDY R+  YS Y+  GL++ G      N LGI LGNG Y    G      PR F+  L  F +G   L  +      W C+ GPI+++ +Y+GE  D RL     GW  P F  +G             W   VV   P   +L S  +PPIR    L A  L  P  GV+V DFGQN +G V L L       VTL+ AE+L           G +++   R +R+ D YI                  G+ EE     FEP FTYHGFRY E+ G+   P L   + +V+HTDV   G    +D L+N+I + I WGQ +N+MS PTDCPQR ER GWLGDAQL+ EEA  NFDM   Y+K++ DI   Q + G + D  P      PADP+WG+A+ +L   +YE+ GD  IL+  Y  + +YVD +  +AE   ++     YGDWCP PG   P  +P   T ++    +  + S +A A+G++ D + Y +      RAF+ AF  G G Y        D  V QT   LPL+   VP D ++     L+  +      H  TGI+G +Y+ EVL   G  + A +++ Q  YP +GYM+    E AT++WE W        MNS NH+MFG  V  WFY+ LGG+           ++L+  AP   A   L     +  T++G VA+ W +
Sbjct:   18 LETRRPRLSWIPES---RLRGEAQTAYRIICSADKGTVERGRGDLWDSGKVRSAENHLVAWGG------PDLDECSAYFWAVRTWGEVRGAAAEGKETESPWSEAASMETAVLDQASWKASWISMEDPAFEETSVLL--VKDTINNNVHVPSKLYQAIYLRHEFDL--KEAPVRARAYVCGLGYYEAYANGTRLGDHRLDPGQTDYSRKALYSTYNAEGLLRKGR-----NALGIILGNGRYLDAYG---FGEPRAFLQLLCEFSDGSRTLIVTGPS---WTCSHGPILHNGIYSGETCDARLEQE--GWAGPGFTGSG-------------WKNVVVVEGP---KLESQTLPPIRATVTLPARDLANPAPGVYVFDFGQNFSGVVRLRLRGPRGTPVTLQFAELL--------GPDGRLNLGTNRESRSRDRYIL----------------SGKGEE----TFEPRFTYHGFRYAEVTGFPGVPGLDSAEGVVIHTDVRTAGTFVCADPLVNKIHRNILWGQLSNLMSAPTDCPQRGERMGWLGDAQLASEEACCNFDMAGFYVKYLEDIRLAQRSDGSLSDVVPPYWPLYPADPAWGAAYVTLALTMYEQYGDPDILDRHYEGMKRYVDFLESSAE-DHILKSLGRYGDWCP-PGTIYPKKTPMEFTSTWYLYFDTLRFSRIAAALGRKTDAEEYASRAEEVARAFNDAFLLGEGRYTTFRMSPIDRSVGQTTQTLPLFLDLVPPDQRERAVIHLMKAVATDADCHVDTGIVGARYLFEVLRDAGHAETAWKVITQTSYPGWGYMVA---EGATSLWERWEK-LGSMGMNSHNHIMFGS-VDAWFYRTLGGIIPLEPL-----WQLISIAP--YAVGALTHAAVSQETIRGTVAVSWSR 834          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: GAK50072.1 (alpha-rhamnosidase [Candidatus Moduliflexus flocculans])

HSP 1 Score: 424.1 bits (1089), Expect = 1.300e-114
Identity = 305/939 (32.48%), Postives = 449/939 (47.82%), Query Frame = 0
Query:   28 ITSQRVEYMPCPAMGIDVANPRFAWVVEPSDISVRGAYQEAFRIIIT----AADAGREVIWDSGRVNNSESRHIRFGSVGELPTAALASATAYDWTVTAWVATDVLPVSPSISSSTTTRKKAETTGAVQSQPSPAARFVTGMLNASKDWAGADWLVAANSSTQNLCRAPFIVPNSEKVERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPRT------------FIAKLFVNGKEVLSTSSKQQEMWLCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELIS-PKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGY--EPRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWCPV-PGQPMCSPH-LTGSFSFLENLQQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFYHGGTYDN--------------------------------GVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTVRTLQGVVAMEWEQ 914
            +T+ R EY   P +GID   PR +W+ E  +   RG  Q A++I++       DA R  +WDSG+V +  S    +   GE    AL S T Y W V  W   D                         S  S  A F T   +A+ DW GA W ++A  +   L R  F V   + V +A L + G+GY++A LNG ++ D  L+ GWTDY + + Y+ +DVT L++       GN LGI LGNG +S +      +V RT             +A+L +   +  +        W    GPI    +Y+GE +D RL     GWD              SD   A W PA ++ + P G+L+S    PP++  + L   +LT    GV++ DFGQN +G V L +       +T+R+AE+L        P+  + +V N R A AT+ YI                 +GE +E    VFEP FTYHGFRYVE+ G+   P L  ++  V+H+ +   G    S  LLNQI + I WG R+N MS+PTDCPQRDER GWL DA L+ E A++NFDM   Y K++RDI D Q   G + D  P      PADP+WG+A   + + +Y+  GD  +LEE YP + +Y+    L +     V  +  +GDWCP      + +P+ L   + +  +   +S +A  +GK  +   Y         AF+  F HG  Y                                    QT   L L+   VP +L+  V   LV D++     H  TGIIG +Y+ +VLS  G  ++A ++  Q  YPS+GYM+    E ATT+WE W        MNS+NH+M G  +  WFY+YL G+++ P+     G++ ++  P +     L  V+ ++ T +G++A+ W +
Sbjct:    3 VTNLRCEYTENP-LGIDAREPRLSWLFEDPE---RGQKQTAYQILVARRKELLDAERGDLWDSGKVASPLSAQAAY--AGE----ALQSCTRYYWAVRVWDRDD-----------------------QASAYSSPAFFETAFFDAN-DWQGA-W-ISAGETAGPLLRKTFNV--DKPVSKARLYICGVGYYEARLNGQKIGDHVLDPGWTDYAKTLLYTTFDVTHLLR-----RDGNALGILLGNGRFSPS----DEEVKRTPQILKKYAPAPVVLAQLHIEFSDNTTMRILSDATWKTTSGPIQSSDIYDGERYDARLEKS--GWD-------------FSDYNDAGWQPAQIAKH-PGGQLVSQATFPPVKISQTLPPQTLTIVSPGVYIYDFGQNFSGWVKLRVAGARGTKITIRYAELL-------YPDGTLNTVPN-RTASATETYIL----------------KGEGQE----VFEPRFTYHGFRYVEVSGFPGTPSLHALEGQVVHSALETAGSFLCSHPLLNQIHQNILWGLRSNFMSIPTDCPQRDERMGWLADAHLAAEAAIYNFDMAGFYAKWLRDIRDAQLDNGSVPDVVPMYWPIFPADPAWGTACLVIPWMVYQYYGDRRVLEENYPVMQRYL--AFLNSLAHDDVLDFGRWGDWCPPWHVNSVDTPYELVSQWHYYHDTALMSQIAAILGKPAEADEYRKKAERIKTAFNRKFLHGSQYGGTPDRWYQRLIPKVATLDEAQVIEQHLADTFAVRSQTGPVLALYLNLVPEELKAAVVHGLVQDIIVMHGTHVNTGIIGTRYLFDVLSEHGHAELAYKLATQTTYPSWGYMIK---EGATTLWERWEY-LTDLGMNSQNHIMLGS-IDAWFYRYLAGIQRDPS---APGWQHILIRPHVLG--DLTFVSASLNTPKGLIAVSWRK 838          
BLAST of NO01G03970 vs. NCBI_GenBank
Match: OEU17453.1 (Bac_rhamnosid-domain-containing protein [Fragilariopsis cylindrus CCMP1102])

HSP 1 Score: 421.8 bits (1083), Expect = 6.400e-114
Identity = 263/694 (37.90%), Postives = 378/694 (54.47%), Query Frame = 0
Query:  204 ALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLIKGGERGGGG----------NVLGISLGNGWYSTNGGAEP-------TDVPRTFIAKLFVNGKEVLSTSSKQQEMW-LCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGAAGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGVFVVDFGQNIAGRVALSLPRRGKGS-VTLRHAEVLQHAGIAAVPEPGMISVDNLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGFRYVEIRGYEPRL--SDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRANIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQDA-AGQIGDTAPFSIGGR-PADPSWGSAFPSLVYFLYEETGDVGILEEFYPALAKYVDSVLLAAEVAGVVNLYKSYGDWCPVPGQPMCSPHLTGSFSFLENLQQLSDMAFAMGKEED-----GKVYGAYHRSFTRAFHAAFYH--GGTYDNGVQTCLALPLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGKTDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVMFGGPVGGWFYKYLGGVR 868
            A L ++G+GY    LNG +V D +L+ GWT++ +R  Y+ YDVT +++  +    G          NV+ + LGNGW+S   G  P       T+ P   I +L +NG  VL +     E W   A  PI+Y+S+YNGE++D R+A  + GW    +   G S +  +D  A+    A++SS     +L  P      R    + +S         V+DFGQN AG V L      +G+ VTLRHAE++ H       +   I V+N+R A+  D YI                 +GE        + PTFTYHGFRYVE+ G    L  SDV A+ +HTDV     I F+D LLN+IQ  + WG ++N MSV TDC QRDERKGW+GDA L+ E A+ ++DM A Y  ++  + D Q+   G + +  P   GGR    P+W +A+P +++ +    GD  +L   + +L +Y D +  +    GV      +GDW P P  P   PHL G+F+FL +++   D+ F   +  D      ++     +  T  +H AFY+   G Y +G+QT  AL L+ G VPADLQ  +   LV D+  T   H+T+GIIG+KY +EVLS+L + DVA+ +  Q  YPS+GYM+ +  EPATT+WELW+SD AGP MNSRNH MFG  +  WFYKY+ G++
Sbjct:  263 ATLFVSGIGYNHVYLNGVKVGDHQLDPGWTNFTKRTWYTSYDVTHMLQFDDNVENGVLSSDAVANNNVIAVMLGNGWWSC--GPPPGTKQSYCTNDPPQLILQLHINGHPVLIS----DETWRASADSPIIYNSIYNGEIYDARIAESIEGWTSLHYDDEGWSQSKIADTVAS---KAILSS-----QLFEPIRHISTRSPISIVVSGKADNNLTQVLDFGQNQAGIVRLKRFFCPRGTQVTLRHAELIMHPPYGYY-DNSTIYVENMRTAKPNDYYI------------CTGNPQGE-------SYTPTFTYHGFRYVEVTGLNHALDPSDVAAVEMHTDVKQTSLIKFADPLLNKIQHMVMWGLKSNFMSVQTDCNQRDERKGWMGDAALTAEAAVLSYDMGAFYTHWLSQMVDNQNPDDGSMPNIVP--PGGRIEGAPNWQTAYPMILWVMITYYGDRELLFYHHDSLVRYFDFLESSYSRTGVKKFRTGFGDWVPPPPHPKSDPHLMGAFAFLGDMKLGIDI-FKYSRHPDAGAQLNRLKDLLEKVVTE-YHDAFYNETSGVYMSGLQTEQALSLYLGVVPADLQTSILSSLVDDIEVTNIGHTTSGIIGIKYAMEVLSKLDRGDVALDLALQTTYPSWGYMVNSQYEPATTVWELWDSDTAGPGMNSRNHHMFGS-ITSWFYKYVAGIQ 917          
The following BLAST results are available for this feature:
BLAST of NO01G03970 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
WP_066792603.12.600e-12334.02alpha-L-rhamnosidase [Caldivirga sp. MU80][more]
OGD23077.16.000e-12038.90hypothetical protein A2W03_00645, partial [Candida... [more]
OGU26950.11.700e-11931.94alpha-L-rhamnosidase [Ignavibacteria bacterium GWA... [more]
WP_012185241.12.300e-11934.57alpha-L-rhamnosidase [Caldivirga maquilingensis] >... [more]
WP_012548615.17.300e-11832.54alpha-L-rhamnosidase [Dictyoglomus thermophilum] >... [more]
WP_013782175.16.200e-11734.10alpha-L-rhamnosidase [Mahella australiensis] >AEE9... [more]
WP_008193444.16.200e-11732.85MULTISPECIES: alpha-L-rhamnosidase [Thermotoga] >E... [more]
OHE61220.15.800e-11534.07hypothetical protein A2Y36_07205 [Treponema sp. GW... [more]
GAK50072.11.300e-11432.48alpha-rhamnosidase [Candidatus Moduliflexus floccu... [more]
OEU17453.16.400e-11437.90Bac_rhamnosid-domain-containing protein [Fragilari... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL005nonsL005Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR047ncniR047Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR091ngnoR091Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK007896NSK007896Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO01G03970.1NO01G03970.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|577244gene_408Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100136g2gene1299Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO01G03970.1NO01G03970.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO01G03970 ID=NO01G03970|Name=NO01G03970|organism=Nannochloropsis oceanica|type=gene|length=5426bp
TGTTGCTTCTCCGTGCAGTATCAAGTGCCTTTTTGGTTACCTGAGCATGC
CACACAAAGCAGCACATGAGCAAGGCGATAGATGCAGATGTACATGTGGT
CTTTCTCACGCTTCCTAAAATATCCTTTCTGTGACAGGACGCAATCTGCC
CTGTTGTGGCGCCTTGCAGCATACGGCTTGCTGGTACCTCCCGCAGGGTG
ACACATCGATTTTCCCTGCCACCACCACATCTCTCCCCGGAGCAATTAGA
ACACACACCAAGAGGTTTATTGAGATTGCTGCAGACTATGTGCTTTCATT
GATTGTTCGACACACATACAAAACCATTACACTCATCCAGCCACAGCAAA
TATCAACCGCCACGCCCTCACACGACCCCGTTCCTCGCACACCATTATGC
GCACAGATCCATGCAGCTTCTAGCTGTTCCCCTCTCGGCGATGGTGGCAA
CTATTGCTTGCTTCTCTCTGGCTGATTCTCTGGgtgcgtatggacggaag
gaggggaaatggagggaaatcattcaccatgattgccctcttcggtcttc
tcagggctcgcgaccacgccaagcttttgcccaaagtctccatcgtcatt
cccttgcagccggatagcgacagacacgacttctcccaccagatcatctc
cccttaattctctcattcattacgatccacaacccaacgacagAGTACAC
CATTACCAGCCAGCGAGTCGAGTATATGCCCTGTCCCGCCATGGGGATCG
ACGTGGCCAACCCACGGTTTGCGTGGGTGGTGGAGCCTTCTGATATATCC
GTCCGTGGTGCCTACCAAGAGGCTTTTCGCATCATTATCACGGCTGCGGA
TGCTGGCAGAGAAGTGATTTGGGATTCGGGGCGTGTCAACAATTCGGAGA
GCCGGCACATAAGATTTGGCTCGGTGGGAGAGCTTCCCACAGCCGCTTTG
GCTAGTGCGACTGCTTATGATTGGACGGTGACTGCGTGGGTGGCGACGGA
TGTGTTGCCTGTGTCTCCCTCCATCTCCTCTTCCACCACGACGAGGAAGA
AGGCGGAGACGACGGGAGCAGTGCAAAGCCAACCTTCGCCGGCAGCTCGT
TTTGTGACGGGCATGCTAAATGCTTCCAAGGACTGGGCGGGGGCCGACTG
GCTAGTGGCTGCCAATAGCTCCACACAAAATCTGTGTCGTGCACCATTTA
TCGTCCCTAATAGCGAGAAGGTCGAGCGGGCGTTGTTGGTGATGGCGGGG
TTGGGGTATTTCCAAGCGTCTTTGAATGGAGCGAGGGTCTCGGATGCTGA
ATTAGAAAGCGGATGGACGGATTACGGACGACGGGTGCCTTATTCGGTCT
ATGATGTCACAGGCTTGATAAAAGGAGGGGAAAGGGGTGGGGGTGGGAAT
GTGTTGGGTATCTCTTTGGGCAATGGCTGGTATAGCACGAATGGAGGGGC
GGAACCCACCGACGTCCCCCGGACGTTTATTGCCAAGCTTTTTGTGAATG
GGAAGGAAGTCCTCAGCACCTCTTCGAAGCAGCAGGAAATGTGGTTGTGT
GCCGTCGGCCCCATTGTATACGACTCTGTGTATAACGGGGAAGTTTTCGA
CGGTCGGCTCGCTGCTCGCCTTGTCGGTTGGGATGATCCGGCCTTCTTCA
AAACCGGCATTTCTGGTGCTGCTGGTAGTGATGGATTTGCTGCCACGTGG
GTACCTGCAGTGGTTTCAAGCAACCCACCTACTGGTGAATTGATTTCTCC
AAAGATGCCTCCCATTCGTCGGATGGAAGAATTGGTGGCTATTTCTCTCA
CAGAACCACAGCAAGGGGTGTTTGTCGTGGATTTTGGCCAGAATATCGCC
GGGCGGGTAGCACTGTCTTTGCCTAGAAGAGGGAAGGGGAGCGTGACTCT
AAGGCATGCCGAGGTGTTGCAACATGCAGGCATTGCGGCCGTGCCCGAAC
CTGGAATGATCTCGGTGGATAATTTGAGAGGCGCACGGGCAACAGATGTG
TATATTTTTGATGAGGATGAAAGGGAAAATGGAAGGAAAGAAGCGGCGAG
GAGAAGGGAGGGAGAGGTGGAAGAAGAGGAGAGGGTGGTGTTTGAACCAA
CATTTACATATCATGGGTTTCGGTATGTGGAGATACGGGGATATGAGCCG
CGGCTGAGTGATGTGAAAGCGATTGTTTTGCACACAGACGTCGTGCCCAT
TGGTGATATTACCTTCTCGGACGCGTTGCTGAATCAGATCCAAAAGGCGA
TCCATTGGGGCCAACGGGCTAATATCATGTCAGTCCCCACCGACTGTCCT
CAGCGGGATGAACGCAAGGGATGGTTAGGAGATGCCCAGCTATCCGGGGA
AGAGGCCTTGTTTAATTTTGATATGGTTGCAACCTACTTAAAATTTGTCC
GGGACATTACTGATACCCAGGATGCGGCAGGCCAGATTGGCGATACTGCT
CCTTTTTCTATCGGGGGACGCCCGGCGGATCCTTCGTGGGGTTCAGCTTT
CCCGAGCTTGGTTTACTTTTTGTATGAAGAGACAGGAGATGTGGGTATTT
TGGAGGAGTTTTATCCTGCGTTGGCGAAGTATGTAGATTCGGTGCTGCTG
GCGGCGGAGGTTGCGGGGGTGGTGAACCTATACAAATCCTACGGCGATTG
GTGTCCTGTACCGGGGCAGCCCATGTGCTCCCCGCACTTGACGGGCTCGT
TTTCCTTCCTTGAGAATCTGCAACAGCTATCCGATATGGCCTTTGCGATG
GGGAAGGAGGAGGACGGGAAAGTGTATGGGGCATATCATCGTTCTTTCAC
ACGCGCATTCCATGCCGCCTTTTACCATGGCGGGACCTATGATAATGGGG
TTCAGACGTGCCTTGCCCTCCCTTTGTGGGCGGGCGCCGTCCCGGCTGAT
TTGCAAAAGGGGGTTGAGGAAAAGCTCGTAAGCGATTTGGTGGACACACA
GAAGTACCACTCTACCACTGGCATCATCGGGATGAAATACATGTTGGAAG
TATTATCGCGCCTGGGAAAGACAGACGTAGCTGTCCAAATGCTGCAACAG
AGGGATTATCCTTCTTTTGGCTATATGCTCACGAACCCCCTCGAGCCGGC
GACGACGATTTGGGAGCTGTGGAACTCGGACCAGGCGGGCCCTGCGATGA
ATTCGCGCAATCACGTCATGTTTGGCGGGCCGGTCGGGGGATGGTTTTAT
AAGTACTTGGGAGGGGTGAGAAAGGGGCCAGCAACGAGGGAGGGGGCTGG
ATACAGGCTGGTAATATTTGCCCCGCCCTTGGCGGCTTGCATGCCGTTGG
AGAAGGTGACGACGACGGTGCGGACGTTGCAGGGAGTGGTGGCGATGGAG
TGGGAGCAAGGAGCAGGGACCCTTCCCACTACTGCTGCTACTGCTGCTGC
TGCTGTCGCTACATCTTCACTCTCGCTGGCGAGCCGCAACCTCTCCAACT
CTTCCAGCCGGCATCTTCAAGCATCGATTCAGGTGCCCTTGGGAAGCAAA
AGTAGGGTTATTTTGGACGGACGGGCCCTTGGAGATGTCAATCCGATGTT
GAGCGTTACTGTAGATGGAGTTGATTTGAAACAAGCAATGAAGATGTCAG
GGCAAGGACTGGAGGTGCTGCAGGCGAGTGATGATTTTGTGGAATTACTG
TTGGCATCGGGGGGGTGGGCGATTGGGGTGAGGTTTGACGTGAGGGAGGA
CGAAGCGTTGAAGTGTATAGCGCCGTCGGAGGTAAAGATGGTGAGTACGG
CAATGGGCGTGGGGGTGGGAGAGAGAGGAATGGCGGACTTTGCAGTGGTT
ACTGAAGGGGTGATTGCTAGTGCATAAGGTTGGGTGAAAAGAAGAGTGTT
TGTGTGTGCGTATGAGTGCGAGTGTGGTTTGTTTGAGCCGGCGAGAGCAT
TCCTCTAACCTCCTCTGCAGGTTTCTTCACACTACAATGAATAAGAAAGA
GACAAGGAAAGTGTCAGAGGAATCACATGTATGTGTAGGTGCAATTAAAG
GAGATCATTTGATATACTTCCGCGAGCGACAACGAAGGTAGACAGGTGTG
GATATTTTTGGAAAGATGCAGAAAAAGATAAAGAAGAACAAAAGTAGCAA
GGACTATGAAAAGGAGGGAAGTGTGCGCCTTCTCCACACTATCCTGGTGA
ACAAAAAATGCTAGCTTCATCCTATACCGACGGTCACAGAGCAACCCCAA
AAAATAAAGTACAAGAAAATAAAAAGAAAGAGTAAAACCACCCATCGTGT
TGTGCGTGTGTGTGGGTGTGACGCAGACTTACCCAAGCTGTTGGACTGAA
AAACCTCGCACCCACACAATACTTTCGTTCCGTGCAGTATCCTTCAAAAT
TATTTGAAATATTTCAATGCTAAAGCTTCAATCTGTTCCCCCCCGCCGTC
CGCGCCAGCTCCTTCGCCGTCTCTTGCCCCTTCTCCGTCACCTTCCTCTT
CGCATACGCCCCTCCATGCGCCGACACCTCCTCCTGAAGGCCCGCAAAGA
ACTTCCCGGAAGTACTGTATGCGCCTTGTTGACTCGCTGCTTCCAACGTG
CCTTGCACCACCTTCCCACTCGCCTTGATCTCTGACCGCATTTTCTCTTT
CGCGTATTTGTTCCCCAGACCGGGGTTGATGCGATTCACCAGTTTCTCCT
CCGTTGCTGCCTGCTTTCTGGCCTTGCGCCGTGCTGCCTTCTTATCCCTC
CTAATCCTGTTCCGATCCGCTTGGTCCAATTCCGCCGCTTCTCTCAACAC
CCCCTCCCTTCCACGCTTCTTCCCATGCACCTCCTCCGGTGCCTGGGTGT
CGGCGTGTCCGACCGCCAAAGGCAACACCTCTTCCATCGCGATAGCAGGC
ACGTTCGCCTGCACCGTCATCTCAACCTGTGCGGCCTTCGGGACGAAATT
GAAGTTAGACAGCGCGTCCAATTTCCCACACACCAATCGAAACAATGCCG
CCATCTCTATCTTTTCCCGAGTTTCCTTCGTCTCTTTCTCCCCTCCTTCC
CCCCCCCCCTGGCCCATGGCTAACCGCATGTACTCCTTCTCATAAATCGC
TCCAAGGCCCTCCGTCGACTTCTCCTGAGACACCTCCGGCAGCTCCTCCT
CTCCCTCCCTCCCCCTCCTCCTCCCGTTCATCATCACCTCCTCCCCTAAT
GTCTTGGGGACGACGTCATCCCACCTCTCTTCCATCACCCTCTGCTTGAT
CATCTGTTCAATGGACTGGCCCTTTTCAACCGTCATTAACACCTCGGGTC
GTCGTGCCAGCTCCACGTCCACCACTGCTTCCAAGAGCGAGTTCTCCGGC
CTATCTCCCGCCTTGACCTCTCCCCGCAATTCCCACGGGCGTTCCGACAA
CAACTCATGCTCCAACTCGGCAATCTCCTTGCGTTTTGTTCGCTGCTGTC
GCTGGAAGCTTGTAAGGGTTTTAGGG
back to top

protein sequence of NO01G03970.1

>NO01G03970.1-protein ID=NO01G03970.1-protein|Name=NO01G03970.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1069bp
MQLLAVPLSAMVATIACFSLADSLEYTITSQRVEYMPCPAMGIDVANPRF
AWVVEPSDISVRGAYQEAFRIIITAADAGREVIWDSGRVNNSESRHIRFG
SVGELPTAALASATAYDWTVTAWVATDVLPVSPSISSSTTTRKKAETTGA
VQSQPSPAARFVTGMLNASKDWAGADWLVAANSSTQNLCRAPFIVPNSEK
VERALLVMAGLGYFQASLNGARVSDAELESGWTDYGRRVPYSVYDVTGLI
KGGERGGGGNVLGISLGNGWYSTNGGAEPTDVPRTFIAKLFVNGKEVLST
SSKQQEMWLCAVGPIVYDSVYNGEVFDGRLAARLVGWDDPAFFKTGISGA
AGSDGFAATWVPAVVSSNPPTGELISPKMPPIRRMEELVAISLTEPQQGV
FVVDFGQNIAGRVALSLPRRGKGSVTLRHAEVLQHAGIAAVPEPGMISVD
NLRGARATDVYIFDEDERENGRKEAARRREGEVEEEERVVFEPTFTYHGF
RYVEIRGYEPRLSDVKAIVLHTDVVPIGDITFSDALLNQIQKAIHWGQRA
NIMSVPTDCPQRDERKGWLGDAQLSGEEALFNFDMVATYLKFVRDITDTQ
DAAGQIGDTAPFSIGGRPADPSWGSAFPSLVYFLYEETGDVGILEEFYPA
LAKYVDSVLLAAEVAGVVNLYKSYGDWCPVPGQPMCSPHLTGSFSFLENL
QQLSDMAFAMGKEEDGKVYGAYHRSFTRAFHAAFYHGGTYDNGVQTCLAL
PLWAGAVPADLQKGVEEKLVSDLVDTQKYHSTTGIIGMKYMLEVLSRLGK
TDVAVQMLQQRDYPSFGYMLTNPLEPATTIWELWNSDQAGPAMNSRNHVM
FGGPVGGWFYKYLGGVRKGPATREGAGYRLVIFAPPLAACMPLEKVTTTV
RTLQGVVAMEWEQGAGTLPTTAATAAAAVATSSLSLASRNLSNSSSRHLQ
ASIQVPLGSKSRVILDGRALGDVNPMLSVTVDGVDLKQAMKMSGQGLEVL
QASDDFVELLLASGGWAIGVRFDVREDEALKCIAPSEVKMVSTAMGVGVG
ERGMADFAVVTEGVIASA*
back to top
Synonyms
Publications