NO03G05220, NO03G05220 (gene) Nannochloropsis oceanica

Overview
NameNO03G05220
Unique NameNO03G05220
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length4889
Alignment locationchr3:1488809..1493697 -

Link to JBrowse

Properties
Property NameValue
DescriptionPab-dependent poly-specific ribonuclease subunit 2
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr3genomechr3:1488809..1493697 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
PRJNA7699582024-08-13
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR038765Papain_like_cys_pep_sf
IPR012337RNaseH-like_dom
IPR011047Quinonprotein_ADH-like_supfam
IPR028881PAN2_dom
IPR013520Exonuclease_RNaseT/DNA_pol3
Homology
BLAST of NO03G05220 vs. NCBI_GenBank
Match: EWM29971.1 (pab-dependent poly -specific ribonuclease subunit 2 [Nannochloropsis gaditana])

HSP 1 Score: 721.1 bits (1860), Expect = 7.500e-204
Identity = 380/632 (60.13%), Postives = 458/632 (72.47%), Query Frame = 0
Query:   36 HEDEELDGPTEGHVPGSWAELGRVPSDGFPITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVSVMAHP--CPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEGEMEGGKKQASPASLRVGGXXXXXXXXXXXXXXXXXXXXHKKAGKEEFTCGALNVPRSHH---MAAFGGEYLTVGGNQGHVWQLDLVTGLTVVRMIDVTSPSACMESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLVVATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALGSPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQDYQIGAPSPMSCLSTSLSGELLSFGTPSXXXXXXXXXXXXXXXXPTLPLDDILAFPPPAPLSIPPSDPNPASHYVFRPTYLAAPPHPLLELSSSFATTPAVASSLPIRHPPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYLVVDPRRKEEEGEGGREDGRGEGEEDWGDSFMGEGGEAGGLEEALAVPERYRKVVIDFSKRGLDGFDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEVKELVLGKQYARW 663
            +ED+++    +GH  G++ E  RVPSDG+PITALAFDPIEELLWVGTGNGRLSALAQP M KHVSV AHP   PVRQIRCFGEGV+SISG++V+MHN+NCLA+C AP+ AMLG +E   EG  +  +  S   GG                      + G+EEFTCGALNVP  HH   MAA+ GE++TVGG+  HVW+LDL TGL VVR ++V SPS CMESNRA++ GG DGRLRFLDG +RS Q VER+LEA+TGPVTS+CTW+D+V ATGTQGRSLNPYDRSG APTR LPDPLIK+FDLRMLRQ+LPLSFAPAL +PSLL+ LP   + RL+V + +TGQFL CDPFNVTAA+TAFF + ++  G PSPM+C++ S SGE+L+ GTP                 P +PL+ + +                ASH++FRPT+LA PPHPLL  +SSFATTPAV SSLP+R PP RVINP L+TKMSVQDF+GYVPN +Y YP  SLL+G A ++AY VVDPRRKE+  E GR +  G   ED G    G G E  G +   A P RYR+V IDFS RGLDGF+FGRYNRT LVGLEN+LPN+YTNA+LQMMF+VPE+K  V   QY R+
Sbjct:   20 YEDDQV-STIQGH--GTYMETHRVPSDGYPITALAFDPIEELLWVGTGNGRLSALAQPGMQKHVSVRAHPLLSPVRQIRCFGEGVVSISGRQVQMHNKNCLALCAAPASAMLGVDERGQEGDIESVN--SRPKGGGHGRPALGKKEEDENVTGTV--RPGQEEFTCGALNVPLGHHHHPMAAYVGEHVTVGGSGTHVWELDLATGLRVVRTLEVASPSTCMESNRAMVIGGADGRLRFLDGSLRSGQAVERDLEAFTGPVTSMCTWNDMVAATGTQGRSLNPYDRSGRAPTRLLPDPLIKLFDLRMLRQSLPLSFAPALVAPSLLTLLPHTAQARLVVGAATTGQFLLCDPFNVTAADTAFFQVPNHHTGGPSPMACVAASPSGEMLALGTPDGLVISYTNSPEAVVNRPPVPLEILPSTXXXXXXXXXXXXXXXASHFLFRPTHLAFPPHPLLPPASSFATTPAVLSSLPVRVPPARVINPELYTKMSVQDFLGYVPNRYYQYPAHSLLYGSAGKEAYRVVDPRRKEDGAEAGRREDPG---EDVGGKEGGAGREDTGED---ARPPRYRRVQIDFSVRGLDGFEFGRYNRTPLVGLENVLPNTYTNAILQMMFFVPEMKAAVCRHQYLRF 638          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: XP_005852481.1 (PAB-dependent poly(A)-specific ribonuclease subunit 2 [Nannochloropsis gaditana CCMP526] >EKU23351.1 PAB-dependent poly(A)-specific ribonuclease subunit 2 [Nannochloropsis gaditana CCMP526])

HSP 1 Score: 451.1 bits (1159), Expect = 1.400e-122
Identity = 238/348 (68.39%), Postives = 263/348 (75.57%), Query Frame = 0
Query: 1214 LEGAKEGEKEGGR---EEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWLEQLRKAGARMGGGMEGE--EREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSILDAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWKLQQ 1557
            +EG + G  E GR    EWQWYI+NDFLV PTILEDALGFLP WKEPCILLYRDR ++ AAHAAWLEQLR AGAR+ GG + E  E +                                    XXXXX            GRGD++GIDAEFVQLEME AS+ E G RVV KEGRQALARLS+LD R D VM+DDYVLPSEPVMDYLTRFSG+ REDLDPSL RHHLV+ARTAYLKLR LIDRGV+LVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAA+LL E+IQGETHDSIEDARTA+RL++KWEE ++EGK+  VLN +YEYGR TGWKLQQ
Sbjct:    1 MEGVEGGGGEDGRGSDPEWQWYIVNDFLVEPTILEDALGFLPAWKEPCILLYRDRTSTAAAHAAWLEQLRAAGARVAGGGKREVGEEKGENDSSADASGAGGLAVGTLPPSIPVSIFASPSRSTXXXXXXXXXXXXXXXXXGRGDLVGIDAEFVQLEMETASVLETGARVVFKEGRQALARLSLLDVRSDQVMVDDYVLPSEPVMDYLTRFSGIVREDLDPSLSRHHLVTARTAYLKLRCLIDRGVILVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAAYLLGESIQGETHDSIEDARTAVRLYQKWEEAKKEGKLAQVLNGVYEYGRTTGWKLQQ 348          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: EWM29970.1 (Exonuclease [Nannochloropsis gaditana])

HSP 1 Score: 448.4 bits (1152), Expect = 9.400e-122
Identity = 237/348 (68.10%), Postives = 262/348 (75.29%), Query Frame = 0
Query: 1214 LEGAKEGEKEGGR---EEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWLEQLRKAGARMGGGMEGE--EREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSILDAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWKLQQ 1557
            +EG + G  E GR    EWQWYI+NDFLV PTILEDALGFLP WKEPCILLYRDR ++ AAHAAWLEQLR AGAR+ GG + E  E +                                    XXXXX            GRGD++GIDAEFVQLEME AS+ E G RVV KEGRQALARLS+LD R D VM+DDYVLPSEPVMDYLTRFSG+  EDLDPSL RHHLV+ARTAYLKLR LIDRGV+LVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAA+LL E+IQGETHDSIEDARTA+RL++KWEE ++EGK+  VLN +YEYGR TGWKLQQ
Sbjct:    1 MEGVEGGGGEDGRGSDPEWQWYIVNDFLVEPTILEDALGFLPAWKEPCILLYRDRTSTAAAHAAWLEQLRAAGARVAGGGKREVGEEKGENDSSADASGAGGLAVGTLPPSIPVSIFASPSRSTXXXXXXXXXXXXXXXXXGRGDLVGIDAEFVQLEMETASVLETGARVVFKEGRQALARLSLLDVRSDQVMVDDYVLPSEPVMDYLTRFSGIVPEDLDPSLSRHHLVTARTAYLKLRCLIDRGVILVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAAYLLGESIQGETHDSIEDARTAVRLYQKWEEAKKEGKLAQVLNGVYEYGRTTGWKLQQ 348          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: OQR98814.1 (hypothetical protein ACHHYP_07866 [Achlya hypogyna])

HSP 1 Score: 362.5 bits (929), Expect = 6.800e-96
Identity = 389/1542 (25.23%), Postives = 558/1542 (36.19%), Query Frame = 0
Query:   53 WAELGRVPSDGF--PITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVSVMAHPCPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEGEMEGGKKQASPASLRVGGXXXXXXXXXXXXXXXXXXXXHKKAGKEEFTCGALNVPRSHHMAAFGGEYLTVGGNQGHVWQLDLVTGL--------TVVRMIDVTSPS-------ACMESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLVVATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALGSPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQD---YQIGAPSPM----SCLSTSLSGELLSFGTPSXXXXXXXXXXXXXXXXPTLPLDD---ILAFP---PPAPLSIPPSDPNPASHYVFRPTYLA--APPHPLLE-LSSSFATTPAVASSLPIRHPPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYLVVDPRRKEEEGEGGREDGRGEGEEDWGDSFMGEGGEAGGLEEALAVPERYRKVVIDFSKRGLDG--FDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEVKELVLGKQYARWCWGNEKSVVCEMGFLFYMLQMAMAEAMGVGTGTVGREXXXXXXXXXXXXXXXXXXXXXXXXXXXPPAAPKPCQSSNFLRVYRHVPEVVALGLLEASAAAPVQQRAEGAHRFLLSQIHGEDSAVAPLSASFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGLEKSVIDTLYGYTACTTNTFLHPPNLPSKVTSTRSFILELVYPRTLSRGAGGSPTPSPTLKPAQPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXILGVLTSINAXXXXXXXXXPPLPTFSQILHSSLCRETRSRGWCEASKAYEPLKQQRVASTLPSLLALHGGVTVGTGKEVARQIWHQLNPLGGPWLPERIRVRVGGRRKGGKEGGVSVMERVXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTVRMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGEAGEEGHLLLHVKVPVASEWKGAGEKLEGAKEGEKEGGREEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWLEQLRKAGARMGGGMEGEEREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSILDAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWKLQQDQE 1560
            W+E+ R+   G     TA+AFD  +EL+W G  NGRL+A   PT+ K+ SV+++   ++ I    +G+++IS       +R C+                            ++ V G                         +   TCG L  P   H A+   ++L +G + G V   DL   L        T V  +D    S       A  E    +  G   G+L   D  +RS +       A+ G + S+  +   ++  G   RS+NPYD++  AP +  PDPL+K+FDLR +     + F PA  SPS + F P         A+     +L              F ++D   YQ     P+    + L+ + SGELL  G                   P   LDD    LAFP      PL++ P +P PAS Y+FRPT  A  AP  PL   L  +  +T     +L +   P +V++P    ++  +  +G+  +        S +FG  R  A+  VDPR  +++        +   E D                    VP RY+   I  SK G+DG  FDF ++N T  VGLEN LP SY NA+LQ+++++P V+    G      C      VVCE+GFLF+M+  A A+A                                       P   K CQ++N L   R VP  VALGL +   AA +  RA+    FL+  +     A+A L                                                                                                   +VT        L YP   S  A  +                                                                 +F  +L +SL                    Q+  A T    L L  GV    G++  +++W      G  W+P + RVR+          G  V+E                                V                                                                                  +G   HL+ H+  P                  E      E  W++ NDF V PT+  DA+ F   WK P +L+YR R T                   GG    E   A                                         S  PL  + +P RGD + ID EFV +EME A+++ DGTR+V+KE RQALAR+S++    ++V +DDY+L SEPV+DYLTRFSGL  +DL+P++ RHH+V  + AY+KLRYL+DRG + VGHGL KDFRI+N++VPP QVIDTV+L+     RKI+LRFL  +L K +IQ ETHDSIEDAR AL L  K+ E+  + + E  L  +Y  GR++ WK+   +E
Sbjct:   33 WSEIVRLAPGGSNQGCTAVAFDQCQELVWTGHANGRLTAHLLPTLEKYSSVVSNTGAIKHIIPCYDGIVAISKSMAVFRSRGCV-------------------------HNDTVAVDG------------------------SRGSLTCGHLR-PYDGHKAS---DHLLLGNSVGCVAAYDLQGQLSHPLHKTPTPVWSMDFYHGSKIAITALASHEDAPMICAGSATGQLDLFDAGLRSHRVECSIPNAHAGNIVSMDMYGHYIITCGVSARSINPYDKN--APVKIYPDPLVKLFDLRTMGLVQSMPF-PATTSPSFVQFQPHAPGTHFYAANPDGDLYL--------------FDIEDPGQYQYSPVGPLARNVAALAVAPSGELLVTG---CNDGTVVLFDSSQAANPRALLDDPLEALAFPTAYQAPPLALSPLEPAPASRYLFRPTINAYGAPIPPLSSWLPPNMDSTLKHEVTLVVAPKPIKVLDPTFAGRVQQKGTIGFTQHG--GVLKNSFVFGHGRTAAFTTVDPRFVDKKITHRVAQAKSFDESD-----------------PCHVPPRYKYKEIRMSKHGMDGFMFDFAKHNATPFVGLENTLPFSYMNALLQLLYFLPSVR----GHALQHLC-DAPVCVVCELGFLFHMMNEAAAKA---------------------------------------PRHAKSCQATNLLTTLRQVPAAVALGLFD--TAASIVPRADAFFSFLIDTL-----ALADL--------------------------------------------------------------------------------------------------ERVT--------LQYPEVASGAARDA-----------------------------------------------------------------SFGDVLQASL--------------------QELPADT----LCLQTGVGSWLGQDDIKELWATEKATGASWVPTQFRVRL--------VDGAVVVEEPSGDDWAPSPDDFVLAGVAAAVVRDLGKSHLV----------------------------------------------------------------------------------SGPNAHLIAHILGP------------------ETPNKAPEENWFLFNDFSVTPTVALDAVAFHVPWKYPSVLVYRRRAT-----------------LQGGSTALEPAVAIPSSVFHAPAVGVG---------------------------SAEPLDVATLPQRGDRVAIDTEFVIVEMEEATLQTDGTRIVTKESRQALARVSLIHGETNVVFVDDYILSSEPVVDYLTRFSGLVADDLNPAVSRHHVVPLKAAYMKLRYLVDRGCLFVGHGLGKDFRIVNLFVPPDQVIDTVELYQQPNMRKIALRFLVVYLFKAHIQLETHDSIEDARAALMLHNKYRELMTKNEFERTLMEIYAAGRQSRWKIADLEE 1084          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: XP_008607298.1 (hypothetical protein SDRG_03442 [Saprolegnia diclina VS20] >EQC39237.1 hypothetical protein SDRG_03442 [Saprolegnia diclina VS20])

HSP 1 Score: 355.1 bits (910), Expect = 1.100e-93
Identity = 376/1540 (24.42%), Postives = 548/1540 (35.58%), Query Frame = 0
Query:   53 WAELGRVPSDG--FPITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVSVMAHPCPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEGEMEGGKKQASPASLRVGGXXXXXXXXXXXXXXXXXXXXHKKAGKEEFTCGAL-----NVPRSHHMAAFGGEYLTVGGNQGH-----------VWQLDLVTGLTVVRMIDVTSPSACMESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLVVATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALGSPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQDYQIGAPSPM----SCLSTSLSGELLSFGTPS----XXXXXXXXXXXXXXXXPTLPLDDILAFPPPAPLSIPPSDPNPASHYVFRPT---YLAAPPHPLLELSSSFATTPAVASSLPIRHPPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYLVVDPRRKEEEGEGGREDGRGEGEEDWGDSFMGEGGEAGGLEEALAVPERYRKVVIDFSKRGLDG--FDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEVK-ELVLGKQYARWCWGNEKSVVCEMGFLFYMLQMAMAEAMGVGTGTVGREXXXXXXXXXXXXXXXXXXXXXXXXXXXPPAAPKPCQSSNFLRVYRHVPEVVALGLLEASAAAPVQQRAEGAHRFLLSQIHGEDSAVAPLSASFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGLEKSVIDTLYGYTACTTNTFLHPPNLPSKVTSTRSFILELVYPRTLSRGAGGSPTPSPTLKPAQPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXILGVLTSINAXXXXXXXXXPPLPTFSQILHSSLCRETRSRGWCEASKAYEPLKQQRVASTLPS-LLALHGGVTVGTGKEVARQIWHQLNPLGGPWLPERIRVRVGGRRKGGKEGGVSVMERVXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTVRMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGEAGEEGHLLLHVKVPVASEWKGAGEKLEGAKEGEKEGGREEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWLEQLRKAGARMGGGMEGEEREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSILDAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWKLQQDQE 1560
            W E+ R+   G    + A+AFDP +E+LW G  NGRL+A   PT+ K+ SV+++   ++ +    +G+++IS       +R C+                                                      H +      TCG L     N P  H +       +     QG            VW +D   G  +      +   A M     +  G   G+L   D  +RS +       A+ G + S+      ++  G   RS+NPYD++  AP +  PDPLIK+FD+R +     + F P   SPS + F P          S  +  +L    F++T   +       YQ     P+    S ++ + SGELL+ G P                     P LPL    +F  P PL++ P +P PAS Y+FRPT   Y  A P     L     +T     +L +   P +V++P    ++  +D +G+  +     P  S ++G+ R  A   VDPR  +++        +   E D                    VP RY+   I  SK G+DG  FDF ++N T  VGLEN LP SY NA+LQ++++VPE++   +L    A  C      V CE+GFLF+M+  A A+A                                       P   K CQ +N L   R VP    LGL + +                        +++ P + +F                                                                  GL   ++DTL                LPS    T ++                 PT    L+ A                                                       TF+ ++  SL                         S LP+  L L  GV         +++W      G  W+P + RV V        +G V V E                                V                                                                                  +G   HL+ H+  P  +  + +                +E  W + NDF V PT+  DA+ F   WK P +L+YR R T  A                                                                    SFTPL  + +P +GD + ID EFV +EME A+++ DGTRVV+KE RQALAR+S++    + V +DDYVLPSEPV+DYLTRFSGL  +DL+PS+ RHH+V  + AY+KLRYL+DRG + VGHGL KDFRI+N++VPP Q+IDTV+L+     RKI+LRFL  +L K +IQ ETHDSIEDAR AL L  K+ ++  + + E  L  +Y  GR++ WK+   +E
Sbjct:   30 WTEIARLAPAGAAHAVGAVAFDPCQEVLWTGHANGRLTAHLLPTLEKYSSVISNAGAIKTLIPCYDGIVAISKSMAVFRSRGCV----------------------------------------------HNDTVAVDHSRGA---LTCGHLRPYDSNKPNDHLLLGNSLGCVAAYDLQGQLSHPLHKTPMPVWSMDFYHGSKIAITALASHDDAPM-----MCAGSATGQLDLFDSSLRSHRVECSIPNAHAGNIVSMDMHGHYILTCGVSARSINPYDKN--APVKVYPDPLIKLFDVRTMGLVQSMPF-PGAASPSYVQFQPHAQGSHYYAGSRDSDLYL----FDITEPGS-------YQYTPLGPLGANASAMTFASSGELLALGAPDGGVVLYDSSQASNPKALLDDPLLPLAFPASFQAP-PLALSPLEPAPASRYLFRPTINDYGEAIPPLSSWLPPHMDSTLKHEVTLVVAPKPTKVLDPNFAARIQQKDTIGFTQHG--GVPKNSFVYGQGRSAAVATVDPRFVDKKMTHRTASAKSFDESD-----------------PCHVPPRYKYKEIKMSKHGMDGFMFDFAKHNGTRFVGLENSLPFSYINALLQLLYFVPELRTHALLHLCDAPVC------VTCELGFLFHMMNEASAKA---------------------------------------PRHAKSCQPTNLLTTLRQVPAATTLGLFDTT------------------------TSILPRADAFF-----------------------------------------------------------------GL---LVDTL---------------GLPSVQRITLTY-----------------PTDPAALRDA-------------------------------------------------------TFADVVSESL-------------------------SGLPAPTLCLQTGVADWLETPDIKELWATEKASGASWVPMQFRVAV-------TDGIVVVHEPTTDAEWTPADDDYVLVGVAAGVVRDVRRSHLV----------------------------------------------------------------------------------SGHNAHLIAHILDPDVTSSRSSD---------------DEHNWLLFNDFSVTPTVGLDAVAFHVPWKFPSVLVYRQRATLRA-------------------------------------------IATPEPTVDIPTSVFHAPSVTPSSSSFTPLELATLPQKGDRVAIDTEFVIVEMEEATLQTDGTRVVTKESRQALARVSLIHGETNTVFVDDYVLPSEPVVDYLTRFSGLVADDLNPSVSRHHVVPLKAAYMKLRYLVDRGCLFVGHGLGKDFRIVNLFVPPEQIIDTVELYQQPNMRKIALRFLIVYLFKAHIQLETHDSIEDARAALMLHNKYRDLMSKHEFERTLMEIYAAGRQSRWKIADLEE 1085          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: XP_012193937.1 (hypothetical protein SPRG_00453 [Saprolegnia parasitica CBS 223.65] >KDO35609.1 hypothetical protein SPRG_00453 [Saprolegnia parasitica CBS 223.65])

HSP 1 Score: 353.2 bits (905), Expect = 4.100e-93
Identity = 372/1539 (24.17%), Postives = 543/1539 (35.28%), Query Frame = 0
Query:   53 WAELGRVPSDG--FPITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVSVMAHPCPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEGEMEGGKKQASPASLRVGGXXXXXXXXXXXXXXXXXXXXHKKAGKEEFTCGAL-----NVPRSHHMAAFGGEYLTVGGNQGH-----------VWQLDLVTGLTVVRMIDVTSPSACMESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLVVATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALGSPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQDYQIGAPSPM----SCLSTSLSGELLSFGTPS----XXXXXXXXXXXXXXXXPTLPLDDILAFPPPAPLSIPPSDPNPASHYVFRPT---YLAAPPHPLLELSSSFATTPAVASSLPIRHPPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYLVVDPRRKEEEGEGGREDGRGEGEEDWGDSFMGEGGEAGGLEEALAVPERYRKVVIDFSKRGLDG--FDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEVK-ELVLGKQYARWCWGNEKSVVCEMGFLFYMLQMAMAEAMGVGTGTVGREXXXXXXXXXXXXXXXXXXXXXXXXXXXPPAAPKPCQSSNFLRVYRHVPEVVALGLLEASAAAPVQQRAEGAHRFLLSQIHGEDSAVAPLSASFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGLEKSVIDTLYGYTACTTNTFLHPPNLPSKVTSTRSFILELVYPRTLSRGAGGSPTPSPTLKPAQPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXILGVLTSINAXXXXXXXXXPPLPTFSQILHSSLCRETRSRGWCEASKAYEPLKQQRVASTLPSLLALHGGVTVGTGKEVARQIWHQLNPLGGPWLPERIRVRVGGRRKGGKEGGVSVMERVXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTVRMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGEAGEEGHLLLHVKVPVASEWKGAGEKLEGAKEGEKEGGREEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWLEQLRKAGARMGGGMEGEEREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSILDAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWKLQQDQE 1560
            W E+ R+   G    + A+AFDP +E+LW G  NGRL+A   PT+ K+ SV+++   ++ +    +G+++IS       +R C+                                                      H +      TCG L     N P  H +       +     QG            VW +D   G  +       +  A  E    +  G   G+L   D  +RS +       A+ G + S+      ++  G   RS+NPYD++  AP +  PDPLIK+FD+R +     + F P   SPS + F P          S     +L    F++T   +       YQ     P+    S ++ + SGELL+ G P                     P LPL    +F  P PL++ P +P PAS Y+FRPT   Y  A P     L     +T     +L +   P +V++P    ++  +D +G+  +     P  S ++G+ R  A   VDPR  +++        +   E D                    VP RY+   I  SK G+DG  FDF ++N T  VGLEN LP SY NA+LQ++++VPE++   +L    A  C      V CE+GFLF+M+  A A+A                                       P   K CQ +N L   R VP    LGL + +                        +++ P + +F                                                                  GL   +IDTL G  +    T  +P +  +   +T S ++              S  P+PT                                                                    LC +T    W E                 P +                +++W      G  W+P + RV V        +G V+V E                                V                                                                                  +G   HL+ H+  P                 G      ++  W + NDF V PT+  DA+ F   WK P +L+YR R T  A                                                                    SF PL  + +P +GD + ID EFV +EME A+++ DGTRVV+KE RQALAR+S++    + V +DDYVLPSEPV+DYLTRFSGL  +DL+PS+ RHH+V  + AY+KLRYL+DRG + VGHGL KDFRI+N++VPP Q+IDTV+L+     RKI+LRFL  +L K +IQ ETHDSIEDAR AL L  K+ ++    + E  L  +Y  GR++ WK+   +E
Sbjct:   30 WTEIARLAPAGAAHAVGAVAFDPCQEVLWTGHANGRLTAHLLPTLEKYSSVISNAGAIKTLIPCYDGIVAISKSMAVFRSRGCV----------------------------------------------HNDTVAVDHSRGA---LTCGHLRPYDSNKPNDHLLLGNSAGCVAAYDLQGQLSHPLHKTPMPVWSMDFYHGSKI-----AITALASHEDAPMMCAGSATGQLDLFDSSLRSHRVECSIPNAHAGNIVSMDMHGHYILTCGVSARSINPYDKN--APVKVYPDPLIKLFDVRTMGLVQSMPF-PGAASPSYVHFQPHAQGSHYYAGSRDGDLYL----FDITEPAS-------YQYTPLGPLGTNASAMAFASSGELLALGAPDGGVVLFDSSQASNPKALLDDPLLPLAFPASFQAP-PLALSPLEPAPASRYLFRPTINDYGEAIPPLSSWLPPHMDSTLKHEVTLVVAPKPTKVLDPTFAARIQQKDTIGFTQHG--GVPKNSFVYGQGRVAAVATVDPRFVDKKVTHRTASAKSFDESD-----------------PCHVPPRYKYKEIKMSKHGMDGFMFDFAKHNATRFVGLENSLPFSYINALLQLLYFVPELRAHALLHLCDAPVC------VTCELGFLFHMMNEASAKA---------------------------------------PRHAKSCQPTNLLTTLRQVPAATTLGLFDTT------------------------TSILPRADAFF-----------------------------------------------------------------GL---LIDTL-GLPSVQRVTLTYPTDPAALCDATFSDVV----------SGSLSVRPAPT--------------------------------------------------------------------LCLQTGVADWLET----------------PDI----------------KELWATEKASGASWVPMQFRVTV-------TDGIVAVHEPTPDQEWTPADDDYVLVGVAAAVVRDVRRSHLV----------------------------------------------------------------------------------SGHNAHLIAHILDP---------------DVGSSRSRDDDHNWLLFNDFSVTPTVGLDAVAFHVPWKFPSVLIYRQRATLRA-------------------------------------------IATPEPTVDIPHSVFHAPSVAQSSSSFEPLDVATLPQKGDRVAIDTEFVIVEMEEATLQTDGTRVVTKESRQALARVSLIHGETNTVFVDDYVLPSEPVVDYLTRFSGLVADDLNPSVSRHHVVPLKAAYMKLRYLVDRGCLFVGHGLGKDFRIVNLFVPPEQIIDTVELYQQPNMRKIALRFLIVYLFKAHIQLETHDSIEDARAALMLHNKYRDLMATQEFERTLMEIYAAGRQSRWKIADLEE 1085          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: ORX88696.1 (cysteine proteinase [Basidiobolus meristosporus CBS 931.73])

HSP 1 Score: 348.6 bits (893), Expect = 1.000e-91
Identity = 371/1520 (24.41%), Postives = 556/1520 (36.58%), Query Frame = 0
Query:   53 WAELGRV----PSDGFPITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVSVMAHPCPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEGEMEGGKKQASPASLRVGGXXXXXXXXXXXXXXXXXXXXHKKAGKEEFTCGALNVPRSHHMAAFGGEYLTVGGNQGHVWQLDLVTGLTVVRMIDVTSPSACMESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLVVATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALGSPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQDYQIGAPSPMSCLSTSLSGELLSFGTPSXXXXXXXXXXXXXXXXPTLPLDDILAFPPPAPLSIPPSDPNPASHYVFRPTYLAAPPHPLLELSSSFATTPAVASSLPIRH-----PPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYLVVDPR-RKEEEGEGGREDGRGEGEEDWGDSFMGEGGEAGGLE-EALAVPERYRKVVIDFSKRGLDGFDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEVKELVLGKQYARWCWGNEKSVVCEMGFLFYMLQMAMAEAMGVGTGTVGREXXXXXXXXXXXXXXXXXXXXXXXXXXXPPAAPKPCQSSNFLRVYRHVPEVVALGLLE---ASAAAPVQQRAEGAHRFLLSQIHGEDSAVAPLSASFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGLEKSVIDTLYGYTACTTNTFLHPPNLPSKVTSTRSFILELVYPRTLSRGAGGSPTPSPTLKPAQPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXILGVLTSINAXXXXXXXXXPPLPTFSQILHSSLCRETRSRGWCEASKAYEPLKQQRVASTLPSLLALHGGVTVGTGKEVARQIWHQLNPLG-GPWLPERIRVRVGGRRKGGKEGGVSVMERVXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTVRMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGEAGEEGHLLLHVKVPVASEWKGAGEKLEGAKEGEKEGGREEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWLEQLRKAGARMGGGMEGEEREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSIL---DAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQ-RKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWK 1554
            W E   +    P   FPIT  AFDP  EL+W+G  NGR+ +    T+ K+ S   H  PVRQI      VIS+    +RM NR      G P   + G+   +       A P S                                                      L V GNQ     ++L  G  ++  ID  S    M+ +R +  G T G +   D         E  ++A+TG ++ L    +L +  G   R+              + DP++KV+D+R ++   P++F      P+++ F P K    + VAS S GQF  CD  N  +A+  F     +Q+   S ++CL TS+SGE + FG  S                     D+ +A     PL +P  DP      V  P  +     PL  +   + T P + S  P         P   I P +   M   DFVGY PN       ++ +  +  RD      P+ R E+E E      R + E            E   L+   LAV + YR+V I +S+ G++ FDFG YN+T   GLE  + NSY N++LQ++F++P +K   + K + +     E  ++CE+GFLF ML+                                              A  + CQ++NFLR +  +P+ VALGL E    +A        +  +RF+L Q+H E +A+                                                                        +    S+I  ++G    ++N  L    +     +T  F ++LVYP+        S T S   KP+                                                       +  +IL SS+ R+T+++ WC + + Y+   Q++    LP++ +    V   T  +    +W      G   WLP  I++ + G             + V                              V                                                                                    E  HL+ H+KV                 + E E G+ E  WY  NDF+V     E+   F   WK P +L +   + S     + L  +                                                           S+ PL   +MP  G + GIDAEFV +  E   I  DG+R + +  R ALAR+S+L     +E I  +DDY+  SEPV+DYLT +SG++  DLDPSL +H +V  + AY KLR L+D G + VGHGL KDFRIIN+ VPP+QVIDTVD++ ++ + RKISL+FLA +LL ++IQ + HDSIEDARTAL +++K+  ++ +G  E VL  +Y  G K  WK
Sbjct:    4 WYEFATICDPNPVSAFPITTCAFDPHAELIWIGDENGRVCSYDIETLEKYTSFKGHTGPVRQITVTDRAVISLGPNSIRMTNRR-----GVPQSIVRGDHVHDFHAMVNSALPQS-----------------------------------------------------ELFVAGNQSRTLVVNLDRG-AIINQIDTDSGIFVMKRSRLICCGSTSGEVTLRDSRTFK---AEHRVQAHTGTMSDLDVSGNLFITCGFSYRA-----------GSLISDPIVKVYDIRTMQPLPPIAFPM---GPTMIKFHP-KIPTSVFVASQS-GQFQLCDVSN-QSADICF-----HQVNVNSYINCLDTSVSGEFIGFGDASNIVHVWSDR------------DEPIASNYSRPLELP--DP------VEYPDVIVDEESPLSSVGMPYYTEP-LLSVWPAHFTFEVGKPAPSIEPEVLQNMKTIDFVGYAPNP--RTRLRNQIPPRPGRDRKSSDTPKFRSEQEREKHLRRLRKKSE------LASPKEEDISLQNNPLAVFKLYRRVEIKYSRFGVEDFDFGYYNKTHFSGLETHITNSYCNSLLQVLFFIPTLKN--IAKSHVKSNCPKEFCLLCELGFLFQMLE---------------------------------------------DAKGRNCQATNFLRAFSTIPQAVALGLFEPDQPTAETSYSTLIQNFNRFILEQLHQESNAMG-----------------------------------------------------------LNICVRKDISGNDPTLPSIIQQVFGLKTVSSNKCLCGAQID---RTTYPFAIDLVYPK-------NSTTSSSPPKPS-------------------------------------------------------SLVEILQSSISRQTQAKAWCNSCQRYQSSTQKKSLQELPNIFS----VNCSTLTQSNLDLWRSSAVAGTNSWLPLAIQMELKG-------------DEVHITNPSPQPPVHDDENNERASNVATYELYAVVSQIQFEK----------------------------------------------------------------------------EIAHLVAHIKV----------------GKSEMEDGKSE--WYHFNDFMVKNIPEEEVTNFKHPWKTPSVLYFVRTDLSSKVDVSALPDV--------------------------------------VDNTILFKDYSISRLRHKLPRSYKPLTVDEMPNPGFLCGIDAEFVAMTKEETEIWSDGSRRLIRPSRLALARVSVLRGEGPKEMIPFIDDYIATSEPVVDYLTEYSGISAGDLDPSLSKHTVVPLKAAYKKLRLLLDMGSIFVGHGLNKDFRIINIIVPPSQVIDTVDIFHIKNRHRKISLKFLAWYLLNQDIQTDMHDSIEDARTALAIYKKYLLLKEQGLFEQVLEDIYNVGMKYNWK 1090          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: PKK68457.1 (cysteine proteinase [Rhizophagus irregularis])

HSP 1 Score: 338.2 bits (866), Expect = 1.400e-88
Identity = 365/1509 (24.19%), Postives = 560/1509 (37.11%), Query Frame = 0
Query:   65 PITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVSVMAHPCPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEGEMEGGKKQASPASLRVGGXXXXXXXXXXXXXXXXXXXXHKKAGKEEFTCGA-LNVPRSHHMAAFGGEYLTVGGNQGHVWQLDLVTGLTVVRMIDVTSPSACMESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLVVATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALGSPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQDYQIGAPSPMSCLSTSLSGELLSFGTPSXXXXXXXXXXXXXXXXPTLPLDDILAFPPPAPLSIPPSDPNPASHYVFRPTYLAAPPHPLLELSSSFATTPAVASSLPIRHPPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYLVVDPRRKEEEG----EGGREDGRGEGEEDWGDSFMGEGGEAGGLEEALAVPERYRKVVIDFSKRGLDGFDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEVKELVLGKQYARWCWGNEKSVVCEMGFLFYMLQMAMAEAMGVGTGTVGREXXXXXXXXXXXXXXXXXXXXXXXXXXXPPAAPKPCQSSNFLRVYRHVPEVVALGLLEASAA---APVQQRAEGAHRFLLSQIHGEDSAVAPLSASFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGLEKSVIDTLYGYTA-------CTTNTFLHPPNLPSKVTSTRSFILELVYPRTLSRGAGGSPTPSPTLKPAQPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXILGVLTSINAXXXXXXXXXPPLPTFSQILHSSLCRETRSRGWCEASKAYEPLKQQRVASTLPSLLALHGGVTVGTGKEVARQIWHQLNPLGGPWLPERIRVRVGGRRKGGKEGGVSVMERVXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTVRMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGEAGEEGHLLLHVKVPVASEWKGAGEKLEGAKEGEKEGGREEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWL-EQLRKAGARMGGGMEGEEREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSIL---DAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEG-QRKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWK 1554
            PI+A+AFDP +ELLW G   GR+++     +H++ S  AH  PVRQI    +GVIS+    V+M NR  L       K  L  E+                                              +  C     +P S  +AA         G Q ++  +++  G+ VV+ ++  S    M  +R +  G   G +  +D        VE  ++A+TG ++ + T  +L++  G+  R  N            + DPL+KV+D+R +R  +P+ F      P  L   P K    + +AS S GQF  CD  N     T+  H   YQ+   S ++ +  S SGE+L+FG  +                 + P++      PP   +I  S+ +  S  +  P Y      PLL +  S            + +PP + I+P +   M + DFVGY PN            G  +R+  +V   R+K ++G       +E     G+   G S + +  EA     +  +P+ Y++V I +S+ G+D FDF  YN++   GLE  + NSY N++LQ++F+ P ++  ++ + +       E  + CE+GFLF ML+ A                                               + CQ+SNFLR +  +P+ +ALGL E        P     +  +RF+L Q+H E ++                                                                              S+I  L+G          C T T             T SF+++L +    S+   G    S T                                                            F  IL +S+ RE + R WC   + Y P   +++  +LP +L+    +  G G  V  +IW + +     WLPER+ + +        +  +  ++ +                            +  +                                                                                   E  HL+  +KVP +        +LE   +           WY+ NDF V     ++   F   WK P +L Y   + S     + L  ++ K+                                                        S   L P ++P  G ++ IDAEFV L  E   IR DGT+ V +  R +LAR+S+L    A+E    +DDY+  SEPV+DYLT +SG+   DLDP   +H LV  + AY KLR L+D G VLVGHGLKKDFRIIN+ VPP QVIDTVD++ ++  QRKISLRFLA +LL +NIQ +THDSIEDA TAL +++K+ + + EGK E VL  +Y  G K  WK
Sbjct:   20 PISAIAFDPYQELLWTGNEKGRVTSHFGSGLHRYTSFRAHLNPVRQILVSDKGVISLCSDSVKMTNRRGLI------KWTLSNED--------------------------------------------TTDLHCMTYTTMPNSEILAA---------GKQHNMLVINVARGI-VVKKVESESDIVVMRKSRLICCGANSGEVTLMDPRTFK---VEHRVQAHTGTISDIDTIGNLLLTCGSSARHGN-----------LIIDPLVKVYDIRTMRPLVPMPFPT---GPCFLKMHP-KLSTTVFIASRS-GQFHVCDIGN-----TSDIHF--YQVNTSSYVNAIDLSASGEMLAFGDAASFVHLWEDRKEAKINAYSNPIELPTIQTPP---NITTSERSSLS-LIGMPYY----KEPLLSVWPS-------NMRFEVGNPPPK-IDPDILNNMKMIDFVGYSPNP-----------GNMKRN-QVVRYSRKKHKDGTPKFRSEKERELQSGKSSRGPSSLFD-NEAELDATSTKMPKYYKRVEIQYSRFGVDDFDFEFYNKSRYAGLETHIANSYCNSLLQVLFFTPVLR--LITRSHIGTACTKENCLCCELGFLFRMLENARG---------------------------------------------RNCQASNFLRAFSTIPQALALGLFEPDEPDENTPYSMLIQNFNRFILEQLHQECNS-------------------------------------------------------------NNNPRLLKSLPLEQTSHSMIQQLFGMQVASISKCQCETQT----------DRLTTSFVVDLQFS---SKSHKGKERESKT----------------------------------------------------------KAFVDILRTSIQREIQQRAWCNNCQQYVPTIAKKIPKSLPPVLS----INCGAGTSVPLEIW-RTHDGQNAWLPERVSMDI-------DDDDILTVKELSSDAIVDINTSGSSKHANYELMAIISQVRVEK-----------------------------------------------------------------------------------EIPHLVAFIKVPKS--------ELESTSKS---------PWYLFNDFSVKNITEQEVFNFQGVWKMPVVLYYSRVDISDLMDTSDLPSEIDKS---------------------------------------ILFKDISISKHHSTNKKSVDLLTPEELPQPGTLVAIDAEFVALSQEETEIRSDGTKSVIRPSRLSLARVSVLRGEGAKEGFPFIDDYIAASEPVVDYLTEYSGIKAGDLDPGSSKHTLVPLKIAYKKLRLLLDLGCVLVGHGLKKDFRIINILVPPEQVIDTVDIFHIKNRQRKISLRFLAWYLLNQNIQTDTHDSIEDAHTALLIYKKYLQFKSEGKFEKVLEDIYSEGHKHNWK 1083          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: GBC18663.1 (PAB-dependent poly(A)-specific ribonuclease subunit 2 [Rhizophagus irregularis DAOM 181602])

HSP 1 Score: 335.5 bits (859), Expect = 8.900e-88
Identity = 364/1508 (24.14%), Postives = 559/1508 (37.07%), Query Frame = 0
Query:   66 ITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVSVMAHPCPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEGEMEGGKKQASPASLRVGGXXXXXXXXXXXXXXXXXXXXHKKAGKEEFTCGA-LNVPRSHHMAAFGGEYLTVGGNQGHVWQLDLVTGLTVVRMIDVTSPSACMESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLVVATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALGSPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQDYQIGAPSPMSCLSTSLSGELLSFGTPSXXXXXXXXXXXXXXXXPTLPLDDILAFPPPAPLSIPPSDPNPASHYVFRPTYLAAPPHPLLELSSSFATTPAVASSLPIRHPPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYLVVDPRRKEEEG----EGGREDGRGEGEEDWGDSFMGEGGEAGGLEEALAVPERYRKVVIDFSKRGLDGFDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEVKELVLGKQYARWCWGNEKSVVCEMGFLFYMLQMAMAEAMGVGTGTVGREXXXXXXXXXXXXXXXXXXXXXXXXXXXPPAAPKPCQSSNFLRVYRHVPEVVALGLLEASAA---APVQQRAEGAHRFLLSQIHGEDSAVAPLSASFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGLEKSVIDTLYGYTA-------CTTNTFLHPPNLPSKVTSTRSFILELVYPRTLSRGAGGSPTPSPTLKPAQPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXILGVLTSINAXXXXXXXXXPPLPTFSQILHSSLCRETRSRGWCEASKAYEPLKQQRVASTLPSLLALHGGVTVGTGKEVARQIWHQLNPLGGPWLPERIRVRVGGRRKGGKEGGVSVMERVXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTVRMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGEAGEEGHLLLHVKVPVASEWKGAGEKLEGAKEGEKEGGREEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWL-EQLRKAGARMGGGMEGEEREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSIL---DAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEG-QRKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWK 1554
            I+A+AFDP +ELLW G   GR+++     +H++ S  AH  PVRQI    +GVIS+    V+M NR  L       K  L  E+                                              +  C     +P S  +AA         G Q ++  +++  G+ VV+ ++  S    M  +R +  G   G +  +D        VE  ++A+TG ++ + T  +L++  G+  R  N            + DPL+KV+D+R +R  +P+ F      P  L   P K    + +AS S GQF  CD  N     T+  H   YQ+   S ++ +  S SGE+L+FG  +                 + P++      PP   +I  S+ +  S  +  P Y      PLL +  S            + +PP + I+P +   M + DFVGY PN            G  +R+  +V   R+K ++G       +E     G+   G S + +  EA     +  +P+ Y++V I +S+ G+D FDF  YN++   GLE  + NSY N++LQ++F+ P ++  ++ + +       E  + CE+GFLF ML+ A                                               + CQ+SNFLR +  +P+ +ALGL E        P     +  +RF+L Q+H E ++                                                                              S+I  L+G          C T T             T SF+++L +    S+   G    S T                                                            F  IL +S+ RE + R WC   + Y P   +++  +LP +L+    +  G G  V  +IW + +     WLPER+ + +        +  +  ++ +                            +  +                                                                                   E  HL+  +KVP +        +LE   +           WY+ NDF V     ++   F   WK P +L Y   + S     + L  ++ K+                                                        S   L P ++P  G ++ IDAEFV L  E   IR DGT+ V +  R +LAR+S+L    A+E    +DDY+  SEPV+DYLT +SG+   DLDP   +H LV  + AY KLR L+D G VLVGHGLKKDFRIIN+ VPP QVIDTVD++ ++  QRKISLRFLA +LL +NIQ +THDSIEDA TAL +++K+ + + EGK E VL  +Y  G K  WK
Sbjct:   24 ISAIAFDPYQELLWTGNEKGRVTSHFGSGLHRYTSFRAHLNPVRQILVSDKGVISLCSDSVKMTNRRGLI------KWTLSNED--------------------------------------------TTDLHCMTYTTMPNSEILAA---------GKQHNMLVINVARGI-VVKKVESESDIVVMRKSRLICCGANSGEVTLMDPRTFK---VEHRVQAHTGTISDIDTIGNLLLTCGSSARHGN-----------LIIDPLVKVYDIRTMRPLVPMPFPT---GPCFLKMHP-KLSTTVFIASRS-GQFHVCDIGN-----TSDIHF--YQVNTSSYVNAIDLSASGEMLAFGDAASFVHLWEDRKEAKINAYSNPIELPTIQTPP---NITTSERSSLS-LIGMPYY----KEPLLSVWPS-------NMRFEVGNPPPK-IDPDILNNMKMIDFVGYSPNP-----------GNMKRN-QVVRYSRKKHKDGTPKFRSEKERELQSGKSSRGPSSLFD-NEAELDATSTKMPKYYKRVEIQYSRFGVDDFDFEFYNKSRYAGLETHIANSYCNSLLQVLFFTPVLR--LITRSHIGTACTKENCLCCELGFLFRMLENARG---------------------------------------------RNCQASNFLRAFSTIPQALALGLFEPDEPDENTPYSMLIQNFNRFILEQLHQECNS-------------------------------------------------------------NNNPRLLKSLPLEQTSHSMIQQLFGMQVASISKCQCETQT----------DRLTTSFVVDLQFS---SKSHKGKERESKT----------------------------------------------------------KAFVDILRTSIQREIQQRAWCNNCQQYVPTIAKKIPKSLPPVLS----INCGAGTSVPLEIW-RTHDGQNAWLPERVSMDI-------DDDDILTVKELSSDAIVDINTSGSSKHANYELMAIISQVRVEK-----------------------------------------------------------------------------------EIPHLVAFIKVPKS--------ELESTSKS---------PWYLFNDFSVKNITEQEVFNFQGVWKMPVVLYYSRVDISDLMDTSDLPSEIDKS---------------------------------------ILFKDISISKHHSTNKKSVDLLTPEELPQPGTLVAIDAEFVALSQEETEIRSDGTKSVIRPSRLSLARVSVLRGEGAKEGFPFIDDYIAASEPVVDYLTEYSGIKAGDLDPGSSKHTLVPLKIAYKKLRLLLDLGCVLVGHGLKKDFRIINILVPPEQVIDTVDIFHIKNRQRKISLRFLAWYLLNQNIQTDTHDSIEDAHTALLIYKKYLQFKSEGKFEKVLEDIYSEGHKHNWK 1086          
BLAST of NO03G05220 vs. NCBI_GenBank
Match: PKC08154.1 (cysteine proteinase [Rhizophagus irregularis])

HSP 1 Score: 334.7 bits (857), Expect = 1.500e-87
Identity = 364/1509 (24.12%), Postives = 559/1509 (37.04%), Query Frame = 0
Query:   65 PITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVSVMAHPCPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEGEMEGGKKQASPASLRVGGXXXXXXXXXXXXXXXXXXXXHKKAGKEEFTCGA-LNVPRSHHMAAFGGEYLTVGGNQGHVWQLDLVTGLTVVRMIDVTSPSACMESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLVVATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALGSPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQDYQIGAPSPMSCLSTSLSGELLSFGTPSXXXXXXXXXXXXXXXXPTLPLDDILAFPPPAPLSIPPSDPNPASHYVFRPTYLAAPPHPLLELSSSFATTPAVASSLPIRHPPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYLVVDPRRKEEEG----EGGREDGRGEGEEDWGDSFMGEGGEAGGLEEALAVPERYRKVVIDFSKRGLDGFDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEVKELVLGKQYARWCWGNEKSVVCEMGFLFYMLQMAMAEAMGVGTGTVGREXXXXXXXXXXXXXXXXXXXXXXXXXXXPPAAPKPCQSSNFLRVYRHVPEVVALGLLEASAA---APVQQRAEGAHRFLLSQIHGEDSAVAPLSASFSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNGLEKSVIDTLYGYTA-------CTTNTFLHPPNLPSKVTSTRSFILELVYPRTLSRGAGGSPTPSPTLKPAQPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXILGVLTSINAXXXXXXXXXPPLPTFSQILHSSLCRETRSRGWCEASKAYEPLKQQRVASTLPSLLALHGGVTVGTGKEVARQIWHQLNPLGGPWLPERIRVRVGGRRKGGKEGGVSVMERVXXXXXXXXXXXXXXXXXXXXXXXXXXXXKTVRMXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGEAGEEGHLLLHVKVPVASEWKGAGEKLEGAKEGEKEGGREEWQWYILNDFLVAPTILEDALGFLPTWKEPCILLYRDRETSPAAHAAWL-EQLRKAGARMGGGMEGEEREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLSFTPLPPSQMPGRGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSIL---DAREDIVMMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLIDRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEG-QRKISLRFLAAFLLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKTGWK 1554
            PI+A+AFDP +ELLW G   GR+++     +H++ S  AH  PVRQI    +GVIS+    V+M NR  L       K  L  E+                                              +  C     +P S  +AA         G Q ++  +++  G+ VV+ ++  S    M  +R +  G   G +  +D        VE  ++A+TG ++ + T  +L++  G+  R  N            + DPL+KV+D+R +R  +P+ F      P  L   P K    + +AS S GQF  CD  N     T+  H   YQ+   S ++ +  S SGE+L+FG  +                 + P++      PP   +I  S+ +  S  +  P Y      PLL +  S            + +PP + I+P +   M + DFVGY PN            G  +R+  +V   R+K ++G       +E     G+   G S + +  EA     +  +P+ Y++V I +S+ G+D FDF  YN++   GLE  + NSY N++LQ++F+ P ++  ++ + +       E  + CE+GFLF ML+ A                                               + CQ+SNFLR +  +P+ +ALGL E        P     +  +RF+L Q+H E ++                                                                              S+I  L+G          C T T             T SF+++L +    S+   G    S T                                                            F  IL +S+ RE + R WC   + Y P   +++  +LP +L+    +  G G  V  +IW + +     WLPER+ + +        +  +  ++ +                            +  +                                                                                   E  HL+  +KVP +        +LE   +           WY+ NDF V     ++   F   WK P +L Y   + S     + L  ++ K+                                                        S   L P ++P  G ++ IDAEFV L  E   IR DGT+ V +  R +LAR+S+L    A+E    +DDY+  SEPV+DYLT +SG+   DLDP   +H LV  + AY KLR L+D G VLVGHGLKKD RIIN+ VPP QVIDTVD++ ++  QRKISLRFLA +LL +NIQ +THDSIEDA TAL +++K+ + + EGK E VL  +Y  G K  WK
Sbjct:   20 PISAIAFDPYQELLWTGNEKGRVTSHFGSGLHRYTSFRAHLNPVRQILVSDKGVISLCSDSVKMTNRRGLI------KWTLSNED--------------------------------------------TTDLHCMTYTTMPNSEILAA---------GKQHNMLVINVARGI-VVKKVESESDIVVMRKSRLICCGANSGEVTLMDPRTFK---VEHRVQAHTGTISDIDTIGNLLLTCGSSARHGN-----------LIIDPLVKVYDIRTMRPLVPMPFPT---GPCFLKMHP-KLSTTVFIASRS-GQFHVCDIGN-----TSDIHF--YQVNTSSYVNAIDLSASGEMLAFGDAASFVHLWEDRKEAKINAYSNPIELPTIQTPP---NITTSERSSLS-LIGMPYY----KEPLLSVWPS-------NMRFEVGNPPPK-IDPDILNNMKMIDFVGYSPNP-----------GNMKRN-QVVRYSRKKHKDGTPKFRSEKERELQSGKSSRGPSSLFD-NEAELDATSTKMPKYYKRVEIQYSRFGVDDFDFEFYNKSRYAGLETHIANSYCNSLLQVLFFTPVLR--LITRSHIGTACTKENCLCCELGFLFRMLENARG---------------------------------------------RNCQASNFLRAFSTIPQALALGLFEPDEPDENTPYSMLIQNFNRFILEQLHQECNS-------------------------------------------------------------NNNPRLLKSLPLEQTSHSMIQQLFGMQVASISKCQCETQT----------DRLTTSFVVDLQFS---SKSHKGKERESKT----------------------------------------------------------KAFVDILRTSIQREIQQRAWCNNCQQYVPTIAKKIPKSLPPVLS----INCGAGTSVPLEIW-RTHDGQNAWLPERVSMDI-------DDDDILTVKELSSDAIVDINTSGSSKHANYELMAIISQVRVEK-----------------------------------------------------------------------------------EIPHLVAFIKVPKS--------ELESTSKS---------PWYLFNDFSVKNITEQEVFNFQGVWKMPVVLYYSRVDISDLMDTSDLPSEIDKS---------------------------------------ILFKDISISKHHSTNKKSVDLLTPEELPQPGTLVAIDAEFVALSQEETEIRSDGTKSVIRPSRLSLARVSVLRGEGAKEGFPFIDDYIAASEPVVDYLTEYSGIKAGDLDPGSSKHTLVPLKIAYKKLRLLLDLGCVLVGHGLKKDNRIINILVPPEQVIDTVDIFHIKNRQRKISLRFLAWYLLNQNIQTDTHDSIEDAHTALLIYKKYLQFKSEGKFEKVLEDIYSEGHKHNWK 1083          
The following BLAST results are available for this feature:
BLAST of NO03G05220 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM29971.17.500e-20460.13pab-dependent poly -specific ribonuclease subunit ... [more]
XP_005852481.11.400e-12268.39PAB-dependent poly(A)-specific ribonuclease subuni... [more]
EWM29970.19.400e-12268.10Exonuclease [Nannochloropsis gaditana][more]
OQR98814.16.800e-9625.23hypothetical protein ACHHYP_07866 [Achlya hypogyna... [more]
XP_008607298.11.100e-9324.42hypothetical protein SDRG_03442 [Saprolegnia dicli... [more]
XP_012193937.14.100e-9324.17hypothetical protein SPRG_00453 [Saprolegnia paras... [more]
ORX88696.11.000e-9124.41cysteine proteinase [Basidiobolus meristosporus CB... [more]
PKK68457.11.400e-8824.19cysteine proteinase [Rhizophagus irregularis][more]
GBC18663.18.900e-8824.14PAB-dependent poly(A)-specific ribonuclease subuni... [more]
PKC08154.11.500e-8724.12cysteine proteinase [Rhizophagus irregularis][more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL069nonsL069Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR000ncniR000Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR054ngnoR054Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK005351NSK005351Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO03G05220.1NO03G05220.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|573268gene_1734Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100200g3gene678Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO03G05220.1NO03G05220.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO03G05220 ID=NO03G05220|Name=NO03G05220|organism=Nannochloropsis oceanica|type=gene|length=4889bp
CAGCACAGCTATTGAGCTCGACCTCCTCACTCAGCCATGAACGGCCTGAG
TGCTCCCTTCATCCCGCCAGGGCAGCAGCAGCAGCAGCAGCAGGAGGAGG
AGGAGGAGGAGGAGGACCCCCCATTCTCCCGGTCGGGCTACCACGAGGAC
GAGGAGCTCGACGGCCCTACGGAGGGCCACGTTCCAGGGAGCTGGGCTGA
GCTCGGTCGTGTGCCTAGTGATGGCTTCCCCATCACGGCTCTGGCGTTCG
ACCCTATCGAGGAGCTCCTGTGGGTCGGCACGGGCAACGGCCGCCTCTCT
GCCCTGGCCCAGCCGACCATGCACAAGCATGTATCTGTCATGGCGCATCC
ATGCCCCGTGCGGCAGATCCGTTGTTTTGGGGAGGGTGTGATTTCCATCT
CGGGGAAAGAAGTGAGGATGCACAATCGGAATTGTCTGGCCATTTGTGGG
GCTCCCTCCAAGGCCATGCTGGGGGAGGAGGAGGGAGAGATGGAGGGTGG
GAAGAAACAGGCGAGCCCCGCAAGCCTCCGTGTGGGTGGCCGCGCTGCAG
GGGGGGCTGACAGCGACGATAGCGACGACGGCAGCCGCAATGGCAGCGCC
CACAAGAAAGCCGGCAAAGAAGAGTTTACGTGCGGGGCCTTGAACGTGCC
TAGGAGCCACCACATGGCAGCTTTTGGAGGGGAGTATCTGACGGTGGGCG
GAAACCAAGGACATGTCTGGCAGCTTGATTTAGTAACAGGTCTTACCGTG
GTAAGAATGATAGACGTCACGAGCCCCTCTGCCTGCATGGAGAGTAATCG
CGCCGTTATCACCGGAGGCACCGATGGACGACTGCGGTTCTTAGACGGGA
TGATGCGATCCCGCCAGGGGGTCGAGAGGGAGTTGGAGGCATACACTGGA
CCAGTGACGAGTCTATGTACATGGGACGACTTGGTGGTGGCTACAGGGAC
ACAAGGTCGCTCCCTCAACCCCTACGACCGTTCCGGGCTGGCCCCCACTC
GCTTTTTGCCCGATCCGCTCATTAAGGTATTTGATTTACGGATGCTCCGT
CAAACCCTCCCCCTCTCCTTCGCACCCGCCCTCGGCTCGCCCTCCCTCCT
TTCCTTCCTCCCCGGTAAGGACAAGGACCGCCTTCTCGTCGCCTCCGGCA
GCACAGGTCAGTTCCTCACTTGCGATCCGTTCAATGTCACGGCCGCCGAG
ACCGCATTCTTCCACCTCCAAGACTACCAAATCGGGGCCCCCTCTCCCAT
GTCCTGCCTCTCTACTTCCCTCTCCGGCGAGCTCCTTTCGTTTGGAACCC
CAAGTGGCCTTGTGGTCACCTATGCCACCGACTCTGGCGCGGTCTGCAAC
TACCCCACCCTCCCTCTCGACGACATCCTTGCCTTTCCACCCCCGGCCCC
GCTCTCCATCCCGCCCTCAGATCCCAACCCTGCTTCGCACTACGTCTTCC
GGCCTACCTACCTGGCCGCCCCCCCCCATCCCCTCCTCGAGCTCTCCTCC
TCCTTCGCCACGACCCCCGCAGTCGCCTCCTCCCTCCCGATTCGACACCC
CCCCCAGAGGGTCATCAACCCTGCTTTGTTCACCAAAATGAGCGTGCAGG
ACTTTGTGGGATACGTCCCCAATACCTTCTACCACTATCCTTTTCAATCC
CTCTTGTTTGGCAAGGCCCGGAGGGATGCATACCTAGTCGTGGATCCACG
GAGGAAAGAGGAAGAGGGGGAAGGAGGGAGGGAGGACGGCAGGGGTGAAG
GGGAGGAGGATTGGGGGGATTCATTCATGGGGGAGGGGGGAGAGGCGGGA
GGATTGGAGGAGGCATTGGCGGTGCCGGAGAGGTATAGAAAAGTAGTGAT
TGATTTCTCGAAGAGGGGACTGGACGGATTTGACTTCGGGAGGTACAACA
GGACTAGTTTGGTGGGTCTGGAAAATCTGTTGCCAAACTCTTACACAAAT
GCGGTGTTGCAGATGATGTTTTACGTACCGGAGGTGAAGGAGCTGGTGTT
GGGGAAGCAATACGCGCGGTGGTGCTGGGGGAATGAGAAGAGCGTGGTGT
GTGAGATGGGTTTTTTATTCTACATGCTGCAGATGGCCATGGCGGAGGCC
ATGGGAGTAGGGACGGGGACGGTCGGGAGGGAGGAAGAGAAGGAGGAAGG
GAAGGAGGGGAAGTCTGTGCATCAACAGCCACAGCAACGACAAGCTCGTC
TTGCCTCCCTCATCCCTCCCGCGGCACCAAAGCCTTGCCAAAGCAGCAAT
TTCTTGAGGGTCTACCGCCACGTCCCCGAGGTGGTTGCCCTCGGGCTTCT
AGAAGCTTCCGCGGCAGCGCCCGTCCAGCAACGAGCCGAAGGTGCGCATC
GGTTTTTGCTCTCGCAGATCCACGGGGAGGACAGTGCAGTGGCTCCCCTT
TCTGCTTCCTTTTCTGCTTCATCTTCTGCTCCCCGTCGAGGTGGTCGGGG
CGGGGCCAAAAAAGCAGCAGCAGCAGCAGCAGCAGCAGCAGCCGCAGCAG
CAGCCGCTGTCGCCGCTGCTGAGAAGCACGGAGCAGCAAAAGCAGAGGTT
GAAGAGAACGAGGACGAGCTCTCTGGCAGCGAAGGAGGAGGGGGGCAGGA
AGAAGGAAATGGGCTAGAGAAGAGCGTGATTGATACCTTGTACGGCTACA
CGGCCTGCACGACAAACACCTTCCTTCACCCTCCCAACCTCCCCTCCAAA
GTCACCTCCACTCGCTCCTTCATCCTCGAGCTCGTCTACCCACGAACCCT
CTCACGCGGCGCCGGTGGCTCTCCTACGCCATCCCCCACCCTCAAACCCG
CCCAACCCTCTTCCTCCTTCCCTCCCTCCTCCTCCTCCTCCTCCTCCTTC
TCCCTGCCCACTGACACTGCCTCTCTCGCCAAGAGCAAAAAGGAGAAAAT
TCTCGGCGTCCTGACCTCCATCAATGCCGCCGCCACCTCCACCACCACTG
GCCCTCCCCCCCTTCCCACCTTCTCTCAAATTCTCCATTCCTCCCTGTGC
CGCGAGACCCGTTCCCGCGGCTGGTGCGAGGCAAGCAAGGCCTACGAACC
GTTGAAGCAGCAGCGGGTGGCCTCCACCCTTCCCTCCCTCCTTGCCCTGC
ATGGAGGTGTAACGGTCGGCACAGGCAAAGAGGTTGCACGGCAGATTTGG
CACCAACTGAATCCTTTGGGGGGCCCATGGTTGCCGGAGAGAATAAGGGT
GAGGGTGGGGGGGAGGAGGAAGGGCGGGAAGGAGGGGGGCGTGTCGGTGA
TGGAGAGGGTGGTGGATCCGGGTGGAGGGAGAGAGGACGAGGGAGGGAAA
GAAGGGGGGGAGGAGGAAGAGTTTTGGATCGGTTCGGAGGACGGGAAGAC
GGTGAGGATGAGGGAGGAATTGGAAAATGTGGAGGAAGAGCAGGAGGGTG
GGGAGAGGAAGGAGGGGGAGGAGGAGGGAGGACGGGTGGAAAAGGAGTAC
GAGTTGATGTCGGTGGTGTCCTTTGTGAGTGCAGGCGCGAGTGGGGGAGG
GAGGACGAGGGGACCAGGGGGCATGCAACGGCGAGGATTGAAAAAAGGAG
GAGGAGGAGGAGGAGGAAGAGGAGGAGGAGGAGGAGGAATGAATGGAGAA
GCAGGAGAGGAAGGCCACCTCCTCCTCCACGTCAAAGTGCCCGTGGCCAG
CGAGTGGAAAGGGGCAGGCGAAAAATTGGAGGGTGCGAAGGAGGGCGAAA
AGGAGGGCGGGAGGGAGGAGTGGCAATGGTACATCTTGAATGACTTTTTG
GTCGCTCCGACGATCTTAGAAGACGCATTGGGCTTTTTGCCTACGTGGAA
GGAACCGTGTATTCTCCTTTATAGGGACCGAGAAACATCGCCGGCAGCAC
ATGCAGCATGGCTTGAGCAGCTGCGGAAGGCAGGAGCAAGGATGGGTGGT
GGCATGGAGGGAGAGGAAAGAGAAGCAGGTGGTGTTGCTACAGCAGCAGC
AACAGCAAAAGCAGCAGCAACACCTGCAGCAGTAATGCCCTCTATCCCGG
TCTCGGTCTTTTCTTCCCCTTCCATTTCCACTGCTCCTCCGCGCTCTCTG
TCCTTTACCCCCCTTCCTCCCTCACAGATGCCGGGGAGAGGGGACGTGAT
CGGTATCGACGCGGAGTTCGTCCAACTGGAGATGGAGATGGCATCTATTC
GTGAGGATGGCACTCGCGTCGTTTCGAAAGAGGGCCGACAAGCGCTGGCT
CGTCTGTCGATATTAGACGCACGGGAGGACATCGTAATGATGGATGATTA
CGTCTTGCCCTCTGAGCCGGTCATGGATTACCTAACGCGATTTTCGGGTT
TGACGAGGGAGGACCTGGACCCATCTCTCTGTCGGCATCACCTTGTCTCG
GCCCGGACTGCGTATTTGAAACTGCGATATTTGATCGATCGGGGCGTGGT
CTTGGTCGGGCACGGCCTGAAGAAAGACTTCCGGATTATCAATGTCTATG
TACCACCGGCCCAGGTGATTGATACGGTGGACCTGTGGTGCTTGGAGGGC
CAGAGGAAGATCAGTCTAAGATTCCTGGCTGCCTTCTTGTTGAAGGAAAA
CATTCAGGGGGAGACGCACGACTCGATTGAGGATGCGCGGACGGCCTTGA
GATTATGGAGGAAGTGGGAGGAGGTGCGGAGGGAGGGGAAGGTGGAGCTG
GTTTTGAACGCGTTGTATGAGTACGGCCGGAAGACGGGATGGAAGCTGCA
GCAGGATCAAGAGGAGGAGCAGGAGCAGCAGCTGTGATGGGGAGGGAGCG
AGGCAGGAGGGAAGGAGGGAGGCATGTAAAGCATTTGAGAAATGCCTAGC
TTTAATGAATGCTTACCCTTACCCACGCACCCGCATAAGCAGTATGGCAT
GTATCAAAGGAGAAAATAAATCAATGAAAATAGTGAAAC
back to top

protein sequence of NO03G05220.1

>NO03G05220.1-protein ID=NO03G05220.1-protein|Name=NO03G05220.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1567bp
MNGLSAPFIPPGQQQQQQQEEEEEEEDPPFSRSGYHEDEELDGPTEGHVP
GSWAELGRVPSDGFPITALAFDPIEELLWVGTGNGRLSALAQPTMHKHVS
VMAHPCPVRQIRCFGEGVISISGKEVRMHNRNCLAICGAPSKAMLGEEEG
EMEGGKKQASPASLRVGGRAAGGADSDDSDDGSRNGSAHKKAGKEEFTCG
ALNVPRSHHMAAFGGEYLTVGGNQGHVWQLDLVTGLTVVRMIDVTSPSAC
MESNRAVITGGTDGRLRFLDGMMRSRQGVERELEAYTGPVTSLCTWDDLV
VATGTQGRSLNPYDRSGLAPTRFLPDPLIKVFDLRMLRQTLPLSFAPALG
SPSLLSFLPGKDKDRLLVASGSTGQFLTCDPFNVTAAETAFFHLQDYQIG
APSPMSCLSTSLSGELLSFGTPSGLVVTYATDSGAVCNYPTLPLDDILAF
PPPAPLSIPPSDPNPASHYVFRPTYLAAPPHPLLELSSSFATTPAVASSL
PIRHPPQRVINPALFTKMSVQDFVGYVPNTFYHYPFQSLLFGKARRDAYL
VVDPRRKEEEGEGGREDGRGEGEEDWGDSFMGEGGEAGGLEEALAVPERY
RKVVIDFSKRGLDGFDFGRYNRTSLVGLENLLPNSYTNAVLQMMFYVPEV
KELVLGKQYARWCWGNEKSVVCEMGFLFYMLQMAMAEAMGVGTGTVGREE
EKEEGKEGKSVHQQPQQRQARLASLIPPAAPKPCQSSNFLRVYRHVPEVV
ALGLLEASAAAPVQQRAEGAHRFLLSQIHGEDSAVAPLSASFSASSSAPR
RGGRGGAKKAAAAAAAAAAAAAAAVAAAEKHGAAKAEVEENEDELSGSEG
GGGQEEGNGLEKSVIDTLYGYTACTTNTFLHPPNLPSKVTSTRSFILELV
YPRTLSRGAGGSPTPSPTLKPAQPSSSFPPSSSSSSSFSLPTDTASLAKS
KKEKILGVLTSINAAATSTTTGPPPLPTFSQILHSSLCRETRSRGWCEAS
KAYEPLKQQRVASTLPSLLALHGGVTVGTGKEVARQIWHQLNPLGGPWLP
ERIRVRVGGRRKGGKEGGVSVMERVVDPGGGREDEGGKEGGEEEEFWIGS
EDGKTVRMREELENVEEEQEGGERKEGEEEGGRVEKEYELMSVVSFVSAG
ASGGGRTRGPGGMQRRGLKKGGGGGGGRGGGGGGMNGEAGEEGHLLLHVK
VPVASEWKGAGEKLEGAKEGEKEGGREEWQWYILNDFLVAPTILEDALGF
LPTWKEPCILLYRDRETSPAAHAAWLEQLRKAGARMGGGMEGEEREAGGV
ATAAATAKAAATPAAVMPSIPVSVFSSPSISTAPPRSLSFTPLPPSQMPG
RGDVIGIDAEFVQLEMEMASIREDGTRVVSKEGRQALARLSILDAREDIV
MMDDYVLPSEPVMDYLTRFSGLTREDLDPSLCRHHLVSARTAYLKLRYLI
DRGVVLVGHGLKKDFRIINVYVPPAQVIDTVDLWCLEGQRKISLRFLAAF
LLKENIQGETHDSIEDARTALRLWRKWEEVRREGKVELVLNALYEYGRKT
GWKLQQDQEEEQEQQL*
back to top
Synonyms
Publications