NO12G02110, NO12G02110 (gene) Nannochloropsis oceanica

Overview
NameNO12G02110
Unique NameNO12G02110
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length13146
Alignment locationchr12:595817..608962 +

Link to JBrowse

Properties
Property NameValue
DescriptionProtease
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr12genomechr12:595817..608962 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0004252serine-type endopeptidase activity
GO:0005515protein binding
GO:0016787hydrolase activity
Vocabulary: Biological Process
TermDefinition
GO:0006508proteolysis
GO:0019538protein metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR001940Peptidase_S1C
IPR036034PDZ_sf
IPR009003Trypsin-like_Pept_dom
IPR025926PDZ-like_dom
Homology
BLAST of NO12G02110 vs. NCBI_GenBank
Match: GAX94281.1 (pro-apoptotic serine protease nma111-like protein [Pythium insidiosum])

HSP 1 Score: 633.3 bits (1632), Expect = 1.400e-177
Identity = 390/1035 (37.68%), Postives = 557/1035 (53.82%), Query Frame = 0
Query:   42 WQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARV---------GREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGG-YNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCL----PAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDH-APGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHEGEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRIP--TELGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSFLGYKPAQ-GAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEI 1059
            W   L+  I A ++IR+  V++F+  G+S+S ASGFVV +   +ILTNRHVV  G P V D +  NKEE+ L  +Y DP+HDF F  FD  ++K+ +L EI L PEGA+            +     NDAGEK+ ILPGI+A+LDRD+P YG   YND+NTF             GSPVL  DG A+ALN                RV RAL  +                             S   +PRGTL TIF+H AFDE+ RLGLT++ +  VR AFP ETGML+V+Q ++ GPA+G L+ GD+L++  G   TTFL +E+ LD+HVG  V ++ QRGG    VD+ ++DLH++ P     I   ++H ++   A++  L  G GV++A  G+MF +A +T   II +V G+PTPT+    +V A LP+G+RT + Y  + DR R+R+A I +D++WF   + V  R D D    W  QPC+A++   L    PAV P  +SF                          G+    K   SLVMV+F +PY+VDG+S + Y G GIVID  +G VLVD++T P  +GD L+T    +E+PAK VFVHP+HN +V+QYD   L A +      +  A FAE   L VGE   + GLSS +  VT +  +TK + L L D +PP + + N  V H D       G+F+                                 V AL L ++Y      K+++R +   +V   VR   R G  P T++ LP +     LSK ++G+GLS+   ++L    Q  E +  +L V+R    T+    +++GD+LL++D   +V+  ++E A                         AA ++ + + VLR  +E+ + V+TT LS +GTDRV++WCGL++Q  H  V+ LGY P + G VY SRW +GSP+HKYGLRA+ +I  +NG AT  L+ FL  V       S+RIK + L++K K +TLK  Y YWPT E+
Sbjct:   64 WMTSLERCIRAIVSIRLNSVRAFDGNGASFSVASGFVVDMARGIILTNRHVVTPG-PVVADAIFLNKEEVDLVPIYRDPVHDFGFFRFDPAKVKFLELHEIPLRPEGAKTHTFAVSPQSASKSASSANDAGEKLSILPGILAKLDRDAPNYGSSTYNDFNTFXXXXXXXXXXXXXGSPVLNIDGCAIALNA--------------DRVVRALRYV----------------------------QSGEAVPRGTLQTIFRHAAFDEVRRLGLTSDTEVLVRQAFPQETGMLIVDQVIQQGPADGKLQTGDVLIKFAGRYETTFLGIEEFLDAHVGGTVTVEFQRGGETLTVDINVQDLHSITPDRYLEIGGGIVHSLSYQQARNASLPVG-GVYMAQAGHMFLKAHLTQPSIITAVDGKPTPTLEDFMRVMASLPNGYRTVLRYFMVRDRHRLRTAFIMMDRQWFPIQLCV--RNDTDG--LWYPQPCEANAVAALPKTVPAVAPAPLSF------------------------PGGSEAGKKMLLSLVMVTFDIPYMVDGISSSSYHGVGIVIDADKGFVLVDQNTVPIALGDVLLTIAASVEIPAKVVFVHPVHNFSVVQYDPKDLGAASH-----LESAVFAER-SLDVGEPCDYIGLSSNWTVVTMKSFVTKMDRLVLRDFQPPRYKAMNVEVLHFDRITKSVGGVFM----------------------------NDDGHVSALWLSFSYQDGSGRKEVFRGLPVHVVQPIVRQF-REGKIPATVNTLPAQLLTYSLSKARSGLGLSDAWIQKLQ---QQYEDKRQILGVKRCAAGTDCATKLESGDLLLAIDCSVMVRDFDVEEA-------------------------AADKDQVAITVLRNQEELTLNVQTTHLSAMGTDRVVIWCGLVVQAPHYAVACLGYIPEEGGGVYCSRWCYGSPAHKYGLRATIWIVEVNGEATKSLDDFLRVVRQLKNGDSVRIKTISLNTKPKVFTLKTDYHYWPTVEL 963          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: CCI41296.1 (unnamed protein product [Albugo candida])

HSP 1 Score: 629.0 bits (1621), Expect = 2.700e-176
Identity = 375/1037 (36.16%), Postives = 571/1037 (55.06%), Query Frame = 0
Query:   29 QPL--ALQQNPQQPTWQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGGYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDH-APGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHEGEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRIP--TELGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSFLGYKP-AQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEIV 1060
            QPL    +++ +   W   L   I A ++IR+L V++F+  G+S+S A+GF+V LK  ++LTNRHVV  G P + D +  +KEE+ LK +Y DP+HDF F  FD ++I + KL EI L PE A+VG EIR++GNDAGEK                                   TSGGSSGSPVL  DG A+ALN G AK AASSFYLPL RV RAL L+                             S   + RGT+  I KH AFDE+ RLGL   ++  VR  FP ETGML+V+Q ++NGPA+G L+ GDI++++ G   T+FL +E+ LD+HV + V ++ QRG       ++++DLHA+ PH    +   ++H ++   A++  L  G GV++A TG++F RA +   CII ++ G+ TPT+    QV A LPDG RT++ Y  + DR R+R+A I + + WF   +   +  DD N L W    C+++  +  P +  T  SFT     VP+             G   G++++ K   SLVMV+F +PY++DG+S + Y G G+VID +QG +LVD++T P  +GD +VT    +E+ AK VFVHP+HN +++QYD   L      G + +  AEFAE  PL+VG++  F GLSS +  VT +  ++K + L L D +PP + ++N  V H D       G+F+                                 V AL L ++Y      ++++R +S +I+   +  ++     P ++ +LP++     LSK ++G+GL +T  +++       E +  +L V+R    T+  + +++GD++L+++ + VV+ +++E+A   WQ                      + + ++L + R+HKE+ I V+ T LS  GTDR+LLWCGL++Q  H  V+ LGY P   G  Y+SRW +GSP+HKYGLRA+ +I  +N   T  L+  L  VT       +RIK + +++K K +TLK  Y YWPT EI+
Sbjct:   53 QPLISCSKESDESQRWLHSLGKCIRAIVSIRLLSVRAFDGNGASFSVATGFIVDLKRGIVLTNRHVVTPG-PVIADAIFLSKEEVDLKPIYRDPVHDFGFYQFDPSKINFLKLHEIPLHPERAKVGVEIRVVGNDAGEKF----------------------------------TSGGSSGSPVLDIDGNAIALNAGGAKKAASSFYLPLDRVVRALKLL----------------------------QSGQHVTRGTIQMILKHAAFDEVRRLGLPPPIEAQVRLIFPKETGMLIVDQVIQNGPADGKLQTGDIIIQLDGKYVTSFLEIEEYLDTHVSDSVSVQFQRGDSLNSTLLQVQDLHAITPHQYLEVGGGIVHSLSYQQARNASLPVG-GVYMAQTGHIFMRAHLMQPCIITALDGKSTPTLKEFVQVIASLPDGKRTTLRYFMIRDRHRIRTAFITMSRLWFPVQL---STRDDRNGL-WIPDLCESTGALSAPQI--TLPSFT----PVPL-------------GFPGGSSSAKKLLLSLVMVAFDLPYMIDGISSSSYHGIGLVIDSTQGYILVDQNTVPIALGDVMVTVAASVEIQAKVVFVHPIHNYSIVQYDPKEL------GTIELKSAEFAEL-PLNVGDTTEFIGLSSNWTVVTSKSVVSKVDRLVLRDFQPPRYKASNVEVIHFDRITKSVGGVFM----------------------------DSDGNVNALWLSFSYQDNTGRREVFRGLSVEILEPVLAHIRHTKSVPSSVRVLPLQLLTYPLSKARSGLGLPDTWIQQME---GIYEDKRQVLGVKRCAAGTDASSKLQSGDLILAINEKTVVRDIDVEKA-TTWQ----------------------EYDSVSLTIFRDHKELKINVQLTELSATGTDRILLWCGLVIQPPHYAVASLGYIPEIGGGAYISRWCYGSPAHKYGLRATIWIVEVNDTPTGTLDALLQVVTQLKNGDPVRIKTVAINTKPKVFTLKSDYHYWPTVEII 941          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: PRP82470.1 (pro-apoptotic serine protease [Planoprotostelium fungivorum])

HSP 1 Score: 588.6 bits (1516), Expect = 4.000e-164
Identity = 375/1076 (34.85%), Postives = 554/1076 (51.49%), Query Frame = 0
Query:   13 PATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGG-YNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHE---GEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRI--PTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRV---------------LLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEIVFNYELHEW 1068
            P +PS+    S  QL++   +  N +  TWQ+ LD  + A + IR   V+ F+   +S+S A+GF+V     LILTNRHVV  G P   + +  + EE+ L  +Y DPIHDF F  F+   +K+  L+EIELDPEGA+VG EIR++GNDAGEK+ IL G IAR+DR +PFYG   YND+NTFY  AASSTSGGSSGSPVL   GKA+ALN G  + AASSFYLPLHRVKR L                     L+   + P        PRG L T+F+H  +DE+ RLGL    + + R    +ETGMLVV+  +   P+  +LEPGD+++ + G L T F+ LED++DSHVG+ V ++I+RGG      + I DLH L P T   +   VLH ++   A+++ L  G GV++AS GYM  R G+    II +VG   TP + +  +      +  R  + Y +++DR + R ++I +D+ W    M+      DD    WT   C  + E   PA +P      T    + +                     +N+   S+VMVSF +PY++DG S   Y+G+G+++D  +GL+LVD++T P  +GD LV+F   +E+PA+  ++HP+HN  +LQYD   L     SG     ++      PL  G  +   GL+    P  Q+ T++K E L + + RPP F + N  V H + A    G  ++  +                             + AL   Y+ S +    E  +I+R +   ++ E +  LKR   P   +  L ++   + LSK +  +GL E   + +  K         +L +RRI   T+    +KTGD+L++V+G+ V    E+E                        TRE    E ++L +LR  +E+ + V+TT L  +GTD+V               L W G +LQ  H  VS LG+      V+ SRW +GSP+H++GLRA  +I  +NG  TPDL +F+  V N   +A +R+K + +  K    T+K    YWPT  I  +    +W
Sbjct:  532 PNSPSI----SSIQLDEAPPMGLNVESETWQRTLDRVVSAVVAIRFCTVRHFDTERASFSVATGFIVDKAKGLILTNRHVVRPG-PVTAEAIFLDHEEIKLYPVYRDPIHDFGFFRFNPADVKFMDLKEIELDPEGAKVGIEIRVVGNDAGEKLSILSGTIARMDRPAPFYGDDEYNDFNTFYYQAASSTSGGSSGSPVLNLTGKAIALNAGGRRKAASSFYLPLHRVKRVLQY-------------------LQNDNLKP--------PRGDLQTVFRHTPYDEVHRLGLRGVTESTFRKENANETGMLVVDSVIPESPSYNVLEPGDVIVSLQGELLTQFIRLEDIMDSHVGQTVTIEIERGGVPMTHKLPIVDLHQLEPSTFLEVGGGVLHELSYQQARNHRLPVG-GVYVASDGYMLGRGGLHKGTIIRAVGQTETPNLESFAKAICSYSNDSRVPIQYFTVSDRHQSRLSVIYIDRCWHGMQMWTK----DDALGLWTASDCLPAPE---PAFVPKPTPVKTLKSPMEL---------------------ANEIVKSMVMVSFHIPYVIDGGSSDNYLGSGLILDAERGLILVDQNTVPLVLGDLLVSFASTVEIPARIRYIHPIHNFGILQYDPKLL---VNSGF----QSANISMEPLEAGADVFLVGLTRTDQPFCQKTTVSKIEELFIGEARPPRFRAINEDVIHLEKATSCVGGVLVDEN---------------------------RKIRALWASYSSSEKKNPSESFEIFRGLPLYLIQEIIEPLKRDEIP--RIRSLEVELWPIALSKAR-DLGLGEKWIQDIEDKY----VRRHILCIRRIAAKTDASTKLKTGDVLIAVNGDVVTNFREVENL----------------------TRE---RESVSLTLLRNQQEIVMDVKTTLLEGIGTDKVESVYVSSSFTYASQILSWSGAILQNTHRAVSQLGF--TYEGVFCSRWFYGSPAHQFGLRAVHWIVEVNGKRTPDLESFIKVVRNAGDEAFLRLKLMGMQDKVSVLTIKTDLHYWPTVLISLDKSSGQW 1478          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: PRP82453.1 (pro-apoptotic serine protease [Planoprotostelium fungivorum])

HSP 1 Score: 588.6 bits (1516), Expect = 4.000e-164
Identity = 375/1076 (34.85%), Postives = 554/1076 (51.49%), Query Frame = 0
Query:   13 PATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGG-YNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHE---GEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRI--PTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRV---------------LLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEIVFNYELHEW 1068
            P +PS+    S  QL++   +  N +  TWQ+ LD  + A + IR   V+ F+   +S+S A+GF+V     LILTNRHVV  G P   + +  + EE+ L  +Y DPIHDF F  F+   +K+  L+EIELDPEGA+VG EIR++GNDAGEK+ IL G IAR+DR +PFYG   YND+NTFY  AASSTSGGSSGSPVL   GKA+ALN G  + AASSFYLPLHRVKR L                     L+   + P        PRG L T+F+H  +DE+ RLGL    + + R    +ETGMLVV+  +   P+  +LEPGD+++ + G L T F+ LED++DSHVG+ V ++I+RGG      + I DLH L P T   +   VLH ++   A+++ L  G GV++AS GYM  R G+    II +VG   TP + +  +      +  R  + Y +++DR + R ++I +D+ W    M+      DD    WT   C  + E   PA +P      T    + +                     +N+   S+VMVSF +PY++DG S   Y+G+G+++D  +GL+LVD++T P  +GD LV+F   +E+PA+  ++HP+HN  +LQYD   L     SG     ++      PL  G  +   GL+    P  Q+ T++K E L + + RPP F + N  V H + A    G  ++  +                             + AL   Y+ S +    E  +I+R +   ++ E +  LKR   P   +  L ++   + LSK +  +GL E   + +  K         +L +RRI   T+    +KTGD+L++V+G+ V    E+E                        TRE    E ++L +LR  +E+ + V+TT L  +GTD+V               L W G +LQ  H  VS LG+      V+ SRW +GSP+H++GLRA  +I  +NG  TPDL +F+  V N   +A +R+K + +  K    T+K    YWPT  I  +    +W
Sbjct:  532 PNSPSI----SSIQLDEAPPMGLNVESETWQRTLDRVVSAVVAIRFCTVRHFDTERASFSVATGFIVDKAKGLILTNRHVVRPG-PVTAEAIFLDHEEIKLYPVYRDPIHDFGFFRFNPADVKFMDLKEIELDPEGAKVGIEIRVVGNDAGEKLSILSGTIARMDRPAPFYGDDEYNDFNTFYYQAASSTSGGSSGSPVLNLTGKAIALNAGGRRKAASSFYLPLHRVKRVLQY-------------------LQNDNLKP--------PRGDLQTVFRHTPYDEVHRLGLRGVTESTFRKENANETGMLVVDSVIPESPSYNVLEPGDVIVSLQGELLTQFIRLEDIMDSHVGQTVTIEIERGGVPMTHKLPIVDLHQLEPSTFLEVGGGVLHELSYQQARNHRLPVG-GVYVASDGYMLGRGGLHKGTIIRAVGQTETPNLESFAKAICSYSNDSRVPIQYFTVSDRHQSRLSVIYIDRCWHGMQMWTK----DDALGLWTASDCLPAPE---PAFVPKPTPVKTLKSPMEL---------------------ANEIVKSMVMVSFHIPYVIDGGSSDNYLGSGLILDAERGLILVDQNTVPLVLGDLLVSFASTVEIPARIRYIHPIHNFGILQYDPKLL---VNSGF----QSANISMEPLEAGADVFLVGLTRTDQPFCQKTTVSKIEELFIGEARPPRFRAINEDVIHLEKATSCVGGVLVDEN---------------------------RKIRALWASYSSSEKKNPSESFEIFRGLPLYLIQEIIEPLKRDEIP--RIRSLEVELWPIALSKAR-DLGLGEKWIQDIEDKY----VRRHILCIRRIAAKTDASTKLKTGDVLIAVNGDVVTNFREVENL----------------------TRE---RESVSLTLLRNQQEIVMDVKTTLLEGIGTDKVESVYVSSSFTYASQILSWSGAILQNTHRAVSQLGF--TYEGVFCSRWFYGSPAHQFGLRAVHWIVEVNGKRTPDLESFIKVVRNAGDEAFLRLKLMGMQDKVSVLTIKTDLHYWPTVLISLDKSSGQW 1478          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: OUU83132.1 (hypothetical protein CBC32_16725 [Proteobacteria bacterium TMED72])

HSP 1 Score: 549.7 bits (1415), Expect = 2.100e-152
Identity = 364/1021 (35.65%), Postives = 528/1021 (51.71%), Query Frame = 0
Query:   42 WQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYG-GGYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKAS-SEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHEGEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRIPTE--LGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEI 1059
            WQ+ +D  +P  +  RV   ++F++    Y  A+GFVV  +  LILTNRHVV  G P   + VL + EE+ ++ +Y  P+HDF F  FD   +++ +L E+ L P+ AR+GREIR++GNDAGEK+ ILPG IARLDR +P YG   +ND+NTFYI AAS TSGGSSGSPV+  +GK VALN G +  AASSF+LPL RV+RAL+         +  +G E                   + RGTL T+F ++ +DE+ RLG+  E + + R+AFP  TGMLVV Q +  GPA G+LEPGD+++++ G     F+ LE VLD  VGE+V + I+RGG  R V++E+ DLHA+ P          L+P++ H A+++ +    GV+LAS GY+FSRAGI    +I  VGG   P + + E   A   DG +  + + S  +       +++ D+ WF   M       DD+T  W   PC AS S    P   P +        + PV               T G         S+V VS+ VPY +DG+ G  + GAG+V+D   GLV+VDR T P  +GD  + FGG +EVPA+  ++HP HN AV+QYD S L      G   V  A F++   L  G+ + F G++     V++   ++++E + L+   PP F   N  V   D      G  ++                                V AL   ++    G     +  + +  +   V  LK G         L  +F  + LS+ + G GLS+ Q RRL K   A      +L+VR++ TE      +  GD+LLSV+G+ V +  E+E                          +A+ EE + L VLR  ++  + VRT P    GT R LLW G +LQ+A   V    Y  +   VYV+R+ FGSP+++YGL A+  I  ++G  TPDL+ F+  +       S+R+K +DLD +    TLK   +YWPT E+
Sbjct:   21 WQETIDEVVPGVVAXRVNSPRAFDSEVPGYLQATGFVVDAEQGLILTNRHVVRSG-PVRAEAVLLDHEEVPVEAVYRXPVHDFGFYRFDPADVEFMELPELALAPQNARLGREIRVIGNDAGEKLSILPGTIARLDRRAPDYGPSTWNDFNTFYIQAASGTSGGSSGSPVVDIEGKVVALNAGGSLAAASSFFLPLERVERALE---------KLQKGEE-------------------VERGTLQTLFVYEPYDEVRRLGVRRETEAASRSAFPDSTGMLVVGQVVPGGPAAGLLEPGDVVVKLNGDRLGDFIGLESVLDDSVGERVTMDIERGGEMRRVELEVEDLHAITPDRYLEFGGGALNPLSYHQARNHSIPV-EGVYLASPGYVFSRAGIPRGVVITEVGGVKVPDLESFETEMARYADGEKVPLQFFSPVNPRTPSVRVVRADRTWFPMQMC----RRDDSTGRW---PCVASPSPRTAPPAQPASTDMVIKGAEDPVK--------------TIG--------PSIVQVSYDVPYRLDGVHGDRFTGAGLVVDTESGLVVVDRETVPIALGDLSIVFGGSVEVPAEVAYLHPEHNFAVIQYDPSLL------GETGVRAARFSDV-ELEPGDDVWFVGITGGQRIVSRETRVSRREPVSLAQTFPPRFRDRNIEVVSLDDVMATVGGVLVDEDG---------------------------RVQALWASFSAGDGGRRDSFFGGIPARSLRPVVDRLKAG--EAVAWRSLGAEFIPLALSQAR-GRGLSDEQARRLEK---ADPEGRRVLMVRQVDTEPQSSQSLYPGDLLLSVNGKTVTRHHEIE--------------------------QASMEEEVRLEVLRNGEKRDLVVRTEPRDGTGTGRALLWAGTLLQDAP-DVLAREYGISPTGVYVARYRFGSPANRYGLVATTRIVEVDGHPTPDLDAFIEAMGTRSDRDSVRLKTIDLDGRTSVTTLKLDLEYWPTAEV 915          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: GBG32291.1 (Pro-apoptotic serine protease nma111 [Aurantiochytrium sp. FCC1311])

HSP 1 Score: 525.4 bits (1352), Expect = 4.200e-145
Identity = 369/1036 (35.62%), Postives = 521/1036 (50.29%), Query Frame = 0
Query:   52 ASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGG-GYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPA--EGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSLEAE-----AQSGLVTVAE-----AEFAEAPP-LHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAY-SHEGEIKQIYRSVSSDIVTETVRLLKRGGPPPRT-LDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRIPTELGA--LVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEIVFNYELHEWRL 1070
            A + +RV  V++F+  G   S A+GFVV  +  LILTNRHVV  G P   D    +KEE+ +K ++ DP+HD+    FD  +IK+ ++ EI L PE A VG E+R++GNDAGEK+ IL G +ARLDR +P YG   YND N                 PV+ SDG AVA+N G    AASSFYLPL RV RAL  +               +GE               +PR TL  IFKH+AF E  RLGL+ E++  +R  FP  TG+LV +Q +  GP    G+L PGDILLE+ G     FLPLE VLD    E V L ++RG     V++E  DLHAL P +   IS AV+HP++   A +  +EAG+ V+++S+G+M  RA +  + IIL+VG   TP IA  E+V A  PD    SV Y  +T R  +    I VD+ WF   ++ +A+  D     W F+       +  P++ P     T                             S+  +  LVMV   VP+L DG+  +   G G V+D   GLV+VDR+T P T+ +  +TF G  E+ A+ + VHPLHN +++QYD S L+ E     + S      E     AEFA +P  L  G++  F GLS     V Q   +T  E L  S   PP F + N  V H D      G  +L                                V A    ++Y S   + K+I+  V +D V ETV  LK  G   +T L  L +      +SK +AGMGLS  + R+ +    A E+   +L VR       +  ++++GDILL ++  P++    +++AV      +++N+  DA   V  T            VLR+ +E   QV  + LS +GT R++L+ G+ +QE H PV F G+ P   +VY SRW +GSP+HKY L+A+ FI  IN   TP L+  +  V+       +RI+ + L  K + +TLK  Y YWPT ++V + E   W L
Sbjct:   53 AVVVLRVNYVRAFDGEGRGCSSATGFVVDKEKGLILTNRHVVSCG-PVRADATFLSKEEVEIKAIFRDPVHDYGVFQFDPKEIKFQEVVEIPLSPERAEVGLEVRLLGNDAGEKLSILSGTLARLDRVAPHYGSKEYNDHNXXXXXXXXXXXXXXXXXPVIDSDGHAVAINAGGKTKAASSFYLPLDRVVRALSYL--------------RKGE--------------TVPRRTLQVIFKHQAFTEALRLGLSRELEAEIRKEFPKATGLLVADQVVPGGPGARAGVL-PGDILLELNGKHIHEFLPLEAVLDD--SESVKLTLRRGDETLTVELETVDLHALTPSSFLEISGAVIHPLSYMQAINNAIEAGS-VYISSSGFMLGRANVPWNAIILAVGDEETPDIATFERVIAKYPDLSTVSVRYAQVTHRHHILVRSITVDRTWF---VWRHAQRCDKEG-TWHFR------NLASPSIAPAPKQHT--------------------------IHVSDDKRAGLVMVLCDVPFLADGIPSSNMHGTGTVVDAENGLVVVDRNTVPNTLCNVSITFCGTAEISARVLLVHPLHNFSIIQYDPSLLQVEFSASNSDSNDEATREPEPFRAEFALSPQRLAAGDATTFAGLSHQMTTVEQSCKVTNVERLMFSLTSPPRFTAPNQLVIHVDKLVSNVGGVLL---------------------------DDENRVQAFWWSFSYQSSSYKDKEIFVGVHADFVYETVNALKANGCRAQTSLPTLGLDLVVTPISKARAGMGLSSERVRQFA---SAAEAHNQVLTVRSALARSHSFNVLRSGDILLDINKTPMITFEAVQQAVFE----ASANNESDAPAEVAVT------------VLRDLREQTFQVPLSTLSFIGTSRLVLFAGMFIQEPHTPVFFRGFDPGT-SVYCSRWFYGSPAHKYDLKATHFILGINDHDTPTLDDLVRVVSELKDGEFVRIRTISLKEKKRVFTLKCDYHYWPTVDLVRDAETSNWTL 972          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: OUV26549.1 (hypothetical protein CBC48_15340 [bacterium TMED88])

HSP 1 Score: 520.4 bits (1339), Expect = 1.300e-143
Identity = 348/1021 (34.08%), Postives = 531/1021 (52.01%), Query Frame = 0
Query:   42 WQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYG-GGYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSL-EAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHEGEIKQIYRSVSSDIVTETVRLLKRGGPPP-RTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRI----PTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPT 1056
            WQ+ +D   PA + ++V   ++F+   +  + A+GFVV  +  LILTNRHVV  G P     V  + EE+ ++ LY DP+HDF F  FD   +++  + E+ L PE A VG ++R++GNDAGEK+ IL G IARLDR +P YG  G+ND+NTFY+ AAS TSGGSSGSPV+   G+ VALN G  + AASSF+LPL RV+RAL                    EL+G R P          RGTL T+F ++ +DEL RLGL+   +E++R + P  TG++VV + +  GPA+G+LEPGDI+L + G     FLP+E  LD+HVG+ ++L I+RGG+   V++++ DLHA+ P   F     VLH ++   A+++ +    GV++AS GY  SRAG++   I+  V G PTPT+ A E   A   DG +  + Y  L        A+I+VD++WF+    +   E DD T  W   PC+++     PA  P     TT     P               +TR          SLV+V + +PY +DG+ G ++ GAG+++D  +GLV+VDR T P  +GD  V   G + +PA+ V++HP HN+AV++YD + + E   +S  + V E        L +GE +   GLS+    V++   + ++E + L    PP F  +N  +   + AP   G  VLA                               V A    ++      ++  +  + S  ++  V  L++G     R+LD   ++F  + L + ++  GL   Q  RL K          +L VRR+    P E    ++ GD++LSVDG PV                              E +EA+  E ++L +LR+ + + + + T PL   GT R L+W G +LQ+    +S     P +G VY++R+ +GSP+ +YGL  +  I +++G  TPDL+ F + V      AS+R+K   L+ +    TL+    YWP+
Sbjct:   47 WQETIDRVAPAIVVLKVSAPRAFDGGQAGDAVATGFVVDAERGLILTNRHVVMPG-PVAAKAVFLDNEEVDIRALYRDPVHDFGFYQFDPADVQFMPVAELPLAPERAEVGLDVRVIGNDAGEKMSILGGTIARLDRAAPVYGRRGFNDFNTFYLQAASGTSGGSSGSPVIDRQGQVVALNAGGRRLAASSFFLPLDRVQRAL-------------------LELQGGRSP---------VRGTLETVFAYRPYDELRRLGLSVSTEEAIRRSRPEGTGLIVVSEIVPGGPADGLLEPGDIVLRIAGEAVDGFLPIERQLDAHVGDDLVLDIERGGQPLSVELKVADLHAVTPSAYFEFGGGVLHDLSYQQARNHGVPI-QGVYVASPGYTLSRAGLSAGAIVTHVKGVPTPTLEAFEAEIAGFADGEKVPIGYWLLDAPRAPGVAVIRVDRRWFS----MQHCERDDQTGTW---PCRSAPPA--PASEPPA-PVTTQVSSAP-------------EKVTRALA------PSLVLVEYDIPYRIDGVHGDQFRGAGLIVDADRGLVVVDRETVPVALGDLEVVVAGSVRIPAEVVYLHPEHNLAVIRYDPALIGETPLRSARLRVDE--------LELGEGVFVVGLSNANRIVSRETRLERREAVVLPRTYPPRFQESNLELLSVEDAPVTIG-GVLADG--------------------------KGRVRAFWGSFSEGFGQSLEAFFAGIPSREISAIVDPLRKGDAVGWRSLD---VEFYPVGLEEARS-RGLEAVQAARLEKH---DPDRRQVLAVRRVTAGGPAE--GKLQPGDLVLSVDGRPV--------------------------SRFHEIQEASSAERVSLEILRDGRVLELSLPTAPLKGEGTTRALVWAGTLLQDPPRVLSRQEQLPQEG-VYIARYWYGSPADRYGLPVAARILAVDGKPTPDLDAFTAAVRGKPSGASVRLKLEALNGQLSVATLELDLDYWPS 937          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: PLX55351.1 (hypothetical protein C0629_13080 [Chromatiales bacterium])

HSP 1 Score: 516.2 bits (1328), Expect = 2.500e-142
Identity = 345/1024 (33.69%), Postives = 526/1024 (51.37%), Query Frame = 0
Query:   42 WQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYG-GGYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTL-CWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSL-EAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHEG-EIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRIP-TELGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAH--LPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEI 1059
            W++ L+      ++I+V  V++F+   +  + A+GFVV  +  LILTNRHVV  G P     V  N EE+ LK +Y DP+HDF    FD + +++ +  E  L P+ A+VGREIR++GNDAGE++ IL G IA+LDR +P YG G YND+NTFYI AAS TSGGSSGSPV+ +DG AVALN G +  AASSF+LPL RV+R +D+I              G+G               L+ RGTL+TIF H ++ EL RLGLT +++   RAA P +TGMLVV+Q +   PA+  LEPGDIL+ V G L T F  LE +LD  VG+ V +++QRGG      +++ DLHA+ P     IS AVLH V+   A+  ++    GV +A+ GYMFS +GI    +I  + GRP P +         L DG + ++ Y +  D    +   + +D++W+      + R DD  TL  W  +P         PA  P +  F                       +T G + + K   SLV+V+F +PY++ G+S   Y G G+V+D  +GL++ DR+T P  +GD  +TF G +EVP +  ++HPLHN+AV+ Y+   + +   +S + +   AE         G+ +   GL     P  Q+  +   + +     R   F  TN       +APG N   VL  S                             V+A    +A+     +++Q+   ++ D+V E V  ++ G    R L  L  +   + L+  +  +GL   + + L K   + +   +L VVR +  +    +++ GD+LL++DGE V    E+ER V                          +++ +++ + R  +E+   +RT  L+  G DR++ W G +LQ  H  LP    G  P    VYV+ + +GSP+ +Y L A   I  I+G+ TPDL+TF++ V N     S+RIK +  + + +  TLK  ++YWP  E+
Sbjct:   32 WRRTLEEISTGVVSIKVDGVRAFDTEWNQTTQATGFVVDRERGLILTNRHVVTSG-PVTAQAVFLNNEEVDLKPVYRDPVHDFGLYRFDPSALRFIEPYEFPLRPDRAQVGREIRVVGNDAGEQLSILAGTIAKLDRSAPNYGRGKYNDFNTFYIQAASGTSGGSSGSPVVDADGNAVALNAGGSAQAASSFFLPLERVRRVVDII-------------RGDG---------------LVTRGTLMTIFAHTSYGELRRLGLTEDIEARARAAQPDQTGMLVVQQVVPGSPAQNKLEPGDILIAVNGGLITRFAALEALLDDAVGDNVTVEVQRGGERFGFTLDVIDLHAITPAAFLEISDAVLHTVSYQQARHLNMPV-MGVFVANPGYMFSASGIPRGAVIAQLNGRPVPDLDTFVGKIRELADGQQATIRYFTFDDPQTTKLRSVNIDRRWYPAR---HCRRDD--TLGYWPCEPLPEVGSAAPPA--PASTEF-----------------------VTNGDSRARKIAPSLVLVNFDMPYIISGVSERHYHGTGLVVDAERGLIVTDRNTVPVAMGDVKITFAGTVEVPGRVEYIHPLHNLAVISYNPELVGDTPVRSAVFSPQVAE--------EGDEIWVAGLKGNSNPFIQKSQVAAVDAVGFPLSRTLRFRDTNLETIAVVNAPG-NVDGVLLDS--------------------------KGRVMATWSSFAFEGANKKLEQVTFGIAGDLVEEMVGFVREG----RDLHSLETELRLLPLATAR-DLGLPAERIKGLEK--HSPQRRQALQVVRTVAGSPAAGVLRPGDLLLAIDGELVNTYREVERRV--------------------------QQDEVSVTLWRNGEELTETLRTQTLTGHGVDRIVYWAGAVLQTPHRALPAQ-RGILPE--GVYVAYFAYGSPASRYSLWAGRRIIEIDGLPTPDLDTFVAAVANKSDRESVRIKTVTWNDQVEVLTLKTDHRYWPAYEL 924          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: KPK61541.1 (hypothetical protein AMJ59_00520 [Gammaproteobacteria bacterium SG8_31])

HSP 1 Score: 513.5 bits (1321), Expect = 1.600e-141
Identity = 341/1026 (33.24%), Postives = 526/1026 (51.27%), Query Frame = 0
Query:   38 QQPTWQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYG-GGYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEA-GTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGA-NGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHEGEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRIP-TELGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSF-LGYKPAQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEI 1059
            ++  W++ L+      ++I V   +SF+   +  + A+GFVV  +  LILTNRHVV  G P + + V  N EE+ +  +Y DP+HDF F  +D + +++ +  E+ L P+GA++GREIR++GNDAGE++ IL G +ARLDR++P YG G YND+NTFY  AAS TSGGSSGSPV+  +G+ VALN GA   AASSF+LPL RV+RALD+I                      R+  P      + RGTL+T F H  +DEL RLGL+ E++  VR  FP  TGMLVV+Q +   PA G LEPGDIL+ V G +   F+PLED+LD+H+G K+ L++QRGG  + V V   DLH + P+       AV++ ++   A+  HL +  TG+++AS GY+F+R+ I    +I  V G P P +    +    L DG + +V +H+  +    +   +++D++WF           +D    W   PC+    V    V P     TT   D P                       ++   SLV+V+F +PY V G+S   Y G G +ID ++GLV+VDR+T P  +GD  +TF G LEVP +  ++HPLHN+AV+ YD + +   A + +  VA        P+  G+ L   GL        Q   +   + ++    R   F  TN       +AP   +G+   A                               VV+L   +AY    E+ Q+ + V +D+V E +  +++G P    +  L  +F ++ LS  + G+GL +   +RL +         ++ V+R +  T     +K GD+LLSVDGE +    E+ER                           ++   + LV+ R+  E  + + T  L     DR+L+W G +L   H  ++   G +P    VYV+ + +GSP+ +YGL A   I  ++G+ TPDL+ F++ V+      ++R+K ++ +   +  TLK   +YWP  E+
Sbjct:   28 EEDRWRETLERISSGVVSITVDGTRSFDTNWNQSTQATGFVVDAERGLILTNRHVVTPG-PVIAEAVFLNHEEIPVFPVYRDPVHDFGFYRYDPSSLRFIRPAELSLFPKGAQLGREIRVVGNDAGEQLSILAGTLARLDREAPDYGQGNYNDFNTFYFQAASGTSGGSSGSPVVDIEGRVVALNAGANTQAASSFFLPLDRVQRALDMI----------------------RVGQP------VTRGTLMTEFVHTPYDELRRLGLSQEIEAEVRRRFPQATGMLVVKQVIPGSPAAGSLEPGDILVRVDGDILNGFVPLEDLLDNHIGRKISLQVQRGGSLKDVGVTPIDLHGVTPNEYIEFGDAVVNQLSYQQAR--HLNSPPTGIYVASPGYVFARSAIPRSAVISEVNGVPVPKLDDFREELEGLQDGEQFTVRFHTFDEPRGSKLRTVRMDRRWFP----AQVCRRNDALGVW---PCEPLPPV---GVAPPPSPATTRFIDYP-------------------DARRSRLAPSLVVVNFDMPYTVAGVSDRHYFGTGAIIDAARGLVVVDRNTVPVALGDVRITFAGSLEVPGRVEWIHPLHNLAVVAYDPALI---ADTPVREVA----LNLEPVSPGQRLWVVGLKGDHTLAVQSTEVASVDPVQFPLSRTLRFRDTNLETISLVNAPSEFDGVLADADG----------------------------RVVSLWSSFAYHAGQELNQVNKGVPADLVAEVLDQVRQGTP----VRSLETEFGRLPLSSAR-GLGLPDVWVQRLEQ--DDPRRRQAMQVIRTVAGTPADVALKPGDLLLSVDGEVITSFREVER--------------------------RSQRPEVELVIWRDGAEQVLTMETVALDGRDLDRLLVWAGALLHSPHRAMAAQRGIEPT--GVYVAFFNYGSPATRYGLFAGRRIVEVDGVPTPDLDAFIAAVSGLQDREAVRLKTINWNDGVEVITLKLDNRYWPAYEL 923          
BLAST of NO12G02110 vs. NCBI_GenBank
Match: XP_006677811.1 (hypothetical protein BATDEDRAFT_19279 [Batrachochytrium dendrobatidis JAM81] >EGF81247.1 hypothetical protein BATDEDRAFT_19279 [Batrachochytrium dendrobatidis JAM81] >OAJ38220.1 hypothetical protein BDEG_22170 [Batrachochytrium dendrobatidis JEL423])

HSP 1 Score: 510.0 bits (1312), Expect = 1.800e-140
Identity = 339/1062 (31.92%), Postives = 533/1062 (50.19%), Query Frame = 0
Query:   40 PTWQQCLDHAIPASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTVIDGVLH-NKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGARVGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGG-YNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMYVNAREDDDNTLCW---TFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVXXXXXXXXXXXXXGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAXXXXXXXXXXXXXXXXXXXXXXXXXXXMVVALRLCYAYSHEGEIKQIYRSVSSDIVTETVRLLKRGG-----------------PPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQATESEASLLVVRRIP--TELGALVKTGDILLSVDGEPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTL---GTDRVLLWCGLMLQEAHLPV-SFLGYKPAQGAVYVSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCYQYWPTTEIVFNYEL-HEWRLHHY 1073
            P W++ L+  IPA ++IR++ V++F+      S A+GF+V  K  +IL+NRHVV  G P + D +L+ +KEE+ L  +Y DP+HDF F  FD + +KY K++EI L+P   RVG +IR++GND+GE++ IL G +ARLDR +P YG G +NDWNTFY  AAS TSGGSSGSPV+  DG A+ALN G A  +A+SF+LPL RV R L LI                             +   IPRGT+ TIF++ A+DE+ RLGL  +++  VR +FP  TGML V Q +  GPA+ +LE GD+LL +   L T+F+P+E++ DS+VG+ + + +QRG   + V++ ++DLH++ P+    +S  +LH ++  MA+SY +  G GV +A  GYM   +G++  CII S+  +PTPT+ A  +V   L D  R S  +H L+D  + +++II  D++W    + V     DD T  W   T  PC  S E  L     T ++     G   +                           SLV V+F +P+ +DG+    + G G+V+D   GL+LVDR+T P +IGD L+TF   + +P K +++H + N  ++ YD +SL  +     VT++  E ++A  +H+        LS  + P+ ++  +T      +++G PPT+ + N      ++     G+                                   V      Y         + Y  +S  +V   +  L+                    P  +TL+ + + +TQ+  +++   MGL++   +R+     +  S  +++V+RR+   T    LV  GDI+LSV+G+P  Q                            +      +  + LV+LR+ KE+ +    TPLST+   GT+RV+ W G + Q  H  V   L + P+   V  S    GSPS  Y L    ++  +NGI TP+L+ FL  +     D  +R+  +      K   L+P   Y+   EIV +  +   WRL  +
Sbjct:   41 PRWEETLNKIIPAIVSIRMICVRNFDTESQRTSQATGFIVDKKQGIILSNRHVVQPG-PILADMILNQSKEEVRLTPIYRDPVHDFGFFKFDVSAVKYMKIQEIPLEPALVRVGLDIRVVGNDSGERLSILSGTMARLDRKAPNYGAGRFNDWNTFYYQAASMTSGGSSGSPVIDVDGNAIALNAGGATQSATSFFLPLDRVVRVLKLI----------------------------QAGHSIPRGTIQTIFQYTAYDEIKRLGLDTDIETLVRQSFPENTGMLTVRQVIPKGPADKLLEAGDVLLRINDELITSFVPMEEIWDSNVGQSIKVLVQRGPDIKEVNITVQDLHSITPNRYLEVSGGILHELSYQMARSYIVPTG-GVFVAGAGYMLGLSGVSKRCIIESLNNKPTPTLDAFIEVMGSLKDNERVSFRFHQLSDINKSKTSIILADRRWHDFKVAVR----DDTTGLWNYTTLPPC--SGEAVLEFHSATHLTLDDSLGPAKIVI------------------------PSLVHVAFYLPFKIDGVVAQIHTGVGVVLDAKCGLILVDRNTIPTSIGDILLTFSNSIIIPGKIIYLHQIFNFGIVSYD-TSLLGDTFVRSVTISPKELSQADSVHL------VCLSKSYQPIVRKTVVTNIRQFFVNEGVPPTYRAMNVEGIELENPVSQGGVLTTEDG----------------------------QVQGFYAAYTKHGSKSSGEFYMGLSMSVVIPVLDALRTPDNIASMSGSNGVRHNLILPSMKTLE-VEMTYTQVAHARI---MGLTDAWVKRIE---SSHLSRRNVIVIRRVTSGTTASDLVNAGDIILSVNGKPATQ--------------------------FSDITSHYDQSHLELVLLRDGKEMHV---NTPLSTMLVSGTERVVGWAGAIFQMPHKAVYQQLNHVPS--GVLCSVVYDGSPSQLYALHPLTWVTEVNGIPTPNLDEFLVQIMKVKKDTFVRLSTVSSTRFVKVIALRPNDHYFGMWEIVRDTSVASTWRLSSF 969          
The following BLAST results are available for this feature:
BLAST of NO12G02110 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
GAX94281.11.400e-17737.68pro-apoptotic serine protease nma111-like protein ... [more]
CCI41296.12.700e-17636.16unnamed protein product [Albugo candida][more]
PRP82470.14.000e-16434.85pro-apoptotic serine protease [Planoprotostelium f... [more]
PRP82453.14.000e-16434.85pro-apoptotic serine protease [Planoprotostelium f... [more]
OUU83132.12.100e-15235.65hypothetical protein CBC32_16725 [Proteobacteria b... [more]
GBG32291.14.200e-14535.62Pro-apoptotic serine protease nma111 [Aurantiochyt... [more]
OUV26549.11.300e-14334.08hypothetical protein CBC48_15340 [bacterium TMED88... [more]
PLX55351.12.500e-14233.69hypothetical protein C0629_13080 [Chromatiales bac... [more]
KPK61541.11.600e-14133.24hypothetical protein AMJ59_00520 [Gammaproteobacte... [more]
XP_006677811.11.800e-14031.92hypothetical protein BATDEDRAFT_19279 [Batrachochy... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL126nonsL126Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
nonsL125nonsL125Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
nonsL124nonsL124Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR015ncniR015Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR014ncniR014Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR008ngnoR008Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR007ngnoR007Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR006ngnoR006Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK010136NSK010136Nannochloropsis salina (N. salina CCMP1776)gene


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
Naga_100021g30gene4691Nannochloropsis gaditana (N. gaditana B-31)gene
jgi.p|Nanoce1779_2|592304gene_5373Nannochloropsis oceanica (N. oceanica CCMP1779)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO12G02110.1NO12G02110.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.2NO12G02110.2-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.3NO12G02110.3-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.4NO12G02110.4-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.5NO12G02110.5-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.6NO12G02110.6-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.7NO12G02110.7-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.8NO12G02110.8-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.9NO12G02110.9-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide
NO12G02110.10NO12G02110.10-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO12G02110.1NO12G02110.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.2NO12G02110.2Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.3NO12G02110.3Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.4NO12G02110.4Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.5NO12G02110.5Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.6NO12G02110.6Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.7NO12G02110.7Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.8NO12G02110.8Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.9NO12G02110.9Nannochloropsis oceanica (N. oceanica IMET1)mRNA
NO12G02110.10NO12G02110.10Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO12G02110 ID=NO12G02110|Name=NO12G02110|organism=Nannochloropsis oceanica|type=gene|length=13146bp
CGTCACGATCATCCAGATGATCATCAGCATGATGCTTCTTCGTGGCCCAT
GGAAGCCACTTAGGAGCCCCCCTGCTCCCTCTATTACCCAATCTGCCAGT
CAGTCCCCCACTTAGTGGTCGACGTATCTCCACGTCAGCAACGTAGATGC
CAAATTTTTGTGGAGTCTTGTTTAATTTACTTCACTTCCCACAAGTCCAT
TCTTCCACGGACCACAGTTTACCCCAGTTCCTCGTCTCCTCAAGAACAGT
GCCTCATAATCCCGATCGATAAAAGTGGAAGAGTTCTATTTTTTCATCTA
CAGGGGTGCTGGAATTGGAGAACATGAGCGGCTGCAGGGGGTTCTTGTCT
CGGTGGTACACAACGAAGAGATTGCCGATGGAGCAAAGCGGTGAGAAATA
GAGGTAGGTGACCCCCTGTTGAGGTAGCCCCTGTCCGTTTGTAGATGGAT
TGTGTGAGTGCGTGGACGAGATTGCAATCTTGTGCTGTGTGTTATATTGG
GCGGGCTATGCGAAAGCAAGGCAAGCGATTGGCAGCTTTTTTCTTACCCG
CCCTGTCAATGCCCTTCCTTTGTGTATTTTAAATGGCTCGTACAAATCCT
GTACACGACTCTCCAGTCCTTCCTCCCAAGAACCACAGCATCTCAGCTTC
TGCCTTGTGCAAGTGTCCCTTTACACCCACACCCACGTTCGTTCGCTTTG
CTCATCCTCCCACTCTCGCCACCGCCAAGTCCAAAATGACCACCACCACT
ACCGCCTTCTACGACAGGAGCCTGGCGCTGGCGCTGCAGGAGTAGGTGGC
TTCCTTTCAATATCCACTTTCGCTTTTGCGCCTCGGTGGCCTGGCACGCT
GGCACTTGATCTCGTCCTATTTGGACATATTCGACAGACGATCGGTCCTT
GATTTGAATGCGGTGCAGGAGTGCCCACCATCTCGGATGGGGAGGCGGAT
GAAGAGCCGTAGTACATTTTTTTTTTAATGCCGCTACTCTTTCTCTGTTT
CTTACGCGTATGCTCCAGTTCACTGTCGTTGATCGAAGCGAAAGCGCATT
TCTTGGATTGAAAGAGCGCTCGCAATGCTTTGCAAATGTGCCTCATGCCT
CTCTGGTCCCTTGTCACTGACAGAACTGCCCGTTTTATTGATGTTAGTAC
ATTTTTGTTGGGAAAGAGCCATTCCTCCAAGAGACAATCGTCGTGACTCA
CGGCCATTAGCCTCGTCTGTGTAAATTCTGAGAGTGGTCGCGCATGAAAG
TCATAGCGACGTTACTGGAGAACGCTTTTTAGAATGGTGGCTTACGCTGA
TATTTCAAGCACTCTGAATATTGGCACTGGACCAGTCTTCGAGAGCTTGC
ACGACGCTTGCAAGAGAGATGTCTCCTGGTGTAAATTAAGTTTCATACGA
TGAAGCGGCCCGAAAAAGCGTTAACGGAAGAACCATTTACGAATAATCGA
GCACCGGATTTAATCTCGGTGATCGTATGGTCAGTCTCGATGATGATTCA
TGTCTTGGCTCCGGACATCTTACGTAGGCGCTGCAGAAGGAAAATTCTCA
ACATCAAAAAGCTGAAAATTAAAATTAAAAGTTGTCAATTAAGGAGCGAC
AGGCGTCGAGCGACTGGGTCAAATATGGCGGGCCGATGGTCTTTCCAAGT
AAGAGGAGCGAAACATAAGAATAGATTCCCCAAGCTTTTACCTGGCTACA
TTTTGCGAAAGCTTTTTGACCAAAGATCACAAGCACTATCCCTATATCAA
TGTTACAACCCCCCTCAAACCCCCCTCCGCGCTCAAAATTCACAAGAAAA
CCTTGCACACATTTATAAATGGATAGGCTGAGGATTGCTCATTGAATTAG
TCCACATGGCACACTTCATGTCCTCAGGAAATGATGGCTAGTAATTTCTT
GCTGTGCCATAAAAAAAAGCACGCCGTGTATGAGGGAGGGCCTTGAAGTT
GTGCAGAATTTTCTGAACCTTTTAAACCAACGTGAAACGTTGCCATGATC
CCTCTCAAACCTTGTTTAAAGATGACCAAGTCATATCGAATGCGGCGTTC
CGCCCTTGATTAAATCGAATTCCATGCCTCTGAACACAATTCTTCCAGAG
CTTGACTTCATCTTGTCTTCAAATTTAGGATTTCATGCTGTCATTTGCAG
CCAGACTGACCCGTCCATTTTGATCGCGAGGTTTCTCTTCATTCGCACAC
ACTGGATTCTTCAAGCACCTTGTTTTGCCCTTTTCCTTTTCGTCCTACAA
CTGGCCATGGACAGATGACTCAATGTTCAGATTACACTTCGTGGTTGGGA
ATAAAAGCAAGGCATTGAAATATTCGGAGAGACTCCGAAAAAAGCCTGAA
TCAGGCGTCAAGGATATGTGTGAGCGACAAAAAGATGGATGGTTCGATGA
TCTGGCCACCCCGACTGCAACCGAGAGAACTTTACCTTACATGCGTCTAA
CATTCCTATATTGCTTTAGCCTCCAAGTCAATCATGCTCATGAACACGCT
CCATTCTGCCACAAACCGCTTGGTGGCAGGCGGCAGGCGGCGCAATAGCC
CTGTACCCCCACCTTGAATTGCTCCCGCAATAAATCCCGTTATTAAAAGT
TTGTAAGTACATAGGTAGGGTGAATAAATGTCTTTTCCATCAAAATATCG
CTGTTGTTGCAGTAGCTGCTCCATCAGCATATTTTTCTGGCGCGCCACAG
ACGAAATTGTCTGCAAAATTTTGACTGCCCGCAACACTACTTGGACACCC
CACTCAACAACAGGCCCGGCGTCATCAGATTCCGAAAATATGACTGCGAC
ACCCTCCTTGAGCACGTCGGTGCGCAATTGATCCTCAACGGTGGACTCAG
TGGTACAATGTTTGACGCCCTCGGCCCGAGCCAAGGCATTATGCATTTGG
ACTTGCAGTAACGGTAATGCCTCGTGCTGTATCAGTCCGCCTGGCATGCC
CAGCCAAGACAGTTCCCCCGTGAACAAGAAGACGAGCACCACCAAGGGTC
CAGAACGAGTGGGCTCAACAGTCAGATAGTCCAGCAACGCCAAAATCCAT
TCCCGCCGCGCTGCTGCCAAGGACATGCCTCGGGACGCCCACAAAGGCCG
GACGACCCACGGAATCAAAGAGCAGCAAAAATCTGCCGgtaaggcttctt
gaaaggcgatgcggggccccatcaattgaagccgaagtgtattgagttga
accaccggcaacattcctttttccaccagcaagacagacaatattatccc
ccccatctcgcttacgcttcccgacagACGTGTAGGTATGAGTGCTTGCT
CCAAGTGCTCCTTAACTCGATCCGGCATGGCGTGGATCCGACCGACGGCT
GCGAGGACCCGGGCGAGTGCATCTAAAAACTTGAAAGGTGGAGAATGGCT
TGAGCCTGCGCCATCCTCTCGTTCCGACAAATGGATGTTGCTAGCAGTGT
CTAGTATAAGATAAAGAGAGGAAAAGCGACGCAAAAAAGGGAGCACGTCG
GCAGTCACCAAAAACCACACCGCCTCAGCAACGGTAGAGCTCGCGCCGTT
GTCCTCGCAGCCCAGGGCCAGGCCAGTCAAGAGTTTGACAGCTCTTACTA
AGGCAGCTGTTCCTGTCTCCTCCTGGTCAAGACACTTCCTCCAAAGCACG
CTTAAAAGATGCGCCTGCAATGTTGTTGGAGTCAAGCGGACCAGGTCGTC
CAGTACATTCAGTACTACATCGACCACTGCCGATGATGAATTCACCGCAA
AATCGTACAACCAGGTGACGTAGGTCGCATCGCGCCGCGCCATAATCACT
GCAAAAGCGACACAAGCCCTGATCACCGCCAAATCGTCGTTTGAAATCTC
GGCGTGCTCGACAACTTCTAAAGGTTGTTGTTGCTGCTGTCGTCGCTTGA
GCCGGGCCTGACTCATTTTTACAAAGATTTGCAGGATGAGCGCCATTTGC
TTATGTTCGAATTTGAGAACCCGGCACGACATCAACGCACCCAACGTCGA
TACGACAAAACCATTTGAATCCGCAGGATCCACGTCGCCTGCACCGGCTG
CCAAGCGAAGCAATTCGGCTAGAACGAGGCGCGACAAAGTCCCCTCACAA
GGTAAGCCGAAGGTGCTGGCGTCCAGAGGACCTCCCCCTAATCCCCGGCC
GCGTCCTGAGTGCACACCACCTGGAACCCTTTCACTCCCTTTGTCCTCCT
TCTCATCCCTTAAAGCCACCAACAAGCCCAATACGCGCGCTGCACCACAG
CGTTGTCGCTGACTGGGCGATTGTAAAATGCCGCAACGAAGTGCCTCCAC
AAGATCGATTGTTAAGCCCGCTGTAGCTCGGTCAGCGCTTAGCCCTGCCC
CGACTTCCTTGCCCATGATGTCCAAAAGAGGCCCGCCGCTCGCCAGGATT
GTATACATGCTCAAACAAGCTGCCAGTGTACGCGTTTGTTGTTGCGGCTT
TTCTGATGCTGCTCCTGTTCCTGTGCTCTGCAGCGTTTTTACAAGCGACA
CCAGCCGACGCTTGAGCCCATCCATTAAGCATACCGGGAGGATCACAATG
TCTTGGATGCTGACAGCTAAATGTGCGACGCCCAGGACAAGGGCACCATT
ATCGTCGTCATTCTTGTTCACTTCTTCCCCTGATTTGTCGATGGGTATCA
CGTTCCCCACCTGACTCGCTAATTTTTCCAGCAACGCGAGAATTTCCCTC
ACATCCAGCAAATCCGCCTTGAGCGCCAATGCTGCTACCACTCCTAAGGG
GAAATTGACAGTCGTAATTCCTGCCTGAGCTGCAGCCCGTAAAATGTGGT
AGAGTTGCAAAAGTGACTGCTTTAGCCCGACATGTGCCAAATTGGGAGCC
ATGATGGCCAGCGCCCACGATGCACCCACTACGGCAAGCTCTTTATTGTT
GATGCTGTCTATCGCCGAGGAGCATGAGGACAAGGACGAGGAAGACACTA
CCTCGCCCACATCGATCGCTAGCCAGGACACCGTGGCGGATCTCGTGCTT
GAACTTGCGCCATTCTCGCTCCCACGTTCTTGCATCAGCCCCTCCAAGGA
AAAGAATCCTGTGGGCCCCCGGAGCATGCTCGCCAACGCATTCGAATGCA
ATGCTTGCTCTTGCAACGTGACTAAACAGTTCCGGATTACAATGACCAGG
AGTTCAAATGTAGTAGCGTCGGGAGAGTGAATTTTCCGCACCCAATCTGT
CAGCACACCTGCCCCCACGAGCGCACCCCACTGGCACCATCCCCAGTCCG
TGCGGGCAGTAAAAGCCGTCTGCGTGTGCTCTACAAACACCTCTCGAACC
AATGTTACTACGCACTCAGCATCAGCAGGCCCTAAACATCGACCCGCAAG
CCCCAAAGAAACGAGGACTGTGGGAATGTCCAAGGAAATGCCCGTGGCTG
CGTCCGACATGAGGGCTGCTTGGAGCGTAGCCACCACCTCGGCGGTCTTG
TAAGAAAGTTGAGGCGGCAGGGCGTTGACCAACGCTGCCAATGCCAGGAA
GCAATTATTCACGTCGCGCCCTTGCTTTCGATGCTGATCCATTGTCGATT
GGATAATAGCTTCCACTTCTTCCGCGGCCTCCAAAGTACCCGATAATGAC
TCCGGGCGGCTGGCAGCGCGATCCTGAATCAAAGCAGGAAGAAAGCGCAA
GAAGCTATGAGGTGCGAGCAGACGCTGTACAACGCAACGCCCACCCCCAT
CATCATCACTCAACACGTCTACAAAGACATCCACCATGTTACTCGGCCCC
ATTGTTGTCGACGTTGAGGATGTCAGCCGCCATTTCCTGCCCAAGGCATG
CAAGGTGGCCCCTGCCAGTCCGGGAACGTCCCCATGGTGCGTTTGGTACA
AGCTCAAAATCTCTTCTGCTGTAGGAAGAGACTTAAGTGTTTTACGAGAG
GGTCTCGCACCAATTGTAGCAGCAAGGGCTGTTGCTGCGGCGTTGACGGT
GCCTTGACTGGCATCAGCCCCACTATCTCCAGTGCGTAGTAGCGTATGGC
GCCACGTACCCGAATCCTCCGTCTCGAACAAAAGCGCCGATCTAAGCAAA
TTTATAAGCCCCCCCCGGGCTCTCACATCGCCGTCCTCCTGCAGCGCACG
AAAGATACGGCGCCGCAGACCCAAAGCGACCTCTTCCTCCTCGCTCGCAA
GAAGTGCGGGCACGTACGCTGCAAGCGCCAAATAAGCCTGTCGTCGAACG
ACTGCAGAGGGCTGCTGCGCGAGCAGCCAAAGATTATCTGTCAAATTTGC
CAGGTGATGTTTACGTATGTGCGGCCGTCGTTCCTCATCCTCGTTGCTAC
CATACTTCATGTAAGCGCCTCCAGAGTTCTCTCCACCTTCCTTTTCGCTC
TCGATGTCTATTTGGGGGCTGCAGTCAAAGATGTAGGCTCCTGCGCCAAA
GAAATCTGCCAGATTTGCTAGGACCAGTGGATGCTCTTTGAATTGGACGC
GGCCCTCCTTCTGCAAAATACGCGTCGCCGCCATATAGTTGAGGCAGTCA
GCTGAGCAGAGATATGTTATACAGTCTACTGCTGTGGCCACCACGCCAGG
CAGATGGGCGTCCATCAAGTACGTCTGTAGCAACCCAATAAACTCCGAAC
CAGCCTCCGGGTCCTCGCGCACTACATCAGAAATAGCCAGCACGCGCGCC
AGGCGGATCTCACCCTCCTCGCCACTCAAATTCCCGGAGGCAACGGCTGA
TGAGGTCACATCGGCTTCTGCCGCCAGCACTAATACTCGTAACCCAGAGA
ATGTTCGAGTGTTGCTCCGATAGAGAGCTGCAGATAAGGACAAAACAGCC
GTATTCAGATTCACCATGCAGATGCCTGCGCGATCTTGTAGTTGGCTTAC
CCCTTTAGAAACATGAGCGGCTTTGCTAGCCTCAACAACAGCTGGCGTGG
ATGCGAGACGCTTGAGAAAAGCTTGCACCAACTGCGCTCCCACACCATGC
GACCCCAAGGCCGGCAGGAACCGCAACACCCGAGATTGGAGGCGTACTAC
GGGCATCATTGTTTTAACTAACGCAGCCCCTGAAATTAAATTAATAGTCT
TCCCGCATCGGTCCAGGCGGTAAAGCACGGCGGGAATCAGTCGCGCCCCC
ACAAAACCCTTCTTGTCCTGAGCCACAAGGGCATCCAACAGACCTTGCGC
ATCCTCGGCGATGGTCACCTCATCTGATTCTTGCACCGTGTCCTCATTAA
AAAACAAACCCATGACCAAAAACAGAGCCCATTCCCGCTGCCACATTGTC
ACGGGATCGTCATCAGGCGCTTCCTTAAGGCTGTTGACCAACGAGGCCAC
TGACTGCTGGGAGAAACTCGTGGGGGGTTGCATCAGAAGCGTTCGTAAGC
TTCGTCCTGGCCCTGAATCAGACAATGGCACAGAAGAGGGTGCCAGCTGT
AGGCGCCGCTCGACGACTGCGATAAGTGGAGCCGCCGCATCACTTACACT
GATCAAGCTTATGAGTGGGTATAGACCCCAGCGCAGGGCATGTGACGTAA
AACATGAGAGATCTCCACCCTCCCTATCATTGTTACCATTCACGATGGCC
TGAAGAACGGCTAGCAATGTATTCCGTTCTCTACCACAGACGCCTGCCTT
GCCCATGGCCACCAGGACGTAGGAAAGGAGGGGGAAGTCAGGGGCCAACA
ATATCGCATCGTGCTCGGCCAGACGCTCCAACATGACGATGAGGGGCAAA
AGCATGGACATAGACAACACAGCCGTCGATCCTTGGACGACCACATGCAG
GGCGAGCTTGACAAGATGATGGGCGAGCCGACGGCCAACGATACGAACAT
ACCTCGGAGCCTCTAGGTGGCACACGAGGTGAAAGACAATCTTAGTAAAC
TGCACCACCAGCTGAGCATCTTTAGCTGCGGACGATCCGCTCAATCCCCC
GCTTCCGATAGTCGACGACGAAGCAGGACTCAGCCTCTTATTTCCTCTTG
AAAACGCCAAAGCAGGCCAGGGTAAGAAATTTAAGAGCTCCAGGTAGCAT
GAGGCTAAGGCCACCAAAGCACTTTGTGGAATGTAGTCAAGGGGCATAGT
ACCCAATGCTTCCACAAAAACGGCCTTGCTTTCACATGATTGCGCCTCCA
GAAGACCGGCAACCAAGAGAGGGGCTTGTAATGCCACTAGTGACACCTCG
GGATGTGCCGGTAACTTTTGAGCTTCGAAATAGCTGGCCTCGGCCAGAAC
CAAGGGCGTTATACGAACCTGGTCGAAGCCTAGAGCCAGGACTAGAGGGG
TCTGTTCACACCAAAGCAAAGCAGCGTCCATATTTCCGCCGCCATACTCT
TGGGAAACACCATCCAGCATCATCTGGGTTAAGGTGCGTATGTACAAGGC
CAGCAAGCTTTGCACCTGCTGCTGTTGATATTCATGAGGAAGAAAGGACG
CACAGAGCAGAGTCTCCTCCAAGACAGCCAAGATTCGCTTAATGATGACT
CCCACTTCTGCCTTTCCACGCCGCACCAATTTCTCCAGCCCATGACACAC
ACTGCTTGCGACAGCTAAGGTGGGAAGGGTCAAGGAGGAGGATGTGCCCC
GTCCGAACTCTGCGTCATCGTTATCATCGTCGTCGCCGCCGCGGGTACGC
ACGGAGCCTTGACGATCTGGTATGCTTATTACATGTTTACTGCTGCCGTC
GCAGCTGCTGCTGCCCGTGGCCACGAGCGGGAAGAGGACGTCGATGTTGC
CTTCCTCAAGCAACCCCTTGATGGCAACCTCCTGCTGCCAGGGGTCTTCG
AGTTTGACCCGGATTAGCAACTTCTGCTGCCGTCGCTGCCGCTGTGCTGA
TTCCTTAGCGCTGTTCATGTCAGCTGGAAAGTCTAGCTGTGTTTGTCGTG
TGGTATGATGAAATTGGTGGTGTTCAGCGAGAAGGTTGAGGGTTTGGAGG
GAGGGTAGGAAACACCCACGAATGCCCAAGCAAAGTGTTCGCTAAAAAGG
AGCATATCGCATCGCAGCTCTCAATCGCCATCGCAAGCTTCGCATCGGTT
TTGGTGGTTGGTGTTCCGTGTCCCGCCCACTGCAGTCGTGTCGAGGCGAG
ATGGTGACAGGACATGCAGGTGTCCGTGTAGGTTGATGCGAGGgtaagta
taatttacacttggtccagcaatttaaaaaattggccgccgaagcttgga
aatgtgttgtattctaaatctgacacctttctcgtatctctcccaccctc
taccacggcagGTCCAGCGGCCCAGCTTCGTAACCATCCATGGAGGGCCC
CATCATCGAGGCCGTGCCCCCTCCGCCTGCTACCCCCTCGCTCCATCAAC
AATTTTCCGAGCAGCAGCTGGAGCAGCCCCTGGCGCTGCAGCAGAATCCG
CAGCAGCCCACTTGGCAGCAATGCCTTGATCACGCCATCCCAGCATgtat
gcctcttctctgtctcctttggagaagcatcaatgcgtcactcacccctg
ttctattttctttacctcctccctctatctctgacagCCATCAATATCCG
CGTCCTCCAGGTGAAGAGCTTCGAGGCCACGGGATCCAGCTACTCGTATG
CCAGCGGATTCGTTGTCTCGCTCAAGCACCGTCTGATTTTGACCAATCGC
CATGTGGTGTGTGGAGGAGCTCCAACAGTCATCGACGGCGTTCTACACAA
CAAGGAGGAGTTGGCGCTGAAGGTCCTGTACATTGATCCCATCCACGATT
TCGCCTTTCTCTCCTTTGACGCCACCCAAATCAAGTACCACAAACTGGAG
GAGATTGAGCTTGATCCGGAGGGAGCACGAGTTGGTCGCGAGATCCGGAT
CATGGgtgcgtcgggatggtgtggcggcagtcaagatatgtgagtatttg
tctgacctcttttctcatacatttcccttctcctttcaatacagGCAATG
ATGCAGGTGAGAAAATCCAGATTTTGCCGGGCATTATTGCACGTCTTGAC
CGTGACTCCCCTTTTTACGGGGGCGGCTACAACGACTGGAACACGTTTTA
TATCAGTGCTGCCTCGTCCACCTCTGGGGGGTCATCGGGTTCGCCGGTCT
TGGCGTCGGATGGCAAGGCGGTTGCATTGAATTGTGGGGCCGCCAAGACA
GCGGCCTCATCCTTCTACTTGCCGCTTCACCGGGTCAAACGGGCCTTGGA
TCTCATCATTGACTTCAAGGAGCGTGTCCGCAGCGACGAGGGGGGTGAAG
GTGAGGGAGAACTGCGCGGGCCTCGCATCCCTCCCCCTTGCTCCTCCTCG
ACACTCATCCCTCGCGGGACACTGCTCACCATTTTCAAGCACAAGGCGTT
TGATGAGCTTTGTCGCTTGGGATTGACAGCCGAGGTTCAGGAGAGCGTGC
GCGCGGCGTTCCCCTCCGAGACTGGTATGCTGGTGGTTGAGCAAGGCCTG
AAGAATGGGCCGGCTGAGGGGATTCTGGAGCCTGGTGATATTTTGCTGGA
AGTTGGTGGGGTGCTTTGTACGACCTTTTTGCCGCTAGAAGATGTCCTAG
ATTCGCATGTGGGCGAGAAAGTAATTTTGAAGATACAGCGGGGTGGGAGG
GACCGGGTTGTGGATGTTGAGATTCGGGACTTGCATGCCTTGATGCCCCA
CACACTCTTTACCATCAGCTCTGCTGTTCTCCACCCGGTCACTGTACATA
TGGCCAAGTCCTATCACCTGGAGGCGGGAACGGGCGTGCATCTTGCGAGC
ACGGGTTACATGTTTAGTCGCGCAGGCATCACTATTCACTGCATCATATT
GAGTGTGGGTGGGAGGCCGACTCCGACGATTGCGGCGTTGGAGCAGGTCT
TCGCCATCCTTCCCGACGGCCACCGCACTAGTGTGAGCTATCACAGCCTG
ACGGATCGATTTCGTGTACGCTCAGCCATAATCCAGGTAGACAAAAAGTG
GTTCGCGACTGGTATGTACGTGAATGCCCGGGAAGACGACGACAATACCC
TGTGCTGGACCTTCCAACCCTGCAAGGCCAGTAGCGAGGTGTGTCTTCCT
GCTGTCCTGCCCACGACCATGAGCTTCACGACGCCAAGCGGGGACGTGCC
CGTCCCAGTCCCTCCAGCTTCTACGACTTCTTCAACATCCACCGGCATAA
CACGCGGCACCACCACCAGCAACAAGCCCAAGGACTCACTCGTCATGGTT
TCCTTTGCCGTGCCTTATCTGGTCGATGGCATGAGCGGAGCCGAGTATGT
AGGAGCGGGAATTGTGATTGATAAGTCGCAGGGCCTAGTCCTGGTGGATC
GAAGCACGGCCCCTGCTACTATTGGGGATGGTCTCGTCACTTTTGGAGGC
GTGCTGGAGGTTCCGGCCAAATGTGTCTTTGTGCATCCACTTCACAATAT
TGCGGTATTGCAGTACGACGCGTCCTCCTTGGAGGCGGAGGCGCAGAGCG
GGTTGGTGACTGTGGCTGAGGCTGAGTTTGCGGAGGCGCCGCCATTGCAT
GTCGGAGAGTCGTTGATTTTCCATGGGCTTTCCTCACCTTTCATTCCTGT
GACGCAACGGGTGACCATCACCAAGAAAGAGACGCTGAAGCTTTCGGATG
GACGCCCTCCCACCTTTGTCTCGACCAATGCCAGCGTCTTTCATTGCGAC
CATGCCCCTGGCGCGAATGGTATGTTTGTCCTTGCACCTTCAGCGGGTGG
TGACGTAAGCAGCAGCAATGGCGGTGGCAGCAGCAGCAGCAGCAGCAGCA
GCGACAACCATCATTCTGGCAAAAACATGGTCGTGGCGCTGCGTCTGTGC
TATGCGTACTCGCACGAGGGGGAAATCAAGCAGATCTACCGAAGCGTGTC
ATCGGATATTGTCACGGAGACAGTACGACTGCTGAAGCGTGGAGGGCCGC
CGCCACGTACGCTTGATTTGCTTCCAATCAAGTTTACGCAAATGCAGTTG
TCTAAGGTTAAGGCAGGGATGGGCTTGTCCGAGACACAATTTCGGCGCTT
GAGCAAAAAGATCCAGGCGACGGAGAGCGAGGCGTCGTTGTTGGTAGTTC
GCCGTATACCGACGGAGCTCGGGGCGCTCGTCAAGACAGGAGATATACTG
CTTAGTGTGGATGGAGAGCCCGTAGTGCAGCCTGTGGAGCTCGAGCGTGC
GGTAGTGGCGTGGCAGGGTGGCAGCACCAGCAATAGTGGCAAGGATGCTA
GCGGGACTGTGGAAGAGACCAGAGAGGCAGCCAAAGAGGAAGGGATTACG
TTGGTGGTCCTACGCGAGCACAAAGAGGTGGCGATTCAGGTGCGCACGAC
ACCGCTCTCCACCCTTGGGACCGATCGTGTCTTACTGTGGTGTGGGCTCA
TGCTGCAAGAGGCACATTTGCCTGTCTCTTTCCTCGGCTACAAGCCAGCG
CAAGGCGCCGTCTACGTTTCTCGCTGGATGTTTGGTAGTCCGTCGCATAA
ATATGGCTTACGGGCATCCTTTTTCATCGAGTCCATCAACGGGATTGCTA
CTCCTGATCTGAACACCTTTTTGAGCACCGTGACCAACACCCTTCCTGAT
GCATCCATCCGAATCAAGGGGTTGGATTTAGATAGCAAGGGAAAGAGCTA
CACGCTGAAGCCTTGTTATCAATACTGGCCAACTACCGAGATTGTCTTCA
ACTATGAGCTGCATGAATGGCGACTGCACCATTACCCCGCAAGACTCAAG
GCCGTATTGCATGCGGCGTAAGTGACAAGCGTTGTGAGAAATGCGACAAG
GGGGTGAAGACTAATTTTGTATTCTGGTTGAGAATAGAAGTATTAAGCAC
AGTTGCGCGATTTTTCTGCTGAGTATACAGTTGGAAGGAGAGAGACAGTT
GCTTAACTTATGCGTGGTTGCTGGCCGTAATTTTTGTGATAGCAAAAACA
ATAACAATGGGATGACAACGGGAAGAAAGACGTCCATCCACTCCGAAAAA
TGAAAAGACGAAAGGATGGGCAAGAAGATATCGCAATAGGCTGCAATGCG
ACATTCTCCGACGCCTCAGAATAATTGAAAAGATAGAAATTAGGCGGTGT
AGAATTCTTGTGGCAAAAAGATTGCACCTTGAAAGAATGTGAAGAAATAT
GAATAAAAAATGAGAGCGGATGTAGCGAGTGCAGTTGTCCAAAATACAAA
TAAATAAGAAAGAGTACAAAAATGGAGGAAAAAAAACACCTCGGGG
back to top

protein sequence of NO12G02110.1

>NO12G02110.1-protein ID=NO12G02110.1-protein|Name=NO12G02110.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1084bp
MEGPIIEAVPPPPATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAI
PASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTV
IDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGAR
VGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGGYNDWNTFYISAASS
TSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFK
ERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRL
GLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLC
TTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTIS
SAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRP
TPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMY
VNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVPVPPA
STTSSTSTGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVI
DKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYD
ASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTI
TKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAGGDVSSSN
GGGSSSSSSSSDNHHSGKNMVVALRLCYAYSHEGEIKQIYRSVSSDIVTE
TVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQA
TESEASLLVVRRIPTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQG
GSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLG
TDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRAS
FFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCY
QYWPTTEIVFNYELHEWRLHHYPARLKAVLHAA*
back to top

protein sequence of NO12G02110.2

>NO12G02110.2-protein ID=NO12G02110.2-protein|Name=NO12G02110.2|organism=Nannochloropsis oceanica|type=polypeptide|length=1084bp
MEGPIIEAVPPPPATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAI
PASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTV
IDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGAR
VGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGGYNDWNTFYISAASS
TSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFK
ERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRL
GLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLC
TTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTIS
SAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRP
TPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMY
VNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVPVPPA
STTSSTSTGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVI
DKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYD
ASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTI
TKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAGGDVSSSN
GGGSSSSSSSSDNHHSGKNMVVALRLCYAYSHEGEIKQIYRSVSSDIVTE
TVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQA
TESEASLLVVRRIPTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQG
GSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLG
TDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRAS
FFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCY
QYWPTTEIVFNYELHEWRLHHYPARLKAVLHAA*
back to top

protein sequence of NO12G02110.3

>NO12G02110.3-protein ID=NO12G02110.3-protein|Name=NO12G02110.3|organism=Nannochloropsis oceanica|type=polypeptide|length=1084bp
MEGPIIEAVPPPPATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAI
PASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTV
IDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGAR
VGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGGYNDWNTFYISAASS
TSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFK
ERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRL
GLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLC
TTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTIS
SAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRP
TPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMY
VNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVPVPPA
STTSSTSTGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVI
DKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYD
ASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTI
TKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAGGDVSSSN
GGGSSSSSSSSDNHHSGKNMVVALRLCYAYSHEGEIKQIYRSVSSDIVTE
TVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQA
TESEASLLVVRRIPTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQG
GSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLG
TDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRAS
FFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCY
QYWPTTEIVFNYELHEWRLHHYPARLKAVLHAA*
back to top

protein sequence of NO12G02110.4

>NO12G02110.4-protein ID=NO12G02110.4-protein|Name=NO12G02110.4|organism=Nannochloropsis oceanica|type=polypeptide|length=1084bp
MEGPIIEAVPPPPATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAI
PASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTV
IDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGAR
VGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGGYNDWNTFYISAASS
TSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFK
ERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRL
GLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLC
TTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTIS
SAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRP
TPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMY
VNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVPVPPA
STTSSTSTGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVI
DKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYD
ASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTI
TKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAGGDVSSSN
GGGSSSSSSSSDNHHSGKNMVVALRLCYAYSHEGEIKQIYRSVSSDIVTE
TVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQA
TESEASLLVVRRIPTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQG
GSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLG
TDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRAS
FFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCY
QYWPTTEIVFNYELHEWRLHHYPARLKAVLHAA*
back to top

protein sequence of NO12G02110.5

>NO12G02110.5-protein ID=NO12G02110.5-protein|Name=NO12G02110.5|organism=Nannochloropsis oceanica|type=polypeptide|length=1084bp
MEGPIIEAVPPPPATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAI
PASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTV
IDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGAR
VGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGGYNDWNTFYISAASS
TSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFK
ERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRL
GLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLC
TTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTIS
SAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRP
TPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMY
VNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVPVPPA
STTSSTSTGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVI
DKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYD
ASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTI
TKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAGGDVSSSN
GGGSSSSSSSSDNHHSGKNMVVALRLCYAYSHEGEIKQIYRSVSSDIVTE
TVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQA
TESEASLLVVRRIPTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQG
GSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLG
TDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRAS
FFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCY
QYWPTTEIVFNYELHEWRLHHYPARLKAVLHAA*
back to top

protein sequence of NO12G02110.6

>NO12G02110.6-protein ID=NO12G02110.6-protein|Name=NO12G02110.6|organism=Nannochloropsis oceanica|type=polypeptide|length=1084bp
MEGPIIEAVPPPPATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAI
PASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTV
IDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGAR
VGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGGYNDWNTFYISAASS
TSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFK
ERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRL
GLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLC
TTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTIS
SAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRP
TPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMY
VNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVPVPPA
STTSSTSTGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVI
DKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYD
ASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTI
TKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAGGDVSSSN
GGGSSSSSSSSDNHHSGKNMVVALRLCYAYSHEGEIKQIYRSVSSDIVTE
TVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQA
TESEASLLVVRRIPTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQG
GSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLG
TDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRAS
FFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCY
QYWPTTEIVFNYELHEWRLHHYPARLKAVLHAA*
back to top

protein sequence of NO12G02110.7

>NO12G02110.7-protein ID=NO12G02110.7-protein|Name=NO12G02110.7|organism=Nannochloropsis oceanica|type=polypeptide|length=1084bp
MEGPIIEAVPPPPATPSLHQQFSEQQLEQPLALQQNPQQPTWQQCLDHAI
PASINIRVLQVKSFEATGSSYSYASGFVVSLKHRLILTNRHVVCGGAPTV
IDGVLHNKEELALKVLYIDPIHDFAFLSFDATQIKYHKLEEIELDPEGAR
VGREIRIMGNDAGEKIQILPGIIARLDRDSPFYGGGYNDWNTFYISAASS
TSGGSSGSPVLASDGKAVALNCGAAKTAASSFYLPLHRVKRALDLIIDFK
ERVRSDEGGEGEGELRGPRIPPPCSSSTLIPRGTLLTIFKHKAFDELCRL
GLTAEVQESVRAAFPSETGMLVVEQGLKNGPAEGILEPGDILLEVGGVLC
TTFLPLEDVLDSHVGEKVILKIQRGGRDRVVDVEIRDLHALMPHTLFTIS
SAVLHPVTVHMAKSYHLEAGTGVHLASTGYMFSRAGITIHCIILSVGGRP
TPTIAALEQVFAILPDGHRTSVSYHSLTDRFRVRSAIIQVDKKWFATGMY
VNAREDDDNTLCWTFQPCKASSEVCLPAVLPTTMSFTTPSGDVPVPVPPA
STTSSTSTGITRGTTTSNKPKDSLVMVSFAVPYLVDGMSGAEYVGAGIVI
DKSQGLVLVDRSTAPATIGDGLVTFGGVLEVPAKCVFVHPLHNIAVLQYD
ASSLEAEAQSGLVTVAEAEFAEAPPLHVGESLIFHGLSSPFIPVTQRVTI
TKKETLKLSDGRPPTFVSTNASVFHCDHAPGANGMFVLAPSAGGDVSSSN
GGGSSSSSSSSDNHHSGKNMVVALRLCYAYSHEGEIKQIYRSVSSDIVTE
TVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKAGMGLSETQFRRLSKKIQA
TESEASLLVVRRIPTELGALVKTGDILLSVDGEPVVQPVELERAVVAWQG
GSTSNSGKDASGTVEETREAAKEEGITLVVLREHKEVAIQVRTTPLSTLG
TDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVYVSRWMFGSPSHKYGLRAS
FFIESINGIATPDLNTFLSTVTNTLPDASIRIKGLDLDSKGKSYTLKPCY
QYWPTTEIVFNYELHEWRLHHYPARLKAVLHAA*
back to top

protein sequence of NO12G02110.8

>NO12G02110.8-protein ID=NO12G02110.8-protein|Name=NO12G02110.8|organism=Nannochloropsis oceanica|type=polypeptide|length=952bp
MVWRQSRYVSICLTSFLIHFPSPFNTGNDAGEKIQILPGIIARLDRDSPF
YGGGYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSF
YLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPR
GTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPA
EGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVD
VEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMF
SRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFR
VRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPT
TMSFTTPSGDVPVPVPPASTTSSTSTGITRGTTTSNKPKDSLVMVSFAVP
YLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVP
AKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESL
IFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGA
NGMFVLAPSAGGDVSSSNGGGSSSSSSSSDNHHSGKNMVVALRLCYAYSH
EGEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKA
GMGLSETQFRRLSKKIQATESEASLLVVRRIPTELGALVKTGDILLSVDG
EPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLR
EHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVY
VSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRI
KGLDLDSKGKSYTLKPCYQYWPTTEIVFNYELHEWRLHHYPARLKAVLHA
A*
back to top

protein sequence of NO12G02110.9

>NO12G02110.9-protein ID=NO12G02110.9-protein|Name=NO12G02110.9|organism=Nannochloropsis oceanica|type=polypeptide|length=952bp
MVWRQSRYVSICLTSFLIHFPSPFNTGNDAGEKIQILPGIIARLDRDSPF
YGGGYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSF
YLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPR
GTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPA
EGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVD
VEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMF
SRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFR
VRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPT
TMSFTTPSGDVPVPVPPASTTSSTSTGITRGTTTSNKPKDSLVMVSFAVP
YLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVP
AKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESL
IFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGA
NGMFVLAPSAGGDVSSSNGGGSSSSSSSSDNHHSGKNMVVALRLCYAYSH
EGEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKA
GMGLSETQFRRLSKKIQATESEASLLVVRRIPTELGALVKTGDILLSVDG
EPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLR
EHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVY
VSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRI
KGLDLDSKGKSYTLKPCYQYWPTTEIVFNYELHEWRLHHYPARLKAVLHA
A*
back to top

protein sequence of NO12G02110.10

>NO12G02110.10-protein ID=NO12G02110.10-protein|Name=NO12G02110.10|organism=Nannochloropsis oceanica|type=polypeptide|length=952bp
MVWRQSRYVSICLTSFLIHFPSPFNTGNDAGEKIQILPGIIARLDRDSPF
YGGGYNDWNTFYISAASSTSGGSSGSPVLASDGKAVALNCGAAKTAASSF
YLPLHRVKRALDLIIDFKERVRSDEGGEGEGELRGPRIPPPCSSSTLIPR
GTLLTIFKHKAFDELCRLGLTAEVQESVRAAFPSETGMLVVEQGLKNGPA
EGILEPGDILLEVGGVLCTTFLPLEDVLDSHVGEKVILKIQRGGRDRVVD
VEIRDLHALMPHTLFTISSAVLHPVTVHMAKSYHLEAGTGVHLASTGYMF
SRAGITIHCIILSVGGRPTPTIAALEQVFAILPDGHRTSVSYHSLTDRFR
VRSAIIQVDKKWFATGMYVNAREDDDNTLCWTFQPCKASSEVCLPAVLPT
TMSFTTPSGDVPVPVPPASTTSSTSTGITRGTTTSNKPKDSLVMVSFAVP
YLVDGMSGAEYVGAGIVIDKSQGLVLVDRSTAPATIGDGLVTFGGVLEVP
AKCVFVHPLHNIAVLQYDASSLEAEAQSGLVTVAEAEFAEAPPLHVGESL
IFHGLSSPFIPVTQRVTITKKETLKLSDGRPPTFVSTNASVFHCDHAPGA
NGMFVLAPSAGGDVSSSNGGGSSSSSSSSDNHHSGKNMVVALRLCYAYSH
EGEIKQIYRSVSSDIVTETVRLLKRGGPPPRTLDLLPIKFTQMQLSKVKA
GMGLSETQFRRLSKKIQATESEASLLVVRRIPTELGALVKTGDILLSVDG
EPVVQPVELERAVVAWQGGSTSNSGKDASGTVEETREAAKEEGITLVVLR
EHKEVAIQVRTTPLSTLGTDRVLLWCGLMLQEAHLPVSFLGYKPAQGAVY
VSRWMFGSPSHKYGLRASFFIESINGIATPDLNTFLSTVTNTLPDASIRI
KGLDLDSKGKSYTLKPCYQYWPTTEIVFNYELHEWRLHHYPARLKAVLHA
A*
back to top
Synonyms
Publications