EWM26231.1, cds4426 (CDS) Nannochloropsis gaditana

Overview
NameEWM26231.1
Unique Namecds4426
TypeCDS
OrganismNannochloropsis gaditana (N. gaditana B-31)
Alignment locationCM002463.1:586180..586222 +
Alignment locationCM002463.1:586495..589130 +

Link to JBrowse

Properties
Property NameValue
Protein idEWM26231.1
Productdna mismatch repair protein
Orig transcript idgnl|cribi|Naga_100025g56.7522.mrna
GeneMSH1B
GbkeyCDS
Mutants
Expression
No biomaterial libraries express this feature.
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
CM002463.1supercontigCM002463.1:586180..586222 +
CM002463.1supercontigCM002463.1:586495..589130 +
Analyses
This CDS is derived from or has results from the following analyses
Analysis NameDate Performed
GO annotation for N. gaditana B312020-04-08
BLAST analysis for N. gaditana B-312020-04-07
InterPro analysis for N. gaditana B-312020-04-06
Gene prediction for N. gaditana B-312014-02-18
Annotated Terms
The following terms have been associated with this CDS:
Vocabulary: Biological Process
TermDefinition
GO:0006950response to stress
GO:0006259DNA metabolic process
GO:0045910negative regulation of DNA recombination
GO:0006298mismatch repair
Vocabulary: Molecular Function
TermDefinition
GO:0004518nuclease activity
GO:0000166nucleotide binding
GO:0003677DNA binding
GO:0016887ATPase activity
GO:0005524ATP binding
GO:0030983mismatched DNA binding
Vocabulary: INTERPRO
TermDefinition
IPR005747MutS2
IPR027417P-loop_NTPase
IPR002625Smr/MutS2_C
IPR007696DNA_mismatch_repair_MutS_core
IPR000432DNA_mismatch_repair_MutS_C
Homology
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|585108230|gb|EWM26231.1| (dna mismatch repair protein [Nannochloropsis gaditana])

HSP 1 Score: 1806.19 bits (4677), Expect = 0.000e+0
Identity = 892/892 (100.00%), Postives = 892/892 (100.00%), Query Frame = 0
Query:    1 MVLSWLLINLFLTSELLRAARIRKNSNRYQKNKGSDDSDPQPADTAPIPVGLEDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELARSPCFAENVEEVRRRYKAVEEIWQSADVIPLNDPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFVQAPVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDEYIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDVGLGDTKTSVEEALKELAAVKAKVAEESKPEVVHVPANLVLLKRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAARVNAGPYAGLAKNKDRKKKWTKADERNLKLLGQEVRHASTPAAITLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVSRHRAADDSDGGDAFTQVFLK 892
            MVLSWLLINLFLTSELLRAARIRKNSNRYQKNKGSDDSDPQPADTAPIPVGLEDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELARSPCFAENVEEVRRRYKAVEEIWQSADVIPLNDPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFVQAPVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDEYIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDVGLGDTKTSVEEALKELAAVKAKVAEESKPEVVHVPANLVLLKRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAARVNAGPYAGLAKNKDRKKKWTKADERNLKLLGQEVRHASTPAAITLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVSRHRAADDSDGGDAFTQVFLK
Sbjct:    1 MVLSWLLINLFLTSELLRAARIRKNSNRYQKNKGSDDSDPQPADTAPIPVGLEDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELARSPCFAENVEEVRRRYKAVEEIWQSADVIPLNDPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFVQAPVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDEYIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDVGLGDTKTSVEEALKELAAVKAKVAEESKPEVVHVPANLVLLKRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAARVNAGPYAGLAKNKDRKKKWTKADERNLKLLGQEVRHASTPAAITLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVSRHRAADDSDGGDAFTQVFLK 892          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|299115580|emb|CBN75783.1| (MutS protein homolog 1B MutS-like ATPases involved in mismatch repair, family 1 [Ectocarpus siliculosus])

HSP 1 Score: 615.535 bits (1586), Expect = 0.000e+0
Identity = 373/880 (42.39%), Postives = 522/880 (59.32%), Query Frame = 0
Query:   58 NLHERTWASMDLHVVQDRLAGLCDTVRAKELARSPCFAENVEEVRRRYKAVEEIWQ-SADVIPLNDPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFVQAPV-----------RKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDE----YIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDVGLGDTKTSVEEALKELAAVKAKVAEESK-PEVVHVPANLVLLKRDDILKEGE-VVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAARVNAGPYAG-LAKNK---------------DRKKKWTKADERNLKLLGQEVRHASTPAAITLNA-----------MRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVSRHRAADDSDGGDAFTQVFLK 892
            +L  RTW S+D  VV +RL+  C T   +  A  P F   +EEV   Y+ V E+   + D +PL   M + P +A AA G+ LE  ++ ++A AL  L  LR+F    V           R  +  +L+  A EI L   L+ LL  AFD  G LS  +FP + RLR + D L   IK+T+ +LM     S M+ DE    Y+++  GRFV+P+  TYKR  +GIVHD S TG+T+YVEP QV+ PTN++  ++L+L  E  RI+ +MT  +A  +D+ILQSL A A +D+++AR +LG   GG  +P+V  +G I   +ARHPVLLLR + P+GN++ +D +   L+LTGPNAGGKT+VLKTLGL+AL+ R GIP+PA  G RVD F P+LADIGD+QSVTGDLSTFSGHL+V + VLS A +G+LVLMDEMGSGTDP QG A+AQ+LLEAL++ G+RVA+TTHY QLKELA  D+RF V AM+F++G+PTYR  +GA+GES+AL+VAERLELPA V+ RA+GL+     +V+ELI +LED+R+ LQ +++ A  RE E       V  Q  +L     E  +++  AK + A  +  +L   EKKL+ M        T   +  ++ E+ A+K +V +E+  P        L  LK+ D + +GE VVVC G  +  +  G+V+  S ++V+V       +  + F  + L+R P          P  G LAK +               D  ++ T    R  + L  +V   S+      N            MRT  NT+D+RG+ L +A+   D F   GI      VY+LHGHGTG LK G+R ++ R+  V++ R A   DGGDA+T V LK
Sbjct:   37 DLFARTWESLDFGVVLERLSRECRTEMGRSRALIPDFKTTLEEVHELYERVNEVLLLAGDAVPLRAGMAVEPQLAIAAAGSTLEPTEIAAVASALEGLFELREFFCGAVADAKSPGGMVDRSGKTPRLAAVAAEIKLDEGLLGLLRGAFDSQGELSAQRFPEIGRLRSKADSLRQGIKSTMSRLMAGGEFSGMLADEGREAYVSEIAGRFVIPVTPTYKRT-VGIVHDSSRTGKTLYVEPTQVVGPTNELVEVKLQLKVETQRILSQMTLKIAEHEDEILQSLAAAAEVDLALARGRLGAKTGG-TIPKVMNEGTIKLVNARHPVLLLRGKAPVGNSMSLDASMQALILTGPNAGGKTVVLKTLGLVALMARAGIPIPAAPGARVDLFDPVLADIGDLQSVTGDLSTFSGHLVVAKAVLSGARAGSLVLMDEMGSGTDPMQGAALAQSLLEALVDAGSRVALTTHYTQLKELAATDERFGVSAMQFVDGRPTYRLIKGAVGESFALQVAERLELPAFVVERARGLLDDNTRQVSELISKLEDERNQLQDQLDRAAKRETE-------VLQQLKKLAEEREEAAELRSMAKRDAAKEYAVKLDANEKKLKQMFDKARSEPTTDVIGSSIGEIRALKKEVQKEAAVPTYTAQDLGLTPLKKRDRVAKGEKVVVCDGSSIGWE--GEVLSTSNRDVEVFLPSA--EASMRFAFSQLSRLPPGGVKWPAQSPSGGSLAKKRYPGGIVGPSGGAAAADTPRRKTGTSRRVAQYLEDDVGSFSSGDGDDNNKRKKKRSGDNYFMRTERNTLDLRGMTLSEAQGDCDMFFSNGIMDGSDGVYLLHGHGTGVLKAGIRRWLPRNSMVAKWRPASQEDGGDAYTVVELK 903          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|397576299|gb|EJK50176.1| (hypothetical protein THAOC_30884 [Thalassiosira oceanica])

HSP 1 Score: 500.36 bits (1287), Expect = 5.922e-157
Identity = 342/965 (35.44%), Postives = 518/965 (53.68%), Query Frame = 0
Query:   32 NKGSDDSDPQPADTAPIPVGLEDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELARS---------------------PCFAENVEEVRRRYKAVEEI-----------WQSA----------------------DVIPLNDP------MDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFVQA------------------PVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDE----YIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNTLHIDPAK-PGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRG--VRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDV-GLGDTKTSVEEALKELAAVKAKVAEESKPEVVHVP-----------ANLVL-----LKRDDILKEGE-VVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAARVNAGPYAGLAKNKDRKKKWTKADERNLKLLGQEVRHASTPAAITLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRH-PYVSRHRAADDSDGGDAFTQVFLK 892
            N  + + DP   +T    + +ED   ++H+RT  ++D  V+Q  LA  C+T  A+ L                        P  A +VE V  RY AV+E+           W +A                        + L  P      +DI   ++    G VLE P++  ++  + +   + D+  A                   +++E   QL    ++I +  EL++LL+ AFD++G+LSG  FP + RLR ++  L   I +++  ++    M   +  E     +++ NGR V+P++  Y  + +GIVHD S +G+T YVEP +V++PTN+++  E EL  EEA++  ++T  +   +++I +++ +   +D+  AR KLG  + G  VPEV  +G +SA DA+HP+LLLR    +G+ + I   K   ++LTGPNAGGKTI+LK LGL A++ R GIPVP ++    RVDFF P+LADIGD+QSV  DLSTFSGH+LVCREVL++A   ALVL+DE GSGTDP QGVAIAQALLEALL+ G RVAITTH+L LK+LA  DDRF V  M+FL  +PTY+   G IGES+AL VAERL+LPASVL RA GL+     ++ EL+++LE+Q+  ++   E  + +E E+   +  + +Q+ +L A+++        A+ + A  F  +L+EKEK LE +++ + G G TK  + ++  EL  VK +V  E++    +VP           AN+ L     LK  + ++ G+ VVVC         A   VK  G+ ++   S GG+ + ++ K  ++A  PS    +    P A L +        +    R+                    +M+T  NT+D  GL   +++ K  N         R VVYILHGHGTG LK  +RE++K    +V   + AD +DGGDA T+V LK
Sbjct:  247 NPDTPEQDPVDHNT----LNVEDQNIDMHQRTLDTLDYPVIQRALADECETQFARNLITKSMNTQVQSRDIQDTDADVLTMPITASSVEGVHSRYGAVQEMQRLMGGRITGFWSTARRNALKHKISSSSRGISNKKKVQRLSLGTPPIDGYTLDIESIMSIIDEGKVLEGPEILDVSSMMELCLDVLDWSDALEEWNRDNVGGEESEFEFELQQEPFVQLPSLVKQIHMDEELIELLATAFDEEGKLSGTTFPSIGRLRSKIRTLKRGILSSIESILALPSMRNKLAVESGGSLMSEINGRIVIPVQQQY--SSVGIVHDASRSGKTSYVEPSEVVQPTNELRSAESELRAEEAKVWRQLTESIVKHREEIERNVASLGQLDVVKARVKLGRRLDG-TVPEVKNEGVVSAIDAKHPILLLRGMEVVGSDVEIGQGKNQAMILTGPNAGGKTIILKLLGLFAMMARDGIPVPTKQSEKARVDFFEPVLADIGDIQSVDADLSTFSGHMLVCREVLNDARKDALVLLDEPGSGTDPNQGVAIAQALLEALLDRGCRVAITTHFLDLKQLASSDDRFAVAGMQFLGNRPTYKLIPGMIGESFALAVAERLKLPASVLDRANGLLDSETRKMGELLRDLEEQKQEVEKTSEALKKKEFEMLELKAEMRSQQEKLEAKQLN-------ARRDEAARFSKKLEEKEKILEGILERLQGSGATKKVIADSWTELRIVKREVMSEAE----NVPGSIRQLNQLDEANVELIPISELKGINKVEVGKSVVVCKKGAFYGKDA--TVKKLGKKLE--LSVGGMPVRLSLK--EIAFPPSSGRGQT---PVANLDEGGSSSVDASSGSTRDKG-----------------TSMKTKSNTVDCLGLNFEESKRKCINSFSKAAMGNRSVVYILHGHGTGVLKRKIREWLKNERQFVKSFKPADQADGGDALTRVELK 1167          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|224010241|ref|XP_002294078.1| (predicted protein [Thalassiosira pseudonana CCMP1335] >gi|220970095|gb|EED88433.1| predicted protein [Thalassiosira pseudonana CCMP1335])

HSP 1 Score: 494.197 bits (1271), Expect = 1.215e-153
Identity = 345/953 (36.20%), Postives = 512/953 (53.73%), Query Frame = 0
Query:   53 EDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELA------------RS----------------------PCFAENVEEVRRRYKAVEEI----------WQSADVI---------------PLNDP------MDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDF--------------VQAPVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDE----YIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRN-RNPIGNTLHID-PAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAR--RGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDV-GLGDTKTSVEEALKELAAVKAKVAEESKPEVVHVPANLVLLKR------DDILK-------------EGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAARVNAGPYAGLAKNKDRKKKWTKADERNLKLLGQE---VRHASTPAAITLN-AMRTSFNTIDVRGLRLRDAEAK-VDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKR-HPYVSRHRAADDSDGGDAFTQVFLK 892
            ED   ++H+RT  ++D  +V   LA  C TV  KE+             RS                      P  A +VE V RR+ A++E+          W ++                  PL  P       D  P       G VLE P++  +   L I   + D+                + +++E   +L    + I++  EL DLL++AFD +GRLSG  FP +  LR +V      I  ++  L+    M   +  E       + NGR V+P++  Y+   +GIVHD S +G+T YVEP +++ PTN++++ E EL  EEAR+  ++T  +   + +I +++ +   +D+ + R KLG  + G VVP V  +G +S +DARHP+LLLR     +G+ + I      GL+LTGPN+GGKT++LK LGL A +VR GIPVP++     RVDFF+PILADIGD+QSV GDLSTFSGH+LVCREVL+NA   ALVLMDE+GSGTDP QGVAIAQALLEALL+ G RVAITTHY+ LK+LA  DDRF V  M+F+ G+PTY+   G IGES+AL VAERL+LP SV+ RA  L+      + ELI  LEDQ+  +  + E+ + RE E+   +  ++ Q+  L A+++        A+ E A  F A+L+EKE+ LE +++ + G G +K  V ++  ++  +K     E+  +  +VP  +  LK+      DD L              + +V+VC            VVK  G+ + V    GG+ + +T K  +++  PS       AG   G+ K KD +++ +   +R+++ L  E   V  +ST  AI    +MR   NT++  G    +++ K +D F K  +   R VV+ILHGHGTG LK+ +R ++     +V   + AD +DGG+A T+V LK
Sbjct:  358 EDQHLDMHQRTLDTLDYPLVLRALANECGTVPGKEIVLDSLMKSGDASKRSKLTRKKKSSTDASDLDGDILTMPLTATSVEGVHRRFGALQEMQRLMEGRVSGWITSSRSQQSSDSNKKKRPQRKPLGAPPIEGYSFDFQPIFEIVDEGKVLEGPEILEVTTMLEIAMDVLDWRYALKEFNEEIDNEADSDLKQEPFVELVSLTDSIEIDDELFDLLTNAFDDEGRLSGTTFPFIGVLRAKVRTFKRDILASIDSLLAMPSMKNKLAVESGGALTMEINGRLVIPVQQKYQ--NIGIVHDASRSGKTTYVEPTEIVGPTNELRQAEAELRSEEARVWRQLTETIVKHRAEIERNVASIGQLDVVIGRVKLGKKLNG-VVPTVREEGVVSVKDARHPILLLRELEGVVGSDVEIGIDGNQGLILTGPNSGGKTVILKLLGLYAFMVRDGIPVPSKAYEPARVDFFTPILADIGDLQSVDGDLSTFSGHMLVCREVLNNAQENALVLMDELGSGTDPNQGVAIAQALLEALLDRGCRVAITTHYMDLKQLASTDDRFAVAGMQFVGGRPTYKLIPGMIGESFALAVAERLKLPQSVIQRANELLDTETRTMGELISSLEDQKLLVDQKQEELKKREFEMLELKAEMKRQQERLEAKQI-------NARREEAAKFAAKLEEKERLLEDILEKLKGSGASKKVVADSWTDIRIIK----REALSDAENVPGVMQRLKQQQGQVGDDELVPISEMKGVNKVNIDDKVIVCKKGAFYGKEG--VVKEVGKKISVAV--GGVSVRLTTK--EISFLPS-------AG---GVQKAKDPEEQQSSRAKRDMEYLTDEEVFVDVSSTATAIDKGVSMRMDANTVNCIGKNFEESKRKCIDAFSKATM-SNRSVVFILHGHGTGVLKKKIRSWLSTDRQWVKSFKPADQADGGEALTRVELK 1279          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|1210523157|dbj|GAX18208.1| (DNA mismatch repair protein MutS2 [Fistulifera solaris])

HSP 1 Score: 482.641 bits (1241), Expect = 1.261e-151
Identity = 343/946 (36.26%), Postives = 503/946 (53.17%), Query Frame = 0
Query:   53 EDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELAR-----------------------SPCFAENVEEVRRRYKAVEEIWQ----------------SADVIPL-NDPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFV----QAPVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDE-----YIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLL-------------RNRNP-------------IGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARR-GVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDV-----------GLGDTKTSVEEALKELAAVKAKVA--EESKPEVVHVPANLVLL----KRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETA---ARVNA--GPYAGLAKNKDRKKKWTKADERNLKLL------GQEVRHASTPAAI-TLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQ-LKEGLREFMKRHPYVSRHRAADDSDGGDAFTQVFLK 892
            ED   +L++R+W ++D   +   L   C TV A+++ +                       +   A  VE  + RY+AV E+ +                   V PL    +++AP +    R  +LE PDL  I   L +L  + DF+    +   +   L  L++ A  I L     +LL +A D  GRLSG  FP + RLR  +  L   I + +  L+++  +   ++ +     Y     GR V+P++ T     +GIVHD S +GQT YVEP +++ PTN++K++E EL  EEARI   +T  V ++++ +  S+ A A +D+ +AR +LG+   GA+ P V   G IS  +A+HPVLLL             R+R+              +G+ + +     GLVLTGPN+GGKTI+LK LGL AL+ R GIP+P +    RVDFF PILADIGD+QSV GDLSTFSGH+LVC+ VL  A   ALVLMDE+GSGTDPAQGVAIAQALLEALLETG+RVAITTHY QLK+LA  D+RF V  M+F+ G+PTY+   G +GES+AL VAER+ LP SVL RA  L+     ++ +LI+ELEDQ+  L+ +  +   ++ E       +E  + +L    + L+K     + E A AF  +L+EKE+ LE +++ +              D K    +AL E   V + +A  E    E+    A+LV L     + +++   +++VC    +    A  +   SG+   V  S  G++  ++FK  ++A  PS T    A+ N   GP  G       +   +KA ER L +         + +  +TP  + +  A+RT  NTIDVRG  L  A++K ++     +   R VVYILHG+GTG  L+  +R ++K    V     A   DGGDAFT+V L+
Sbjct:  122 EDKELDLYQRSWDTLDFEPILQALQDECLTVPARKIVQHAIKVDTPQQKKSKADERHYDSSNSLMATTVEGCQERYRAVHELRRLLTSSKTFRNRNGKQAPLSVFPLAGHALNLAPLLEDTTR--LLEGPDLYDI---LSVLNVVEDFMLWNQELKEQHPELEYLNRMASNITLNTTFHELLQNALDDKGRLSGTTFPVVGRLRARLRALKSDILSRLETLLETPSIKTKLSLQSGGPLYSQVSGGRLVIPVE-TSSANQIGIVHDSSRSGQTSYVEPTEIVGPTNELKQVESELRAEEARIWRSLTAQVQLNREGLESSIQAMAQLDLVMARLRLGESWQGAI-PAVEDKGVISLRNAKHPVLLLKAMRKRKKTKLSIRSRSGGAQEDMTNAVNDIVGSDIDLGDRHQGLVLTGPNSGGKTIILKMLGLAALMARSGIPIPCKDDNPRVDFFDPILADIGDLQSVGGDLSTFSGHMLVCKAVLDQAGKNALVLMDEVGSGTDPAQGVAIAQALLEALLETGSRVAITTHYTQLKQLAVADERFAVAGMQFVRGRPTYKLLPGTVGESFALSVAERVGLPLSVLERANELLDSETRQMGDLIRELEDQKATLEEQAAELEEKKRE-------IEQIQFKLKEENLRLEKKMLNVRREEAKAFTKKLEEKEQVLEEILRKLKSDPSGKVVAKSWQDIKFVKRDALNEAENVPSVIARKEAEIKEMNEASADLVPLIEMRDKPNLVPGDKLIVCKKGAMFGREASFIKSLSGR---VEVSVNGMN--VSFKLAEVALPPSSTTTIIAKYNKSRGPQKG------SQSSISKAAERALDIESTSSGGNSKTKTQTTPEPVRSTVAIRTQSNTIDVRGCTLEQAKSKAESAFSSCLMSNRSVVYILHGYGTGGILRNKVRNWLKTSNVVKEWAPASAEDGGDAFTRVVLR 1042          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|1210512179|dbj|GAX28752.1| (DNA mismatch repair protein MutS2 [Fistulifera solaris])

HSP 1 Score: 480.33 bits (1235), Expect = 1.034e-150
Identity = 345/946 (36.47%), Postives = 497/946 (52.54%), Query Frame = 0
Query:   53 EDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELAR-----------------------SPCFAENVEEVRRRYKAVEEI----------------WQSADVIPL-NDPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFV----QAPVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDE-----YIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLL---RNRNP------------------------IGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARR-GVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDV-----------GLGDTKTSVEEALKELAAVKAKVA--EESKPEVVHVPANLVLL----KRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETA--ARVNA--GPYAGLAKNKDRKKKWTKADERNLKLLGQ------EVRHASTPAAI-TLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQ-LKEGLREFMKRHPYVSRHRAADDSDGGDAFTQVFLK 892
            ED   +L +R+W ++D   +   L   C TV A++L +                       S   A  VE  + RY+AV E+                     V PL    +++AP +    R  +LE PDL  I   L +L  + DF+    +   +   L  L++ A  I L     +LL +A D  GRLSG  FP + RLR  +  L   I   +  L+K+  +   ++ +     Y     GR V+P++ T     +GIVHD S +GQT YVEP +++ PTN++K++E EL  EEARI   +T  V ++++ +  S+ A A +D+ +AR +LG+   G  +P V  +G IS  +A+HPVLLL   R R                          +G+ + +     GLVLTGPN+GGKTI+LK LGL AL+ R GIP+P      RVDFF PILADIGD+QSV GDLSTFSGH+LVC+ VL  A   ALVLMDE+GSGTDPAQGVAIAQALLEALLETG+RVAITTHY QLK+LA  D+RF V  M+F+ G+PTY+   G +GES+AL VAER+ LP SVL RA  L+     ++ +LI+ELEDQ+  L+ +  +   ++ E       +E  + +L    + L+K     + E A AF  +L+EKE+ LE +++ +              D K    +AL E   V + +A  E    E+    A+LV L     + +++   +++VC    +    A  +   SG+   V  S  G++  ++FK  ++A  PS T   A+ N   GP  G       +   +KA ER L +         +V+  + P  + +  A+RT  NTIDVRG  L  A++K ++     +   R VVYILHG+GTG  L+  +R ++K    V     A   DGGDAFT+V L+
Sbjct:  122 EDKELDLFQRSWDTLDFEPILRALQDECLTVPARKLVQQAIKVDTPQQEKSKQDERQTNRNSSLMATTVEGCQERYRAVHELRTLLTSSKTFRNRNGKQAPLSVFPLAGHSLNLAPLLEDTTR--LLEGPDLYDI---LSVLNVMEDFILWNQELKEQHAELEHLNRMASSITLNTTFHELLQNALDDKGRLSGTTFPAVGRLRARLRALKSDILLRLETLLKTPSVKAKLSLQSGGPLYSQVSGGRLVIPVE-TSSANKIGIVHDSSRSGQTSYVEPTEIVGPTNELKQVESELRAEEARIWRSLTAQVQLNREGLELSIQAMAQLDLVMARLRLGESWEG-TIPVVEDNGVISLRNAKHPVLLLKAMRKRKKSKLSILSTKSGGNQEDMNDAMDEVVGSDIDLGGRHQGLVLTGPNSGGKTIILKMLGLAALMARSGIPIPCEDDSPRVDFFDPILADIGDLQSVGGDLSTFSGHMLVCKAVLDQAGKNALVLMDEVGSGTDPAQGVAIAQALLEALLETGSRVAITTHYTQLKQLAVADERFAVAGMQFVRGRPTYKLLPGTVGESFALSVAERVGLPLSVLERANELLDSETRQMGDLIRELEDQKATLEEQAAELEEKKRE-------IEQIQFKLKEENLRLEKKMLNVRREEAKAFTKKLEEKEQVLEEILRKLKSDPSGKVVAKSWQDIKFVKRDALNEAENVPSVIARKEAEIKEMNEASADLVPLIEMRDKPNLIPGDKLIVCKKGAMFGREASFIKSLSGR---VEVSVNGMN--VSFKLAEVALPPSATTRIAKYNKSRGPQKG------SQSSISKAAERALDIESTSSGGNAKVKAQTPPEPVRSAVAIRTQSNTIDVRGCTLEQAKSKAESAFSSCLMSNRSVVYILHGYGTGGILRNKVRNWLKTSSAVKEWAPASAEDGGDAFTRVVLR 1042          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|219123265|ref|XP_002181948.1| (predicted protein [Phaeodactylum tricornutum CCAP 1055/1] >gi|217406549|gb|EEC46488.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1])

HSP 1 Score: 460.685 bits (1184), Expect = 9.440e-143
Identity = 334/899 (37.15%), Postives = 485/899 (53.95%), Query Frame = 0
Query:   84 RAKELARSPCFAENVEEVRRRYKAVEEI-W---------QSADVIPLN-----------------DPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFVQ-APVRKERLSQ------LSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDE-----YIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRN-RNPIGNTLHIDP-AKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARR---------------------GVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDVGLGDTKTSVEEALKELAAVKAKVAEESK--PEVV--HVPANLVL------------LKRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVIT--------FKKTDLARAPSETA--ARVNAGPYAGLAKNKDRKKKWTKADERNLKLLGQEVRHASTPAAITLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQ-LKEGLREFM-KRHPYVSRHRAADDSDGGDAFTQVFLK 892
            +  E A  P  A+ V   + RY+AV+E+ W           AD    N                 +  D+   +A A +G VLE  ++  +++ L  +  +R +     +   RL Q      L + A  I +   L DLL +AFD+D RLSG  FP L RLR  V  L   I  T+  L+    +   +  E     Y     GR VLP+   Y  + +GIVHD S +G+TVYVEP +++ PTN++++ E EL  EEAR+   +T  +  ++  +  S+ A   +D+ +AR  LG  + G  +P V  +G I   +A+HPVLLLR  +N +G+ + +      GLVLTGPN+GGKT++LK LGL+AL+ R GIPVPA R                       R+DFF+P+LADIGD+QSV GDLSTFSGH+LVCREVL+N+   ALVLMDE+GSGTDPAQGVAIAQALLEA+LETGARVAITTHY+QLK+LA  DDRF+V  M+F+ G+PTY+   G +GES+AL VAERL LP SV+ RA+ LM     ++ +LI+ELEDQ+  +  ++        EL   ++ +   + EL  + + L+K +   + E A  F  +L+EKE+ LE +++ +    T+  + ++  ++  VK     E++  P +V     AN VL            L+    LKEG+ V+         R   +VK+ G  V+V+ +   + + +T        F+ T     P+ T    R++ G  A  A   +R      A          +    S P+      MRT+ NT+DVRG  L +A+ ++ +     +   R VVY+LHGHGTG  LK  LR+++ K    V   + AD +DGGDAFT+V L+
Sbjct:  204 KGSERAFQPLTADTVLGTQERYRAVQELEWILQGGSGQINLADYSYRNRKSYKETLAGKPPPLGGNAFDLLAILAVAEQGKVLEGEEIFDVSQMLDRMQDVRLWSDDGLLNVNRLQQDIEFVELPKLASCIQVNTTLQDLLHNAFDKDDRLSGTTFPVLGRLRARVRSLKADIMGTLDSLLALPSIKNKLALESGGPIYSEVNGGRLVLPVAQKYA-SSVGIVHDTSRSGKTVYVEPTELVGPTNELRQAEGELRAEEARVWRSLTEQILKNQIVLETSVRAIGQLDLVMARLLLGRKLSG-TIPVVQDEGVIQLRNAKHPVLLLRQVKNVVGSDVDLGADGNQGLVLTGPNSGGKTVILKLLGLMALMSRGGIPVPADRPRVAVGAKSYGDEYDSNNDEFQPRIDFFNPVLADIGDIQSVGGDLSTFSGHMLVCREVLANSGRNALVLMDELGSGTDPAQGVAIAQALLEAILETGARVAITTHYMQLKQLAASDDRFSVAGMQFVQGRPTYKLLPGTVGESFALAVAERLNLPQSVIDRAEALMDSETRQLGDLIRELEDQKGLVDQQV-------LELEEKRQEIGKMRFELKEQGLRLEKKQLTVRREEARKFAKKLEEKEQVLENVLEKLKADPTRRVLAKSWDDIKFVKRDALNEAENIPSIVARKKKANAVLAAEQGELIPIAELRERPELKEGDKVIVCKQGPVFGREATIVKSLGSRVEVLVNNMNVGLKLTQVALPTASFRSTS---GPANTWGDGRLSIGRAAERALATER-----CAGPSTSSSSSSDTVAVSAPSKSRGVTMRTTSNTVDVRGCNLEEAKDRIRSAFSASLLAGRSVVYVLHGHGTGGVLKSKLRQWLPKEKTLVDSFQGADAADGGDAFTRVQLR 1085          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|873240756|emb|CEL93116.1| (unnamed protein product [Vitrella brassicaformis CCMP3155])

HSP 1 Score: 442.195 bits (1136), Expect = 5.501e-139
Identity = 291/735 (39.59%), Postives = 422/735 (57.41%), Query Frame = 0
Query:   40 PQPADT-API------------PVGLEDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELARSPCFAENVEEVRRRYKAVEEIWQSADVIPLNDPMDIAPAVAFAARG---NVLELPDLRSIAKALIILATLRDFVQAP-------VRKER---LSQLSQYAEEIDLPFELVDLLSDAFD--QDGR-LSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTD----EYIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDVGLGDTKTSVEEALKELAAVKAKVAEESKPEVVHVPANLVLLKRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQN-VDVMFSKGG 740
            P PA T AP+            P+  + +  +L ERTW S+D  VV   LA    T   + LARS  FA + +E    Y+ V ++ + ++ +P    +DI   V  +  G   +   LPDL  I  AL  +  L+ ++ A         ++E    +  L +  E + L  EL DL + AF+  +D   LSG +FP L+RLR+ + RL  S++  + Q+ +   +   + D    ++     GRFV+ ++  Y R GLGI HD S +G+TVY+EP +++EPTN++    + L  E ARI  +M+ +V      I  +++    +D++ AR  LG+ +GG  +P VGT+G I  + +RHPVL LR   P  N + +      LVLTGPNAGGKT+VLKTLGL AL VR G+PVPA  G RVD+F+PILADIGD+Q+VTGD+STFSGHLLV + VL  A  GALVLMDEMG+GTDP+QG A+AQALLE L+++G +         LKELA  D RF +GAME++ G+PTYR + G +GES AL+VAERL LP+SVL RA+ L+     R+TEL+KELE ++       E  R+ ++EL+   +  EA + EL   + ++++ K     ETA   LA L+ +E +L +++  +   +     E AL+E   +K  +A+E   +   +P   V   + D  K  EV V  G  L  +R GK VK  G+  V V  +KGG
Sbjct:   27 PHPARTRAPLDRRWHAMAMSALPMSDDALHLDLFERTWQSLDWRVVMASLAEAASTSPGRRLARSASFAGSHDECLVLYEQVGDVRRLSEQLPFATSLDIEHLVNVSDEGRARSSFSLPDLYRIGTALDDVTHLKGWLSAAHDRLVAAAKRESDRGIGSLLELMEPVQLDSELTDLFNGAFEGSEDSPVLSGNRFPELRRLRESIARLEMSLQARIEQIARRPELQPKLADGSGGKWSRTDTGRFVIAVQRRY-RKGLGISHDFSGSGKTVYLEPAELVEPTNELMEARMSLRSEGARICSDMSWMVTRHAVAIANAVECAGRVDLAQARYLLGEKIGG-TIPTVGTEGKIHIDQSRHPVLALRGVEPTANDISLGFDYDALVLTGPNAGGKTVVLKTLGLFALFVRYGLPVPAMDGARVDWFNPILADIGDLQTVTGDVSTFSGHLLVSKAVLERAGRGALVLMDEMGTGTDPSQGAALAQALLETLVDSGCK---------LKELAASDRRFRIGAMEWMQGRPTYRLKLGMVGESLALDVAERLRLPSSVLDRARLLLDEDTRRLTELVKELEHEK-------ESVRSLKQELKRETKRTEALQRELEVAKEQVEQSKEEVLFETAKIRLATLESEEGELRSLVSRLKEEEDLKLAEAALREAERLKL-IAKEKVDKGRKLPTGWVKAAKVD--KGDEVYVLKG-PLAGNR-GKAVKRGGRGVVTVRVNKGG 738          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|551655283|ref|XP_005830416.1| (hypothetical protein GUITHDRAFT_110559 [Guillardia theta CCMP2712] >gi|428174541|gb|EKX43436.1| hypothetical protein GUITHDRAFT_110559 [Guillardia theta CCMP2712])

HSP 1 Score: 400.979 bits (1029), Expect = 4.280e-122
Identity = 286/846 (33.81%), Postives = 443/846 (52.36%), Query Frame = 0
Query:   51 GLEDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKE-LARSPCFAENVEEVRRRYKAVEEIWQSAD---VIPLNDPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFVQAPVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDEYIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEELRAAQEAVEAQKVELGAREVELDKMKYRAKLETADAFLAQLQEKEKKLEAMMKDVGLGDTKTSVEEALKELAAVKAKVAEESKPEVVHVPANLVLLKRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAARVNAGPYAGLAKNKDRKKKWTKADERNLKLLGQEVRHASTPAAITLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVSRHRAADDSDGGDAFTQVFLK 892
            G ED + +L  +T  S+D   ++  L     T   +  L ++  F E VEEV R Y AVEEI    D    +PL+   D+   V  AA+G+VLEL +L    K +  +  + D +     +     L   A++I L   +V  L  +FD  G+LS   +P L+ LR E+D++  ++ +T+  ++K T ++  + D +   R  RFVLP+  T K    GIVH  S TG TVY+EP +VI+  N ++  E EL  EE RI+G +++ V      +  +  A   +D++ AR K  +++  AV PEV + G I     RHPVL+LR   P+ N + ++  KP +V++GPNAGGKTIVLKT+GL ALLV+ G  VP   G ++  F  +LA IGD Q+V  DLS+FS HL     +L +A  G L+L+DE+ SGTDP QG A+AQA+LE LL    ++ +TTHY QLK LA  D RF V AM+++NG PTYR   G  GES+A  +A+++ +   V+ RA+ LMG  A ++T+ ++ LE++R       ++A    E+LR   E +E ++ E+ +R  EL+K       E A  FL QL+  E+ +  ++K +        V++A K L  ++  +  + +     VP           +KEG+ V+     L+    G+V+       ++    G L   +  K   + +   +TA+   +   +G    K  KK  +K  +  L+                  A+RT  NT+D+RG R       +D+F+       +   +IL GHGTG +K+ ++E +    Y + +  A    GGDA T V LK
Sbjct:  100 GTEDALESLRRKTEESLDWKFLKATLVNCSVTSMGRSALQQARPFKE-VEEVERAYNAVEEIRMLNDDGTRLPLSQVGDVRELVTRAAKGDVLELDELYLCTKTMGAMREIEDVLHG---RNETPTLMDIADDIHLDGSVVLQLKRSFDNVGQLSTKMYPQLQDLRKEIDKIAAAVTSTMDAMLKDTKIASTLQDSFYTIRENRFVLPVSATNKNKINGIVHGVSGTGSTVYIEPQEVIDLNNKLRLAEGELKAEEIRIMGLLSKKVGSLARDVKLATSAVCQLDMAAAREKFAEMLK-AVRPEVSSGGEIDIRSGRHPVLVLRGIKPVANDMSMNGEKPAVVISGPNAGGKTIVLKTVGLCALLVQHGCWVPCEEGSKMALFRRVLASIGDQQTVEEDLSSFSSHLKTLNTMLQHADEGTLILLDEIASGTDPTQGAALAQAILEELLGKAPKMVVTTHYSQLKALATVDSRFGVAAMQYVNGAPTYRVLHGVSGESHAFSIAKKMGILEGVIERAESLMGEQA-KMTKTLEALEEERTRASVAAQEAVEEREKLRRKLERLEKREEEIRSRAKELEK-------EGAREFLKQLKSAEQSVAEVIKQLQQNPDFKEVDKAKKLLDGLRTNLTAQEEEGAGEVPEG---------IKEGDFVML----LDVGSEGEVISPPSSKGELQVRVGPL--TLRTKVDRVRKVEGKTASSSPSPRASGSLTQKRGKKTGSKEYKAALQ-----------------RAVRTPVNTLDLRGFRAEQVGDAIDSFLDKMTSANQPTAFILSGHGTGVVKKVVQEHLATCMYAAAYAPASFEQGGDALTVVALK 900          
BLAST of EWM26231.1 vs. NCBI_GenBank
Match: gi|1072237604|gb|OEU22331.1| (P-loop containing nucleoside triphosphate hydrolase protein, partial [Fragilariopsis cylindrus CCMP1102])

HSP 1 Score: 363.229 bits (931), Expect = 8.750e-113
Identity = 218/453 (48.12%), Postives = 296/453 (65.34%), Query Frame = 0
Query:  172 QLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDEY-------IAQRNG--RFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNT---LHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRG-------VRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASG----ALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGEMEDARAREEE 601
            ++    + I L   L +LL +AFD +G+LSG  FP L +LR +V  +   I  T+  +++   +   +  E        +A  NG  R VLPI   Y  A LGIVHD S +G+TVYVEP +++ PTND++ +E +L  EEAR+   +T  V  ++  +  S+ A A +D+ VAR  LG  + G V+P+V  +G IS  +A+HPVLLLR    +  +   L +D  K GLVLTGPN+GGKT++LK LGLLAL+ R GIP+PA  G        RVDFF P+LADIGD+QSV  DLSTFSGH+ +CREVL+    G    +LVLMDE+GSGTDP QGVAIAQALLEALL+TG RV ITTHY+ LK+LA  DDRF+VG M+F+ G+PTY+   G +GESYAL VAERL+LP +VL RA  L+     ++ +LI +LEDQ+  +  ++ +   R++E
Sbjct:   24 EIPHIVDGIKLNTTLQNLLEEAFDDEGKLSGKTFPFLGQLRAKVRTMKADILQTLDSIVQLPSIKSKLALESGGPLISEVASSNGAGRLVLPINPKYASA-LGIVHDSSRSGKTVYVEPSEIVGPTNDLRVVERDLEAEEARVWRLLTEQVWNNQRDLRASVQAVAQLDLCVARYTLGQRLEG-VIPDVQDEGIISLRNAKHPVLLLRKMEKVVGSDISLGVD-GKQGLVLTGPNSGGKTLILKLLGLLALMSRSGIPIPAEHGDMDGVYLPRVDFFDPVLADIGDIQSVDSDLSTFSGHMYICREVLALTKGGNGKNSLVLMDELGSGTDPNQGVAIAQALLEALLDTGCRVVITTHYMALKQLASSDDRFSVGGMQFVGGRPTYKLLPGVVGESYALAVAERLQLPQTVLDRASELLDSETRQMGDLISDLEDQKLLIDEQVVEIEERKKE 473          
The following BLAST results are available for this feature:
BLAST of EWM26231.1 vs. NCBI_GenBank
Analysis Date: 2020-04-07 (BLAST analysis for N. gaditana B-31)
Total hits: 10
Match NameE-valueIdentityDescription
gi|585108230|gb|EWM26231.1|0.000e+0100.00dna mismatch repair protein [Nannochloropsis gadit... [more]
gi|299115580|emb|CBN75783.1|0.000e+042.39MutS protein homolog 1B MutS-like ATPases involved... [more]
gi|397576299|gb|EJK50176.1|5.922e-15735.44hypothetical protein THAOC_30884 [Thalassiosira oc... [more]
gi|224010241|ref|XP_002294078.1|1.215e-15336.20predicted protein [Thalassiosira pseudonana CCMP13... [more]
gi|1210523157|dbj|GAX18208.1|1.261e-15136.26DNA mismatch repair protein MutS2 [Fistulifera sol... [more]
gi|1210512179|dbj|GAX28752.1|1.034e-15036.47DNA mismatch repair protein MutS2 [Fistulifera sol... [more]
gi|219123265|ref|XP_002181948.1|9.440e-14337.15predicted protein [Phaeodactylum tricornutum CCAP ... [more]
gi|873240756|emb|CEL93116.1|5.501e-13939.59unnamed protein product [Vitrella brassicaformis C... [more]
gi|551655283|ref|XP_005830416.1|4.280e-12233.81hypothetical protein GUITHDRAFT_110559 [Guillardia... [more]
gi|1072237604|gb|OEU22331.1|8.750e-11348.12P-loop containing nucleoside triphosphate hydrolas... [more]
back to top
Relationships

This CDS is a part of the following mRNA feature(s):

Feature NameUnique NameSpeciesType
rna4437rna4437Nannochloropsis gaditana (N. gaditana B-31)mRNA


Sequences
Synonyms
Publications