NO06G02300, NO06G02300 (gene) Nannochloropsis oceanica

Overview
NameNO06G02300
Unique NameNO06G02300
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length3381
Alignment locationchr6:666880..670260 -

Link to JBrowse

Properties
Property NameValue
DescriptionDna mismatch repair protein
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr6genomechr6:666880..670260 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0016887ATPase activity
GO:0004519endonuclease activity
GO:0030983mismatched DNA binding
GO:0005524ATP binding
GO:0004518nuclease activity
GO:0003677DNA binding
GO:0000166nucleotide binding
Vocabulary: Biological Process
TermDefinition
GO:0045910negative regulation of DNA recombination
GO:0006298mismatch repair
GO:0006950response to stress
GO:0006259DNA metabolic process
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
IPR036063Smr_dom_sf
IPR036187DNA_mismatch_repair_MutS_sf
IPR005747MutS2
IPR000432DNA_mismatch_repair_MutS_C
IPR007696DNA_mismatch_repair_MutS_core
IPR002625Smr/MutS2_C
Homology
BLAST of NO06G02300 vs. NCBI_GenBank
Match: EWM26231.1 (dna mismatch repair protein [Nannochloropsis gaditana])

HSP 1 Score: 1349.3 bits (3491), Expect = 0.000e+0
Identity = 722/849 (85.04%), Postives = 790/849 (93.05%), Query Frame = 0
Query:  121 DGAANPVGLVDIVSNLHERTWASLDLHVVQEQLASLCDTVRAKDLARSSCFAEDVHEVHRRYKAVEEVWQSVEPIPLNDPMDIAPAVNFASRGNTLELPDLRSIAKALILLATLRDFMQAPQRAERVPQLAAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDEYIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRGRNPVGNNIHIEKDKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDALAGERREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKLQAADTFLAQLQEKEKKLEGLLKGVGLGDEKATVEEALRELAAVRGQVAVDSKPEMVHVPSNIVLFKRDDIVREGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAGLAKNKEKKKKWTKADERNLKMLGKEMNTVSSATAVTLNAMRTSFNTIDVRGLRLRDAESKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVGRHRSADESDGGDAFTQVFLK 970
            D A  PVGL DIV NLHERTWAS+DLHVVQ++LA LCDTVRAK+LARS CFAE+V EV RRYKAVEE+WQS + IPLNDPMDIAPAV FA+RGN LELPDLRSIAKALI+LATLRDF+QAP R ER+ QL+ Y+E ID+P+ELV+LL+DAFD+DGRLSG KFP LKRLR+EVDRLYGSI+NTVGQLMKSTGMS M+TDEYIAQRNGRFVLPIKNTYKR+GLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRL+A+SKD+IL SLDA AL+DISVARAKLGD++GGAVVPEVGT+GCISAEDARHPVL+LR RNP+GN +HI+  KPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPA+RGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVC+EVLS AASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHY+QLKELAQKDDRF VGAMEFL+GKPTYRF+EGA+GESYALEVAERLELP++VL RA+GLM  G ++VTELIKELE+QRDAL G     XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX    AKL+ AD FLAQLQEKEKKLE ++K VGLGD K +VEEAL+ELAAV+ +VA +SKPE+VHVP+N+VL KRDDI++EGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAA VNAGPYAGLAKNK++KKKWTKADERNLK+LG+E+   S+  A+TLNAMRTSFNTIDVRGLRLRDAE+KVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYV RHR+AD+SDGGDAFTQVFLK
Sbjct:   44 DTAPIPVGLEDIVRNLHERTWASMDLHVVQDRLAGLCDTVRAKELARSPCFAENVEEVRRRYKAVEEIWQSADVIPLNDPMDIAPAVAFAARGNVLELPDLRSIAKALIILATLRDFVQAPVRKERLSQLSQYAEEIDLPFELVDLLSDAFDQDGRLSGVKFPHLKRLRDEVDRLYGSIKNTVGQLMKSTGMSQMVTDEYIAQRNGRFVLPIKNTYKRAGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLVAVSKDKILQSLDAGALIDISVARAKLGDLMGGAVVPEVGTDGCISAEDARHPVLLLRNRNPIGNTLHIDPAKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPARRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCREVLSNAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYLQLKELAQKDDRFTVGAMEFLNGKPTYRFREGAIGESYALEVAERLELPASVLARAKGLMGGGALRVTELIKELEDQRDALQGXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXMKYRAKLETADAFLAQLQEKEKKLEAMMKDVGLGDTKTSVEEALKELAAVKAKVAEESKPEVVHVPANLVLLKRDDILKEGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAARVNAGPYAGLAKNKDRKKKWTKADERNLKLLGQEVRHASTPAAITLNAMRTSFNTIDVRGLRLRDAEAKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVSRHRAADDSDGGDAFTQVFLK 892          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: CBN75783.1 (MutS protein homolog 1B MutS-like ATPases involved in mismatch repair, family 1 [Ectocarpus siliculosus])

HSP 1 Score: 565.8 bits (1457), Expect = 2.500e-157
Identity = 378/880 (42.95%), Postives = 517/880 (58.75%), Query Frame = 0
Query:  135 NLHERTWASLDLHVVQEQLASLCDTVRAKDLARSSCFAEDVHEVHRRYKAVEEV-WQSVEPIPLNDPMDIAPAVNFASRGNTLELPDLRSIAKALILLATLRDFM-------QAP----QRAERVPQLAAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDE----YIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRGRNPVGNNIHIEKDKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDALAGERREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKLQAADTFLAQLQEKEKKLEGLLKGVGLGDEKATVEEALRELAAVRGQVAVDSK-PEMVHVPSNIVLFKRDDIVREGE-VVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAG-LAKNK--------------------------EKKKKWTKADERNLKMLGKEMNTVSSATAVTLNAMRTSFNTIDVRGLRLRDAESKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVGRHRSADESDGGDAFTQVFLK 970
            +L  RTW SLD  VV E+L+  C T   +  A    F   + EVH  Y+ V EV   + + +PL   M + P +  A+ G+TLE  ++ ++A AL  L  LR+F        ++P     R+ + P+LAA +  I +   L+ LL  AFD  G LS  +FP++ RLR + D L   I++T+ +LM     S M+ DE    Y+++  GRFV+P+  TYKR+ +GIVHD S TG+T+YVEP QV+ PTN++  ++L+L  E  RI+ +MT  IA  +D IL SL AAA VD+++AR +LG   GG  +P+V  EG I   +ARHPVL+LRG+ PVGN++ ++     L+LTGPNAGGKT+VLKTLGL+AL+ R GIP+PA  G RVD F P+LADIGD+QSVTGDLSTFSGHL+V K VLS A +G+LVLMDEMGSGTDP QG A+AQ+LLEAL++ G+RVA+TTHY QLKELA  D+RF V AM+F+DG+PTYR  +GAVGES+AL+VAERLELP+ V+ERARGL+D    +V+ELI +LE++R+ L      A        XXXXXXXXXXXXXXXXXXX       AK  AA  +  +L   EKKL+ +            +  ++ E+ A++ +V  ++  P        +   K+ D V +GE VVVC G  +  +  G+V+  S ++V+V       +  + F  + L+R P          P  G LAK +                                                            MRT  NT+D+RG+ L +A+   D F   GI      VY+LHGHGTG LK G+R ++ R+  V + R A + DGGDA+T V LK
Sbjct:   37 DLFARTWESLDFGVVLERLSRECRTEMGRSRALIPDFKTTLEEVHELYERVNEVLLLAGDAVPLRAGMAVEPQLAIAAAGSTLEPTEIAAVASALEGLFELREFFCGAVADAKSPGGMVDRSGKTPRLAAVAAEIKLDEGLLGLLRGAFDSQGELSAQRFPEIGRLRSKADSLRQGIKSTMSRLMAGGEFSGMLADEGREAYVSEIAGRFVIPVTPTYKRT-VGIVHDSSRTGKTLYVEPTQVVGPTNELVEVKLQLKVETQRILSQMTLKIAEHEDEILQSLAAAAEVDLALARGRLGAKTGG-TIPKVMNEGTIKLVNARHPVLLLRGKAPVGNSMSLDASMQALILTGPNAGGKTVVLKTLGLVALMARAGIPIPAAPGARVDLFDPVLADIGDLQSVTGDLSTFSGHLVVAKAVLSGARAGSLVLMDEMGSGTDPMQGAALAQSLLEALVDAGSRVALTTHYTQLKELAATDERFGVSAMQFVDGRPTYRLIKGAVGESFALQVAERLELPAFVVERARGLLDDNTRQVSELISKLEDERNQLQDXXXRA-------AXXXXXXXXXXXXXXXXXXXAAELRSMAKRDAAKEYAVKLDANEKKLKQMFDKARSEPTTDVIGSSIGEIRALKKEVQKEAAVPTYTAQDLGLTPLKKRDRVAKGEKVVVCDGSSIGWE--GEVLSTSNRDVEVFLPSA--EASMRFAFSQLSRLPPGGVKWPAQSPSGGSLAKKRYPGGIVGPSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXNYFMRTERNTLDLRGMTLSEAQGDCDMFFSNGIMDGSDGVYLLHGHGTGVLKAGIRRWLPRNSMVAKWRPASQEDGGDAYTVVELK 903          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: EJK50176.1 (hypothetical protein THAOC_30884 [Thalassiosira oceanica])

HSP 1 Score: 479.2 bits (1232), Expect = 3.100e-131
Identity = 334/947 (35.27%), Postives = 491/947 (51.85%), Query Frame = 0
Query:  125 NPVGLVDIVSNLHERTWASLDLHVVQEQLASLCDTVRAKDLARSS---------------------CFAEDVHEVHRRYKAVEEV-----------W----------------------QSVEPIPLNDP------MDIAPAVNFASRGNTLELPDLRSIAKALILLATLRDFMQAPQ------------------RAERVPQLAAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDE----YIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRGRNPVGNNIHIEKDK-PGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQRG--VRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDALAGERREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKLQAADTFLAQLQEKEKKLEGLLKGV-GLGDEKATVEEALRELAAVRGQVAVDSKPEMVHVPSNIVLFKRDDIVREGEVVVCAGDQLNSDRAGK---------------VVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAGLAKNKEKKKKWTKADERNLKMLGKEMNTVSSATAVTLNAMRTSFNTIDVRGLRLRDAESKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMK-RHPYVGRHRSADESDGGDAFTQVFLK 970
            N + + D   ++H+RT  +LD  V+Q  LA  C+T  A++L   S                       A  V  VH RY AV+E+           W                      + V+ + L  P      +DI   ++    G  LE P++  ++  + L   + D+  A +                  + E   QL +  + I +  EL+ELL  AFD++G+LSG  FP + RLR ++  L   I +++  ++    M N +  E     +++ NGR V+P++  Y  S +GIVHD S +G+T YVEP +V++PTN+++  E EL  EEA++  ++T  I   ++ I  ++ +   +D+  AR KLG  L G  VPEV  EG +SA DA+HP+L+LRG   VG+++ I + K   ++LTGPNAGGKTI+LK LGL A++ R GIPVP ++    RVDFF P+LADIGD+QSV  DLSTFSGH+LVC+EVL+ A   ALVL+DE GSGTDP QGVAIAQALLEALL+ G RVAITTH++ LK+LA  DDRF V  M+FL  +PTY+   G +GES+AL VAERL+LP++VL+RA GL+D    K+ EL+++LE Q       ++E                                   A+   A  F  +L+EKEK LEG+L+ + G G  K  + ++  EL  V+ +V  +++    +VP +I    + D      + +     +N    GK                VK  G+ +++  S GG+ + ++ K  ++A  PS         P A L             DE         ++  S +T     +M+T  NT+D  GL   +++ K  N         R VVYILHGHGTG LK  +RE++K    +V   + AD++DGGDA T+V LK
Sbjct:  259 NTLNVEDQNIDMHQRTLDTLDYPVIQRALADECETQFARNLITKSMNTQVQSRDIQDTDADVLTMPITASSVEGVHSRYGAVQEMQRLMGGRITGFWSTARRNALKHKISSSSRGISNKKKVQRLSLGTPPIDGYTLDIESIMSIIDEGKVLEGPEILDVSSMMELCLDVLDWSDALEEWNRDNVGGEESEFEFELQQEPFVQLPSLVKQIHMDEELIELLATAFDEEGKLSGTTFPSIGRLRSKIRTLKRGILSSIESILALPSMRNKLAVESGGSLMSEINGRIVIPVQQQY--SSVGIVHDASRSGKTSYVEPSEVVQPTNELRSAESELRAEEAKVWRQLTESIVKHREEIERNVASLGQLDVVKARVKLGRRLDG-TVPEVKNEGVVSAIDAKHPILLLRGMEVVGSDVEIGQGKNQAMILTGPNAGGKTIILKLLGLFAMMARDGIPVPTKQSEKARVDFFEPVLADIGDIQSVDADLSTFSGHMLVCREVLNDARKDALVLLDEPGSGTDPNQGVAIAQALLEALLDRGCRVAITTHFLDLKQLASSDDRFAVAGMQFLGNRPTYKLIPGMIGESFALAVAERLKLPASVLDRANGLLDSETRKMGELLRDLEEQ-------KQEVEKTSEALKKKEFEMLELKAEMRSQQEKLEAKQLNARRDEAARFSKKLEEKEKILEGILERLQGSGATKKVIADSWTELRIVKREVMSEAE----NVPGSIRQLNQLDEANVELIPISELKGINKVEVGKSVVVCKKGAFYGKDATVKKLGKKLEL--SVGGMPVRLSLK--EIAFPPSSGRGQT---PVANL-------------DEGG----SSSVDASSGSTRDKGTSMKTKSNTVDCLGLNFEESKRKCINSFSKAAMGNRSVVYILHGHGTGVLKRKIREWLKNERQFVKSFKPADQADGGDALTRVELK 1167          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: XP_002294078.1 (predicted protein [Thalassiosira pseudonana CCMP1335] >EED88433.1 predicted protein [Thalassiosira pseudonana CCMP1335])

HSP 1 Score: 455.7 bits (1171), Expect = 3.600e-124
Identity = 308/816 (37.75%), Postives = 459/816 (56.25%), Query Frame = 0
Query:  202 DIAPAVNFASRGNTLELPDLRSIAKALILLATLRDFMQAPQ--------------RAERVPQLAAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDE----YIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRG-RNPVGNNIHIEKD-KPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVP--AQRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDALAGERREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKLQAADTFLAQLQEKEKKLEGLLKGV-GLGDEKATVEEALRELAAVRGQVAVDSKPEMVHVPSNIVLFKR-------DDIVR------------EGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAGLAKNKEKKKKWTKADERNLKMLGKE---MNTVSSATAVTLN-AMRTSFNTIDVRGLRLRDAESK-VDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKR-HPYVGRHRSADESDGGDAFTQVFLK 970
            D  P       G  LE P++  +   L +   + D+  A +              + E   +L + ++ I+I  EL +LLT+AFD +GRLSG  FP +  LR +V      I  ++  L+    M N +  E       + NGR V+P++  Y+   +GIVHD S +G+T YVEP +++ PTN++++ E EL  EEAR+  ++T  I   +  I  ++ +   +D+ + R KLG  L G VVP V  EG +S +DARHP+L+LR     VG+++ I  D   GL+LTGPN+GGKT++LK LGL A +VR GIPVP  A    RVDFF+PILADIGD+QSV GDLSTFSGH+LVC+EVL+ A   ALVLMDE+GSGTDP QGVAIAQALLEALL+ G RVAITTHY+ LK+LA  DDRF V  M+F+ G+PTY+   G +GES+AL VAERL+LP +V++RA  L+D     + ELI  LE+Q+  L  +++E                                   A+ + A  F A+L+EKE+ LE +L+ + G G  K  V ++  ++  ++ +   D++    +VP  +   K+       D++V             + +V+VC            VVK  G+ + V  + GG+ + +T K         E + L +AG   G+ K K+ +++ +   +R+++ L  E   ++  S+ATA+    +MR   NT++  G    +++ K +D F K  +   R VV+ILHGHGTG LK+ +R ++     +V   + AD++DGG+A T+V LK
Sbjct:  495 DFQPIFEIVDEGKVLEGPEILEVTTMLEIAMDVLDWRYALKEFNEEIDNEADSDLKQEPFVELVSLTDSIEIDDELFDLLTNAFDDEGRLSGTTFPFIGVLRAKVRTFKRDILASIDSLLAMPSMKNKLAVESGGALTMEINGRLVIPVQQKYQ--NIGIVHDASRSGKTTYVEPTEIVGPTNELRQAEAELRSEEARVWRQLTETIVKHRAEIERNVASIGQLDVVIGRVKLGKKLNG-VVPTVREEGVVSVKDARHPILLLRELEGVVGSDVEIGIDGNQGLILTGPNSGGKTVILKLLGLYAFMVRDGIPVPSKAYEPARVDFFTPILADIGDLQSVDGDLSTFSGHMLVCREVLNNAQENALVLMDELGSGTDPNQGVAIAQALLEALLDRGCRVAITTHYMDLKQLASTDDRFAVAGMQFVGGRPTYKLIPGMIGESFALAVAERLKLPQSVIQRANELLDTETRTMGELISSLEDQK-LLVDQKQE------ELKKREFEMLELKAEMKRQQERLEAKQINARREEAAKFAAKLEEKERLLEDILEKLKGSGASKKVVADSWTDIRIIKREALSDAE----NVPGVMQRLKQQQGQVGDDELVPISEMKGVNKVNIDDKVIVCKKGAFYGKEG--VVKEVGKKISV--AVGGVSVRLTTK---------EISFLPSAG---GVQKAKDPEEQQSSRAKRDMEYLTDEEVFVDVSSTATAIDKGVSMRMDANTVNCIGKNFEESKRKCIDAFSKATM-SNRSVVFILHGHGTGVLKKKIRSWLSTDRQWVKSFKPADQADGGEALTRVELK 1279          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: CEL93116.1 (unnamed protein product [Vitrella brassicaformis CCMP3155])

HSP 1 Score: 431.8 bits (1109), Expect = 5.600e-117
Identity = 251/551 (45.55%), Postives = 349/551 (63.34%), Query Frame = 0
Query:  135 NLHERTWASLDLHVVQEQLASLCDTVRAKDLARSSCFAEDVHEVHRRYKAVEEVWQSVEPIPLNDPMDIAPAVNFASRG---NTLELPDLRSIAKALILLATLRDFMQ---------APQRAER-VPQLAAYSEGIDIPYELVELLTDAF---DKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITD----EYIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRGRNPVGNNIHIEKDKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDAL 666
            +L ERTW SLD  VV   LA    T   + LARS+ FA    E    Y+ V +V +  E +P    +DI   VN +  G   ++  LPDL  I  AL  +  L+ ++          A + ++R +  L    E + +  EL +L   AF   +    LSG++FP+L+RLRE + RL  S++  + Q+ +   +   + D    ++     GRFV+ ++  Y R GLGI HD S +G+TVY+EP +++EPTN++    + L  E ARI  +M+ ++      I  +++ A  VD++ AR  LG+ +GG  +P VGTEG I  + +RHPVL LRG  P  N+I +  D   LVLTGPNAGGKT+VLKTLGL AL VR G+PVPA  G RVD+F+PILADIGD+Q+VTGD+STFSGHLLV K VL +A  GALVLMDEMG+GTDP+QG A+AQALLE L+++G +         LKELA  D RF +GAME++ G+PTYR K G VGES AL+VAERL LPS+VL+RAR L+D+   ++TEL+KELE++++++
Sbjct:   58 DLFERTWQSLDWRVVMASLAEAASTSPGRRLARSASFAGSHDECLVLYEQVGDVRRLSEQLPFATSLDIEHLVNVSDEGRARSSFSLPDLYRIGTALDDVTHLKGWLSAAHDRLVAAAKRESDRGIGSLLELMEPVQLDSELTDLFNGAFEGSEDSPVLSGNRFPELRRLRESIARLEMSLQARIEQIARRPELQPKLADGSGGKWSRTDTGRFVIAVQRRY-RKGLGISHDFSGSGKTVYLEPAELVEPTNELMEARMSLRSEGARICSDMSWMVTRHAVAIANAVECAGRVDLAQARYLLGEKIGG-TIPTVGTEGKIHIDQSRHPVLALRGVEPTANDISLGFDYDALVLTGPNAGGKTVVLKTLGLFALFVRYGLPVPAMDGARVDWFNPILADIGDLQTVTGDVSTFSGHLLVSKAVLERAGRGALVLMDEMGTGTDPSQGAALAQALLETLVDSGCK---------LKELAASDRRFRIGAMEWMQGRPTYRLKLGMVGESLALDVAERLRLPSSVLDRARLLLDEDTRRLTELVKELEHEKESV 597          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: GAX18208.1 (DNA mismatch repair protein MutS2 [Fistulifera solaris])

HSP 1 Score: 422.5 bits (1085), Expect = 3.400e-114
Identity = 353/930 (37.96%), Postives = 496/930 (53.33%), Query Frame = 0
Query:  135 NLHERTWASLDLHVVQEQLASLCDTVRAKDLAR-----------------------SSCFAEDVHEVHRRYKAVEEVWQ------------------SVEPIPLNDPMDIAPAVNFASRGNTLELPDLRSIAKALILLATLRDFM----QAPQRAERVPQLAAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDE-----YIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRG--------------------------RNPVGNNIHIEKDKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQ-RGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDALAGERREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKLQAADTFLAQLQEKEKKLE----GLLKGVGLGDEKATVEEALRELAAVRGQVAVDSK--PEMVHVPSNIV----LFKRDDIVREGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAGLAKNKEKKKKWTKADERNLKM----LGKEMNTVSSATAVTLN---AMRTSFNTIDVRGLRLRDAESKVDNFIKVGIPQKRKVVYILHGHGTGQ-LKEGLREFMKRHPYVGRHRSADESDGGDAFTQVFLK 970
            +L++R+W +LD   + + L   C TV A+ + +                       +S  A  V     RY+AV E+ +                  SV P+     +++AP +   +R   LE PDL  I   L +L  + DFM    +  ++   +  L   +  I +     ELL +A D  GRLSG  FP + RLR  +  L   I + +  L+++  +   ++ +     Y     GR V+P++ T   + +GIVHD S +GQT YVEP +++ PTN++K++E EL  EEARI   +T  + ++++ +  S+ A A +D+ +AR +LG+   GA +P V  +G IS  +A+HPVL+L+                            + VG++I +     GLVLTGPN+GGKTI+LK LGL AL+ R GIP+P +    RVDFF PILADIGD+QSV GDLSTFSGH+LVCK VL +A   ALVLMDE+GSGTDPAQGVAIAQALLEALLETG+RVAITTHY QLK+LA  D+RF V  M+F+ G+PTY+   G VGES+AL VAER+ LP +VLERA  L+D    ++ +                   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  A  +          E  +KL+    G +      D K    +AL E   V   +A       EM    +++V    +  + ++V   +++VC    +    A  +   SG+   V  S  G++  ++FK  ++A  PS T  ++ A         K  +   +KA ER L +     G    T +  T   +    A+RT  NTIDVRG  L  A+SK ++     +   R VVYILHG+GTG  L+  +R ++K    V     A   DGGDAFT+V L+
Sbjct:  127 DLYQRSWDTLDFEPILQALQDECLTVPARKIVQHAIKVDTPQQKKSKADERHYDSSNSLMATTVEGCQERYRAVHELRRLLTSSKTFRNRNGKQAPLSVFPL-AGHALNLAPLLEDTTR--LLEGPDLYDI---LSVLNVVEDFMLWNQELKEQHPELEYLNRMASNITLNTTFHELLQNALDDKGRLSGTTFPVVGRLRARLRALKSDILSRLETLLETPSIKTKLSLQSGGPLYSQVSGGRLVIPVE-TSSANQIGIVHDSSRSGQTSYVEPTEIVGPTNELKQVESELRAEEARIWRSLTAQVQLNREGLESSIQAMAQLDLVMARLRLGESWQGA-IPAVEDKGVISLRNAKHPVLLLKAMRKRKKTKLSIRSRSGGAQEDMTNAVNDIVGSDIDLGDRHQGLVLTGPNSGGKTIILKMLGLAALMARSGIPIPCKDDNPRVDFFDPILADIGDLQSVGGDLSTFSGHMLVCKAVLDQAGKNALVLMDEVGSGTDPAQGVAIAQALLEALLETGSRVAITTHYTQLKQLAVADERFAVAGMQFVRGRPTYKLLPGTVGESFALSVAERVGLPLSVLERANELLDSETRQMGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKAFTKXXXXXXXXXXEILRKLKSDPSGKVVAKSWQDIKFVKRDALNEAENVPSVIARKEAEIKEMNEASADLVPLIEMRDKPNLVPGDKLIVCKKGAMFGREASFIKSLSGR---VEVSVNGMN--VSFKLAEVALPPSSTTTII-AKYNKSRGPQKGSQSSISKAAERALDIESTSSGGNSKTKTQTTPEPVRSTVAIRTQSNTIDVRGCTLEQAKSKAESAFSSCLMSNRSVVYILHGYGTGGILRNKVRNWLKTSNVVKEWAPASAEDGGDAFTRVVLR 1042          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: GAX28752.1 (DNA mismatch repair protein MutS2 [Fistulifera solaris])

HSP 1 Score: 414.8 bits (1065), Expect = 7.100e-112
Identity = 354/931 (38.02%), Postives = 488/931 (52.42%), Query Frame = 0
Query:  135 NLHERTWASLDLHVVQEQLASLCDTVRAKDLAR-----------------------SSCFAEDVHEVHRRYKAVEEVWQ------------------SVEPIPLNDPMDIAPAVNFASRGNTLELPDLRSIAKALILLATLRDFM----QAPQRAERVPQLAAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDE-----YIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLR-----------------GRNP----------VGNNIHIEKDKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQ-RGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDALAGERREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKLQAADTFLAQLQEKEKKLE----GLLKGVGLGDEKATVEEALRELAAVRGQVAVDSK--PEMVHVPSNIV----LFKRDDIVREGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAGLAKNKEKKKKWTKADERNLKMLGKEMNTVSSATAVT-------LNAMRTSFNTIDVRGLRLRDAESKVDNFIKVGIPQKRKVVYILHGHGTGQ-LKEGLREFMKRHPYVGRHRSADESDGGDAFTQVFLK 970
            +L +R+W +LD   +   L   C TV A+ L +                       SS  A  V     RY+AV E+                    SV P+     +++AP +   +R   LE PDL  I   L +L  + DF+    +  ++   +  L   +  I +     ELL +A D  GRLSG  FP + RLR  +  L   I   +  L+K+  +   ++ +     Y     GR V+P++ T   + +GIVHD S +GQT YVEP +++ PTN++K++E EL  EEARI   +T  + ++++ +  S+ A A +D+ +AR +LG+   G  +P V   G IS  +A+HPVL+L+                 G N           VG++I +     GLVLTGPN+GGKTI+LK LGL AL+ R GIP+P +    RVDFF PILADIGD+QSV GDLSTFSGH+LVCK VL +A   ALVLMDE+GSGTDPAQGVAIAQALLEALLETG+RVAITTHY QLK+LA  D+RF V  M+F+ G+PTY+   G VGES+AL VAER+ LP +VLERA  L+D    ++ +                   XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX  A  +          E  +KL+    G +      D K    +AL E   V   +A       EM    +++V    +  + +++   +++VC    +    A  +   SG+   V  S  G++  ++FK  ++A  PS T  +       G    K  +   +KA ER L +        +   A T         A+RT  NTIDVRG  L  A+SK ++     +   R VVYILHG+GTG  L+  +R ++K    V     A   DGGDAFT+V L+
Sbjct:  127 DLFQRSWDTLDFEPILRALQDECLTVPARKLVQQAIKVDTPQQEKSKQDERQTNRNSSLMATTVEGCQERYRAVHELRTLLTSSKTFRNRNGKQAPLSVFPL-AGHSLNLAPLLEDTTR--LLEGPDLYDI---LSVLNVMEDFILWNQELKEQHAELEHLNRMASSITLNTTFHELLQNALDDKGRLSGTTFPAVGRLRARLRALKSDILLRLETLLKTPSVKAKLSLQSGGPLYSQVSGGRLVIPVE-TSSANKIGIVHDSSRSGQTSYVEPTEIVGPTNELKQVESELRAEEARIWRSLTAQVQLNREGLELSIQAMAQLDLVMARLRLGESWEG-TIPVVEDNGVISLRNAKHPVLLLKAMRKRKKSKLSILSTKSGGNQEDMNDAMDEVVGSDIDLGGRHQGLVLTGPNSGGKTIILKMLGLAALMARSGIPIPCEDDSPRVDFFDPILADIGDLQSVGGDLSTFSGHMLVCKAVLDQAGKNALVLMDEVGSGTDPAQGVAIAQALLEALLETGSRVAITTHYTQLKQLAVADERFAVAGMQFVRGRPTYKLLPGTVGESFALSVAERVGLPLSVLERANELLDSETRQMGDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKAFTKXXXXXXXXXXEILRKLKSDPSGKVVAKSWQDIKFVKRDALNEAENVPSVIARKEAEIKEMNEASADLVPLIEMRDKPNLIPGDKLIVCKKGAMFGREASFIKSLSGR---VEVSVNGMN--VSFKLAEVALPPSATTRIAKYNKSRG--PQKGSQSSISKAAERALDIESTSSGGNAKVKAQTPPEPVRSAVAIRTQSNTIDVRGCTLEQAKSKAESAFSSCLMSNRSVVYILHGYGTGGILRNKVRNWLKTSSAVKEWAPASAEDGGDAFTRVVLR 1042          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: XP_002181948.1 (predicted protein [Phaeodactylum tricornutum CCAP 1055/1] >EEC46488.1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1])

HSP 1 Score: 387.1 bits (993), Expect = 1.600e-103
Identity = 328/852 (38.50%), Postives = 455/852 (53.40%), Query Frame = 0
Query:  179 HRRYKAVEEVWQSVEPIPLNDPMDIAPAVNFASRGNTLELPDLRSIAKALILLATLR----------DFMQAPQRAERVPQLAAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDE-----YIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRG-RNPVGNNIHIEKD-KPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQRGVRVDFFSPILADIGDMQS----------------------VTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDALAGERREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKLQAADTFLAQLQEKEKKLEG----LLKGVGLGDEKATVEEALRELAAVRGQVAVDSKPEMVHVPSN-----IVLFKRDDIVREGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAGLAKNKEKKKKWTKADERNL---KMLGKEMNTVSSATAVTLNA--------MRTSFNTIDVRGLRLRDAESKVDNFIKVGIPQKRKVVYILHGHGT-GQLKEGLREFM-KRHPYVGRHRSADESDGGDAFTQVFLK 970
            +R  K+ +E      P    +  D+   +  A +G  LE  ++  +++ L  +  +R          + +Q       +P+LA+    I +   L +LL +AFDKD RLSG  FP L RLR  V  L   I  T+  L+    + N +  E     Y     GR VLP+   Y  S +GIVHD S +G+TVYVEP +++ PTN++++ E EL  EEAR+   +T  I  ++  +  S+ A   +D+ +AR  LG  L G  +P V  EG I   +A+HPVL+LR  +N VG+++ +  D   GLVLTGPN+GGKT++LK LGL+AL+ R GIPVPA R  RV   +    D  D  +                           STFSGH+LVC+EVL+ +   ALVLMDE+GSGTDPAQGVAIAQALLEA+LETGARVAITTHY+QLK+LA  DDRF V  M+F+ G+PTY+   G VGES+AL VAERL LP +V++RA  LMD    ++ +LI+ELE+Q+  +       XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                  KL+      +      D K    +AL E   +   VA   K   V          I   +    ++EG+ V+         R   +VK+ G  V+V+ +   + + +T      A   S +      G         + +    +A ER L   +  G   ++      V ++A        MRT+ NT+DVRG  L +A+ ++ +     +   R VVY+LHGHGT G LK  LR+++ K    V   + AD +DGGDAFT+V L+
Sbjct:  249 YRNRKSYKETLAGKPPPLGGNAFDLLAILAVAEQGKVLEGEEIFDVSQMLDRMQDVRLWSDDGLLNVNRLQQDIEFVELPKLASC---IQVNTTLQDLLHNAFDKDDRLSGTTFPVLGRLRARVRSLKADIMGTLDSLLALPSIKNKLALESGGPIYSEVNGGRLVLPVAQKY-ASSVGIVHDTSRSGKTVYVEPTELVGPTNELRQAEGELRAEEARVWRSLTEQILKNQIVLETSVRAIGQLDLVMARLLLGRKLSG-TIPVVQDEGVIQLRNAKHPVLLLRQVKNVVGSDVDLGADGNQGLVLTGPNSGGKTVILKLLGLMALMSRGGIPVPADR-PRVAVGAKSYGDEYDSNNXXXXXXXXXXXXXXXXXXXXXXXXXXXSTFSGHMLVCREVLANSGRNALVLMDELGSGTDPAQGVAIAQALLEAILETGARVAITTHYMQLKQLAASDDRFSVAGMQFVQGRPTYKLLPGTVGESFALAVAERLNLPQSVIDRAEALMDSETRQLGDLIRELEDQKGLVDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKLKADPTRRVLAKSWDDIKFVKRDALNEAENIPSIVARKKKANAVLAAEQGELIPIAELRERPELKEGDKVIVCKQGPVFGREATIVKSLGSRVEVLVNNMNVGLKLTQVALPTASFRSTSGPANTWG---------DGRLSIGRAAERALATERCAGPSTSSXXXXDTVAVSAPSKSRGVTMRTTSNTVDVRGCNLEEAKDRIRSAFSASLLAGRSVVYVLHGHGTGGVLKSKLRQWLPKEKTLVDSFQGADAADGGDAFTRVQLR 1085          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: OEU22331.1 (P-loop containing nucleoside triphosphate hydrolase protein, partial [Fragilariopsis cylindrus CCMP1102])

HSP 1 Score: 377.1 bits (967), Expect = 1.600e-100
Identity = 218/430 (50.70%), Postives = 290/430 (67.44%), Query Frame = 0
Query:  255 EGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDE-------YIAQRN--GRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRG-RNPVGNNIHIEKD-KPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQRG-------VRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASG----ALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQR 663
            +GI +   L  LL +AFD +G+LSG  FP L +LR +V  +   I  T+  +++   + + +  E        +A  N  GR VLPI   Y  S LGIVHD S +G+TVYVEP +++ PTND++ +E +L  EEAR+   +T  +  ++  +  S+ A A +D+ VAR  LG  L G V+P+V  EG IS  +A+HPVL+LR     VG++I +  D K GLVLTGPN+GGKT++LK LGLLAL+ R GIP+PA+ G        RVDFF P+LADIGD+QSV  DLSTFSGH+ +C+EVL+    G    +LVLMDE+GSGTDP QGVAIAQALLEALL+TG RV ITTHY+ LK+LA  DDRF VG M+F+ G+PTY+   G VGESYAL VAERL+LP  VL+RA  L+D    ++ +LI +LE+Q+
Sbjct:   30 DGIKLNTTLQNLLEEAFDDEGKLSGKTFPFLGQLRAKVRTMKADILQTLDSIVQLPSIKSKLALESGGPLISEVASSNGAGRLVLPINPKY-ASALGIVHDSSRSGKTVYVEPSEIVGPTNDLRVVERDLEAEEARVWRLLTEQVWNNQRDLRASVQAVAQLDLCVARYTLGQRLEG-VIPDVQDEGIISLRNAKHPVLLLRKMEKVVGSDISLGVDGKQGLVLTGPNSGGKTLILKLLGLLALMSRSGIPIPAEHGDMDGVYLPRVDFFDPVLADIGDIQSVDSDLSTFSGHMYICREVLALTKGGNGKNSLVLMDELGSGTDPNQGVAIAQALLEALLDTGCRVVITTHYMALKQLASSDDRFSVGGMQFVGGRPTYKLLPGVVGESYALAVAERLQLPQTVLDRASELLDSETRQMGDLISDLEDQK 457          
BLAST of NO06G02300 vs. NCBI_GenBank
Match: XP_005830416.1 (hypothetical protein GUITHDRAFT_110559 [Guillardia theta CCMP2712] >EKX43436.1 hypothetical protein GUITHDRAFT_110559 [Guillardia theta CCMP2712])

HSP 1 Score: 368.2 bits (944), Expect = 7.700e-98
Identity = 296/852 (34.74%), Postives = 439/852 (51.53%), Query Frame = 0
Query:  121 DGAANPVGLVDIVSNLHERTWASLDLHVVQEQLASLCDTVRAKDLARSSCFAEDVHEVHRRYKAVEEVWQSVEP---IPLNDPMDIAPAVNFASRGNTLELPDLRSIAKALILLATLRDFMQAPQRAERVPQLAAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIRNTVGQLMKSTGMSNMITDEYIAQRNGRFVLPIKNTYKRSGLGIVHDQSNTGQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGSLDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRGRNPVGNNIHIEKDKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQRGVRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDEMGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVGAMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVKVTELIKELENQRDALAGERREAXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXAKLQAADTFLAQLQEKEKKLEGLLKGVGLGDEKATVEEALRELAAVRGQVAVDSKPEMVHVPSNIVLFKRDDIVREGEVVVCAGDQLNSDRAGKVVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAGLAKNKEKKKKWTKADERNLKMLGKEMNTVSSATAVTLNAMRTSFNTIDVRGLRLRDAESKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVGRHRSADESDGGDAFTQVFLK 970
            DGAA   G  D + +L  +T  SLD   ++  L +   T   +   + +   ++V EV R Y AVEE+    +    +PL+   D+   V  A++G+ LEL +L    K +  +  + D +         P L   ++ I +   +V  L  +FD  G+LS   +PQL+ LR+E+D++  ++ +T+  ++K T +++ + D +   R  RFVLP+  T K    GIVH  S TG TVY+EP +VI+  N ++  E EL  EE RI+G +++ +      +  +  A   +D++ AR K  ++L  AV PEV + G I     RHPVLVLRG  PV N++ +  +KP +V++GPNAGGKTIVLKT+GL ALLV+ G  VP + G ++  F  +LA IGD Q+V  DLS+FS HL     +L  A  G L+L+DE+ SGTDP QG A+AQA+LE LL    ++ +TTHY QLK LA  D RF V AM++++G PTYR   G  GES+A  +A+++ +   V+ERA  LM +   K+T+ +                 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                    +          +   V++A + L  +R  +    +     VP  I         +EG+ V+     L+    G+V+       ++    G L +     K D  R      A  +  P A  +  +++ KK    + +                A    A+RT  NT+D+RG R       +D+F+       +   +IL GHGTG +K+ ++E +    Y   +  A    GGDA T V LK
Sbjct:   96 DGAA---GTEDALESLRRKTEESLDWKFLKATLVNCSVTSMGRSALQQARPFKEVEEVERAYNAVEEIRMLNDDGTRLPLSQVGDVRELVTRAAKGDVLELDELYLCTKTMGAMREIEDVLHG---RNETPTLMDIADDIHLDGSVVLQLKRSFDNVGQLSTKMYPQLQDLRKEIDKIAAAVTSTMDAMLKDTKIASTLQDSFYTIRENRFVLPVSATNKNKINGIVHGVSGTGSTVYIEPQEVIDLNNKLRLAEGELKAEEIRIMGLLSKKVGSLARDVKLATSAVCQLDMAAAREKFAEML-KAVRPEVSSGGEIDIRSGRHPVLVLRGIKPVANDMSMNGEKPAVVISGPNAGGKTIVLKTVGLCALLVQHGCWVPCEEGSKMALFRRVLASIGDQQTVEEDLSSFSSHLKTLNTMLQHADEGTLILLDEIASGTDPTQGAALAQAILEELLGKAPKMVVTTHYSQLKALATVDSRFGVAAMQYVNGAPTYRVLHGVSGESHAFSIAKKMGILEGVIERAESLMGE-QAKMTKTLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQ-------QNPDFKEVDKAKKLLDGLRTNLTAQEEEGAGEVPEGI---------KEGDFVML----LDVGSEGEVISPPSSKGELQVRVGPLTL---RTKVDRVRKVEGKTASSSPSPRASGSLTQKRGKKTGSKEYK----------------AALQRAVRTPVNTLDLRGFRAEQVGDAIDSFLDKMTSANQPTAFILSGHGTGVVKKVVQEHLATCMYAAAYAPASFEQGGDALTVVALK 900          
The following BLAST results are available for this feature:
BLAST of NO06G02300 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM26231.10.000e+085.04dna mismatch repair protein [Nannochloropsis gadit... [more]
CBN75783.12.500e-15742.95MutS protein homolog 1B MutS-like ATPases involved... [more]
EJK50176.13.100e-13135.27hypothetical protein THAOC_30884 [Thalassiosira oc... [more]
XP_002294078.13.600e-12437.75predicted protein [Thalassiosira pseudonana CCMP13... [more]
CEL93116.15.600e-11745.55unnamed protein product [Vitrella brassicaformis C... [more]
GAX18208.13.400e-11437.96DNA mismatch repair protein MutS2 [Fistulifera sol... [more]
GAX28752.17.100e-11238.02DNA mismatch repair protein MutS2 [Fistulifera sol... [more]
XP_002181948.11.600e-10338.50predicted protein [Phaeodactylum tricornutum CCAP ... [more]
OEU22331.11.600e-10050.70P-loop containing nucleoside triphosphate hydrolas... [more]
XP_005830416.17.700e-9834.74hypothetical protein GUITHDRAFT_110559 [Guillardia... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL091nonsL091Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR028ncniR028Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR147ngnoR147Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK000230NSK000230Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO06G02300.1NO06G02300.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|631984gene_6546Nannochloropsis oceanica (N. oceanica CCMP1779)gene
MSH1Bgene4281Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO06G02300.1NO06G02300.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO06G02300 ID=NO06G02300|Name=NO06G02300|organism=Nannochloropsis oceanica|type=gene|length=3381bp
CGCGGTGGGACTTTGCGTGGATTGAAGCAGAACACGAACACCAGCGCATG
CGTGCTTCGTTGCTCACCTTCTCCTCCTGCTGAAGGCCTTTCAGCTCTAC
GTCCTGAGCGTATTCTCAATTATTCACACTCACACAGGCACCAATACACA
GCACTTTGCATCGCAGCACCAAGGACTCATTCTAACTCGGCTTCAGTTCG
ACCTCCCCGCACCTCATCCCCCTCGCCTTCCCCTGTCGATCCGTACATTT
CACAGTCTTGACTAATGCAGGCGCCACGCCATCGGAGGGCTGTCGTCCTC
CTCGCACTGCTCCTGGCAGTTTTTGAAACATgtaagaggcgctgagtggg
gagggttaggatgagattttatcctgtgagccagcacaaagaggatcggc
ctttggtcatgccacgtacccaccctgccctctcatccctccactcgcat
acaacacaacagCTACTGCCTTCCTGCACAGCCTACCGACAACAAAGTTA
AGAGATGTCCTTGCCGAAGGATTGAGGGCCGCGCATACGAGACTACCACT
CCCCTTTCCTGCCGCTGCCACCGCTGCTGCCCCCTTCACCCCCCCAACCT
ACTCCGCCTCCCTTACCGCCGCCAAGGCCTCTTTATCTGACGCTGAACTC
CTGCGGCGTGCGCGCGACAACGCCAATAACAACCCAAAAAATAAGTATAA
AAAGAATACCAACGCCCCTCCCTCGACCTCGGCTTCGACGGGCCAAGGAC
AGCAAGACGGCGCTGCCAACCCCGTGGGGCTCGTCGACATTGTCAGTAAC
CTCCACGAACGCACCTGGGCCTCGCTCGATCTTCACGTGGTGCAAGAGCA
ATTAGCGAGCCTATGTGATACAGTCCGTGCAAAGGACTTGGCGCGGTCCT
CCTGCTTTGCTGAAGATGTGCATGAGGTCCACCGACGTTACAAGGCCGTA
GAGGAAGTCTGGCAATCCGTCGAACCCATCCCTTTGAACGACCCTATGGA
CATTGCTCCTGCCGTTAATTTTGCCTCGCGGGGAAATACATTAGAGTTGC
CTGACCTTCGTTCGATAGCAAAGGCGCTGATACTGCTGGCGACGCTGCGT
GATTTCATGCAGGCCCCTCAGAGGGCCGAGCGTGTGCCACAGCTAGCGGC
TTATTCGGAAGGTATTGATATACCCTATGAATTGGTGGAATTGTTGACGG
ACGCATTTGATAAGGATGGGCGGTTGAGTGGGGACAAATTCCCACAGTTG
AAGCGATTGAGGGAGGAGGTGGATAGGTTGTATGGGAGCATTAGAAATAC
TGTCGGGCAGCTGATGAAGAGCACAGGGATGAGTAACATGATCACGGACG
AGTATATTGCGCAGCGCAATGGTCGGTTCGTCTTGCCGATAAAAAACACG
TACAAAAGGTCTGGGTTGGGGATAGTGCATGATCAATCCAATACCGGCCA
AACCGTGTATGTGGAACCAGTCCAAGTTATAGAGCCCACCAACGACATGA
AACGCCTCGAGCTAGAGCTCCTCCAGGAAGAAGCACGCATAGTCGGGGAA
ATGACCCGGCTTATTGCCATCTCCAAAGACCGCATTCTCGGCAGTCTTGA
CGCAGCCGCGCTCGTGGACATCTCCGTCGCACGGGCCAAGCTAGGCGACG
TGCTCGGGGGGGCCGTAGTACCTGAGGTGGGTACTGAAGGCTGCATATCC
GCCGAGGATGCACGACACCCTGTCCTCGTCCTCCGAGGGCGTAACCCGGT
CGGGAATAACATTCACATTGAAAAAGATAAGCCAGGACTGGTGCTGACGG
GCCCGAACGCCGGAGGGAAAACGATTGTGTTAAAGACCTTGGGGCTACTT
GCCTTGTTGGTGCGCTTGGGCATCCCTGTCCCAGCACAGAGGGGGGTAAG
GGTTGATTTTTTCTCTCCCATTCTGGCGGATATTGGGGACATGCAAAGTG
TTACCGGCGACCTATCGACGTTCTCGGGGCATTTGCTAGTGTGTAAGGAG
GTATTGAGTAAGGCCGCATCGGGGGCGTTGGTGCTGATGGATGAGATGGG
GAGTGGGACGGATCCCGCACAGGGTGTTGCGATTGCGCAAGCGTTGTTGG
AGGCACTGCTGGAGACGGGTGCTAGGGTGGCAATCACCACGCACTATGTC
CAGTTGAAGGAACTCGCGCAGAAGGATGACAGGTTTGTGGTGGGGGCGAT
GGAGTTCCTGGACGGAAAACCGACGTATCGCTTTAAGGAAGGGGCCGTGG
GTGAAAGCTATGCACTGGAGGTGGCGGAAAGGCTCGAGTTACCGTCGGCG
GTGCTTGAAAGGGCTAGGGGGTTGATGGATAAGGGGGTGGTGAAGGTGAC
GGAACTGATCAAGGAGTTGGAGAACCAGCGGGATGCTCTGGCGGGGGAGC
GTAGGGAGGCGGAAGCGAGAGAGCAGGAGATGAGGAAGATGCAGGCAGAG
ATGGAGAAGAAGCAGAGGCAGCTGGAAGCAAGGGAGGTGGAGGTGGAAAA
AATGAAATATCGAGCGAAGCTGCAAGCGGCGGATACGTTTTTGGCCCAGC
TGCAGGAGAAGGAGAAGAAGCTGGAGGGGCTATTGAAGGGTGTGGGGTTG
GGAGATGAGAAGGCGACGGTGGAGGAGGCATTGAGAGAGCTGGCGGCAGT
GAGAGGGCAGGTGGCGGTGGATAGCAAGCCGGAGATGGTGCATGTGCCGT
CGAACATAGTGTTATTCAAACGGGATGATATTGTGAGAGAGGGGGAAGTG
GTGGTGTGCGCGGGGGATCAGTTGAATTCGGATAGGGCGGGGAAGGTTGT
GAAGGCATCGGGGCAGAACGTGGATGTGATGTTCAGTAAGGGAGGCTTGG
ATATTGTGATCACATTCAAGAAGACAGACCTTGCTCGCGCTCCTTCTGAA
ACGGCTGCCTTGGTGAATGCGGGTCCTTATGCAGGCTTGGCTAAGAATAA
GGAGAAGAAAAAGAAATGGACCAAGGCGGACGAGCGGAATTTGAAGATGC
TAGGGAAAGAGATGAACACGGTGTCGAGTGCTACGGCCGTCACTTTAAAC
GCAATGCGCACTTCATTCAACACGATTGATGTCCGGGGTTTGCGCTTGCG
TGATGCGGAGAGCAAAGTCGATAATTTCATCAAGGTGGGCATCCCCCAAA
AGCGCAAGGTGGTGTACATTTTGCATGGCCATGGCACGGGTCAATTGAAG
GAGGGGCTAAGGGAGTTTATGAAACGACATCCGTACGTGGGCCGGCACCG
CTCTGCGGATGAGTCGGATGGGGGCGATGCCTTTACGCAGGTGTTTTTGA
AGTAAAATGCGTGAATGCGGTGGAGTGTAACATACAAGGCTATCAAATAA
ATAATTTGTAACAATAATCTGGAATGAGCGG
back to top

protein sequence of NO06G02300.1

>NO06G02300.1-protein ID=NO06G02300.1-protein|Name=NO06G02300.1|organism=Nannochloropsis oceanica|type=polypeptide|length=970bp
MQAPRHRRAVVLLALLLAVFETSTAFLHSLPTTKLRDVLAEGLRAAHTRL
PLPFPAAATAAAPFTPPTYSASLTAAKASLSDAELLRRARDNANNNPKNK
YKKNTNAPPSTSASTGQGQQDGAANPVGLVDIVSNLHERTWASLDLHVVQ
EQLASLCDTVRAKDLARSSCFAEDVHEVHRRYKAVEEVWQSVEPIPLNDP
MDIAPAVNFASRGNTLELPDLRSIAKALILLATLRDFMQAPQRAERVPQL
AAYSEGIDIPYELVELLTDAFDKDGRLSGDKFPQLKRLREEVDRLYGSIR
NTVGQLMKSTGMSNMITDEYIAQRNGRFVLPIKNTYKRSGLGIVHDQSNT
GQTVYVEPVQVIEPTNDMKRLELELLQEEARIVGEMTRLIAISKDRILGS
LDAAALVDISVARAKLGDVLGGAVVPEVGTEGCISAEDARHPVLVLRGRN
PVGNNIHIEKDKPGLVLTGPNAGGKTIVLKTLGLLALLVRLGIPVPAQRG
VRVDFFSPILADIGDMQSVTGDLSTFSGHLLVCKEVLSKAASGALVLMDE
MGSGTDPAQGVAIAQALLEALLETGARVAITTHYVQLKELAQKDDRFVVG
AMEFLDGKPTYRFKEGAVGESYALEVAERLELPSAVLERARGLMDKGVVK
VTELIKELENQRDALAGERREAEAREQEMRKMQAEMEKKQRQLEAREVEV
EKMKYRAKLQAADTFLAQLQEKEKKLEGLLKGVGLGDEKATVEEALRELA
AVRGQVAVDSKPEMVHVPSNIVLFKRDDIVREGEVVVCAGDQLNSDRAGK
VVKASGQNVDVMFSKGGLDIVITFKKTDLARAPSETAALVNAGPYAGLAK
NKEKKKKWTKADERNLKMLGKEMNTVSSATAVTLNAMRTSFNTIDVRGLR
LRDAESKVDNFIKVGIPQKRKVVYILHGHGTGQLKEGLREFMKRHPYVGR
HRSADESDGGDAFTQVFLK*
back to top
Synonyms
Publications