NO21G01170, NO21G01170 (gene) Nannochloropsis oceanica

Overview
NameNO21G01170
Unique NameNO21G01170
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length5177
Alignment locationchr21:355384..360560 +

Link to JBrowse

Properties
Property NameValue
DescriptionDna polymerase theta
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr21genomechr21:355384..360560 +
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
PRJNA7699582024-08-13
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Biological Process
TermDefinition
GO:0006261DNA-dependent DNA replication
GO:0071897DNA biosynthetic process
GO:0006261DNA-dependent DNA replication
Vocabulary: Molecular Function
TermDefinition
GO:0003887DNA-directed DNA polymerase activity
GO:0003676nucleic acid binding
GO:0005524ATP binding
GO:0005524ATP binding
GO:0003887DNA-directed DNA polymerase activity
GO:0003676nucleic acid binding
Vocabulary: INTERPRO
TermDefinition
IPR027417P-loop_NTPase
IPR002298DNA_polymerase_A
IPR011545DEAD/DEAH_box_helicase_dom
IPR014001Helicase_ATP-bd
IPR001650Helicase_C
Homology
BLAST of NO21G01170 vs. NCBI_GenBank
Match: EWM22669.1 (dna polymerase theta [Nannochloropsis gaditana])

HSP 1 Score: 979.9 bits (2532), Expect = 9.400e-282
Identity = 576/1104 (52.17%), Postives = 700/1104 (63.41%), Query Frame = 0
Query:  553 EGLGIWRRVGEGGXXXXXSVGRRTAV-GSAYLGRG-----RGVRRRGGVADKVYLALNGDRLRGREAVERSKEDLLKMRGGLTFNAQRAAIQKSLVTSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXKAHLSYPAWGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKLRMAASL-------------------------SPSLAGAV-------GGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEEL--XXXXXXXXXXXXXXXXSSWTRVRDLPPYHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAISVGLKDDLLREKVRQGRKELLIRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSALQTGMGGGLERALLEVVMGKLAVGEEEE------EXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREALVYLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLAL-----GGRSWPMSREG---GREKE-REEEEGVARRFVYGLMLHEMLQEVPLVRLERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPFEAE--TKGEXXXXXXXXXXXXXXDRVAERIAGLIVRKARGVVRKQL 1600
            EGLGIWRR           VG R  V G+    RG      G           YLALNG+RL+G  A+ER ++D  K++ GLTF AQR+A++ +L+                                         K  LSYP WG+PEV+VRSYEE+GV +LFPWQV CLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLM+RALTHAVE GTA+FVVPF+ALAEEKA YFR++WAG++LGVK+FHSDAVD++L+E+VHVAVCTIERANALVNRLLERGEL R+KVVVVDELHM+GD  RGFLIEVML+K+R+AASL                         +    GA            +QI+GLSATLPNL Q+A+WL+AHLY+T FRPVQLSLLLC GK LE L                  S+W  +R+LP   SD+TQAV+HLCLE+VS G GVL+FC++RAWT+RCA  +ARA +  L       KV++GR ELL RL LT VGL+ +LE  V+ GVAFHHAG+TMEERTL+E GFKTG+LS +VATSTLAAGVNLPARRVIVR+R GF+G E+   +F QMCGRAGR+GID +GEAILMT E ++  ARAFA+R LPPM SAL  G GGG+ERALLEVVMG+LA  +EEE      E                                      R  L YLV++R+I GR S+   +                                       + +V      PAA      GA P        L   R P     GTQLGAATFF+ MSP DAT ML +LSQA   G+IL+SDLHLL+LCVPPNR YFTPDW+DL RRSQRWAPEVA+VAAA+GV+E  +ER+       G R+  +   G   G +K    +E  + RRFVY L+LH+MLQEVPL++LE G V RGTLQAL+S+AR+FCGM+ VFCKHL+W+ LARLI+GLS+RLE+ VGE+L+ LCR+ PEL+HA RARALF+AGFQ  +++A A VEDVA VL+ SMPF AE  +                  RVA RIA LIVRKA+ V R+++
Sbjct:  439 EGLGIWRRGRGDPRKVGRGVGHRKGVLGNVEANRGSVKGPAGXXXXXXXXAMTYLALNGERLQGPAAMERHRQDTAKIQQGLTFTAQRSALRAALLA-------RQPTAVGEGGVDGSSHGGVPIPPGRASTEETDAKNLLSYPGWGIPEVIVRSYEELGVRRLFPWQVACLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMLRALTHAVEGGTALFVVPFVALAEEKAGYFRRVWAGVDLGVKAFHSDAVDATLSEDVHVAVCTIERANALVNRLLERGELARVKVVVVDELHMVGDAGRGFLIEVMLSKVRLAASLWSQASLMASDHATGNAVRLDPAAEVNDQRGGATVDCTTRKNAARVQIIGLSATLPNLPQVAAWLDAHLYVTDFRPVQLSLLLCAGKRLERLRRAPPPNPSKASSASASLSAWEPLRELPSTISDETQAVLHLCLESVSQGHGVLVFCTSRAWTQRCAKSIARAFAACLGPWTEAVKVQEGRNELLARLRLTSVGLAKELEESVKHGVAFHHAGLTMEERTLLEGGFKTGVLSTLVATSTLAAGVNLPARRVIVRSRIGFDGQEMSVAQFQQMCGRAGRFGIDETGEAILMTREADVQAARAFAARCLPPMMSALHVGEGGGVERALLEVVMGRLAGCKEEEGGEGWLETFARCTLYAVQAHEEEGKEENGEGLGVEGQACKVLQRFRRGLDYLVQNRYI-GRRSIAAVKEVSGSTVRDGETSGGSIVGPDESHLDPERGGGQGILQGANAAVDGTRPVPAATREHAGGAGP-------SLPTAR-PKTCYFGTQLGAATFFAGMSPRDATAMLASLSQATTKGVILSSDLHLLFLCVPPNRRYFTPDWHDLARRSQRWAPEVASVAAAIGVSESAIERVVRPRSTGGRRNGGLGERGDGLGNDKSVGGQEVSLLRRFVYSLVLHDMLQEVPLLQLETGGVGRGTLQALQSEARIFCGMIVVFCKHLQWVTLARLIEGLSLRLEHRVGEDLLPLCRMGPELVHAARARALFEAGFQAPEDVAAARVEDVAAVLLTSMPFAAEHDSAKPRQGPKDGGRQGGGDKRVARRIADLIVRKAQEVARQKV 1526          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: XP_015196626.1 (PREDICTED: DNA polymerase theta isoform X1 [Lepisosteus oculatus])

HSP 1 Score: 412.9 bits (1060), Expect = 4.600e-111
Identity = 300/900 (33.33%), Postives = 440/900 (48.89%), Query Frame = 0
Query:  691 AWGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKLRMAA----SLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAI----SVGLKDDLLREK---------VRQGRKELLIRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSALQTGMGGGLE----RALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXR-----EALV-YLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRL-ERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPFEAETK 1561
            +WGLP+ ++  Y+ +GV ++F WQ ECL  GQ   + EG+NLVYSAPTS GKTLV+ELL+++ +        A+F++PF+++A+EK  Y + I+    + V+ +      +    ++ VAVCTIE+AN L+NRL+E   +  L +VVVDELHM+GD  RG+L+E++L K+R       S   S +       +QIVG+SATLPNL  +A WLNA LY T +RPV L   +  GK++ +                  S   VR+  P      D   ++ LC ET+  G  VL+FC ++ W E+ A  + R         LK     +K           +G  ++L +L  +P GL + L+  V+ GVAFHHAG+T EER +IE  F+ G + V+VATSTL++GVNLPARRVI+RT   FNG  +    + QM GRAGR G+D  GE+IL+  E E +   A     L P+RS L    G G+     RA+LE+++G +A      +                                      +     EA V +L+E+ FI+ +                                                                      E+G +EQ   K  P      T LGAAT  SS+SP +A L + A  Q A  G +L +DLH+LY   P    + T DW       +     +  VA  VG+ E  L R ++GG+         + + +  +  + +RF   L+L +++ EVPL  + ++   +RG LQ+L+  A  + GM+ VFC  L W  L  L+     RL +G+  EL  L RI   L++A RARAL++AGF    ELA AS  DV   L K++PF++  K
Sbjct:  360 SWGLPKPVLEKYQRLGVVQMFEWQAECLTLGQ---VLEGKNLVYSAPTSAGKTLVSELLILKRVLETRRK--ALFILPFVSVAKEKMYYLQNIFQEAGVRVEGYMGSTSAAGGFSSLDVAVCTIEKANGLINRLIEENRMDLLGIVVVDELHMLGDSGRGYLLELLLTKIRYVTQKTLSRESSKSTPSFREEVQIVGMSATLPNLDLLAKWLNADLYHTDYRPVPLMEWVKIGKNVYD-----------------GSLALVREFKPALQIKGDDDHIVSLCFETIQSGHSVLLFCPSKNWCEKLADSIGREFYNFHQRALKSAEGGDKNASVPPLSLDEEGLLDVLAQLKRSPAGLDSVLKRTVQLGVAFHHAGLTFEERDIIEGAFRQGYIRVLVATSTLSSGVNLPARRVIIRT-PVFNGRPLDMLTYKQMAGRAGRKGVDSMGESILVCKESERTKGTALLQGSLKPIRSCLIKKEGEGVTTSMIRAILEIIVGGVASTPGHVKMYASCTLLAASLSKDVTELGCSEGEMNMERTARKRKSSKQTSPIEACVDWLIENEFIQIQ----------------------------------------------------------------------EEGDKEQKIEKYCP------THLGAATLSSSLSPPEA-LGIFADLQRAMKGFVLENDLHILYQITPVCADWATIDWYQFFCLWEHLPTSMKRVAEMVGIEEGFLAR-SVGGKII------AKTERQHRQMAIHKRFFTSLVLLDLISEVPLETVAKKYSCSRGQLQSLQQSASTYAGMVTVFCNRLGWHNLELLLSQFQSRLSFGIQRELCDLVRI--SLLNAQRARALYNAGFITVSELARASAADVEIALRKAVPFKSSRK 1150          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: XP_015196627.1 (PREDICTED: DNA polymerase theta isoform X2 [Lepisosteus oculatus])

HSP 1 Score: 412.9 bits (1060), Expect = 4.600e-111
Identity = 300/900 (33.33%), Postives = 440/900 (48.89%), Query Frame = 0
Query:  691 AWGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKLRMAA----SLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAI----SVGLKDDLLREK---------VRQGRKELLIRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSALQTGMGGGLE----RALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXR-----EALV-YLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRL-ERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPFEAETK 1561
            +WGLP+ ++  Y+ +GV ++F WQ ECL  GQ   + EG+NLVYSAPTS GKTLV+ELL+++ +        A+F++PF+++A+EK  Y + I+    + V+ +      +    ++ VAVCTIE+AN L+NRL+E   +  L +VVVDELHM+GD  RG+L+E++L K+R       S   S +       +QIVG+SATLPNL  +A WLNA LY T +RPV L   +  GK++ +                  S   VR+  P      D   ++ LC ET+  G  VL+FC ++ W E+ A  + R         LK     +K           +G  ++L +L  +P GL + L+  V+ GVAFHHAG+T EER +IE  F+ G + V+VATSTL++GVNLPARRVI+RT   FNG  +    + QM GRAGR G+D  GE+IL+  E E +   A     L P+RS L    G G+     RA+LE+++G +A      +                                      +     EA V +L+E+ FI+ +                                                                      E+G +EQ   K  P      T LGAAT  SS+SP +A L + A  Q A  G +L +DLH+LY   P    + T DW       +     +  VA  VG+ E  L R ++GG+         + + +  +  + +RF   L+L +++ EVPL  + ++   +RG LQ+L+  A  + GM+ VFC  L W  L  L+     RL +G+  EL  L RI   L++A RARAL++AGF    ELA AS  DV   L K++PF++  K
Sbjct:  358 SWGLPKPVLEKYQRLGVVQMFEWQAECLTLGQ---VLEGKNLVYSAPTSAGKTLVSELLILKRVLETRRK--ALFILPFVSVAKEKMYYLQNIFQEAGVRVEGYMGSTSAAGGFSSLDVAVCTIEKANGLINRLIEENRMDLLGIVVVDELHMLGDSGRGYLLELLLTKIRYVTQKTLSRESSKSTPSFREEVQIVGMSATLPNLDLLAKWLNADLYHTDYRPVPLMEWVKIGKNVYD-----------------GSLALVREFKPALQIKGDDDHIVSLCFETIQSGHSVLLFCPSKNWCEKLADSIGREFYNFHQRALKSAEGGDKNASVPPLSLDEEGLLDVLAQLKRSPAGLDSVLKRTVQLGVAFHHAGLTFEERDIIEGAFRQGYIRVLVATSTLSSGVNLPARRVIIRT-PVFNGRPLDMLTYKQMAGRAGRKGVDSMGESILVCKESERTKGTALLQGSLKPIRSCLIKKEGEGVTTSMIRAILEIIVGGVASTPGHVKMYASCTLLAASLSKDVTELGCSEGEMNMERTARKRKSSKQTSPIEACVDWLIENEFIQIQ----------------------------------------------------------------------EEGDKEQKIEKYCP------THLGAATLSSSLSPPEA-LGIFADLQRAMKGFVLENDLHILYQITPVCADWATIDWYQFFCLWEHLPTSMKRVAEMVGIEEGFLAR-SVGGKII------AKTERQHRQMAIHKRFFTSLVLLDLISEVPLETVAKKYSCSRGQLQSLQQSASTYAGMVTVFCNRLGWHNLELLLSQFQSRLSFGIQRELCDLVRI--SLLNAQRARALYNAGFITVSELARASAADVEIALRKAVPFKSSRK 1148          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: KFQ33857.1 (DNA polymerase theta, partial [Mesitornis unicolor])

HSP 1 Score: 411.4 bits (1056), Expect = 1.300e-110
Identity = 304/938 (32.41%), Postives = 454/938 (48.40%), Query Frame = 0
Query:  692 WGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKLR-----MAASLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAI------SVGLKDDLLREKV--RQGRKELLIRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSAL----QTGMGGGLERALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREALVYLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRLERG-EVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPF--------EAETKGEXXXXXXXXXXXXXXDRVAERIAGLIVRKARGVVRKQLAL 1602
            WGLP+ ++  Y  +GV ++F WQ ECL  GQ   + EG+NLVYSAPTS GKTLVAELL+++ +    +   A+ ++PF+++A+EK RY + ++  +++ V+ +      +     + VAVCTIE+AN L+NRL+E  ++  L VVVVDELHM+GD  RG+L+E++L K+R     +A   +       GG  IQIVG+SATLPNL  +ASWL+A LY T FRPV L   +  G ++ +                 SS   VR+  P      D   V+ LC ETV  G  VL+FC ++ W E+ A  +AR            KD  L   V  R+G  E+L +L  +  GL + L+  +  GVAFHHAG+T +ER +IE GF+ G++ V+ ATSTL++GVNLPARRVI+RT   F G  +    + QM GRAGR G+D  GE+IL+    E S   A     L P+ S L      G+   ++RA+LE+++G +A   ++ +                                         + +L+E+ FI+   +                                                             G+G           + AK         T LG+AT  SS+SP +A  +   L +A +   +L +DLH++YL  P    + T DW       ++    +  VA  VG+ E  L R   G       +   + ++++ +  + +RF   L L +++ EVPL+ + R    +RG LQ+L+  A  + GM+ VFC  L W  +  L+     RL +GV  EL  L R+   L++A RAR L++AGF    +LA AS +DVA  L  S+PF        E E   E                     A L+V +ARG++++ LAL
Sbjct:   14 WGLPKAVLEKYHSLGVVQMFEWQAECLMLGQ---VLEGKNLVYSAPTSAGKTLVAELLILKRVLETRKK--ALLILPFVSVAKEKKRYLQALFQEVDVRVEGYMGSMSPAGRFSALDVAVCTIEKANGLINRLIEENKMDLLGVVVVDELHMLGDSHRGYLLELLLTKVRYVTEKVAKRQAKKAHPGFGG--IQIVGMSATLPNLGLLASWLDAELYCTDFRPVPLKEWVKIGSNIYD-----------------SSMNLVREFRPKLQLKGDEDHVVSLCYETVCDGHSVLLFCPSKNWCEKLADIIAREFYSLQQAESSAKDSALAPVVVDREGIDEVLDQLKRSISGLDSVLQRTLPWGVAFHHAGLTFDERDIIEGGFRQGLIRVLAATSTLSSGVNLPARRVIIRTPM-FGGKLLDILTYKQMAGRAGRKGVDTEGESILVCKPSERSKGIALLQGSLKPVCSCLLRREGEGVASSMKRAILEIIVGGVANSPDDVQ-------TYASCTLLASSLKESKWGNEKAQDQAQTGPIEACVAWLLENEFIQVLDA-------------------------------------------------------------GNG-----------VKAK-----VYHPTHLGSATLSSSLSPTEAMEIFADLQRAMK-SFVLENDLHIVYLVTPVYEEWTTIDWYQFFCLWEKLPASMKRVAELVGIEECFLARSVKG-------KIIAKTEKQQRQMAIHKRFFTSLALLDLISEVPLMDMTRKYGCSRGQLQSLQQSAATYAGMVTVFCNRLGWHNMELLLSQFQSRLTFGVHRELCDLVRV--SLLNAQRARTLYNAGFVTVADLAKASPDDVAAALKNSVPFKSVRRAVDEDEESAEERRTVRSIWMAGMKGLTEREAASLVVDEARGLLQQDLAL 832          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: XP_010140554.1 (PREDICTED: DNA polymerase theta, partial [Buceros rhinoceros silvestris])

HSP 1 Score: 411.4 bits (1056), Expect = 1.300e-110
Identity = 306/943 (32.45%), Postives = 450/943 (47.72%), Query Frame = 0
Query:  691 AWGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKLRMAAS---------LSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAI------SVGLKDDLLREKV--RQGRKELLIRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSAL----QTGMGGGLERALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREALVYLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRL-ERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPF--------EAETKGEXXXXXXXXXXXXXXDRVAERIAGLIVRKARGVVRKQLAL 1602
            +WGLP+ ++  Y  +GV ++F WQ ECL  GQ   + EG+NLVYSAPTS GKTLVAELL+++ +    +   A+ ++PF+++A+EK  Y + ++  +++ V+ +      +     + VAVCTIE+AN+L+NRL+E  ++  L VVVVDELHM+GD  RG+L+E++L K+R              SPS  G      IQIVG+SATLPNL  +ASWL+A LY T FRPV L   +  G ++ +                 SS   VR+  P      D   V+ LC ETV  G  VL+FC ++ W E+ A  VAR            K+  L   V  R+G  E+L +L  +  GL + L+  +  GVAFHHAG+T +ER +IE  F+ G++ V+ ATSTL++GVNLPARRVI+RT   F G  +    + QM GRAGR G+D  GE+IL+    E S   A     L P+ S L      G+   ++RA+LE+++G +A   ++ +                                         + +L+++ FI+   S  G                                                                        + AK     A   T LG+AT  SS+SP +A L + A  Q A    +L +DLH++YL  P    + T DW       ++    +  VA  VG+ E  L R   GG          + +++  +  + +RF   L L +++ EVPL  + ++   +RG LQ+L+  A  + GM+ VFC  L W  +  L+     RL +GV  EL  L R+   L++A RAR L++AGF    +LA AS  DVA  L  S+PF        E E   E                     A LIV +ARG++++ LAL
Sbjct:   19 SWGLPKAVLEKYHSLGVVQMFEWQAECLMLGQ---VLEGKNLVYSAPTSAGKTLVAELLILKRVLETRKK--ALLILPFVSVAKEKKCYLQALFQEVDVRVEGYMGSMSPAGRFSALDVAVCTIEKANSLINRLIEENKMDSLGVVVVDELHMLGDSHRGYLLELLLTKVRYVTEKVAKRQVKMTSPSFGG------IQIVGMSATLPNLGLLASWLDAELYCTDFRPVPLKEQVKIGSNIYD-----------------SSMNLVREFQPKLQPKGDEDHVVSLCYETVCDGHSVLLFCPSKNWCEKLADIVAREFYSLQQAESSAKNSALAPVVVDREGIDEVLDQLKRSVSGLDSVLQRTLPWGVAFHHAGLTFDERDIIEAAFRQGLIRVLAATSTLSSGVNLPARRVIIRT-PVFGGKLLDILTYKQMAGRAGRKGVDTEGESILVCKPSERSKGTALLQGSLKPICSCLLRREGEGVASSMKRAILEIIVGGVANTPDDVQ-------TYASCTLLACSLKESKQGKEKAQDKVQTGPIEACVAWLLKNEFIQVLDSDNG------------------------------------------------------------------------VKAK-----AYHPTHLGSATLSSSLSPTEA-LEIFADLQRAMKSFVLENDLHIVYLVTPVYEEWTTIDWYQFFCLWEKLPASMKRVAELVGIEEGFLARSVKGGII-------AKTEKQHRQMAIHKRFFTSLALLDLISEVPLNNMTKKYGCSRGQLQSLQQSAATYAGMVTVFCNRLGWHNMELLLSQFQSRLTFGVHRELCDLVRV--SLLNAQRARTLYNAGFVTVADLAKASPGDVATALKNSVPFKSVRRAVDEDEESAEERRTVRSIWLAGMKGLTEREAASLIVEEARGLLQQDLAL 838          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: XP_010180926.1 (PREDICTED: DNA polymerase theta, partial [Mesitornis unicolor])

HSP 1 Score: 411.4 bits (1056), Expect = 1.300e-110
Identity = 304/938 (32.41%), Postives = 454/938 (48.40%), Query Frame = 0
Query:  692 WGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKLR-----MAASLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAI------SVGLKDDLLREKV--RQGRKELLIRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSAL----QTGMGGGLERALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREALVYLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRLERG-EVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPF--------EAETKGEXXXXXXXXXXXXXXDRVAERIAGLIVRKARGVVRKQLAL 1602
            WGLP+ ++  Y  +GV ++F WQ ECL  GQ   + EG+NLVYSAPTS GKTLVAELL+++ +    +   A+ ++PF+++A+EK RY + ++  +++ V+ +      +     + VAVCTIE+AN L+NRL+E  ++  L VVVVDELHM+GD  RG+L+E++L K+R     +A   +       GG  IQIVG+SATLPNL  +ASWL+A LY T FRPV L   +  G ++ +                 SS   VR+  P      D   V+ LC ETV  G  VL+FC ++ W E+ A  +AR            KD  L   V  R+G  E+L +L  +  GL + L+  +  GVAFHHAG+T +ER +IE GF+ G++ V+ ATSTL++GVNLPARRVI+RT   F G  +    + QM GRAGR G+D  GE+IL+    E S   A     L P+ S L      G+   ++RA+LE+++G +A   ++ +                                         + +L+E+ FI+   +                                                             G+G           + AK         T LG+AT  SS+SP +A  +   L +A +   +L +DLH++YL  P    + T DW       ++    +  VA  VG+ E  L R   G       +   + ++++ +  + +RF   L L +++ EVPL+ + R    +RG LQ+L+  A  + GM+ VFC  L W  +  L+     RL +GV  EL  L R+   L++A RAR L++AGF    +LA AS +DVA  L  S+PF        E E   E                     A L+V +ARG++++ LAL
Sbjct:   17 WGLPKAVLEKYHSLGVVQMFEWQAECLMLGQ---VLEGKNLVYSAPTSAGKTLVAELLILKRVLETRKK--ALLILPFVSVAKEKKRYLQALFQEVDVRVEGYMGSMSPAGRFSALDVAVCTIEKANGLINRLIEENKMDLLGVVVVDELHMLGDSHRGYLLELLLTKVRYVTEKVAKRQAKKAHPGFGG--IQIVGMSATLPNLGLLASWLDAELYCTDFRPVPLKEWVKIGSNIYD-----------------SSMNLVREFRPKLQLKGDEDHVVSLCYETVCDGHSVLLFCPSKNWCEKLADIIAREFYSLQQAESSAKDSALAPVVVDREGIDEVLDQLKRSISGLDSVLQRTLPWGVAFHHAGLTFDERDIIEGGFRQGLIRVLAATSTLSSGVNLPARRVIIRTPM-FGGKLLDILTYKQMAGRAGRKGVDTEGESILVCKPSERSKGIALLQGSLKPVCSCLLRREGEGVASSMKRAILEIIVGGVANSPDDVQ-------TYASCTLLASSLKESKWGNEKAQDQAQTGPIEACVAWLLENEFIQVLDA-------------------------------------------------------------GNG-----------VKAK-----VYHPTHLGSATLSSSLSPTEAMEIFADLQRAMK-SFVLENDLHIVYLVTPVYEEWTTIDWYQFFCLWEKLPASMKRVAELVGIEECFLARSVKG-------KIIAKTEKQQRQMAIHKRFFTSLALLDLISEVPLMDMTRKYGCSRGQLQSLQQSAATYAGMVTVFCNRLGWHNMELLLSQFQSRLTFGVHRELCDLVRV--SLLNAQRARTLYNAGFVTVADLAKASPDDVAAALKNSVPFKSVRRAVDEDEESAEERRTVRSIWMAGMKGLTEREAASLVVDEARGLLQQDLAL 835          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: EAW79510.1 (polymerase (DNA directed), theta, isoform CRA_a [Homo sapiens])

HSP 1 Score: 410.2 bits (1053), Expect = 3.000e-110
Identity = 294/889 (33.07%), Postives = 431/889 (48.48%), Query Frame = 0
Query:  692 WGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKL----RMAASLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAI------SVGLKDDLLREKVRQGRKELL---IRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSALQTGMG----GGLERALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREALVYLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRL-ERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPFEAETK 1561
            WGLP+ ++  Y   GV K+F WQ ECL  GQ   + EG+NLVYSAPTS GKTLVAELL+++ +    +   A+F++PF+++A+EK  Y + ++  + + V  +      S    ++ +AVCTIERAN L+NRL+E  ++  L +VVVDELHM+GD  RG+L+E++L K+    R +AS    LA ++    +QIVG+SATLPNL  +ASWLNA LY T FRPV L   +  G  + +                 SS   VR+  P      D   V+ LC ET+     VL+FC ++ W E+ A  +AR        + GL        V   +KELL    +L   P GL + L+  V  GVAFHHAG+T EER +IE  F+ G++ V+ ATSTL++GVNLPARRVI+RT   F G  +    + QM GRAGR G+D  GE+IL+    E S   A     L P+RS LQ   G    G + RA+LE+++G +A   ++                                           +++L+E+ FI+   +  G E                                                                                   T LG+AT  SS+SP D TL + A  Q A  G +L +DLH+LYL  P    + T DW       ++    +  VA  VGV E  L R   G       +   R + +  +  + +RF   L+L +++ EVPL  + ++   NRG +Q+L+  A ++ GM+ VF   L W  +  L+     RL +G+  EL  L R+   L++A RAR L+ +GF    +LA A++ +V  +L  ++PF++  K
Sbjct:  209 WGLPKAVLEKYHSFGVKKMFEWQAECLLLGQ---VLEGKNLVYSAPTSAGKTLVAELLILKRVLEMRKK--ALFILPFVSVAKEKKYYLQSLFQEVGIKVDGYMGSTSPSRHFSSLDIAVCTIERANGLINRLIEENKMDLLGMVVVDELHMLGDSHRGYLLELLLTKICYITRKSASCQADLASSLSNA-VQIVGMSATLPNLELVASWLNAELYHTDFRPVPLLESVKVGNSIYD-----------------SSMKLVREFEPMLQVKGDEDHVVSLCYETICDNHSVLLFCPSKKWCEKLADIIAREFYNLHHQAEGLVKPSECPPVILEQKELLEVMDQLRRLPSGLDSVLQKTVPWGVAFHHAGLTFEERDIIEGAFRQGLIRVLAATSTLSSGVNLPARRVIIRT-PIFGGRPLDILTYKQMVGRAGRKGVDTVGESILICKNSEKSKGIALLQGSLKPVRSCLQRREGEEVTGSMIRAILEIIVGGVASTSQD-------MHTYAACTFLAASMKEGKQGIQRNQESVQLGAIEACVMWLLENEFIQSTEASDGTEGK-----------------------------------------------------------------------------VYHPTHLGSATLSSSLSPAD-TLDIFADLQRAMKGFVLENDLHILYLVTPMFEDWTTIDWYRFFCLWEKLPTSMKRVAELVGVEEGFLARCVKG-------KVVARTERQHRQMAIHKRFFTSLVLLDLISEVPLREINQKYGCNRGQIQSLQQSAAVYAGMITVFSNRLGWHNMELLLSQFQKRLTFGIQRELCDLVRV--SLLNAQRARVLYASGFHTVADLARANIVEVEVILKNAVPFKSARK 979          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: XP_010205958.1 (PREDICTED: DNA polymerase theta, partial [Colius striatus])

HSP 1 Score: 410.2 bits (1053), Expect = 3.000e-110
Identity = 299/943 (31.71%), Postives = 451/943 (47.83%), Query Frame = 0
Query:  691 AWGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKL---------RMAASLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAISVGLKDDLLREKV--------RQGRKELLIRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSAL----QTGMGGGLERALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREALVYLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRL-ERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPF--------EAETKGEXXXXXXXXXXXXXXDRVAERIAGLIVRKARGVVRKQLAL 1602
            +WGLP+ ++  Y  +GV ++F WQ ECL  GQ   + EG+NLVYSAPTS GKTLVAELL+++ +    +   A+F++PF+++A+EK  Y + ++  +++ V+ +      +     + VAVCTIE+AN+L+NRL+E   +  L VVVVDELHM+GD  RG+L+E++L K+         R     +PS  G      IQI+G+SATLPNL  +ASWL+A LY T FRPV L   +  G ++ +                 SS   VR+  P      D   V+ LC ETV  G  VL+FC ++ W E+ A  +AR      +D+   +K         R+G  E+L +L  +  GL + L+  +  GVAFHHAG+T +ER +IE  F+ G++ V+ ATSTL++GVNLPARRVI+RT   F G  +    + QM GRAGR G+D  GE+IL+    E S   A     L P+RS L      G+   ++RA+LE+++G +A   ++ +                                         + +L+E+ FI+   S K                                                                         + AK         T LG+A+  SS+SP +A  +   L +A +   +L +DLH++YL  P    +   DW       ++    +  VA  VG+ E  L R   G       +   + +++  +  + +RF   L L +++ EVPL  + E+   +RG LQ+L+  A  + GM+ VFC  L W  +  L+     RL +GV  EL  L R+   +++A RAR L++AGF    +LA AS +DVA  L  S+PF        E E   E                     A LIV +ARG++++ LAL
Sbjct:   21 SWGLPKAVLEKYHSLGVVQMFEWQAECLMLGQ---VLEGKNLVYSAPTSAGKTLVAELLILKRVLETRKK--ALFILPFVSVAKEKKCYLQALFQEVDVRVEGYMGSTSPAGRFSALDVAVCTIEKANSLINRLIEENNMDSLGVVVVDELHMLGDSHRGYLLELLLTKVRYVTEKVTKRQVKKTNPSFGG------IQIIGMSATLPNLGLLASWLDAELYCTNFRPVPLKEWVKIGSNIYD-----------------SSMNLVREFEPKLQLKGDEDHVVSLCYETVCEGHSVLLFCPSKNWCEKLADIIAREFYSLQQDESSAKKSALAPVVVDREGINEVLDQLKRSISGLDSVLQRTLPWGVAFHHAGLTFDERDIIEGAFRQGLIRVLAATSTLSSGVNLPARRVIIRTPM-FGGKLLDVLTYKQMAGRAGRKGVDTEGESILVCKPSERSKGTALLQGYLKPVRSCLLRREGEGVSSSMKRAILEIIVGGVANTPDDVQ-------TYASCTLLASSLKESQWGNEKAKDKVQTGPIEACVAWLLENEFIQVLDSGK------------------------------------------------------------------------DVKAK-----IYHPTHLGSASLSSSLSPTEAMEIFADLQRAMK-SFVLENDLHIVYLVTPVYEDWTIIDWYQFFCLWEKLPASMKRVAELVGIEEGFLARSIKG-------KITAKTEKQHRQMAIHKRFFTSLALLDLISEVPLKDMTEKYGCSRGQLQSLQQSAATYAGMVTVFCNRLGWHNMELLLSQFQSRLTFGVHRELCDLVRV--SVLNAQRARTLYNAGFVTVADLAKASPDDVATALKNSVPFKSVRRAVDEDEESAEERRTVCSIWMAGMKGLTEREAASLIVEEARGLLQEDLAL 840          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: 5AGA_A (Chain A, Crystal Structure Of The Helicase Domain Of Human Dna Polymerase Theta In Complex With Amppnp >5A9F_A Chain A, Crystal Structure Of The Helicase Domain Of Human Dna Polymerase Theta In Complex With Adp)

HSP 1 Score: 410.2 bits (1053), Expect = 3.000e-110
Identity = 294/889 (33.07%), Postives = 431/889 (48.48%), Query Frame = 0
Query:  692 WGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKL----RMAASLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAI------SVGLKDDLLREKVRQGRKELL---IRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSALQTGMG----GGLERALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREALVYLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRL-ERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPFEAETK 1561
            WGLP+ ++  Y   GV K+F WQ ECL  GQ   + EG+NLVYSAPTS GKTLVAELL+++ +    +   A+F++PF+++A+EK  Y + ++  + + V  +      S    ++ +AVCTIERAN L+NRL+E  ++  L +VVVDELHM+GD  RG+L+E++L K+    R +AS    LA ++    +QIVG+SATLPNL  +ASWLNA LY T FRPV L   +  G  + +                 SS   VR+  P      D   V+ LC ET+     VL+FC ++ W E+ A  +AR        + GL        V   +KELL    +L   P GL + L+  V  GVAFHHAG+T EER +IE  F+ G++ V+ ATSTL++GVNLPARRVI+RT   F G  +    + QM GRAGR G+D  GE+IL+    E S   A     L P+RS LQ   G    G + RA+LE+++G +A   ++                                           +++L+E+ FI+   +  G E                                                                                   T LG+AT  SS+SP D TL + A  Q A  G +L +DLH+LYL  P    + T DW       ++    +  VA  VGV E  L R   G       +   R + +  +  + +RF   L+L +++ EVPL  + ++   NRG +Q+L+  A ++ GM+ VF   L W  +  L+     RL +G+  EL  L R+   L++A RAR L+ +GF    +LA A++ +V  +L  ++PF++  K
Sbjct:   10 WGLPKAVLEKYHSFGVKKMFEWQAECLLLGQ---VLEGKNLVYSAPTSAGKTLVAELLILKRVLEMRKK--ALFILPFVSVAKEKKYYLQSLFQEVGIKVDGYMGSTSPSRHFSSLDIAVCTIERANGLINRLIEENKMDLLGMVVVDELHMLGDSHRGYLLELLLTKICYITRKSASCQADLASSLSNA-VQIVGMSATLPNLELVASWLNAELYHTDFRPVPLLESVKVGNSIYD-----------------SSMKLVREFEPMLQVKGDEDHVVSLCYETICDNHSVLLFCPSKKWCEKLADIIAREFYNLHHQAEGLVKPSECPPVILEQKELLEVMDQLRRLPSGLDSVLQKTVPWGVAFHHAGLTFEERDIIEGAFRQGLIRVLAATSTLSSGVNLPARRVIIRT-PIFGGRPLDILTYKQMVGRAGRKGVDTVGESILICKNSEKSKGIALLQGSLKPVRSCLQRREGEEVTGSMIRAILEIIVGGVASTSQD-------MHTYAACTFLAASMKEGKQGIQRNQESVQLGAIEACVMWLLENEFIQSTEASDGTEGK-----------------------------------------------------------------------------VYHPTHLGSATLSSSLSPAD-TLDIFADLQRAMKGFVLENDLHILYLVTPMFEDWTTIDWYRFFCLWEKLPTSMKRVAELVGVEEGFLARCVKG-------KVVARTERQHRQMAIHKRFFTSLVLLDLISEVPLREINQKYGCNRGQIQSLQQSAAVYAGMITVFSNRLGWHNMELLLSQFQKRLTFGIQRELCDLVRV--SLLNAQRARVLYASGFHTVADLARANIVEVEVILKNAVPFKSARK 780          
BLAST of NO21G01170 vs. NCBI_GenBank
Match: XP_011510645.1 (DNA polymerase theta isoform X2 [Homo sapiens])

HSP 1 Score: 410.2 bits (1053), Expect = 3.000e-110
Identity = 294/889 (33.07%), Postives = 431/889 (48.48%), Query Frame = 0
Query:  692 WGLPEVLVRSYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLMVRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVDSSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRGFLIEVMLAKL----RMAASLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNAHLYITKFRPVQLSLLLCTGKHLEELXXXXXXXXXXXXXXXXSSWTRVRDLPP--YHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAI------SVGLKDDLLREKVRQGRKELL---IRLNLTPVGLSADLESLVREGVAFHHAGVTMEERTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFEFHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSALQTGMG----GGLERALLEVVMGKLAVGEEEEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXREALVYLVEHRFIEGRSSLKGEEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXDVSVVSLAGPPAAKPGPGSGATPVEQGQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGLILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTERTLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLVRL-ERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLEYGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLVKSMPFEAETK 1561
            WGLP+ ++  Y   GV K+F WQ ECL  GQ   + EG+NLVYSAPTS GKTLVAELL+++ +    +   A+F++PF+++A+EK  Y + ++  + + V  +      S    ++ +AVCTIERAN L+NRL+E  ++  L +VVVDELHM+GD  RG+L+E++L K+    R +AS    LA ++    +QIVG+SATLPNL  +ASWLNA LY T FRPV L   +  G  + +                 SS   VR+  P      D   V+ LC ET+     VL+FC ++ W E+ A  +AR        + GL        V   +KELL    +L   P GL + L+  V  GVAFHHAG+T EER +IE  F+ G++ V+ ATSTL++GVNLPARRVI+RT   F G  +    + QM GRAGR G+D  GE+IL+    E S   A     L P+RS LQ   G    G + RA+LE+++G +A   ++                                           +++L+E+ FI+   +  G E                                                                                   T LG+AT  SS+SP D TL + A  Q A  G +L +DLH+LYL  P    + T DW       ++    +  VA  VGV E  L R   G       +   R + +  +  + +RF   L+L +++ EVPL  + ++   NRG +Q+L+  A ++ GM+ VF   L W  +  L+     RL +G+  EL  L R+   L++A RAR L+ +GF    +LA A++ +V  +L  ++PF++  K
Sbjct:   74 WGLPKAVLEKYHSFGVKKMFEWQAECLLLGQ---VLEGKNLVYSAPTSAGKTLVAELLILKRVLEMRKK--ALFILPFVSVAKEKKYYLQSLFQEVGIKVDGYMGSTSPSRHFSSLDIAVCTIERANGLINRLIEENKMDLLGMVVVDELHMLGDSHRGYLLELLLTKICYITRKSASCQADLASSLSNA-VQIVGMSATLPNLELVASWLNAELYHTDFRPVPLLESVKVGNSIYD-----------------SSMKLVREFEPMLQVKGDEDHVVSLCYETICDNHSVLLFCPSKKWCEKLADIIAREFYNLHHQAEGLVKPSECPPVILEQKELLEVMDQLRRLPSGLDSVLQKTVPWGVAFHHAGLTFEERDIIEGAFRQGLIRVLAATSTLSSGVNLPARRVIIRT-PIFGGRPLDILTYKQMVGRAGRKGVDTVGESILICKNSEKSKGIALLQGSLKPVRSCLQRREGEEVTGSMIRAILEIIVGGVASTSQD-------MHTYAACTFLAASMKEGKQGIQRNQESVQLGAIEACVMWLLENEFIQSTEASDGTEGK-----------------------------------------------------------------------------VYHPTHLGSATLSSSLSPAD-TLDIFADLQRAMKGFVLENDLHILYLVTPMFEDWTTIDWYRFFCLWEKLPTSMKRVAELVGVEEGFLARCVKG-------KVVARTERQHRQMAIHKRFFTSLVLLDLISEVPLREINQKYGCNRGQIQSLQQSAAVYAGMITVFSNRLGWHNMELLLSQFQKRLTFGIQRELCDLVRV--SLLNAQRARVLYASGFHTVADLARANIVEVEVILKNAVPFKSARK 844          
The following BLAST results are available for this feature:
BLAST of NO21G01170 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Position :
0
Zoom :
x 1
500100015002000250030003500400045005000Expect = 9.40e-282 / Id = 52.17Expect = 4.60e-111 / Id = 33.33Expect = 4.60e-111 / Id = 33.33Expect = 1.30e-110 / Id = 32.41Expect = 1.30e-110 / Id = 32.45Expect = 1.30e-110 / Id = 32.41Expect = 3.00e-110 / Id = 33.07Expect = 3.00e-110 / Id = 31.71Expect = 3.00e-110 / Id = 33.07Expect = 3.00e-110 / Id = 33.07SequenceEWM22669.1XP_015196626.1XP_015196627.1KFQ33857.1XP_010140554.1XP_010180926.1EAW79510.1XP_010205958.15AGA_AXP_011510645.1
Match NameE-valueIdentityDescription
EWM22669.19.400e-28252.17dna polymerase theta [Nannochloropsis gaditana][more]
XP_015196626.14.600e-11133.33PREDICTED: DNA polymerase theta isoform X1 [Lepiso... [more]
XP_015196627.14.600e-11133.33PREDICTED: DNA polymerase theta isoform X2 [Lepiso... [more]
KFQ33857.11.300e-11032.41DNA polymerase theta, partial [Mesitornis unicolor... [more]
XP_010140554.11.300e-11032.45PREDICTED: DNA polymerase theta, partial [Buceros ... [more]
XP_010180926.11.300e-11032.41PREDICTED: DNA polymerase theta, partial [Mesitorn... [more]
EAW79510.13.000e-11033.07polymerase (DNA directed), theta, isoform CRA_a [H... [more]
XP_010205958.13.000e-11031.71PREDICTED: DNA polymerase theta, partial [Colius s... [more]
5AGA_A3.000e-11033.07Chain A, Crystal Structure Of The Helicase Domain ... [more]
XP_011510645.13.000e-11033.07DNA polymerase theta isoform X2 [Homo sapiens][more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL026nonsL026Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR040ncniR040Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR062ngnoR062Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO21G01170.1NO21G01170.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|559534gene_8073Nannochloropsis oceanica (N. oceanica CCMP1779)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO21G01170.1NO21G01170.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO21G01170 ID=NO21G01170|Name=NO21G01170|organism=Nannochloropsis oceanica|type=gene|length=5177bp
ATGCAACCGCCGCCGCCGCCACCGTCACCCATAGATGCCATTCAGCCAGC
CGACCCCACTTCCCCTTCCGCTTCCGCCTCCTCCTGGACTCCGCCCATCC
TACCAAACGCCATGCGTGCGCCGTTGGGTCGCAATGGACAGTCTCCTCTC
CAGATGCCTTCTCCGCGGGGTACAAAACGGCTGTTGGCTTCGCCCGGAGC
AGCAGTGCAGGCCGACAGCTCTGCCAGTCTCGCGCTCGTCCCACCACTTA
AGTTGCCCCCGCCGCCTCCAGCTCTTGCCTCTGCCTTGTCATACAATGCC
TTGTTGAGTGATGACGACCGCGACTTTATCAATTGCGCAGAGTTTGTATC
CTTCTTTGCCGAGGATGTCGAGGAAGGGGAGCAATGGGAAGGGGAAAGGA
AGAAGGAATGCCGGCATGAGAGAGTATATGAGGAAGCCGCCGCCCCAACA
GCAGGTCCAATATCTGAAGCAGCAACATTTCAACTACCAGTCTCCGTAAC
AGCGCCTTCTCTAATTCCTTTGGAGCGAGCGGTCATGCTCCAACATCTCC
CCGCACATGGTCGCTACATCTCCCAACGTGGTCGCGAGGAGGGGACGGAG
AAGGAGGAAGCAGTGTCCCCAGGACCACCACCCGAAGCAGAGTCATCACC
TCTACCACGAGCTGCCCTCGAAGCATCGCCTCCTCGACCTCCTCCTTCTT
CCTCCACACCATCTTTTCGTCGATACATATCCCAGCGTGGCTGTGAGCTG
CTTGCTCAGAGAAAAGCGATGCAGCTGACACACTCACTAGGCCGAGGACC
AACCGCTCCTTCCCCCACCGGCAAAAAACGCATGCAGCACAGTCCTTGCC
AACAAACGCTTCCACGATCCTCTTCTTTCTCTCACGCATATTCATCAAGC
TCTTCACATCCTCCTCGTCCTCTCCTTCCTCCTGTCCCTGTGCAAGACTC
GCCGCAAGGCTTGAAATTGCATGCACCAGCATGCGCCTCGGCCCCCCCTC
CTCCCCTTCCTCCCTCTTGTCCCCATACTTCTTTCTCGTCAAACTTGCCT
CCTTTCCTCCCTCCTCCTTTACCCTTTTCACGGCACAAATCAACATCGGC
AGTGACCTCGGCAGTACTAGCAACAGCACAAACGGAAGCTGCGAAATCGA
CAATAGGAGCTGCAGCAGCAGCATTAGTCACCGCTCTACCAGCAGCGGAA
GAAGAAAAAAGGCAACCATCACAGCAAGAAGGACACGGTCATCTTCTTCC
TCCGCCTCCGCCTCCTCTTCATCGTCCCTATCGATCTTTCCGTGTTCAGC
TGCTGCAGCAGCAGCAGCACACACAACGACATGAAGCCCCCTCCGAGAAA
CATCAATTGCAGCCCTCGCCGCCGCAGCCTGAGATTTCCTATCCTTTCTC
TCATTCGCACGTGTCGCCCTATCGACAGCAGCAGCAGCAGCGACATCAGC
AACAGCTGAAGGACGAAGAGGAGGTTTCAAACTTTTCTCCTTCACTCGCA
CCACGTTTCGCACCATCGCCCTGCCACGGAGGTTCCCCCCCACCTATTCC
TCCCCCGTCACAAGCGTCAGAAGGAGCAGAAGGAGGTGGAGGAAGAATAA
GAGGAGTGAAGGAGGGAGGGCAAGGACGGGGGAGGGGGAGGGGGGGCATG
CTGCAGGAAGGGCTCGGGATCTGGCGTCGTGTTGGAGAAGGAGGAGGCAG
AGGGGGAGGTAGTGTAGGACGACGAACTGCTGTTGGTAGTGCTTATCTTG
GACGAGGTAGAGGGGTAAGAAGAAGAGGGGGGGTGGCTGACAAGGTTTAC
TTGGCTTTGAATGGAGATAGATTGAGGGGGAGGGAGGCGGTGGAGAGGTC
GAAGGAGGACCTGCTGAAGATGAGAGGGGGACTGACTTTCAATGCGCAGA
GAGCTGCAATTCAAAAAAGTCTCGTCACGAGCAGCGGCAGCAGCAGTGGC
AGTGTGGAAAAGAGGAGTCTGCAAGGAGCGAAGGAGGGAGGGAACGGGGA
AAAGAAGAAGAAGAAGAAGAAGGAGGAGGAGGAGGAGGAGGAGGAGGCCA
AGGCGCATTTGTCTTACCCGGCATGGGGGTTGCCTGAAGTGCTCGTGCGT
TCTTACGAAGAAATGGGGGTCCACAAGCTCTTTCCCTGGCAAGTGGAATG
TTTGGAGGCGGGACAAGGGAGAGTGCTGAGGGAGGGGAGGAATCTCGTGT
ATAGTGCCCCAACGAGTGGGGGAAAGACGCTGGTGGCCGAGTTGTTAATG
GTGCGGGCCTTGACCCATGCGGTGGAGAGTGGGACCGCGATGTTCGTCGT
GCCCTTCATTGCGCTTGCAGAGGAAAAAGCACGGTATTTTCGACAGATCT
GGGCGGGTCTGGAGCTAGGGGTGAAAAGCTTTCATAGCGATGCGGTTGAT
TCCAGCTTGACGGAGAACGTGCACGTGGCGGTGTGTACAATCGAGAGGGC
AAACGCCCTGGTGAATCGGCTGTTGGAACGGGGAGAGTTGGGCAGGTTGA
AGGTCGTGGTGGTGGATGAACTGCATATGATTGGGGATGATAGCCGAGGC
TTTCTGATCGAGGTAATGCTGGCCAAGCTCCGGATGGCGGCCTCACTCTC
TCCCTCCCTCGCAGGAGCCGTAGGAGGGACACACATTCAAATTGTGGGTT
TGAGTGCCACTCTTCCCAATCTCACTCAAATCGCCTCTTGGTTGAATGCT
CATCTGTATATCACCAAATTCCGCCCCGTCCAACTCTCCCTCCTTCTCTG
CACGGGCAAGCACCTCGAAGAACTCAAGCCCCTCCCTCCCGCCCACCCAC
CCACCCCTCCCTCCCTCCCTACATCGTCCTGGACACGAGTTCGTGACTTG
CCTCCTTATCATTCTGATGACACCCAGGCGGTCATTCATTTGTGTTTGGA
AACGGTGTCGCTGGGACAGGGGGTCTTGATTTTTTGTAGCAACCGCGCGT
GGACAGAGCGATGTGCAAGTCAGGTGGCGAGGGCTATTTCTGTGGGTTTG
AAGGACGATCTCCTGCGGGAGAAGGTGAGGCAGGGAAGGAAAGAGCTCTT
GATACGGTTGAATCTGACCCCTGTGGGACTGTCGGCAGACTTGGAGAGTC
TCGTGCGGGAGGGAGTGGCGTTCCATCATGCGGGAGTGACGATGGAAGAG
AGGACGTTGATTGAAGAGGGTTTCAAGACAGGGATTTTGAGTGTTATCGT
CGCAACGAGTACGTTGGCGGCGGGAGTGAACCTACCGGCAAGGAGAGTGA
TTGTGCGGACCAGGAAGGGCTTTAATGGCGCGGAAATTAAGGCGTTTGAA
TTCCACCAGATGTGTGGAAGGGCGGGGAGGTACGGCATTGACGGGAGCGG
GGAAGCGATCCTCATGACGAGTGAAAAAGAATTGAGCCTCGCGCGGGCTT
TTGCGTCGAGACCACTACCGCCGATGCGAAGCGCGTTACAGACAGGGATG
GGAGGAGGGCTGGAACGAGCGTTGCTCGAGGTAGTGATGGGAAAATTGGC
GGTGGGGGAGGAGGAGGAAGAGACGGGCGAGGGGAAGGGGTGGATGGAGG
TGTTTGCACGGTGTACTTTGTTTGCCACGCAGATGGGAGAGGGAGGGGAG
GAAGCCGGGGAAGAGGGAGGGGTGATGGGGCGATTTCGAGAGGCGCTGGT
CTACTTGGTGGAGCATCGTTTTATTGAGGGACGGTCGAGTCTGAAGGGAG
AGGAGAGGGAGGAGGACAAGATGAAGATGAAGACGAATCAGGAGGAGGGG
CGGGAGCGGGAGGAGGAGGAAGAGGCAGAAAAAAATTGGAAGGAAGAGAA
AAAGGGGAGAGGCAGCGCGAAGGACGTCAGTGTCGTTTCTTTAGCAGGGC
CTCCAGCAGCAAAGCCAGGCCCGGGCTCAGGCGCAACTCCCGTCGAGCAG
GGACAGCAAGAGCAGCTATCGGCCAAACGCACACCTCTGTTAGCCCTGCG
TGGTACGCAACTGGGTGCAGCTACCTTCTTTTCCAGTATGTCTCCCTATG
ATGCCACGCTCATGCTCACGGCGCTCTCACAAGCAGCGCGTGGAGGCCTC
ATTCTCACCAGTGACTTGCATCTTCTCTACTTATGCGTACCTCCGAATCG
CACGTATTTCACCCCTGACTGGAATGATCTCGTCAGACGCTCTCAGAGGT
GGGCACCAGAAGTAGCTACAGTAGCAGCAGCAGTGGGAGTGACTGAACGA
ACGTTGGAACGACTGGCTTTGGGGGGGAGGAGTTGGCCGATGAGTCGGGA
AGGTGGGAGGGAGAAGGAGAGGGAGGAGGAGGAGGGGGTGGCGAGGCGGT
TTGTATATGGACTAATGTTGCATGAGATGCTGCAGGAGGTGCCGCTGGTG
CGGTTGGAAAGGGGGGAAGTGAACAGGGGGACGTTGCAGGCGTTGAGGAG
CGATGCCCGGATGTTTTGCGGGATGATGGGGGTGTTCTGCAAGCACTTGA
AGTGGATGGCGTTGGCCCGATTGATACAGGGGTTGAGCGTGCGACTCGAG
TATGGGGTGGGGGAGGAGTTGGTAGGGCTGTGTCGGATAGATCCGGAGCT
TATGCATGCGTTGAGGGCACGTGCCTTGTTTGATGCCGGGTTTCAAGGTG
CCGACGAGCTGGCGGTGGCAAGTGTGGAGGACGTCGCGCGGGTTTTGGTG
AAAAGTATGCCTTTTGAGGCAGAAACTAAAGGAGAGGGAGGCAGAGGAGG
AGGGGGAGGGGGGGAACGAGGACGGGCAGATCGTGTGGCGGAACGGATTG
CAGGTTTGATCGTTCGAAAGGCACGAGGGGTGGTGAGGAAGCAATTGGCG
CTGACGACTGCAGGAGCATGTGGAAAGTATGAGGAAGGATGGAGAAAGCA
GAGAGCAGAGTGTGGTAAAGACGGGGAAAGGGAATTAATAGAGGACAAGA
GAGGGAGTTTGGAAAAGCGTAACATAACGTTTTAGTTGTGTGTCGACTGA
TGGATGCTGCCTGGTTTCATGCCAAAAAAGGGAGGTATAGTATGAGGAAA
TGTAACATACATATCGTTTACCATGTCTGTAGCTCGAGATCGATTGTAAA
TAGATCCATGTATATATTTGTGTGTGTATGTGTGTGTGTGTGTGTAAATG
ACCGAAAGACAGCAAGCAAGACAGTAAAAACTGTGCATGAGAGCAAAGGA
AAAAAAATAAGGAACGTGACGTCGGGG
back to top

protein sequence of NO21G01170.1

>NO21G01170.1-protein ID=NO21G01170.1-protein|Name=NO21G01170.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1645bp
MQPPPPPPSPIDAIQPADPTSPSASASSWTPPILPNAMRAPLGRNGQSPL
QMPSPRGTKRLLASPGAAVQADSSASLALVPPLKLPPPPPALASALSYNA
LLSDDDRDFINCAEFVSFFAEDVEEGEQWEGERKKECRHERVYEEAAAPT
AGPISEAATFQLPVSVTAPSLIPLERAVMLQHLPAHGRYISQRGREEGTE
KEEAVSPGPPPEAESSPLPRAALEASPPRPPPSSSTPSFRRYISQRGCEL
LAQRKAMQLTHSLGRGPTAPSPTGKKRMQHSPCQQTLPRSSSFSHAYSSS
SSHPPRPLLPPVPVQDSPQGLKLHAPACASAPPPPLPPSCPHTSFSSNLP
PFLPPPLPFSRHKSTSAVTSAVLATAQTEAAKSTIGAAAAALVTALPAAE
EEKRQPSQQEGHGHLLPPPPPPLHRPYRSFRVQLLQQQQHTQRHEAPSEK
HQLQPSPPQPEISYPFSHSHVSPYRQQQQQRHQQQLKDEEEVSNFSPSLA
PRFAPSPCHGGSPPPIPPPSQASEGAEGGGGRIRGVKEGGQGRGRGRGGM
LQEGLGIWRRVGEGGGRGGGSVGRRTAVGSAYLGRGRGVRRRGGVADKVY
LALNGDRLRGREAVERSKEDLLKMRGGLTFNAQRAAIQKSLVTSSGSSSG
SVEKRSLQGAKEGGNGEKKKKKKKEEEEEEEEAKAHLSYPAWGLPEVLVR
SYEEMGVHKLFPWQVECLEAGQGRVLREGRNLVYSAPTSGGKTLVAELLM
VRALTHAVESGTAMFVVPFIALAEEKARYFRQIWAGLELGVKSFHSDAVD
SSLTENVHVAVCTIERANALVNRLLERGELGRLKVVVVDELHMIGDDSRG
FLIEVMLAKLRMAASLSPSLAGAVGGTHIQIVGLSATLPNLTQIASWLNA
HLYITKFRPVQLSLLLCTGKHLEELKPLPPAHPPTPPSLPTSSWTRVRDL
PPYHSDDTQAVIHLCLETVSLGQGVLIFCSNRAWTERCASQVARAISVGL
KDDLLREKVRQGRKELLIRLNLTPVGLSADLESLVREGVAFHHAGVTMEE
RTLIEEGFKTGILSVIVATSTLAAGVNLPARRVIVRTRKGFNGAEIKAFE
FHQMCGRAGRYGIDGSGEAILMTSEKELSLARAFASRPLPPMRSALQTGM
GGGLERALLEVVMGKLAVGEEEEETGEGKGWMEVFARCTLFATQMGEGGE
EAGEEGGVMGRFREALVYLVEHRFIEGRSSLKGEEREEDKMKMKTNQEEG
REREEEEEAEKNWKEEKKGRGSAKDVSVVSLAGPPAAKPGPGSGATPVEQ
GQQEQLSAKRTPLLALRGTQLGAATFFSSMSPYDATLMLTALSQAARGGL
ILTSDLHLLYLCVPPNRTYFTPDWNDLVRRSQRWAPEVATVAAAVGVTER
TLERLALGGRSWPMSREGGREKEREEEEGVARRFVYGLMLHEMLQEVPLV
RLERGEVNRGTLQALRSDARMFCGMMGVFCKHLKWMALARLIQGLSVRLE
YGVGEELVGLCRIDPELMHALRARALFDAGFQGADELAVASVEDVARVLV
KSMPFEAETKGEGGRGGGGGGERGRADRVAERIAGLIVRKARGVVRKQLA
LTTAGACGKYEEGWRKQRAECGKDGERELIEDKRGSLEKRNITF*
back to top
Synonyms
Publications