NO04G03050, NO04G03050 (gene) Nannochloropsis oceanica

Overview
NameNO04G03050
Unique NameNO04G03050
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length16446
Alignment locationchr4:817774..834219 -

Link to JBrowse

Properties
Property NameValue
DescriptionBeta-alanine--pyruvate aminotransferase
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr4genomechr4:817774..834219 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
PXD0166992021-01-08
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PXD0087212019-04-30
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0030170pyridoxal phosphate binding
GO:0008483transaminase activity
Vocabulary: INTERPRO
TermDefinition
IPR015424PyrdxlP-dep_Trfase
IPR005814Aminotrans_3
Vocabulary: Cellular Component
TermDefinition
GO:0016020membrane
Homology
BLAST of NO04G03050 vs. NCBI_GenBank
Match: EWM28721.1 (hypothetical protein Naga_100002g88 [Nannochloropsis gaditana])

HSP 1 Score: 1944.1 bits (5035), Expect = 0.000e+0
Identity = 995/1692 (58.81%), Postives = 1257/1692 (74.29%), Query Frame = 0
Query:   28 LRSQHSLRHKLQKWRGTQWMSRGAGEINLDESQSLSLLDVRGRIRACGAGLWQRRSPMCMGFSFIFLAVTIMMCVLLVDVLRLIPCMNPNSPCFNLALAEFQNICSFERMPVRLTTTAMLPHIYSRLHVKSAVIDVALRGDHGPGHVATSSFTEKDALSSMVLKGGEQNLTVNSIMVVTNTTGLAIMFAYLLNEIDFQVAITAKIQVVVQTPLRVRIKAPINSNFFLTCGHLPCKESVCKYICQFGNYPLKDPDPGNPYPYVTRVEKVRLAYDEEGEYIISPQVDVWLRDFKASASFPKSILDMYFYNSTKPGSISGFRERHDLDEDHKLFRVTLHEFLLQSLESSGGKPFRMKMDLRLLNNTAGQAATARDFTTRFLGKEPMYIYLAATPIDTTCPQLRNPLLMVPPAGLSLNTNLSTKDSSDSSLSNMGALTDVFRVNSVDLFSLNNTILRSFANITLMLPFALEGELPALDMDGFAGPDLVGHFKAFPVTLPPVVPRQGFERDRAPAHTFGKEAMSLFAPKGEATVVLFTEAELSQWGIEKLETIFPNVLSQLEELDIDLVGSRPAEPGNVMQTFLNGVVIKLASLEDRLIQHGEANEDRMFFTPADSPIKPRVHMEVKSIPYVHSNDTGLLIYLNFTYPDAFQGRFDIRGGSMHFLLADWNSTALATVTVLDSLLSRELTHLEAEVFISDRTQAEAVHNLIDALSKKMPSGLRISEGVVDSAPGLKQRVPISYTDVEWGCAEDIVVHCPNLRPSETRACLNQALDGRILSHQCLKVLLPRGELDLEIRDLAKVMDVGRGIANGLLEQVKTTASKVAVNLIEVLSLSTPEEAIDLQVILEENKLDIGMPEGTPVEVRVRANLSIGIFDSIDLTFVVPPLAVEISTPL-PKNLAERLKVPAPVGEKKAVKATKRDQYXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPVTMDYNADDKEYAATTSLLTLVLPAMDKEYVHSSLLLVDSAVRISNIYHAMCWLEDIMADLTDKYVIVHPASTGDVWSYIMAPLRLKIDILTAINSTSGQPPSDTDESGAPATFLQENINETWSKILFRAASFPELIVTESQLAFSQAWSEAVDVRIPAMGMSVYTSFVEEGDRHLVPLLTDDTICLKDAHACLLGDFGAGTFQTTRGAMVMVDTNLNMTAEDGGRFWGHILSDYMGGARVGLYLKTKTGAAGDDGKALFDMDASIILPKMPPLNIPLNDIVSQVGEITRRALQESRRQLSLAEIWSSLDPIILSMTQPVSIKVDTVTLENFESNTIDPLDPASIALGGDLLARSIVTIPDILQVDLQIPPLTMQIHNGNLSKLLLPAVPGTEGDGVTGGHSLMHLPWASAQVSEFVYDSQRDNTSTVKLHIDILRVKQALKLMEDVLMGGRNVTMTVVGTEGTSNSLFSHIFSHMKFQLDMFSKPVPCVTADKPDANDDDRVATPLGENASHKSARTIPTERISISAHLSSMGTALWGDAEIFIPQQTNPLVMTVLAGAMELRMVLKGIPEATEPLAYISLTPMVLSQERDMHVEAKMVMEPANVETLRILVARLKAREVTEVVLKGTVGNAPIGRFKTSLLLTPALVDKLGIFPDGGETFIKVSLDKLLRS-NTPFRFDDVAIVGGEDGVGSSVSLPCLLKGWCDG 1718
            +  + +L+ KLQ W G QW+SR   E+NLDESQS+SLL VR R  AC  G    RS + +    +F+ V +++CVLL+D  RL+PC+ P+SPCF LA   F++IC+FERMP+RL +TA +PH+YSRLHVKSA+ID++LRG  G  H+A  SFT+ DAL  M+L GG QNLTV+SI+ VTNTT LA M  Y LNE++FQ  IT +I                 SNF+LTCGH+PCK +VCKYICQF +YP++DPDP NPYPYVTRVEKV++AYD E +Y+ISP+V+VWL +FK +A+FPKS LD+YFYN+TKPGS  G R+R DLDEDH+LF++TLHEFLLQSLE+SGG+PF M MDLRL N T GQAATARDFTTRFLGKEPMYIYLA +  D+TCPQLR  +L +PPAGLSLNTN STKDSSDSS+SNMGALTDVFRVNSVDLFSLNNTILRSF N+TLMLPFALEGELP LDMDGFAGP+L+GHFKAF  TLPPV PR+G + ++ P H+FGK AMS FAP+GE T++LFTEA+LS WG+EK+E I PN LSQLEELDIDL+GSRP+ PGNVMQTFL+GVVIKLASLEDR +QHG A   R+F+TP +S IKPRVH EV+SI Y HSN TGLL Y+NFTYP+AFQGRF+IRGGSMHF+LAD NSTALATVTVL+ LLS+ELTHLEAEVFISD TQA+AVH+LID+ +K +P  +++  G +D+AP LK+RVPI + D++  CA DI  +CP L   +TR CLN+A+  RIL+++C K LLPRG L+L I DLAK +++GRG+A G+L+++ +TASKVAVN +E+LSL TPE  +DLQ+ILEE+    GMP+GTPV+V  RAN+++GIF+ +DLTFVVPPLAVE+  PL  K     +  P  V +  A+   +                                                                    P ++ Y+A D       SL TL +PAM+K+   SS+LL++ A+RISNIY AM WLEDIM+DLTDKYV+  P STGDVWSY+M+P+++K+D+LTAINSTS Q      E+ +P + L++ +N+TWS+I+FR+ASF ++IVTE+QL FS+ WSEAVDV IP   ++VY S+V+EGDRH  P+LTDD  C KD  AC LG F AG F+TTRGAMV++DTNLNMT++D GRFWGH+LSDY+ GARV L+LKT+ GA   DG+ALFD+DAS ILPKMPPLN PL D+V+ VGE  + A Q++ R+LSLAEIW+SLDP ILS+ QPVS+K++TV L+NF SN IDPLD AS  LG DL+A S VTIP ILQV L IPPL+M I+NGN+++ L+ ++   EG+      + M  PWASA++SEFVYDS+  N+S V LH+D+LRVKQAL+L ED+L+ G NVTMT+ GT   S SLFS I SHM  Q+D++S     +  D+  A   + ++ P+  +   K A  + TER+  SA ++S  + L  DA+I IPQQ NPLVMT+LAGA++LR++L+ +PEATEPLA IS  P  LSQERDM V+ ++ M+ + V TL +L+ RL+AR+ TEV+L+G V N   GRFKTS++LTPAL  KLG+FPD   TF+KV LD+LLR+  TPFR D VAIVGG DGVGS VSLPCLLKGWCDG
Sbjct:   22 VEERQTLQQKLQGWFGGQWLSRCREEVNLDESQSISLLGVRARAAACLWGHSHWRSILGVAIFLLFVPVLVVVCVLLLDAARLLPCLRPSSPCFRLAELRFEDICTFERMPLRLVSTAFIPHVYSRLHVKSAMIDISLRGPKGLEHLADLSFTDADALVPMILTGGLQNLTVDSILEVTNTTRLAYMVGYQLNEVEFQAVITTRIXXXXXXXXXXXXXXXXRSNFYLTCGHVPCKAAVCKYICQFKDYPIEDPDPENPYPYVTRVEKVKMAYDSEMQYVISPKVNVWLSNFKVAATFPKSTLDLYFYNATKPGSTLGLRDRQDLDEDHRLFQMTLHEFLLQSLENSGGQPFTMNMDLRLCNTTPGQAATARDFTTRFLGKEPMYIYLAQSQNDSTCPQLRKAMLRIPPAGLSLNTNYSTKDSSDSSISNMGALTDVFRVNSVDLFSLNNTILRSFVNLTLMLPFALEGELPMLDMDGFAGPNLLGHFKAFHSTLPPVPPREGHDYNKPPEHSFGKMAMSEFAPRGEDTILLFTEAQLSPWGVEKVEAILPNFLSQLEELDIDLIGSRPSNPGNVMQTFLSGVVIKLASLEDRFVQHGGATGGRLFYTPLESHIKPRVHTEVRSISYEHSNSTGLLAYVNFTYPEAFQGRFNIRGGSMHFVLADGNSTALATVTVLNFLLSKELTHLEAEVFISDNTQAKAVHDLIDSYTKHLPLSIKVLNGFIDTAPNLKERVPIPHADLKEQCASDIETYCPGLSLFDTRTCLNRAMGSRILTYKCTKALLPRGALELGIADLAKFVNLGRGLATGVLDRITSTASKVAVNQVEILSLGTPEAPMDLQIILEEDNFATGMPDGTPVDVTARANMTLGIFEIVDLTFVVPPLAVEVLAPLHEKQNVRNVSSPLTVNQTGALYEAR--------------------------------------------------------------------PKSIIYSAGDVVDDDLRSLFTLAMPAMEKDKTSSSMLLINPAIRISNIYDAMFWLEDIMSDLTDKYVVARPVSTGDVWSYMMSPIQIKVDVLTAINSTSSQSTGHPGEARSPTSLLKQPVNDTWSRIVFRSASFLDVIVTETQLVFSEPWSEAVDVSIPPTRLAVYASYVKEGDRHFAPILTDDAPCQKDTRACPLGVFKAGAFRTTRGAMVVLDTNLNMTSDDDGRFWGHLLSDYVAGARVALHLKTENGAK-SDGEALFDVDASFILPKMPPLNFPLTDLVNPVGEAKQHAQQDAGRKLSLAEIWTSLDPFILSVMQPVSVKIETVQLQNFYSNPIDPLDLASTTLGADLVACSTVTIPGILQVALSIPPLSMHIYNGNITENLI-SLKNEEGE-----RTPMPSPWASARLSEFVYDSKLGNSSAVDLHVDVLRVKQALQLAEDMLLRGENVTMTISGTVDPSASLFSRIVSHMSIQVDIYSDEPVVLDIDEQGA---EEMSPPVKPSLFEKKATKL-TERVFASARVASTPSELLMDADIVIPQQGNPLVMTLLAGAIDLRLILERVPEATEPLARISFDPFFLSQERDMQVKMRVAMDMSCVRTLHVLLERLRARKTTEVMLEGIVNNTAPGRFKTSVILTPALTTKLGLFPDDTATFVKVLLDELLRNPMTPFRMDSVAIVGGADGVGSPVSLPCLLKGWCDG 1634          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: EWM28828.1 (aminotransferase class-iii [Nannochloropsis gaditana])

HSP 1 Score: 822.4 bits (2123), Expect = 3.400e-234
Identity = 402/487 (82.55%), Postives = 432/487 (88.71%), Query Frame = 0
Query: 1741 SAFWRTTKASARAAIALPIHRR-CARTSLATSTRMTEPPFEAATAA--DQDAMAAFWMPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQIVGTLSQTLHEVA 2225
            S+  R T+     A A  + R+   R  LATSTRM+ P     T    DQ  + AFWMPFT+NK FK+SPRIL KAKGMHYWT++G KVLDGTAGLWCVNAGHC+E IV AIQKQAAA+DFAPTFNMGHPLAFEYAHRLC ELMPG+GLDQVFFTMCGST+VDTALK+ALAYHKARGEG R+RLIGRERAYHGVGFGGISVGGM+PNRR FGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHD+STIAAVIVEPVSGSAGVLVPP+GYLKRLREICD HGILLIFDEVITAFGRLGKGMGAEYFGV+PDMVTCAKGLTNAAVPAGATFVQKHVYD FM      G IELFHGYTYSGHPLAMAAGLATLDVYQ EGLFER+A +APYWE+ LHALKGHP+V+DIRNLGLMGAVEL PR GKPGQRAFQ+FTTAL+KGVMVRVTG+TIALSPPLI+EK Q+DQIV TLSQ LHEVA
Sbjct:    3 SSLTRATRPPVALAAAAQLPRQWLPRAFLATSTRMSAPSTAPETNCNIDQATLGAFWMPFTANKAFKRSPRILAKAKGMHYWTEDGHKVLDGTAGLWCVNAGHCNERIVSAIQKQAAALDFAPTFNMGHPLAFEYAHRLCQELMPGKGLDQVFFTMCGSTAVDTALKVALAYHKARGEGSRFRLIGRERAYHGVGFGGISVGGMLPNRRAFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDASTIAAVIVEPVSGSAGVLVPPYGYLKRLREICDAHGILLIFDEVITAFGRLGKGMGAEYFGVIPDMVTCAKGLTNAAVPAGATFVQKHVYDTFMGATPGDG-IELFHGYTYSGHPLAMAAGLATLDVYQDEGLFERAAALAPYWEETLHALKGHPHVVDIRNLGLMGAVELVPRAGKPGQRAFQVFTTALHKGVMVRVTGETIALSPPLIVEKGQIDQIVTTLSQALHEVA 488          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: WP_089288237.1 (MULTISPECIES: aspartate aminotransferase family protein [Azospirillum] >SNS46973.1 beta-alanine--pyruvate transaminase [Azospirillum sp. RU38E] >SNS66132.1 beta-alanine--pyruvate transaminase [Azospirillum sp. RU37A])

HSP 1 Score: 611.7 bits (1576), Expect = 9.100e-171
Identity = 292/432 (67.59%), Postives = 346/432 (80.09%), Query Frame = 0
Query: 1792 AFWMPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQIVGTLSQTLHEV 2224
            AFWMPFT+N+ FKQ+PR+L  A GM+Y +++G+++LDGTAGLWCVNAGHC   I  AIQKQAA MDFAPTF MGHP+AFE+A +L   L P  GLD VFFT  GS SVDTALK+ALAYH++RGEG R R IGRER YHGVGFGGISVGG+V NR+ FG  LPGVDHL  TYDR HQAFS+GQPEWGAHLA++LER+V LHD+STIAAVIVEPV+ S GVLVPP GYL++LR ICD HG+LLIFDEVIT FGRLG     +YFGV+PD++T AKGLTNA VP GA F QK +YDAFM   +   +IELFHGYTYSGHPLA AAGLATLD+Y+ EGL  R+A +A YWE A+H+L+   +V+DIRNLGL+GA+EL P  G+P +RAF  F     KG+++R TGDTIALSPPLI+EK Q+DQIV T+   L EV
Sbjct:   26 AFWMPFTANRQFKQAPRLLVSASGMYYRSEDGRQILDGTAGLWCVNAGHCRAEITAAIQKQAAEMDFAPTFQMGHPVAFEFASQLVQLLPP--GLDHVFFTNSGSESVDTALKVALAYHRSRGEGQRTRFIGRERGYHGVGFGGISVGGIVTNRKAFGAQLPGVDHLPHTYDRAHQAFSRGQPEWGAHLADELERIVALHDASTIAAVIVEPVACSTGVLVPPQGYLEKLRAICDKHGLLLIFDEVITGFGRLGAPFATDYFGVMPDIITAAKGLTNATVPMGAVFFQKKIYDAFMTGPE--NVIELFHGYTYSGHPLAAAAGLATLDLYRNEGLLTRAAELAGYWEVAMHSLRDAAHVVDIRNLGLIGAIELEPIAGQPTRRAFSAFLKCFEKGLLIRTTGDTIALSPPLIVEKGQIDQIVDTIRTVLTEV 453          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: EGY01753.1 (beta alanine--pyruvate transaminase [Nitrospirillum amazonense Y2])

HSP 1 Score: 609.8 bits (1571), Expect = 3.500e-170
Identity = 293/434 (67.51%), Postives = 346/434 (79.72%), Query Frame = 0
Query: 1790 MAAFWMPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQIVGTLSQTLHEV 2224
            M AFWMPFT+N+ FK++PR+   AK MHY TD+G++VLDGTAGLWCVNAGHC   IV A+Q Q AAMD+AP F MGHP AFE A +L   L+P  G+D+VFFT  GS SVDTALK+ALAYH+ARGEG R R IGRER YHGVGFGGISVGG+V NR+ FGPML GVDHL  T+D+ HQAF+KGQPEWGAHLA+DLER+V LHD+STIAAVIVEPV+GS GVL+PP GYL++LR ICD HGILLIFDEVIT FGRLG     +YFGVVPD++T AKGLTNA VP GA F + H+YDAFM   +    IELFHGYTYSGHPLA AAGLATL +Y++EGL  R A +A YW++A H+L+G P+VID+RNLGL+  +EL P  GKP  RAF  F  A  KG+++R TGD IALSPPLIIEKS +DQI GTL++ L E+
Sbjct:    9 MDAFWMPFTANRQFKKAPRLFVGAKDMHYTTDDGRQVLDGTAGLWCVNAGHCRPEIVQAVQAQVAAMDYAPAFQMGHPGAFELAAQLA-ALLP-TGMDKVFFTNSGSESVDTALKVALAYHRARGEGTRTRFIGRERGYHGVGFGGISVGGIVTNRKFFGPMLAGVDHLPHTHDKTHQAFTKGQPEWGAHLADDLERIVTLHDASTIAAVIVEPVAGSTGVLLPPKGYLEKLRAICDRHGILLIFDEVITGFGRLGAPFATDYFGVVPDIITAAKGLTNATVPMGAVFFKNHIYDAFMTGPE--NAIELFHGYTYSGHPLATAAGLATLKIYKEEGLLTRGAELASYWQEAAHSLRGLPHVIDVRNLGLIAGIELEPIAGKPTARAFDAFLRAFEKGLLIRTTGDIIALSPPLIIEKSHIDQIFGTLAEVLREL 438          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: WP_040844835.1 (aspartate aminotransferase family protein [Nitrospirillum amazonense])

HSP 1 Score: 609.8 bits (1571), Expect = 3.500e-170
Identity = 293/434 (67.51%), Postives = 346/434 (79.72%), Query Frame = 0
Query: 1790 MAAFWMPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQIVGTLSQTLHEV 2224
            M AFWMPFT+N+ FK++PR+   AK MHY TD+G++VLDGTAGLWCVNAGHC   IV A+Q Q AAMD+AP F MGHP AFE A +L   L+P  G+D+VFFT  GS SVDTALK+ALAYH+ARGEG R R IGRER YHGVGFGGISVGG+V NR+ FGPML GVDHL  T+D+ HQAF+KGQPEWGAHLA+DLER+V LHD+STIAAVIVEPV+GS GVL+PP GYL++LR ICD HGILLIFDEVIT FGRLG     +YFGVVPD++T AKGLTNA VP GA F + H+YDAFM   +    IELFHGYTYSGHPLA AAGLATL +Y++EGL  R A +A YW++A H+L+G P+VID+RNLGL+  +EL P  GKP  RAF  F  A  KG+++R TGD IALSPPLIIEKS +DQI GTL++ L E+
Sbjct:    1 MDAFWMPFTANRQFKKAPRLFVGAKDMHYTTDDGRQVLDGTAGLWCVNAGHCRPEIVQAVQAQVAAMDYAPAFQMGHPGAFELAAQLA-ALLP-TGMDKVFFTNSGSESVDTALKVALAYHRARGEGTRTRFIGRERGYHGVGFGGISVGGIVTNRKFFGPMLAGVDHLPHTHDKTHQAFTKGQPEWGAHLADDLERIVTLHDASTIAAVIVEPVAGSTGVLLPPKGYLEKLRAICDRHGILLIFDEVITGFGRLGAPFATDYFGVVPDIITAAKGLTNATVPMGAVFFKNHIYDAFMTGPE--NAIELFHGYTYSGHPLATAAGLATLKIYKEEGLLTRGAELASYWQEAAHSLRGLPHVIDVRNLGLIAGIELEPIAGKPTARAFDAFLRAFEKGLLIRTTGDIIALSPPLIIEKSHIDQIFGTLAEVLREL 430          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: WP_088874047.1 (aspartate aminotransferase family protein [Nitrospirillum amazonense] >ASG23558.1 aspartate aminotransferase family protein [Nitrospirillum amazonense CBAmc])

HSP 1 Score: 607.1 bits (1564), Expect = 2.200e-169
Identity = 291/434 (67.05%), Postives = 346/434 (79.72%), Query Frame = 0
Query: 1790 MAAFWMPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQIVGTLSQTLHEV 2224
            M AFWMPFT+N+ FK++PR+   AK MHY TD+G++VLDGTAGLWCVNAGHC   IV A+Q Q AAMD+AP F MGHP AFE A +L   L+P  G+D+VFFT  GS SVDTALK+ALAYH+ARGEG R R IGRER YHGVGFGGISVGG+V NR+ FGPML GVDHL  T+D+ HQAF+KGQPEWGAHLA++LER+V LHD+STIAAVIVEPV+GS GVL+PP GYL++LR ICD HGILLIFDEVIT FGRLG     +YFGVVPD++T AKGLTNA VP GA F + H+YDAFM   +    IELFHGYTYSGHPLA AAGLATL +Y++EGL  R A +A YW++A H+L+G P+VID+RNLGL+  +EL P  GKP  RAF  F  A  KG+++R TGD IALSPPLIIEKS +DQI GTL++ L ++
Sbjct:    1 MDAFWMPFTANRQFKKAPRLFVGAKDMHYTTDDGRQVLDGTAGLWCVNAGHCRPEIVQAVQAQVAAMDYAPAFQMGHPGAFELAAQLA-ALLP-TGMDKVFFTNSGSESVDTALKVALAYHRARGEGTRTRFIGRERGYHGVGFGGISVGGIVTNRKFFGPMLAGVDHLPHTHDKAHQAFTKGQPEWGAHLADELERIVTLHDASTIAAVIVEPVAGSTGVLLPPKGYLEKLRAICDRHGILLIFDEVITGFGRLGAPFATDYFGVVPDIITAAKGLTNATVPMGAVFFKNHIYDAFMTGPE--NAIELFHGYTYSGHPLATAAGLATLKIYKEEGLLTRGAELASYWQEAAHSLRGLPHVIDVRNLGLIAGIELEPIAGKPTARAFDAFLRAFEKGLLIRTTGDIIALSPPLIIEKSHIDQIFGTLAEVLRDL 430          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: WP_051330067.1 (aspartate aminotransferase family protein [Niveispirillum irakense])

HSP 1 Score: 603.6 bits (1555), Expect = 2.500e-168
Identity = 290/429 (67.60%), Postives = 347/429 (80.89%), Query Frame = 0
Query: 1795 MPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQIVGTLSQTLHEV 2224
            MPFT+N+ FKQ+PR+L  A GM+Y +D+G+++LDGTAGLWCVNAGHC   IV AIQ QAA MD+APTF MGHP+AFE+A RL  +L+P  G+D VFFT  GS SVDTALK+ALAYH+ARGEG R R IGRER YHGVGFGGISVGG+V NR+ FG MLPGVDHL  TYDR HQAF++GQP+WGAHLA++LER+V LHD+ST+A VIVEPV+ S GVLVPP GYL++LR ICD HG+LLIFDEVIT FGRLG     +YFGVVPD++T AKGLTNA VP GA F QK +YDAFM+  +   +IELFHGYTYSGHPLA AAGLATLDVYQQEGL  R+  +A YWE A+H+L+   +VIDIRNLGL+GA+EL P  G+P +RAF  F     +GV++R TGDTIALSPPLI+EK+Q+DQIV T+   L +V
Sbjct:    1 MPFTANRQFKQAPRLLVSASGMYYRSDDGREILDGTAGLWCVNAGHCRPEIVAAIQAQAAQMDYAPTFQMGHPVAFEFASRLV-QLLPD-GMDHVFFTNSGSESVDTALKVALAYHRARGEGQRTRFIGRERGYHGVGFGGISVGGIVTNRKVFGTMLPGVDHLPHTYDRTHQAFTRGQPDWGAHLADELERIVALHDASTVAGVIVEPVACSTGVLVPPKGYLEKLRAICDRHGLLLIFDEVITGFGRLGTPFATDYFGVVPDIITSAKGLTNATVPMGAVFFQKAIYDAFMNGPE--HVIELFHGYTYSGHPLAAAAGLATLDVYQQEGLLTRAGDLAGYWEVAMHSLRDARHVIDIRNLGLIGAIELEPLSGQPTKRAFSTFLKCFEQGVLIRTTGDTIALSPPLIVEKAQIDQIVDTIRAALAQV 425          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: WP_008944101.1 (aspartate aminotransferase family protein [Oceanibaculum indicum] >EKE76463.1 beta alanine--pyruvate transaminase [Oceanibaculum indicum P24])

HSP 1 Score: 600.5 bits (1547), Expect = 2.100e-167
Identity = 286/432 (66.20%), Postives = 347/432 (80.32%), Query Frame = 0
Query: 1790 MAAFWMPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAP-GMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQIVGTLSQTL 2221
            + +FWMPFT+N+ FK++PR+L +A+GMHY T +G+++LDGTAGLWCVNAGH  E IV AIQKQA  +D+AP+F MGHPL+FE A RL   L    G D+VFFT  GS SVDTALKIA+AYH+ RGEG R RLIGRER YHGVGFGGISVGG+V NR+ FG +LPGVDHL  T+  E+QAF++GQP WG HLA++LER+V LHD+ST+AAVIVEPV+GS GVL+PP GYL++LR+ICD HGILLIFDEVIT FGRLG     E+FGV PD++T AKGLTN  VP GA F ++ +YD FMD  KAP G IELFHGYTYSGHPLA AAGLATLDVY++EGLFER+A +APYWEDA+H+LKG   VID+RNLG++  +EL   PGKP  RA + FT A  KG+++R TGD IALSPPLI+EKS +D++ GTL   L
Sbjct:   14 LESFWMPFTANRAFKKNPRMLVEAEGMHYTTADGRRILDGTAGLWCVNAGHRREKIVKAIQKQAEKLDYAPSFQMGHPLSFELASRLTTML---PGYDRVFFTGGGSESVDTALKIAIAYHRVRGEGARTRLIGRERGYHGVGFGGISVGGIVANRKFFGSLLPGVDHLPHTHSPENQAFTRGQPAWGGHLADELERIVALHDASTVAAVIVEPVAGSTGVLIPPQGYLQKLRQICDKHGILLIFDEVITGFGRLGSSFATEHFGVKPDLITTAKGLTNGCVPMGAVFAKQGIYDTFMD--KAPEGTIELFHGYTYSGHPLAAAAGLATLDVYKEEGLFERAAELAPYWEDAVHSLKGTGPVIDLRNLGIIAGIELEGIPGKPTARAMEAFTKAYEKGLLIRTTGDIIALSPPLILEKSHIDEMFGTLGDVL 440          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: WP_034851442.1 (aspartate aminotransferase family protein [Inquilinus limosus])

HSP 1 Score: 598.2 bits (1541), Expect = 1.000e-166
Identity = 297/435 (68.28%), Postives = 348/435 (80.00%), Query Frame = 0
Query: 1790 MAAFWMPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTM-CGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQIVGTLSQTLHEV 2224
            + +FWMPFT+N++FK +PR+   AKGM+Y T +G++VLDGTAGLWCVNAGHCH  IV AIQ QAA MD+AP+F MGHP AFE A R+  +L+PG  LD VFF    GS +VD+ALKIALAY +  G+G R RLIGRER YHGVGFGGISVGGMV NRRTFG ML GVDHLR T+D    AFS+G PE GA LA+DLER+V LHD+STIAAVIVEPV+GS GVL+PP GYL+RLREICD HGILLIFDEVIT FGRLG    A++FGV PDMVT AKGLTNAAVPAGA  V+KH+YDAFM   +   MIELFHGYTYS HPLA AAGLATLDVY++EGLFER A +APYWE ALH+LKG  +VIDIRNLGL+GAVEL P PG+P  RA+ +F     KGV +R TGD +ALSPP IIEK+++D++V TL +TL  +
Sbjct:   16 LESFWMPFTANRSFKAAPRLYASAKGMYYTTVDGRQVLDGTAGLWCVNAGHCHPKIVQAIQDQAAEMDYAPSFQMGHPKAFELASRVA-QLLPG-DLDHVFFAAGGGSEAVDSALKIALAYQRQIGQGTRTRLIGRERGYHGVGFGGISVGGMVANRRTFGTMLTGVDHLRHTHDLARNAFSRGLPEHGAELADDLERIVALHDASTIAAVIVEPVAGSTGVLIPPKGYLQRLREICDKHGILLIFDEVITGFGRLGASFAADWFGVQPDMVTMAKGLTNAAVPAGAVGVRKHIYDAFMTGPE--HMIELFHGYTYSAHPLACAAGLATLDVYREEGLFERVAGLAPYWEQALHSLKGTRHVIDIRNLGLIGAVELEPIPGRPTARAYDLFVKCFQKGVGIRTTGDIVALSPPFIIEKAEIDRLVDTLRETLQSL 446          
BLAST of NO04G03050 vs. NCBI_GenBank
Match: WP_028792763.1 (MULTISPECIES: aspartate aminotransferase family protein [Thalassobaculum] >SDF64233.1 beta-alanine--pyruvate transaminase [Thalassobaculum litoreum DSM 18839])

HSP 1 Score: 597.4 bits (1539), Expect = 1.800e-166
Identity = 288/447 (64.43%), Postives = 351/447 (78.52%), Query Frame = 0
Query: 1778 PFEAATAADQDAMAAFWMPFTSNKTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQKQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVDTALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGPMLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVIVEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGAEYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFHGYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNVIDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSPPLIIEKSQVDQI---VGTLSQTLH 2222
            P  A  +   + + +FWMPFTSN+ FKQ+PR+  +A+GMHY T +G++VLDGT+GLWC NAGH  + IV AI+KQA  +DFAP F MGHPL+FE+A RL   +    G D VFFT  GS SVDTALKIALAY +ARG+G R RLIGRER YHGVGFGGISVGG+V NR+ FG +L GVDH+R T+D EH A++KG+P+WGAHLA+DLER+V LHD+STIAAVIVEPV+GS GVL+PP GYL+RLR+ICD HGILLIFDEVIT FGRLG   G++YFGV PD+ T AKG+TNA VP GA F +  +YDAFMD  +  G IELFHGYTYSGHPLA AAGLAT+D+Y+ EGLFER+A +APYWEDA+H+LKG  +VIDIRNLG++GA+EL    GKP  RA   F    +KG++VR TGD IALSPPLI+EK Q+D I   +G + +TLH
Sbjct:    2 PLAANESTKPNDLESFWMPFTSNRDFKQNPRLFVEAEGMHYITADGRRVLDGTSGLWCSNAGHRRKPIVDAIKKQAEVLDFAPAFQMGHPLSFEFASRLTQMI---EGFDHVFFTNSGSESVDTALKIALAYQRARGQGTRTRLIGRERGYHGVGFGGISVGGIVGNRKQFGTLLNGVDHMRHTHDMEHNAYTKGEPDWGAHLADDLERIVALHDASTIAAVIVEPVAGSTGVLIPPKGYLQRLRDICDKHGILLIFDEVITGFGRLGTPFGSDYFGVKPDIFTTAKGITNATVPMGAVFCRDGIYDAFMDAPE--GAIELFHGYTYSGHPLACAAGLATIDLYRDEGLFERAAELAPYWEDAMHSLKGSNHVIDIRNLGMVGAIELEGIAGKPTARAMDAFKQCYDKGLLVRTTGDIIALSPPLIVEKGQIDFIADTIGDVLRTLH 443          
The following BLAST results are available for this feature:
BLAST of NO04G03050 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM28721.10.000e+058.81hypothetical protein Naga_100002g88 [Nannochlorops... [more]
EWM28828.13.400e-23482.55aminotransferase class-iii [Nannochloropsis gadita... [more]
WP_089288237.19.100e-17167.59MULTISPECIES: aspartate aminotransferase family pr... [more]
EGY01753.13.500e-17067.51beta alanine--pyruvate transaminase [Nitrospirillu... [more]
WP_040844835.13.500e-17067.51aspartate aminotransferase family protein [Nitrosp... [more]
WP_088874047.12.200e-16967.05aspartate aminotransferase family protein [Nitrosp... [more]
WP_051330067.12.500e-16867.60aspartate aminotransferase family protein [Niveisp... [more]
WP_008944101.12.100e-16766.20aspartate aminotransferase family protein [Oceanib... [more]
WP_034851442.11.000e-16668.28aspartate aminotransferase family protein [Inquili... [more]
WP_028792763.11.800e-16664.43MULTISPECIES: aspartate aminotransferase family pr... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL081nonsL081Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR058ncniR058Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR111ngnoR111Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR106ngnoR106Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK009510NSK009510Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO04G03050.1NO04G03050.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|578955gene_2076Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100002g26gene1903Nannochloropsis gaditana (N. gaditana B-31)gene
Naga_100002g88gene1844Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO04G03050.1NO04G03050.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO04G03050 ID=NO04G03050|Name=NO04G03050|organism=Nannochloropsis oceanica|type=gene|length=16446bp
ATGTCCCCGGCTGGGGCAGGGCATGGTGGTGGCGATGATGGCAGCAGAAA
GGAGAGGGATGGTGGACTCCGCGTCAAGGAGTTGCGGAGCCAGCACTCGC
TGCGACACAAGCTACAAAAATGGCGCGGCACGCAATGGATGTCGCGTGGA
GCAGGGGAGATCAATCTGGACGAAAGTCAGTCCCTGAGCCTCTTGGATGT
CAGGGGAAGGATCAGGGCATGTGGGGCTGGTCTCTGGCAGCGGCGTTCGC
CCATGTGCATGGGGTTTTCGTTCATCTTTCTGGCTGTGACGATTATGATG
TGCGTGCTCCTCGTGGACGTTTTGCGCCTCATACCCTGTATGAATCCCAA
CTCCCCGTGCTTTAATCTGGCTCTGGCCGAGTTTCAGAACATTTGCAGTT
TTGAACGCATGCCGGTGCGCCTGACGACAACAGCCATGCTACCCCATATC
TACTCTCGCCTTCATGTCAAATCGGCAGTCATCGATGTGGCTCTGCGCGG
TGACCATGGGCCAGGGCACGTGGCTACCTCTTCCTTTACGGAAAAGGATG
CACTATCTTCAATGGTACTGAAGGGAGGGGAGCAGAATTTGACAGTCAAT
TCAATCATGGTGGTGACCAACACCACGGGGCTGGCCATCATGTTCGCCTA
CCTGTTGAACGAGATCGATTTCCAGGTGGCCATCACGGCCAAGATCCAGG
TTGTGGTGCAGACGCCTTTGCGAGTGCGCATAAAAGCCCCCATTAATTCA
AATTTCTTCCTGACTTGCGGTCACCTTCCCTGCAAGGAGTCCGTCTGCAA
GTACATTTGTCAATTTGGAAATTACCCTCTGAAAGACCCCGACCCCGGCA
ATCCTTACCCTTATGTGACCCGCGTGGAGAAGGTGCGCCTGGCTTACGAC
GAGGAAGGGGAATACATTATCTCCCCGCAAGTGGACGTGTGGCTGCGGGA
CTTCAAGGCTTCGGCTTCTTTCCCAAAATCTATTTTGGACATGTACTTCT
ACAACAGCACCAAGCCAGGGAGCATATCGGGATTTCGGGAGCGACACGAC
TTGGACGAGGACCACAAGCTTTTCCGGGTGACGTTGCATGAATTCTTGCT
GCAGAGCCTCGAAAGCTCGGGTGGCAAGCCCTTTAGGATGAAAATGGACT
TGCGGCTCTTAAACAACACGGCCGGTCAAGCTGCCACCGCACGGGACTTC
ACCACGCGCTTCCTGGGAAAGGAACCCATGTATATATACCTTGCGGCCAC
CCCTATTGATACCACCTGTCCTCAGCTCCGCAATCCACTGCTGATGGTTC
CACCAGCTGGTTTGTCCCTGAACACAAATTTGAGCACCAAGGACTCATCA
GACAGCAGCCTCAGCAACATGGGCGCCTTGACTGACGTTTTTCGTGTAAA
TTCCGTGGATCTTTTCAGCTTGAACAATACAATCCTGCGATCATTTGCCA
ACATTACGCTTATGCTGCCCTTTGCGCTCGAAGGCGAATTGCCTGCTCTG
GACATGGACGGATTTGCCGGTCCCGATTTGGTGGGCCATTTCAAGGCCTT
TCCAGTCACTCTACCCCCAGTAGTGCCGCGACAAGGATTTGAGCGGGACA
GGGCGCCTGCTCACACGTTCGGCAAGGAGGCCATGTCGTTGTTTGCGCCC
AAGGGGGAGGCGACAGTTGTATTATTTACTGAGGCTGAGCTTTCACAATG
GGGTATAGAAAAGCTGGAGACGATTTTCCCGAATGTGTTGTCGCAGTTGG
AGGAGCTAGACATAGACTTGGTGGGTTCTCGACCCGCTGAGCCGGGGAAT
GTGATGCAGACTTTTTTGAATGGTGTGGTAATCAAACTAGCCAGCCTTGA
GGACCGGCTGATCCAGCACGGGGAGGCAAATGAGGACCGGATGTTCTTCA
CGCCGGCCGACAGCCCGATCAAGCCCCGCGTGCACATGGAGGTAAAATCC
ATCCCCTATGTGCACAGCAACGACACGGGCCTGCTGATATATTTAAATTT
CACCTACCCTGACGCCTTTCAAGGCCGCTTTGACATTCGCGGGGGATCTA
TGCATTTCTTGCTCGCCGACTGGAATTCGACGGCCCTGGCAACGGTCACT
GTGCTGGACTCCTTACTCTCCCGGGAGCTCACGCACCTGGAGGCAGAGGT
CTTCATCTCTGACAGGACACAGGCCGAGGCCGTGCATAACTTGATTGATG
CCCTCTCAAAAAAAATGCCCTCGGGTCTGCGCATTTCAGAGGGTGTAGTG
GACTCGGCTCCAGGGCTGAAACAGCGAGTGCCCATTTCATATACGGACGT
CGAGTGGGGATGTGCCGAGGACATAGTCGTGCATTGTCCGAACCTACGCC
CCTCGGAAACCCGAGCCTGTTTGAACCAAGCTTTGGACGGGCGAATCCTC
TCCCACCAATGCCTCAAAGTGCTGCTGCCCCGCGGAGAGCTGGATTTGGA
GATCCGTGACCTAGCCAAAGTGATGGATGTGGGCCGCGGCATCGCCAACG
GATTGTTGGAGCAGGTCAAAACCACGGCCAGCAAAGTCGCAGTAAATTTG
ATTGAGGTCTTGAGTCTAAGCACGCCAGAAGAAGCGATTGACCTGCAGGT
CATTTTAGAGGAGAATAAACTCGACATTGGCATGCCTGAAGGGACCCCTG
TAGAGGTGCGGGTTCGCGCCAATCTCTCCATTGGCATCTTTGATAGCATT
GACCTTACCTTTGTGGTTCCTCCTTTGGCCGTTGAGATTTCGACGCCTTT
GCCCAAAAACCTGGCTGAAAGACTAAAAGTGCCGGCCCCGGTTGGGGAAA
AGAAGGCTGTGAAAGCGACGAAAAGGGATCAATATGATGATGAAGAAGAG
GAAAAGAAGGATGATGATGATGATGATGATGATGATGATGATGATGATGA
TGATGATGATGGTGATGGTGATGATAAGGAGGGGGAGGAGGATGAGGTCA
AGGAGGGAGAGAAGGAAGAGGATGGGGGAACGGAGGAGGAGGAGGAGAAG
GAAGAGGGAGAGGGAGAGGAGGAGACACCAGTTACGATGGATTACAATGC
TGATGACAAGGAATATGCCGCGACAACGTCTCTCTTAACGCTAGTGCTTC
CCGCGATGGATAAGGAGTACGTGCACAGTTCCCTTTTGCTGGTGGACTCA
GCGGTTCGAATCAGTAATATTTACCATGCAATGTGCTGGTTAGAAGACAT
CATGGCCGACTTGACAGACAAGTACGTGATTGTCCACCCTGCGAGCACTG
GTGACGTGTGGTCCTACATAATGGCGCCTCTCCGACTTAAGATTGACATT
TTAACGGCCATCAACTCCACTTCAGGGCAACCTCCGTCGGACACGGACGA
GTCAGGTGCTCCAGCAACCTTTTTGCAGGAGAATATTAACGAGACTTGGA
GCAAAATTTTGTTTCGCGCCGCCTCCTTCCCGGAGCTAATTGTGACAGAG
TCCCAGCTGGCATTCTCTCAGGCTTGGAGCGAAGCTGTAGATGTACGTAT
CCCAGCGATGGGTATGTCTGTGTATACCTCTTTTGTGGAGGAGGGCGATC
GGCATTTGGTCCCGCTCTTGACTGATGATACGATTTGCTTGAAGGACGCG
CATGCTTGTCTTCTGGGTGATTTCGGGGCGGGCACCTTCCAAACCACGCG
CGGGGCTATGGTCATGGTGGACACCAATCTGAACATGACCGCGGAGGATG
GTGGGCGCTTTTGGGGGCACATTCTGAGTGATTACATGGGCGGGGCTCGC
GTCGGATTGTACCTGAAGACAAAAACTGGGGCGGCGGGCGATGATGGCAA
GGCACTCTTTGACATGGATGCTTCCATCATCCTGCCCAAGATGCCACCGC
TTAACATCCCCTTGAATGACATCGTGAGCCAGGTGGGGGAGATTACCCGA
CGTGCCTTACAGGAGAGCAGGCGGCAACTTTCTTTGGCGGAGATATGGTC
GTCTTTGGATCCTATCATTCTCTCCATGACGCAGCCCGTAAGTATCAAAG
TAGATACGGTCACGCTGGAGAATTTCGAGTCCAACACCATCGACCCCCTC
GACCCGGCCTCTATTGCCCTTGGTGGTGATTTACTGGCCCGCAGCATCGT
AACAATACCCGACATCCTTCAAGTCGACCTCCAAATCCCACCCTTGACCA
TGCAAATTCACAATGGCAACCTCAGCAAATTGCTCTTGCCGGCTGTCCCC
GGCACCGAGGGCGACGGCGTGACTGGTGGGCATTCCCTTATGCACTTACC
ATGGGCCTCGGCTCAAGTTTCGGAGTTTGTCTACGACAGTCAACGCGACA
ACACGTCCACTGTGAAGCTCCATATTGACATTTTGCGTGTCAAACAGGCG
CTTAAGCTGATGGAGGACGTGCTGATGGGCGGGAGGAATGTGACCATGAC
GGTAGTCGGGACTGAGGGGACCTCGAATTCGCTCTTCTCCCACATCTTCT
CGCACATGAAGTTCCAGTTGGACATGTTCTCCAAGCCCGTGCCTTGTGTT
ACTGCTGACAAGCCTGATGCCAACGACGATGACAGAGTGGCAACACCACT
CGGTGAGAATGCTTCCCACAAATCGGCTAGGACCATCCCCACGGAGCGCA
TATCCATATCGGCCCACCTCTCCTCCATGGGCACAGCGCTATGGGGGGAC
GCGGAAATTTTCATCCCGCAGCAGACGAATCCGCTGGTTATGACCGTACT
TGCGGGTGCCATGGAGCTGAGGATGGTATTAAAGGGTATTCCTGAGGCCA
CTGAGCCGCTTGCTTACATTTCCCTTACTCCAATGGTGCTAAGCCAAGAG
CGAGACATGCATGTGGAGGCGAAGATGGTGATGGAGCCGGCGAATGTGGA
GACGCTTCGCATCTTGGTGGCGCGCCTAAAGGCTCGGGAGGTGACGGAAG
TGGTGCTGAAGGGCACCGTAGGCAATGCACCCATAGGACGCTTCAAGACG
AGCCTGCTGCTGACGCCGGCATTGGTAGATAAGCTGGGCATATTCCCGGA
CGGCGGGGAGACATTTATCAAGGTTTCGCTCGACAAACTGCTTCGAAGTA
ATACTCCTTTCCGTTTTGACGATGTGGCTATTGTGGGCGGGGAGGACGGT
GTGGGCTCTTCTGTATCTTTGCCTTGTTTGCTCAAAGGGTGGTGCGATGG
TGGGGTCGGACCCGGAGGCCCAAAAGGAAATGCGGCAGgtatgaccacga
acttactggcgttggtctcggctactgtttcgggtctctcaaacattttg
ggcttacccgaggatttcctgggattgaaacttgacctaccgcgactagc
cttcaacgtggcggtggaccctacctctgaagacctgggcacggtcgttt
tggagccggtccggctgactcttcgggatggggccaacatctcgatgatg
gcgcaggcgaagctgacggatgctgatgggctgcaaggcactgtgtttag
tatttggcagcgcgcttttactgtgttcctcgttggcggcgccgacggga
gtgacaacgtcctgaactcagtcatccgcctcttaccaattgcaatcgac
gtgtcagccccagcagtgggagctaaggagtatgtagatagcctgcggac
gctacctacagtttgtgatgggcgttggaccatggaggagaccacggcct
caagcttcactgcccgcatcgacctgccgcacctggtgagtcccatacca
gtgatgatgcagcaattcactgctactgtctattatcacgacatggccat
cctgcgggccgacaccgcagatggcacattctttatgggacctgaaggga
gtactgaccatttgtccgttaccacactcacggcctcgccgggcgcaaga
gctgggacatgtgactacttgtcgacgccggagctctgcatcctagggga
agccctgggcaagcttatgacgtttggagattcaggcccttttgaggggg
agattttggtgacgtaccttaatccggcaaaacctagttcgatacagtct
ttgcgtatgcccattcagctctacggctcctccgtggatggtgcagatca
ataccccccgtattggggctggcatgattacacgcctcagccggccaagc
ccaaacccatccccgggtccagcctctcctgcacggcggcccgcgagctg
atccaggacatttacatcaatttcgaggagacggtagggggtagtctgcg
gttctgggagacagtggaggtagcaatgaaagtctttatggtcaacatct
tctcgttccctctggaggtctcccacctccgactcacaatgtttttccgg
gacccagacggcgtgttcgattccaggccctccctcatggcttacccgcc
gtcctacgattatagcttgttctacaaagtagacctgcccactccaggtt
tctttattgcgccgggagaagggaaatggacaccgatcctacggccgcgc
atgaatgaacgaaaactgatggaaagcttggcgaggctctttgacgaggt
ggttgttcacaagcgtttgtgtgtcgacattgtggactcggtcattgggg
tgaccattcgaagcaaggattatgttccgtcggaagagccgtttgttata
aatttgccagtctctatccgctccattccattttatagcgctgatgcttg
tgggacgacgccggccacagcggctccggcggcgaaattgcaacagttac
aggagctgaaagagcagaagcaccagcatttacataatatgaagcaattg
aaggagagcgaagaaggagatgaaccgtggaagaatattataggcggaca
gagagagactttcgacgaaagcgtggccactgacattggctcagacggcc
gaatgctgcgggtgaacccaggcacggccacagtactagaagcggagcgg
gatggtaggatcataaataaacacacccactcgcggacaaggaattacca
gtagatgaaggaatgactggtaaatgaaaaaagtgcatgtgatatagtaa
ttttgtcagaaaaaacaaatgatgaaattgagggtgatgatgcaagaaat
tctatttgaaggaacgtctggaagagcctcattacttgaaaccccaccta
ctcattcatctggtgaaaacacctcaaatgtatataaatagtgcgtacgt
gtctttaaaccgcaacaagaaaagagaacgaaaaagaaacccgagcttta
cttttcttttttggtacattttataaatctagatagaattttatttattt
cttctatgtgcccatcagcacaaatttgacaaggacagcaacgtggtcag
aggcgaacacaggcccatttcgttcaagcatgcctccacccggactgtcc
gaatccttgctttcctccgtaggccccgccccatgtaaacgcgcaccaat
gggccgaagacttttgcttcggaacaaaatgtagtctattctctttacag
ggtcgcacgtaggaaaggtgaaggaagctttagctagcctttcctgcacc
tttttctctcttcctctcacactccttacgcaggcatctttttctgtagc
gcctgcgaggccctctctcctcttgtaagtctcaccttcctttattccct
cttcctcctttagctcattttggttatctacctcgcacacctcggccggc
gtcggtacgaaaggatgcatctcatcccacgcgtcatggaaccctccttc
atgaagaacccgcaaactctcctcctcaggctcagcattaaggtccccca
tcaacaccacatgctcatgcccctgctctgcagcccacgcgactatactt
ccagcagccgcatcgcgcgcgggagaactcaaggaaaagtgcgtgacgta
cacgtccacaaatggaccccctggcagaggctccacttgcgcatgcaaca
ctatgcgttgatgcccgtcttgggggtcagaaaaatcccgccgcagcaaa
agataatctgtccgacgtactggaaaccgactcaggatggcaaccccttc
ctcctcccgtaacacatggccttggtgccctgcaaaagacatggccggct
gatacacatattggtaaggcgccgactgcccctgactcccgccttctttt
cccttattttcctcaccaacagctagggcccccagcaagtccagcaagtg
actcaatcccgaatgatatcccgcaggaccgaacgtctgatcatatctaa
cttcctgcagcccaactacgtccgcaccgctcgcccaaatgatcgaggca
aggtacgctagcctctcccagtacttggaccagcgcgttgaggcgtcagg
atacacccatgctgctggttggttatgccacatattataggagaggcacg
atatggaggggaactccttctttgtgcttgatgcggataggtcggaggga
tcaaaatgtgtatgctgcaagatggaaaggtgctgatggctatcgcgcga
aactgtgacgtgatgccgccgggagggcttatcctgctgcaactcgacat
gatagatgctacctgtagagggacgataccataatttttgtggttctacg
tcagggccgtacaccactgcctccatcaccagctgctcgcacattggccc
atcgatcgcaatctcacagtccaaccgccgtgcgcccaacaagatcacag
gtggtggtgatgctgctgctgctgctgctgccgccccttgctcactttct
tcagagtaactgtcgaaggtacattcctccaatgcaggtggaactgcgaa
gagaccttgttgctgtgcaaacgccaccatctcctctacttccccggacc
gaaactcttcttgaccatgaaagctggccgaggtgccgccataggccagg
ggcgaggagctgttatagaaagggaatagggaaacgacgtgtgctcgagt
gtgccatgtatagtcctgtgttcgtcggatgatgtaagagataatgtatt
gttgtgttgctacagcgacaaggggttagaaagatgtgtcgtgcgtgcag
gtcatcacttgaataaaagactctatattttgagtcgggggggggggaaa
gaaatcacggccctctcaattctgattgctgaaaaagcttgaactatgaa
catacattactggccccactagatgcgcgcacgtaccgtcatgttggccg
gcacctttgcctttcgccccgtgaaacggatcccacgcttagccgcagcg
agagtaggggtggctataaactggagaagagcgagcaaggctacgaccgt
agtgggtaatgatggggcagacaggccctcgaaagctccgcggggcctcc
tattctccaccatgagaagaaaggcgatgggcctgggaggcctgtctacc
ttgctgagaggggggagcgaaggcggccctggcaccagactgtcgtggtg
gcgcgtcctaatcgttgtacatgcggtggggatgtgtctgtatgaccgta
tgatgattgtgttaaaagaggaggtgtcaggtcatcttaccgctttccgc
gatgggttgataacaccgagcggtgtaccgctctcttccatatacatata
tacatttacatatttacatatacatatataatgtatgtattcccccgata
ctatcactgacggtgaggacaaggagactggctatggtacagcccttgta
cacagtcgtcgcgctgtgtgcattgggaagtgcaggacggtgtcccatcc
tcgcatcaaacctcgcgctcatagattaaaaaggtcttgtgcaatcacac
taccgcacaggcatccatctacgacctcgagcatggagggcgtatcggca
ttcatcaagaagcacatctactggtaccggctatggacaggtatgttttt
gttgtgcatgccgtgcacacgttggaccagtagcacttgttcatgtgtct
tccgtcttgtcctgtgatcaagcgcgaagtgcattcactactctgtcagg
gcgtagatttaaatgaatgcacgaagttcggtctgcacttcgactgggtg
tgaatgcattggacggagtgagggattacaacagatctggacgaactaca
cggatagaaaaataaattcgtgtctagctgttagtttttgcccaagataa
tttccatcttcagggattgcactcgtgcaattctcgctgagcacaaattt
gtttgagggacttatcggactggaagtactccacgagccatgttgtggga
atccgtgtgaaaaaattccatagactgagtgtgtgtgtgtatatgcatga
aatcctgcaagtttgacaggttgatgatattattttgcggcattgtatag
tgacacttgaaatgtgtctcgtccctgcctccctcctcgtttcctgttcc
ccaggcagtctgtcatggtggagatgggctccggaatcccaatctcgctt
ttttcaaacgtgaagacgctgaagctaacgtgtttacaaaactgcgacgg
tctgctcgtctcacggcataactggctgtcaaaacaatttagatccttgg
aaaacatgttcagtacactcgcaatggggcgtcacccaggtaatcgcatt
agtattcccagatggtcttaccgccagaaaattcgtccaacatgtgaccc
acacatgatatctttggatatggacgagttacacgatttgctgagacctc
ttgtccggggccttatttgcagccgtttctcccgaggtgatgctctgtag
ttttttttacactagctcaccctagatagtaatagaaagtggacacctac
ttttttaaaaggtcccacaatctgtgtatgagttcacttcatcctttgac
cttgtctacatacgttccctgaccgcatgaacacgtgctgtcgacggcgc
ttgcataaaaaagattttggggggtattgcaaagccctccctcctaccca
ccccaaggggtgtatgtcggcatgtctgccatggccataaggtcgtccac
aatgcaatcatttgtctgatttatttatgacatcgttcttcagtgtgtat
tctttcagtatgaggtgtttagcttgatttgaggcttagacattgcacgg
caggtaggcctgcaaggggacaggtcggcacttgcaccagcgcttcggtc
acctcccagccacctcggtggacccatccatcaaccaaattcattcattt
ctttgcagtctctgatttttaaattcatgaggaaccccagcttgagtgcg
gtctcataaaaccgttcacttcctgacttattcacaaatatgtcaatagt
ccctcccgcccggaggatgcccatatttggcattccccacttccggcagg
atacatttaccttttctcaccacacgcgcacgcatctcacacaggcacct
atatgctggcatggtgggagcagatacttttcagtaagtggtacgcgcgt
tatgcttgtctttgttcgcaaccttcgggctcaggccagtgccttttttc
gctctctatatttcctgaccccttctttctaccgccccaagtttgttgtt
gtgttcctatatttcacagtgctctttttccatccgcctttacccattgc
cgcttacagacgtgctttttatggctatcacggccccgtttttctactac
accgtcaagacagcccggccactcttccaatttgttaagtatcagttgat
gtgtgggtaagctaaggaactgctgatgcttgtcagattttattctcttt
aagtctcaaggttcatacgagcatgaaaatggtggtgttatgtacaggat
tcaaaatttagtgcgctcgggggaacaggcaagaagcacgcattgggaaa
agaaggaaaatttcacttaataaagggaaaatcggtatcggtaatcgtaa
ctagtaataagtggtaaaaaattgatgaagaacccacagtagttacagat
gtactttgcttgatgcaatggaaaagaacaaccaatttatcccaatactt
gtagcattattgagtcttttgtacgcaattctcaatagatatgtattgta
agtaaatctgtagctagtccccttgaaaagatttaaaacttactacaacg
ataggtacggcggaaggtttatcgcgcaccaatcaagtgttcttatcccc
cgaaagacggcctgcttccctagcaagcatgtactccctccacccttccc
tgctaacctcaaaatcgttcccaataatgggcattagctggccgtggtgg
tgggtcgtcatacaggatgtccggcgtgcggttaaaggcactgccgccat
gatgtagacccatcaacttattggtgtgcatgcaggtaataattgcagaa
ttgcttcacgggcaaaccatacatcgcctctacggggctcttgcattata
ttccgaagctcacccttggcggtaaaattgcgatgtcggttctccttcac
ataccataggtctttcgagcgaaatgacggcggcactggcaaataaggac
tcgtccgctatagaggtgtgcctcatgctatacagtaatccttttacctc
tggcgtgctgaccaacttctccagcatcgagcggtgaaagacgtgccaca
gggagcagcccaccattcgtgccatcttcatccgcgcccggtattaccgt
tcgtctcattttggcgcagattaaatgcgatgacgaatttatgcgctccc
ttctcgggcagtacataggtttggcggaactcaattggcactgtcgtggc
ggaactcactgatattatagtggctgcggatccagctcatgctaacggtg
ccacacgccgcagggacctccgctgtcgcaaggtagggtttaaaatactc
ggtgacagcgtcgacgacgggaatgtcactgcctgaaagattgaaaaata
gccccacttccgacgtgccgactggtggttgagccggagccaaagaagga
cctcagtgtaaacgagcgtcatgtctcaccacaagctgtagatggacgca
gtttcctccaggtggacgcggtcccagaccaagaccgtggtgaattaaag
agttgtggcagcaatgatggcaaggatgtcctttcgtcgttgagaatcct
tagctttctctgcctcagcctcagcaacagcagcagctgcatcagcagaa
ccagctgcaggaagagaggcgttcttctcctccacgtgcttgtcgaggtg
agtgatgaaatgatgatcttcggagaagagggcttgccactgggcctcaa
atttggagacgtcatcgtgtagcatcagcccaaagacgatagtagatgcc
taattctcctcttcctccttatccatgctgctcttttcgacggcccgagg
caccatagtcgatctctgcgccgcatgctgccggttcttgtgaaatgtca
caagaatctgtttgtccacggccgaccaaggccctaaggacgtccacgtg
tctggcacagtcacaaggttggacgatctgctgttgctgctggtgctgcc
gctgctgctgctgcttcgaagactactgatgctgctctggtaggactggg
caagagcacctctccatcatctagccaaacaagaaaaaatggcagcttgg
ccttgtaatagtcgagactgtcgtcgccatgtgtctttgcttgtttcaaa
agccggaaattgaggggttttggcggggcctaagcgtacatacccatcat
tcttttgtctgggaaaagggtggagagtaaagaaaaatattgtataagcg
atcggaaaaaaaagatcagagatagatcggaaagggggtcatagtctttg
agtctgagtgcgtgtagagactctaaagtaccttcacaactccagataaa
aatatggagaaggaaggggctctacaaagaaagcgtccagacacgcacct
acaccgcacactgtgtaatctcggactctagtcagtagttactcgtcgag
cgtctggcatcgaagttcttgtgtgccccgcccacaaatacagccgtcat
tgtcgagctccaggccaccgtgtcggcaacaggcttcttccgatggcgag
aacctccagcttcggagccgccatgggggaaacagtcggcaaatttacgt
ccctcaagacctcatcaataatgatgacctgctcgctcgttgcccgggct
gtggttcttgttgctgcactcctcaacatgtctgaggatcaggatgacga
cacaggcgaagtagaccccaatgctctgtccatgatttccaagggctgct
tatttcgtaaggtatacgagacgctgcacagttcaattatggcgacggca
tagaggcctcgtgggtgaagttgacggcagtaaaggcccaagtgtcgaat
cgaagacgcggcccccagtaggtcaagcgtcattcttactcgtctaattg
gttgaaattggttgtgctcgtcacggggcaggcctgcgtgagcgtggcat
ggtttatgtagagctggttagacgatcgtcgtagggcaccgatacaaatg
tcttcgtgtgtgtggatgagccgcgcgatgcgtcggccaaaaactcgtcc
tgtgatcgctattactcttcattcttttttcctttactaagtttgatatc
atcaaatatgtcacagatgactcccagaagattgaccaaccttcgggggt
accaaaaattcttcgcgtagtcaaatttcaacttcatggacccaaaatcg
gcgcacacacacacatttcccaaccaaactcgcccgacacagtcggtata
cctacaagaggccaacgaactgcagctaaataaccttgcccatctatctc
cccattcccccatctttttttgtccagCCTCGTTGCCGGCGAGCAAGATG
ATGTCCTCCTCCGCCTTCTGGCGGACGACCAAGGCCTCCGCCCGGGCTGC
CATAGCGCTTCCCATCCACAGACGATGTGCACGTACTTCTCTGGCCACGT
CGACCCGCATGACGGAGCCGCCGTTTGAGGCAGCCACGGCCGCTGACCAA
GACGCGATGGCGGCCTTTTGGATGCCCTTCACCTCGAACAAAACCTTTAA
GCAGTCCCCGCGCATCTTGACCAAGGCCAAGGGCATGCACTACTGGACTG
ACGAGGGCCAAAAGGTCCTAGATGGTACGGCCGGGCTGTGGTGCGTCAAC
GCAGGTCATTGTCACGAGCACATTGTGGGCGCCATCCAGAAGCAAGCGGC
CGCGATGGACTTCGCCCCGACATTCAACATGGGCCATCCTTTGGCCTTTG
AGTACGCGCACCGCCTCTGCAATGAGCTCATGCCGGGCAGGGGTCTGGAT
CAGGTCTTCTTTACCATGTGCGGATCCACGTCGGTGGACACAGCATTGAA
GATTGCATTAGCCTACCACAAGGCCCGCGGGGAAGGTGGGCGATACCGGC
TGATAGGCAGGGAACGCGCATATCACGGTGTGGGCTTTGGTGGCATTTCC
GTGGGTGGCATGGTGCCGAACCGGCGGACGTTTGGCCCCATGCTGCCTGG
GGTGGACCATTTACGCACCACTTACGACCGTGAGCATCAGGCGTTCTCGA
AGGGGCAGCCGGAGTGGGGAGCGCATTTGGCAGAGGATTTGGAACGTTTG
GTAGGGCTGCATGATTCTTCGACGATTGCCGCTGTGATTGTGGAGCCCGT
TTCTGGATCGGCGGGCGTGCTCGTGCCACCTCATGGGTACTTGAAGCGAC
TGCGCGAGATTTGTGATACCCATGGGATTTTGCTAATTTTTGACGAGGTC
ATCACTGCCTTCGGTAGGTTGGGAAAAGGGATGGGCGCCGAGTACTTTGG
CGTAGTGCCCGACATGGTGACGTGTGCGAAGGGCTTGACCAATGCAGCAG
TCCCGGCAGGGGCGACCTTCGTTCAGAAGCATGTGTATGATGCATTCATG
GATACGGCAAAGGCCCCGGGAATGATTGAGCTCTTCCACGGTTACACCTA
CTCGGGGCACCCTCTGGCGATGGCCGCTGGTCTTGCCACTTTGGATGTGT
ACCAACAGGAAGGGTTATTTGAACGGTCTGCATTGATGGCACCGTATTGG
GAGGATGCATTGCATGCATTGAAGGGGCATCCGAATGTCATTGACATTCG
GAATCTTGGGCTCATGGGCGCTGTGGAATTGGCCCCGCGGCCAGGTAAGC
CGGGTCAGCGGGCCTTCCAAATCTTCACTACGGCTTTAAATAAGGGCGTG
ATGGTGCGTGTGACGGGTGATACCATTGCGTTGTCTCCGCCTCTCATCAT
TGAGAAGAGTCAGGTTGACCAGATTGTCGGCACCTTGAGCCAGACGCTGC
ATGAGGTTGCGTAAGAAAGAATGTTGATGACGAAGGAGAACGGAAAGATG
GAGTGAGTTTGTAAATAAGGAGCCGAGACATTCAAAGACAAAGAAATAAG
ATACGAAGTTCTGAGAGGGACGAGATAAGAACGATTTTTAAATAAGGAGC
TGAGACATTCAAAGACAAAGAATATACGAAGTTCTGAGAGGGACGA
back to top

protein sequence of NO04G03050.1

>NO04G03050.1-protein ID=NO04G03050.1-protein|Name=NO04G03050.1|organism=Nannochloropsis oceanica|type=polypeptide|length=2225bp
MSPAGAGHGGGDDGSRKERDGGLRVKELRSQHSLRHKLQKWRGTQWMSRG
AGEINLDESQSLSLLDVRGRIRACGAGLWQRRSPMCMGFSFIFLAVTIMM
CVLLVDVLRLIPCMNPNSPCFNLALAEFQNICSFERMPVRLTTTAMLPHI
YSRLHVKSAVIDVALRGDHGPGHVATSSFTEKDALSSMVLKGGEQNLTVN
SIMVVTNTTGLAIMFAYLLNEIDFQVAITAKIQVVVQTPLRVRIKAPINS
NFFLTCGHLPCKESVCKYICQFGNYPLKDPDPGNPYPYVTRVEKVRLAYD
EEGEYIISPQVDVWLRDFKASASFPKSILDMYFYNSTKPGSISGFRERHD
LDEDHKLFRVTLHEFLLQSLESSGGKPFRMKMDLRLLNNTAGQAATARDF
TTRFLGKEPMYIYLAATPIDTTCPQLRNPLLMVPPAGLSLNTNLSTKDSS
DSSLSNMGALTDVFRVNSVDLFSLNNTILRSFANITLMLPFALEGELPAL
DMDGFAGPDLVGHFKAFPVTLPPVVPRQGFERDRAPAHTFGKEAMSLFAP
KGEATVVLFTEAELSQWGIEKLETIFPNVLSQLEELDIDLVGSRPAEPGN
VMQTFLNGVVIKLASLEDRLIQHGEANEDRMFFTPADSPIKPRVHMEVKS
IPYVHSNDTGLLIYLNFTYPDAFQGRFDIRGGSMHFLLADWNSTALATVT
VLDSLLSRELTHLEAEVFISDRTQAEAVHNLIDALSKKMPSGLRISEGVV
DSAPGLKQRVPISYTDVEWGCAEDIVVHCPNLRPSETRACLNQALDGRIL
SHQCLKVLLPRGELDLEIRDLAKVMDVGRGIANGLLEQVKTTASKVAVNL
IEVLSLSTPEEAIDLQVILEENKLDIGMPEGTPVEVRVRANLSIGIFDSI
DLTFVVPPLAVEISTPLPKNLAERLKVPAPVGEKKAVKATKRDQYDDEEE
EKKDDDDDDDDDDDDDDDDDGDGDDKEGEEDEVKEGEKEEDGGTEEEEEK
EEGEGEEETPVTMDYNADDKEYAATTSLLTLVLPAMDKEYVHSSLLLVDS
AVRISNIYHAMCWLEDIMADLTDKYVIVHPASTGDVWSYIMAPLRLKIDI
LTAINSTSGQPPSDTDESGAPATFLQENINETWSKILFRAASFPELIVTE
SQLAFSQAWSEAVDVRIPAMGMSVYTSFVEEGDRHLVPLLTDDTICLKDA
HACLLGDFGAGTFQTTRGAMVMVDTNLNMTAEDGGRFWGHILSDYMGGAR
VGLYLKTKTGAAGDDGKALFDMDASIILPKMPPLNIPLNDIVSQVGEITR
RALQESRRQLSLAEIWSSLDPIILSMTQPVSIKVDTVTLENFESNTIDPL
DPASIALGGDLLARSIVTIPDILQVDLQIPPLTMQIHNGNLSKLLLPAVP
GTEGDGVTGGHSLMHLPWASAQVSEFVYDSQRDNTSTVKLHIDILRVKQA
LKLMEDVLMGGRNVTMTVVGTEGTSNSLFSHIFSHMKFQLDMFSKPVPCV
TADKPDANDDDRVATPLGENASHKSARTIPTERISISAHLSSMGTALWGD
AEIFIPQQTNPLVMTVLAGAMELRMVLKGIPEATEPLAYISLTPMVLSQE
RDMHVEAKMVMEPANVETLRILVARLKAREVTEVVLKGTVGNAPIGRFKT
SLLLTPALVDKLGIFPDGGETFIKVSLDKLLRSNTPFRFDDVAIVGGEDG
VGSSVSLPCLLKGWCDGGVGPGGPKGNAAASLPASKMMSSSAFWRTTKAS
ARAAIALPIHRRCARTSLATSTRMTEPPFEAATAADQDAMAAFWMPFTSN
KTFKQSPRILTKAKGMHYWTDEGQKVLDGTAGLWCVNAGHCHEHIVGAIQ
KQAAAMDFAPTFNMGHPLAFEYAHRLCNELMPGRGLDQVFFTMCGSTSVD
TALKIALAYHKARGEGGRYRLIGRERAYHGVGFGGISVGGMVPNRRTFGP
MLPGVDHLRTTYDREHQAFSKGQPEWGAHLAEDLERLVGLHDSSTIAAVI
VEPVSGSAGVLVPPHGYLKRLREICDTHGILLIFDEVITAFGRLGKGMGA
EYFGVVPDMVTCAKGLTNAAVPAGATFVQKHVYDAFMDTAKAPGMIELFH
GYTYSGHPLAMAAGLATLDVYQQEGLFERSALMAPYWEDALHALKGHPNV
IDIRNLGLMGAVELAPRPGKPGQRAFQIFTTALNKGVMVRVTGDTIALSP
PLIIEKSQVDQIVGTLSQTLHEVA*
back to top
Synonyms
Publications