NO01G02870, NO01G02870 (gene) Nannochloropsis oceanica

Overview
NameNO01G02870
Unique NameNO01G02870
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length5397
Alignment locationchr1:742551..747947 -

Link to JBrowse

Properties
Property NameValue
DescriptionIsoleucyl-trna synthetase
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr1genomechr1:742551..747947 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
PXD0160542021-01-14
PXD0166992021-01-08
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
PXD0100302020-03-16
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PXD0087212019-04-30
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: Molecular Function
TermDefinition
GO:0002161aminoacyl-tRNA editing activity
GO:0004812aminoacyl-tRNA ligase activity
GO:0003723RNA binding
GO:0000166nucleotide binding
GO:0004822isoleucine-tRNA ligase activity
GO:0005524ATP binding
GO:0016787hydrolase activity
GO:0003723RNA binding
GO:0000166nucleotide binding
Vocabulary: Biological Process
TermDefinition
GO:0006418tRNA aminoacylation for protein translation
GO:0006428isoleucyl-tRNA aminoacylation
GO:0006412translation
GO:0006139nucleobase-containing compound metabolic process
Vocabulary: Cellular Component
TermDefinition
GO:0005737cytoplasm
Vocabulary: INTERPRO
TermDefinition
IPR009080tRNAsynth_1a_anticodon-bd
IPR009008Val/Leu/Ile-tRNA-synth_edit
IPR012340NA-bd_OB-fold
IPR036612KH_dom_type_1_sf
IPR013155M/V/L/I-tRNA-synth_anticd-bd
IPR002300aa-tRNA-synth_Ia
IPR004088KH_dom_type_1
IPR002301Ile-tRNA-ligase
Homology
BLAST of NO01G02870 vs. NCBI_GenBank
Match: EWM29120.1 (isoleucyl-trna synthetase [Nannochloropsis gaditana])

HSP 1 Score: 1739.9 bits (4505), Expect = 0.000e+0
Identity = 831/1070 (77.66%), Postives = 943/1070 (88.13%), Query Frame = 0
Query:  293 RQPLGRHRAVVMSRRAVSTFVMTTCLLLLQRSSTHVVWRRSVQTKSSSSMRGRG-GLGSPAFLLSASRLRHTGPIH-AKKQPDDPSDPANLYRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPELGEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDGATLMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGDSLSFPADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDFDPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARPTTSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALSQLRDTASGAVLGIKRASGSKCSRCWYYGDLSPADAELPHICPRCSHAVQARGGLK 1361
            R+ L +    VMSR  + T+ + TCLL L RS   V WRR  Q  S   +R       + +F LS +RLR    +H AKKQ DDPSDP+N+YRDTV LP+S+FDQRANA+VREPQ+H FWE+E IY+ LHE  RK GAPKF LHDGPPYANGDLHIGHALNKILKDFINRYK+L+GFEVRYVPGWDCHGLPIELKVLQSLKQ +RQALTP++LR +A DFA ETV+RQR SFKRYG+ G+W +PY+TLQPEYEAAQ+R FGDMVLKGHIYRG KPVHWSPSSRTALAEAELEYPE+H+SKSIYV FPV+SLE C GAA L+ AHP G QALRVA+WTTTPWTIP+NLA+AVN E++YS+V+H ++  +A++VA DLV KLAE+VFKV  KGGL  LATLKGRDLVGL+YRHPL DRESQ++EGGDYITTE+GTGLVHTAPGHGQEDYLTG+KYGLPLLSPVDDAGRFTEE G RFEGKDVLGDGN+EV+QALNETG+LILMEPY+HKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAI+TV W+P+VGKKRIS MT SRSDWCISRQRSWGVPIPVFY+ +DG+ LM++KT++HLEGVFR HGSDAWWTM+  DLLP EY+E++ L+ +GTDTMDVWFDSGSSWAGV+K R D L FPADVYLEGVDQHRGWFQSSLLT VAATGMAPYK VVTHGFVLDEKGYKMSKSLGNVVDP+KVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVG NILKQ+GD+YRKLRNTARYL+GNLHDFDP++DAV YD+LPRLDKYILGRLS+M+QEVE+A++++QFSRA+QALQRFAV DLSNFY+DVAKDRLYIAA +D RRRTCQTTM+LLLEGMATAMSP+LPHMAEDIWQNLPYARPTTSIFQAGWI + R++PSYE++ W+AVLRLRDDVNKCMEA RREG+IGASLEAAVYVYA D EL  +LDGLMGD + ++PP +SN VDDLRFLLLASQVH+V +  EVT+ C+  L++  D ASGA +GI+RASG KCSRCWYYGDLSPADA  PHIC RC HAV ARGGLK
Sbjct:    3 RRLLPQATTSVMSR--LPTYALMTCLLFLHRSGAQVTWRRGAQCGSFIFLRNHHVKYRAVSFDLSTTRLRQN--LHSAKKQLDDPSDPSNIYRDTVTLPKSTFDQRANAVVREPQIHQFWETEGIYRKLHEGSRKKGAPKFVLHDGPPYANGDLHIGHALNKILKDFINRYKVLQGFEVRYVPGWDCHGLPIELKVLQSLKQADRQALTPISLRLKARDFALETVERQRASFKRYGIMGDWQQPYLTLQPEYEAAQIRAFGDMVLKGHIYRGLKPVHWSPSSRTALAEAELEYPEDHVSKSIYVGFPVTSLEGCPGAAALSTAHPAGAQALRVAVWTTTPWTIPANLAVAVNTEIDYSLVTHSDMPGEAYLVATDLVDKLAETVFKV-SKGGLERLATLKGRDLVGLRYRHPLIDRESQIVEGGDYITTETGTGLVHTAPGHGQEDYLTGIKYGLPLLSPVDDAGRFTEEAGERFEGKDVLGDGNLEVIQALNETGALILMEPYHHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIDTVTWLPDVGKKRISTMTASRSDWCISRQRSWGVPIPVFYHVDDGSILMDDKTISHLEGVFREHGSDAWWTMEVADLLPIEYRERASLYLRGTDTMDVWFDSGSSWAGVLKNRKDDLQFPADVYLEGVDQHRGWFQSSLLTSVAATGMAPYKAVVTHGFVLDEKGYKMSKSLGNVVDPIKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGGNILKQVGDAYRKLRNTARYLVGNLHDFDPDRDAVPYDSLPRLDKYILGRLSMMIQEVEDAFETFQFSRANQALQRFAVADLSNFYLDVAKDRLYIAASNDERRRTCQTTMRLLLEGMATAMSPVLPHMAEDIWQNLPYARPTTSIFQAGWIPEQRRFPSYENEEWSAVLRLRDDVNKCMEAARREGLIGASLEAAVYVYAHDPELCKILDGLMGDNSLKYPPTRSNGVDDLRFLLLASQVHIVSTAAEVTSKCEIVLNKGGDAASGATVGIRRASGKKCSRCWYYGDLSPADATFPHICSRCVHAVTARGGLK 1067          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: XP_002176534.1 (predicted protein [Phaeodactylum tricornutum CCAP 1055/1] >EEC50997.1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1])

HSP 1 Score: 1268.8 bits (3282), Expect = 0.000e+0
Identity = 598/984 (60.77%), Postives = 750/984 (76.22%), Query Frame = 0
Query:  378 PANLYRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPELGEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDG-ATLMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGDSLSFPADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDFDP----NKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARPTTSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALSQLRDTASGAVLGIKRASGSKCSRCWYYG-DLSPADAELPHICPRCSHAVQA 1356
            P N Y  T+ LP+++F QRANA++REP+L   W+   +Y  L    +++GA +F LHDGPPYANGDLH GHALNKILKDFINR +IL G +V Y+PGWDCHGLPIELKVLQ++K  ER+ALTPVTLR++A +FAKETV++Q  +F+RYG++G++++PY+TLQPE+EAAQ+RVFG+M  KG+I+RGRKPVHWSPSSRTALAEAELEYPE H+SKSIYVAF V      + +  LA+ H    + L+VAIWTTTPWT+P+N+A+AVN EL YSVV H + G+   +VA DL   LA   F + +   L +LAT  G+DLVG  Y+HP++DR+S VL GGDYITTESGTGLVHTAPGHGQEDYLTG+K GL +LSPVDD G+FT E G R+ G  VL +GN+ ++ AL E GSL+  E Y HKYPYDWRTKKPTIFRATDQWFASV  FR  AL A+E V W+PEVGK RI    + R DWCISRQRSWGVPIPVFY+K  G   L+N+ TLAH++ +F  HGSD WW MD  +LLP +++ ++  W+KGTDTMDVWFDSGSSWAGV + R D L++PAD+YLEG DQHRGWFQSSLLT VA   +APYKTV+THGFVLDEKG+KMSKSLGNVVDP+KVIEGGNNKKQDPAYGADVLRLWV++ DYS DV +G NI+KQ  DSYRKLRNTARYLIGNL DF P    + +AVAYD LP +DK++LGRLS +L+ V  A D +Q+ RA Q L RFA  DLSNFY+DVAKDRLYI+A DD RRR+CQT +   LEG   A+SP+LPHMAEDIWQNLPY + T S+F+ GW      Y  ++   W  V  +RDDVNK +E+ R + ++GASL+AA Y+Y PD+E + +L+ L+GDA+   PP K+N VD+LR  L+ SQ+H+V++  +V   C+      RDT+SG ++G+ RA G+KC+RCW+Y  ++         +C RC+ A+ +
Sbjct:   57 PKNAYAPTIVLPETAFSQRANAVIREPELQALWKETNLYHKLSSQAKEAGAERFVLHDGPPYANGDLHCGHALNKILKDFINRKQILNGKQVHYIPGWDCHGLPIELKVLQTMKSKEREALTPVTLREKAAEFAKETVEKQSVAFQRYGIFGDFEKPYLTLQPEFEAAQIRVFGEMFQKGYIFRGRKPVHWSPSSRTALAEAELEYPEGHVSKSIYVAFNVD-----TPSEVLASFH-STEEPLKVAIWTTTPWTMPANMAVAVNPELSYSVVQHEKTGK--LLVATDLASSLARK-FDLPEDETLDILATFPGKDLVGTIYKHPMYDRKSPVLAGGDYITTESGTGLVHTAPGHGQEDYLTGLKNGLEILSPVDDVGKFTIEAGERYVGMRVLAEGNLAMIDALQEAGSLLKAEDYGHKYPYDWRTKKPTIFRATDQWFASVAGFRDQALDAVEKVKWVPEVGKNRIRSFVDGRGDWCISRQRSWGVPIPVFYDKATGKEVLLNKDTLAHVQTLFAEHGSDCWWKMDESELLPEKFRAEADKWQKGTDTMDVWFDSGSSWAGVAQSR-DELAYPADMYLEGSDQHRGWFQSSLLTSVANNNIAPYKTVLTHGFVLDEKGFKMSKSLGNVVDPLKVIEGGNNKKQDPAYGADVLRLWVANSDYSGDVLIGDNIIKQTFDSYRKLRNTARYLIGNLADFVPSDSTDSNAVAYDDLPSMDKWMLGRLSAVLRTVNEAMDDFQYQRAIQELLRFASADLSNFYLDVAKDRLYISAMDDARRRSCQTVLYACLEGFTKAISPILPHMAEDIWQNLPYQKSTDSVFEGGWPTNLMSYAEFDSQTWDLVRLVRDDVNKMLESARSDKLVGASLDAAAYIYVPDSEKKAILEKLVGDASLVAPPVKTNGVDELRTALMLSQIHLVENENDVREACEDKYVSSRDTSSGCIVGVGRAVGTKCARCWFYDEEVGNHSLTYADVCQRCNEAISS 1030          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: CBJ33425.1 (Isoleucyl-tRNA Synthetase [Ectocarpus siliculosus])

HSP 1 Score: 1250.7 bits (3235), Expect = 0.000e+0
Identity = 596/989 (60.26%), Postives = 741/989 (74.92%), Query Frame = 0
Query:  388 LPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPELGEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDGATLMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGDSLSFPADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDFDPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARP--TTSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALS-QLRDTASGAVLGIKRASGSKCSRCWYYGDLSPADAELPHICPRCSHAVQARGGLKPAA--AQPAEVGA 1372
            LPQ+ F+QRANA+ REP+L  FW+ ERIY+ + +    +   K+ LHDGPPYANGDLHIGHALNKILKDFINRY++LRG +V YVPGWDCHGLPIELKVLQS+K  ER+ +TPV LR++A +FA+ TV +Q  SF+RYGVWG++++PY+TLQPEYEAAQ+ VFG M  +GHI+RGRKPV+WSPSSRTALAEAELEYP  H S S+Y AF V        A      + G    L V++WTTTPWT+P+NLA+AV+  + YSVV H  LG++  +VA DLVG ++  +  +++   L V+ T  G+DL+G KY+HPL DR S+V+ GGDYITTE+GTGLVHTAPGHGQEDY+TG KYGLPLLSPVDDAGRFT E G RF+G +VL +GN EV+ AL E G L+  E Y HKYPYDWRTKKPTIFRATDQWFASVD FR  AL+AIE V W+PEVG++RI+ M E R+DWCISRQRSWG+PIPVFYN E G  ++  +++ H+  +   HGSDAWW M+  DLLP   + ++  W +GTDTMDVWFDSGSSWAGV K R + L +PAD+YLEG DQHRGWFQSSLLT VAA G APYKTV+THGFVLDEKGYKMSKSLGN +DP +VIEGGNNKKQ PAYGADVLRLWVSSVDYS DVCVG  I+KQ  +SYRKLRNT RYL G+L DFDP KD+V Y+ LP LDKY+LG LS +L+EVENA+D YQF +ASQ LQRFA  DLSNFY+D AKDRLYI++ DD RRR+CQT +  +L  ++TAM+P+LPHMAED+WQ LP+       SIFQAGW     ++  +E + W  +  LR+DVNK +E  R   V+G+SLE  V++Y  D  L+  L+ L+GD T +  P  SN VDDLRF+L+ S++++V S   V + C + ++   +D+ SG V+G+ RA G+KC RCWY+ D   +  + P +C RC   V+  G   P A   +PA V A
Sbjct:   84 LPQTGFEQRANAVKREPELQAFWDEERIYERVLDD---NTGEKYVLHDGPPYANGDLHIGHALNKILKDFINRYQMLRGRKVGYVPGWDCHGLPIELKVLQSMKSNERRGMTPVQLRKKAAEFARNTVSKQSASFRRYGVWGDFEKPYLTLQPEYEAAQIEVFGKMFTEGHIFRGRKPVNWSPSSRTALAEAELEYPAGHTSTSVYAAFEVKE------ATPALEKYMGSGGGLAVSVWTTTPWTLPANLAVAVSETIRYSVVEHASLGDRKLLVATDLVGAVSAKI-GLEEGDTLKVVETFTGKDLIGTKYQHPLCDRVSEVVVGGDYITTETGTGLVHTAPGHGQEDYVTGQKYGLPLLSPVDDAGRFTVEAGDRFKGLNVLKEGNEEVLVALEEAGGLLAREAYQHKYPYDWRTKKPTIFRATDQWFASVDRFRDTALSAIEKVQWIPEVGQRRITTMVEGRNDWCISRQRSWGLPIPVFYNIESGEPMLTTESIEHIRAIVAEHGSDAWWEMEVADLLPPSLQGEADQWRRGTDTMDVWFDSGSSWAGVAKAR-EELDYPADLYLEGSDQHRGWFQSSLLTSVAANGHAPYKTVLTHGFVLDEKGYKMSKSLGNTLDPTEVIEGGNNKKQKPAYGADVLRLWVSSVDYSGDVCVGDQIIKQASESYRKLRNTLRYLTGSLFDFDPEKDSVPYEDLPSLDKYMLGMLSSVLEEVENAFDGYQFYKASQVLQRFATADLSNFYLDGAKDRLYISSSDDFRRRSCQTVLHTMLMSLSTAMAPILPHMAEDVWQTLPFKPEGGQRSIFQAGW--ATGRHAQHEAEKWGRLRTLRNDVNKALEQARGAKVLGSSLEGQVFIYCSDDGLKAELESLLGDETLKREPELSNGVDDLRFVLMVSKINLVSSTEAVDSACAEGMTVSAKDSESGCVVGVSRAPGNKCERCWYHCDSVGSHEDHPSLCSRCHGVVEGLGLAPPPAVETEPAAVSA 1059          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: XP_002286281.1 (isoleucine-trna synthetase [Thalassiosira pseudonana CCMP1335] >EED95922.1 isoleucine-trna synthetase [Thalassiosira pseudonana CCMP1335])

HSP 1 Score: 1244.2 bits (3218), Expect = 0.000e+0
Identity = 622/1071 (58.08%), Postives = 788/1071 (73.58%), Query Frame = 0
Query:  303 VMSRRAVSTFVMTTCLLLLQRSSTHVVWRRSVQTKS-SSSMRGRGGLGSPAFLLSASRLRHTGPIHAKKQPDDPSDPANLYRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPEL---GEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEE------VGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDGAT--LMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGD---SLSFPADVYLEGVDQHRGWFQSSLLTCVAAT-GMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDFDPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARPT-TSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALSQLRDTASGAVLGIKRASGSKCSRCWYYGDLSPADAELPH---ICPRCSHAV 1354
            +++ +   TF+M+  L L   +     ++ S +  S ++S RGR  + + +F  SA +                  P N+Y DT+ LPQ+ F+QRANAI+REP+L  +W S  +Y  L ++   + A +F LHDGPPYANGDLHIGHALNK+LKDFINR++IL+G +V YVPGWDCHGLPIELKVLQ++K  ER+ALTPVTLR++A  FAKETV++Q  SF+RYGV G++  PY+TL PEYEAAQ+RVFG+M   GHI+RGRKPVHWSPSS+TALAEAELEYPE H+SKSIYVA     L+  S +  L      G + L+VAIWTTTPWTIP+NLA+A+N +LEY VV   E    G+   +VA  LV K  ES F++ +   L V+AT  G  LVG  Y+HPL++R S V+ GGDYITTESGTGLVHTAPGHGQEDYLTG+KYGL LLSPVDD G+FT E      VG  F G  VLG+GN+ V++AL + G+LI  E Y HKYPYDWRTKKPTIFRAT QWFASV+ FR+DAL A++TV W+P+ GK RI    ESR DWCISRQRSWGVPIPVFY+KE G T  L++E TL H++ VF  HGSDAWW +DTVDLLP +YK+Q+  W KG+DTMDVWFDSGSSWAGV ++R D    L +PAD+YLEG DQHRGWFQSSLLT VAA  G APYK+V+THGFVLDEKG+KMSKSLGNVV+P++VIEGGNNKK +PAYGADVLRLWV+SVDY+ DV VG+NI+KQ  +SYRKLRNTARYLIGNL DF+P  D++ Y+ LP +DK++LG LS +L EV++A   YQFSRA+Q L RF+ +DLSNFY+DVAKDRLYI+A +D RRR+CQT +  LLEG A +++P+LPHMAEDIWQNLPY + + +S+F+ G   K   YP+++ D W  V  +R DVN+ +E  RR+ ++GASL++A Y+Y  D ++R VL+GL GD +   P  K+N VD+LR  L+ SQV++VDS   +T+ CD A    +   SG ++G+K+A G KC RCW+Y D       L +   +C RC  A+
Sbjct:    3 ILAPQMKKTFIMSLLLALALETKLAASFQPSARLMSTATSSRGR-SIATRSF--SALQAEXXXXXXXXXXXXXXXAPKNVYADTIILPQTDFNQRANAIIREPELQQYWSSTNLYSKLSKAAAANSAERFILHDGPPYANGDLHIGHALNKLLKDFINRHQILKGKQVHYVPGWDCHGLPIELKVLQTMKSKEREALTPVTLREKAASFAKETVEKQSASFQRYGVIGDFGNPYLTLLPEYEAAQIRVFGEMYKAGHIFRGRKPVHWSPSSKTALAEAELEYPEGHVSKSIYVA-----LDVVSPSEELKEYMEEG-EKLKVAIWTTTPWTIPANLAVAINPDLEYCVVDTHESVLDGKSKLLVATGLV-KTLESKFQLPEGEHLKVVATFPGSTLVGTTYQHPLYERTSPVVAGGDYITTESGTGLVHTAPGHGQEDYLTGLKYGLELLSPVDDVGKFTVEAGSSTVVGDAFVGLSVLGEGNLAVIEALEQAGALIRAENYGHKYPYDWRTKKPTIFRATAQWFASVEGFREDALKAVDTVKWIPDTGKNRIRSFVESRGDWCISRQRSWGVPIPVFYDKETGGTEILLDEDTLNHIQSVFAEHGSDAWWKLDTVDLLPEKYKDQADKWTKGSDTMDVWFDSGSSWAGVTQQRADEDMGLGYPADIYLEGSDQHRGWFQSSLLTSVAANKGQAPYKSVLTHGFVLDEKGFKMSKSLGNVVNPLQVIEGGNNKKLEPAYGADVLRLWVASVDYAGDVRVGANIIKQTFESYRKLRNTARYLIGNLADFNPATDSIPYEDLPSMDKWMLGTLSSVLNEVDDAMSQYQFSRATQELLRFSTSDLSNFYLDVAKDRLYISAVNDARRRSCQTVLHSLLEGFAKSIAPILPHMAEDIWQNLPYKKESDSSVFEGGIPEKLTSYPAFDSDKWDLVRDVRTDVNQVLELARRDKLVGASLDSAAYIYTADEKMREVLNGLDGDESLITPSVKTNGVDELRTALMISQVNLVDSEDAITSACD-ATYIAKGELSGCMIGVKKADGVKCGRCWFY-DKEVGKHGLRYGEDLCQRCDDAI 1061          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: EJK73254.1 (hypothetical protein THAOC_05131 [Thalassiosira oceanica])

HSP 1 Score: 1241.5 bits (3211), Expect = 0.000e+0
Identity = 605/997 (60.68%), Postives = 748/997 (75.03%), Query Frame = 0
Query:  378 PANLYRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVV-SHPEL--GEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEE------VGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDGAT--LMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGD---SLSFPADVYLEGVDQHRGWFQSSLLTCVAAT-GMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDFDPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYAR--PTTSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALSQLRDTASGAVLGIKRASGSKCSRCWYYG-DLSP-ADAELPHICPRCSHAVQA 1356
            P N Y DT+ LPQ+ F QRANAI REP+L ++W+S  +Y  L        A +F LHDGPPYANGDLHIGHALNK+LKDFINR++IL+G +V YVPGWDCHGLPIELKVLQ++K  ERQ LTP+ LR+RA  FAKETV++Q  SF+RYGV G++D PY+TL PEYEAAQ+RVFG+M  +G I+RGRKPVHWSPSSRTALAEAELEYPE H+SKSIYVA  V S       +     H G    L+VAIWTTTPWTIP+NLA+A+N ELEY VV +HP +  G+   +VAK LV  L E+ F + D   L V+A+  G  LVG  Y+HPL++R+S V+ GGDYITTESGTGLVHTAPGHGQEDYLTG+K GL LLSPVDD GRFT E      VG    GK VLG+GN+ V++AL E G+L+  E Y HKYPYDWRTKKPTIFRATDQWFASV+ FR DAL A+++V W+P+VGK RI+   ESR DWCISRQRSWGVPIPVFY+KE G T  L++E TL H++ +F   GSDAWW +D VDLLP +YK ++  W KGTDTMDVWFDSGSSWAGVV+ER +    LS+PAD+YLEG DQHRGWFQSSLLT VAA  G APYK V+THGFVLDEKG+KMSKSLGNVV+P++VIEGGNN+K +PAYGADVLRLWVSSVDY+ DV VGSNI+KQ  +SYRKLRNTARYLIGNL D++P+ DA+ YD LP +DK++LG L+ +L EV++AY +YQFSRA+  + RFA  DLSNFY+DVAKDRLYI+A DD RRR+CQT +  LLEG A + +P+LPHMAEDIWQNLPY      +S+F+ G   +   Y  ++ + W  +  +R D N+ +E  R++ ++GASL+AA Y+Y  D  +R VL  L GD     P  K+N VD+LR  L+ SQVH+VDS ++VT+ CD   +  +   SG  +G+K+A G KC RCW+Y  ++ P        +C +C +A+ +
Sbjct:   59 PKNAYADTILLPQTDFSQRANAIKREPELQEYWKSIDLYSKLSSDAESRSAERFVLHDGPPYANGDLHIGHALNKLLKDFINRHQILKGKQVHYVPGWDCHGLPIELKVLQTMKSKERQGLTPIALRERAATFAKETVEKQSASFQRYGVIGDFDNPYLTLLPEYEAAQIRVFGEMYKQGFIFRGRKPVHWSPSSRTALAEAELEYPEGHVSKSIYVALDVVS------PSEELQQHVGS-GNLKVAIWTTTPWTIPANLAVAINPELEYCVVDAHPNVLDGQSKLLVAKGLVESL-ETKFDLSDGDNLKVIASFNGSSLVGTSYQHPLYERQSPVIAGGDYITTESGTGLVHTAPGHGQEDYLTGLKNGLELLSPVDDVGRFTVEAGSSTIVGDDLVGKSVLGEGNIAVIEALEEAGALLKAEDYGHKYPYDWRTKKPTIFRATDQWFASVEGFRDDALKAVDSVQWIPDVGKNRINSFIESRGDWCISRQRSWGVPIPVFYDKETGGTQVLLDEDTLEHIQSIFAEQGSDAWWKLDVVDLLPNKYKSEADKWTKGTDTMDVWFDSGSSWAGVVQERANKEKGLSYPADIYLEGSDQHRGWFQSSLLTSVAANKGQAPYKAVLTHGFVLDEKGFKMSKSLGNVVNPLQVIEGGNNRKLEPAYGADVLRLWVSSVDYAGDVRVGSNIIKQTFESYRKLRNTARYLIGNLDDYNPSTDAIPYDDLPSMDKWMLGTLTKVLNEVDDAYSNYQFSRATNEILRFATADLSNFYLDVAKDRLYISAIDDFRRRSCQTVLHKLLEGFAKSFAPILPHMAEDIWQNLPYKAEGDNSSVFEGGVPAELLSYSEFDKERWELIRDVRTDANQVLELARQDKLVGASLDAAAYIYTEDEIIRKVLGELDGDEHIISPAVKTNGVDELRTALMISQVHLVDSELDVTSACDSTYT-AKGELSGCTIGVKKAEGKKCGRCWFYDTNVGPHGKCHGDDLCQKCVNAIDS 1046          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: XP_005832559.1 (hypothetical protein GUITHDRAFT_71180 [Guillardia theta CCMP2712] >EKX45579.1 hypothetical protein GUITHDRAFT_71180 [Guillardia theta CCMP2712])

HSP 1 Score: 1241.5 bits (3211), Expect = 0.000e+0
Identity = 598/1001 (59.74%), Postives = 751/1001 (75.02%), Query Frame = 0
Query:  363 TGPIHAKKQPDDPSDPANLYRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPELGEQAFVVAKDLVGKLAESVFK-VQDKGGLTVLATLKGRDLVGLKYRHPLF-DRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDGATLMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKE--RGDSLSFPADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDFDPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARPTTSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATF--QHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALS-QLRDTASGAVLGIKRASGSKCSRCWYYGDLSPADAELPHICPRCSHAVQAR 1357
            T P  +KK+  +  +   +Y  TV LP ++F QRAN++VREP++  FWE  RIY+ L+ES   +   KF LHDGPPYANG+LHIGHALNKILKDFINRYK++ G +V++VPGWDCHGLPIELKVLQSLK+ E++ +TP++LR++A +FAK+TVD+QR SFKRYGVW +W EPY+TL P+YEAAQ+  FG M+  GHIYRGRKPV+WSPSSRTALAEAELEYPE H S+S+Y AF V             A  P   + L+VAIWTTTPWTIP+NLA+AVN +LEYS+V H  +    +VVAKDL G LA  + K  +D+  L  +A + G++L G KY+HPL+ +R + V+ GGDYITTESGTGLVHTAPGHG EDY TG+KYGL LLSPVDDAGRFT E GP  EGKDVL DGN  V++ + E G+LI  E Y HKYPYDWRTKKPTIFRATDQWFASV++FR+ A+ AI+TV W+PEVGK RIS M E RSDWCISRQRSWGVPIPVFY+KE G  L+  +++ H+  +   HGSDAWW  D  +LLP  +K+++  WE+G DTMDVWFDSGSSW GVVK    G +L FPAD+YLEG DQHRGWFQSSLLT VAA G APYKTV+THGFVLDEKG+KMSKSLGNVVDP  VI GG N+K +PA+GAD LRLWV+SVDY+ DV VG NI+KQI DSYRK+RNT RYL+GNLH FDP K +V Y+ LP +DK++LGRL+ + +EV +AYD++QF RA QA+ +  +T+LSNFY+D+AKDRLYI+A DD RRR+CQT +   LE + TA+SP+LPHMAED WQNLP+    +S+F+ GW  K   YPS+E + W  +  LRDDVN+CMEA R + +IGA+LE +V +Y  D E + ++  L+ + +          N VDDLRFL L S V +VD+  +VT+ C++  +  + D+ASG  +G+ RA G KC RCW+Y D       L +ICPRC+ AV+ +
Sbjct:    4 TSPKGSKKKGGN-DEEEGIYSSTVNLPVTNFQQRANSVVREPEIQKFWEENRIYEKLYES---NPGTKFVLHDGPPYANGNLHIGHALNKILKDFINRYKVISGHKVKFVPGWDCHGLPIELKVLQSLKKEEKENMTPISLRKKAAEFAKQTVDQQRESFKRYGVWADWSEPYLTLDPKYEAAQIETFGAMLKGGHIYRGRKPVNWSPSSRTALAEAELEYPEGHKSRSMYAAFTV--------VEPSEAVKPHS-ENLKVAIWTTTPWTIPANLAVAVNEKLEYSIVEHHGV---KYVVAKDLKGALAAKLKKGEEDEVELKEIAVITGKELAGTKYQHPLYKERINPVVIGGDYITTESGTGLVHTAPGHGVEDYQTGLKYGLELLSPVDDAGRFTAEAGPLLEGKDVLKDGNERVLELMEEAGALIKEEEYGHKYPYDWRTKKPTIFRATDQWFASVENFREKAMGAIDTVKWIPEVGKNRISAMVEGRSDWCISRQRSWGVPIPVFYHKETGEPLLTAESIDHIRDIIAEHGSDAWWEKDVAELLPPSHKQEADKWERGRDTMDVWFDSGSSWNGVVKSWGEGKALDFPADMYLEGSDQHRGWFQSSLLTSVAAQGTAPYKTVLTHGFVLDEKGFKMSKSLGNVVDPALVINGGKNQKTEPAFGADTLRLWVASVDYTGDVRVGGNIMKQISDSYRKIRNTLRYLLGNLHGFDPKKHSVKYEDLPSVDKWMLGRLARVQEEVRDAYDTFQFYRAYQAILQVCITELSNFYLDIAKDRLYISAEDDVRRRSCQTVLHAALEMLTTAISPMLPHMAEDAWQNLPWEH-VSSVFEHGW--KQLDYPSHEGERWDYIRSLRDDVNQCMEAARADKLIGATLEGSVAIYVADEEKKQLVTSLLHEVSMLTSETSPGFNKVDDLRFLFLVSDVKIVDTPEDVTSLCNEKHTLSISDSASGCFVGVARAQGKKCERCWFYSDTVGTHEGLENICPRCAQAVKQK 985          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: GAX16043.1 (isoleucyl-tRNA synthetase [Fistulifera solaris])

HSP 1 Score: 1235.3 bits (3195), Expect = 0.000e+0
Identity = 594/985 (60.30%), Postives = 732/985 (74.31%), Query Frame = 0
Query:  380 NLYRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAF----PVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPELGEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDG-ATLMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGDSLSFPADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDF----DPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARPTTSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALSQLRDTASGAVLGIKRASGSKCSRCWYYGDLSPADAELPH--ICPRCSHAV 1354
            N++  T+ LP + F QRANA+VREP+L  +W+   +Y  L E    SG  +F LHDGPPYANGDLH GHALNKILKDF+NR ++L G +V Y+PGWDCHGLPIELKVLQ++   ER  LTPVTLRQ+A  FAKETV++Q  +F+R+G++G++++PY+TLQPE+EAAQ+RVFG+M  KG+I+RGRKPVHWSPSSRTALAEAELEYP+ H+SKSIYVAF    P  +LE    A             L+VAIWTTTPWT+P+NLA+AVN++L YSVV H + G+   +VA DL   LA+  F + +     V+ T +G DLVG  Y+HPL++R+S VL GGDYITTESGTGLVHTAPGHGQEDYLTG+K GL +LSPVDD G+FT + G +F G  VL +GN  ++ AL ETG+L+  E Y HKYPYDWRTKKPTIFRATDQWFASV+ FR+ AL AI+TV W+P+VGK RI+     R DWCISRQRSWGVPIPVFY++  G   L+NE TL H++ +F  HGSD WWTMD  DLLP  YK ++  W KGTDTMDVWFDSGSSWAGV + R + L++PAD+YLEG DQHRGWFQSSLLT VA    APYKTV+THGFVLDEKG+KMSKSLGNVVDPM+VI+GGNN+K +PAYGADVLRLWV++ DYSSDV +G NI+KQ  +SYRKLRNTARY+IGNL DF     P  +AV YD LP +DK++LGRLS +L+ V+ A + YQF RA Q + RFA  DLSNFY+DVAKDRLYI+  DD RRR+CQT +Q  LEG A AMSPLLPHMAEDIWQNLPY     S+F+ GW      YP +  D W  V  LRDDVNK +EA R + ++GASL+ A +VY  D   R +L+ L+GD     PP K+N VD+LR  L+ SQV +V S   +   CD+     +DTASG ++G+++ASG KC RCW+Y D      +LPH   C RC+ A+
Sbjct:   49 NIFASTIILPDTPFSQRANAVVREPELQAYWKESGLYHKLSEQNAASG--RFVLHDGPPYANGDLHCGHALNKILKDFVNRKQLLNGKQVHYIPGWDCHGLPIELKVLQTMSSKERSGLTPVTLRQKAAAFAKETVEKQSVAFQRFGIYGDFEKPYLTLQPEFEAAQIRVFGEMFKKGYIFRGRKPVHWSPSSRTALAEAELEYPDGHVSKSIYVAFNVDKPSQALEKYHSAQ----------DPLKVAIWTTTPWTMPANLAVAVNSDLSYSVVYHEKTGK--LLVATDLAETLAKK-FALPEGEVFDVMGTFEGSDLVGTTYQHPLYERKSPVLAGGDYITTESGTGLVHTAPGHGQEDYLTGLKNGLEILSPVDDVGKFTADAGEKFAGLSVLAEGNQAIIDALAETGALLKAEDYGHKYPYDWRTKKPTIFRATDQWFASVEGFRESALKAIDTVKWVPDVGKNRITAFVSGRGDWCISRQRSWGVPIPVFYDRATGKEVLLNEDTLQHIQELFVKHGSDCWWTMDEKDLLPESYKPEAEKWVKGTDTMDVWFDSGSSWAGVAQSR-EELAYPADMYLEGSDQHRGWFQSSLLTSVANNDQAPYKTVLTHGFVLDEKGFKMSKSLGNVVDPMQVIKGGNNQKLEPAYGADVLRLWVANCDYSSDVSIGPNIIKQTFESYRKLRNTARYMIGNLADFVPEGQPGSNAVPYDDLPSMDKWMLGRLSAVLKLVDEAMNEYQFQRAVQEILRFATADLSNFYLDVAKDRLYISGIDDFRRRSCQTVIQKCLEGFAKAMSPLLPHMAEDIWQNLPYETRHESVFEGGWPADLAAYPEFAVDEWDLVRELRDDVNKVLEAARTDKLVGASLDGAAFVYVADDRKRAILEKLVGDDNLISPPVKTNGVDELRTALMLSQVSLVGSETALKEACDEKYISGKDTASGCIVGVRKASGEKCGRCWFY-DEQIGKHDLPHADACQRCNEAI 1016          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: GAX15769.1 (isoleucyl-tRNA synthetase [Fistulifera solaris])

HSP 1 Score: 1230.3 bits (3182), Expect = 0.000e+0
Identity = 590/981 (60.14%), Postives = 725/981 (73.90%), Query Frame = 0
Query:  380 NLYRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPELGEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDG-ATLMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGDSLSFPADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDF----DPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARPTTSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALSQLRDTASGAVLGIKRASGSKCSRCWYYGDLSPADAELPH--ICPRCSHAV 1354
            N++  T+ LP + F QRANA+VREP+L  +W+   +Y  L +    SG  +F LHDGPPYANGDLH GHALNKILKDF+NR ++L G +V Y+PGWDCHGLPIELKVLQ++   ER  LTPVTLRQ+A  FAKETV++Q  +F+R+G++G++D+PY+TLQPE+EAAQ+RVFG+M  KG+I+RGRKPVHWSPSSRTALAEAELEYP+ H+SKSIYVAF V         +     H      L+VAIWTTTPWT+P+NLA+AVN++L YSVV H + G+   +VA DL   LA+  F + +     V  T +G DLVG  Y+HPL++R S VL GGDYITTESGTGLVHTAPGHGQEDYLTG+K GL +LSPVDD G+FT + G +F G  VL +GN  ++ AL ETG+L+  E Y HKYPYDWRTKKPTIFRATDQWFASV  FR  AL AI+TV W+P+VGK RI+     R DWCISRQRSWGVPIPVFY++  G   L+NE TL H++ +F  HGSD WWTMD  DLLP  YK ++  W KGTDTMDVWFDSGSSWAGV + R + L++PAD+YLEG DQHRGWFQSSLLT VA    APYKTV+THGFVLDEKG+KMSKSLGNVVDPM+VI+GGNN+K +PAYGADVLRLWV++ DYSSDV +G NI+KQ  +SYRKLRNTARY+IGNL DF     P  +AV YD LP +DK++LGRL+ +L+ V+ A + YQF RA Q + RFA  DLSNFY+DVAKDRLYI+  DD RRR+CQT +Q  LEG A AMSPLLPHMAEDIWQNLPY     S+F+ GW      YP +  D W  V  LRDDVNK +EA R + ++GASL+ A +VY  D   R +L+ L+GD     PP K+N VD+LR  L+ SQV +V S   +   C++     +DTASG ++G+++ASG KC RCW+Y D       LPH   C RC+ A+
Sbjct:   49 NVFASTIILPDTPFSQRANAVVREPELQAYWKESGLYHKLSKQNAASG--RFVLHDGPPYANGDLHCGHALNKILKDFVNRKQLLNGKQVHYIPGWDCHGLPIELKVLQTMSSKERSGLTPVTLRQKAAAFAKETVEKQSVAFQRFGIYGDFDKPYLTLQPEFEAAQIRVFGEMFKKGYIFRGRKPVHWSPSSRTALAEAELEYPDGHVSKSIYVAFNVDK------PSQALVEHHSAQDPLKVAIWTTTPWTMPANLAVAVNSDLSYSVVFHEKTGK--LLVATDLAETLAKK-FALPEGEVFDVRGTFQGSDLVGTTYQHPLYERRSPVLAGGDYITTESGTGLVHTAPGHGQEDYLTGLKNGLEILSPVDDVGKFTADAGEKFVGLSVLAEGNQAIIDALAETGALLKAEDYGHKYPYDWRTKKPTIFRATDQWFASVQGFRDSALKAIDTVKWVPDVGKNRITAFVSGRGDWCISRQRSWGVPIPVFYDRATGKEVLLNEDTLQHIQELFVKHGSDCWWTMDEKDLLPESYKHEAEKWVKGTDTMDVWFDSGSSWAGVAQSRNE-LAYPADMYLEGSDQHRGWFQSSLLTSVANNDQAPYKTVLTHGFVLDEKGFKMSKSLGNVVDPMQVIKGGNNQKLEPAYGADVLRLWVANCDYSSDVSIGPNIIKQTFESYRKLRNTARYMIGNLADFVPEGQPGSNAVPYDDLPSMDKWMLGRLTAVLKLVDEAMNEYQFQRAVQEILRFATADLSNFYLDVAKDRLYISGIDDFRRRSCQTVIQKCLEGFAKAMSPLLPHMAEDIWQNLPYETSHDSVFEGGWPADLAAYPEFAVDEWDLVRELRDDVNKVLEAARNDKLVGASLDGAAFVYVADDRKRAILEKLVGDENLISPPVKTNGVDELRTALMLSQVSLVGSETALKEACEEKYISGKDTASGCIVGVRKASGEKCGRCWFY-DEQIGKHNLPHGDACQRCNEAI 1016          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: CEL95501.1 (unnamed protein product [Vitrella brassicaformis CCMP3155])

HSP 1 Score: 1183.3 bits (3060), Expect = 0.000e+0
Identity = 576/1021 (56.42%), Postives = 736/1021 (72.09%), Query Frame = 0
Query:  380 NLYRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATL----AAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPELGEQAFVVAKDLVGKLAESVFKVQD--KGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDGATLMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGDSLSF--PADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDFDPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPY----ARPTTSIFQAGWIRKDRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGD------ATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALSQLRDTASGAVLGIKRASGSKCSRCWYYGD------LSPADAELP----HICPRCSHAVQARGGLKPAAAQPAEVGAP 1373
            N + D++ LPQ++F QRANA+ REP+L  FW  +R+Y+   ES   +    F LHDGPPYANG+LH+GHALNKILKDFINRY+ LRG +VRYVPGWDCHGLPIEL VL+S+K  ERQ LTP+ LR++A  FA   VD+QR SFKRYGV GEW++PY+TL PEYEAAQ+ + G M+LKGH+Y+G+KPVHWSPSS+TALAEAELEYP+ H+SKSIY +F V+S     G A L              + VA+WTTTPWTIP N+A+AVN EL+Y++V  P+  +   +VAKDLV ++++++  + +    GL VLAT+KG DLV  +Y HPL+ R S+V+ GGDYITT++GTGLVHTAPGHGQED+ TG KYGL  LSPVD+ GRFT E G RFEG  VL +GN  ++QAL+E GSL+  E Y H+YPYDWRTKKPTIFRAT+QWF SVD+FR+ ALAAI+ V W+PE+GK RI+ MT SRSDWCISRQRSWGVPIPVFY       L+N  T+ H++ VFR HG+DAWWTMDT  LLP  Y+ +  LW +G DT+DVWFDSGSSWAGVV  R D+L    P ++YLEG DQHRGWFQSSLLT VA  G  PY+TV+THGFVLDEKGYKMSKSLGNV+ P+ +IEGG++KK+ PAYGADVLRLWV++VDYSSDV VG  I+KQ+ D+YRKLRNTAR+LIGN+HDFD  +D VAY+ LP +DK++LG+ + +++ V  AY+SYQF R +Q L  F+   LSNFY+DV+KDRLYI+ P+D RRR CQ+ ++++ E  A  ++PLLPHMAEDIWQNLP+     R + S+F+AGW       P + +D W  V+ LRDDVNK +E+ R   ++GA+ EAAV+++ PD +  + L  L+ D       T      + + VDDLRFL L SQVH+ DS   V  +C +       T SG  +G+ +A G+KC RCW +G+       SP   + P     +CPRC  AV+     + A A+      P
Sbjct:  142 NPFSDSINLPQTTFTQRANAVTREPELQAFWAEQRVYERQAES---NPGEVFVLHDGPPYANGELHMGHALNKILKDFINRYQSLRGRKVRYVPGWDCHGLPIELAVLKSMKSSERQQLTPLDLRRKAASFALSQVDKQRDSFKRYGVMGEWEKPYLTLDPEYEAAQIDILGKMLLKGHVYKGKKPVHWSPSSQTALAEAELEYPDGHVSKSIYASFKVTS--PSDGLARLLPDXXXXXXXXXXXVGVAVWTTTPWTIPGNMAVAVNPELQYALVRRPD-SDAVLIVAKDLVKEVSKALGALDELTGEGLEVLATIKGEDLVDTQYEHPLYGRVSRVVAGGDYITTDTGTGLVHTAPGHGQEDFETGQKYGLEPLSPVDNYGRFTAEAGERFEGLSVLKEGNEAIIQALDECGSLLKAEDYPHRYPYDWRTKKPTIFRATEQWFISVDAFREAALAAIDEVRWIPEMGKTRIASMTSSRSDWCISRQRSWGVPIPVFYQAHTNEPLINNSTITHIKEVFRQHGTDAWWTMDTQQLLPEPYRSEWSLWVRGNDTIDVWFDSGSSWAGVVNTR-DNLRHDNPVELYLEGSDQHRGWFQSSLLTSVAVRGAPPYQTVLTHGFVLDEKGYKMSKSLGNVLSPLTIIEGGSDKKKQPAYGADVLRLWVATVDYSSDVLVGGTIIKQVFDAYRKLRNTARFLIGNIHDFDKEQDGVAYEDLPEVDKWVLGKTASLIRRVREAYESYQFYRVTQELLAFSNQLLSNFYLDVSKDRLYISTPNDRRRRACQSVIRVVTEAFALLLAPLLPHMAEDIWQNLPHDGEGVRRSGSVFEAGWPSGAEASPPFREDKWDLVMTLRDDVNKVIESARTAKLVGANSEAAVHIHCPDPQTTSWLRELLPDQSPLLATTTTTTDGRISAVDDLRFLFLTSQVHLADSPGAVEGSCPEFRLSAEQTGSGCTVGVSKAVGTKCERCWMWGEDVSEWQGSPRSQDDPPSRHSLCPRCRAAVEQFLDRRRARAEGGRTNQP 1155          
BLAST of NO01G02870 vs. NCBI_GenBank
Match: WP_015203168.1 (isoleucine--tRNA ligase [Crinalium epipsammum] >AFZ13052.1 Isoleucyl-tRNA synthetase [Crinalium epipsammum PCC 9333])

HSP 1 Score: 1171.4 bits (3029), Expect = 0.000e+0
Identity = 570/978 (58.28%), Postives = 716/978 (73.21%), Query Frame = 0
Query:  382 YRDTVALPQSSFDQRANAIVREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHALNKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTPVTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVFGDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSSLESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVVSHPELGEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYRHPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSPVDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPYDWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTESRSDWCISRQRSWGVPIPVFYNKEDGATLMNEKTLAHLEGVFRTHGSDAWWTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGDSLSFPADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMSKSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNILKQIGDSYRKLRNTARYLIGNLHDFDPNKDAVAYDTLPRLDKYILGRLSLMLQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRRRTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARPTTSIFQAGWIRKDRQYPSYE-DDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDAELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANCDQALSQLRDTASGAVL--GIKRASGSKCSRCWYYGDLSPADAELPHICPRCSHAVQAR 1357
            Y++TV LP++ FD RANA+ REP++  FW  E+IY+ L ++   +    F LHDGPPYANG LHIGHALNKILKD IN+Y++L+G +VRYVPGWDCHGLPIELKVLQ++KQ ERQ LTP+TLR++A +FA +TVD QR  FKRYGVWG+WD PY+TL+P YEAAQ+ VFG MVLKG+IYRG KPVHWSPSS+TALAEAELEYPE HIS+S+YVAFPV+ L      +         +  L VAIWTTTPWTIP+NLA++VN EL+Y+VV+         +VA DLV  L+E++ K        V AT+ G+DL    YRHPLFDRES V+ GGDY+TTESGTGLVHTAPGHGQEDYL G +YGLP+LSPVD  G FTEE G +F G +VLG+GN  V+ AL E G+L+  E Y+HKYPYDWRTKKPTI+RAT+QWFASV+ FR +AL+AI +V W+P  G+ RI+ M   RSDWCISRQRSWGVPIPVFY++  G  L+N++T+ + + +    GSDAWW +   +LLP  Y+     + KGTDTMDVWFDSGSSWAGV++ER + L +PAD+YLEG DQHRGWFQSSLLT VA  G APYKTV+THGF LDE+G KMSKS+GNV+DP  VIEGG N+K++P YGADVLRLWVSSVDYSSDV +  +ILKQ+GD   K+RNTARYL+G+LHDFDP KDAV Y+ LP LD+Y+L R++ + ++V  A+DSYQF R  Q +Q F V DLSNFY+D+AKDRLYI+A +  RRR+CQT M + LE +A A++P+L HMAEDIWQ +PYA P  S+F+AGW+  + ++   E    W  + ++R DVNK +E  R E +IG+SLE+ V +Y P+ + R +L         Q  P   N VD+LR+L L+SQV ++DSV        +AL  ++    G  L  GI +A G KC RCW Y       AE P IC RC  A+  R
Sbjct:    8 YKNTVNLPKTKFDMRANAVKREPEIQKFWAEEQIYERLSQN---NPGELFVLHDGPPYANGALHIGHALNKILKDIINKYQLLKGRKVRYVPGWDCHGLPIELKVLQNMKQQERQELTPLTLRRKAKEFALKTVDEQRQGFKRYGVWGDWDNPYLTLKPSYEAAQIGVFGQMVLKGYIYRGLKPVHWSPSSKTALAEAELEYPEGHISRSLYVAFPVTKLGDAVNESLQQF-----LPNLSVAIWTTTPWTIPANLAVSVNPELKYAVVAVEGEQSNYLIVAADLVETLSETLGK-----NFQVKATVVGKDLENTTYRHPLFDRESPVVIGGDYVTTESGTGLVHTAPGHGQEDYLVGQRYGLPILSPVDADGNFTEEAG-QFAGLNVLGEGNTAVITALTEVGALLKEEAYSHKYPYDWRTKKPTIYRATEQWFASVEGFRDEALSAIASVKWIPSQGENRITPMVSERSDWCISRQRSWGVPIPVFYDEATGEPLLNQETINYAQAIIAEKGSDAWWELSVEELLPESYRNNGKSYRKGTDTMDVWFDSGSSWAGVLQER-EELKYPADIYLEGSDQHRGWFQSSLLTSVATNGYAPYKTVLTHGFTLDEQGRKMSKSVGNVIDPAIVIEGGKNQKEEPPYGADVLRLWVSSVDYSSDVSISKSILKQMGDVRGKIRNTARYLLGSLHDFDPAKDAVPYEQLPELDRYLLHRMTEVFKDVTEAFDSYQFFRFFQTVQNFCVVDLSNFYLDIAKDRLYISAENSLRRRSCQTVMAIALENLAKAIAPVLSHMAEDIWQYIPYATPCKSVFEAGWVNLEEEWHKPELTQPWQMLRQVRTDVNKVLEQARAEKMIGSSLESKVLLYIPNVDQRQLLQ--------QLNPEAGNGVDELRYLFLSSQVELLDSV--------EALQDMQYKMQGDNLSVGIVKADGEKCDRCWNYSTQVGKIAEHPVICERCVAALSDR 954          
The following BLAST results are available for this feature:
BLAST of NO01G02870 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM29120.10.000e+077.66isoleucyl-trna synthetase [Nannochloropsis gaditan... [more]
XP_002176534.10.000e+060.77predicted protein [Phaeodactylum tricornutum CCAP ... [more]
CBJ33425.10.000e+060.26Isoleucyl-tRNA Synthetase [Ectocarpus siliculosus][more]
XP_002286281.10.000e+058.08isoleucine-trna synthetase [Thalassiosira pseudona... [more]
EJK73254.10.000e+060.68hypothetical protein THAOC_05131 [Thalassiosira oc... [more]
XP_005832559.10.000e+059.74hypothetical protein GUITHDRAFT_71180 [Guillardia ... [more]
GAX16043.10.000e+060.30isoleucyl-tRNA synthetase [Fistulifera solaris][more]
GAX15769.10.000e+060.14isoleucyl-tRNA synthetase [Fistulifera solaris][more]
CEL95501.10.000e+056.42unnamed protein product [Vitrella brassicaformis C... [more]
WP_015203168.10.000e+058.28isoleucine--tRNA ligase [Crinalium epipsammum] >AF... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL005nonsL005Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR047ncniR047Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR091ngnoR091Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK008041NSK008041Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO01G02870.1NO01G02870.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|576796gene_281Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100007g122gene1409Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO01G02870.1NO01G02870.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO01G02870 ID=NO01G02870|Name=NO01G02870|organism=Nannochloropsis oceanica|type=gene|length=5397bp
ATGAGGAAACCCAGCGATCAGAATGACGAGGAGGAGGAGGGCGAGGTGCG
CGAAGgtaagtcccgcggacaaaatctaaaatacggagcatcacgaaaga
ggttaggccctcagggaggtttgatattattccccctccatccaatctac
acagACCAAGACAGTCACACCACTGCCATGGACGAGCAAGACAACGATTT
TGTGGAGGAGGAGGGGGGGATGGACGTGTCAGAGGCGGCAGCGGGGCTTG
TCCAAGCCGCACTCAACATAGCAGCGCCCGAGGTCCTGCTTCCGGGAGAC
GTGAAGGAGGTGCCAGAGCACTTCTCAAGGGTCAAGCTAGgtgagacggt
gacggtaatgtcacatactgccacatatgtgaagaaattatgcctaatta
tactatcccttatcttgtgtactaattctctttctccctctcctccttcc
ccctcctatttgacagGCAATGGGCTAGTCAGCGCGCCCTCGCCACCTGG
TGCGGCCAAGGCAACCAAGGCTGGCATCCTGCGTTTCCAACCGCCGAATA
CGTATCGAATCGATAACCAACAACGACGGgtaagtgatgaaacacctcaa
atgtgatggtgaaaaggaaaggaactcaaatccagttctatctctttgct
atccatctctgtcccctgtcctctccttcatgccatctacctttcttagT
ACATACCTAAGGGCGACGACACCGTGCTGGGCATTGTGGAAGACCGCGTG
GCCGAGCACTACCGCATCAACATTTTCGGAAGgtaggtgccgaaacaagc
cgagggatgaaggaaggctaagtaacagcatggcaagaccactggttcaa
aagtgaaaagggactgcttcaatatgctcattcttacttccactcctcat
ctctgccccttatatatttagCTCTCCGGGTCTGTTGCCCCACTTGGCAT
TTGACGGTGCCTCGAAGCGGAACAAGCCGAATTTGAAGATTGGGGCCTTG
GTGTACTGCCGCGTGGCCGTTGCCAATAAAGACATGGATAGCGAACTAAG
CTGCATGGgtaatgaggcagttgggaaagggaacgagggagggatggctc
ggggcagtatgcttattcaaatttgaactgagcggaaatcacttcccgtg
ccttttcagttcctgcaacacatgttttcattttctctggttcgcacatt
ttttttcaattcattcacagTAACACAAGGCATCAAAAAGGATTGGATGA
CGGGGGAGTCGACGTTTGGGGAGCTGAAAGACGGCACCGTCACACGATGT
TCCCTAGGCCTCTGCCGACAgtatgtagctttgttcaatggttcatttcc
gctttccccttccatctcttccccctttgtccccacatctttctaagata
agctctgtaagcaatgttcgacgcgctcatttctttccatctttcccgtt
atttccttcccagGTTACTGAGTAAGAACTGCCGAGTGCTCCAGGCGTTG
GCAAAGCAGCAGGTCCCTTTTGAGGTGGCCGTCGGTCGAAATGGGGCCTT
TTATGTGAACGCCGACGATCCACGGATTATCGTGGCTGTGGTTAACATTC
TGTTGAACTCAGAGATGATTCATGACGGGCACGATGCAATGGTGCGgtcc
gttctggcctcagtctctcataagcggaaagcaagaaggccagcgtcata
aagaaattatgataaaaattcagaaatgtgggagtactttttttttgcat
cgtttcttaaccgttatgtcatagttggtcgttctagactcgtattgttt
agttgttggttgcgtttgtgcaggtggggagattgaatgtgggccgttgg
cgccttgtgtgtgaggtggcccttgtgtgtggtcgaggcacatttaagat
cagcctacgataattggcaaatggtcgccctgtgcaacttgaattcgaca
cgtgtaggccaccacttcctcaaggtcaaagctagggctgctggcacaag
agagacaaaagatactcacgttgctgtggtcttgtacactccagCCACTT
CAGCACCCACAAAAGGCAACCCTTAGgtactggacatgcataagcgctgt
ggactttcacatatgatctcttcactcatttcatcttttcccctccccca
atacactcccaaagGGCGGCACCGCGCAGTGGTAATGTCCCGGCGGGCGG
TCTCCACGTTCGTCATGACGACTTGCCTGCTCCTTCTGCAGAGGTCAAGC
ACCCATGTTGTATGGCGCCGCAGTGTGCAAACTAAAAGCAGCAGCAGCAT
GCGAGGACGTGGTGGCTTGGGAAGCCCGGCGTTCCTGCTCTCTGCGTCAC
GACTTCGTCACACGGGCCCGATCCACGCCAAGAAGCAACCCGACGATCCG
TCCGACCCCGCGAACCTCTACCGCGACACAGTGGCCCTCCCCCAGAGTAG
CTTTGACCAGCGGGCCAATGCCATTGTCCGCGAACCCCAACTCCACGACT
TTTGGGAATCAGAGCGCATTTACCAAGGGTTGCACGAGAGCCGGAGGAAG
AGCGGGGCCCCTAAGTTCACTCTGCACGACGGGCCTCCCTACGCCAATGG
GGACCTGCACATTGGCCACGCGCTCAACAAAATTTTGAAGGACTTCATCA
ACCGCTATAAAATCCTGCGGGGGTTCGAGGTCCGGTATGTTCCCGGGTGG
GACTGCCACGGGCTGCCTATCGAGCTCAAGGTGCTGCAGAGCCTGAAGCA
GGTGGAGCGCCAGGCATTGACGCCCGTCACGCTGCGACAACGCGCTGGAG
ATTTTGCCAAGGAGACGGTGGACCGACAACGGACGTCGTTCAAACGCTAT
GGGGTATGGGGCGAGTGGGATGAGCCTTACATGACATTGCAGCCTGAGTA
CGAGGCAGCGCAAGTTCGCGTGTTTGGGGATATGGTCCTCAAGGGCCATA
TCTATCGAGGCCGCAAACCCGTGCATTGGAGTCCCTCTTCCCGCACTGCC
CTGGCGGAAGCCGAGCTCGAATACCCCGAAAACCACATTTCAAAGTCTAT
ATACGTCGCCTTTCCCGTCAGCTCCCTCGAAAGCTGCTCCGGGGCAGCCA
CGCTTGCTGCTGCGCACCCTGGAGGGATCCAGGCGCTGCGTGTCGCCATT
TGGACGACAACGCCGTGGACCATCCCTTCAAACCTGGCAATTGCAGTAAA
TGCCGAACTGGAGTATTCAGTAGTTTCTCATCCAGAACTCGGCGAGCAGG
CTTTTGTGGTCGCAAAGGACTTGGTGGGCAAGCTGGCGGAGAGCGTGTTC
AAGGTGCAGGACAAGGGTGGGTTGACGGTCCTTGCCACGCTGAAGGGTCG
AGACCTGGTAGGACTTAAGTACCGCCATCCGCTGTTTGATCGTGAATCAC
AGGTCTTGGAAGGAGGGGATTATATCACGACGGAGAGTGGGACGGGATTA
GTGCATACGGCACCCGGGCATGGGCAGGAGGATTATCTGACTGGAATGAA
ATATGGTTTACCGTTGCTGAGTCCTGTGGATGATGCGGGGCGTTTCACGG
AGGAAGTGGGACCGAGGTTTGAGGGTAAGGATGTGCTGGGCGATGGGAAC
ATGGAGGTAGTTCAGGCGTTGAATGAGACGGGGTCCTTGATTTTGATGGA
GCCGTATAATCATAAGTACCCTTATGACTGGCGCACGAAGAAGCCGACGA
TTTTCCGGGCTACGGACCAATGGTTTGCGTCAGTGGATTCATTCCGACAG
GATGCCCTGGCAGCAATTGAAACGGTACATTGGATGCCGGAAGTGGGGAA
GAAGCGAATTTCGATCATGACGGAATCGAGGTCTGATTGGTGCATTTCGA
GGCAACGGTCTTGGGGGGTGCCGATCCCTGTGTTTTATAACAAAGAGGAT
GGCGCGACCTTGATGAATGAGAAGACCCTCGCTCATCTGGAGGGTGTGTT
TCGTACACATGGTTCGGATGCGTGGTGGACGATGGACACGGTTGATTTGC
TTCCTGCTGAGTATAAAGAGCAGTCGGGGCTGTGGGAGAAGGGGACTGAC
ACCATGGACGTGTGGTTTGACAGTGGGTCGTCATGGGCCGGTGTGGTAAA
GGAACGCGGGGATTCTTTGTCGTTTCCTGCTGATGTATATCTGGAAGGGG
TCGATCAGCACCGAGGCTGGTTCCAGTCTTCCCTTTTGACGTGCGTGGCG
GCTACGGGGATGGCACCTTACAAAACGGTAGTAACGCACGGCTTTGTGCT
TGATGAAAAAGGGTACAAGATGAGTAAGAGCTTGGGTAATGTCGTTGACC
CGATGAAGGTCATCGAGGGAGGAAATAACAAAAAGCAAGACCCGGCATAT
GGCGCAGACGTGTTGCGGTTGTGGGTGTCGTCGGTGGACTACTCGAGTGA
CGTGTGCGTTGGGTCAAATATTTTGAAGCAAATCGGCGATTCCTACCGGA
AGCTCCGCAACACCGCCCGATACCTCATCGGAAATCTCCACGACTTTGAT
CCAAACAAAGATGCCGTCGCCTACGACACTCTTCCTCGCCTGGATAAGTA
CATCTTGGGCCGGCTCTCTTTGATGTTGCAAGAGGTGGAGAACGCTTATG
ATAGCTACCAGTTCAGTCGTGCCAGCCAGGCCTTGCAGCGCTTTGCCGTC
ACTGATCTCTCTAATTTTTACATGGATGTGGCCAAGGACCGCTTGTATAT
CGCCGCCCCCGATGACACTCGTCGACGTACATGCCAAACTACGATGCAGC
TGCTTTTGGAGGGGATGGCGACGGCGATGAGCCCGCTCTTGCCTCATATG
GCTGAGGATATCTGGCAAAACCTGCCTTATGCCCGGCCGACCACCAGTAT
CTTCCAAGCAGGCTGGATAAGAAAAGACCGTCAGTATCCCTCGTATGAGG
ATGACGCCTGGGCTGCCGTCTTGAGGTTGAGGGATGATGTGAACAAGTGC
ATGGAAGCTGGACGACGGGAAGGCGTGATTGGCGCGTCATTGGAGGCAGC
TGTCTATGTCTATGCCCCTGACGCCGAGCTCCGCACCGTGTTGGATGGCT
TGATGGGAGATGCGACATTTCAACATCCTCCGGCGAAAAGCAACAACGTC
GACGACCTCCGCTTTTTGTTGCTTGCGTCCCAGGTGCATGTGGTGGACTC
CGTAGTGGAGGTGACGGCGAATTGCGACCAAGCGTTGAGCCAGCTGCGGG
ATACGGCCAGCGGGGCCGTATTAGGAATTAAGCGGGCGAGCGGCAGCAAA
TGCAGCCGATGTTGGTATTACGGTGATCTGTCTCCGGCGGACGCCGAATT
GCCCCATATCTGCCCGCGCTGCTCGCACGCCGTCCAAGCCAGGGGCGGCC
TAAAGCCAGCAGCAGCTCAGCCAGCGGAAGTCGGCGCCCCACAGTAA
back to top

protein sequence of NO01G02870.1

>NO01G02870.1-protein ID=NO01G02870.1-protein|Name=NO01G02870.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1374bp
MRKPSDQNDEEEEGEVREDQDSHTTAMDEQDNDFVEEEGGMDVSEAAAGL
VQAALNIAAPEVLLPGDVKEVPEHFSRVKLGNGLVSAPSPPGAAKATKAG
ILRFQPPNTYRIDNQQRRYIPKGDDTVLGIVEDRVAEHYRINIFGSSPGL
LPHLAFDGASKRNKPNLKIGALVYCRVAVANKDMDSELSCMVTQGIKKDW
MTGESTFGELKDGTVTRCSLGLCRQLLSKNCRVLQALAKQQVPFEVAVGR
NGAFYVNADDPRIIVAVVNILLNSEMIHDGHDAMVRHFSTHKRQPLGRHR
AVVMSRRAVSTFVMTTCLLLLQRSSTHVVWRRSVQTKSSSSMRGRGGLGS
PAFLLSASRLRHTGPIHAKKQPDDPSDPANLYRDTVALPQSSFDQRANAI
VREPQLHDFWESERIYQGLHESRRKSGAPKFTLHDGPPYANGDLHIGHAL
NKILKDFINRYKILRGFEVRYVPGWDCHGLPIELKVLQSLKQVERQALTP
VTLRQRAGDFAKETVDRQRTSFKRYGVWGEWDEPYMTLQPEYEAAQVRVF
GDMVLKGHIYRGRKPVHWSPSSRTALAEAELEYPENHISKSIYVAFPVSS
LESCSGAATLAAAHPGGIQALRVAIWTTTPWTIPSNLAIAVNAELEYSVV
SHPELGEQAFVVAKDLVGKLAESVFKVQDKGGLTVLATLKGRDLVGLKYR
HPLFDRESQVLEGGDYITTESGTGLVHTAPGHGQEDYLTGMKYGLPLLSP
VDDAGRFTEEVGPRFEGKDVLGDGNMEVVQALNETGSLILMEPYNHKYPY
DWRTKKPTIFRATDQWFASVDSFRQDALAAIETVHWMPEVGKKRISIMTE
SRSDWCISRQRSWGVPIPVFYNKEDGATLMNEKTLAHLEGVFRTHGSDAW
WTMDTVDLLPAEYKEQSGLWEKGTDTMDVWFDSGSSWAGVVKERGDSLSF
PADVYLEGVDQHRGWFQSSLLTCVAATGMAPYKTVVTHGFVLDEKGYKMS
KSLGNVVDPMKVIEGGNNKKQDPAYGADVLRLWVSSVDYSSDVCVGSNIL
KQIGDSYRKLRNTARYLIGNLHDFDPNKDAVAYDTLPRLDKYILGRLSLM
LQEVENAYDSYQFSRASQALQRFAVTDLSNFYMDVAKDRLYIAAPDDTRR
RTCQTTMQLLLEGMATAMSPLLPHMAEDIWQNLPYARPTTSIFQAGWIRK
DRQYPSYEDDAWAAVLRLRDDVNKCMEAGRREGVIGASLEAAVYVYAPDA
ELRTVLDGLMGDATFQHPPAKSNNVDDLRFLLLASQVHVVDSVVEVTANC
DQALSQLRDTASGAVLGIKRASGSKCSRCWYYGDLSPADAELPHICPRCS
HAVQARGGLKPAAAQPAEVGAPQ*
back to top
Synonyms
Publications