NO20G01910, NO20G01910 (gene) Nannochloropsis oceanica

Overview
NameNO20G01910
Unique NameNO20G01910
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length4623
Alignment locationchr20:572753..577375 -

Link to JBrowse

Properties
Property NameValue
Descriptionpurine biosynthesis 4
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr20genomechr20:572753..577375 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
GO annotation for IMET1v2 genes2019-07-10
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR036604PurS-like_sf
IPR029062Class_I_gatase-like
IPR036921PurM-like_N_sf
IPR036676PurM-like_C_sf
IPR017926GATASE
IPR010918AIR_synth_C_dom
Vocabulary: Biological Process
TermDefinition
GO:0009987cellular process
GO:0008152metabolic process
Homology
BLAST of NO20G01910 vs. NCBI_GenBank
Match: EWM22060.1 (phosphoribosylformylglycinamidine synthase [Nannochloropsis gaditana])

HSP 1 Score: 1338.2 bits (3462), Expect = 0.000e+0
Identity = 689/971 (70.96%), Postives = 780/971 (80.33%), Query Frame = 0
Query:  588 MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEGDPELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALATLSRICHREATPMEVVGHITGHGAIRVYENEEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTR-------------GGIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDGVRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLP---PHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIGTSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFPPPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEAS-NRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAG--------GQVGEG-EAVYTPWFRLFQNAFDFSM 1533
            MPAW+GAEFLKPIMFSAG+GK+PAEA+ K  P VGMAVVKLGGPAYRVGLGGGAASSKV GDPEL+LQAVQRGDPEMGNKVGRVVRAC+EM H+S SL+LESVHDQGAGGNANVLKELVAP GADVFL+RI+RGDSSLTPLEVWGAEYQESQG+LV GSQALA L+RIC RE+ PM VVGH+TG+GAIRV+E+EE++  G    + LVDLPLEPLLS RPKRVIQA   R G +             G   +     AL+D L  +LS VSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWG +GVA+ALGECPGIG+AG    I+AMVRMAVGEALTNLVSAP+T WADIKLQANWMWPGREG +AGQLY+GVEALR  LL+LGLALDGGKDSLSMAT C+DG+RVP PATVV+TAYAPC DVSRVLTPD+KVP   +G+EGVL+ L++SG     + LGGSAWA+I   D +  TPDM DP+L+ RAF AVQ L++E  +QACHDVSSGGLVTTVLEMA+SGDAGARL LP   PH    S     EA+LF+EEL LVLELK QD+  V+  L+ Q +P+  IG STP+R++ I  A+G+ +L E+ + L A+W A S QLE+LQA+PACIEAEEA+L   RRP W  PP   T  P+ S   R + RVGVLREQGSNGDREMAAALHSAGF VWDLTV DLLQ QVDL SFHGLAFVGGFSYGD LGSARGWRSVL GNPR EAQ+R FFARPETWSLGLCNGCQLLVALGIVP  D+ ++ +P+  +  +   WLGEN+SGRFESRFVTV VGPSPAVLLKGLEG V+GIWSAH EG +   ++ +++  L +GLAPVRY DP   GG   GTE+YPFNPNGSPRGVAALCS DGRHLAMMPHPERSWL WQMPWM  AG        G+V  G E VYTPWFRLFQNAF FS+
Sbjct:    1 MPAWRGAEFLKPIMFSAGLGKVPAEALLKRSPSVGMAVVKLGGPAYRVGLGGGAASSKVGGDPELDLQAVQRGDPEMGNKVGRVVRACVEMQHASNSLVLESVHDQGAGGNANVLKELVAPAGADVFLDRIERGDSSLTPLEVWGAEYQESQGLLVNGSQALAALNRICRRESAPMAVVGHVTGNGAIRVFESEEQSGDG-QAVQALVDLPLEPLLSHRPKRVIQATRERTGAQSSNADVEAGEGIGGRRGREGDKTALQDTLYKVLSVVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGVDGVATALGECPGIGIAGGNEGISAMVRMAVGEALTNLVSAPLTGWADIKLQANWMWPGREGASAGQLYDGVEALRGVLLELGLALDGGKDSLSMATLCEDGIRVPSPATVVVTAYAPCCDVSRVLTPDVKVPGECDGQEGVLLFLDVSGSGPRPKELGGSAWAEIKRIDGDQRTPDMRDPALLRRAFEAVQGLSKEGKIQACHDVSSGGLVTTVLEMAISGDAGARLVLPAGDPHDPLLSSQRFTEASLFSEELGLVLELKGQDLALVQNDLQDQGIPYSTIGFSTPERKVEIRGADGRILLSEETEALHAKWCAMSLQLERLQASPACIEAEEAVLSSLRRPVWAIPPSLTTARPSPSTPNRTRLRVGVLREQGSNGDREMAAALHSAGFQVWDLTVADLLQGQVDLGSFHGLAFVGGFSYGDALGSARGWRSVLLGNPRVEAQLRHFFARPETWSLGLCNGCQLLVALGIVP--DLALEDRPNPINCKKPLVWLGENDSGRFESRFVTVKVGPSPAVLLKGLEGLVMGIWSAHAEGKMHFSDETLMNQILDRGLAPVRYTDPWDRGGERGGTEQYPFNPNGSPRGVAALCSADGRHLAMMPHPERSWLRWQMPWMAQAGYDSAKSPSGKVNAGHEEVYTPWFRLFQNAFHFSV 968          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: OGW32091.1 (phosphoribosylformylglycinamidine synthase [Nitrospirae bacterium GWF2_44_13] >OGW65826.1 phosphoribosylformylglycinamidine synthase [Nitrospirae bacterium RIFOXYA2_FULL_44_9])

HSP 1 Score: 981.9 bits (2537), Expect = 2.300e-282
Identity = 556/1333 (41.71%), Postives = 794/1333 (59.56%), Query Frame = 0
Query:  226 PRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWYHIVSR----SEALLPEAEALLYDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAIVLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHRPRHVASALEILLRASDGASDYANKYGEPLVGGFARA----MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEGD--PELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALATLSRICHREATPMEVVGHITGHGAIRVYENEEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRGGIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMAT-----------------DCDDGVRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIGTSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFPPPRATTTPALSFSA----RPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            PR +F + WS+N  ++  A G           +++ +E  R Y +V      +++ +    AL++DRMTE P+                    +   T  + EP A  +PL EEGP  L  ++ + GLGL  W++ +   LFV  + R+PT VE FD++QS SEH RHW F GQ +VDG+ + ++LM+++++P +                        NS++AF DNSS IKG ++    P    R SR+       HL LTAETHNFP  +APFPGA+TG GGRIRD+ ATGRG + +AG A Y VG L + G    K          P ++AS L+I + AS+GASDY NK+GEP++ GF R+    +   +  E++KPIMF+AG+G++ A  + K +P+ GM V K+GGPAYR+G+GGGAASS ++G+   EL+  AVQRGD EM  K+ RV+RAC+E+   +  +   S+HDQGAGGN NV+KE++ P GA + + +IQ GD++L+ LE+WGAEYQE   +L++  +A      +C RE  P   +G ITG G I +++  +    G  P    V+L LE +L   P++  +    +P       K    + +RD L  +L  VSVGSK FLTNKVDRSVTGL+A Q C GPLHL V DV V A S +G  G A ++GE P      T  + +A  R++VGEALTN+V A I++  DIK   NWMW  +      +LY+   ALR+ +L+LG+A+DGGKDSLSMA                   CD  + V  P T+VI+AYA C D+++V+TPD+K P      +  L+ ++L    +   RLGG+A AQ+  +    E+PD+ DP L+ +AF AVQ+L  + L+ A HD S GGL+TT+LEMA +G+ G  + +          + +   LF+EEL LV+E  P++ K++ A L++Q+VP  ++G +T  ++ + V++NG+ +L EDM VLR  W+ TS+QLE+LQ NP C + E+  +  +R P++     + T  P ++ SA    + KP V ++RE+GSN DREM +A + AGFDVWD T+ D  + +V L +F G+AFVGGFSY DVL SA+GW  V+  N     Q ++F+ RP+T+SL +CNGCQL   LG +P   ++   +P             N SGRFESRF TV + PSP+++LKG+EG+ +GIW AHGEG      + +L+   +  LAPVRYVD        + TE YPFNPNGS  G+AALCSPDGRHLA+MPHPER++  W   WMP    +  +     +PW RLFQNA
Sbjct:   69 PRMNFTTAWSTNAVSVCHACGL---------KKIRRIERSRRYRLVGSIHPFTDSSIHRFLALVHDRMTECPYP----------------ETLETFETGIKPEP-AYTVPLIEEGPSALKKINTEMGLGLDDWDIEYYYSLFVKDLKRNPTNVECFDLSQSNSEHSRHWFFRGQLIVDGKEISENLMQIIKQPLK--------------------ANPNNSVIAFKDNSSGIKGYKIKTIIPENVGRHSRFKEAALKYHLILTAETHNFPSGVAPFPGAETGTGGRIRDVQATGRGAHVMAGTAAYCVGNLRIPG---YKLPWEDESFEYPNNLASPLQIEIEASNGASDYGNKFGEPVIQGFTRSFGMRLADGERREWIKPIMFTAGIGQMDARHIEKGQPEKGMLVTKIGGPAYRIGMGGGAASSMIQGENIAELDFNAVQRGDAEMEQKLNRVIRACVELGSDNPII---SIHDQGAGGNCNVVKEIIYPAGAKIEIRKIQIGDNTLSVLEIWGAEYQEQNALLIKPDKA-NIFEELCRREKVPFSFIGQITGDGYIVLHDEND----GSAP----VNLDLEKILGDMPQKTFKLERVQPKLEP--LKLPEDITVRDALDRVLRLVSVGSKRFLTNKVDRSVTGLIARQQCAGPLHLTVSDVAVIAQSHFGLTGAAISIGEQP----IKTLINPSATARLSVGEALTNIVWAKISKLEDIKCSGNWMWAAKLPGEGARLYDAAVALRDLMLELGIAIDGGKDSLSMAAIIPKSSGGGSHRGDSSPTCD--LTVKSPGTLVISAYATCPDITKVVTPDIKKPG-----KSRLLFIDLG---NSKNRLGGTALAQVYNQIGN-ESPDVDDPKLLKKAFNAVQKLISDNLIIAGHDRSDGGLITTLLEMAFAGNCGVEVEMQDAGCRMQDEIMSR--LFSEELGLVVEYLPKNEKKILAELKNQKVPCHVVGRTTVNKK-IKVKSNGKLILNEDMRVLRGIWEETSYQLERLQMNPDCADEEKKNIYARRSPQF-----KITFKPDITSSAIIKRKNKPSVAIVREEGSNSDREMTSAFYQAGFDVWDTTMTDFFEGKVSLDNFKGMAFVGGFSYADVLDSAKGWAGVIKFNKEIYEQFQKFYNRPDTFSLSVCNGCQLAALLGWIPWQGIEDKYQPR---------FIHNVSGRFESRFSTVKIFPSPSIMLKGMEGSTLGIWVAHGEGRAYFPKEDILNKIGKDSLAPVRYVD-----DKGKITESYPFNPNGSVNGIAALCSPDGRHLAIMPHPERTFQKWNWAWMP----EEWKKNLKASPWLRLFQNA 1297          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: KPK82456.1 (phosphoribosylformylglycinamidine synthase [Gemmatimonas sp. SM23_52])

HSP 1 Score: 979.9 bits (2532), Expect = 8.800e-282
Identity = 575/1320 (43.56%), Postives = 781/1320 (59.17%), Query Frame = 0
Query:  226 PRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWYHIVSRSEALLPEAEALL---YDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAI-VLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHR-PRHVASALEILLRASDGASDYANKYGEPLVGGFARA----MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEGD--PELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALATLSRICHREATPMEVVGHITGHGAIRVYENEEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRGGIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMA------TDCDDGVRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIGTSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFP-PPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            PR +F + WS+N  +I  A G           +++ +E  R Y +   S+     A A L   +DRMTE+P+  P  +                   E   +P  I  +P+ EEG   L  ++ + GL    W++ + TDLF++ +GRDPT VE FD+AQS SEH RHW F G+ V+DG+ +PK L+ +++EP +                        NS++AF DNSS I+G  +    PA P  P  +   +   H+  TAETHNFP  +APFPGA+TG GGRIRD+ ATGRG   +AG A Y VG L + G  +   ED   P  R P ++AS L+I + AS+GASDY NK+GEP++ G+ R+    +P  +  E++KPIMF+ G+G++ A    K+ P+  M VVK+GGPAYR+G+GGGAASS V+G+   EL+  AVQRGD EM  KV RV+RAC+E+   +  +   S+HDQGAGGN NVLKE+V P GA + +  I  GD +L+ LE+WGAEYQE+  +L++   A  T   +C RE  P+  VG ITG G I V++   ++          V+L LE +L   P++  +    R   +         + ++D L  +L  +SVGSK FLT KVDRSVTGL+A Q C GPL L V DV V A S +G  G A+A+GE P  GL  T    AAM RMAVGEAL NLV A ++   D+    NWMW  +       LY+   A+R+ +L+LG+A+DGGKDS+SMA      T  D+ V+   P  +VI+AY  C D+++ +TPDLK+P +     G L+ ++L+   SG  RLGGSA AQ+ G+  + E+PD+ D  ++ RAF  VQ+L     + A HD S GGL+TT+LEMA +G+ G  + L       + + SA   LF+EEL LVLE+  +    V AA     VP + IG ST     V V  NG++VLR+D+  LR  W+ATSFQL++LQAN  C+E EE  L  +  P+++ P  P+ T T  L    + K  V +LRE+GSNGDREM +A ++AGF+ WD+ + DLL  ++DL+ F G+  VGGFSY DVL SA+GW  V+  N     Q + F+ R +T+SLG+CNGCQLL  LG VP       G P E   R      +N SGRFESRFV V +  SPA++L+G+EGA +GIW AHGEG     +  VLD  +  GLAPVR+VD          TE YPFNPNGSP G+A LCSPDGRHL MMPHPERS+L WQ  WMP    +  E     +PW R+FQNA
Sbjct:   61 PRMNFSTAWSTNAVSICHACGL---------KKIRRIERSRRYLLKGVSKLSEDRAWAFLAEVHDRMTETPYPEPLQSF------------------ETGVKPEPIEKIPVMEEGRAALERINREMGLAFDAWDLDYYTDLFLNRVGRDPTNVECFDIAQSNSEHSRHWFFKGRLVIDGEEIPKHLIALIKEPLE--------------------ANPNNSVIAFKDNSSGIRGYPIRTIVPAHPGEPCPFVAADLDYHIIFTAETHNFPSGVAPFPGAETGTGGRIRDVHATGRGSLVVAGTAAYCVGNLRIPG-YELPWED---PGFRYPSNLASPLQIEIEASNGASDYGNKFGEPVIQGYTRSFGLRLPNGERREWIKPIMFTGGIGQIDARHTEKDSPEPEMWVVKIGGPAYRIGMGGGAASSMVQGENVEELDFNAVQRGDAEMEQKVNRVIRACVELGDRNPII---SIHDQGAGGNCNVLKEIVDPAGAQIEIREIPLGDETLSVLEIWGAEYQENDALLLRPEHA-DTFRALCEREKIPVAFVGKITGDGRIVVHDEVSDSTP--------VNLDLEAVLGHMPQKTFEF--ERIPAKLEPLHLPDDLTVQDALERVLRLLSVGSKRFLTTKVDRSVTGLIAQQQCTGPLQLTVADVAVIAQSHFGTTGGATAVGEQPMKGLLNT----AAMGRMAVGEALINLVWAQVSALEDVCCSGNWMWAAKLPGEGAALYDAAAAMRDVMLELGIAVDGGKDSISMAAIAPGPTGADETVK--APGELVISAYVTCPDITKTVTPDLKLPGS-----GRLLYVDLA---SGMHRLGGSALAQVYGQIGD-ESPDVEDAGVLKRAFNVVQQLIGAGKIAAGHDRSDGGLITTLLEMAFAGNHGIEVDLQD-----ADATSAIPFLFSEELGLVLEVSSEAEAEVLAAFRQANVPCIAIG-STTNGTSVTVRYNGEKVLRDDLRYLRDLWEATSFQLDRLQANLECVEEEERGLRERTGPKYVVPFTPKHTPTAIL--EKKEKIPVAILREEGSNGDREMTSAFYAAGFEPWDVVMSDLLSGRIDLRRFRGVVGVGGFSYADVLDSAKGWAGVIRFNGDLWRQFQEFYERSDTFSLGICNGCQLLALLGRVPW-----SGIPDERQPRFI----QNASGRFESRFVAVQIQKSPAIMLQGIEGATLGIWVAHGEGRAFFPDATVLDRVIEDGLAPVRFVD-----DDNRITEVYPFNPNGSPHGIAGLCSPDGRHLVMMPHPERSFLKWQWGWMPDEMKRTLEA----SPWLRMFQNA 1274          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: KRT73468.1 (phosphoribosylformylglycinamidine synthase, phosphoribosylformylglycinamidine synthase [Deltaproteobacteria bacterium CSP1-8])

HSP 1 Score: 978.8 bits (2529), Expect = 2.000e-281
Identity = 567/1315 (43.12%), Postives = 776/1315 (59.01%), Query Frame = 0
Query:  226 PRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWYHIVSRSEALLPEAEALL---YDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAI-VLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHRPRHVASALEILLRASDGASDYANKYGEPLVGGFARA----MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEGD--PELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALATLSRICHREATPMEVVGHITGHGAIRVYENEEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRGGIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDG---VRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIGTSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFPPPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            PR +F + WS+N  ++ +A G          D+++ +E  R Y  ++ ++    + E LL   +DRMTE P+                  P      E    P A+  +PL EEG   L  ++   GLGL  W++ +  DLF   + R+PT+VE FD++QS SEH RHW F G+ VV+G+ +P++LM++V+ P                          NS++AF DNSSAI+G R+   +P  P R SR+       HL  TAETHNFP  +APFPGA+TG GGRIRDI ATGRGG  +AG A Y VG L + G   R   ++P   + P ++A+ L I ++AS+GASDY NK+GEP++ GF R+    +P  +  E++KPIMF+ G+G++ +  V K  P+ GM + K+GGPAYR+GLGGGAASS ++G+   EL+  AVQRGD EM  KV RV+RAC+EM  ++    + S+HDQGAGGN NV+KE++ P GA + + +IQ GD +L+ LE+WGAEYQE   +L++   A     ++C RE  P   +G I+G G I + +  +    G  P    VDL LE +L   P++  +     P       K    +++   L  +L  VSVGSK FLTNKVDRSVTGLVA Q C GPL L V DV V A S +G  G A ++GE P      T    AAM R++VGEALTNLV A I+R  D+K   NWMW  +      +LYE   ALR+ +L+LG+A+DGGKDSLSMA     G     V  P T+VI++YAPC D+++V TPDLK+P      +  L+ ++L    +G  RLGGSA AQ+  +   +E+PD+ DP L+ RAF AVQ L  + L+ + HD S GGL TT+LEMA SG+ G  + +            A   LF+EEL LV+E  P+D   +   L   +VP  I+G +T  +  + V  NG+ VL E+M  LR  W+ TS +LE+LQANP CI  E+  +  ++ P +    P    +PA+    R  P V V+RE+GSN DREM++A + AGFDVWD+T+ D L  +VDL  F G+AFVGGFSY DVL SA+GW  V+  N     Q RRF+ R +T+SLG+CNGCQL+  LG +P   ++   +P             N SGRFESRF TV + PSP+++L G++G+ +GIW AHGEG        +L       LAPVRYVD        E T +YPFNPNGS  G+AALCS DGRHLA+MPHPER++L WQ  WMP    +        +PW ++FQNA
Sbjct:   86 PRMNFTTAWSTNAVSVFQACGL---------DKIKRIERSRRYKWITDAKLGKDQIERLLSEVHDRMTECPY------------------PATLTTFETGIRPEAVRTVPLIEEGIPALQNINATLGLGLDAWDIEYYYDLFAKDLKRNPTDVECFDLSQSNSEHSRHWFFRGKLVVEGKEVPETLMQIVKSP--------------------LMANPGNSVIAFRDNSSAIQGYRIKTIRPEHPGRCSRFEEANPTYHLIFTAETHNFPSGVAPFPGAETGTGGRIRDIQATGRGGLVVAGTAAYCVGNLRIPGY--RLPWEDPSFRY-PGNLATPLSIEIQASNGASDYGNKFGEPVIQGFTRSFGLRLPDGERREWIKPIMFTGGIGQMDSRHVEKGIPEKGMLLAKIGGPAYRIGLGGGAASSMIQGENIAELDFNAVQRGDAEMEQKVNRVIRACIEMGENNP---IVSIHDQGAGGNCNVVKEIIYPAGARIEIRKIQSGDDTLSVLELWGAEYQEQNALLLRPGNA-GRFEKMCRREKVPCAFIGRISGDGRIVLVDETD----GSTP----VDLDLEKILGDMPQKTFRLDRIAPEREP--LKLPGNLSVGGALDRVLRLVSVGSKRFLTNKVDRSVTGLVARQQCAGPLQLTVSDVAVIAQSHFGLTGAAISIGEQP----IKTLIDPAAMARLSVGEALTNLVWAKISRLEDVKCSGNWMWAAKLPGEGTRLYEAAVALRDIMLELGIAIDGGKDSLSMAAKVVSGEASEMVKSPGTLVISSYAPCPDITKVATPDLKMPG-----KSRLLFIDLG---NGRDRLGGSALAQVYNQ-VGVESPDVDDPGLLKRAFNAVQGLISKGLILSGHDRSDGGLATTLLEMAFSGNCGLDIAV--------GGAGAIPCLFSEELGLVIEYLPKDETTITFRLRKAKVPFRILGRTTTGKR-IRVRFNGRIVLDEEMRGLRETWEETSHRLERLQANPKCITEEKKNIHDRQGPSYKVTFPPKAASPAV-IRKRKNPAVAVIREEGSNSDREMSSAFYQAGFDVWDVTMTDFLGGKVDLDRFRGMAFVGGFSYADVLDSAKGWAGVIRFNRDIFEQFRRFYDRTDTFSLGVCNGCQLMALLGWIPWTGIEDRHQPR---------FIHNLSGRFESRFSTVRIFPSPSIMLAGMQGSTLGIWVAHGEGRAYFPRKEILRKVESLSLAPVRYVD-----DRREPTMKYPFNPNGSANGIAALCSTDGRHLAIMPHPERTFLKWQWAWMP----EEWRKNLKASPWIKMFQNA 1295          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: OGP79185.1 (phosphoribosylformylglycinamidine synthase [Deltaproteobacteria bacterium RBG_16_64_85])

HSP 1 Score: 978.8 bits (2529), Expect = 2.000e-281
Identity = 586/1407 (41.65%), Postives = 801/1407 (56.93%), Query Frame = 0
Query:  164 VQVQEFYFLSVLHEQKQEDNLI---ENISNVLREAATPWTHALLPLDDQSTASHHSLR--------------------QHYRAYAPRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWYHIVSRSEALLPEAEALL---YDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAI-VLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHRPRHVASALEILLRASDGASDYANKYGEPLVGGFARA----MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEGD--PELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQ-GSQALATLSRICHREATPMEVVGHITGHGAIRVYENEEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRGGIEKAASPMAL------RDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDG---VRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIGTSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFPPPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            +++  FY    L E K ED L    +N+S  +R   T +   +   +  + A   +LR                           PR +F + WS+N  ++  A G          D+++ +E  R Y +V+  E      E  L   +DRMTE P+  P +                    E    P ++  +PL EEG   L  ++ + GLGL  W++ +  DLF   + R+PT+VE FD++QS SEH RHW F G+ +VDG  +P++LM +V++P +                        NS++AF DNSSAI+G R+    P  P   SR T      HL  TAETHNFP  +APFPGA+TG GGRIRD+ ATGRGG  +AG A Y VG L + G   R   ++P   + P ++AS L I + AS+GASDY NK+GEP++ GF R+    +P  +  E++KPIMF+ G+G++ +  V K  P+ GM V K+GGPAYR+GLGGGAASS ++G+   EL+  AVQRGD EM  KV RV+RAC+EM   +    + S+HDQGAGGN NV+KE++ P GA + + +IQ GD +L+ LE+WGAEYQE   +L++ GS+   T   +C RE  P   +G I+G G I +++  +    G  P    VDL LE +L   P++  +     P        A SP+AL      R  L  +L  +SVGSK FLTNKVDRSVTGLVA Q C GPL L V DV V A S +G+ G A A+GE P      T    AAM R+ V E LTNLV A I R  D+K   NWMW  +      +LY+   ALRE L++LG+A+DGGKDSLSMA     G     V  P T+VI+AYAPC D+++V+TPD+K P      +  L+L++L     G  RLGGSA AQ  G+  + E+PD+ DP L+ RAF AVQ L  + LV + HD S GGLV T+LEMA +G+ G  ++        +        LF+EE  +V+E  P+D K + + L+  ++P  ++G +T  +  + V+A G+ VL EDM  LR+ W+ TS +LE+LQANP C   E+  +  +  P +         +PA+    R KP V V+RE+GSN DREMA+A H AGF VWD+T+ D L+ +VDL  F G+AFVGGFSY DVL SA+GW  V+  N       ++F+ R +T+SLG+CNGCQL+  LG +P   ++         +R       N SGRFESRF  V + PSP++LLKG+ G+ +GIW AHGEG     +  +L       LAPVRYVD        E T  YPFNPNGS  G+AALCSPDGRHLA+MPHPER++L WQ  WMP    +  +     +PW +LF NA
Sbjct:    1 MRLLHFYRYPALSETKAEDLLAFARKNVSPRVRRIETEYCFNIEASEPLNDAEIETLRWLLAETFEPDRFAPETLFHGDRVLEVGPRMNFTTAWSTNAVSVCRACGL---------DKIRRIERSRRYRLVADREPAGDRIEPFLSKVHDRMTECPYPAPLSTF------------------ETGIRPESVRTVPLIEEGLPALQRINAELGLGLDAWDIEYYFDLFAKDLQRNPTDVECFDLSQSNSEHSRHWFFRGKLIVDGTKIPETLMRIVKDPLK--------------------ANPANSVIAFKDNSSAIRGYRIRTLLPEHPGTCSRMTEARPAYHLIFTAETHNFPSGVAPFPGAETGTGGRIRDVQATGRGGLVVAGTAAYCVGNLRIPGY--RLPWEDPSFAY-PGNLASPLSIGIDASNGASDYGNKFGEPVIQGFTRSFGLRLPGGERREWIKPIMFTGGIGQMDSRHVEKGAPETGMLVTKIGGPAYRIGLGGGAASSMIQGENVAELDFNAVQRGDAEMEQKVNRVIRACVEMGDRNP---IVSIHDQGAGGNCNVVKEIIYPAGARIEVRKIQSGDDTLSALELWGAEYQEQNALLIRPGSR--ETFEAVCRREKVPCAFIGRISGDGRIVLHDERD----GSTP----VDLDLEKVLGDMPQKTFRLDRIAP--------ALSPLALPRSVSVRGALDRVLRLLSVGSKRFLTNKVDRSVTGLVARQQCAGPLQLTVSDVAVIAQSHFGQTGAAIAIGEQP----VKTLIDPAAMARLTVAEMLTNLVWAKIRRLGDVKCSGNWMWAAKLPGEGARLYDAAVALREILIELGIAIDGGKDSLSMAAKVGSGETAETVKSPGTLVISAYAPCPDITKVVTPDIKRPG-----KSRLLLIDLG---KGNYRLGGSALAQAYGQVGD-ESPDVDDPKLLRRAFNAVQGLIGKGLVLSGHDRSDGGLVATLLEMAFAGNCGLEIS--------TGGADPLPFLFSEEPGMVIEYLPKDEKHLLSRLQKARLPFRVLGATTAGKR-IKVKAGGRIVLGEDMRTLRSIWEETSHRLERLQANPGCAAQEKRNIHDRPGPSYKLSFTPEAASPAV-LRKRSKPAVAVVREEGSNSDREMASAFHQAGFTVWDVTMTDFLEGKVDLDRFRGMAFVGGFSYADVLDSAKGWAGVIRFNRSVFEPFQKFYDRADTFSLGVCNGCQLMALLGWIPWAGIE---------DRRQPRFIRNLSGRFESRFSAVRIFPSPSILLKGMAGSTLGIWVAHGEGRAFFPDRKILKNVESLSLAPVRYVD-----DRREVTMRYPFNPNGSANGIAALCSPDGRHLAIMPHPERTFLTWQWAWMP----EEWKRSLKASPWLKLFLNA 1295          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: OGP35824.1 (phosphoribosylformylglycinamidine synthase [Deltaproteobacteria bacterium GWC2_65_14])

HSP 1 Score: 976.1 bits (2522), Expect = 1.300e-280
Identity = 573/1314 (43.61%), Postives = 775/1314 (58.98%), Query Frame = 0
Query:  226 PRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWYHIVSRSEALLPEAEALL---YDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAIVLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHRPRHVASALEILLRASDGASDYANKYGEPLVGGFARA----MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEGD--PELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALATLSRICHREATPMEVVGHITGHGAIRVYENEEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRGGIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDGV---RVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIGTSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFPPPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            PR +F + WS+N  ++  A G           +++ +E  R Y +++  +     A   L   +DRMTE P+  P +                   T    EP  +V PL EEG   L  ++ + GLGL  W+V +  DLFV  + R+PT+VE FD++QS SEH RHW F G+  VDG+ +P++L  +V++P +                        NSL+AF DNSSAI+G R+    P  P R SR+       HL  TAETHNFP  +APFPGA+TG GGRIRD+ ATGRGG  +AG A Y VG L + G     ++D       P ++AS L I + AS+GASDY NK+GEP++ GF R+    +P  +  E++KPIMF+ G+G++ A  V K  P+  M V K+GGPAYR+G+GGGAASS ++G+   +L+  AVQRGD EM  KV RV+RAC+EM   +    + S+HDQGAGGN NV+KE+V P GA + + +IQ GD +L+ LE+WGAEYQE   +L++   A    + +C RE  P   +G ITG G I +++  +    G  P    VDL LE +L   P++  + +   P     + +    + +R  L  +L  +SVGSK FLTNKVDRSVTGLVA Q C GPL L V DV V A S +GK G A ++GE P      T    AAM R+ V EA+TNLV A I R  D+K   NWMW  +      +LY+   ALRE L+DLG+A+DGGKDSLSMA     G     V  P T+VI+AYAPC D+++V TPD+K P      +  L+L++L G   G  RLGGSA AQ+ G+   +E PD+ DP L+ RAF A+QRL  + LV + HD S GGLVTT+LEMA +G+ G  +++            A  +LF+EE  LV+E  P D + V + L+  +V   I+G +T  + + I  AN + VL EDM VLR  W+ TS +LE+LQA+P C  AE+     ++ P +         +PAL    + KP V V+RE+GSN DREM++A H AGF+VWD+T+ D L+  VDL  F G AFVGGFSY DVL SA+GW  V+  N   + Q RRF+ RP+T+SLG+CNGCQL+  LG VP   +    +P    NR         SGRFESRF TV +  SP++LL+G+ G+ +GIW AHGEG       ++L     Q LAPVRYVD        + T EYPFNPNG+  G+AALCSPDGRHLA+MPHPER++L WQ  +MP    +  +     +PW  +F NA
Sbjct:   86 PRMNFTTAWSANAVSVCHACGL---------GKIRRIERSRRYRLIAGRKIGNDRAARFLSEVHDRMTECPYPKPLST----------------FETGIRPEPFRVV-PLIEEGIPALQRINTELGLGLDAWDVDYYHDLFVKDLRRNPTDVECFDLSQSNSEHSRHWFFRGKLEVDGKRIPETLFRIVKDPLR--------------------ANPGNSLIAFKDNSSAIRGYRIRTILPEEPGRCSRFLEAGPTYHLIFTAETHNFPSGVAPFPGAETGTGGRIRDVQATGRGGLVVAGTAAYCVGNLRIPGYRLPWEDDT---FAYPDNLASPLAIEIEASNGASDYGNKFGEPVIQGFTRSFGMRLPGGERREWIKPIMFTGGIGQMDARHVDKGLPEPEMLVTKIGGPAYRIGMGGGAASSMIQGENVADLDFNAVQRGDAEMEQKVNRVLRACVEMGDGNP---IVSIHDQGAGGNCNVVKEIVYPAGARIEVRKIQSGDDTLSVLELWGAEYQEQNALLIRPESA-PLFTAVCRREKVPCAYIGKITGDGRIVLHDEND----GSTP----VDLDLEKVLGDMPQKKFR-LDRIPPVLAPL-RLPRGLTVRQALDRVLRLLSVGSKRFLTNKVDRSVTGLVARQQCAGPLQLTVSDVAVIAQSHFGKTGAAISIGEQP----LKTLIDPAAMARITVAEAVTNLVWAKIRRLEDVKCSGNWMWAAKLPGEGAKLYDAAVALREVLIDLGIAIDGGKDSLSMAARVGSGTAAETVKSPGTLVISAYAPCPDITKVATPDIKRPG-----KSRLLLIDLGG---GRDRLGGSALAQVYGQ-VGIEPPDLDDPKLLKRAFDAIQRLIGKGLVLSGHDRSDGGLVTTLLEMAFAGNCGLSVSVEGR--------GAIPSLFSEEPGLVIEYLPGDERAVLSQLKKTKVHFRILGGTTTAKRIRIRFAN-RTVLNEDMRVLRETWEETSHRLERLQADPKCARAEKRNSYDRKGPSYKLAFDPRPPSPAL-LRRKDKPAVAVIREEGSNSDREMSSAFHQAGFEVWDVTMTDFLEGTVDLSRFRGAAFVGGFSYADVLDSAKGWAGVIRFNRSIQEQFRRFYERPDTFSLGVCNGCQLMALLGWVPWQGIGDRHQPRFIRNR---------SGRFESRFSTVRILESPSILLQGMAGSTLGIWVAHGEGRAFFPKRSILSKVEAQSLAPVRYVD-----DRGKVTTEYPFNPNGAANGIAALCSPDGRHLAIMPHPERTFLTWQWGFMP----EEWKKNLKISPWLTMFLNA 1295          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: XP_005648260.1 (AIR synthase-related protein [Coccomyxa subellipsoidea C-169] >EIE23716.1 AIR synthase-related protein [Coccomyxa subellipsoidea C-169])

HSP 1 Score: 969.9 bits (2506), Expect = 9.100e-279
Identity = 585/1401 (41.76%), Postives = 806/1401 (57.53%), Query Frame = 0
Query:  140 PPMSGPREAILLRRL-KALDAAIEAVQVQEFYFLSVLHEQKQEDNLIENISNVLREAATPWTHALLPLDDQSTASHHSLRQHYRAYAPRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWYHIVSRSEALLPEA----EALLYDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAIVLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHRPRHVASALEILLRASDGASDYANKYGEPLVGGFARA----MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEGD--PELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQ-GSQALATLSRICHREATPMEVVGHITGHGAIRVYENEEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRG-GIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDGVRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIGTSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFPPPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            P +S      L+R++ + + + IE++   E  F   L E   ED     ++ +LRE   P    LL  D +    + ++ +      PR SF+S WS+N  +I  + G  A         +  LE+ R Y + SRS +L P+      AL++DRMTE  +  P                  R +T          +P+ E+G   L  ++E+ GL     ++A+ T +F   + RDPT VE+FD+AQS SEH RHW F  Q V+DGQA P++LM+MV+   +                        NS++ F DNSSAI+G  V    P     PS     E    L LTAETHNFPC++AP+PGA+TG GGR+RD  ATG G    AG AGY VG L ++G               P  +AS L+IL+ AS+GASDY NK+GEPL+ G+ R     MP+ +  E++KPIMFSAG+G++    + KN+P++GM +VK+GGPAYR+G+GGGAASS   G    +L+  AVQRGD EM  K+ RVVRAC+EM   +    ++ +HDQGAGGN NV+KE++ P+GA + +  I  GD +++ LE+WGAEYQE+  +L++ G + L  L  +  RE   ++++G I+G G I + +      H  P +   VDL LE +L   P++  +       T    +   A+P      L  +L   +V SK FLT KVDR VTGLVA Q C GPL LPV DV V A S  G  G A+++GE P  GL       AAM RMA+GEA+TNL+ A  T  AD+K   NWM+  +       +Y+   ALR+A+++LGLA DGGKDSLSMA     G  V  P  +VI+AY  C D++  +TPDLK+P +     G L+ ++LSG   G +RLGGSA A    +  + E PD+  P+ +  A+ A Q L ++  + A HD+S GG+   +LEM  SG+ G  + LP        S  A  ALFAEEL LVLE+ P+D + V+AA  ++ +  + +G+    R  V +   G+  +  D+  LR  W+AT F+LE+ QA    +EAE   L  +  P W   P   T TP  + SA  KPRV +LRE+GSNGDREMAAA+H+AG + WD+T+ DLL  +  L SF G+ FVGGFSY DVL SA+GW   +  N R  AQ + F+ RP+++SLG+CNGCQL+  LG +PG D+    +P             N SGRFE R+ TV + PSP++LLKG+EG+V+GIW AHGEG  +  +DAV  + L+QGLAP+RY D      +   TE YPFNPNGSP G+AAL SPDGRHLA+MPHPER +L WQ PW P   G   +G    +PW +LFQNA
Sbjct:   48 PGLSASATKTLIRKVQQKVSSDIESIDT-ELCFNVALKEPLTEDQ-AATLTWLLRETYEP---ELLTPDSRLQEGNGTVLE----VGPRMSFQSAWSTNAVSICRSCGLNA---------VSRLEVSRRYLLRSRS-SLSPDTLAAFSALVHDRMTEQVYLEPL-----------------RSFTTSVTPGPVFTIPVLEKGRAALEAINEELGLAFDEQDLAYYTRMFQEEMKRDPTNVELFDIAQSNSEHSRHWFFGAQLVLDGQAAPETLMQMVKATLK--------------------ANPNNSVIGFKDNSSAIRGGPVQPMLPLACGAPSALAPQERDWDLLLTAETHNFPCAVAPYPGAETGAGGRMRDTHATGIGSIMGAGTAGYCVGNLNIEGT---PLPGEDLSFEYPESLASPLQILIDASNGASDYGNKFGEPLIAGYTRTFGLRMPSGERREWIKPIMFSAGLGQIDHSHLHKNDPELGMLIVKIGGPAYRIGMGGGAASSVPSGSNRADLDFNAVQRGDAEMAQKLWRVVRACVEMGGRNP---IQQIHDQGAGGNCNVVKEIIYPLGATIDVRSIALGDDTMSVLEIWGAEYQENDCLLIKPGDRGL--LEAVAARERCILQIIGSISGSGRITLVDK-----HAPPDSPTPVDLDLEKVLGDMPQKTFEFTRRAEATHPLDLPSTATP---EQALDRVLRLPAVASKRFLTTKVDRCVTGLVAQQQCCGPLQLPVADVAVMAQSHQGLTGAATSIGEQPLKGLIDP----AAMARMALGEAVTNLIWAAATGLADVKASVNWMYAAKMEHEGAAMYDAAAALRDAMIELGLACDGGKDSLSMAAGA-GGEVVKAPGNLVISAYVACPDITLTVTPDLKLPGS-----GRLLWVDLSG---GRKRLGGSALAHAYSQIGD-EVPDVA-PAALKGAWEATQELLRQRKISAGHDISDGGIAVALLEMGFSGNVGLTVDLPK---PEQDSTGALGALFAEELGLVLEVAPEDEEAVQAAYAARGLSAVAVGSVAADR-AVSISVGGEPSISGDVAALRDVWEATGFRLEREQAAEETVEAERRGLAAREAPAWTL-PYTPTWTPEEALSAADKPRVAILREEGSNGDREMAAAVHAAGMEPWDITMSDLLAGRASLDSFAGIVFVGGFSYADVLDSAKGWAGTIRRNSRLWAQFQAFYDRPDSFSLGVCNGCQLMALLGWIPGGDLPDTRQPR---------FVHNASGRFECRWATVRIEPSPSILLKGMEGSVVGIWCAHGEGQAKFPDDAVRASVLQQGLAPIRYCD-----ASGATTEAYPFNPNGSPDGIAALTSPDGRHLALMPHPERCFLTWQNPWYPKDVGLQPDGP---SPWLKLFQNA 1339          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: EAY96218.1 (hypothetical protein OsI_18107 [Oryza sativa Indica Group])

HSP 1 Score: 969.1 bits (2504), Expect = 1.600e-278
Identity = 565/1320 (42.80%), Postives = 780/1320 (59.09%), Query Frame = 0
Query:  226 PRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWYHIV------SRSEALLPEAEALLYDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAIVLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHRPRHVASALEILLRASDGASDYANKYGEPLVGGFAR----AMPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEG--DPELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALATLSRICHREATPMEVVGHITGHGAIRVYEN---EEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRGGIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDGVRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIG--TSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFP-PPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            PR +F + +S+N  +I ++   +          +  LE  R Y +          E+ L +  AL++DRMTE  +     +   +                   EP  IV P+ E G E L  ++ K GL     ++ + T LF   I R+PT VE+FD+AQS SEH RHW F+G+ V+DG+ MP++L ++V+ P +                        NS++ F DNSSAIKG      +P  P   S  + +     +  TAETHNFPC++AP+PGA+TG GGRIRD  ATG+G + +A  AGY VG L ++GA    ++ +      P ++AS L+IL+ ASDGASDY NK+GEPL+ GF R     +   +  E+LKPIMFS  +G++    + K +P++GM VVK+GGPAYR+G+GGGAASS V G  D EL+  AVQRGD EM  K+ RVVRAC EM  S+  +   S+HDQGAGGN NV+KE++ P GA++ +  I  GD +L+ LE+WGAEYQE   +LV+  ++ + L  +C RE   M V+G I G G I + ++   E   ++G PP  P+ DL LE +L   P++  +    R          A  + + D L+ +LS  SV SK FLT KVDR VTGLVA Q  VGPL LP+ DV V A ++    G A A+GE P  GL   K    AM R+A+GEALTNLV A ++  +D+K   NWM+  +       +Y+   AL + ++ LG+A+DGGKDSLSMA  C DG  V  P  +VI+AY  C D++  +TPDLK+     G++GVL+ ++LS    G RRLGGSA AQ   +    + PD+ D   + +AF AVQ L  E L+ A HD+S GGL+ +VLEMA +G+ G +L +       S   S   ALFAEEL L+LE+  +D+  V+  L++  +   +IG  T++P  E+V+   +G+  L+E    LR  W+ TSFQLE LQ   +C+  E+  L  +  P W     P+ T    L+ S+  KP+V +LRE+GSNGDREMAAA ++AGF+ WD+T+ DLL  +  L+ + G+AFVGGFSY DVL SA+GW + +  N     Q + F+ RP+T+SLG+CNGCQL+  LG VPG   DV G      + +      NESGRFE RF +V +G SPA++ KG+EG+ +GIWSAHGEG     ++ VL + ++  LAPVRY D      A   TE YPFNPNGSP G+AALCSPDGRHLAMMPHPER ++ WQ PW P        G    +PW R+FQNA
Sbjct:  131 PRMTFSTAFSTNAVSICKSLSLM---------EVTRLERSRRYLLCLDPGYGPLDESQLNDFTALVHDRMTECVYPKKLTSFHSDV----------------VPEPVRIV-PVIERGREALEEINVKMGLAFDEQDIKYYTHLFRDDIKRNPTTVELFDIAQSNSEHSRHWFFNGKLVIDGETMPRTLFQLVKSPLK-------------------ANPDNNSVIGFNDNSSAIKGYPANQLRPTVPGSTSPLSVMMRELDILFTAETHNFPCAVAPYPGAETGAGGRIRDTHATGKGSFVVASTAGYCVGNLRIEGAYAPWEDPS---FSYPSNLASPLQILIDASDGASDYGNKFGEPLIQGFTRNFGTRLLNGERREWLKPIMFSGAIGQIDHAHISKGDPEIGMLVVKIGGPAYRIGMGGGAASSMVSGQNDAELDFNAVQRGDAEMAQKLYRVVRACAEMGESNPII---SIHDQGAGGNCNVVKEIIYPKGAEIDIRSIVVGDHTLSVLEIWGAEYQEQDALLVK-PESRSLLESLCERERVSMAVIGTINGCGKIVLIDSAAVEHAKLNGLPPPTPVEDLELEKVLGDMPQKTFEF--KRVSVVSEPLDIARGVTIMDALKRVLSLPSVCSKRFLTTKVDRCVTGLVAQQQTVGPLQLPLADVAVIAQTYTDLTGGACAIGEQPTKGLLNPK----AMARLAIGEALTNLVWAKVSSLSDVKASGNWMYAAKLDGEGADMYDAAVALADCMIQLGIAIDGGKDSLSMAAQC-DGEVVKAPGNLVISAYVTCPDITLTVTPDLKL-----GKDGVLLHIDLS---KGKRRLGGSALAQAFDQIGN-DCPDIDDVLYLKKAFEAVQELLGERLISAGHDISDGGLIVSVLEMAFAGNCGVKLNID------SEDSSLLQALFAEELGLLLEVHLKDLSVVKQKLQAGGISANVIGKVTASPDIELVV---DGRLHLKEKTSDLRDIWEETSFQLEGLQRLKSCVRLEKEGLKHRTSPSWSLSFTPKFTDEKLLTASS--KPKVAILREEGSNGDREMAAAFYAAGFEPWDITMSDLLAGKSSLEDYRGIAFVGGFSYADVLDSAKGWAASIRFNQPLIQQFQNFYNRPDTFSLGVCNGCQLMALLGWVPG--SDVGGSLGSGGDMSQPRFIHNESGRFECRFTSVSIGASPAIMFKGMEGSTMGIWSAHGEGRAFFPDENVLASVVKSNLAPVRYCD-----DANNITEVYPFNPNGSPLGIAALCSPDGRHLAMMPHPERCFMMWQYPWSPKDWQLEKSGP---SPWLRMFQNA 1361          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: OAY73949.1 (putative phosphoribosylformylglycinamidine synthase, chloroplastic/mitochondrial [Ananas comosus])

HSP 1 Score: 968.8 bits (2503), Expect = 2.000e-278
Identity = 571/1346 (42.42%), Postives = 795/1346 (59.06%), Query Frame = 0
Query:  201 HALLPLDDQSTASHHSLRQHYRAYAPRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWY--HIVSRSEAL----LPEAEALLYDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAI-VLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHRPRHVASALEILLRASDGASDYANKYGEPLVGGFARA----MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEG--DPELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALATLSRICHREATPMEVVGHITGHGAIRVYEN---EEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRGGIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDGVRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIG--TSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFP-PPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            H+ L  ++  T + +S+        PR SF + WS+N  +I +A          +   +  LE  R Y  H+   S  L    + +  A+++DRMTE  +     +      KT  +             P A+ V+P+ E G E L  ++ K GL     ++ + T LF   I R+PT VE+FD+AQS SEH RHW F+G+ V+DG+ M K+LM++V+   +                        NS++ F DNSSAIKG +V   +PA P        +     +  TAETHNFPC++AP+PGA+TG GGRIRD  ATGRG + +A  AGY VG L ++G+    ++ +      P ++A  L+IL+ ASDGASDY NK+GEPL+ GF R     +P+ +  E+LKPIMFS G+G++    + K EPD+GM VVK+GGPAYR+G+GGGAASS V G  D EL+  AVQRGD EM  K+ RVVRAC EM   +  +   S+HDQGAGGN NV+KE++ P GA++ +  I  GD +++ LE+WGAEYQE   +LV+  ++   L  IC RE   M V+G I+G G I + ++   EE   +G PP  P+ DL LE +L   P++  +     P  R  ++  A    L D L+ +L   SV SK FLT KVDR VTGLVA Q  VGPL LP+ DV V A ++    G A A+GE P  GL  +K    AM RMAVGEALTNLV A +T  AD+K   NWM+  +       +Y+   AL E+++ LG+A+DGGKDSLSMA     G  V  P  +VI+AY  C D++  +TPDLK+ +     +GVL+ ++L+    G RRLGGSA AQ   +  + + PD+ D   +   F +VQ L  E L+ A HD+S GGL+   LEMA +G+ G +L L       S   S    LFAEEL L+LE+  +D+  V+  L++  V   +IG  +++P  E+V+   +G   L+E+   LR  W+ TSFQLE LQ   +C++ E+  L  ++ P W     P+ T +  ++ S+  KP+V ++RE+GSNGDREM+AA ++AGF+ WD+T+ DLL  ++ L  F G+AFVGGFSY DVL SA+GW + +  N     Q ++F+ RP+T+SLG+CNGCQL+  LG VPG   DV G      + +      NESGRFE RF  V +G SPA++ KG+EG+ +G+W+AHGEG     ++ +L + L+  LAPVRY D        + TE YPFNPNGSP G+AALCSPDGRHLAMMPHPER ++ WQ PW P       +G    +PW R+FQNA
Sbjct:  139 HSFLEEEEALTGAQNSV---LIEVGPRMSFTTAWSANAVSICQA---------CSLTEITRLERSRRYLLHLRPGSSPLDVNQINDFAAMVHDRMTECVYPQKLTSF-----KTSAI-------------PEAVSVVPVIERGREALEEINVKMGLAFDEQDIKYYTALFKDDIKRNPTTVELFDIAQSNSEHSRHWFFNGKLVIDGETMSKTLMQIVKSTLK--------------------ANPNNSVIGFKDNSSAIKGYQVNQLRPAFPGSTCPLDMIIRELDILFTAETHNFPCAVAPYPGAETGAGGRIRDTHATGRGSFVVAATAGYCVGNLRIEGSFAPWEDSS---FLYPSNLAPPLQILVDASDGASDYGNKFGEPLIQGFTRTFGMRLPSGERREWLKPIMFSGGIGQIDHAHISKGEPDIGMLVVKIGGPAYRIGMGGGAASSMVSGQNDAELDFNAVQRGDAEMAQKLYRVVRACAEMGEKNPII---SIHDQGAGGNCNVVKEIIYPKGAEIDIRSIVVGDHTMSVLEIWGAEYQEQDALLVK-PESRDLLQVICERERVSMAVIGTISGSGKIVLIDSSAIEESKSNGLPPXXPVEDLELEKVLGDMPQKCFE-FSRIPQLREPLD-IAPGTTLMDSLKRVLKLPSVCSKRFLTTKVDRCVTGLVAQQQTVGPLQLPLSDVAVIAQTYTDLTGGACAIGEQPIKGLLNSK----AMARMAVGEALTNLVWAKVTSLADVKASGNWMYAAKLDGEGADMYDAAIALSESMIQLGIAIDGGKDSLSMAAHA-GGEVVKAPGNLVISAYVTCPDITLTVTPDLKLTN-----DGVLLHIDLA---KGKRRLGGSALAQAFDQVGD-DCPDLDDVLYLKSVFESVQDLLSERLISAGHDISDGGLIVCALEMAFAGNCGLKLNLS------SGGHSILHTLFAEELGLILEINKKDIDIVKKKLKTMGVSSEVIGEVSASPVIELVV---DGDLRLKEETSYLRDLWEETSFQLESLQRLASCVKLEKEGLKHRQSPSWSLSFTPKFTNSKLIAASS--KPKVAIIREEGSNGDREMSAAFYAAGFEPWDVTMSDLLNGKISLDDFRGVAFVGGFSYADVLDSAKGWSASIRFNLPLLQQFQKFYNRPDTFSLGVCNGCQLMALLGWVPG--GDVGGSSGVGGDLSQPRFVHNESGRFECRFTGVTIGDSPAIMFKGMEGSTLGVWAAHGEGRAYFPDNDILGSVLKSNLAPVRYCD-----DESKITEVYPFNPNGSPLGIAALCSPDGRHLAMMPHPERCFMMWQYPWYPKEWNVDKKGP---SPWLRMFQNA 1390          
BLAST of NO20G01910 vs. NCBI_GenBank
Match: XP_020083748.1 (probable phosphoribosylformylglycinamidine synthase, chloroplastic/mitochondrial [Ananas comosus] >XP_020083757.1 probable phosphoribosylformylglycinamidine synthase, chloroplastic/mitochondrial [Ananas comosus])

HSP 1 Score: 968.8 bits (2503), Expect = 2.000e-278
Identity = 571/1346 (42.42%), Postives = 795/1346 (59.06%), Query Frame = 0
Query:  201 HALLPLDDQSTASHHSLRQHYRAYAPRPSFRSVWSSNVRAILEARGCLAQAGESAGDRLQELEMGRWY--HIVSRSEAL----LPEAEALLYDRMTESPFAMPAAAVWGEADKTWMLAPTDRLWTEREKEPSAI-VLPLHEEGPEVLYTVSEKYGLGLGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDGQAMPKSLMEMVREPWQXXXXXXXXXXXXXXXXXXXXXXXXNSLLAFCDNSSAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGAQTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHHRPRHVASALEILLRASDGASDYANKYGEPLVGGFARA----MPAWQGAEFLKPIMFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEG--DPELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNANVLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALATLSRICHREATPMEVVGHITGHGAIRVYEN---EEEAVHGWPPARPLVDLPLEPLLSRRPKRVIQAIGTRPGTRGGIEKAASPMALRDVLRAILSTVSVGSKAFLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGECPGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGREGEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDGVRVPCPATVVITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGSAWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGLVTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQDVKRVRAALESQQVPHLIIG--TSTPQREMVIVEANGQEVLREDMDVLRAEWQATSFQLEKLQANPACIEAEEALLPRQRRPEWIFP-PPRATTTPALSFSARPKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHGLAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNGCQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVGPSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDPCCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQMPWMPPAGGQVGEGEAVYTPWFRLFQNA 1528
            H+ L  ++  T + +S+        PR SF + WS+N  +I +A          +   +  LE  R Y  H+   S  L    + +  A+++DRMTE  +     +      KT  +             P A+ V+P+ E G E L  ++ K GL     ++ + T LF   I R+PT VE+FD+AQS SEH RHW F+G+ V+DG+ M K+LM++V+   +                        NS++ F DNSSAIKG +V   +PA P        +     +  TAETHNFPC++AP+PGA+TG GGRIRD  ATGRG + +A  AGY VG L ++G+    ++ +      P ++A  L+IL+ ASDGASDY NK+GEPL+ GF R     +P+ +  E+LKPIMFS G+G++    + K EPD+GM VVK+GGPAYR+G+GGGAASS V G  D EL+  AVQRGD EM  K+ RVVRAC EM   +  +   S+HDQGAGGN NV+KE++ P GA++ +  I  GD +++ LE+WGAEYQE   +LV+  ++   L  IC RE   M V+G I+G G I + ++   EE   +G PP  P+ DL LE +L   P++  +     P  R  ++  A    L D L+ +L   SV SK FLT KVDR VTGLVA Q  VGPL LP+ DV V A ++    G A A+GE P  GL  +K    AM RMAVGEALTNLV A +T  AD+K   NWM+  +       +Y+   AL E+++ LG+A+DGGKDSLSMA     G  V  P  +VI+AY  C D++  +TPDLK+ +     +GVL+ ++L+    G RRLGGSA AQ   +  + + PD+ D   +   F +VQ L  E L+ A HD+S GGL+   LEMA +G+ G +L L       S   S    LFAEEL L+LE+  +D+  V+  L++  V   +IG  +++P  E+V+   +G   L+E+   LR  W+ TSFQLE LQ   +C++ E+  L  ++ P W     P+ T +  ++ S+  KP+V ++RE+GSNGDREM+AA ++AGF+ WD+T+ DLL  ++ L  F G+AFVGGFSY DVL SA+GW + +  N     Q ++F+ RP+T+SLG+CNGCQL+  LG VPG   DV G      + +      NESGRFE RF  V +G SPA++ KG+EG+ +G+W+AHGEG     ++ +L + L+  LAPVRY D        + TE YPFNPNGSP G+AALCSPDGRHLAMMPHPER ++ WQ PW P       +G    +PW R+FQNA
Sbjct:  167 HSFLEEEEALTGAQNSV---LIEVGPRMSFTTAWSANAVSICQA---------CSLTEITRLERSRRYLLHLRPGSSPLDVNQINDFAAMVHDRMTECVYPQKLTSF-----KTSAI-------------PEAVSVVPVIERGREALEEINVKMGLAFDEQDIKYYTALFKDDIKRNPTTVELFDIAQSNSEHSRHWFFNGKLVIDGETMSKTLMQIVKSTLK--------------------ANPNNSVIGFKDNSSAIKGYQVNQLRPAFPGSTCPLDMIIRELDILFTAETHNFPCAVAPYPGAETGAGGRIRDTHATGRGSFVVAATAGYCVGNLRIEGSFAPWEDSS---FLYPSNLAPPLQILVDASDGASDYGNKFGEPLIQGFTRTFGMRLPSGERREWLKPIMFSGGIGQIDHAHISKGEPDIGMLVVKIGGPAYRIGMGGGAASSMVSGQNDAELDFNAVQRGDAEMAQKLYRVVRACAEMGEKNPII---SIHDQGAGGNCNVVKEIIYPKGAEIDIRSIVVGDHTMSVLEIWGAEYQEQDALLVK-PESRDLLQVICERERVSMAVIGTISGSGKIVLIDSSAIEESKSNGLPPXXPVEDLELEKVLGDMPQKCFE-FSRIPQLREPLD-IAPGTTLMDSLKRVLKLPSVCSKRFLTTKVDRCVTGLVAQQQTVGPLQLPLSDVAVIAQTYTDLTGGACAIGEQPIKGLLNSK----AMARMAVGEALTNLVWAKVTSLADVKASGNWMYAAKLDGEGADMYDAAIALSESMIQLGIAIDGGKDSLSMAAHA-GGEVVKAPGNLVISAYVTCPDITLTVTPDLKLTN-----DGVLLHIDLA---KGKRRLGGSALAQAFDQVGD-DCPDLDDVLYLKSVFESVQDLLSERLISAGHDISDGGLIVCALEMAFAGNCGLKLNLS------SGGHSILHTLFAEELGLILEINKKDIDIVKKKLKTMGVSSEVIGEVSASPVIELVV---DGDLRLKEETSYLRDLWEETSFQLESLQRLASCVKLEKEGLKHRQSPSWSLSFTPKFTNSKLIAASS--KPKVAIIREEGSNGDREMSAAFYAAGFEPWDVTMSDLLNGKISLDDFRGVAFVGGFSYADVLDSAKGWSASIRFNLPLLQQFQKFYNRPDTFSLGVCNGCQLMALLGWVPG--GDVGGSSGVGGDLSQPRFVHNESGRFECRFTGVTIGDSPAIMFKGMEGSTLGVWAAHGEGRAYFPDNDILGSVLKSNLAPVRYCD-----DESKITEVYPFNPNGSPLGIAALCSPDGRHLAMMPHPERCFMMWQYPWYPKEWNVDKKGP---SPWLRMFQNA 1418          
The following BLAST results are available for this feature:
BLAST of NO20G01910 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 20
Match NameE-valueIdentityDescription
EWM22060.10.000e+070.96phosphoribosylformylglycinamidine synthase [Nannoc... [more]
OGW32091.12.300e-28241.71phosphoribosylformylglycinamidine synthase [Nitros... [more]
KPK82456.18.800e-28243.56phosphoribosylformylglycinamidine synthase [Gemmat... [more]
KRT73468.12.000e-28143.12phosphoribosylformylglycinamidine synthase, phosph... [more]
OGP79185.12.000e-28141.65phosphoribosylformylglycinamidine synthase [Deltap... [more]
OGP35824.11.300e-28043.61phosphoribosylformylglycinamidine synthase [Deltap... [more]
XP_005648260.19.100e-27941.76AIR synthase-related protein [Coccomyxa subellipso... [more]
EAY96218.11.600e-27842.80hypothetical protein OsI_18107 [Oryza sativa Indic... [more]
OAY73949.12.000e-27842.42putative phosphoribosylformylglycinamidine synthas... [more]
XP_020083748.12.000e-27842.42probable phosphoribosylformylglycinamidine synthas... [more]

Pages

back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL021nonsL021Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR020ncniR020Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR067ngnoR067Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


This gene is orthologous to the following gene feature(s):

Feature NameUnique NameSpeciesType
NSK002730NSK002730Nannochloropsis salina (N. salina CCMP1776)gene


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO20G01910.1NO20G01910.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
jgi.p|Nanoce1779_2|594935gene_6411Nannochloropsis oceanica (N. oceanica CCMP1779)gene
Naga_100052g21gene8390Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO20G01910.1NO20G01910.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO20G01910 ID=NO20G01910|Name=NO20G01910|organism=Nannochloropsis oceanica|type=gene|length=4623bp
ATGGGATCCTTGAAGGCGCGCAGGCGGCGGCAGCGGCGGAAAAGGTGCCT
TGGTTATGATGCTAATGGGAATGAAAGAGGAGTCCAGGCGAAAATGGAAA
AGCAAACGCTCAAATTCGTCCAAGCGCGGGGGCGGCGGGTGCCGCCCCGG
GTCCTGCAGATCGCGCTGGCGGCACTGTTAATGCTGCTCTCTAGGCCATC
TTACGCTGCCCGACAGTGCGTCCAGATGTTCCTCCCTTTGTTTGTGTCGT
CTCCTCGTTCCGTCCCACTACGAACCCATCCCCTCGTTGTGCGGAGGGCC
ATGACGACGCCGAACCCCAAGCAGCGCCGGCCAGCAGAAGACTGGACAAG
AGACCCAACACCACCTCCACCATGGCATCCATCATCTATTCCATCTACAC
GCGTTTTTACCGTGGGACCACCTATGTCCGGGCCGCGCGAGGCCATTCTC
TTGCGACGCCTCAAAGCACTGGACGCAGCCATCGAGGCCGTGCAGGTTCA
AGAATTCTACTTCCTCAGTGTCTTGCACGAGCAGAAGCAAGAAGACAATT
TGATAGAAAACATTTCGAACGTATTGCGTGAAGCTGCCACCCCTTGGACA
CATGCGCTTTTGCCCTTGGACGATCAATCCACGGCCTCTCACCACTCCCT
CCGTCAGCACTATCGCGCGTACGCCCCTCGTCCTTCCTTCCGTAGCGTTT
GGAGCAGCAACGTCCGTGCAATTCTAGAAGCACGGGGCTGTCTAGCTCAA
GCTGGCGAGAGTGCAGGCGACAGGCTGCAGGAACTGGAGATGGGGCGGTG
GTACCATATCGTTAGTCGCTCCGAGGCCTTGTTGCCTGAAGCCGAAGCCC
TTCTCTACGACCGCATGACCGAGAGCCCCTTTGCCATGCCAGCTGCAGCT
GTCTGGGGAGAGGCCGACAAAACATGGATGCTAGCACCCACTGACAGGCT
GTGGACCGAGAGGGAGAAGGAGCCGAGTGCGATTGTGCTGCCCTTGCATG
AGGAGGGTCCTGAGGTGCTGTATACCGTGAGTGAGAAGTACGGCCTGGGT
TTGGGGGTATGGGAAGTGGCTTTTCTCACAGACTTGTTCGTGTCGGGCAT
TGGACGAGATCCAACCGAGGTGGAAATTTTTGACGTGGCTCAGTCGCTGT
CTGAGCATTGCCGGCACTGGACCTTTGACGGGCAGTTTGTGGTTGACGGA
CAGGCCATGCCCAAGAGCCTGATGGAAATGGTTCGGGAACCCTGGCAGCG
ACAACAGCAGCAGCAACAGCAACAGCAACAGCAACAGCAGCAGAATGAAG
AGGGCAAAGAAGAAGGAGATAACTCGCTGCTGGCCTTTTGTGACAATAGC
TCGGCCATCAAAGGCCCGCGTGTGCTCGACTTTCAACCCGCGACGCCCGA
CCGGCCATCCCGGTACACCTGTGTCGAGGGTGTGCGACATCTGAGTCTGA
CTGCCGAGACACACAATTTTCCTTGCAGCATCGCACCATTTCCGGGGGCG
CAGACGGGCGTTGGAGGTCGCATTCGAGACATTTTGGCCACAGGCCGAGG
CGGCTATACGCTTGCAGGCCTGGCCGGATATGCGGTCGGGAGACTTGAGC
TTCAGGGCGCAGTGGACAGGAAAAAGGAGGATAATCCTCGACCACATCAT
CGCCCTCGGCACGTGGCCTCAGCCCTGGAAATCCTGTTGCGTGCGAGCGA
TGGGGCCTCCGACTATGCAAACAAGTATGGGGAGCCCTTGGTGGGGGGCT
TTGCGCGGGCCATGCCCGCATGGCAGGGCGCTGAATTTTTGAAGCCCATC
ATGTTCTCGGCAGGTGTGGGCAAACTCCCAGCCGAAGCTGTGCGCAAGAA
TGAGCCTGACGTAGGAATGGCAGTAGTGAAGCTGGGGGGACCAGCCTACC
GTGTTGGACTGGGTGGAGGTGCTGCATCGAGCAAAGTGGAGGGGGATCCT
GAATTAAACCTGCAGGCTGTACAACGTGGTGATCCCGAGATGGGCAACAA
GGTGGGCCGCGTGGTGCGGGCCTGTCTGGAGATGCACCATTCTTCACAGT
CACTGCTGTTGGAGAGTGTGCACGACCAGGGGGCGGGCGGCAATGCCAAT
GTGCTCAAGGAGCTGGTGGCCCCGGTGGGCGCGGACGTGTTCCTAAATCG
AATTCAGCGAGGCGACTCCAGCCTGACGCCACTGGAAGTTTGGGGGGCAG
AGTATCAGGAAAGTCAGGGCGTGCTTGTGCAAGGCTCGCAAGCATTGGCT
ACCCTGAGCCGGATATGTCACCGCGAGGCCACACCCATGGAGGTGGTGGG
ACATATAACGGGCCATGGCGCTATACGGGTGTACGAGAACGAGGAGGAAG
CAGTGCACGGATGGCCGCCCGCACGACCTTTGGTGGACTTGCCCCTAGAA
CCCTTGCTCTCACGCCGGCCCAAGCGCGTTATCCAAGCCATAGGGACGAG
GCCAGGAACGAGGGGCGGCATCGAGAAGGCTGCTTCTCCAATGGCGCTCC
GTGATGTATTGCGTGCCATCCTGAGCACAGTGAGTGTGGGCAGCAAGGCC
TTCCTGACCAACAAGGTAGACCGTAGCGTCACGGGTCTGGTAGCAGCACA
ACCTTGCGTAGGGCCCTTGCACTTGCCGGTGGGAGACGTGGGCGTGACGG
CCCTGTCTTTTTGGGGCAAGGAGGGCGTGGCCTCTGCTCTGGGGGAGTGT
CCGGGCATTGGACTGGCGGGCACAAAGACGAGCATCGCTGCCATGGTGCG
CATGGCCGTGGGGGAGGCACTCACAAATCTGGTCTCAGCCCCCATCACAC
GCTGGGCTGACATCAAGCTGCAGGCCAATTGGATGTGGCCAGGTCGGGAA
GGGGAGGCCGCGGGTCAGCTTTACGAGGGCGTGGAGGCGCTGCGAGAAGC
ATTACTGGATCTGGGTCTAGCCCTGGACGGAGGCAAAGATTCCCTGTCCA
TGGCCACGGATTGCGACGATGGTGTGCGGGTCCCTTGCCCTGCTACCGTG
GTAATCACGGCCTATGCACCTTGTGCTGACGTCTCAAGAGTTTTGACGCC
CGACCTGAAGGTGCCGAGCGCGGTTGAGGGCGAGGAAGGTGTCCTTATGC
TTCTGGAGCTAAGTGGGAAGCCTTCAGGGACCAGGCGTCTCGGCGGTTCG
GCCTGGGCGCAAATTATGGGTAAGGACGCGGAGTTGGAAACGCCTGACAT
GCTCGACCCATCCCTTATGAACCGCGCATTTGTGGCGGTGCAGAGGCTGG
CACAAGAGGCCCTGGTCCAAGCATGTCACGACGTGAGCTCCGGGGGCCTA
GTCACCACCGTTTTGGAGATGGCCATGAGCGGAGATGCCGGTGCAAGGCT
AACGTTGCCACCTCACGTCTTCTCCTTTTCGCCATCAGTCTCGGCAGAGG
CCGCCCTTTTCGCGGAAGAGCTGAGCTTGGTGTTGGAGCTCAAGCCGCAG
GACGTGAAGAGAGTTCGGGCCGCGCTGGAGAGCCAGCAGGTGCCGCACCT
CATCATTGGGACTTCCACACCGCAGCGTGAAATGGTAATTGTTGAGGCCA
ATGGACAGGAGGTTTTGCGGGAGGACATGGACGTCTTGCGTGCAGAGTGG
CAGGCGACCAGCTTCCAGCTCGAGAAGCTTCAAGCCAATCCAGCGTGCAT
CGAGGCTGAAGAAGCACTTCTCCCACGCCAGCGACGACCCGAGTGGATTT
TCCCTCCTCCTCGTGCAACAACTACTCCAGCCCTCTCCTTCTCTGCTCGA
CCTAAGCCTCGAGTGGGAGTGTTGCGGGAGCAAGGCAGCAACGGCGACCG
GGAGATGGCGGCGGCTTTGCACAGCGCGGGCTTCGACGTCTGGGACCTGA
CGGTGTATGACCTGCTACAGAACCAGGTGGACCTCCAATCTTTCCACGGC
CTAGCGTTTGTGGGAGGCTTTAGCTACGGCGATGTCTTGGGCTCAGCGCG
AGGCTGGAGAAGCGTGCTCGCAGGAAATCCCCGTGCGGAAGCTCAGATGC
GGCGATTCTTTGCTCGGCCAGAGACCTGGTCTCTAGGACTCTGCAATGGA
TGCCAATTACTTGTCGCCCTAGGCATCGTGCCTGGTCTTGACATGGATGT
AGACGGGAAACCGTCTGAGGCTTCCAACCGTGCGGCGGCATGGTTGGGGG
AGAATGAGAGCGGGCGTTTCGAAAGCCGCTTCGTAACCGTGCACGTCGGA
CCTAGTCCCGCCGTGCTACTTAAGGGCTTGGAAGGAGCGGTGATAGGCAT
ATGGAGTGCACATGGAGAGGGTAATCTGCGCCTTGAGAACGACGCTGTTC
TGGATACTGCTCTTCGCCAGGGTCTTGCGCCAGTGCGATATGTCGATCCG
TGCTGCCACGGTGGGGCATGGGAAGGGACAGAGGAATACCCTTTTAACCC
TAATGGGTCCCCAAGGGGCGTTGCAGCCCTCTGCTCCCCGGATGGCCGGC
ACTTGGCCATGATGCCACATCCAGAGCGCTCTTGGCTGTGCTGGCAAATG
CCTTGGATGCCTCCTGCGGGTGGGCAGGTGGGAGAAGGCGAGGCGGTTTA
CACGCCCTGGTTCCGCCTCTTTCAAAATGCCTTTGACTTTTCTATGAACA
GTATAATGACAGAGAATGCATGA
back to top

protein sequence of NO20G01910.1

>NO20G01910.1-protein ID=NO20G01910.1-protein|Name=NO20G01910.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1541bp
MGSLKARRRRQRRKRCLGYDANGNERGVQAKMEKQTLKFVQARGRRVPPR
VLQIALAALLMLLSRPSYAARQCVQMFLPLFVSSPRSVPLRTHPLVVRRA
MTTPNPKQRRPAEDWTRDPTPPPPWHPSSIPSTRVFTVGPPMSGPREAIL
LRRLKALDAAIEAVQVQEFYFLSVLHEQKQEDNLIENISNVLREAATPWT
HALLPLDDQSTASHHSLRQHYRAYAPRPSFRSVWSSNVRAILEARGCLAQ
AGESAGDRLQELEMGRWYHIVSRSEALLPEAEALLYDRMTESPFAMPAAA
VWGEADKTWMLAPTDRLWTEREKEPSAIVLPLHEEGPEVLYTVSEKYGLG
LGVWEVAFLTDLFVSGIGRDPTEVEIFDVAQSLSEHCRHWTFDGQFVVDG
QAMPKSLMEMVREPWQRQQQQQQQQQQQQQQNEEGKEEGDNSLLAFCDNS
SAIKGPRVLDFQPATPDRPSRYTCVEGVRHLSLTAETHNFPCSIAPFPGA
QTGVGGRIRDILATGRGGYTLAGLAGYAVGRLELQGAVDRKKEDNPRPHH
RPRHVASALEILLRASDGASDYANKYGEPLVGGFARAMPAWQGAEFLKPI
MFSAGVGKLPAEAVRKNEPDVGMAVVKLGGPAYRVGLGGGAASSKVEGDP
ELNLQAVQRGDPEMGNKVGRVVRACLEMHHSSQSLLLESVHDQGAGGNAN
VLKELVAPVGADVFLNRIQRGDSSLTPLEVWGAEYQESQGVLVQGSQALA
TLSRICHREATPMEVVGHITGHGAIRVYENEEEAVHGWPPARPLVDLPLE
PLLSRRPKRVIQAIGTRPGTRGGIEKAASPMALRDVLRAILSTVSVGSKA
FLTNKVDRSVTGLVAAQPCVGPLHLPVGDVGVTALSFWGKEGVASALGEC
PGIGLAGTKTSIAAMVRMAVGEALTNLVSAPITRWADIKLQANWMWPGRE
GEAAGQLYEGVEALREALLDLGLALDGGKDSLSMATDCDDGVRVPCPATV
VITAYAPCADVSRVLTPDLKVPSAVEGEEGVLMLLELSGKPSGTRRLGGS
AWAQIMGKDAELETPDMLDPSLMNRAFVAVQRLAQEALVQACHDVSSGGL
VTTVLEMAMSGDAGARLTLPPHVFSFSPSVSAEAALFAEELSLVLELKPQ
DVKRVRAALESQQVPHLIIGTSTPQREMVIVEANGQEVLREDMDVLRAEW
QATSFQLEKLQANPACIEAEEALLPRQRRPEWIFPPPRATTTPALSFSAR
PKPRVGVLREQGSNGDREMAAALHSAGFDVWDLTVYDLLQNQVDLQSFHG
LAFVGGFSYGDVLGSARGWRSVLAGNPRAEAQMRRFFARPETWSLGLCNG
CQLLVALGIVPGLDMDVDGKPSEASNRAAAWLGENESGRFESRFVTVHVG
PSPAVLLKGLEGAVIGIWSAHGEGNLRLENDAVLDTALRQGLAPVRYVDP
CCHGGAWEGTEEYPFNPNGSPRGVAALCSPDGRHLAMMPHPERSWLCWQM
PWMPPAGGQVGEGEAVYTPWFRLFQNAFDFSMNSIMTENA*
back to top
Synonyms
Publications