EWM25407.1, cds5275 (CDS) Nannochloropsis gaditana

Overview
NameEWM25407.1
Unique Namecds5275
TypeCDS
OrganismNannochloropsis gaditana (N. gaditana B-31)
Alignment locationCM002465.1:629255..629814 -
Alignment locationCM002465.1:628747..629146 -
Alignment locationCM002465.1:628362..628640 -

Link to JBrowse

Properties
Property NameValue
Protein idEWM25407.1
Productpapain family cysteine protease containing protein
Orig transcript idgnl|cribi|Naga_100005g81.6701.mrna
GbkeyCDS
Mutants
Expression
No biomaterial libraries express this feature.
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
CM002465.1supercontigCM002465.1:629255..629814 -
CM002465.1supercontigCM002465.1:628747..629146 -
CM002465.1supercontigCM002465.1:628362..628640 -
Analyses
This CDS is derived from or has results from the following analyses
Analysis NameDate Performed
GO annotation for N. gaditana B312020-04-08
BLAST analysis for N. gaditana B-312020-04-07
InterPro analysis for N. gaditana B-312020-04-06
Gene prediction for N. gaditana B-312014-02-18
Annotated Terms
The following terms have been associated with this CDS:
Vocabulary: Molecular Function
TermDefinition
GO:0016787hydrolase activity
GO:0008234cysteine-type peptidase activity
Vocabulary: Biological Process
TermDefinition
GO:0019538protein metabolic process
GO:0006508proteolysis
Vocabulary: INTERPRO
TermDefinition
IPR025660Pept_his_AS
IPR000169Pept_cys_AS
IPR013128Peptidase_C1A
IPR013201Prot_inhib_I29
IPR000668Peptidase_C1A_C
Homology
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|585107168|gb|EWM25407.1| (papain family cysteine protease containing protein [Nannochloropsis gaditana])

HSP 1 Score: 854.744 bits (2207), Expect = 0.000e+0
Identity = 412/412 (100.00%), Postives = 412/412 (100.00%), Query Frame = 0
Query:    1 MYASGLRIAAVAKVVIVLTNLAFALSGREGKAHLRHSSPASRLSLDLEPHHRYTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSGNGSEPMCLNQPLHVGSIAGYVDLPPNDLAAHLLAVNSQPLAISLDASDFHNYHSGVLTFQDCGADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQECAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPVGAFLFQD 412
            MYASGLRIAAVAKVVIVLTNLAFALSGREGKAHLRHSSPASRLSLDLEPHHRYTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSGNGSEPMCLNQPLHVGSIAGYVDLPPNDLAAHLLAVNSQPLAISLDASDFHNYHSGVLTFQDCGADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQECAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPVGAFLFQD
Sbjct:    1 MYASGLRIAAVAKVVIVLTNLAFALSGREGKAHLRHSSPASRLSLDLEPHHRYTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSGNGSEPMCLNQPLHVGSIAGYVDLPPNDLAAHLLAVNSQPLAISLDASDFHNYHSGVLTFQDCGADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQECAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPVGAFLFQD 412          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|118378379|ref|XP_001022365.1| (papain family cysteine protease [Tetrahymena thermophila SB210] >gi|89304132|gb|EAS02120.1| papain family cysteine protease [Tetrahymena thermophila SB210])

HSP 1 Score: 271.552 bits (693), Expect = 2.297e-84
Identity = 156/365 (42.74%), Postives = 212/365 (58.08%), Query Frame = 0
Query:   51 HRYTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYP---YVSG-NGSEPMCLNQPLHVGSIAGYVDLPPNDLAAHLLAV-NSQPLAISLDASDFHNYHSGVLTFQDC----GADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQE--CAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYP 404
             +YTF+++++DF K Y  GS+EY  R+ IF  +L    E  A N +   +++KGVN+++D TD+E      FK ++  + K    +           + S       E+ L  LP +VDWR+KG VT  K+QG CGSCW F+    IES   +++G+L TLS Q+ VSC  NP  CGGTGGC GA  ++ + YV  +G T E  YP   YVSG  G+      Q     ++ GYV L  ND  A L A+ N  PLA+++DAS + NY SGV  F  C      D+NH VVLVGYGTD E G D++L+RNSWG ++GE+GYIRLAR  +  C  D TPLDG  C G      VCG CG+ + ++YP
Sbjct:   24 KQYTFDQYIQDFNKGYQYGSSEYFMRKAIFEQKLA---EIIAFNEQTNQSYKKGVNRFTDLTDSE------FKQNSLGYSKNMSNVRAF-------RNLSVKNLEVTEQQLKELPVNVDWRQKGVVTPVKDQGHCGSCWAFASTATIESYAAINSGQLKTLSTQQLVSCVPNPYQCGGTGGCNGAISELAFNYVQLYGLTSEFKYPYQSYVSGVTGNCTFDSTQQTPEVALDGYVKLQANDYDALLYALANIGPLAVAVDASQWRNYQSGV--FNGCSYTDNIDVNHVVVLVGYGTDPELG-DYWLIRNSWGTKFGENGYIRLARESKVTCGTDYTPLDGQACAGQNVPTKVCGQCGVAYDAAYP 369          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|118364806|ref|XP_001015624.1| (papain family cysteine protease [Tetrahymena thermophila SB210] >gi|89297391|gb|EAR95379.1| papain family cysteine protease [Tetrahymena thermophila SB210])

HSP 1 Score: 268.47 bits (685), Expect = 3.345e-83
Identity = 154/367 (41.96%), Postives = 211/367 (57.49%), Query Frame = 0
Query:   51 HRYTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTG--KLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSGNGSEPMCLNQPLHVGSIA---GYVDLPPNDLAAHLLAVNSQ-PLAISLDASDFHNYHSGVLTFQDC----GADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQ--ECAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPV 405
              YTFE+++ DF KEY   S EY +R+  F   L  +   N     +  +++KGVN+ +D T  E +  LG K S +    R+  ++ L A   ++ S            LT+LP  VDWR+KG V+  K+QG CGSCW F+    +ESA  ++ G  +L TLS Q+ VSC  NP  CGGTGGC GA  ++ + Y   +G T E  Y Y S  G+   C   P    +     GY ++ PND  A L AV +  P+AIS+DAS++ +Y  GV  F  C      D+NHAVVLVGYGTD + G D++LVRNSWG ++GE GYIR+ R    +CA+DTTP DG GC G  + + VCG CG+L  S+YP+
Sbjct:   25 QNYTFEQYIVDFEKEYEVDSVEYNQRKQTFEKNLVEIIAFN----NKDHSYKKGVNRNTDLTTKEFQVQLGLKKSMK---NRKNPIQRLLAKNNTAAS------------LTDLPQSVDWRQKGVVSPVKDQGGCGSCWAFASAAVLESAAAIAAGPGQLKTLSTQQLVSCVPNPNQCGGTGGCSGAVAELAFSYTTLYGITSEYKYSYQSYFGTTYSCKYDPKTQSAEVINQGYANVTPNDQNALLEAVATVGPIAISVDASNWASYEEGV--FDGCDYSKNVDINHAVVLVGYGTDPKYG-DYWLVRNSWGTDYGEDGYIRVKRESVAQCAMDTTPTDGFGCAGDEEPIKVCGMCGILSDSAYPL 369          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|669193579|gb|AII16495.1| (cathepsin K, partial [Paracyclopina nana])

HSP 1 Score: 260.381 bits (664), Expect = 4.728e-80
Identity = 157/375 (41.87%), Postives = 216/375 (57.60%), Query Frame = 0
Query:   49 PHHRYTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRGLL--GFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLF-TLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSG-NGSEPMCLNQPLHVGSIA---GYVDLPPNDLAA---HLLAVNSQPLAISLDASDFHNYHSGVLTFQDC----GADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQE--CAVDTTPLDGTGC-EGGVDELTVCGTCGMLFSSSYPVG 406
            P ++  FE F + +GK Y   S E  +R +IF   LR ++EHN   S+ G +++K VN+++D T  E + +L  G+  +A+ H    P  +V +                    L +LP  VDWREKG +T  K+QG CGSCW F+  ++IES L +++GK    LS Q   SCT NP  CGGTGGC G+   + + Y    G T E DYPY SG  G+   C  +  ++ ++A   GY  LP N+  A   HL   N  PL++++DAS +  Y +GV  F DC      ++NHAV LVGYGTDE  G D++LVRNSWG  WG+ GYI+L R  E  C +D+TPL GTGC   G + LTVCG CG+LF + YP+G
Sbjct:   22 PSYKSQFEAFEQQYGKSY-KSSAERLKRYSIFVKNLRDIEEHN---SKSGKSWKKAVNKFADLTQEEFKSILSSGYVNAAKPH---GPVADVKAVN------------------LADLPESVDWREKGCITDVKDQGYCGSCWAFAAAQSIESYLQINSGKKAEELSAQHINSCTPNPLQCGGTGGCMGSIPQLAFTYTQLFGITREADYPYTSGTTGNTGNCKFEGSNMEAVATLRGYETLPRNNYEAVMNHL--ANVGPLSVAVDASSWSFYSTGV--FDDCNYSYNIEINHAVQLVGYGTDEFEG-DYWLVRNSWGGFWGDDGYIKLKRESETKCGIDSTPLMGTGCPNDGNEVLTVCGQCGILFDTCYPIG 366          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|229595148|ref|XP_001019547.3| (papain family cysteine protease [Tetrahymena thermophila SB210] >gi|225566367|gb|EAR99302.3| papain family cysteine protease (macronuclear) [Tetrahymena thermophila SB210])

HSP 1 Score: 256.144 bits (653), Expect = 2.194e-78
Identity = 159/365 (43.56%), Postives = 215/365 (58.90%), Query Frame = 0
Query:   53 YTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELR-GLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSGNGSEPMC----LNQPLHVGSIAGYVDLPPNDLAAHLLAVNSQ-PLAISLDASDFHNYHSGVLTFQDC----GADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLAR--SQECAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPV 405
            YTF+++V+DF K Y   S EY +R+ IF  +L+ ++  N+ NS  G  ++KG+NQ++D T  ELR   LG+  + +    +Q     L  + + +              + +LP  VDWR+ G VT  K+QG CGSCW F+    IES   ++TG+L TLS Q+ VSC  N   CGG GGC GA  ++ Y YV   G T E  Y Y S  G    C      QP+ V +I GY+ +P ND A+ + AV +Q PL IS+DAS+FH+Y SGV  F  C      D+NHAVVLVGYGTDE+ G D+++VRNSWG  +GE+GYIR+ R  +  C  D TPLDG GC G      VCG CG+L  S+YP+
Sbjct:   26 YTFDDYVKDFNKAYTKFSAEYNQRKRIFEQKLKEIKAFNS-NSENG--YKKGINQFTDRTAEELRETTLGYSKTVKNAANKQNMFRNLKTSDKIN--------------VKDLPKSVDWRDAGVVTPVKDQGHCGSCWAFATTAVIESYAAIATGQLKTLSTQQLVSCVQNSYQCGGQGGCNGAVSELAYNYVQLFGLTSEYKYSYSSYQGQTGNCTFDPTQQPIEV-TIDGYLKVPENDYASLMNAVATQGPLVISVDASNFHDYESGV--FHGCDGADNVDINHAVVLVGYGTDEKEG-DYWIVRNSWGTRFGENGYIRVKREATPTCKTDFTPLDGNGCVGFAKPQKVCGQCGILSDSAYPL 369          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|955162080|emb|CUG93941.1| (cysteine proteinase, putative [Bodo saltans])

HSP 1 Score: 253.832 bits (647), Expect = 1.687e-77
Identity = 142/364 (39.01%), Postives = 210/364 (57.69%), Query Frame = 0
Query:   53 YTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREK--GAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSGNGSEPMCLNQPLHVGSIAGYVDLPPN---DLAAHLLAVNSQPLAISLDASDFHNYHSGVLTFQDC----GADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLAR--SQECAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPV 405
            YTFE++V DFGK+Y   + E+ RR+ +F  +L  V+ HNAA    G +++KG+N  +DWT+ E + L G +  A  H+  +        +++  H          ++ L  LP  VD+R      +T  K+QG CGSCW     E +E+   L+T +LF LS+Q+  +C  NP+ CGGTGGC G+T ++ ++YV + G T E +YPY + NG+  +C         + GYV L  N   D+   L  V   PLAI++DAS + +Y SG+  F  C       ++H V LVGYG D++  +D+++VRNSW   +GESG+IR+ R  + EC  D  P DG GC+GG  +L VCG CG+L  +SYPV
Sbjct:   27 YTFEKYVADFGKKY-NSAAEFARRKVLFNKKLVDVKAHNAA----GLSWKKGINHMADWTEEEFKRLNGGRSRAMGHLADK--------SLQLPHV--------PKKTLAQLPPAVDYRNSIPSVLTAVKDQGMCGSCWAHGSTEQMETFWALATNELFVLSQQQVTACAPNPDQCGGTGGCMGSTAELAFQYVASAGLTQEWEYPYTAYNGTTGVCDGTSDPKIKLTGYVKLTSNSQDDVLNTLATVG--PLAINVDASTWSDYESGI--FSGCNYANNISIDHVVQLVGYGHDDDLNLDYWIVRNSWSPIYGESGFIRVLRQATPECGWDVDPQDGYGCQGGPAQLWVCGMCGLLGDTSYPV 365          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|118364816|ref|XP_001015629.1| (papain family cysteine protease [Tetrahymena thermophila SB210] >gi|89297396|gb|EAR95384.1| papain family cysteine protease [Tetrahymena thermophila SB210])

HSP 1 Score: 249.21 bits (635), Expect = 1.064e-75
Identity = 150/366 (40.98%), Postives = 209/366 (57.10%), Query Frame = 0
Query:   51 HRYTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRG-LLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSGNGSEPMCLN-----QPLHVGSIAGYVDLPPNDLAAHLLAVNSQ-PLAISLDASDFHNYHSGVLTFQDCGADL--NHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQE--CAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPV 405
            + YTF++++ DF K Y P S EY  R+TIF  +L+ +   N   S +GT ++KGVNQ++D ++ EL    LG+  S Q                  S S+   + +     L  LP+ VDWREK  VT  K+QG CGSCW F+    IES   +++GKL TLS Q+ VSC  N  +CGG GGC G+  ++ + YV   G T +  Y Y S  G E    +     Q + V  + GY+ L  N     ++A+ +  PLA+++DAS +H+Y  GV    D  A++  NHAVVLVGYGTDE  G D++LVRNSWG ++GE+GYIRL R  +  C VD TPL G  C+G      VCG C +L+ +SYP+
Sbjct:   28 NNYTFDQYITDFNKGYTPNSPEYHMRKTIFNKKLQAIISFN---SLQGTYYKKGVNQFTDQSEQELENQTLGYVSSGQ--------------KSPFSSSSRLLSSTLSNVTLQELPASVDWREKNVVTPVKDQGKCGSCWAFASAATIESHAAIASGKLKTLSTQQLVSCAQNSYNCGGVGGCHGSIAELAFSYVQLFGITSDYKYSYSSYQGVEEGSCSFNPDKQSVEV-MLDGYLKLTTNSYEDIMVALATVGPLAVAVDASKWHDYEGGVFDGCDYTANMNVNHAVVLVGYGTDEVEG-DYWLVRNSWGTKFGENGYIRLRRESQTKCGVDYTPLKGQACKGQEAPTRVCGQCAILYDASYPL 374          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|514692042|ref|XP_004993594.1| (hypothetical protein PTSG_05727 [Salpingoeca rosetta] >gi|326428462|gb|EGD74032.1| hypothetical protein PTSG_05727 [Salpingoeca rosetta])

HSP 1 Score: 244.973 bits (624), Expect = 8.355e-74
Identity = 150/363 (41.32%), Postives = 206/363 (56.75%), Query Frame = 0
Query:   53 YTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAELRGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAH-GFTLERDYPYVSGNGSEPMC-LNQPLHVGSIAGYVDLPPNDLAAHLLAV-NSQPLAISLDASDFHNYHSGVLTFQDC---GADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQ----ECAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPV 405
            Y+FE F  ++GK Y+  S E+  R  +F   L  V+ HN+  ++   T+++G+N  SDWTD E + LLG+     + + R P      A ++                +  LP  VDWR K  VT  K+QG CGSCW+F   E +ES + + TG L  LSEQ  + CT NPE CGGTGGC G T +I YE++  H G   E  YPY+S  G    C   + + V ++ GYV LP N     + A+ N  P++IS++A  + NY SG+  F  C     D++HAV LVGYG D   G  ++LVRNSW   WGESGYIR+ R+      C +D TP DG+GC+GG D++ VCGTCG+LF + YP 
Sbjct:   55 YSFEHFKAEYGKRYL-SSEEHDFRRQVFERTLASVKAHNSDPTK---TWKQGINHMSDWTDGEFKRLLGYDKGIGYSLHR-PTPPGFKANVD----------------VNGLPDSVDWRTKHVVTAVKDQGQCGSCWSFGSAETLESHVAVQTGTLEVLSEQNILDCTPNPEECGGTGGCQGGTAEIAYEHMAKHGGLQTEWTYPYLSWYGDNYKCHFKEKMSVVNVTGYVKLPSNQYEPLMDAIANKGPISISVEAVAWKNYESGI--FDGCNQTNPDIDHAVQLVGYGDDNSQG--YWLVRNSWTPHWGESGYIRIRRTANEGGRCGMDITPQDGSGCKGGPDKVKVCGTCGILFDNVYPT 392          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|1002456371|ref|XP_015662324.1| (putative cysteine proteinase [Leptomonas pyrrhocoris] >gi|928119429|gb|KPA83885.1| putative cysteine proteinase [Leptomonas pyrrhocoris])

HSP 1 Score: 243.817 bits (621), Expect = 9.735e-74
Identity = 147/367 (40.05%), Postives = 207/367 (56.40%), Query Frame = 0
Query:   54 TFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGT-TFRKGVNQYSDWTDAELRGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREK--GAVTKPKNQGACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVL-AHGFTLERDYPYVSGNGS----EPMCLNQPLHVGSIAGYVDLPPNDLAAHLLAVNSQ-PLAISLDASDFHNYHSGVLTFQDCGADL--NHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQ--ECAVDTTPLDGTGCEGGVDELTVCGTCGMLFSSSYPVGA 407
            +F+ ++R +GK Y   + EY +R  IF +RLR ++E N    R G  ++R+G+N+ +DWT+ E+  L G +P                  M S +  SS       R    LP  VD+R      +T  K+QG+CGSCW  S VEA+ES   ++TG L  LS+Q+  +CT NP HCGGTGGC G+ + + Y+YV  A G   E  YPY +  G     E M  ++P  V S   YV+LP ND  A + AV  + P+A+S+DAS + +Y  G+    D   ++  NHAV LVGYG D ++G D++++RNSWG  WGE GYIRL R +  +C        G  C+G  DE+ VCG CG+L  S+YPV A
Sbjct:   28 SFDAYIRQYGKRY--SAVEYSKRLKIFTERLREIEEFN----RDGKHSYRRGLNKLTDWTEDEIGALNGARP------------------MMSRNLRSSVPKHIYNRSSHTLPRRVDYRTSVPPVLTSIKDQGSCGSCWAHSAVEAMESHWAIATGHLHVLSQQQVTACTPNPRHCGGTGGCDGSIEALAYDYVAGAGGIQEEWGYPYTAFYGETGKCEDMRTSRPAKVSS---YVELPANDQEALMDAVAFKGPIAVSVDASRWFSYRGGIFDGCDYSVNITQNHAVQLVGYGHDYDSGKDYWIIRNSWGPLWGEEGYIRLLREKTPQCGWAVDAHSGAACDGDPDEVWVCGMCGILSGSTYPVMA 367          
BLAST of EWM25407.1 vs. NCBI_GenBank
Match: gi|225718616|gb|ACO15154.1| (Cathepsin K precursor [Caligus clemensi])

HSP 1 Score: 240.35 bits (612), Expect = 2.651e-72
Identity = 153/378 (40.48%), Postives = 211/378 (55.82%), Query Frame = 0
Query:   44 SLDLEPHHRYTFEEFVRDFGKEYVPGSTEYQRRETIFADRLRVVQEHNAANSRRGTTFRKGVNQYSDWTDAEL-RGLLGFKPSAQWHVKRQPRLEVLSAAMESSHSNSSGAFSRQERLLTNLPSHVDWREKGAVTKPKNQG-ACGSCWTFSGVEAIESALWLSTGKLFTLSEQEFVSCTSNPEHCGGTGGCFGATQDILYEYVLAHGFTLERDYPYVSG--NGSEPMCLN--QPLHVGSIAGYVDLPPNDLAA---HLLAVNSQPLAISLDASDFHNYHSGVLT-FQ-DCGADLNHAVVLVGYGTDEETGIDFFLVRNSWGEEWGESGYIRLARSQE--CAVDTTPLDGTGC-EGGVDELTVCGTCGMLFSSSYPVGA 407
            SL+  P+    FEEF + FGK Y    T Y +R  IF   LRV+  HNA     G ++   VN+++D T+ E  +  LG++      V +  RL        SS  N++         + NLP  VDWR+KGAV   + QG  CGSCW  S    IES ++++ G L TLS Q+  SC  NP  CGG GGC G+   + + Y   +G T E +YPY+SG  N +E    N    + +  + GY  LP ND+ A   HL  V   PL++++D++ +H+Y  GV+  F  D   +LNH V L+GYG DE+ G  ++L++NSWG +WGE G+IR+ R  E  C  D TPL+GTGC   G D   VCG  G+LF SSYP+GA
Sbjct:   20 SLEFSPYEIQRFEEFQKTFGKVYDDRMT-YSKRLRIFIHNLRVINAHNA---NPGRSYDLAVNKFTDLTEKEFTQRFLGYQKVPG--VSKNRRL--------SSKGNATS--------MENLPEEVDWRKKGAVGIMRWQGLICGSCWAVSSTGIIESHVFINEGILPTLSIQQVTSCALNPYSCGGKGGCDGSISQVAFMYAQLYGLTSEEEYPYISGMTNQTETCKFNFTDSVALARVRGYETLPSNDMEAVMRHLAEVG--PLSVNVDSTLWHSYGGGVMDGFDFDKNINLNHIVQLIGYGLDEKQG-PYWLIKNSWGSDWGEEGFIRIKRYSETQCGFDATPLNGTGCVNDGNDVQHVCGNFGVLFDSSYPLGA 372          
The following BLAST results are available for this feature:
BLAST of EWM25407.1 vs. NCBI_GenBank
Analysis Date: 2020-04-07 (BLAST analysis for N. gaditana B-31)
Total hits: 10
Match NameE-valueIdentityDescription
gi|585107168|gb|EWM25407.1|0.000e+0100.00papain family cysteine protease containing protein... [more]
gi|118378379|ref|XP_001022365.1|2.297e-8442.74papain family cysteine protease [Tetrahymena therm... [more]
gi|118364806|ref|XP_001015624.1|3.345e-8341.96papain family cysteine protease [Tetrahymena therm... [more]
gi|669193579|gb|AII16495.1|4.728e-8041.87cathepsin K, partial [Paracyclopina nana][more]
gi|229595148|ref|XP_001019547.3|2.194e-7843.56papain family cysteine protease [Tetrahymena therm... [more]
gi|955162080|emb|CUG93941.1|1.687e-7739.01cysteine proteinase, putative [Bodo saltans][more]
gi|118364816|ref|XP_001015629.1|1.064e-7540.98papain family cysteine protease [Tetrahymena therm... [more]
gi|514692042|ref|XP_004993594.1|8.355e-7441.32hypothetical protein PTSG_05727 [Salpingoeca roset... [more]
gi|1002456371|ref|XP_015662324.1|9.735e-7440.05putative cysteine proteinase [Leptomonas pyrrhocor... [more]
gi|225718616|gb|ACO15154.1|2.651e-7240.48Cathepsin K precursor [Caligus clemensi][more]
back to top
Relationships

This CDS is a part of the following mRNA feature(s):

Feature NameUnique NameSpeciesType
rna5288rna5288Nannochloropsis gaditana (N. gaditana B-31)mRNA


Sequences
Synonyms
Publications