NO04G02740, NO04G02740 (gene) Nannochloropsis oceanica

Overview
NameNO04G02740
Unique NameNO04G02740
Typegene
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length3526
Alignment locationchr4:724996..728521 -

Link to JBrowse

Properties
Property NameValue
DescriptionPentatricopeptide repeat containing protein
Mutants
Expression

Hover the mouse over a column in the graph to view expression values.
Sort Descending | Sort Ascending | Only Non-Zero Values | Tile/Chart | Reset

back to top
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr4genomechr4:724996..728521 -
Analyses
This gene is derived from or has results from the following analyses
Analysis NameDate Performed
GSE1786722023-12-15
PRJNA9336932023-10-26
PRJNA9434492023-07-19
GSE1499042020-10-10
NoIMET1_WT_HS2020-09-29
GSE1396152020-03-11
PRJNA2413822020-03-11
BLASTP analysis of N. oceanica IMET1 genes2019-07-11
InterPro analysis for N. oceanica IMET1 genes2019-07-11
PRJNA1821802019-04-25
Gene prediction for N. oceanica IMET12017-10-24
Annotated Terms
The following terms have been associated with this gene:
Vocabulary: INTERPRO
TermDefinition
IPR002885Pentatricopeptide_repeat
IPR033443PPR_long
Homology
BLAST of NO04G02740 vs. NCBI_GenBank
Match: EWM28763.1 (Pentatricopeptide repeat containing protein, partial [Nannochloropsis gaditana])

HSP 1 Score: 216.9 bits (551), Expect = 3.200e-52
Identity = 310/657 (47.18%), Postives = 341/657 (51.90%), Query Frame = 0
Query:   82 PFTAIGSSLDAPRWPG------DGIAGASTGTNHNPGGVGLSRCMLSSPNPPSSPSPAGLHIRTWDPPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXVSTTLSHCKKSIPKQHEWSHGGLGNMDPHMSPLSNSGSPSLISPSPTVSNYSFGFQSPMPSDLESSFANLLLDRTGGSAVEGGACTPRGRAQALQXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLRHVHGIAVGLGLDDRPPLHNQLQGSPTSAGLRILTGRQFPSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXQIPASTSAFGSGPSSAPAAGMSNRSHLFLSGSDEVGLHARGTYSPRLQGQTSPRFLGS-XXGGIGXXMNTRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPRGMGYSGHGGEGPMDFETKRCSNAIKKLCSAGNVTEVFAVLEDMLGHGLVPDLKTVKTIMRTCVKKLAWRRAVQLLHTTGLPLDVVIFTMAINTCGKAGEWEQGLRVLREMDSEEAKAHGIVP 732
            PF  I  S   P W        +G + +S G  H     G S  +L+  +PP SP P  L +R  D                 XXXXXXXXXXXX                             H +K      +W  G L  M   MS     GSP   S SPT S+   G +SP+  +LE S + L+LDR G  A    A    GR+       XXXXXXXXXXXXXXXXXXXXXXXXXXX     LR  HG+  G   D+R   ++QL       G+R    R + S                                                   QIP  +SA GSGPSSAPAAGMSNRS+LF  G  E G + RG++SP L  Q SPR  GS   G +G  M  R                                             XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX                         G MD E KRCSNAIKKLCSAGNVTEVFAVLEDMLG GLVPDLKTVKTIMRTCVKKLAWRRAVQLLH  GLPLDVVIFTMAINTCGKAGEWEQGLRVLREMDS++AKA GI+P
Sbjct:   63 PFHNISPSFSTPPWSNEWGDSREGKSSSSAGMGH-----GNSASVLNHLHPPVSPPPCALKLRPRDSASSNCSMNEFYCRTEGXXXXXXXXXXXXVNRFVVPGTEACLTSGSPSDLSSPGPDPRHSRK-----QDWGPGAL-EMGSRMSS-PGGGSPLFRSSSPTASH---GGRSPV-VELECSLSGLVLDRGGERAGSESAWALPGRSPDNVKYQXXXXXXXXXXXXXXXXXXXXXXXXXXXPMSRQLRGAHGLG-GSAFDERSAFNSQLH----VGGVRGQGNRPYAS----------------------------------NQPQKPPAPQPHQALMIQIPTLSSALGSGPSSAPAAGMSNRSYLFAEGGSEGGRN-RGSFSPTLHNQASPRPPGSGNLGNMGMGMAARNSFHNQSPRQYHYQANQSPYMHGAAGRSGSLGRSGVSGAAYGGSPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG------------------------GAMDVEAKRCSNAIKKLCSAGNVTEVFAVLEDMLGRGLVPDLKTVKTIMRTCVKKLAWRRAVQLLHMEGLPLDVVIFTMAINTCGKAGEWEQGLRVLREMDSDDAKARGILP 639          
The following BLAST results are available for this feature:
BLAST of NO04G02740 vs. NCBI_GenBank
Analysis Date: 2019-07-11 (BLASTP analysis of N. oceanica IMET1 genes)
Total hits: 1
Match NameE-valueIdentityDescription
EWM28763.13.200e-5247.18Pentatricopeptide repeat containing protein, parti... [more]
back to top
Relationships

This gene is member of the following syntenic_region feature(s):

Feature NameUnique NameSpeciesType
nonsL081nonsL081Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ncniR058ncniR058Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region
ngnoR111ngnoR111Nannochloropsis oceanica (N. oceanica IMET1)syntenic_region


The following polypeptide feature(s) derives from this gene:

Feature NameUnique NameSpeciesType
NO04G02740.1NO04G02740.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following gene feature(s) are orthologous to this gene:

Feature NameUnique NameSpeciesType
Naga_100002g1gene1930Nannochloropsis gaditana (N. gaditana B-31)gene


The following mRNA feature(s) are a part of this gene:

Feature NameUnique NameSpeciesType
NO04G02740.1NO04G02740.1Nannochloropsis oceanica (N. oceanica IMET1)mRNA


Sequences
The following sequences are available for this feature:

gene sequence

>NO04G02740 ID=NO04G02740|Name=NO04G02740|organism=Nannochloropsis oceanica|type=gene|length=3526bp
AAAGGACAGGCTAAACGAGGTGTTTTGCGACTTGAGACTCGCCCAACGGC
GATATATCCTAGCCTCCCAATTGCCCCACGTTGACCTCAGGGGACAAGAG
CAAAGGGACGAAATGCCCTACAGCTACCCAGGACGCCTTCATTTCCCGAT
ATCAGACCTCTGACATCACCTTCTTTCTCCTCGATAACATGCGTGTCCCC
AGCTTTAGACAGGAATGTCATATCCTCGTAGGACATAAGGGCGAAAGGTT
CAAGACCCAGCCAGGTGTCTGCTTGAGCGCAGCAACGCTTCTGGACGAAG
GATGAAGAAAAAATCCCACGGCACAGATATGAACGAACCGCCTACCTTGG
ATTGCTCTCGCGGCGATAACACATCCGGCTACAGTACGGAGAGCAGCAAC
AGCAGCAGCGGCCTGTGGTCTTTCGGCGGCAGCGGCAGGGGGGGCAATGG
AAGCACAAGCAGCAACAGGAGAGGGACCGTAAGCAGTGCAAGCAGCGCAA
GCAGTGGCAAGGCCCCCGACACTCCGTCGTCCCACGACTGTGTTCCTTTC
ACTGCAATTGGCTCCTCGCTCGATGCTCCCCGTTGGCCTGGTGATGGGAT
CGCAGGGGCCTCTACCGGCACCAACCACAATCCTGGCGGCGTGGGCTTGA
GTCGGTGTATGTTGTCGTCCCCGAACCCCCCGTCCTCCCCATCGCCCGCA
GGCCTCCACATCCGCACCTGGGATCCACCACCTCATTCCCCGACGACGTA
CGGCAGCCAATCAGAAGGTGCAGCAGCAGCAGCAGCGGCAGCTGCGGGGG
CGGCCATTCAACGTAATGGACTACCTAGCGGGAAGGAGCATCACCTTTAT
TCGGGTTCGTCGTCCGCGGCCTCGTCGCCTGTTTCTACGACGTTGAGCCA
TTGTAAGAAAAGTATACCAAAGCAGCACGAATGGAGCCACGGGGGATTAG
GGAATATGGATCCTCATATGTCTCCCTTGTCAAATAGTGGCTCGCCGTCC
TTGATCTCGCCGAGTCCCACCGTCAGCAACTACAGCTTTGGATTTCAGTC
ACCCATGCCATCTGATCTCGAATCGTCATTTGCAAACCTGCTGCTAGATC
GAACGGGGGGTAGTGCGGTGGAGGGAGGGGCATGCACGCCCCGAGGACGG
GCACAGGCCTTGCAGCAGTTTCAACAGCAGCAGCAGCAGCAGCAGCAGCA
TCATCATCATCATTACCATCATCATCAACAACAGCAGCAGCAGCACATTC
AGCAGCAGCAGCCACCGCAGCTTCGCCACGTGCATGGTATTGCAGTAGGA
CTGGGACTTGATGACCGTCCTCCCCTTCACAATCAACTTCAAGGTTCTCC
TACTTCCGCTGGACTACGCATTCTCACCGGTCGTCAGTTTCCCTCCCAGC
AGCATCAGCAGCAGCAGCAGGAGCAGCAGCAGCAGCAGGAGCAGCAGGAG
CAGCAGGAGCAGCCGCTGCCGCCGCAGCAGCAGCCACAGCCGCCACAGCA
GCAGCAGCACCTCCACCACCACCACCACCAGCAGCAGTCGCTCATAATCC
AGATCCCAGCGTCGACCTCTGCCTTTGGCTCGGGTCCGAGCAGTGCTCCC
GCCGCAGGGATGAGCAATCGAAGCCATCTCTTCTTGAGTGGTAGTGATGA
GGTCGGGCTCCACGCTCGGGGCACGTACTCTCCACGCCTGCAAGGGCAGA
CTTCACCTCGTTTCCTGGGCAGCACTACGGGAGGGATTGGGGGAGGGATG
AACACGAGGGGCATGGGTGGAGGGGGAGGGGGTGGGAACGGTTTTCACAA
CCGATCTCCTCGGCAGCATCATCCCCAGCAACAGCAGGGGGGCAATTCTC
TGTATATGCATAATGTAGGGGGAGGGCGGAGCAGCAGCTTAAGTAGTCGT
GGTAGTGGTGGTGGAGGAATGATGAATGGTGGGTATGGTGGAGGGTCTCC
GCACCACTTAATGGCTGGCGGAAGTGGTCGACGTGGAGGAGGAGGAGGAG
GAGGAGGAGGCGGAAGCCGTGGTGGTGGCGGAGGGTTTTATTCGCCGACG
GGATCGAGTAGTAGTGGGATGGGGAGTCCGATTGGGGGAGGGCGCCTGGA
AGTTTATTATGGTGGTGGATCACCACGAGGGATGGGGTATAGTGGGCATG
GAGGTGAAGGACCTATGGATTTTGAGACGAAGCGTTGCTCGAACGCAATT
AAGAAGCTGTGCAGCGCGGGGAATGTGACAGAGGTGTTCGCGGTGCTGGA
GGATATGCTGGGGCATGGGCTGGTGCCGGACCTGAAGACGGTGAAGACCA
TCATGCGGACATGCGTGAAGAAGCTGGCCTGGCGACGGGCGGTGCAGCTG
CTGCACACAACAGGCCTGCCGCTGGACGTGGTCATCTTCACAATGGCTAT
CAACACTTGCGGTAAGGCCGGAGAGTGGGAGCAGGGCCTGCGGGTGCTGC
GGGAGATGGACTCTGAGGAGGCCAAGGCCCACGGCATCGTGCCCAACGAG
GTGAGCTACGGGACGGCCATCTCGGCTTGCGGCAAGGCGGGCCGGTGGGA
GCTGGCGCTCTCGCTCTTAAACGAGGTCAAGGATCGCGGCCTGCTCCTCA
ATGATGTTTGTTACGGCGGGGCAATTGATGCCTGCGGGCGCGCGGGTCAG
TGGCAGGAGGCACTCAAGCTTCTGAATGAGATGACGCTGGACGGGGTGGC
GGCCAATGAGGTGTGCTACAACTCGGCCATCTCCGCCTGCGGCAAGGCCG
GGGAGTGGGAGAAGGCGGTGGCTTTGTTGCGCGAGATGCGGGAGCGGGGC
CTGACGCCCGACGAGCTGAACTACAACTCCGCTATCTCGGCCTGCGGCAA
GGCGGCCCAGTGGGCGCACGCCATTCGCCTGCTCCGCGAGATGGCCGTAC
AAGGGTTGTGCCCGGATGTGGTGTCCTACGGGGCGGCCATCGATGCCTGC
CGTAAGGCCGGCAAGTACGACCGAGGCCTGGCGCTGCTGGGCGAGATGCG
CCAGGTGGGCCTGGTGCCCAACGAGGTTTGCTACCACTCGGCGGTCACGG
CCTGCTCCGAGGCAGGCCGGTGGAATGAGGCCCTGTTTGTGCTCTCGGAG
ATGGTCAAGCAAGGCCTCAAGCCGGATGCCATCAGCTACAATGCGGCTGT
GGAGGCCTGCGCCAAGTCGGGAAAGTGGGACGCCCTGGTCATCCTGCTGC
AGGAGACGGCTCGGGACCCGGACACCACACTTTTGGATGCCGTCTACAAC
AATGCCGTCATAACCTGCGCCAAGTACGGCAACTGGGAGTTGGCGGCCAC
CGTGCTGCAGGAGATGGAGTATTTGGGTGGGAATGGTGGAAGTGGGGCGG
GGACGCCGGGCGGCAGCAATAGCAGCAGTCATCCGGGCTTGTCGCCCACC
TCGGCCTCCTACTCTGCGGTCATTGACGCTTGTGCCAAGGCCCTGGCCGC
CGCAGGAGCAGCAGGTGACGGGAGCCCCTCCTCTTCACCCCCCCCCCTTC
CGGCGGCCTGCCCCAGTGGGAGCTGA
back to top

protein sequence of NO04G02740.1

>NO04G02740.1-protein ID=NO04G02740.1-protein|Name=NO04G02740.1|organism=Nannochloropsis oceanica|type=polypeptide|length=1075bp
MKKKSHGTDMNEPPTLDCSRGDNTSGYSTESSNSSSGLWSFGGSGRGGNG
STSSNRRGTVSSASSASSGKAPDTPSSHDCVPFTAIGSSLDAPRWPGDGI
AGASTGTNHNPGGVGLSRCMLSSPNPPSSPSPAGLHIRTWDPPPHSPTTY
GSQSEGAAAAAAAAAGAAIQRNGLPSGKEHHLYSGSSSAASSPVSTTLSH
CKKSIPKQHEWSHGGLGNMDPHMSPLSNSGSPSLISPSPTVSNYSFGFQS
PMPSDLESSFANLLLDRTGGSAVEGGACTPRGRAQALQQFQQQQQQQQQH
HHHHYHHHQQQQQQHIQQQQPPQLRHVHGIAVGLGLDDRPPLHNQLQGSP
TSAGLRILTGRQFPSQQHQQQQQEQQQQQEQQEQQEQPLPPQQQPQPPQQ
QQHLHHHHHQQQSLIIQIPASTSAFGSGPSSAPAAGMSNRSHLFLSGSDE
VGLHARGTYSPRLQGQTSPRFLGSTTGGIGGGMNTRGMGGGGGGGNGFHN
RSPRQHHPQQQQGGNSLYMHNVGGGRSSSLSSRGSGGGGMMNGGYGGGSP
HHLMAGGSGRRGGGGGGGGGGSRGGGGGFYSPTGSSSSGMGSPIGGGRLE
VYYGGGSPRGMGYSGHGGEGPMDFETKRCSNAIKKLCSAGNVTEVFAVLE
DMLGHGLVPDLKTVKTIMRTCVKKLAWRRAVQLLHTTGLPLDVVIFTMAI
NTCGKAGEWEQGLRVLREMDSEEAKAHGIVPNEVSYGTAISACGKAGRWE
LALSLLNEVKDRGLLLNDVCYGGAIDACGRAGQWQEALKLLNEMTLDGVA
ANEVCYNSAISACGKAGEWEKAVALLREMRERGLTPDELNYNSAISACGK
AAQWAHAIRLLREMAVQGLCPDVVSYGAAIDACRKAGKYDRGLALLGEMR
QVGLVPNEVCYHSAVTACSEAGRWNEALFVLSEMVKQGLKPDAISYNAAV
EACAKSGKWDALVILLQETARDPDTTLLDAVYNNAVITCAKYGNWELAAT
VLQEMEYLGGNGGSGAGTPGGSNSSSHPGLSPTSASYSAVIDACAKALAA
AGAAGDGSPSSSPPPLPAACPSGS*
back to top
Synonyms
Publications