NO20G02540.1, NO20G02540.1 (mRNA) Nannochloropsis oceanica

Overview
NameNO20G02540.1
Unique NameNO20G02540.1
TypemRNA
OrganismNannochloropsis oceanica (N. oceanica IMET1)
Sequence length1814
Alignment locationchr20:800589..803929 +

Link to JBrowse

Properties
Mutants
Expression
No biomaterial libraries express this feature.
Alignments
The following features are aligned
Aligned FeatureFeature TypeAlignment Location
chr20genomechr20:800589..803929 +
Analyses
This mRNA is derived from or has results from the following analyses
Analysis NameDate Performed
Gene prediction for N. oceanica IMET12017-10-24
Homology annotation for N. oceanica IMET12017-10-25
InterPro analysis for N. oceanica IMET12017-10-25
Annotated Terms
Homology
BLAST of NO20G02540.1 vs.
Match: XP_009035509.1 (hypothetical protein AURANDRAFT_58891 [Aureococcus anophagefferens]; EGB09441.1 hypothetical protein AURANDRAFT_58891 [Aureococcus anophagefferens])

HSP 1 Score: 386.7 bits (992), Expect = 8.400e-104
Identity = 206/389 (52.96%), Postives = 259/389 (66.58%), Query Frame = 0
Query:    2 IDFTDSPQFIPHTQKALNYTPHDVKWVPSSARLVSMGATAGAKGILEIYALSGGELKKTAEAEHKHSFKCSTFGASTLEERNLATGDWSGGLSVWDLDHLSSGPSLSLSTAHAGIINTIDGVGGLEGRGHGAPELVTGSKDGCVRVWDLRVSNPVLSLQPSPGDAKRDCWAVAFGDSHNDAERTLAAGYDNGDVKLFDLRTSTMRHEENVGNGVTSLEFDRRDTPMNKLVITTLESTFHVWDLRSSLGEGEGGKEGGKEGGKEGGVGTERGFVSIKQKMHQNATVWLARHLPQNRDIFATTGAEGGLALWQYHYPVKRSSSSSSPSSPFSFPSSSSSSSPSSPGTLSLLNSRVLSSQPLVALDWHRNKAGLCAIASLDQCLRVHIITKL 391
            ++ T++PQ I H  ++LN+TP++ KWVP SAR V  G +  AKG+L+IY L GG+++  AE  H    KC TFGAS+L ER+LATGD+ GGL V+DL+ L + P+ S+  AH  IIN IDGVGGL G G GAPELVTGS+DGCVRVWD RV  PV+SL+P  G   RDCW  AFG+S  D ER +AAGYDNGDVKLFDLRT+ MR E N  NGVT +EFDR+D  MNKLV+TTLES F V+DLR+                       + GF    +K H+ ATVWLARHLPQNRD+F T G  GG  L+ YHYP KR++            +   ++    PGT+ LLNSRV+S+QP+V+ DW  +K GL  ++ LDQ LRV+I TKL
Sbjct:    1 METTEAPQVIEHQHQSLNFTPYETKWVPCSARFVCCGISPKAKGVLQIYELKGGKMEVVAEKTHDCGLKCGTFGASSLGERSLATGDYKGGLHVFDLERLDA-PTFSVPGAHKAIINGIDGVGGL-GIGGGAPELVTGSRDGCVRVWDPRVREPVVSLEPVEGQPVRDCWCAAFGNSVGD-ERVVAAGYDNGDVKLFDLRTNAMRWETNASNGVTCVEFDRKDIEMNKLVVTTLESRFRVFDLRTQ--------------------HADDGFAHCTEKAHK-ATVWLARHLPQNRDVFMTGGGNGGFNLYAYHYPKKRTA------------THKDNAPVGVPGTVELLNSRVISTQPIVSFDWSPDKEGLAVLSCLDQTLRVYICTKL 353          
The following BLAST results are available for this feature:
BLAST of NO20G02540.1 vs.
Analysis Date: 2017-10-25 (Homology annotation for N. oceanica IMET1)
Total hits: 1
Match NameE-valueIdentityDescription
XP_009035509.18.400e-10452.96hypothetical protein AURANDRAFT_58891 [Aureococcus... [more]
back to top
Relationships

This mRNA is a part of the following gene feature(s):

Feature NameUnique NameSpeciesType
NO20G02540NO20G02540Nannochloropsis oceanica (N. oceanica IMET1)gene


The following polypeptide feature(s) derives from this mRNA:

Feature NameUnique NameSpeciesType
NO20G02540.1NO20G02540.1-proteinNannochloropsis oceanica (N. oceanica IMET1)polypeptide


The following CDS feature(s) are a part of this mRNA:

Feature NameUnique NameSpeciesType
NO20G02540.1.cds1NO20G02540.1.cds1Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds2NO20G02540.1.cds2Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds3NO20G02540.1.cds3Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds4NO20G02540.1.cds4Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds5NO20G02540.1.cds5Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds6NO20G02540.1.cds6Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds7NO20G02540.1.cds7Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds8NO20G02540.1.cds8Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds9NO20G02540.1.cds9Nannochloropsis oceanica (N. oceanica IMET1)CDS
NO20G02540.1.cds10NO20G02540.1.cds10Nannochloropsis oceanica (N. oceanica IMET1)CDS


Sequences
The following sequences are available for this feature:

mRNA sequence

>NO20G02540.1 ID=NO20G02540.1|Name=NO20G02540.1|organism=Nannochloropsis oceanica|type=mRNA|length=1814bp
AAAATTCACCAACATAATGTAGGCAGTGACAAAAAGGCATGAAATAGGAG
ACGTGGGGTCTGGTAAGGTGCAGGCGCGAGCGGCAGCGGCTCCAAAGAGG
ATCGGCCGTCGTTATTCCATCTCGTTTCTTCATCGCTGACATGGCATCAT
CGCCATAGCCAGGCCGCCCCCCGTCGTGATTTGCTGCTTTTTGTGCTGCT
GTGTTGTATGTGCCTATCTCTCTACAGCAGGTCCCTGATGTAAAAGCAGA
TATAATGGTCTCTTTTTGAGACCAGAATCACGTATCCACCCCGCGCGCCG
CGCGCTGACTTGTGTTAAGGCAGCCTCACTCTGCCCTTCCCCTCTTACCC
TCCCATCTCATTTCCTCCCATCAGACCATGATAGATTTTACAGACAGCCC
ACAATTTATCCCGCATACGCAAAAAGCGCTAAATTACACCCCACATGATG
TAAAATGGGTCCCGTCCTCCGCCCGGCTGGTCTCGATGGGGGCCACGGCT
GGCGCCAAAGGCATTTTGGAGATTTATGCGCTCTCGGGCGGCGAGCTCAA
GAAGACGGCCGAGGCTGAACACAAGCACAGTTTCAAATGCTCGACATTTG
GGGCCAGCACGCTGGAAGAAAGGAATCTCGCCACAGGGGACTGGAGCGGG
GGGCTGTCGGTCTGGGACCTGGACCACCTCTCCTCCGGCCCCTCCCTCTC
CCTCTCCACCGCCCATGCAGGCATCATCAACACCATTGACGGCGTGGGAG
GGCTGGAGGGCAGAGGACACGGCGCACCTGAGCTCGTCACCGGCTCAAAG
GACGGCTGCGTCCGAGTTTGGGACTTGAGGGTGTCGAACCCCGTCCTCTC
CCTCCAACCCTCCCCGGGCGATGCCAAAAGAGACTGCTGGGCGGTGGCCT
TTGGCGATTCCCACAATGATGCCGAGAGGACGCTGGCCGCCGGCTACGAC
AACGGAGACGTCAAGCTCTTTGACCTCCGCACCTCCACCATGCGTCACGA
AGAGAATGTGGGCAACGGCGTTACCTCCCTTGAATTCGATCGAAGGGACA
CGCCAATGAACAAGCTTGTCATCACCACCCTCGAGTCTACGTTCCACGTG
TGGGACCTCCGCTCTTCCCTGGGCGAGGGGGAGGGAGGGAAGGAGGGAGG
GAAGGAGGGAGGGAAGGAGGGAGGGGTGGGGACGGAAAGGGGATTCGTGT
CGATCAAGCAGAAGATGCATCAAAATGCCACCGTGTGGCTCGCGCGACAC
TTGCCTCAGAACAGGGATATTTTCGCCACCACGGGAGCAGAGGGCGGGTT
GGCGCTTTGGCAGTACCACTACCCTGTCAAACGCTCCTCCTCCTCTTCTT
CTCCCTCATCCCCTTTCTCCTTCCCTTCCTCCTCCTCCTCCTCCTCTCCT
TCCTCCCCCGGCACTCTCTCCCTTCTCAACTCTCGCGTTCTCTCCTCGCA
GCCCTTGGTCGCTCTCGACTGGCACCGGAATAAGGCCGGTCTCTGCGCTA
TCGCCTCCCTAGACCAATGCCTGCGGGTCCATATCATCACAAAGCTGTAA
CCCAATAGAGGGGGAGGGAGGAAGAGGAGGGGGCGGGGAGGGAAGGAGGG
ACAAAGGCCAAACGATGTCAATAGCAACGCAGTGAGAGGGAGGAAAGGAG
GGAGGAAGGAGGGAGGCAAAGCGTGTCGAAAGTGTTTTGTCCAGTGTTTC
ATAATGAAGAATACATCAATAAAGAGCTCAACCATTCAAGCAGGGGATAC
AACGCTACAGCCTTTGCAAGGGCTGAGACACCTAACTGTTATTTTAAAAT
AAGATATAGGGGGA
back to top

protein sequence of NO20G02540.1

>NO20G02540.1-protein ID=NO20G02540.1-protein|Name=NO20G02540.1|organism=Nannochloropsis oceanica|type=polypeptide|length=391bp
MIDFTDSPQFIPHTQKALNYTPHDVKWVPSSARLVSMGATAGAKGILEIY
ALSGGELKKTAEAEHKHSFKCSTFGASTLEERNLATGDWSGGLSVWDLDH
LSSGPSLSLSTAHAGIINTIDGVGGLEGRGHGAPELVTGSKDGCVRVWDL
RVSNPVLSLQPSPGDAKRDCWAVAFGDSHNDAERTLAAGYDNGDVKLFDL
RTSTMRHEENVGNGVTSLEFDRRDTPMNKLVITTLESTFHVWDLRSSLGE
GEGGKEGGKEGGKEGGVGTERGFVSIKQKMHQNATVWLARHLPQNRDIFA
TTGAEGGLALWQYHYPVKRSSSSSSPSSPFSFPSSSSSSSPSSPGTLSLL
NSRVLSSQPLVALDWHRNKAGLCAIASLDQCLRVHIITKL*
back to top

mRNA from alignment at chr20:800589..803929+

Legend: polypeptideCDS
Hold the cursor over a type above to highlight its positions in the sequence below.
>NO20G02540.1 ID=NO20G02540.1|Name=NO20G02540.1|organism=Nannochloropsis oceanica|type=mRNA|length=3341bp|location=Sequence derived from alignment at chr20:800589..803929+ (Nannochloropsis oceanica)
AAAATTCACCAACATAATGTAGGCAGTGACAAAAAGGCATGAAATAGGAG ACGTGGGGTCTGGTAAGGTGCAGGCGCGAGCGGCAGCGGCTCCAAAGAGG ATCGGCCGTCGTTATTCCATCTCGTTTCTTCATCGCTGACATGGCATCAT CGCCATAGCCAGGCCGCCCCCCGTCGTGATTTGCTGCTTTTTGTGCTGCT GTGTTGTATGTGCCTATCTCTCTACAGCAGGTCCCTGATGTAAAAGCAGA TATAATGGTCTCTTTTTGAGACCAGAATCACGTATCCACCCCGCGCGCCG CGCGCTGACTTGTGTTAAGGCAGCCTCACTCTGCCCTTCCCCTCTTACCC TCCCATCTCATTTCCTCCCATCAGACCATGATAGATTTTACAGACAGCCC ACAATTTATCCCGCATACGCAAAAAGCGCTAAATTACACCCCACATGATG TAAAATGGGTCCCGTCCTCCGCCCGGCTGGTCTCGATGGGGGCCACGGCT GGCGCCAAAGGCATTTTGGAGATTTATGCGCTCTCGGGCGGCGAGCTCAA GAAGACGGCCGAGGCTGGTAAGGAAAGACAGAACAGAGGGTGTTTGGAAG AAAGAGGATATTTCCTCATGATATTTAAGGGAGCGAATTGGATCCTTTCG CTCTGTGTCCTTTCCGCCTTTTTCTTTTTTTTAGCCCCAGACTGCAAACC CTCCCACTATTCTTACCTTCCCCTTTACTGCCTCCCCCTCCCTTCTCCTC CTCTTTCGTACTCTCCCGTAGAACACAAGCACAGTTTCAAATGCTCGACA TTTGGGGCCAGCACGCTGGAAGAAAGGAATCTCGCCACAGGGGACTGGAG CGGGGGGCTGTCGGTCTGGTATGTCCTATGTGTGTGTGTGTGTGTGTGTA TGTGTGTGTGTATGTGTGTGTGTGTATGTACCACACCCACCCTCTCTCCC TCCCTCCCTCCCCTCTTCCTTTCTTTCCTTCCATTCTATGATATCGTGAC ATTCTACTAACAGCATGTGCTGCCATGATTCATTTTTCCGCGGATCTTTG TCCAACACTGCCAATACATCCACATCTCCACTCCTTCCCTTCCCCCCCTC CCTCCCTGCCCTCCCTCCTTCCCTCATTCCCTCCCTTTGCAGGGACCTGG ACCACCTCTCCTCCGGCCCCTCCCTCTCCCTCTCCACCGCCCATGCAGGC ATCATCAACACCATTGACGGCGTGGGAGGGCTGGAGGGCAGAGGACACGG CGCACCTGAGCTCGTCACCGGCTCAAAGGACGGTATGCTCCCGCTCGCTC CCTCCCTCCCTCTCTCCCTCCCTCCTATTCTCCCCTTACTAATTTCAGGG GAGATCCAAGTTGCCATTCTGATTCCCACATCCACCCTCCTCCCTTCCTC CTTCCCTCAGGCTGCGTCCGAGTTTGGGACTTGAGGGTGTCGAACCCCGT CCTCTCCCTCCAACCCTCCCCGGGCGATGCCAAAAGAGACTGCTGGGCGG TGGCCTTTGGTGTGTGTGTCACCCTCCCCCCTTCCCTCTTTCCTTCCCCT CTCCTCTTGTCCTCGTTTTCCTTTCATCTTTCTTGGAAGCTCAACCCTTT CGTGTGCCTCAGTCCCTCCCTCCCTCCCTCCCTCCCTCCCTCCCTCCCTC TTTCCCTCCCTCCACCCGTAGGCGATTCCCACAATGATGCCGAGAGGACG CTGGCCGCCGGCTACGACAACGGAGACGTCAAGCTCTTTGGTACGTCCTC CCTCCCTCCTCCCCTCTCTCCCTCCCTCCCTCCTTCCTTCCTTCCCTCCT TCCCTCCCTCCTTCCATCCATCGATCCTTCCTTTTGGCCCTTCTTCCCCT TCCGCCACCCGCGATTGGATGCATCCACACCCACTGGTCTCCCTCCCTCC CTCCCTCTCTCCCTCCCTCCCTCCCTCCCTCCCCCCCTCCCTCCCCCCAG ACCTCCGCACCTCCACCATGCGTCACGAAGAGAATGTGGGCAACGGCGTT ACCTCCCTTGAATTCGATCGAAGGGTAAGACCCCTCCCTCCCTGCCTCCC TCCCTCCCTCCCTCCATCCTTCCCTCCCTCCAGTCCATCGCTCACACACA CCCTCCCTCTCCCTATCCACCCACAGGACACGCCAATGAACAAGCTTGTC ATCACCACCCTCGAGTCTACGTTCCACGTGTGGGACCTCCGCTCTTCCCT GGGCGAGGGGGAGGGAGGGAAGGAGGGAGGGAAGGAGGGAGGGAAGGAGG GAGGGGTGGGGACGGAAAGGGGATTCGTGTCGATCAAGCAGAAGATGCAT CAAAATGTAAGGTGTTGTGCTGGGGAGACCGTCACTCCTCTTTTATCCTC CTTCCTTCCTGCCCTGCCTTTCCTTCCTTCTTGCCTCCTCGGGCTCCTCC GTCCCCTTACCCCTCCCTCCCTCCCTCTCTCCCTCCCTCCCTCCATAGGC CACCGTGTGGCTCGCGCGACACTTGCCTCAGAACAGGGATATTTTCGCCA CCACGGGAGCAGAGGGCGGGTTGGCGCTTTGGCAGTAGTAAGGAGGGAGG GATGGAGGGAGGGAGGGAAGGAGGGGCAGACGAATGATATAATTGCTTGT TTCGCTCAGTCTTTCTCGCTCACTCGTTAATGTTTCCCTTCTTTCCTCCC TCCCTCCTACCTTTCCTTCCCAGCCACTACCCTGTCAAACGCTCCTCCTC CTCTTCTTCTCCCTCATCCCCTTTCTCCTTCCCTTCCTCCTCCTCCTCCT CCTCTCCTTCCTCCCCCGGCACTCTCTCCCTTCTCAACTCTCGCGTTCTC TCCTCGCAGCCCTTGGTCGCTCTCGACTGGCACCGGAATAAGGCCGGTCT CTGCGCTATCGCCTCCCTAGGTCGATCCATCCTCCCTCCCTCCCTCCCTC CCTCCCTCCTTCCCTCCCTCCTTCCCTCCCTCTCTCCTTCTCCCCCTCTC GCACTCTCTCCCTCCTTCCCTCTCTCCCTAATCACCGTCTCACTCGGCCT TTGTTTTCTCCATTCCTCCCTCCCTCCCTCCTTGTCCAGACCAATGCCTG CGGGTCCATATCATCACAAAGCTGTAACCCAATAGAGGGGGAGGGAGGAA GAGGAGGGGGCGGGGAGGGAAGGAGGGACAAAGGCCAAACGATGTCAATA GCAACGCAGTGAGAGGGAGGAAAGGAGGGAGGAAGGAGGGAGGCAAAGCG TGTCGAAAGTGTTTTGTCCAGTGTTTCATAATGAAGAATACATCAATAAA GAGCTCAACCATTCAAGCAGGGGATACAACGCTACAGCCTTTGCAAGGGC TGAGACACCTAACTGTTATTTTAAAATAAGATATAGGGGGA
back to top

Coding sequence (CDS) from alignment at chr20:800589..803929+

>NO20G02540.1 ID=NO20G02540.1|Name=NO20G02540.1|organism=Nannochloropsis oceanica|type=CDS|length=1173bp|location=Sequence derived from alignment at chr20:800589..803929+ (Nannochloropsis oceanica)
ATGATAGATTTTACAGACAGCCCACAATTTATCCCGCATACGCAAAAAGC
GCTAAATTACACCCCACATGATGTAAAATGGGTCCCGTCCTCCGCCCGGC
TGGTCTCGATGGGGGCCACGGCTGGCGCCAAAGGCATTTTGGAGATTTAT
GCGCTCTCGGGCGGCGAGCTCAAGAAGACGGCCGAGGCTGAACACAAGCA
CAGTTTCAAATGCTCGACATTTGGGGCCAGCACGCTGGAAGAAAGGAATC
TCGCCACAGGGGACTGGAGCGGGGGGCTGTCGGTCTGGGACCTGGACCAC
CTCTCCTCCGGCCCCTCCCTCTCCCTCTCCACCGCCCATGCAGGCATCAT
CAACACCATTGACGGCGTGGGAGGGCTGGAGGGCAGAGGACACGGCGCAC
CTGAGCTCGTCACCGGCTCAAAGGACGGCTGCGTCCGAGTTTGGGACTTG
AGGGTGTCGAACCCCGTCCTCTCCCTCCAACCCTCCCCGGGCGATGCCAA
AAGAGACTGCTGGGCGGTGGCCTTTGGCGATTCCCACAATGATGCCGAGA
GGACGCTGGCCGCCGGCTACGACAACGGAGACGTCAAGCTCTTTGACCTC
CGCACCTCCACCATGCGTCACGAAGAGAATGTGGGCAACGGCGTTACCTC
CCTTGAATTCGATCGAAGGGACACGCCAATGAACAAGCTTGTCATCACCA
CCCTCGAGTCTACGTTCCACGTGTGGGACCTCCGCTCTTCCCTGGGCGAG
GGGGAGGGAGGGAAGGAGGGAGGGAAGGAGGGAGGGAAGGAGGGAGGGGT
GGGGACGGAAAGGGGATTCGTGTCGATCAAGCAGAAGATGCATCAAAATG
CCACCGTGTGGCTCGCGCGACACTTGCCTCAGAACAGGGATATTTTCGCC
ACCACGGGAGCAGAGGGCGGGTTGGCGCTTTGGCAGTACCACTACCCTGT
CAAACGCTCCTCCTCCTCTTCTTCTCCCTCATCCCCTTTCTCCTTCCCTT
CCTCCTCCTCCTCCTCCTCTCCTTCCTCCCCCGGCACTCTCTCCCTTCTC
AACTCTCGCGTTCTCTCCTCGCAGCCCTTGGTCGCTCTCGACTGGCACCG
GAATAAGGCCGGTCTCTGCGCTATCGCCTCCCTAGACCAATGCCTGCGGG
TCCATATCATCACAAAGCTGTAA
back to top
Synonyms
Publications