Monarch geneset OGS2.0

DPOGS204696
TranscriptDPOGS204696-TA1542 bp
ProteinDPOGS204696-PA513 aa
Genomic positionDPSCF300170 + 470039-474572
RNAseq coverage132x (Rank: top 56%)
Annotation
HeliconiusHMEL0082471e-5244.74% 
BombyxBGIBMGA007475-TA1e-17862.34% 
DrosophilaCG4749-PA5e-11343.78% 
EBI UniRef50UniRef50_B4KEU51e-11545.69%GI17979 n=3 Tax=Drosophila RepID=B4KEU5_DROMO
NCBI RefSeqXP_002003553.13e-11645.69%GI17979 [Drosophila mojavensis]
NCBI nr blastpgi|1951180505e-11545.69%GI17979 [Drosophila mojavensis]
NCBI nr blastxgi|1951180502e-11245.16%GI17979 [Drosophila mojavensis]
Group
KEGG pathwaydre:5535343e-31 
 K00599 (E2.1.1.-)maps-> Naphthalene and anthracene degradation
    Tyrosine metabolism
    Histidine metabolism
    Selenoamino acid metabolism
InterPro domain[284-453] IPR0016786.2e-16Bacterial Fmu (Sun)/eukaryotic nucleolar NOL1/Nop2p
[303-313] IPR0232672.3e-11RNA (C5-cytosine) methyltransferase
Orthology groupMCL13724 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204696-TA
ATGTTCTCCTTTTGTAATAGCATCAAAAGTTCACAGCTAAATACATATTTAGTTTTACAAGCTCGTTCAAAATCGAAAACTCATTGGGCGAAACTGAAAAAGAAAACTGGTCCAAAACACAAAGCTATGAACCACTTCGACGAATTTTACGGTTCAGTTTTCGGCGATAAATGGGAACCTATGAGAGAGGCGTTACAGCGACGATCTAAATATGTAGCCGTTGTCAATAATTATGGTGATGCCGAAGAAACTATGCAATACTTAGCTAGTAGAGGTGCACATTGCCTAAAAACACTCATGAATGTCCAACAGGACTTCAACAACCAGTATCTACCACAAGAACCGCTGGAAATAAACAATGAAAATAAGAGTAACTTTCAAAACTATATCAATCAGCTACAGAGCGATGAAATTGCCAAAATTTATCCCCAAGGCGAGATCACACCTGAGAGATTGGAAATCACCGAGGAAAAAATTTTAAACAGTTCTAATGAGGAACTAATTAGCGATAAAGAATTAAACACGAGCTCAAACCTTAATGAAGCTATAGATCAGGCTGAGATAGATGAGTCGAGATTAATCCTACCTTCAATGGGACTCTCATCGGATGCCTTGTACCAATATGTACCGGCCACTAAAATCAAAGGACTCGATGAATGGGTGCCAGAGTCTCTACATTATTCATTCTATAATAATAATAACACAGATTTTCCGTTATTGATAGAGCCGGAGACGGAGTTTGTGTTTCCGGAACATTTGAAAGTTATGACATATGAAAAAGATAGTGAGGCATATAAGTTCCCAGAACCGAAAAGGTGTAAAACAGGTGTATTCAACTATTATCCGTTGGACTGCGGCAGTGTGGTGTCAGTGTTGGCGTTGCTGCTTCGAGCTGGTGACCGAGTGCTGGAACTGTGTGCAGCACCTGGGGGGAAGGCTCTTACTACGCTGCAGACGTTACTACCACATGTTCTGGTAGCCAACGACGCCTCCATATCCAGGTCTAACAGGTTGGTGAGGGTTTTCCGGGACTATCTGCTGGATTACGAGACGAACAGCTCTTGGAGCGAACGCGTACGCGTTGTGAGAACAGACGGTCGGAACTACACAGATGACCAGGGATTCGATAAGGTGTTAGTAGATGTGCCGTGTACAACAGACAGACATTCCGTCATGGAAGATGACAACAATATATTTAGACCAGACAGAGTAAAAGAGAGACTGAGGATACCAGAACTGCAGTCACAATTATTGGTTAACGCCCTCAGGCTGGTGAAACCTGGCGGCGCCGCGGTATACAGCACGTGCTCCTTAAGCCCCGTACAGAACGACGGTGTCATCCACGCGGCCCTAACACAGGCGTTTAGAAACCACGGTATCATTGCTGCCGTCAAAGATCTGTCAGTGCCGTTCAGAGCTTTGAACAGCACTCTATGTCTCGCTGAGGGTTCCGTCAAACCAAAGTACGGTCAACTCATTATACCGGACATATCAGCTAATTTCGGGCCCACTTATGTGTCTAGACTGGTTAGACTTAAATAA

Protein sequence:

>DPOGS204696-PA
MFSFCNSIKSSQLNTYLVLQARSKSKTHWAKLKKKTGPKHKAMNHFDEFYGSVFGDKWEPMREALQRRSKYVAVVNNYGDAEETMQYLASRGAHCLKTLMNVQQDFNNQYLPQEPLEINNENKSNFQNYINQLQSDEIAKIYPQGEITPERLEITEEKILNSSNEELISDKELNTSSNLNEAIDQAEIDESRLILPSMGLSSDALYQYVPATKIKGLDEWVPESLHYSFYNNNNTDFPLLIEPETEFVFPEHLKVMTYEKDSEAYKFPEPKRCKTGVFNYYPLDCGSVVSVLALLLRAGDRVLELCAAPGGKALTTLQTLLPHVLVANDASISRSNRLVRVFRDYLLDYETNSSWSERVRVVRTDGRNYTDDQGFDKVLVDVPCTTDRHSVMEDDNNIFRPDRVKERLRIPELQSQLLVNALRLVKPGGAAVYSTCSLSPVQNDGVIHAALTQAFRNHGIIAAVKDLSVPFRALNSTLCLAEGSVKPKYGQLIIPDISANFGPTYVSRLVRLK-