Monarch geneset OGS2.0

DPOGS207723
TranscriptDPOGS207723-TA1818 bp
ProteinDPOGS207723-PA605 aa
Genomic positionDPSCF300042 - 1126589-1132660
RNAseq coverage333x (Rank: top 35%)
Annotation
HeliconiusHMEL0153100.076.46% 
BombyxBGIBMGA005286-TA0.068.18% 
DrosophilaCG3808-PA8e-13744.05% 
EBI UniRef50UniRef50_Q17NM72e-17051.27%RNA m5u methyltransferase n=6 Tax=Culicidae RepID=Q17NM7_AEDAE
NCBI RefSeqXP_001649776.14e-17151.27%RNA m5u methyltransferase [Aedes aegypti]
NCBI nr blastpgi|1571074348e-17051.27%RNA m5u methyltransferase [Aedes aegypti]
NCBI nr blastxgi|1571074347e-16650.00%RNA m5u methyltransferase [Aedes aegypti]
Group
Gene OntologyGO:00063965.3e-19RNA processing
GO:00081735.3e-19RNA methyltransferase activity
GO:00001662e-05nucleotide binding
KEGG pathwaylac:LBA05393e-46 
 K00599 (E2.1.1.-)maps-> Naphthalene and anthracene degradation
    Tyrosine metabolism
    Histidine metabolism
    Selenoamino acid metabolism
InterPro domain[378-565] IPR0102805.3e-19(Uracil-5)-methyltransferase
Orthology groupMCL13532 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207723-TA
ATGTCAGAAACTGAACTGTTGAAAGACAATAATGAGGTTAACTTAGAAACTGATAATATAAAAGAGGAATATGCGTATCTGGACCGTGGTGGGTTTTCGTCTGAAAAGTTCAAAGTCGAAATTAGAGGCCTCCCCAAATTCTATGGAATAGTGGAACTGAAGAAGCTAATGAATCAGAAACTGGGTCTCAATGCCAGTAAAATCAAAAGACCCAAGAATGGCAGTCATTGGTTGTTTGCATGTTTCCAAAACGATGAGGACAGATCCAAGGCAATAAGCGCTCTTAATGGATATTCCTGGAAAGGTAAGACATTGATAGCCGAAGAAGCAAAACCAGCACCAGATCCATTGGTGAAGAAGAGAAAAAGAGATGAAGATTCAGTAAACATTAAAAAAAAGAAAGAAGACGAGAACAAGAGTCAAGAAGAAAGATTAAAAGACGCTGTTACGCCATATTGGAATATCCCTTACGAAGAACAGCTGGCACTAAAAGAGAAAGAAGTGAAGAACCTATTAATGAAATTTGACAATGAAGTATGGAAAATTGATAAGAACAAACGTAATTCAATAGAATCGAAACGTAAACTGTATAATGGACTAAGTTTCGAACTGAAGCCCATACAAAGGTCGCCGATCACGGCCGGGTATAGGAATAAATGTGAATTCACTGTGGGGATCGATGACGACACGGGGAAACCAACGGTCGGCTTCAGACTGGGGAGCTACGTTACCGGGACAGTCGGCGTGGCGCCAGTCGGCGGCTTAACTCATATATCAGACAAAATGAAACAAGCGGTTTTGCTGTTCCAGAATTTTGTCAGGAAGTCCAATATGTCACCGTTCTCGCCCGCGGACTACTCGGGTTACTGGCGTTACCTCACTGTCAGAGAATCCACTCATAATGGAGACATTATGCTGATTGTGGCCATGTATCCACAGGATTTGACAAAAGATAAATTGGAAGAACTGAAACGAGAATTGATTGAACATTTCAGCTCCGAAGAGGCTCAGGCGTGTGGAGTGAAGTCTGTGTACTTTGAAGAAATCACTAAAAAGAGGTCAGGTGAGGACGGCTCGAAGCCGATACATCTAATGGGCGCAACTCATATAGTGGACACAATACTCGGTCTACAGTTCAGAATATCTCCAGAGGCGTTCTTTCAGATCAACACGGCAGGTGCTAACATATTATATCAGAGCGCTATAGACCTCAGCAACGTCAATGAGAAGTCCACGTTGATCGACATCTGCTGCGGGACCGGTACCATCGGCCTCTGTTTTGCTAAGCATGTAGGGAAAGTCCTTGGTGTAGAGCTCGTGTCGGAAGCCGTTAAGGATGCGAGGTACAATGCTGAGCTGAACTCTATTGATAACTGTTCGTTCTTTGCTGGTCGTGCTGAAGATGTGCTGCCCTCTGTACTGGCGAGGGCGACCTCTGATGACGTCATCGCGGTCGTCGACCCGCCCAGGGCTGGGCTACACATGCGGGCCGTGACTCAACTCCGCAATACTAAGAAGGTGTCCCGCTTGATATACATATCGTGTTCCCCGGCGTCGGCTATAAAGAACTTCGTGGATCTTTCCCGACCGTCATCCAAGACGCTGCGAGGGGCGCCCTTCGTCCCCGTGCAGGCCGTGCCCGTCGACATGTTCCCATACACTAAACACGTTGAATTAGCTGTTTTATTTGAAAGAGAGGTTCGACCCAGTGATGGTAATGAGGGCGACGTTGACGAGAAAGAAGTCAAAACAGAAGAAAACGCTGACATTAAGAAAGAAGAAAATATTGTTAGTTTGGAAGAAAATACACAGACGTGA

Protein sequence:

>DPOGS207723-PA
MSETELLKDNNEVNLETDNIKEEYAYLDRGGFSSEKFKVEIRGLPKFYGIVELKKLMNQKLGLNASKIKRPKNGSHWLFACFQNDEDRSKAISALNGYSWKGKTLIAEEAKPAPDPLVKKRKRDEDSVNIKKKKEDENKSQEERLKDAVTPYWNIPYEEQLALKEKEVKNLLMKFDNEVWKIDKNKRNSIESKRKLYNGLSFELKPIQRSPITAGYRNKCEFTVGIDDDTGKPTVGFRLGSYVTGTVGVAPVGGLTHISDKMKQAVLLFQNFVRKSNMSPFSPADYSGYWRYLTVRESTHNGDIMLIVAMYPQDLTKDKLEELKRELIEHFSSEEAQACGVKSVYFEEITKKRSGEDGSKPIHLMGATHIVDTILGLQFRISPEAFFQINTAGANILYQSAIDLSNVNEKSTLIDICCGTGTIGLCFAKHVGKVLGVELVSEAVKDARYNAELNSIDNCSFFAGRAEDVLPSVLARATSDDVIAVVDPPRAGLHMRAVTQLRNTKKVSRLIYISCSPASAIKNFVDLSRPSSKTLRGAPFVPVQAVPVDMFPYTKHVELAVLFEREVRPSDGNEGDVDEKEVKTEENADIKKEENIVSLEENTQT-