Monarch geneset OGS2.0

DPOGS200862
TranscriptDPOGS200862-TA2520 bp
ProteinDPOGS200862-PA839 aa
Genomic positionDPSCF300071 + 356888-367731
RNAseq coverage240x (Rank: top 43%)
Annotation
HeliconiusHMEL0126410.079.83% 
BombyxBGIBMGA009850-TA0.072.13% 
DrosophilaHPS4-PA4e-12833.37% 
EBI UniRef50UniRef50_UPI00021A685C7e-14235.82%UPI00021A685C related cluster n=3 Tax=unknown RepID=UPI00021A685C
NCBI RefSeqXP_001600954.11e-14034.67%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3454923261e-14134.96%PREDICTED: hypothetical protein LOC100116464 isoform 2 [Nasonia vitripennis]
NCBI nr blastxgi|3838570046e-14035.84%PREDICTED: uncharacterized protein LOC100876281 [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL15665 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200862-TA
ATGGCAAAGGAAATGATGATAGTATTCGTGTATGACACACAATGCTGTCTCTCCGAGGAGGACGATCCCGCTGATGCTGTACTGTACTTTCATCCTGGCTGGGTGTCTGACACACAGAGACTGGCGCTGGCTGGACAGGTCGTTGGTGTGGCTCACTGTACCAAATCACTGTTCTCTCAGCCGCTGGCCATAACGCTGCAGAGCGGCAAGTTCATCATCAGAGAATACGGCAGATACATACTGGCGATAGGCAGCGATAGGAACATCCCCGACTGGGTGTTGAAGAACCGCGCCAGTCTGCTGACGTCCATGATAAGAGTGTACCACGGAGACCTGCAAGCGCTGGCCGACGCTATGGAAGACAGCCGCCGCCTGGCGGAGAAGATGTACCAGATCTTTGAGACCTACTTACCCGTCTTACAATACGGATGTCATATATTCCAGCGAGTGCCCATGCTGAGCCTTCCTAAGAGCGCGACATCGGTTTATATGGAATCCATGCAGATATTGGAACACTGCCGACGGAGCAGGGGCGTGCTGGGAGGCGTCATACTATATAATAATAAAATAATTTCCACGCAACTCCCTCCAAGTCTGACCTCATACTTGACGGTAGTGGACCCGTATAGGATTAAATCCCCAGCTGAAAACCTGAAGACGAAGGTTCCGCTACCGCTCGGCGCTCAGCTGCTAGTGGTTTATGTCGGCAAGAAAACTTATTACAATTTGAAAAAACAAGCCGACAAACTGCAAGAGTTTTACCAGAAGGGAGAGGAAATGATCGCCCGATTCAAGAAGATGCAAGAGGCGGAAAGAGAGAGAGTCCGCGATCATCCTCAGTCGGGCATGAAGCGGGATAAATCGTTACTGTTCACGGCTGTACCGGAGGAAGACCACACCATGATATCGCCGCCGAACAGAGAAGAAGGTTCCGTACCACGGAAACCCAGCATGCCGGATGTCGTTCCGTTCACAAACAAACCACGGCCGAGACCTAACAAACTATCGCTGACCTTCAAAACTCAGAAGTCTTTGGACGAGGACGTTAAAGAGAACGAAGTCAGCGAGAAAGTGTTCACGGGACAGACGAGCGTCGTGTCGACACCCATGGTCGATTACAAGCGGCTCCACGGGAACATGCTGTCCATATGTCAGAACCCGGACGCTGATGAGAAACTGGAACTAGAGCCGGACGTCGTCAAAACAGACAGCAGCATGACAACGGCCGACAGGAGGGACAGAGTAATCCTAGACGGCAACAGCATCGGGGAACATTTCATTAACAAGCCGGAACCGATGAGGAAGTTGGCGAGCGTTACGGACCTGCAGGAGACCTTTAGGAAACTGTCGACCAGAGCTTCCTCTAATATGAAGCTCAAGAGATCCAAAATGGAAGACGAGTCGACACCCTCGCCGGAAACGAAGGGGCAGAATACTATGACCATCAACGATCCACTGTTTCCGGTTTTCAGAAACGACGGCGTTGCCATATCAGAGTCGCTCTTCAACCAGTACCTGGAGCAGTATTATTCTGGCATTAAACATTCCAAGGAAGATAGCGTGTTCAATTTTAATCATAAGTTAGGCGAGTTAGACAAGTTTAAAGATTTCGATTCGGAACTAATGAAGTCTCCGCAAAGAACCCCGAAAAGGGTGGCGAAGGATTCGTCGAATACGGTGCAAACGGACCAGTCGAGGCGGAAAACACTGTCGCTGCCTTTAAAGTCGCTCTCGGAAAATTCAGAAACGCAGATAGACACGGGCGCAGACACATTGAGTTTTAAAAAGAAGCTATCGGGAGTCCAGTTGACGCCGCTCATGGAGAAGCTGAGCCACTTAGCGTTCTCAGACAAGTCCAGCGGATACAGCAGCAGGGTCATGACGCCCTTGGAGCTGAGGGAGTTACTGACACCGGCTCTAGAGAAACAAGTCACATTCACTGAAAGGAAGAGGTCCCGCCTCGAGGACAGTTCGGAGTCGGAGTCGGACGGCGAGTGCGACGTGGACTCCCTGCCTTCCTACAGCTCTCAGTCCGTGAGGTGCGCGCTGTTCGTCAGCGGACTACACAACATGGCGCTGCTGGAACTGCTGGACCTGGACGCCGCCGACGACACACAGACCATCAACACGCTCTGGGAGGCTTCTCTTAACGCTCTGGGTCCGATCGAGCAGAAGTGTATGGAGCCGCCGACCACGGAGCCCGACTCGACCGACTACAGTTACCTGTTGCTGGACCCCGACTGGGGCACCGTCAAGAAGGGCGGCCCATGGGCGGCGTTAGACATCGCTACCATGGGCTATATACATAACGAGTTCGAGGAACAACCCGATTTGACAGAGTTCATATTGAGGAGTGAAGACAGCGTGGTGTTGGGTGCCAGCTGCGGCCGAGCGCAGGTGTTCTACCAGGAGAGCGGAGCGCGCGCCCCCGGGCCTCCGCCCCCGTCCGACCTGCTGGCCGCCGCCCCGCTCAGGGCGCGCCGCCGCCTCCACCGAGACCACTCCACACTCATACTGTAG

Protein sequence:

>DPOGS200862-PA
MAKEMMIVFVYDTQCCLSEEDDPADAVLYFHPGWVSDTQRLALAGQVVGVAHCTKSLFSQPLAITLQSGKFIIREYGRYILAIGSDRNIPDWVLKNRASLLTSMIRVYHGDLQALADAMEDSRRLAEKMYQIFETYLPVLQYGCHIFQRVPMLSLPKSATSVYMESMQILEHCRRSRGVLGGVILYNNKIISTQLPPSLTSYLTVVDPYRIKSPAENLKTKVPLPLGAQLLVVYVGKKTYYNLKKQADKLQEFYQKGEEMIARFKKMQEAERERVRDHPQSGMKRDKSLLFTAVPEEDHTMISPPNREEGSVPRKPSMPDVVPFTNKPRPRPNKLSLTFKTQKSLDEDVKENEVSEKVFTGQTSVVSTPMVDYKRLHGNMLSICQNPDADEKLELEPDVVKTDSSMTTADRRDRVILDGNSIGEHFINKPEPMRKLASVTDLQETFRKLSTRASSNMKLKRSKMEDESTPSPETKGQNTMTINDPLFPVFRNDGVAISESLFNQYLEQYYSGIKHSKEDSVFNFNHKLGELDKFKDFDSELMKSPQRTPKRVAKDSSNTVQTDQSRRKTLSLPLKSLSENSETQIDTGADTLSFKKKLSGVQLTPLMEKLSHLAFSDKSSGYSSRVMTPLELRELLTPALEKQVTFTERKRSRLEDSSESESDGECDVDSLPSYSSQSVRCALFVSGLHNMALLELLDLDAADDTQTINTLWEASLNALGPIEQKCMEPPTTEPDSTDYSYLLLDPDWGTVKKGGPWAALDIATMGYIHNEFEEQPDLTEFILRSEDSVVLGASCGRAQVFYQESGARAPGPPPPSDLLAAAPLRARRRLHRDHSTLIL-