Monarch geneset OGS2.0

DPOGS203491
TranscriptDPOGS203491-TA2472 bp
ProteinDPOGS203491-PA823 aa
Genomic positionDPSCF300055 - 814195-823173
RNAseq coverage11x (Rank: top 83%)
Annotation
HeliconiusHMEL0142953e-5540.39% 
BombyxBGIBMGA008297-TA3e-8944.68% 
Drosophila% 
EBI UniRef50UniRef50_E2A5X54e-1326.57%Glutamine-rich protein 2 n=1 Tax=Camponotus floridanus RepID=E2A5X5_CAMFO
NCBI RefSeqXP_002428746.15e-1232.43%hypothetical protein Phum_PHUM399540 [Pediculus humanus corporis]
NCBI nr blastpgi|3838607301e-1226.05%PREDICTED: uncharacterized protein LOC100883897 [Megachile rotundata]
NCBI nr blastxgi|2700113502e-1719.83%hypothetical protein TcasGA2_TC005359 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL25461 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203491-TA
ATGGCTAAGATTGCGGACGAACCGAATATGAGTGACTCCACTTTATTGGTGACCGTAGAAGATCTTATAAACCAAGCCATGGGACCACCTGGTGGGAATGTAGTGGACTTCAAATTGATTCAAGTGGTTCTCCAAATTCTCGCACGTCAACAGCGAATGCTGCAACAAAAGGTAGAAATCCAGGTTTCAGAATTTATTGAAGTAACGCCTATTAAGAAGTCTAAGGGAAAGTCTTCGGAAGAATCAACTGAAAGTTCATCCTCTAGAAGTCCTCGTCCTCTTGCGAAGCCTCACATGGCGAAGATAAAGGAAGAACAAAAAATGGACGATGATAAGCGAACTGAACAAGAAGAACAAAGAGAAGACGATCAGGAAAAAGTTGATAAGAAAAGTAAACAGGAACAATCTGCTAAAGAAAAAGCTAAATCAAAGAAAGACGGAGACACAACTGAAAAAGAGTTGGGCAAAACTAAGTCACAAAGTGAGTTTGAAAAAGTTCAAAAGGAGGAAGAAGACAAAAGCAAACCACAGAAAGAATCTGGAAAAGGCCAAAAGGGAACAGCAAAAGGCAAATCTCAGAAGGATATGGGAAAAACTCAAAAGAAACAAGGAAAGAAACCTGATGACTCAGCTGCTACCAGTTTAACTGTCACGGATTCCCATGGCCGGACAAACATCGATGTGGTAACTCAGTCACAATTCGCAATTCTGGAAGCGGCGATAAAAGACCTCATGGACGTTGCTGCTCCACAACCCCTCTCGATGCCAAAAAATGAGAAGTTGAGGAAAGATCTCGCCAAAGGCACTGCGACTTTGCCTGATGCTATGGAGGCTATGCAGGTAGTAGCTCGTATGAAAGCAGCAGAAGCCGCTATCCAGCGCATGTCAGGCCTGATTACACATCTGGCAGGTGCAAGTGACCTCGCCGATGTTGGCGATGTCTCAGATGTGACAGATGAGAGGGAAGAGAAACTCCCTGAAGAGACCATCAAATCTCGAGTTTCTGTCGCTCCGAGGAAGTCTGTAATGATCGATCCGAAGGTTTCTCAAGTGTCACATATAAGTACGAAGCCGTCAGTAGCCTCCTATGTAGACACCGGCCCTTCGTCAGCCTCGTCAGTGGCCGCTCCCAGACCCTCACAAGTTTCGGTCAAACCGTCGGTGTTGTCCAAAGCTTCTTCCGTGACTATGGGCCCGAGTGTCACCCAAGAAGAAATGGAGTCAGCCCTGAGAGGTTTGCATGACGAAATATCCAAGTCTCTTAACGCGGCCGTGAGTCGTGCCGCGACTGCTGCGGAGACGGCCCTCCACACTGCTGTCAATGTCGCAAATAAACTAGACGTAGCTCTAAAACTGGATGGTCGTATATCAGCGCTGTACGCCATCGTCGGTGACTACTCAGATCAGTTGAGTGGATTCGACGCCGGGCTCACGACACAAATGCAAGGTTTCAAAGATCAAATCGCCCAAATGCGTTCGGATCTCAAGAAAGGACTTCAACAGTTGGACAATGTTAACAATAACGCCGAAACAGCTGCTGTGATGGAGCTGACGGAGCGCTACACTGAGCTGGTCGTAGACCTGGACACCACTATGACAGCGCACACGGCGCTGCAGCAGCTACAGTCAAAGCTGGCTGGGGAGATGCATGAGGAGACAAATAAACATCAGCAAAATTTTCAGGTTTGGGTAGTGTTTGTCGTTTGCCGACCACCGCTGAGCTTGGTGGAGTGTGTGGAAATGCTGCGCGAACAGAAATGCGACAGAGATGAAGTCTTGGATGGACTCCGGGATAAGGCCGACATATCACGTCTGGCGGGTCTGCTGTCAGAGGTACAGTTCGCGACGGCGCGGACTGACTTCGAGCGGCGGCTAGACCTCTGTCACGACAAATTCAACAGACAGGATGCAATGTGGACGTCGGCAGTCATGGACCTGTCCCGTCTGACGGATCAGAAGGCGGAACTGATCGAGTTGCTATCGTTACGAGACACCACACAGAAACAACTGCAAGAGTTACAAGACAGGCTGCACACGATGGCCGTCGTACTGGGAGAGCCAAAGGCGGCGCTACTTACTCGCCAACTAGCTCGTGGTGCAGTGTGCGGCGCCTGCGGAGCCTCCGCGCTCATGGAGCCGAGGGACTCCCACGCGGGTGCTCCGCCTCGCCTGCCGCCGCTCCGAGCGGAACCCGAGCCGGAGCCCTGCAATCGATGGATCGTCGCTGAGCCTCCGCTTGAGAGACACGTGTGTCACCGGTGGGCGGGAGGGTCCCACACGCTGTTGAGTGCGACCACACACGAGCGAGCACCGAGTCTGGACCTCAGTGAGATCCGCACCATGAAGTATACAGGCCACGGCACGGACGGACGGCTGTACATGTTGGAAGAGGATCTCAAGCCGTGTGTTGAATGCAACATGCTCACCACGGACGTCCCTCCAGAAGGAGCGCAGGCCAGCGACACGCACTGA

Protein sequence:

>DPOGS203491-PA
MAKIADEPNMSDSTLLVTVEDLINQAMGPPGGNVVDFKLIQVVLQILARQQRMLQQKVEIQVSEFIEVTPIKKSKGKSSEESTESSSSRSPRPLAKPHMAKIKEEQKMDDDKRTEQEEQREDDQEKVDKKSKQEQSAKEKAKSKKDGDTTEKELGKTKSQSEFEKVQKEEEDKSKPQKESGKGQKGTAKGKSQKDMGKTQKKQGKKPDDSAATSLTVTDSHGRTNIDVVTQSQFAILEAAIKDLMDVAAPQPLSMPKNEKLRKDLAKGTATLPDAMEAMQVVARMKAAEAAIQRMSGLITHLAGASDLADVGDVSDVTDEREEKLPEETIKSRVSVAPRKSVMIDPKVSQVSHISTKPSVASYVDTGPSSASSVAAPRPSQVSVKPSVLSKASSVTMGPSVTQEEMESALRGLHDEISKSLNAAVSRAATAAETALHTAVNVANKLDVALKLDGRISALYAIVGDYSDQLSGFDAGLTTQMQGFKDQIAQMRSDLKKGLQQLDNVNNNAETAAVMELTERYTELVVDLDTTMTAHTALQQLQSKLAGEMHEETNKHQQNFQVWVVFVVCRPPLSLVECVEMLREQKCDRDEVLDGLRDKADISRLAGLLSEVQFATARTDFERRLDLCHDKFNRQDAMWTSAVMDLSRLTDQKAELIELLSLRDTTQKQLQELQDRLHTMAVVLGEPKAALLTRQLARGAVCGACGASALMEPRDSHAGAPPRLPPLRAEPEPEPCNRWIVAEPPLERHVCHRWAGGSHTLLSATTHERAPSLDLSEIRTMKYTGHGTDGRLYMLEEDLKPCVECNMLTTDVPPEGAQASDTH-