Monarch geneset OGS2.0

DPOGS212208
TranscriptDPOGS212208-TA4563 bp
ProteinDPOGS212208-PA1520 aa
Genomic positionDPSCF300323 + 96789-112308
RNAseq coverage383x (Rank: top 31%)
Annotation
HeliconiusHMEL0166910.081.54% 
BombyxBGIBMGA000976-TA0.074.35% 
DrosophilaCG17233-PC2e-7941.35% 
EBI UniRef50UniRef50_E2A1N24e-17141.22%Glutamine and serine-rich protein 1 n=8 Tax=Endopterygota RepID=E2A1N2_CAMFO
NCBI RefSeqXP_624473.23e-17144.02%PREDICTED: similar to CG17233-PA, isoform A isoform 2 [Apis mellifera]
NCBI nr blastpgi|3800263337e-17844.61%PREDICTED: uncharacterized protein LOC100866350 [Apis florea]
NCBI nr blastxgi|3800263330.041.98%PREDICTED: uncharacterized protein LOC100866350 [Apis florea]
Group
KEGG pathway 
Orthology groupMCL15359 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212208-TA
ATGGATCCAGTAGGACCATGGTCAGCATATGCTTCGTACAATCGGCTAGCAGGAGTACAAGCTGGTGCTGCAAGTGGAGATTTTCATCACCATCTGGCAAGCGGAGGAACTGGATTAGGCAGCCAATCTGTGCCTTCAACTACTAGCCAAATATTACTACAGGCAGCTCATACTACGGCATCATTAGCAGGACAACTTGGTTCATCCACTAGCTCTCCTTTCAACCCTGGTGGTTTCCTTTCTCCACCCACTGTGGGGTATGATGCAGTTTTCTCACCTTTATTTCATCATGCCAACCCAAAACCAGCACATTATAGTTCATCCCTACAGGCGCAGCACCGTCAAGTAATTGCCCAAGCGCAGGCAGCGGTCGCCTCTAAACAATCTTCAGTAGAAAGTGAAATTTCCTCATTAAGGGAAAATTATTCCCATCAACCGCTAGCTGCACAAGGAACATCATTTTTTGATCAACCAACAACACCTGGCAGTACGGCAGGTTTGAGTTGGCAAGGAAACAATCAACTTCCCAGTCCATTTGGAATTCTACCTCATGAAAGTGTTGTGCCCTCATCGCCAAGTCCTGCCACAACAAAAGCATCAGCGACATATGAAAACTTTAATGCTCACTTTGCTGCCGCTCAAACTCTCAACAATCACCTCAACTCTCAAATTTCCAGTGCCGGTAAACAAACAAACAGGTCGGGATCACCTGCAACAGCAACTAAACAACCAGCTTCTTCAACATCATCTTCGACTTTTTTTCAATCTCCTTCATCTTTTGGAAACCAGTCTGATAATTCGTACAGTTCAAGTGCAAAAAGTGGCCAACTCCCATCACAGCAGGATTATGCTGGAAGTAAGTCATATTCAAGTTCTGCAAGTAATGCTGTTCACTCCCAACAATCTTGTATAGTGTCAACTCCATCAGTAACATCCTCTCCCCCACATCCCAGCAAAGATTACCGTTCACCCCCTTCCAATTCTACCAGACCGTCAGCGACTATATACAATTCTTCACCTAAGAATTCTTCTAGTGAAAAAAGCCCCCACACATCATCTAGTAGTAGTAGCGGTTTTGTATCTCCCACGTCCAAACCTCCACAAATACAAACGAAAGCACAAAGTAAAATATACCCCGAGTTAAGTTCAGAGCAAAGAAAAATCTGTGACACGAATGACAAACTCCAACCCCAGTCGTCCCCTATAAGTTACTCTATTATGGATAGTCCTGGTAGATTAAATTTTACTTCGGGTACTGGAACCTCTTCTAAGACTAATAGAATTGGCAGTTCTCAGTATAATTCGACACAAAGTTCAAGTTTTAGGCATTATCAAAGTGGAAATAATGTAGAGTCAGAGTATCATGTTAGAGCTAAAAGTAGTTCTAGTACTGACACTGGATATTCCAGCAGTAGTTCTCAAAATGGACCTGATTGTGGGGTTGTAGTCTCTAGAAGGCATAGTCCTTTGCAGGCGGCACCTCAGACCTCGCCCTTGGGACATGGTTCTAGTCCAGCCTACCCTTTATACCATAGTCCCATGAATTCTATCAATTCCCCTCAACAGCTAGGGGATCATTACAGCAAAGTGAATAATGCTGCACCAAGGTCACCCTTAGATGCTTCTGTCTCACGACCACCCTCACAAAATAGCCAAGTCGCTTACCCTTCCGTTATAACAAGAGCCTTAGGAATTGAACAAACTAAATCTTATAGTGAAAATAGATACGAACGTAACCAAAATCAGCCAGCAACTCAAAGTTGTTGGGAAACAGAGAGGCAATCCAACAGAAAATTCAGCAGTATTGGTATGAGTAGGAGTAACAGTTCTAGTAGCTTCAACGGCTTAACTGAAAATAATACTCATCCTGAAAAATCCTCATCTCAGTCCCAGAGTTCTAATGAAAAAATATTGAGTTTGTCTGAAAGACATCATAATTACGTCGAGGGCAATAGTGTAGCTTTACAAGATCTCTCTAGTTGTCGCGGTGATCCTATGAGTATAGTTAAGAATCTTCAAAGTTTACAACAAAGTTGTCAAATCCAAGATAGTAAAAGTACTAAAAGTCTAACCCCAATGTCGAACTTGCCACCCGTAAGTAAATCAATATCCAGGCGAAAAAGTACTGAAAAAGTAGTACCTCATACTAATATGAATGACATTTCCAATGCCGTAATGGCAGATTATTTAGCTAATAGAATACCACCACCCGCGCATAGCTCCACGAGCCAGCAACAAAACGGTAGCTATTTTGATTTCGAGAGATGGAATCTTCCGCCCCCTCCGCCTAAAATGTTTCCCGGAACTTCAGCTTTCGGTTCCCAAGCACCACTACATGCCACTAACTTCAATCAACACCAAGCTCTGGCAATGCAGCACGGTCACACATTAACCTATTTTTCTCCCTTTCACCTAGGCCACCATCCCGATTTCCAATCCTCTGTGGAGTTAACACCTCTATCTTCATTTAGTGAAACTCCACCTTCGGCTTCCTCGTCATCTTTTTCCACACCAGAAACTCGTGAAGAGGAACAGCCCAAGGTGGTAGTACCTAATATAGAAGAAGAACTTGGTTTCCTTGCAGAACAACGCGCAAATACTGCATCGACGGTAGCACCCACTTCACAGCAGCAGAACATTAACAGTACTTCACAAGACGCTACTTCAAAAATAATGGAAAAAAAGTTCAATGTTCCCGTTACGGGCCCCGGTTCGGGATTCATGGCTTCTTATTTGAAATTTTTACAAGGAGAGAGAGACACCTCTCCTCCGCCGGCCGGTCGAGGCGCCAGGAAATCTACATGGTCGAGGAGCAACACCAACAGTAACACGAACAATAAAACATATCCCAACGACCATAACAAAAGTCAGTGCGAGACGAATTCGTCGCAGAGCGCCAACGGCTCCATGGCGACCATTAACTCCGGCATGACGCTCGGCAACCCCGCCATGAGCACGGCGCTCGCCAACCAGCCGCACCCCTCGTCCACCCTCCTCCCGCACGCCAAGGCCGCGGAGCAGGACGAGTCGCGCTACTACAGTCTGAACAAAGACAGAAAGAGAAAGTACGACGGCACGGAGGAGGCCGTCTACGACGCGGACGAGGAGGCGAGGCGGCTGAACAAGCCCGTGCTGAACGTGCCCAGCACGCCGCTCAGCGACAAGGGCAAGAAGGGGCGAGCGGCGGCGATGAGCAAGGCGCCGCCCTCGGCGTTGGTGGCGCCTCAGCCGGCCGCGCCCAAGAAGCCGCGGGCGCCCTCGCACCCGGCGCCTCCGCTGCCTCCCCCGCAGCAGCAGTACTATTACCAACCGCAGCCCGAGGAAGTGCCGTCGCACACAGTCCTGGGTTACGGGGTGTACGGCGACGGAGACGCGAACTCCAACAGGAAGTTGCATCATATAAAGCACCAACAAGTGTCGAGCAGTGCGCAGATTGAGAACAGACCTATCGAAGAAATGCCTTACCAGTCCGGCGAGTTTGTTGCAATAAAGAGCGAACTGAACGAGATGTGGCCGGCGATATGGAGAGTGGACGGCAAGACGTTACTGCAGAAGTATGAACCGTTTGAAGAAAATGGGAAAGTACTGTATAGAAATATATCAACGTATGCAGCTTGGAATCCTGAGAATAAAAAACTCTACACGCAAGTCCCAGTGAAAGTTCGGTCGCAGTCCCATTTAGAAACAATAGTAGAATTAGTGAGAAGCGAGTTGCAAGGGGATGAATGTAACTTTATAGAAAAAAGGATGTTGGAAACTCAAATGTACCAGGAAAACTTTGAAGTTTATATACAGACGTTAATATCGCACGCGTTGGATCCCAATTTCCTGACGGAGATTTTCCAGGAGCAAGACGAATACTTCCTGTCTAACGTCAAAACTGTGGACGAGGTGACTGAGAGTATGCGTCAACGCGTGTCGTGCAGTAACGCTCGGTCGCTGGACGCGGCCGTGGACGTGTGGCCGGGGCTGAGCGTTGCGGCGGGCGGTTCGGGGGCCTGTCGAGCCTGCACCAGACCCGCTGTGACTCGTCTGCTGCTGTACGGACAACCCTACAACCCGGCCACGCTCGAACCCGTGCAACCCAACGCCAGGCTGGCATACGAGAAGGAGTTCCTGTTATGCACAACTTGTTGCGGTCGAGTGCAGTTGTACTCGAGGATATCACATCAAAAGTACCTCATGTATGCCGAGTGCAGTAAACGTGTGGCTGAGAAACGAATGCAGAATCCAAGTAAAGACACCACAGACATACTCAACGAATTGCTAGCGGATGAAGTTTGGCTGTCACAGGAATTCCTGTTATGCACAACTTGTTGCGGTCGAGTGCAGTTGTACTCGAGGATATCACATCAAAAGTACCTCATGTATGCCGAGTGCAGTAAACGTGTGGCTGAGAAACGAATGCAGAATCCAAGTAAAGACACCACAGACATACTCAACGAATTGCTAGCAGATGAAGTTTGGCTGTCACAGCTGTTCCGAGACGTTCGACATTCGTGGGCCGAAGCGGAATCCTGGGAGAGGAAAATGAGACAGGCCATGACACGACAGATGATTTAA

Protein sequence:

>DPOGS212208-PA
MDPVGPWSAYASYNRLAGVQAGAASGDFHHHLASGGTGLGSQSVPSTTSQILLQAAHTTASLAGQLGSSTSSPFNPGGFLSPPTVGYDAVFSPLFHHANPKPAHYSSSLQAQHRQVIAQAQAAVASKQSSVESEISSLRENYSHQPLAAQGTSFFDQPTTPGSTAGLSWQGNNQLPSPFGILPHESVVPSSPSPATTKASATYENFNAHFAAAQTLNNHLNSQISSAGKQTNRSGSPATATKQPASSTSSSTFFQSPSSFGNQSDNSYSSSAKSGQLPSQQDYAGSKSYSSSASNAVHSQQSCIVSTPSVTSSPPHPSKDYRSPPSNSTRPSATIYNSSPKNSSSEKSPHTSSSSSSGFVSPTSKPPQIQTKAQSKIYPELSSEQRKICDTNDKLQPQSSPISYSIMDSPGRLNFTSGTGTSSKTNRIGSSQYNSTQSSSFRHYQSGNNVESEYHVRAKSSSSTDTGYSSSSSQNGPDCGVVVSRRHSPLQAAPQTSPLGHGSSPAYPLYHSPMNSINSPQQLGDHYSKVNNAAPRSPLDASVSRPPSQNSQVAYPSVITRALGIEQTKSYSENRYERNQNQPATQSCWETERQSNRKFSSIGMSRSNSSSSFNGLTENNTHPEKSSSQSQSSNEKILSLSERHHNYVEGNSVALQDLSSCRGDPMSIVKNLQSLQQSCQIQDSKSTKSLTPMSNLPPVSKSISRRKSTEKVVPHTNMNDISNAVMADYLANRIPPPAHSSTSQQQNGSYFDFERWNLPPPPPKMFPGTSAFGSQAPLHATNFNQHQALAMQHGHTLTYFSPFHLGHHPDFQSSVELTPLSSFSETPPSASSSSFSTPETREEEQPKVVVPNIEEELGFLAEQRANTASTVAPTSQQQNINSTSQDATSKIMEKKFNVPVTGPGSGFMASYLKFLQGERDTSPPPAGRGARKSTWSRSNTNSNTNNKTYPNDHNKSQCETNSSQSANGSMATINSGMTLGNPAMSTALANQPHPSSTLLPHAKAAEQDESRYYSLNKDRKRKYDGTEEAVYDADEEARRLNKPVLNVPSTPLSDKGKKGRAAAMSKAPPSALVAPQPAAPKKPRAPSHPAPPLPPPQQQYYYQPQPEEVPSHTVLGYGVYGDGDANSNRKLHHIKHQQVSSSAQIENRPIEEMPYQSGEFVAIKSELNEMWPAIWRVDGKTLLQKYEPFEENGKVLYRNISTYAAWNPENKKLYTQVPVKVRSQSHLETIVELVRSELQGDECNFIEKRMLETQMYQENFEVYIQTLISHALDPNFLTEIFQEQDEYFLSNVKTVDEVTESMRQRVSCSNARSLDAAVDVWPGLSVAAGGSGACRACTRPAVTRLLLYGQPYNPATLEPVQPNARLAYEKEFLLCTTCCGRVQLYSRISHQKYLMYAECSKRVAEKRMQNPSKDTTDILNELLADEVWLSQEFLLCTTCCGRVQLYSRISHQKYLMYAECSKRVAEKRMQNPSKDTTDILNELLADEVWLSQLFRDVRHSWAEAESWERKMRQAMTRQMI-