Monarch geneset OGS2.0

DPOGS201641
TranscriptDPOGS201641-TA1956 bp
ProteinDPOGS201641-PA651 aa
Genomic positionDPSCF300254 - 116817-118772
RNAseq coverage306x (Rank: top 37%)
Annotation
HeliconiusHMEL0156630.087.37% 
BombyxBGIBMGA008201-TA0.083.90% 
Drosophilalin-PA3e-15449.39% 
EBI UniRef50UniRef50_B3ME976e-15350.30%GF11922 n=4 Tax=Drosophila RepID=B3ME97_DROAN
NCBI RefSeqXP_001959681.11e-15350.30%GF11922 [Drosophila ananassae]
NCBI nr blastpgi|1947547972e-15250.30%GF11922 [Drosophila ananassae]
NCBI nr blastxgi|1947547972e-17050.30%GF11922 [Drosophila ananassae]
Group
KEGG pathway 
Orthology groupMCL15886 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201641-TA
ATGCAATTAATGTTTGATATTTATCTGAAACAAAACAATGTTGGCACTATATGCTCCAGATTAATGTATGCATGTGATATTTTTGTCAGAAACAGACATGACTGGATAACAGAGATTGTAGAATTAGCAGACCATAAGAGCAAATTTATAACCTTCGTCGCATGTAGAGTATTAGCCAGTTTCCTCATTGTATCTAAAGACACCGTCGATGAAAACTGGTTGCAGCAGATCACCGAAAATGTTTACTTGTTTGATAGAATTAATAGAATTACTGTGCAAAAGATAAATTTCAGCTTAGATATAATCAAGAGGATAGTTGAATGGAAGGATGTGGAGCAACATCCCTTGGATGAGACTAGTTATGCTAATGCACCCGGCACCATCCAGGTTCAGGAGGATAATCCATTTAGAGGCAGCTCGAGCTCGCAATCGTCAGCGGGCTCTAGTTCACGTAGTGATAATTTACATAGTGCATTTTCTAATTTGCAAACCCACAGCTCCCCTTCAACATCTTCCGATAGTAAGACAAGTAAACCAACATCCACATTCAAACTTAATGAGCCTGTCTTTAAATTTCCTCACGACCAATCCGACAGCAGAATGGAACCGGCTGATTCTCCAGAACCTTCTCAATCAAGGATAGAAAGAGTTAACGAGCATGGATGCATAACTGTGATATTAACAGACTCCGAGTCGTTTGATACATCTCATATAAAATGTTTAACTATAAAGACTTTAGAGCATCACTGGCCAATTCTAGTAAAAAACATGAAGCTGCTTCTGTTAAGGTACCTGAACTTATCTAATGCTGAAAATTGTATATTAACTTTCTTCTCCCTGTGGGAGAATATTATAAGTGTCAAAGCGAATCTCTCCGTCATTGATACCAAGCCATTTTATGCAGACTTGCAGGGTTTTGTAGATTTATTAAGGAATACAATGCTTCCGGGCTTAATATATGCACATTTACTTAGTTTATTTAATGAAGTCCTATGTTATGGTTCAACGTTGGCCCTTCAGGATATACTGCCCGAGGAAATATGTTGCTTAGCCCATTCTATAGTTAGGTATGTGAAAGACTTTAGATTGTTAAGTGAAGTTAGGGTTCAAAGTAGTAGAAGTGGGTTTGGGTTTTTGGAACATGACTGTAGAGTGATACATGATTATTCTTTGGGACCTGATATCGGGCCGTTATCATCATCAATACAATTGGTCGATCAGAGTTATGGTGAAGATGATAACGAAGATTCCACACAAAGCCGGACTGAAGTAGACAAAACCATGCTCCAACGAATGTCACTGCTGGTCCTCAAATCCGTAGCAGTCACCGTTAAAGAGATGCGATGTGACTCATCGGACAGTTCAATAGATTCGTCAGATTACAACGCTATACAGGACATGCAAATAGTTGAAAGATCGATACGGGATGTGCTTAAGAAATTGGATGTGTTTATAAGGAATCGCTTAGAGTTTCACCCGGAGACTCCGTTCACCAAGATGTTGATACATCTGTTCAGCGAGCAGGATGATTATCTCATTGAATCTATGGTGTGTACATTAGATATAACCGTGGGTATAGTGTATAGGAACTCTATGTATCCCGACTTAATACCAATGCTGAATCCCATAATGTCCTTCATAGAATTCCTCAGGGTTGTCGCACATGATAGTGATGTATTGTTAGATTATCTTGTCAGCAATGAAACCTGTTTTCTTTTATACTTGTTGAGATTCTTAAAATATGTGAGACGTAACTGGCCAAAATTCTTAGACACCTGCCAGCAAATGGATCCAGGTACAACGAGAGGCCTGGATGATACTATGAGGGTTTTGATAAGGTTGCGTTTACAAATCAGTAGGCTAGTATCGAAATCACTGTTCCCATACAACATCAGTCCAGTTCTGAGACTGCTCGAGGTCTGCGAAAGTCTCTATGAAGGAAATGAATTTAGCTGA

Protein sequence:

>DPOGS201641-PA
MQLMFDIYLKQNNVGTICSRLMYACDIFVRNRHDWITEIVELADHKSKFITFVACRVLASFLIVSKDTVDENWLQQITENVYLFDRINRITVQKINFSLDIIKRIVEWKDVEQHPLDETSYANAPGTIQVQEDNPFRGSSSSQSSAGSSSRSDNLHSAFSNLQTHSSPSTSSDSKTSKPTSTFKLNEPVFKFPHDQSDSRMEPADSPEPSQSRIERVNEHGCITVILTDSESFDTSHIKCLTIKTLEHHWPILVKNMKLLLLRYLNLSNAENCILTFFSLWENIISVKANLSVIDTKPFYADLQGFVDLLRNTMLPGLIYAHLLSLFNEVLCYGSTLALQDILPEEICCLAHSIVRYVKDFRLLSEVRVQSSRSGFGFLEHDCRVIHDYSLGPDIGPLSSSIQLVDQSYGEDDNEDSTQSRTEVDKTMLQRMSLLVLKSVAVTVKEMRCDSSDSSIDSSDYNAIQDMQIVERSIRDVLKKLDVFIRNRLEFHPETPFTKMLIHLFSEQDDYLIESMVCTLDITVGIVYRNSMYPDLIPMLNPIMSFIEFLRVVAHDSDVLLDYLVSNETCFLLYLLRFLKYVRRNWPKFLDTCQQMDPGTTRGLDDTMRVLIRLRLQISRLVSKSLFPYNISPVLRLLEVCESLYEGNEFS-