Monarch geneset OGS2.0

DPOGS213579
TranscriptDPOGS213579-TA1872 bp
ProteinDPOGS213579-PA623 aa
Genomic positionDPSCF300033 + 273541-277996
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0179110.080.48% 
BombyxBGIBMGA011649-TA2e-7887.26% 
DrosophilaCG34109-PC8e-16355.78% 
EBI UniRef50UniRef50_F4W4575e-16451.30%Uncharacterized protein n=5 Tax=Formicidae RepID=F4W457_ACREC
NCBI RefSeqXP_393729.21e-16951.94%PREDICTED: similar to CG7384-PA [Apis mellifera]
NCBI nr blastpgi|3838482113e-17053.20%PREDICTED: uncharacterized protein LOC100882610 [Megachile rotundata]
NCBI nr blastxgi|3800288347e-16652.43%PREDICTED: uncharacterized protein LOC100865483 [Apis florea]
Group
KEGG pathway 
InterPro domain[225-406] IPR0052402.6e-38Protein of unknown function DUF389
Orthology groupMCL17018 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213579-TA
ATGCCTTCCGGTGTGATATTCATGATTTATATCCCGTCAAGTAATTTCGAAAACATCTTGAAGTTCGGTAAACGTGGTGATTTGATATCTATTAATGATGCTGGAGAGACACCGCAATTGGAGGTTGAATTGCGTTATAAAAAAGCAAAGAATATACCAATACAGTTCACCCAGAACCACTACGATGTGTTCAGCGCAACCCTGGAAAATATAACGGAGGCAGACGAATTTTTTACCATGGAAAGCGAGTTGCTAAGAAAGCTCGATGTGGAGCAAGCGATATGGGTCAGCGATAAACCAGGCAACTACTACCAGGTCGCGTTCCCTCTACCGACAGGAGACCAATGCGAAACCATGTTGCATTGTCTCACGCAATTGGGCATTGGAGTTCGTAATAAATCTATCGTTAACGTACTGCCGTGTAATGTAGTGCACATCGCTTCAGATCACGAAATTGAAGACGATGAGTCATTGCTAAGGAAAAATGAAGAAGCAAAACGTTGGCGCAACTTCGTCGAATCAATCAGATCTAAATTGACAGTTAAACAGGTAGTGGACGGAGTCAGGGGAGGCGGCGAATTGTCATTTGACTATCTGACGTTGATTATAACTGCCGATTCCTTGGCTGCTTTGGGTCTTGTGGAAAACAATGCATCAAACATAGTAGCCGCCATGCTGGTTTCCCCGTTAATGGGCCCCGTCATGTCAATTACGTTTGGTACAATTATCGCTGATCGAAATCTTGTGAGAAGCGGCTTCGAGAGTCTTATATTGGGCATGTTCCTATCTCTATTATTCGGTTTTATATTTGGGCTCATTCTCGGAACTACGGAAATGCCCTGGGGTTTTGGTGATTGGCCCACTGAAGAAATGAAGTCAAGAGGTAATGTGCGGTCATTATGGATGGGCGTCCTTTGGGCTTTGACCTCCGGGACAGGTGTAGCCCTCGCCCTACTTCAAGGCAGCGCTGGCCCACTGATAGGCATCGCCATATCTGCGTCTTTGTTACCACCAGTTGTTAATTGTGGCTTATTCTGGGCATTGGCATGTATATGGTTGTTGTACCCGGGCGTCAAAATACCACACATCAAAGGGGAGCCTTATTCTGGTAATTCGTCTTACGTCCCACTTTATCATGACTACTTGCCGATAGAATTTGCTATAAATGGTATTGTAAGCTGTTGTCTTACAATAGTAAACGTCATCTGCATATTCATAACCGCGATAATATTTCTAAAAATAAAAGAAGTGGCAGCCCCGTACACATCGACGCCGGACTTGAGACGTTTCTGGGAACAGGACATCAAGGTGGCGAGGGAAGCCAACCGTGCAAATTTACAGCAGGCTGAGGACGATGAAAGGACTGAATTGGTGTTAGAAGACATGAATCAGACCGATGAGGGGGTAAAAGAAAAACTAGAAGCGGCCGTCCAAGAAGCACTCGACGACGAAACTTATAGAAAAGTCAAAAGAATGAGCTATCAAAGTCACAATGCTGACGAGGTAATCCGAACGATCGGTCTTCACCCTCGTTCTCCGCCATCGAACCGCTCCAGTGCGATTGGTTCGGGGGTAGCAACCCCTCATAATAAGGGCTCCAATGTCAATGACATCGTAACACTGGACAAACTACTCACATCGTTGCTAGGACTACAGAATAAAAGGCCGAGATCGTTCAGGTCCCATTTTGGATCTCCAAGAGCGAGCAGGCTGCCTACGTTGCAGGAATTTGGACAAAATAGACGCATAGAATCCTGGCCAGAGAAAATATTTGAGGATAGCGTAGTCAGAAACGTTATTAGTAATTTGAGAGCCAGTAAGAGAAATTCAAAAATTTCGGCAACTGACGAAACATTTTTAACGCCGAAGTAA

Protein sequence:

>DPOGS213579-PA
MPSGVIFMIYIPSSNFENILKFGKRGDLISINDAGETPQLEVELRYKKAKNIPIQFTQNHYDVFSATLENITEADEFFTMESELLRKLDVEQAIWVSDKPGNYYQVAFPLPTGDQCETMLHCLTQLGIGVRNKSIVNVLPCNVVHIASDHEIEDDESLLRKNEEAKRWRNFVESIRSKLTVKQVVDGVRGGGELSFDYLTLIITADSLAALGLVENNASNIVAAMLVSPLMGPVMSITFGTIIADRNLVRSGFESLILGMFLSLLFGFIFGLILGTTEMPWGFGDWPTEEMKSRGNVRSLWMGVLWALTSGTGVALALLQGSAGPLIGIAISASLLPPVVNCGLFWALACIWLLYPGVKIPHIKGEPYSGNSSYVPLYHDYLPIEFAINGIVSCCLTIVNVICIFITAIIFLKIKEVAAPYTSTPDLRRFWEQDIKVAREANRANLQQAEDDERTELVLEDMNQTDEGVKEKLEAAVQEALDDETYRKVKRMSYQSHNADEVIRTIGLHPRSPPSNRSSAIGSGVATPHNKGSNVNDIVTLDKLLTSLLGLQNKRPRSFRSHFGSPRASRLPTLQEFGQNRRIESWPEKIFEDSVVRNVISNLRASKRNSKISATDETFLTPK-