Monarch geneset OGS2.0

DPOGS211742
TranscriptDPOGS211742-TA2742 bp
ProteinDPOGS211742-PA913 aa
Genomic positionDPSCF300364 - 86570-95759
RNAseq coverage339x (Rank: top 34%)
Annotation
HeliconiusHMEL0168170.055.98% 
BombyxBGIBMGA004382-TA2e-15359.65% 
DrosophilaCG12702-PA8e-2022.27% 
EBI UniRef50UniRef50_E2BP192e-5528.28%Protein KIAA1524 n=9 Tax=Formicidae RepID=E2BP19_HARSA
NCBI RefSeqXP_393467.23e-4928.43%PREDICTED: similar to CG12702-PA [Apis mellifera]
NCBI nr blastpgi|3071892311e-5828.73%Protein KIAA1524-like protein [Camponotus floridanus]
NCBI nr blastxgi|3320170891e-6324.54%Protein CIP2A [Acromyrmex echinatior]
Group
KEGG pathway 
Orthology groupMCL14552 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211742-TA
ATGATGGAGGTTGACGGTGTGGAGAGTTGGAACAATCCTATGGAAGGAGCGAAATTCGCAAATTTAAAAGCTTTTGTAAATGCTGCTCGCGAATTTGAAGCTACTCACAGTGAATCTGCAATAAATATGATGACTCGCTATTTAGGATTGATAGCTTCATCATGTGATCTTACAATTTTTTCTCCCGGTCGGAGTGAGGTGTGCGCGTTTTTTAGTTCCCTTTGGAGGGTAATGTGTGATGCGAGAGGTCCTCACTGGGCAGGAGTAGCAGTACTCGCCCGGGCCGCTATAGAATCCAGTACTAGACATGCCTTAACACATACTTACAAGTTTATGCCTATTTTGTCAAGACTTCTCTCAGACAATATTTCAAATGATAAAAAAATTAAACTTCTATCTGTGATGCAGGACATCTCATATGGTATAAAAATAAGCTGGCAAGAGTCCTATCTCACTGGACTCATGAAGCAGCTCACCGATTGGATAACGCAGCCCATGACTGAACCACAGCAGCGTGCTATTGGTCACAAATCATTGACTGTACTTGTTAATGTGTGCTATGGAAACCTCCCGGCCATTTATGCTTTAATGAGGACCGTTGATACCAAGGAGTTTGTTTTACATTTGATAAGTTTAAAGGATGGTGCATACGGTGGTGTTGAAGTGTGCCGTCTACTACTCTGCCTCTCAAGCGCCACCCGCGGCGCTGCTTCTACGAGACAACCAGACGTCCACAGCTACCTATGTTGCACTATGCGGACGTTCTCTAAGGCTATAGTTGAAAAGGATTCAACTCAGTTACTTCATGCATACACATTCATTAATGACCTTTGCTCAGACAGCGGTTTAAGAAATTATGTTCTGACATACACAAAATTTAATAACTCGCTTCTGGATTCATTGAATAACATTGAAGGACTGTGTAAAACGTCTCCGGATGATATGGGCGAATCCGAGAACTGTACGAACTGTTTGTGCAATGTGCTGAAATTTCTCACGGTTTTAGTAAATTTAGATATATATTCGCTGAGAAGCTTCCACAGTCAGCTGGTGTGTCTGTGTATGAAATCGTCTCGCATTTGTCTGCCGGAGTCTTTAGAGCTTTTCGCTGCAATTGTTTCCGTATATAAAGACGAGGGTGCCCTTCCAAGGGAATTGATAACAGTGATAAATGATGGATTACCAGCTCTATTGGTGCCACCGGCCTTGGAGCCGGGGAAAGCTGGCTTACAATGGTTACAAGTAGTTGGAGTGTTATGCGAAATGAGTGAAACCCAAGAGCGGGTCTTGCAGGAAGTAACCCCTGACGCGTTCGAGGACACCCTGTATAGCGTGTTACAATATACATCACAGAACGGTCCGGTGGGTAACGAGACGGCCCAGCAGTGCGTGGTGGTGGCTTGTCGCGGCGGGTTGTGTCTGGCGCCGCTGCACACGCACTGGGAGGCCGCGTTCAACAGGATGCTCGCACACCACCAGGTCCGTAAGTTGTTAAGCGCTGGTCTGACAAGCGGTAGCGGTCCGCGTCGCAGACAGATCTTACAACTAATCAAACATCACTATTTTCCATCTGAACATATGAATCAGATATTCGGTGATAATCTTCAAAATGTATCAGACATCAGTGTGGAAAGTGTGTCGCCTCGTGAGGAACTCGATAGTGTATGGTCAGACAGACTCACACCAGCCCAGGAGAGAGCTATTGATGAGCTCATCAACATTATGAGGGAATCTCTAGTCAGCGGGAAGGTCCGTAAGTTGTTAAGCGCTGGTCTGACAAGCGGTAGCGGTCCGCGTCGCAGACAGATCTTACAACTGATCAAACATCACTATTTTCCATCTGAACATATGAATCAGATATTCGGTGATAATCTTCAAAATGTATCAGACATCAGTGTGGAAAGTGTGTCGCCTCGTGAGGAACTTGATAGTGTATGGTCGGACAGACTCACACCAGCTCAGGAGAGAGCTATCGATGAGCTCATCAACATTATGAGGGAGTCTCTAGTCAGCGGGAAGATTAATGACATAGCTACATCAAGTGTAATGGAACTCTACGGATACAAGATGACTTGCTTAGAACAGAGGTTGCATTCACATTCCTTAGCGTTACAGGGAGCCACAGAGCATATGGCGTCATTGCAACATGCGCTTGCATTGTTACAAGCAACGAATACATCACAGCAGGATGTATTATACACTACACAGATGCAAAACGAAAAACATAAAAAAGTAATAGAGGATCTTCACAAGCAATTAGAAGACGCTGAAACAACAGTACGTGGATACCGAGCGAAGTTGGCCGCTGAGAGATTGGATAAAGAAAATCAAAAAGAGCACTTACAGAAAGAGTTGCGAGCCCAAATAGTTACCATAGAAAATGAAATGAAAGTACGCGAAAGAGAATTAGAAGAGCGTTTGAAGCAACAGGAAGCGGATAATAGAACATTACAAAAGAAATTGGAACAACAGTCAAACAAGAACAACGAGCTGGCCGGAGTGTTGATAAAGTTCGAGGAAAGAGTTAAACAGCGCGACAAGAAGTTGGAAGAAGCGGCCGCCGCTGACACCGCGCTCAGGAAGGAAATAGAACAGAAAGAGAATACTATAAAACAATTAGAGAAAACGGTAGTGGAGCGAGAAAACAGATTGTTCCAAGTGACGTCACAGCTGGAAGAAATGAAACGAGTCCAGGAGATGGTTGCTAAGCTTATGAGCAAAAGCGCGTCTACTGCCAGCTAG

Protein sequence:

>DPOGS211742-PA
MMEVDGVESWNNPMEGAKFANLKAFVNAAREFEATHSESAINMMTRYLGLIASSCDLTIFSPGRSEVCAFFSSLWRVMCDARGPHWAGVAVLARAAIESSTRHALTHTYKFMPILSRLLSDNISNDKKIKLLSVMQDISYGIKISWQESYLTGLMKQLTDWITQPMTEPQQRAIGHKSLTVLVNVCYGNLPAIYALMRTVDTKEFVLHLISLKDGAYGGVEVCRLLLCLSSATRGAASTRQPDVHSYLCCTMRTFSKAIVEKDSTQLLHAYTFINDLCSDSGLRNYVLTYTKFNNSLLDSLNNIEGLCKTSPDDMGESENCTNCLCNVLKFLTVLVNLDIYSLRSFHSQLVCLCMKSSRICLPESLELFAAIVSVYKDEGALPRELITVINDGLPALLVPPALEPGKAGLQWLQVVGVLCEMSETQERVLQEVTPDAFEDTLYSVLQYTSQNGPVGNETAQQCVVVACRGGLCLAPLHTHWEAAFNRMLAHHQVRKLLSAGLTSGSGPRRRQILQLIKHHYFPSEHMNQIFGDNLQNVSDISVESVSPREELDSVWSDRLTPAQERAIDELINIMRESLVSGKVRKLLSAGLTSGSGPRRRQILQLIKHHYFPSEHMNQIFGDNLQNVSDISVESVSPREELDSVWSDRLTPAQERAIDELINIMRESLVSGKINDIATSSVMELYGYKMTCLEQRLHSHSLALQGATEHMASLQHALALLQATNTSQQDVLYTTQMQNEKHKKVIEDLHKQLEDAETTVRGYRAKLAAERLDKENQKEHLQKELRAQIVTIENEMKVRERELEERLKQQEADNRTLQKKLEQQSNKNNELAGVLIKFEERVKQRDKKLEEAAAADTALRKEIEQKENTIKQLEKTVVERENRLFQVTSQLEEMKRVQEMVAKLMSKSASTAS-