Monarch geneset OGS2.0

DPOGS204466
TranscriptDPOGS204466-TA3141 bp
ProteinDPOGS204466-PA1046 aa
Genomic positionDPSCF300002 + 511558-518000
RNAseq coverage252x (Rank: top 41%)
Annotation
HeliconiusHMEL0062530.077.40% 
BombyxBGIBMGA007806-TA0.069.86% 
DrosophilaCG5859-PA3e-14631.93% 
EBI UniRef50UniRef50_E2BWB03e-17535.80%Integrator complex subunit 8 n=7 Tax=Formicidae RepID=E2BWB0_HARSA
NCBI RefSeqXP_395965.24e-17535.32%PREDICTED: similar to CG5859-PA [Apis mellifera]
NCBI nr blastpgi|3504195910.036.23%PREDICTED: integrator complex subunit 8-like [Bombus impatiens]
NCBI nr blastxgi|3838475852e-17736.36%PREDICTED: integrator complex subunit 8-like [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL13763 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204466-TA
ATGGATGTCGATTTACTGCGACCGGGCACTGTGCCCATATCGCCGGACACAGTGTTGTGGTTTGAATTTCTATTAGACCCCAATTTGCTTAAAAGCCATTTAAGTAAACCCAACCCAGAACCTTCAGCTTGCGAATTAATAGAAGAATTTATAAATGTGGACACAAAACAGGCAACTTCTAAACCTACTAGTGAGAGGATAGTTGATGTAGATGCTCCTCCGAGTCCACCAACACCAACACCTGGACCTCTTCAATTCACTAGAAAACAACTGGCCTTAAAGATTTTAGCCTTAAAAGTTGCTGCTACATTGCACTGGAATCTAGACATATTTGAAACAAAGCTGCCACCACAAACTCAACAGCACCTCATGCAGGATCTGGTATACATGGCTACCGATACCAACTTTGCCATTCCCCCTCAGGATGTGCCGGTGGAAATGTTTCACAAACCGCAAGCCCAGTTTGCTCTCACACTATATCACAGATGGGTATTAAGGTTTCCCGTTAAAACTGCATTGTATTCAAAATCCAATAAATTGCCATTTGTGCATGTACCGGGTTTACAGGGAGATACTGCTTTTAGTCCTTTAAATCAAAACTTTGAAAAGATCTTACGTATGTGTGAAGCATCACGAATACAGAGTATCAGATATTTAGACAGTGTACTATCATACTATGAGACTACACTTGGTAGAAAAGATGCAAGAAAGATAAAAGTTCCTGTTAGGGAAGCATTTGTTCATCTCACAGAAGATATTAATGATATGAATCATAATTGGAACGCCGGTGATACAGTTATAAGCCAATATGAATTAGCAATGCAAATACATTTTGATCTCTGCTACAATTACTTTTTTTACGGCCAACACGATCTTGCGAAACAGCACATTTTAGGTTGCAGAGAAAATTCTAACTTACTTGAGAAAGAGGTTGCAAGTTACGGCTATGGGCCACATAAGACGGTACCATGGGGTGAATTTTATTACGCTAGTATGACCAAGGACGACATTTTGGGTTATATAAGAGCTTTGAACTTAGGTTATGAGGTATTGAACGAAGAGCCTTCATTGCTACAAAAACTCCAGGAATCTATAGCAAACCATTATACTGGTATCATAGGTATATTGCAAGCAGACAACCTAGCAAGAGTTATTCCAATGGTTCATAGAGATGTGGTCGAGCTCGACATTCAAGGTTCGGCTTCCAGTGGAGCGTTTACTGTGGCTAGGGATCTTTTGAATCGTGTTGCCGCTTTGAATGCTGTAAGATACTCCCTAGAAGGCGGAATACCATCTACACATCCAGATTTTCTAAACAAACTTAAGACGTCAGGTATTAAATTCTTCGATTTACTATTATGGGCCATGGCTCCGGTACTAATGTCTAACTTATCAGAGAAGGATTGGGAGAATCTGCGTGTATTTTTCCTTCATCTCGCTACATCGCAATACAAGTTACCAATCGACAGGTTAGACGAATATCTGAAGAAATACGTAGGGGACGCCAGCGAGTCAATCAGGAGAAAACTAATACCAGACGAACATCTCAAAGAAATATTGAACGACCCAAATAATATTGACGATGAGAACATTGATATCCCAAAAGAACTGTTGACAGATGACTGGGAAACACCAGACTTTGACTTTAAATCCGTACCTGAACTGGAAATGGGTAGATTGAAGAAGCGTCTTATAGAGGCGTCTACCGCTGACGATGTACGGATGTGTCTTGTTAAACTGGCAATGATGTCTCCGACTTCACCGTTATGGAAGCTGAGCCCATCTTGGAAACCTCCCTCGAGCTTGGTTAATGCATTAATGGCCTTGCCTCGAGGCTTTCTTCAAGATTTTGGGTATATTGTGTCAGGTACAGCAAGGGCTCGTCTAGAAGCTGGCTATGCAAAAACTGCTTTATCTCTTTTGTCTGCGTTAGAAGGGGAAGCGAGAAGTCAATTGGGGGGAGGATCTGACCCCAATTTATATAGACTTTGTCGGCAATTGTCTTGGGAAGTTTTATTGCTACAAGTTAATGTAATGTTAAGCGAATGGCCCCACCATCACATCAATCTAACGGTCCTGGCAAATAAATGTAAAGCATGCATCGCAGTAGCTACGTCAGGCGACAACGTTGTACCGCGACCTCAAGTGTTGGAGGCCTGTTGGACGTGCCTGTTAAACGCATGCGAGTGGGAAGTGACAGGTGTCAGCGGGGGTGCGGGCGAGACATCGGCAGCTTTATGTGCAGCATGCTTCGAGCTGCAGCGGGGGAAGGGGGCGAGGAAATTCCCCCGCGCTTTGTGGGACTATGCCTTATCAGTATACAGTAATGGTTCGAACGTACCGGTGAAGCGCACAGCTGCTGGTATGCCGGCTCATTCACGCGACGCTCCGAACGCGGCGGCCGAAGCTAGGAACGCCTTCAACTCGTTCCTCACAACCCTCAGAGAGCCTCTCGCCATCAGCGTCATGATGTCCCTCCTAGCCAGGATACATAACCTCATAATAAACGATAACTCATTGGAACTGAACGTGGAATACACGAATCTCTGGCCGCCGAACATCTCCAATATAAACAATTATAACTTAAAACATGTCTTGGAATCCCTCACTGAACTCCTTGAACGAAGTCTGAGGCTATACCCATACAATACCTCGTGGCTCCGTCTGTATGGGGACGTGGAAATGGCGGGCGGTAGATGGGCGGCGGCGTTGCGTCGTTATTTATGCTCGTTAGCGGCGAACACGTGGCATTTCACCAAGCGTGCACCGGACGAGGGCGGCATAGCTCGGCGCGCGGCTCGCTGCTGTCAGGCGTTGTGCGCGCCCACTCAGGCCGCGGCCTTATGTCAGCTACCCGACGAACCGGATTACACCATCGCCTTCAAGTGTCTCGCAGAAAAGACTGGTAACGCAGCGGACGCTATGGACGGTTATTATGGTTGTCTATGGGACGGGACTCTGCTGGAGGTCGGCGTGGCGCTCCACGCTCGTAGAGGCGAGGGCGGCAGACGAGCTCGGGCCGTCAAAGCGGCCGGAGCCCTGGAACTCAACGCCAACAACAGCGAAGACATCCAGAGAGAGGCCGCCGCCATACGAAGAGCTAGATTACTAAGAGCACTCACCAACCAATATGTCGTGTGA

Protein sequence:

>DPOGS204466-PA
MDVDLLRPGTVPISPDTVLWFEFLLDPNLLKSHLSKPNPEPSACELIEEFINVDTKQATSKPTSERIVDVDAPPSPPTPTPGPLQFTRKQLALKILALKVAATLHWNLDIFETKLPPQTQQHLMQDLVYMATDTNFAIPPQDVPVEMFHKPQAQFALTLYHRWVLRFPVKTALYSKSNKLPFVHVPGLQGDTAFSPLNQNFEKILRMCEASRIQSIRYLDSVLSYYETTLGRKDARKIKVPVREAFVHLTEDINDMNHNWNAGDTVISQYELAMQIHFDLCYNYFFYGQHDLAKQHILGCRENSNLLEKEVASYGYGPHKTVPWGEFYYASMTKDDILGYIRALNLGYEVLNEEPSLLQKLQESIANHYTGIIGILQADNLARVIPMVHRDVVELDIQGSASSGAFTVARDLLNRVAALNAVRYSLEGGIPSTHPDFLNKLKTSGIKFFDLLLWAMAPVLMSNLSEKDWENLRVFFLHLATSQYKLPIDRLDEYLKKYVGDASESIRRKLIPDEHLKEILNDPNNIDDENIDIPKELLTDDWETPDFDFKSVPELEMGRLKKRLIEASTADDVRMCLVKLAMMSPTSPLWKLSPSWKPPSSLVNALMALPRGFLQDFGYIVSGTARARLEAGYAKTALSLLSALEGEARSQLGGGSDPNLYRLCRQLSWEVLLLQVNVMLSEWPHHHINLTVLANKCKACIAVATSGDNVVPRPQVLEACWTCLLNACEWEVTGVSGGAGETSAALCAACFELQRGKGARKFPRALWDYALSVYSNGSNVPVKRTAAGMPAHSRDAPNAAAEARNAFNSFLTTLREPLAISVMMSLLARIHNLIINDNSLELNVEYTNLWPPNISNINNYNLKHVLESLTELLERSLRLYPYNTSWLRLYGDVEMAGGRWAAALRRYLCSLAANTWHFTKRAPDEGGIARRAARCCQALCAPTQAAALCQLPDEPDYTIAFKCLAEKTGNAADAMDGYYGCLWDGTLLEVGVALHARRGEGGRRARAVKAAGALELNANNSEDIQREAAAIRRARLLRALTNQYVV-