Monarch geneset OGS2.0

DPOGS208159
TranscriptDPOGS208159-TA2664 bp
ProteinDPOGS208159-PA887 aa
Genomic positionDPSCF300058 + 158809-168771
RNAseq coverage188x (Rank: top 48%)
Annotation
HeliconiusHMEL0110913e-7095.38% 
BombyxBGIBMGA014067-TA2e-14576.19% 
DrosophilaCG15160-PA1e-7646.67% 
EBI UniRef50UniRef50_E2C8S07e-8554.36%Uncharacterized protein KIAA0460 n=9 Tax=Endopterygota RepID=E2C8S0_HARSA
NCBI RefSeqXP_001603086.14e-8850.14%PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3838552781e-8655.12%PREDICTED: uncharacterized protein LOC100883214 [Megachile rotundata]
NCBI nr blastxgi|3071924211e-8934.10%Uncharacterized protein KIAA0460 [Harpegnathos saltator]
Group
KEGG pathway 
InterPro domain[11-130] IPR0065691.9e-43RNA polymerase II, large subunit, CTD
[1-138] IPR0089421.3e-22ENTH/VHS
[59-118] IPR0069035.5e-16Domain of unknown function DUF618
Orthology groupMCL16030 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208159-TA
ATGGGTGAAACCGAAGAATTCAATACTTTAGCGTTCGAAAAGAAACTCACACAGCTGAAGGATACACAGGAAAGTATCCAGTCTCTATCTAGCTGGTGTTTGAAACAAAGAACTCACCATAAAAAGATAGTTTCCAGTTGGTTGAATGTGTTGAAGAGAGTGAAGGTGGAACAGCGACTTGTGTTGTTTTATTTAGCTAACGATGTTATTCAATATAGCAAAAGGAAAAATTACGAGTTTGTTGAAAGCTGGGGTCTTAATTTGCAAAAAGCTACACCACTAGTCAGAGATGAGAAAGTTAGACCGAAGATATTGAGGATATTCAAAATCTGGGAACAAAGATCAGTTTATGATGATGAGTTCTTATCAGATCTCACAGGTCTACTGAGTGCGGGGGCTGTTAAAAAGACTGATGACGATCCATTGGATTTTCAAGCAACCTCAACAGTTGGTCAACAAGATAAGACAGTGTACTGTGCTAGAAGCGGATACTCAGGTGAAATTGAAATTCCATTAGAATTGTCGGATACGGATGCATTATGTTCAAATTTAAAAGAGAGAAGTAGCAAGGATGACGTAGAAAAGGAACTCAATGAAGGCATACAATGTGTGGAACGATACACTCAAGCATTACAGAGGGAAATAGTTGCTAGGGAGGCATTACTAGCATTGTTGAGTTCAGCAAACCAATACTACTCTACGCAGAGGGGAGAAGTGAAAGTTGTGGCATATGCATACAAAAACTTCGGTTCCCGAGTGCGCGCTCTCAAACGCAAGTTAGATGAATTGTTGCCAACACTACCGAGTGCACCGTCGCCGCCGACTAGAGATGAAGACGTACCCTCACCTGGACCGGACGAGGATTTGGAACTACCCACCAATGAAAATGAAGTATCATACAACATCGACCAAACCTTCAATACATCGGTGTCAGCTGATGGGTCCTTGTATAACTTGGGACTATCGTCGTTCCTCAACGAAAACTCCATGGCTATATTCAATGAAAGCCAAGCGGATTTAAATATTGTTAATAGCAGCATACAGCCAGATACGCTTCCGGGATTGGACCTCCTTAAGGAATCCAACCCTCCACCACCGACATCATTCTACGGGACCATGGAAACTATCACTAAGAGAAGTAGCAAGGATGACGTAGAAAAGGAACTCAATGAAGGCATACAATGTGTGGAACGATACACTCAAGCATTACAGAGGGAAATAGTTGCTAGGGAGGCATTACTAGCATTGTTGAGTTCAGCAAACCAATACTACTCTACGCAGAGGGGAGAAGTGAAAGTTGTGGCATATGCATACAAAAACTTCGGTTCCCGAGTGCGCGCTCTCAAACGCAAGTTAGATGAATTGTTGCCAACACTACCGAGTGCACCGTCGCCGCCGACTAGAGATGAAGACGTACCCTCACCTGGACCGGACGAGGATTTGGAACTACCCACCAATGAAAATGAAGTATCATACAACATCGACCAAACCTTCAATACATCGGTGTCAGCTGATGGGTCCTTGTATAACTTGGGACTATCGTCGTTCCTCAACGAAAACTCCATGGCTATATTCAATGAAAGCCAAGCGGATTTAAATATTGTTAATAGCAGCATACAGCCAGATACGCTTCCGGGATTGGACCTCCTTAAGGAATCCAACCCTCCACCACCGACATCATTCTACGGGACCATGGAAACTATCACTATTCCTGATGAACCCGATCAACCTTATCTACCAGAAGCTGTGGTCACAAACAGTCAATGGGCAAATAATACTTGGAATGTGCCTCTTCCAGTGGCACGTAACGTGTTCGCGGAGCCCCCAGCCTCACCGCCCGTGCCGATACGAACAGACACACAGGTGGATATTTCCGCGTCTGATCATGAGCTTCGCAGCCGCCTGCCACCACCGCCACCGCCTCCTGTACTGCCAGGTCTTACACATATCGAGGATGTAGATCACAGGCTACTGCCAAGTCTACCTCCGACGCCCGTACCTCCGCCTATACGTCATTCACATCAAGATGTGGATCACAGGAATCTTATATCACTGACACAACTACCTCCCAGACATGTTAATGTAGATCAAGACTACCGCCTACCTCCACTGTCTCAGCCCCTGGGTGTGCCTCTCCTGCCACCTCCGCCTTCAGACATTGTTGAGAGTGTCGATATGGACCTATCAGAGGACGAAGAGCAAGGTATGTACCAGACACAGAATCAAAGCGATCACAGACACAACAGCTTTAACAACAATAAAGTACTGGTCGGTGGTGAGAAGAAGGACAACAGTAATCTCATACAGATAAACGCTAATATAGACATCGAGGCCCCCCACGGAACCCCCGTGGCCCCCATGACGAACCCCTTCGACAACATGCCCCCCCAACTACGATACAACTTCAATACCAACTATCAAAAGAATCCAGATAACGCCGAATATAATGATGATATAAGAAATCGTAACCTCGATAGAAGAAATCAATCGCCTGAGTACGAAGATTATAAGAGTAACGATTTCCAGGCCCCCCGGCCTTATATGAACCGGTTCCCGAGGAATTGGGGGCCTCGCAATAACTTCAGGGCCCCATATAATCAGTTCAATCAGCGCAACGGCGGCCCTCGACAACGATGGGGCGGGCCCAGGCAAAGGTTTTGGTGA

Protein sequence:

>DPOGS208159-PA
MGETEEFNTLAFEKKLTQLKDTQESIQSLSSWCLKQRTHHKKIVSSWLNVLKRVKVEQRLVLFYLANDVIQYSKRKNYEFVESWGLNLQKATPLVRDEKVRPKILRIFKIWEQRSVYDDEFLSDLTGLLSAGAVKKTDDDPLDFQATSTVGQQDKTVYCARSGYSGEIEIPLELSDTDALCSNLKERSSKDDVEKELNEGIQCVERYTQALQREIVAREALLALLSSANQYYSTQRGEVKVVAYAYKNFGSRVRALKRKLDELLPTLPSAPSPPTRDEDVPSPGPDEDLELPTNENEVSYNIDQTFNTSVSADGSLYNLGLSSFLNENSMAIFNESQADLNIVNSSIQPDTLPGLDLLKESNPPPPTSFYGTMETITKRSSKDDVEKELNEGIQCVERYTQALQREIVAREALLALLSSANQYYSTQRGEVKVVAYAYKNFGSRVRALKRKLDELLPTLPSAPSPPTRDEDVPSPGPDEDLELPTNENEVSYNIDQTFNTSVSADGSLYNLGLSSFLNENSMAIFNESQADLNIVNSSIQPDTLPGLDLLKESNPPPPTSFYGTMETITIPDEPDQPYLPEAVVTNSQWANNTWNVPLPVARNVFAEPPASPPVPIRTDTQVDISASDHELRSRLPPPPPPPVLPGLTHIEDVDHRLLPSLPPTPVPPPIRHSHQDVDHRNLISLTQLPPRHVNVDQDYRLPPLSQPLGVPLLPPPPSDIVESVDMDLSEDEEQGMYQTQNQSDHRHNSFNNNKVLVGGEKKDNSNLIQINANIDIEAPHGTPVAPMTNPFDNMPPQLRYNFNTNYQKNPDNAEYNDDIRNRNLDRRNQSPEYEDYKSNDFQAPRPYMNRFPRNWGPRNNFRAPYNQFNQRNGGPRQRWGGPRQRFW-