Monarch geneset OGS2.0

DPOGS206053
TranscriptDPOGS206053-TA1593 bp
ProteinDPOGS206053-PA530 aa
Genomic positionDPSCF300028 - 949253-962042
RNAseq coverage620x (Rank: top 21%)
Annotation
HeliconiusHMEL0179482e-6765.05% 
BombyxBGIBMGA003900-TA1e-6957.42% 
Drosophilaangel-PA5e-4037.31% 
EBI UniRef50UniRef50_D2A3E61e-8143.51%Putative uncharacterized protein GLEAN_07970 n=2 Tax=Tribolium castaneum RepID=D2A3E6_TRICA
NCBI RefSeqXP_975263.11e-8243.51%PREDICTED: similar to carbon catabolite repressor protein [Tribolium castaneum]
NCBI nr blastpgi|910806952e-8143.51%PREDICTED: similar to carbon catabolite repressor protein [Tribolium castaneum]
NCBI nr blastxgi|2700058562e-7940.92%hypothetical protein TcasGA2_TC007970 [Tribolium castaneum]
Group
KEGG pathwaytps:THAPS_108112e-28 
 K12603 (CNOT6, CCR4)maps-> RNA degradation
InterPro domain[168-527] IPR0051356.4e-41Endonuclease/exonuclease/phosphatase
Orthology groupMCL12441 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206053-TA
ATGGTTTTTGCTCTTTGTGTAAAAAAATACGCTATGGCTTTATTTACATGCATTAATGCTGCTGCAGCTATGCTGAAATTGACTCACAGTTTTTGCAGATATACAAGACTCACCTGTGAGGACATAAATCATAGTAATTTTTTTGTGAAAAACTACTCACAGTTTAGTAAAGGAAAGATATGTGTTACATGGGACAAACAACAGACACAGCTAGCTTCGCAAGTTTATTTTGGTGTTTGGTCTCAACGTTTGGTGAACATAAAAAAGGGATTCAGTCTCAAAAAAACATTTATAAACTCAAATGTAACAAGGAGACGTAAACCAATGAGTGATTCATACGAACCCTATCAACATGCCAAATATCAAATTCCCGCAGATGAATGCACAAGTCAATCTAACCAATTTAGAAAACAACCAAGTGAAAAGACGAAGAGACCTCTTAAAATACCTAACGACTTTCGGCTTTGGGAGCCAGTGGGAAAGAAAAATTCTAACAATGGAGGTAATTTCAGGTTCCGTGTGGTCTCATACAATGTCCTAGCCCAGTATCTACTAGAATACCATCCATACTTATACACAGACTGCACTCCAGGAAATCTTAAATGGAAAGTACGAGCTGCAAAATTATATGACGAAATACTCAGTCTATCACCTGATATTATTTGCCTTCAAGAAGTGCAAGTGTCTCATTTAAAAAGTTTTTATTCAAAATTCGAGGACATGGGGTATTTTGGTATATTCAAACAGAAAACTGGTCATCGTCAAGACGGGTGTGCTATTTACTTCAAACATAGCCTATTTGATTTACAAGATCACAACAGTGTGGAGTACTATCAGCCTGAGATGCCAATATTGAATCGTGATAACATCGGCCTGATGGTTAAACTCGCTCCAAAATCCTCTTCAAATACTCCAATAGTAGTGGCCACGACACACCTCTTGTACAACCCGAAACGAACGGACGTTAGACTGGCACAGATGCAGGTCCTGCTGGCGGAGATAGACAGATTCGCATATACAAAGAATGGTTTAGGGGAGGGCTATTTACCTATAATAATTACAGGAGACTTTAACTCAACGCCAGATAGCGCTGTAGTGCAGTTACTGGACAGAGGACATGTTAGTGTATCATCGTTGAGAGACAATTCGGACTGGGAGAGAATCGGCGTCACTGATAACTGCCAGCATTTGGCGGTTTATTTGAACAGGCAGAAGGGAGTTAGCACAGATTTCAGTATGGTTAAGATACACAATTCGGACTACAAGAATAGTGCTCAAAACATACAGCACGAGTCCAAATACCGTGAGATGTTCAACAGTGACGACGTCTGCCATCCCCTCCGGCTGGCCTCCGTATACGACACCATGAAGAACGGTCTCAGCTACGAGGCCACCACTTACCAAGACCTGTGGATTACTGTTGATTACATTTACTTTAGTTACTGCAGTTCTCTCCGGCTAGTGGAACGTCTTCGTTTGCCGACTGAGGCTGAATGTGAGGTCCTCGGCCGTTTGCCAAACGATAAGTACGGCTCCGACCACCTCGTGTTGGCTGCGACCTTCGAATTGAAGACCTCCAAGTCCTCCCTATGA

Protein sequence:

>DPOGS206053-PA
MVFALCVKKYAMALFTCINAAAAMLKLTHSFCRYTRLTCEDINHSNFFVKNYSQFSKGKICVTWDKQQTQLASQVYFGVWSQRLVNIKKGFSLKKTFINSNVTRRRKPMSDSYEPYQHAKYQIPADECTSQSNQFRKQPSEKTKRPLKIPNDFRLWEPVGKKNSNNGGNFRFRVVSYNVLAQYLLEYHPYLYTDCTPGNLKWKVRAAKLYDEILSLSPDIICLQEVQVSHLKSFYSKFEDMGYFGIFKQKTGHRQDGCAIYFKHSLFDLQDHNSVEYYQPEMPILNRDNIGLMVKLAPKSSSNTPIVVATTHLLYNPKRTDVRLAQMQVLLAEIDRFAYTKNGLGEGYLPIIITGDFNSTPDSAVVQLLDRGHVSVSSLRDNSDWERIGVTDNCQHLAVYLNRQKGVSTDFSMVKIHNSDYKNSAQNIQHESKYREMFNSDDVCHPLRLASVYDTMKNGLSYEATTYQDLWITVDYIYFSYCSSLRLVERLRLPTEAECEVLGRLPNDKYGSDHLVLAATFELKTSKSSL-