Monarch geneset OGS2.0

DPOGS213859
TranscriptDPOGS213859-TA1374 bp
ProteinDPOGS213859-PA457 aa
Genomic positionDPSCF300361 + 19810-25845
RNAseq coverage361x (Rank: top 33%)
Annotation
HeliconiusHMEL0071305e-15467.58% 
BombyxBGIBMGA009773-TA9e-11357.85% 
Drosophiladgt1-PA1e-5235.22% 
EBI UniRef50UniRef50_E2C3299e-6840.43%Uncharacterized protein C12orf41-like protein n=6 Tax=Formicidae RepID=E2C329_HARSA
NCBI RefSeqXP_001649909.11e-7338.93%hypothetical protein AaeL_AAEL004832 [Aedes aegypti]
NCBI nr blastpgi|1571077232e-7238.93%hypothetical protein AaeL_AAEL004832 [Aedes aegypti]
NCBI nr blastxgi|1571077232e-7339.72%hypothetical protein AaeL_AAEL004832 [Aedes aegypti]
Group
KEGG pathway 
Orthology groupMCL12813 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213859-TA
ATGTCCCAACAACATAAGATCATCCACCTGCCAAAGGTCAGGATGCTAAACCGAGGGCGTTCAGCAATAAGGATCACAAATGTTAAGTCTGTTAAACCGGATCCTGAATTCGTTAAACGACAGGAAGAGGAGAGATTGAGGGCTCAGTTACAACAAGAGATAGTATCTCGTTCCCGTACTTGTTCCTACCACGCGTATGAGTGCACCCTTCCTGTAGTGGCGGGTCGTATGTACTGCGCACGACATATACTCAGTGACCCCACCGCTCCTTACAAGCAATGTGCTCATGTCTCCGCATCTGGCAACAGATGTACACAGCCCGCACCCATCGACAGAGACCCCGGGGTATGCTTCGACCACGCCCGTTCGTCTCTCTGCCGCCGGATGCGTGCCGCAGCGCCGCCGCCCGCCGTCGACACCACGGAGACGCTGCTGCATCGTTTGCAGCACTACGTTAGACCTGAACGTACTCGGACGACCTCGTGCGCCTCCTCAGTGTCTGTGGTCAGCGAGCCTTCGGAACAGGAAGTCGCTACACATGCTGTGGATCCATTCAAGGAAATAGACGCGGTGTCAGTGAACGCGTCCGTATCAACGGCGCTGATGGAGTGCGCCAGCGCCAGTGACAGCGACTGTGACAGTGTAGTGATCACCACAGAGAAGGAACCCTCCGACACTGAGGACGCACCCTGCGAGGACGGACCGCTCTGGAAGGCTGGTGTATACACAGCAGAGGAGGCAGTCAGTGAAGCGAACAATGTTCTCAAATCATTGCAGTCCTTATATATTAAACAGATGGGTCGCCTGAGAACTCAGTTAGAAACGGCCAGGCTGAAATATGTCAAGGCTCTACGGACAGAAAAGGAACATTACTGTAGTATAAACAGCCAGTCCCGGTCCGGACCTCAGTCGGTGCGCGAGCGGCGACAGCTGAGGAAGTTGAAGGCGTACGCTGGATATCACCGGAAACACGGAATGGATGCCGTGCTCTCGAGGAAACTACATCATAAGAGAGCTATGGCCAAGGATCCACCGTCGAACCGCATACCTTCTCAGGGTCGGTGTGTGTTCACGGAGGGCGGCGTCCGGTGTTCCTCACACGTGCTGCCCGCGGCTAAGCATTGTCTCAAACACATACTCAATGATACACAACAGGTATTATTCGTCCCTTGCGGTGATGTCCGCGGGCCGGTGTCTTGCCTCGAGCCTGCCCCAAGGTCGCCGCTGCCGAGATCGTGTCGCTATCACACAGACGCGCCGCCGCACGCGTGCTTCACGCTTAAAAAGCAGGAATCTGAATGTGGTTCGGAGTGGACGTGTTCTGACCAGTCCCCTGCTGCCCCACAGCTGGTGGACACCCTACAGTACGACTGA

Protein sequence:

>DPOGS213859-PA
MSQQHKIIHLPKVRMLNRGRSAIRITNVKSVKPDPEFVKRQEEERLRAQLQQEIVSRSRTCSYHAYECTLPVVAGRMYCARHILSDPTAPYKQCAHVSASGNRCTQPAPIDRDPGVCFDHARSSLCRRMRAAAPPPAVDTTETLLHRLQHYVRPERTRTTSCASSVSVVSEPSEQEVATHAVDPFKEIDAVSVNASVSTALMECASASDSDCDSVVITTEKEPSDTEDAPCEDGPLWKAGVYTAEEAVSEANNVLKSLQSLYIKQMGRLRTQLETARLKYVKALRTEKEHYCSINSQSRSGPQSVRERRQLRKLKAYAGYHRKHGMDAVLSRKLHHKRAMAKDPPSNRIPSQGRCVFTEGGVRCSSHVLPAAKHCLKHILNDTQQVLFVPCGDVRGPVSCLEPAPRSPLPRSCRYHTDAPPHACFTLKKQESECGSEWTCSDQSPAAPQLVDTLQYD-