Monarch geneset OGS2.0

DPOGS207138
TranscriptDPOGS207138-TA1218 bp
ProteinDPOGS207138-PA405 aa
Genomic positionDPSCF300001 + 3823760-3826878
RNAseq coverage827x (Rank: top 16%)
Annotation
HeliconiusHMEL0176071e-10368.54% 
BombyxBGIBMGA012757-TA2e-4265.85% 
Drosophilacrol-PE4e-1425.33% 
EBI UniRef50UniRef50_E9GB689e-1729.81%Putative uncharacterized protein n=2 Tax=Daphnia pulex RepID=E9GB68_DAPPU
NCBI RefSeqXP_974808.24e-1526.46%PREDICTED: similar to AGAP003111-PA [Tribolium castaneum]
NCBI nr blastpgi|2608246631e-1630.81%hypothetical protein BRAFLDRAFT_125176 [Branchiostoma floridae]
NCBI nr blastxgi|2608246634e-2030.84%hypothetical protein BRAFLDRAFT_125176 [Branchiostoma floridae]
Group
KEGG pathway 
Orthology groupMCL26007 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207138-TA
ATGAATACAGCGGTATTGAAAGTGGAAGCAGAAGCGGAAACATGTTCGGTGTGCGGTGGTGGCGGAGATCTATTCACGCCAGAAGAACACGATGCCGGCGCTCCGCCAATGCAAGTGTCTTTACGAACGATGCTATTGCAAATTAATAATTATAAGGTTGTTCCCGAAGGGAGGCTTTGTATTAGTTGCATACGTCGCGCCATTGAAGCGTATGAGTTTAGTTCTGCCTTAGGATCCAGAACAGCTCCTCCATTAAGTGAGAAAATCCGTACTTTGAGGAGAAGACTACATGATTTAACGCAGAAGGTAGATGTTTTCATCGTGGTCGGTGGACCGGGTGTGAACTCTGGAGGTGCATATAGTGAGGACGATATTATAATGGTAGAACGAGACGCCTTAGCTGCAGCTGCTGCTGCTGATGCTGATGATGAGGATCTTGAAAAAGCTAGAAATGCCTGTGGGGACTCGGTGTACCAATGCTCTATATGTCCCATGTCGTTCCAGCATGCAGCTGAGTATCGATCGCATGTAGCAAGTCACCCAGGCGGTGCTCGACACTCGTGCTGGACTTGTGGCGCACAGTTTGCACAAAAAGAGGCACTTCGAGACCATGCAGCTGAACATTCTACTCCTGGACTCATCTGCCAACTCTGTCGGAACAGATTCCAGAATGCCTCCGAGTTGCGTCGTCACGAGTCGGATGAGTCGTGCCCGTCTCGCTGTCCGATCCCAGGGTGTGGTGTGCGGTGTATGTCGCGCAGCTCGCTATCATCACACGCCTCGGCATCCCACTCGAGGGACCCGCCCTTGCTCTGCTCGCAATGTTTCGTGCAATGCTCCACTCGAGCTCAGATGGCGGCCCATGCTCTGTCTCACCGCTGTGCCGAGCGGTTCGTCTGCGGTTATGATGCTTGCATATTGCGCTTCGCAAATAGAGGTGACTTGTTGTCTCACATCCGCAAGCAGCATGCGGGTTCCGTGCCTGACCCAACCTCGGAACAGCCACCATCCACCACTTGTCACTGTGGACGCATTTTCGGGTCAGTGGCGGCTCTAAAGCGTCATGCTCGTGTTCATCGCCGCGAAACGCAGCAGGAGGAACCAGAATGGACCATCACAATGGAGACTGGGGAGGGGGCGGAGGAAGGGGAGGGAGCGCTCGACGGAGACGTGGAGTACCTCGAGCTGGAGGCGCTCGACGAATACGACGAAAATTAG

Protein sequence:

>DPOGS207138-PA
MNTAVLKVEAEAETCSVCGGGGDLFTPEEHDAGAPPMQVSLRTMLLQINNYKVVPEGRLCISCIRRAIEAYEFSSALGSRTAPPLSEKIRTLRRRLHDLTQKVDVFIVVGGPGVNSGGAYSEDDIIMVERDALAAAAAADADDEDLEKARNACGDSVYQCSICPMSFQHAAEYRSHVASHPGGARHSCWTCGAQFAQKEALRDHAAEHSTPGLICQLCRNRFQNASELRRHESDESCPSRCPIPGCGVRCMSRSSLSSHASASHSRDPPLLCSQCFVQCSTRAQMAAHALSHRCAERFVCGYDACILRFANRGDLLSHIRKQHAGSVPDPTSEQPPSTTCHCGRIFGSVAALKRHARVHRRETQQEEPEWTITMETGEGAEEGEGALDGDVEYLELEALDEYDEN-