Monarch geneset OGS2.0

DPOGS208402
TranscriptDPOGS208402-TA1905 bp
ProteinDPOGS208402-PA634 aa
Genomic positionDPSCF300241 - 56630-72708
RNAseq coverage208x (Rank: top 46%)
Annotation
HeliconiusHMEL0033895e-9981.93% 
BombyxBGIBMGA004050-TA3e-5174.51% 
DrosophilaTORC-PA4e-1841.21% 
EBI UniRef50UniRef50_E0VAI95e-3742.49%Putative uncharacterized protein n=2 Tax=Neoptera RepID=E0VAI9_PEDHC
NCBI RefSeqXP_002423133.11e-3742.49%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420045242e-3642.49%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|3838518666e-4830.60%PREDICTED: CREB-regulated transcription coactivator 1-like [Megachile rotundata]
Group
KEGG pathway 
Orthology groupMCL24973 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208402-TA
ATGGCGAATCCTAGAAAATTTAGTGAAAAGATTGCCCTTCACAATCAAAAGCAGGCTGAGGAAACGGCCGCTTTTGAGAAAATTATGCGTGAAGTTTCCGACGCAACCAATAAGGTGAACTCTAGCAATCGCCGTTCGAAAATGAAAGCACCGCCAGAACCGGAGATTGTTGAACAAATAGTCACAACACAAAGCTTGGGCACATACAGGAGCGGTTCATTGCCAAATGTTGCTGCTCCTGAACCCCCACCAAGTCTGGAGACTACTAAGCCCGAAGAAACGACTCTAGCCACTCAGTACGTGCTTGGTAGGAGCGGGGGAGCCTCGCCAACCCCTCGATCGCCCGGTCGGGGTCGGGCATCCTCGTCTGTGGGGCCAATGAGACGGCCGGCAGACCGGAAACACGACACCAGCCCTTACGGAAGCACTGTATATTTAAGCCCGCCGCCCGACAGCAACTGGCGGCGGACTAACTCTGACTCCGCCCTCCACCAGTCGTCGCCGCTCGTCGTCTGCCGGCGGCCGCCGCTGTCACCGCACCACCCTCTACACTCACACGCACACCCTCACCTGCACCCACACCAGAACCATCGGAGAGCTGGCAACATTAATCTAGATGTATTAGCGACGTTAGGCATGAGCCCCAATAATCGTCCGCGGTCGTCGTGTGAAATACCTAGGATACCTAACAATAACAATGTGTACGAGAGCGGTGTCGATGCTAGCGGTGCGTTGTCCTGCGGAGAGCTGTCCGTGCCTGGCGGGTCACTACCGGACCTCACCTCAGTGCATTACCCGCCACCCCACTACCTCACGAGAGCTTCACCCGACTACCAACCGAGATATAGCCCTACGTCACCGGGCGCGGTGTCCCCCGGAGGTGGCAGCGTGTCCCCCGCCACAGGGGGCCTGTCCCCCCAGTCCGGCTCCCCGGTAGGAGCCACTGTGCCGCTAGCGACGATGCACGAACACCCTATATATGAAACACAAAGCGGTGTCGATGCTAGCGGTGCGTTGTCCTGCGGAGAGCTGTCCGTGCCTGGCGGGTCACTACCGGACCTCACCTCAGTGCATTACCCGCCACCCCACTACCTCACGAGAGCTTCACCCGACTACCAACCGAGATATAGCCCTACGTCACCGGGCGCGGTGTCCCCCGGAGGTGGCAGCGTGTCCCCCGCCACAGGGGGCCTGTCCCCCCAGTCCGGCTCCCCGGTAGGAGCCACTGTGCCGCTAGCGACGATGCACGAACACCCTATATATGAAACACAAAACATGTCTCCGTTGAACCACAGCTGGATGACGTCAAACTATCAGTCGCACTGTAGTCCGCCGTCACAATATTCCTCGACCTCAAATATAAACATACCCTCACCTACAATGGTACACAGTCCTGGCAGTCCTGGCGAGTCACCTCAGACGGACTACACCAATCTCCACCAGGCGCTGTTGCAGCCCTTTGAACAGATCACCATGCTCGACGCGCCGACGTCGAACTACAACACGACGTACATCAACCACAGCTCACACTCGCAGCAGACGACGAATTCAACGCACACATATACACAGTCGTCTCAGCCGTGTCACAGCGGTCACTCGTCGCCGTCCGTGGAGCCGAGAGCACGCGCCCTGGGACACCGCCCCCCCGCGTACCCTCTGCAGCCGCTGCAGCCCCTCGCCTCCCCGCAACAACCACCCACACCCGCCACCCCCGCCTCCATACCGGACATCATACTCACAGATTACTCGGGCGAGTTAGACCCGGGTATATTCGGCGGCGAGGAGGCTCAGCTGAGGGCTGGGTTGGATCTGGACGACTTGACGTTGCTGGAGGAACCGAGCTCGTTACTCCCGGACTCGTCCGTGGAACACGAGTTCCGCCTCGACCGCCTCGACCGTTTGTAG

Protein sequence:

>DPOGS208402-PA
MANPRKFSEKIALHNQKQAEETAAFEKIMREVSDATNKVNSSNRRSKMKAPPEPEIVEQIVTTQSLGTYRSGSLPNVAAPEPPPSLETTKPEETTLATQYVLGRSGGASPTPRSPGRGRASSSVGPMRRPADRKHDTSPYGSTVYLSPPPDSNWRRTNSDSALHQSSPLVVCRRPPLSPHHPLHSHAHPHLHPHQNHRRAGNINLDVLATLGMSPNNRPRSSCEIPRIPNNNNVYESGVDASGALSCGELSVPGGSLPDLTSVHYPPPHYLTRASPDYQPRYSPTSPGAVSPGGGSVSPATGGLSPQSGSPVGATVPLATMHEHPIYETQSGVDASGALSCGELSVPGGSLPDLTSVHYPPPHYLTRASPDYQPRYSPTSPGAVSPGGGSVSPATGGLSPQSGSPVGATVPLATMHEHPIYETQNMSPLNHSWMTSNYQSHCSPPSQYSSTSNINIPSPTMVHSPGSPGESPQTDYTNLHQALLQPFEQITMLDAPTSNYNTTYINHSSHSQQTTNSTHTYTQSSQPCHSGHSSPSVEPRARALGHRPPAYPLQPLQPLASPQQPPTPATPASIPDIILTDYSGELDPGIFGGEEAQLRAGLDLDDLTLLEEPSSLLPDSSVEHEFRLDRLDRL-