Monarch geneset OGS2.0

DPOGS212481
TranscriptDPOGS212481-TA1464 bp
ProteinDPOGS212481-PA487 aa
Genomic positionDPSCF300222 - 309015-325849
RNAseq coverage895x (Rank: top 14%)
Annotation
HeliconiusHMEL0133261e-2330.70% 
BombyxBGIBMGA009653-TA3e-12974.01% 
Drosophilagem-PD2e-8656.25% 
EBI UniRef50UniRef50_D6WQH12e-9653.65%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6WQH1_TRICA
NCBI RefSeqXP_974530.11e-9654.21%PREDICTED: similar to transcription factor CP2B, putative [Tribolium castaneum]
NCBI nr blastpgi|2700105247e-9653.65%hypothetical protein TcasGA2_TC009932 [Tribolium castaneum]
NCBI nr blastxgi|2700105246e-9353.56%hypothetical protein TcasGA2_TC009932 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[259-446] IPR0076042e-62CP2 transcription factor
Orthology groupMCL10673 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212481-TA
ATGGCTAACCTCGATTTGGGGGGTAGCGGCACCAGCTCCTCCTCGTCACACCTTTCCCCGGGATGGCAGGTCAATGACCTGGATCTTGATTTACCCGGGGAACTTTCCATGAATGAGGCCCTACTGTCGTTGCCGTCACTGGCCGTGTTCAAGCAAGAGGCGCCTTCTCCAACTGGGAACGCACTGTCTCCGCCGCGCAGGACCTGGCCCGTGAGGCGTACTGATGACAGACAGATAACCAATATGGTTGTGGACAACCGGGATGCCATGGACGAGGGCTGCCAGCAACACGCTGGAGTCATGAACACGCAATCCAATAGCCCGGAAAGCATGCAGTGCCAAACCATGCCAGTCATAATGCCAATTAATGGCTACCATTCTCCTAGTGGACAAGAAAATAAAAGTAATGAGGCCCTACTGTCGTTGCCGTCACTGGCCGTGTTCAAGCAAGAGGCGCCTTCTCCAACTGGGAACGCACTGTCTCCGCCGCGCAGGACCTGGCCCGTGAGGCGTACTGATGACAGACAGATAACCAATATGGTTGTGGACAACCGGGATGCCATGGACGAGGGCTGCCAGCAACACGCTGGAGTCATGAACACGCAATCCAATAGCCCGGAAAGCATGCAGTGCCAAACCATGCCAGTCATAATGCCAATTAATGGCTACCATTCTCCTAGTGGACAAGAAAATAAAAACGCTGGTCTTCTGATGTGTTCTCCGGCCAGCTCTCTGGATGGCTTCCTGCACTCTCCGAGGCCCGACTCCGGCTTCAAAGACGACAACAGATTCCAATACGTCCTAGCGGCGGCTACGTCCATAGCGACCAAACAGAACGAAGAGACGTTGACTTACTTGAACCAGGGACAGCCTTACGAGGTCAAGCTGAAGAAGCTCGGAGACCTCGCGCACTACAAGGGGAAGATACTGAAGAGTATAATAAAGATCTGCTTCCACGAGCGCCGTCTGCAGTACATGGAGAGAGAACAAATAGCACAGTGGCACAACGACAGGCCGGGAGAGAGGATATTAGAGGTGGACGTACCGCTATCGTACGGTGTATCACGCGTGGAGCAACCAGCTGCCCTCAATGAACTACATGTACACTGGGACCCGACCAAAGATGTCGGGGTGTACGTTAAAGTGAACTGCATATCAACAGAATTCACAGCCAAGAAACATGGCGGTGAAAAGGGAGTGCCGTTCCGTATCCAGGTGGAGACGATGTATGAGGACAGGCGACTGCATACAGCTGCCTGCCAGATTAAGGTCTTCAAGTTGAAGGGCGCTGATCGTAAACACAAGCAGGACCGGGAACGAGTACTGAGAAGGCCCAGGTCCGAGGTGGAGAGGTATCAACCTGGGTGTGACGCAACTGTCCTGACGACGCTATCTAACGACGCTCTGATGCCACCGCCTTCCCTTGTGACAACATCACCCCCATACTCCCCAGAGATGTGGTAA

Protein sequence:

>DPOGS212481-PA
MANLDLGGSGTSSSSSHLSPGWQVNDLDLDLPGELSMNEALLSLPSLAVFKQEAPSPTGNALSPPRRTWPVRRTDDRQITNMVVDNRDAMDEGCQQHAGVMNTQSNSPESMQCQTMPVIMPINGYHSPSGQENKSNEALLSLPSLAVFKQEAPSPTGNALSPPRRTWPVRRTDDRQITNMVVDNRDAMDEGCQQHAGVMNTQSNSPESMQCQTMPVIMPINGYHSPSGQENKNAGLLMCSPASSLDGFLHSPRPDSGFKDDNRFQYVLAAATSIATKQNEETLTYLNQGQPYEVKLKKLGDLAHYKGKILKSIIKICFHERRLQYMEREQIAQWHNDRPGERILEVDVPLSYGVSRVEQPAALNELHVHWDPTKDVGVYVKVNCISTEFTAKKHGGEKGVPFRIQVETMYEDRRLHTAACQIKVFKLKGADRKHKQDRERVLRRPRSEVERYQPGCDATVLTTLSNDALMPPPSLVTTSPPYSPEMW-