Monarch geneset OGS2.0

DPOGS211842
TranscriptDPOGS211842-TA1497 bp
ProteinDPOGS211842-PA498 aa
Genomic positionDPSCF300031 + 1087892-1091504
RNAseq coverage75x (Rank: top 65%)
Annotation
HeliconiusHMEL0063226e-12455.98% 
BombyxBGIBMGA006026-TA3e-10654.69% 
Drosophilal(2)gd1-PB8e-6938.95% 
EBI UniRef50UniRef50_D6WKC84e-9747.34%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WKC8_TRICA
NCBI RefSeqXP_971461.23e-9746.97%PREDICTED: similar to CG4713 CG4713-PA [Tribolium castaneum]
NCBI nr blastpgi|2700071511e-9647.34%hypothetical protein TcasGA2_TC013686 [Tribolium castaneum]
NCBI nr blastxgi|2700071518e-10947.53%hypothetical protein TcasGA2_TC013686 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[129-187] IPR0066085.2e-18Domain of unknown function DM14
Orthology groupMCL11838 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211842-TA
ATGTATAAAAAACCTACGTCCGGGAAAAAGACAAACTTGTCTCAGTTTGGTCTGATAGACATCCCAGATTTGGATGACGAAGAAGCAATGGATCTGTCTGATGATGATGTGGATTTAGAAGCAGAACTGGCAGCACTTAGCGGAGGAAAGAAGCAACAAAGACAACGCAAACCAGCTCCTGTACCAGCCAGTGACCTTGACGCTATGATCGCTGCGAGCTTGAAAGATATACCTTCAGATGAAGACGTGTCAGGTGATGAGGATGATCCAGATCTTCTAGACGAGCTTCAAGCCCTAGCGATAGATGACCAGCCTCAATTAGAACCTCCACGACCGAGGACAAGTCGACCGGCACCACCTCCACCGTCCGCTGAGAATAGTATTGTCAGCCTTCTACAAGAAAGAATCTCCAATTACGCTATCGCTGAGAAGAATGCTAAGGAAAGCGGGGAAAGTGGAAGAGCTAGACGGTTTGGTAGAGCCCTTAAGACACTCAACGACCTACTGAAGCAAGCTAAATCTGGAAAATCAATAAACAATGAGGACATTCCTCCACCAGTTAGTGTGGGGAAACCTAAATCGGATGTACCAAGAGAGATGCCATCGAATGACCCACAGCCAGAACCCACTAACACACTTCCTGAACCGCTGACACCAACACAAGTCCCCTTACCCCCGCCAAGAACCACTTCCATACCACCAAACCCTGAAGAACCTAGATCTCCTACTCCACCAGAACCCAAAGAACCTCCACCACTTCTATCAGATGTGGATCCTGCCAGGGCTCAAGGACTGCAATTGATACTGAATAGGAAGGCGGAATTCAAAGCAGCCGCCCTATCCAGCAAACACGCGGGAGACAAAACCTTAGCGCTGGAGTACCTCAAGGTTGTCAAACAGTTTGATATAGTGGTGGAGGCGTACAAATCTGGTCAAGAGATGGATCTCAGCGAACTGCCAACTCCTGAGGGCATCGCGGCCGCCGTCAAGGGACAGAAGGAAGAAGAACAAGTCCAGAACGCCGCAGAACCAGACCCTGAAGTCCCTCCGGAGCCAGTAGGTTTAATCACTGCCTCCTCTGTAGACGAGGCTCTGAGACAACGACTCGCGCATTTTCAGGAACAAGAAAGTAAGGCGAAGGACGAGGGGAACACGTCGAAGGCCCGTCGTATGGGGCGGATAGTGAAACAGTACCAGGACGCCATCAAGATGCATAAGGCCGGGCGTCCCATACCCACCGACGAGCTTCCCACGCCCAATGGATACGCACCCATACCCACTGGCGAGTCGCCCTCCCCGCGGCCCGCTCCGTCTGCTTCCCCCCGCCGTCCCGCCCCTACGTCTGCCCCGTCTGTCCCGTCCCCGTCCCCGTCTCCGTCCCCGTCCCGTGCCCCGTCCCGTTACGACAAACAGATTGCTCTATTGCTACACAAACAGAAGCAATTTAAAGAGGCAGCGCTGCAAGCTAAGAAGGACGGTCAGTTGAACTTATATTAA

Protein sequence:

>DPOGS211842-PA
MYKKPTSGKKTNLSQFGLIDIPDLDDEEAMDLSDDDVDLEAELAALSGGKKQQRQRKPAPVPASDLDAMIAASLKDIPSDEDVSGDEDDPDLLDELQALAIDDQPQLEPPRPRTSRPAPPPPSAENSIVSLLQERISNYAIAEKNAKESGESGRARRFGRALKTLNDLLKQAKSGKSINNEDIPPPVSVGKPKSDVPREMPSNDPQPEPTNTLPEPLTPTQVPLPPPRTTSIPPNPEEPRSPTPPEPKEPPPLLSDVDPARAQGLQLILNRKAEFKAAALSSKHAGDKTLALEYLKVVKQFDIVVEAYKSGQEMDLSELPTPEGIAAAVKGQKEEEQVQNAAEPDPEVPPEPVGLITASSVDEALRQRLAHFQEQESKAKDEGNTSKARRMGRIVKQYQDAIKMHKAGRPIPTDELPTPNGYAPIPTGESPSPRPAPSASPRRPAPTSAPSVPSPSPSPSPSRAPSRYDKQIALLLHKQKQFKEAALQAKKDGQLNLY-