Monarch geneset OGS2.0

DPOGS209382
TranscriptDPOGS209382-TA1164 bp
ProteinDPOGS209382-PA387 aa
Genomic positionDPSCF300118 + 205512-208581
RNAseq coverage22x (Rank: top 79%)
Annotation
HeliconiusHMEL0133672e-10971.16% 
BombyxBGIBMGA013691-TA2e-6375.33% 
DrosophilaCG17599-PA3e-3533.46% 
EBI UniRef50UniRef50_D2A5713e-6037.19%Putative uncharacterized protein GLEAN_15235 n=1 Tax=Tribolium castaneum RepID=D2A571_TRICA
NCBI RefSeqXP_971969.15e-6137.19%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|910844951e-5937.19%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastxgi|910844953e-6137.76%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[2-266] IPR0193666.7e-77Clusterin-associated protein-1
Orthology groupMCL11222 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209382-TA
ATGCGGTCGCTTGGCTTTCCTCAGCTGATTTCTCTGGAGAGTTTCCGCAGTCCAAACTGGCCGCTAGTGGAATCCTGCAGTCGCTGGCTGGCGGCGCGAGTGGAGCCCGACGCCTTACTGGCGGGCGGCGCCGACACTCTCGAACAGAGGGTGGCGCTCGTCACTCACGCCACCGAACTGTTTCACTCTCGAGCGAATTTAAAGTTGAATGGGAAAAGAATCTATGGCGCGGACGGTTGGGCGGTGCGGGAGTTGTTGAAGGTGGCCAACCTGCTGCGATCAGCTCTGGACACGCCGCGCCATGACGACCAGCTCGCTGACACCGACACGCTCGTGTATGACGTCAGCAATAGAATCGGGGAAATCAAACAGGCGCGAACTCTGGCAACGGAAATAACTGCGCAAGGAGCATTTTTATACGATCTGTTGGCAAAAGAACCCGAAAATAAGGATCACAGAAACAAAGCTCTGTCTCGTCAAATGGACCTGTCGTCTCTGGAGACGGCGTTGTCCCGCGCGGTGGAGGCGATGTCGTCTCGAGTGGACGCCGTGAGGGAACAGACGGCCAGCGTGGCGGCCAGCGAAGCCGCGCTCGACGCCAAGATAGAGAGGCGCAGGGCTGAGCTGCAGAGAGCCGAGAAGAGACTGCTCACTCTACAGAAGATCAAACCTGCCTACCAGAGTGAGTTGACATCTCTAGAGACGGAGATAGCGAGTCTGTGGGATCATTACGTGTTGAGGTATCGCTGTGTGGAGGCTCTGAAACACAGGCTCAGCGTGCTGGAGACTGCTCAGGCTGAGGCAGCCGAGGAACAACAAGCCGCCATAATGCAGCTCATTCACAAGTATGAAGCCGAAGATGTGCTCGGGAAGCTCAGCGACTCCGATGAGCTGGATTCCAGCGACGAAGCCAAAGAGAGCAAGCAGCCGCGCCCGCCCACCAGACCCAAGACAAGGCTTCGCATTAAAACCGCAGGTGGAGGCTGGCGCCCCCGACCTGCAACACGCACTCTCGGAGCCCCCGGCGACTACGAGGATATTGTGCAAGGTGTCGAGGAACCCGTCCAGGTGCTGTCGGAAGGATCGGAGGGTTCCTTGGAGAGTGAACTGAGACTCACGGAGCGAGTGGGCAGGAACTCCGCTCTCAGTGATAATGAATTCTAA

Protein sequence:

>DPOGS209382-PA
MRSLGFPQLISLESFRSPNWPLVESCSRWLAARVEPDALLAGGADTLEQRVALVTHATELFHSRANLKLNGKRIYGADGWAVRELLKVANLLRSALDTPRHDDQLADTDTLVYDVSNRIGEIKQARTLATEITAQGAFLYDLLAKEPENKDHRNKALSRQMDLSSLETALSRAVEAMSSRVDAVREQTASVAASEAALDAKIERRRAELQRAEKRLLTLQKIKPAYQSELTSLETEIASLWDHYVLRYRCVEALKHRLSVLETAQAEAAEEQQAAIMQLIHKYEAEDVLGKLSDSDELDSSDEAKESKQPRPPTRPKTRLRIKTAGGGWRPRPATRTLGAPGDYEDIVQGVEEPVQVLSEGSEGSLESELRLTERVGRNSALSDNEF-