Monarch geneset OGS2.0

DPOGS211159
TranscriptDPOGS211159-TA1173 bp
ProteinDPOGS211159-PA390 aa
Genomic positionDPSCF300007 + 83332-86385
RNAseq coverage65x (Rank: top 67%)
Annotation
HeliconiusHMEL0172018e-8574.88% 
BombyxBGIBMGA003143-TA5e-16080.30% 
DrosophilaCG12125-PB1e-11752.14% 
EBI UniRef50UniRef50_Q9W3F72e-11552.14%CG12125, isoform A n=17 Tax=Endopterygota RepID=Q9W3F7_DROME
NCBI RefSeqXP_970012.16e-12555.50%PREDICTED: similar to CG12125 CG12125-PA [Tribolium castaneum]
NCBI nr blastpgi|910925881e-12355.50%PREDICTED: similar to CG12125 CG12125-PA [Tribolium castaneum]
NCBI nr blastxgi|910925889e-12055.50%PREDICTED: similar to CG12125 CG12125-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[1-385] IPR0193925.9e-144Protein of unknown function DUF2217
Orthology groupMCL13313 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211159-TA
ATGGAGGCGTTAGAGACGAGTATAAATTATTGGGAGGACGCGTTGGCTGCGTTCTCGATGGGAGCCCGCGGTGGAGTGTCGGGGGCGCTGGCGCTCACCTCGCCCGAGGAGGCTGAGTTCTGTAGGGAGATACAGGAGTTACTCCAGAACGCGTATATGCTGCAAGAGCGCTGTGAGCTGCTGTTCCTCGATCAGCGGTCGGTGCTGTTTCGTTCGGAGAGCGGCGGCGGGGTGGAGGGTTCAGGGCGGCGCACCAGACTACTGTCCTCACATACATCAGAAACCCGCAAGGAACATCACTCCAGCGCCGAGTCATTCGCCTCCGCCGAAGACCAGGTTGCCGACTTGCGGGAGTTCGACGACTTGGCCGAGTCATTCCCAGAGATAGAGAAGCTGGAACTGTACCAAGCGGCCGTTAAACAGTTGGAACATGGCATTCCTTGCAGGACGGTTCGGTCGGAGGTGTTGCAGTGCGGGTCGGAGGCGGAGTTCCTCTGTAAGCTGCACTGCGTCCGGTCCGCGCTGGACGCCGCGCTGGGCGGAACGGGCGGCGGGGCGGGACGCGCCTGGCTGGTGGACGCCGGCCGACAGGTCCTCACAGACCTGCTTCTGTATGCTGATAAGGAGCCCAAGGAGTTCCTGGTGGCTTACGAGGAGATGGTGTCGTGGTCGAGCGAGCCGTCCAACTGGCCGGTGGTGGAGGCGGAGCTGAGCGCCAAGGGCGTGCGGGTCCTGTCGTTCTACGACGTGGTGCTGGACTACATCCTGTTAGACGCGTTCGAAGACCTCGCCGCGCCGCCCGCCTCCGTGCTGGCCGTCGTCAGGAACCGATGGCTCTCGGACGGGTTCAAGGAGAGCGCGTTGACAACAGCTGTGTGGTCCGTCATCAAAGCTAAGAGGCGTTACCTCCAGTACCCCGACGGGTTTATGGCGCACTTCTACTCCATATCGGAGCATCTCCTTCCTGTACTGGTCTGGGGTTTCCTGGGACCCCGGGAACGGTTGAAGGACGTCTGCGAAACCTTCCAGGCGGAGATAGTAGGCTTCATAACAGATATATTCAGTTTCGAAAAATCCAGATACACGACGGTGGAGGACCTCGCCGAGGACATCATGCAGCACGCGCGCACACGAGCCGCTAACCTTACCGACAAACTGGCCGAGAGAACATAA

Protein sequence:

>DPOGS211159-PA
MEALETSINYWEDALAAFSMGARGGVSGALALTSPEEAEFCREIQELLQNAYMLQERCELLFLDQRSVLFRSESGGGVEGSGRRTRLLSSHTSETRKEHHSSAESFASAEDQVADLREFDDLAESFPEIEKLELYQAAVKQLEHGIPCRTVRSEVLQCGSEAEFLCKLHCVRSALDAALGGTGGGAGRAWLVDAGRQVLTDLLLYADKEPKEFLVAYEEMVSWSSEPSNWPVVEAELSAKGVRVLSFYDVVLDYILLDAFEDLAAPPASVLAVVRNRWLSDGFKESALTTAVWSVIKAKRRYLQYPDGFMAHFYSISEHLLPVLVWGFLGPRERLKDVCETFQAEIVGFITDIFSFEKSRYTTVEDLAEDIMQHARTRAANLTDKLAERT-