Monarch geneset OGS2.0

DPOGS200609
TranscriptDPOGS200609-TA1254 bp
ProteinDPOGS200609-PA417 aa
Genomic positionDPSCF300076 - 233582-235378
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0031412e-9563.10% 
BombyxBGIBMGA008968-TA2e-6438.72% 
DrosophilaCG3227-PA2e-2247.32% 
EBI UniRef50UniRef50_D6WUW01e-2553.28%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WUW0_TRICA
NCBI RefSeqXP_001654203.12e-2245.97%hypothetical protein AaeL_AAEL001891 [Aedes aegypti]
NCBI nr blastpgi|2700113374e-2553.28%hypothetical protein TcasGA2_TC005343 [Tribolium castaneum]
NCBI nr blastxgi|2700113373e-2553.28%hypothetical protein TcasGA2_TC005343 [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[279-400] IPR0183804.2e-28Uncharacterised protein family CpipJ
[314-387] IPR0183799.1e-12BEN domain
Orthology groupMCL25557 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200609-TA
ATGAAAATTGAGAAATGGCTGTTAGCGGATTGGAGTACTAGTAGACGAATGCAAGAAGTAATCAAAGTGAGTGAAGATATGAAGGTAGTTGTGACACCGAACCAGGAAGACGACAAAGTCACTATAACGGAATTTCAGCAACATAATATTTCGGATCAGAATAGGCAGTCTAATAATGAGAGCGGTTCAGTGCCAGTGAAGAGTGTTCCCGAAGATGAGGTCGAGGAGGATTATTCGCCAAGTGGTTCGTCTTGGGAACCAACGGACAAAAGTTCTTCGTCATCGGAAGATTCTGATAAAGAGGAAATACAATCGAAGACTATGAAGATGGAACAGTATATCCGTCGTTATTCTAGGAAAGGAAATAGTAAAAAGGCTTTAGTACCTGTACATAGACCGAATGCGGTTGTAAATGTTTTGAAAGAACGTCAGATAAATGAAAACAATAGGAGCCATAACTCGAGTCGTAAGAGAAAGTACAAAGAGTTTGTACACAATAAATCCTTACCTGATTCCACAAGAAAAAACCCCTCCCCATCTAAAGAAGTTAAAACGGAAACCCTTAGTATAAATGTAAAAGATCTCTTAGATATTAAAGCAAGTTTTGAAGAACTATTTAGATTGTTGAATGATTTGAAACCCGAAACAGGAAATAATACAGAAATAAACAGTTTCAAAGATATCACAACGCACGAAAAATCCAATATTCCTTCATATGAAAGCGAACACAGCGAAGATAATCAAAGTGAGGATTTAAATCGTTCGGATGATAACGTCCTAATTACAAATAAATACAGCAAGGCTCGAGTCATATCAGACAAATCTCGAGAAAAACAGTCGCCGAATAATGAAAACGAATGGATCCCAATAGGCAGTGGTAAAACATTAATACATAAAGACAAATACAGAAAGGTGAATTGGAAATCTTACACCATAGCCACAAGAACCTTGTTGCTAGCGACATTTCCAAGAAGGATACTGGCAACACACTCATTGACGGGAAAACGGTCTCCGGCCTTTCAAAATAAACCCGCAAAAATGTGTCTGGACCCAAAAATTGTATCCGACATTATTCTTGAAATAACATCGAAGTTTAAAGTTAAAGAAAACTTGGTTAGGAGCATTATAACAACAAAATGCGCGGATGAATGTAAAATGTACAAATCAAGGACAAAGAACAAAAAAATTAAAGATCAAGAAAATCTACCACCAGCTATAAATGCAAGAGAAGAATCTCACAAGGAGGTTTCTTAG

Protein sequence:

>DPOGS200609-PA
MKIEKWLLADWSTSRRMQEVIKVSEDMKVVVTPNQEDDKVTITEFQQHNISDQNRQSNNESGSVPVKSVPEDEVEEDYSPSGSSWEPTDKSSSSSEDSDKEEIQSKTMKMEQYIRRYSRKGNSKKALVPVHRPNAVVNVLKERQINENNRSHNSSRKRKYKEFVHNKSLPDSTRKNPSPSKEVKTETLSINVKDLLDIKASFEELFRLLNDLKPETGNNTEINSFKDITTHEKSNIPSYESEHSEDNQSEDLNRSDDNVLITNKYSKARVISDKSREKQSPNNENEWIPIGSGKTLIHKDKYRKVNWKSYTIATRTLLLATFPRRILATHSLTGKRSPAFQNKPAKMCLDPKIVSDIILEITSKFKVKENLVRSIITTKCADECKMYKSRTKNKKIKDQENLPPAINAREESHKEVS-