Monarch geneset OGS2.0

DPOGS207398
TranscriptDPOGS207398-TA2631 bp
ProteinDPOGS207398-PA876 aa
Genomic positionDPSCF300087 - 567440-577363
RNAseq coverage676x (Rank: top 19%)
Annotation
HeliconiusHMEL0156222e-10351.23% 
BombyxBGIBMGA009367-TA1e-7869.48% 
Drosophilayemalpha-PA3e-2948.87% 
EBI UniRef50UniRef50_E9H1Q25e-3346.24%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9H1Q2_DAPPU
NCBI RefSeqXP_002070137.12e-3153.03%GK11190 [Drosophila willistoni]
NCBI nr blastpgi|3214631982e-3246.24%hypothetical protein DAPPUDRAFT_307367 [Daphnia pulex]
NCBI nr blastxgi|3071696981e-5126.09%Ubinuclein [Camponotus floridanus]
Group
KEGG pathway 
Orthology groupMCL25606 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207398-TA
ATGTCGGACCCTAAACGTGCTAGTCTTATAACAGCGGGTGCTCCAAAAAGTGCAAAGAATAATGTAAATAAAACATTGAGATTGTCTATAAATTTAGACGTAAGTGACGAAACAAAATATCCAGAGCTGAATTACAAGGAACTATACGCAGCAGCATTGAAAAAGAAGCGTGGTGAGAACTGTAAGACATCAGGTCTTGATCCTTTCTCCGATAATGATGATGATATTAAAAGGGCTACAAGGAAATTTGAACAGAAATATGGTGGAAAAAGTACATATGGAAAAAAGGGAAGGTCGAAATATGATGATTTTGCTGACATTGGAGCTGGCTATGATGAGAATGACTCCTTCATTGACAACACAGACGGTTATGATGAGATAATGCCACCAGAATGTGACACAGAGTATGGAGGTTTCTATATCAACAGCGGGGATCTTGAGTTCAAGACTGTACCTGAAAACAATAAGCGCGGACTATCATCAAGCAGTAGTGAGGAGGAGTCGTCTGCGAGTTCTTCGGATTCTGATAGTCAAGAGGCGAAAGACACAAAGGTCATCACCAATGGGAACGTCGACTCACATAAGACTGAGCATCGGAAAAAGAAGAACAAGACTATTGACAAGCAGAAAGCAAAGAAGATCAGAAGAACAGATTCCTCAGCAGCTACATCAGGGAAACAGACATCGGATGAGAACGCCCCCGCGGATGGGTCGACCGGTGATTCAAGATCGTCAGCTTCCGATTCCCTCACCTCCAAACCAGCTCCTAGCTCGGAGAACAACGCCACATCTGATAGTTCGAGAGATGCTGAAACAAAAGAAATCAAACTGCCGGCCTCCGTGAACCAAATACTGGAAGAGCTGGAGACGCTGAGCAGGTTCAAAGAATATCTCGGGAAGGATGATAGCAAGACTGAAGCCTGCATCGTTAGATTGGATAAAGCTCTCAAGGAGGTTGTGGATAATCGTCTGTGTGAACACGCCTGGTCCAGAGCTGCTAAGACGCTGGGAGTTGGACCGGATGTCATCACGCACAGGGCGAGAGAACGAGAGAAACTTGACAGTGTCCCGGTACAAAACTCACAACCTCAGCCCATAGCGGGTACTAAGAGGAAACTGGACGAGCTGTTAGATACAAGCACCATGACGCCCGAAGAGAAAGAAGCAAACATAGAGGATGTTTTACAAAGGTTAAAGACATTAATAAAAGAACGTGAACCAGGAATGATGGCTACGTACAAAGCTGAATGTGAACGAGTAGAGGAAGAGAGGAAGAAGCTGTCGATCGGTTCAGTTTCCGGTGGTAGTACAGGGTCTGAGCGCCGTCGTCCCAAACGTCGTTTCCCGTGGTGTGCTCGTGCCCGGGCCCTGCTGGCCCGTCTTGCAGCGCTAGGGGGAGCCCCGGAGCACCCGGCCGCAGCCGCAGCCCTTCTCACTCAGAGAGCTTTCCCCCTGTTCCCAGACGGCTTTGTACGTCTGCCCACACTACTCAAGCAGGCCGATCTCAACAAAGACATCAAGGTTTCAGACTTGAAGAAGCAAAGAGTCAGTTCAATATCATCTCAGGCGGCGCCACAATCTCAGCCTTCACACGCAACTCAACCAGTACAACCTATGATGACCTCTACACAGTTCACTGAACCCATACAGTTCCCGAGCTCGCTCACCGTCACAACCTCCGTCAAAAGTATAGACAGGAGCGAAGACGACAAATACAAAGTGAATCCAGCCATCGGCGCGCTGATAAACTCGTACACGCTCAACAAAGACCTAGTTATAGAGAAACCGGAAGACAGGAAGACCGACAGGAGCAAAGAGGAGTACATCCCCGCGAACAGTATAGGCAGCATCACGATAACACCCGTCGTCGCTAAGGACAAGAAGGCGGACAAGGCCAAGGAACCCCTGCTGAGAGTCAAGTCACCGGCCGCCTTGAACGAGATGATACACAAGAAAGATAAGCCCAAGAAACCAGAGAAGAACGTCACCCTGTATAGAGATAAGAGATTGGAGTCCCCTCTGCACGTGGATATCAGTGGGAACACCAAGAAAGATTCCCAGAGCCCCAAACTCAAGGAGGAGATAAGTCAGAAAGTCATCGAGAACATGCGCTCAGTAGAGAACAACATACCCAGGCCGGCCTTGATATCGGTGCACCACTCACCCACCTTCGTCAAACCTGACAAGCGGCCGCCGGAAGTCAAGAAGAAGAAGGAACCGTTAATAATATCGGACGAAGACCCGCTCAGTGACGTGCGCGAGACGAACGAGCTCGATGATAATAGTGATGTGGTGTTCATGGGGGAAGTGAGGGACAAGTGTGAGAATAGTGATGTGCCTTCGGACGTTAAGAGTGACAAATGTGAGGACCTGACTGACGAGACGGCCAATGAGGTTATGAAGAATTTAAGAGAAATGGCGCACTCACAGGAACCCGCGAGCCAAAACAGCAGTTCGTACGGCGGGGTCATAACAAGTGGCAAGATGTCTTCCAAGAGCACAGGGCGGGCGGAGTCGCGGAGTAATCTAGAAAGCTTCTTTGAAGATGGAGGATGGGCTGACTCGGACACGAAGGAATCATATGATGATTCACCAGTGGTCGTCCGCTGTCGGGGACGCTTAACACTGTAA

Protein sequence:

>DPOGS207398-PA
MSDPKRASLITAGAPKSAKNNVNKTLRLSINLDVSDETKYPELNYKELYAAALKKKRGENCKTSGLDPFSDNDDDIKRATRKFEQKYGGKSTYGKKGRSKYDDFADIGAGYDENDSFIDNTDGYDEIMPPECDTEYGGFYINSGDLEFKTVPENNKRGLSSSSSEEESSASSSDSDSQEAKDTKVITNGNVDSHKTEHRKKKNKTIDKQKAKKIRRTDSSAATSGKQTSDENAPADGSTGDSRSSASDSLTSKPAPSSENNATSDSSRDAETKEIKLPASVNQILEELETLSRFKEYLGKDDSKTEACIVRLDKALKEVVDNRLCEHAWSRAAKTLGVGPDVITHRAREREKLDSVPVQNSQPQPIAGTKRKLDELLDTSTMTPEEKEANIEDVLQRLKTLIKEREPGMMATYKAECERVEEERKKLSIGSVSGGSTGSERRRPKRRFPWCARARALLARLAALGGAPEHPAAAAALLTQRAFPLFPDGFVRLPTLLKQADLNKDIKVSDLKKQRVSSISSQAAPQSQPSHATQPVQPMMTSTQFTEPIQFPSSLTVTTSVKSIDRSEDDKYKVNPAIGALINSYTLNKDLVIEKPEDRKTDRSKEEYIPANSIGSITITPVVAKDKKADKAKEPLLRVKSPAALNEMIHKKDKPKKPEKNVTLYRDKRLESPLHVDISGNTKKDSQSPKLKEEISQKVIENMRSVENNIPRPALISVHHSPTFVKPDKRPPEVKKKKEPLIISDEDPLSDVRETNELDDNSDVVFMGEVRDKCENSDVPSDVKSDKCEDLTDETANEVMKNLREMAHSQEPASQNSSSYGGVITSGKMSSKSTGRAESRSNLESFFEDGGWADSDTKESYDDSPVVVRCRGRLTL-