Monarch geneset OGS2.0

DPOGS210057
TranscriptDPOGS210057-TA1797 bp
ProteinDPOGS210057-PA598 aa
Genomic positionDPSCF300017 - 926690-937060
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0133601e-11276.81% 
BombyxBGIBMGA012697-TA6e-10161.69% 
Drosophilaspz5-PA4e-3238.02% 
EBI UniRef50UniRef50_D6WMB71e-3562.93%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WMB7_TRICA
NCBI RefSeqXP_970793.12e-3662.93%PREDICTED: similar to GA22158-PA [Tribolium castaneum]
NCBI nr blastpgi|2700081414e-3562.93%hypothetical protein TcasGA2_TC013304 [Tribolium castaneum]
NCBI nr blastxgi|2700081411e-3462.93%hypothetical protein TcasGA2_TC013304 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL25999 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210057-TA
ATGATTTTCAAACAAAGCGAGTGCTATATACTGTGGGCTGCCACGAAAGGCGCGGGTTACAGTTGTCCCTACGGCCAGAACTGTTTGTATGAGCCGGCGCCCCCTGGACGCGCACCCGCCTGCGCTCAACCCGGCCTCACCTACTGCCTGCATCCGGATCCATATCCAGAGAAAGTGATCAGGAAACTGGTCGAAGCCGGTCAGTACGACATCCGGACGTTGCTATCCGACGAGAGTCGCGATAACTTCCAAGACAATAAGAAGACCACTTACCCGTACGGATACGGCCCCAACTCCTTACCACACGTCGACCAAATATCACTCGTCGACGATGGACACAAAGATTATAACACTAAGTACCACAAGTACAACCAAGGTGAGAAAGTGATCCGGAAACTGGTCGAAGCCGGTCAGTACGACATCCGGACGTTGCTATCCGACGAGAGTCGCGATAACTTCCAAGACAATAAGAAGACTACTTACCCGTACGGATACGGCCCCAACTCCTTACCACACGTCGACCAAATATCACTCGTCGACGATGGACACAAAGATTATAACACTAAGTACCACAAGTACAACCAAGATATATTTTCTGAGAAGGCTCCTCTCCAGCCTCCCTCGGCCGCAGATCCAACGCCGTATGATATCAAGGCCTATCAGGCCGCCAATTATTCCAAGTTCGGCTTCCAAGGATACACATCACCAACTTTTTGGGATCCCTCCATTAACCAGTTCCAATATGAAAACAGACGACTGAGAGAGAACGAAAATGTGAACCTTTACAATCCGAATTATTTCAATGGCCCGCTCTATCAAAACTATGAGTCGAATTGGTTGAGAGCGTCTAGCGGGGTCACGCAGTACAACCCGAGCGAATGGTGGAAATATATATCACCGTCTCGGTCAAACGCCGAGGTGTCCATCCAGCGCTCGATCACCTTCCCGACACGAACCACCACGAGCAGACACGCGACACACGCGCGCCGGAAACGGAACACGGAACTTGTGGAGGCTGCGGCGAGACGCACGTTAACAGGGGCTGAGGCGCTCAGGATCGCGCTGGGATTGGCTAACGAGGACTCGTCAGCGGAGCGTCCTCGCCGCCAGGCGTCCACGGGTGAGGAGTTATGCCGAGTCCGCACTCAGTTTATAAATCCTCGAGCCGCTCTCAATAACAAAGGCAGCTGGCGCTACGTCGTCAACATGCCGGATAACATGACACAACTAGTGCGAGCTGAGATATGCGCGTCAACAGAATGCAGTGGGTTATGCACCATACCTCTTGGCTACACTTCTAGATGCGAGCAAAAGTATATACAAAAACGTTTGGTAGCGTTGGAGTCCAGTGGACAGAACTTGTACACCGACGTGTTCTGGATCCCGAGCTGCTGCCAATTAAAAGAGGAAGCGGGGAAAAAATCAATGGTTGCGTTTTATGTTCTGTGCGTGAGACAGATCGGTTGTCATGACGACTACGCGGGGGACACGAGTGGCGGGAAAAAAAGGACTTATGACGCGGCCAAGAAAAGAAAATGGCGTCTCCGCTGCCCGCGGAAAGTTGACATTGCCGCTGCGATAGATTTTAAGCTTTTATTGTTCCGAGGATGTTTTGTGCGTCGTGCGAGAGAGAGGCAACATGGGGTGATCGACGAGATAACGAGGAGGGCGATGGAACTAGGTGAAGGGGATCAGCCTTCAGGTACTGTGGACCACGGGACGCGGGAAGCGGGACTCAGAACATGGGACACGGGACATGGGACGTGGGGTGCCGGCGAGGGGACCAGCCGTATATAG

Protein sequence:

>DPOGS210057-PA
MIFKQSECYILWAATKGAGYSCPYGQNCLYEPAPPGRAPACAQPGLTYCLHPDPYPEKVIRKLVEAGQYDIRTLLSDESRDNFQDNKKTTYPYGYGPNSLPHVDQISLVDDGHKDYNTKYHKYNQGEKVIRKLVEAGQYDIRTLLSDESRDNFQDNKKTTYPYGYGPNSLPHVDQISLVDDGHKDYNTKYHKYNQDIFSEKAPLQPPSAADPTPYDIKAYQAANYSKFGFQGYTSPTFWDPSINQFQYENRRLRENENVNLYNPNYFNGPLYQNYESNWLRASSGVTQYNPSEWWKYISPSRSNAEVSIQRSITFPTRTTTSRHATHARRKRNTELVEAAARRTLTGAEALRIALGLANEDSSAERPRRQASTGEELCRVRTQFINPRAALNNKGSWRYVVNMPDNMTQLVRAEICASTECSGLCTIPLGYTSRCEQKYIQKRLVALESSGQNLYTDVFWIPSCCQLKEEAGKKSMVAFYVLCVRQIGCHDDYAGDTSGGKKRTYDAAKKRKWRLRCPRKVDIAAAIDFKLLLFRGCFVRRARERQHGVIDEITRRAMELGEGDQPSGTVDHGTREAGLRTWDTGHGTWGAGEGTSRI-