Monarch geneset OGS2.0

DPOGS207535
TranscriptDPOGS207535-TA3636 bp
ProteinDPOGS207535-PA1211 aa
Genomic positionDPSCF300177 + 622985-631693
RNAseq coverage60x (Rank: top 68%)
Annotation
HeliconiusHMEL0054950.069.15% 
BombyxBGIBMGA001899-TA0.053.21% 
Drosophilasha-PB6e-6830.07% 
EBI UniRef50UniRef50_UPI0000D56B092e-8836.79%UPI0000D56B09 related cluster n=1 Tax=unknown RepID=UPI0000D56B09
NCBI RefSeqXP_966739.13e-8936.79%PREDICTED: similar to shavenoid CG13209-PA [Tribolium castaneum]
NCBI nr blastpgi|910877876e-8836.79%PREDICTED: similar to shavenoid CG13209-PA [Tribolium castaneum]
NCBI nr blastxgi|910877872e-8936.41%PREDICTED: similar to shavenoid CG13209-PA [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL22641 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207535-TA
ATGGCGATATTACTTCTTCTTATGTTCACTATACCGAACGCTTTAACTTTTGAAGGAGAGATGGCGTTAGCCAACGTCACACGATTCGCGAATGGAGATATTTTTTCTTTGGCTGGTCCTCATTGTGGTTTAACTAGATGCATGGAGGTCTCTCGTGGTACCGCCTTAGCGTCTTTGGGAAGCGGCAATAGCTGTGTTTGTCAATGTCGCCGTGAAACGCCAGCCTTTAGGGAGGATCAGAGGACATGTGTCAACCATATTGATGAATGTCCAATGGCAAGTTTCGGACGTGGAACGAACAAACCCCAAATACCTTTTGTGTTTCTGCCTTTAAAAGGACAGATTATTTATCCTTCGAAAGAAATTATATTTACAGATGTCGAAGAAGCTATCTGTGCTGTAACAAGCGCTCAATACCTGTCTCCGACTGGTTGGGTGACGCTGCGGGATGTGATTGACAACGACGTACCATTCGGTCTTTACAGAGATGAGGGCAGCACTTACTTACAATGGCGTGGTTCTCCAGCACTCCATAGCCGTCTTGAAGGGAGGCTGGTTGCTGCTCACGTGCTTTGCTCTGCACCAAAACCGACTCACCTGGCTACATCCTGCGCAGCATTCAGGGTAGCTGGCGTGACTCATAACACTGTACTAGATGTTAGGTCAATACCATTCCACGCTGGTGAAACTGTGACCACGGAGCCGGCTGCACAAAGTCAAGGTCTAAGCGTTTTAGAATCACTCGCTATAGGAGTGTGCGTGCTGATGTTGGTTTTCGTGTATGCCACTGGGATCATATTCTACATTCATTACAAACAGAAACAAAGAAAAAAGACCAAAGATCCAGAATTAAATAACACACTTTCCACTGAGAACGCCTCCACCATAGATTCACGAATAAGCATTAATAATATGAGAATCAAAAATAATCCATTAATGAACTTGGACAGTGAGGCTGGTCTATCCGATGTCAGCGAACGAACCGAGGACACATTTGATTCTTCATCAATCAATACTCAGAAAATGAGTCAAAATTCAAACGTTGTGTCCGCGGTTGTCCATTCAAGAAGGAAAAAACCGTCGAGACCAACAATACGAGCGACTTCGACACCCGAACGCCTCAACGAAAGATTACAAAGACGTTCCGCCTCACCAGAGCTGGAAAAGGCACCGCATTCAGATCTATCGATACTTCATTGCAGTATGGATAATACCATGCCAGAGCTCCCACCACCACAGTCGACTGGAGAACCAATAATAAGGAAAAAGTTATACTTCAACCCAGTGTTCTTCGAAGTAGAACATCTGAAGAATCCACCGCCAGCTGCTATCGACTTTCTGTGCAAATTAAGGGAAGTCATGTCAGTTGCTAAGGAAAAGATGACTTCGAAGAGATTTATTCCGATATTATCCGACATACCTGAAGAAGAGCTATATCATACTATCGATCTAGGATGGGACATACATTGCGCTAGACGAGGACGAAGATTTAGCGCGATAAGTTTAAAGCGTGAAAACAGTCGGAGGTCCATACATTGCGGTGGCTGTCCCGGTTGTGACAATAATGGTAAAAACCATAAAGCTACAAACCTCATTAGGTCTAACTCCTGCAAAACCTGTGTCAGTGACGACTACAAACAGAGAATTGTTCAGAAATGGTTAGACGAAGCACCATCGCCGTCTCCGACACTTGTACGACCGGTGAAATCCGTCGCAAAAGTAAGTGGAACACCAAGAGTCGTAGATCCAAAAGGTAAAGAAGAATTAAGAGTCAGTCCAAACAGTCCAATAGAAGCACCTAGAAACGTTGATTCAGAAACTATTAAAATCACTGAAACTATCGTGAAAGAATATTTAAGGAATATAACTTCAGTGAACGAAAACGTTAATCTAGCGAAGAGGGAGGAAAATATTAACCTAACCGTCACAAACAAAGAGATAAATGACAAATCCACAGCTATACCAATAAGTGCAAATACTTCGGAAACGAAAAGTCAACCCAATTATGGCACGCGGCGAGTAAGGAAGAAGCTTCCACCGCCGCCTCCGCCACCAATGAATGTGGTCGAAGTTAACGTTAGCGCGGACGTCGAACCCGTTTTGCCGGAAGTCAAAAAGAAAATGGAAGCTGTAATACGTGAACTCAATAAGTGTAAACGCATTGAACCCGTTTCACCCAACAAATCTCCAATGACGGGACCGAAATTAGTCATCCCTGTTACAGCTGAACCCCACTACTTTAGCGATGACAACCTACTTTCCGACAATCTCAAAAAGGATCTCGAAAATGACAAGTCAGTCGAGTGTGACAGTTTAGAGAGAAGATTATCTAAAAAGAGGCGATTCTCTTTGGATCATTCACCCGAATCTCAAAGGAATATGTCCATCATAAAAAATACTTCAATGAAACGCGACAGACTGACAAGAAGCTGGAGAGATTTTAAAGTCGCACCTAACAATAGTGATGTGTCTCCTCAAATAATTAGTTTTGAATCTAAAAGTAGTACAAATGACATTTTTATTAACGGTCCTACCGAGCCCATTTACAATAATGTCGAACAATTGGGACCCTTAACCATACAGGTGCGTGGCTCTCCGGTGGAAAATAGACAAAAAGATAACGAAGACTTCGATCCAGACACATTAGACAGAAAACCTAAACGAATAGTCGAAGAAACGAAAACCAAAAAATGTGTGGAGAAAATTCTTCTCAAATCCGGCGGTAGCTTTAAGCACAAACTTACCGCACCTGCCTGTCAAAAAGAAAAGAGCCCACCAGAAGCAGCCTTCACAAGGAAAATAGGAAGTCTGAGACAAATATATGAAGCGAAAAATAAGACGAATCAAAGTGACGTACGTCCGTTTTATGAACGGCGAGGAAGCGTTCCCTATGGAACTCAAGAATTAGCAGCGTATATGAAAAATTTAGAGCAACCAAGAAGTTTAGATAGCAAAGAGTCCAAACCTCCAGTACCGCCAAAGAACAGAGGATCAGAAACACATTCTAAAGGCAGTACACCCAGGAACAGTCCACCGAGTGACCGGTATCGATCATCGGAAGAAAGAGATAGGTTCCCGCCTTACTCGAGAAACGATAATCTAAACGCGAGGCGCTCTGGCAGAAGATCGTCAAGAACGAGACCGAGACGAGTAGATATAAAGAAATTCTGTCGCACCGAAGACTCGGGATATCTGAGTACCGATTCAAACGAATCAAAAAGAAGGGCGAGATACTTAATGCATCTCAAACCTAAGCTGATACCACCTGAAGTGATAGCAAAGAATAAAATCACCAATTTACACATAGATACGGACACAGACGAATTGGAGTCGTTGTGCGACGGTCACAGTGAATCGGGCGGGGAGAGTGTTGAGACTGATTCGGTGTTCTTCAGGAACTTTGACGATCCAAGCAAAACTTTCCAAGGGTTAGTTTTGAATGAATTCGAAACCCAAAATAGACAGGAGCAAATAGATTCTGGTTTCCTTGGTGAGACCAATATAATTTTAAGCGGCGACAGTGACTCGGAACACAGGAGTGTCATATCAATAGTCACTGGCCGGGACGGTCGAGCTTCAGCCTCCTCTATAACAAATCTAGAAGATTCCTACGTTCAGTCGGTTGAATGCTCTTAA

Protein sequence:

>DPOGS207535-PA
MAILLLLMFTIPNALTFEGEMALANVTRFANGDIFSLAGPHCGLTRCMEVSRGTALASLGSGNSCVCQCRRETPAFREDQRTCVNHIDECPMASFGRGTNKPQIPFVFLPLKGQIIYPSKEIIFTDVEEAICAVTSAQYLSPTGWVTLRDVIDNDVPFGLYRDEGSTYLQWRGSPALHSRLEGRLVAAHVLCSAPKPTHLATSCAAFRVAGVTHNTVLDVRSIPFHAGETVTTEPAAQSQGLSVLESLAIGVCVLMLVFVYATGIIFYIHYKQKQRKKTKDPELNNTLSTENASTIDSRISINNMRIKNNPLMNLDSEAGLSDVSERTEDTFDSSSINTQKMSQNSNVVSAVVHSRRKKPSRPTIRATSTPERLNERLQRRSASPELEKAPHSDLSILHCSMDNTMPELPPPQSTGEPIIRKKLYFNPVFFEVEHLKNPPPAAIDFLCKLREVMSVAKEKMTSKRFIPILSDIPEEELYHTIDLGWDIHCARRGRRFSAISLKRENSRRSIHCGGCPGCDNNGKNHKATNLIRSNSCKTCVSDDYKQRIVQKWLDEAPSPSPTLVRPVKSVAKVSGTPRVVDPKGKEELRVSPNSPIEAPRNVDSETIKITETIVKEYLRNITSVNENVNLAKREENINLTVTNKEINDKSTAIPISANTSETKSQPNYGTRRVRKKLPPPPPPPMNVVEVNVSADVEPVLPEVKKKMEAVIRELNKCKRIEPVSPNKSPMTGPKLVIPVTAEPHYFSDDNLLSDNLKKDLENDKSVECDSLERRLSKKRRFSLDHSPESQRNMSIIKNTSMKRDRLTRSWRDFKVAPNNSDVSPQIISFESKSSTNDIFINGPTEPIYNNVEQLGPLTIQVRGSPVENRQKDNEDFDPDTLDRKPKRIVEETKTKKCVEKILLKSGGSFKHKLTAPACQKEKSPPEAAFTRKIGSLRQIYEAKNKTNQSDVRPFYERRGSVPYGTQELAAYMKNLEQPRSLDSKESKPPVPPKNRGSETHSKGSTPRNSPPSDRYRSSEERDRFPPYSRNDNLNARRSGRRSSRTRPRRVDIKKFCRTEDSGYLSTDSNESKRRARYLMHLKPKLIPPEVIAKNKITNLHIDTDTDELESLCDGHSESGGESVETDSVFFRNFDDPSKTFQGLVLNEFETQNRQEQIDSGFLGETNIILSGDSDSEHRSVISIVTGRDGRASASSITNLEDSYVQSVECS-