Monarch geneset OGS2.0

DPOGS213498
TranscriptDPOGS213498-TA5226 bp
ProteinDPOGS213498-PA1741 aa
Genomic positionDPSCF300100 + 434310-451600
RNAseq coverage119x (Rank: top 58%)
Annotation
HeliconiusHMEL0168280.087.15% 
BombyxBGIBMGA004379-TA0.082.65% 
Drosophilapds5-PB0.049.74% 
EBI UniRef50UniRef50_A1Z8S60.049.74%Pds5 n=36 Tax=Pancrustacea RepID=A1Z8S6_DROME
NCBI RefSeqXP_623860.10.064.30%PREDICTED: similar to CG17509-PA [Apis mellifera]
NCBI nr blastpgi|3071734840.064.67%Androgen-induced proliferation inhibitor [Camponotus floridanus]
NCBI nr blastxgi|3071734840.064.67%Androgen-induced proliferation inhibitor [Camponotus floridanus]
Group
Gene OntologyGO:00054888e-33binding
KEGG pathway 
InterPro domain[641-1544] IPR0160248e-33Armadillo-type fold
[1478-1552] IPR0119891.3e-12Armadillo-like helical
Orthology groupMCL11375 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213498-TA
ATGGCGGAAATAGTGTACCCGCAAGGCTGCCGTCCTATTACCGATGACCTAGGACCAGACGAGCTTGTGAGAAGGCTTAAGGCTCTAGCGCATACCCTTCAAGGTCTCGGCCAAGATGAGGGCATGTACCAGCAGTACATACCTCTGGCCCTGCACCTCGCGGACGAGTTCTTCCTCACTCATCCGTCCCGTGACGTCCAACTCCTTATAGCCTGCTGTATAGCTGATGTTCTCAGGGTATATGCGCCGGAGGCTCCATACAAAGACCAGGAACAGGTGAAAACCATATTCCTGTTCCTAATAAACCAGCTGCAGGGTCTCCGTGACCCCAAGGACCCGGCTTTCAAACGTTACTTCTACCTGCTGGAGAATCTGGCTTACGTGAAATCATTCAACATGTGCTTTGAACTTGAAGACTGCCAGGAAATATTCTGCGCTTTGTTCTCACTTATGTTTAAAATTGTCAATACGGAACACTCTTCGAAAGTGAAATCCTTCATGCTTGACGTCCTCTGCCCTCTAATCACCGAGTCCGACGTGGTCTCCAACGAACTGCTCAACGTGATACTATTGAATCTGGTGGAGCCAAACAAAAGGGAACACAAGCACGCTTACACGCTAGCCAAGGAACTCATTATTAAGACAAGCGAGACCTTGGAGCCGTATATACAAGCATTCTTTAACCACGTCCTGATTTTGGGCAAAGAAGAGAAAAATCTTCTCATATTTTCCAAAGTGTACGAATTAATATACGAGTTAAATCAGTGTTGCCCGTCTGTTTTGCTCTCAGTGTTGCCACAGTTGGAATGTAAATTAAAATCAGCACAATTTCACGAGCGTTTATCGGCCGTGGCTCTGCTCGCTAGAATGTTTTCCGAGCCGGGTTCGGAGCTCGCCAAGCAATATCCGGCTTTATGGCGAGCATTTTTAGGTAGATTCAACGATATATCAGATCAGATAAGGATAAAATGCGTCCAGTACTGCATGCATTTCCTGGTGCACCATCCGGACCTGAGGAAAGACATAACGGACACATTGAAGATGAGACAGCACGATGCACAGGAGCAGGTCCGCTATGAGGTCGTCATGGCTATCATAGCGACAGCTCAGAGAGATTTCAAAGCGGTCGCAGCATCCGAAGATCTGCTGCATTTCGTCCGCGAGAGGACCTTAGATAAGAAGTTCAAGATCCGCAAAGAAGCTATGTCCGGCCTGGCCATGATATACAAGAAGTTTTTAACAGAGGAATCTGTGCCGCCCGCCACCGAGAAAGCTGTGCAGTGGATTAAGGATAAAATATTACACGGATACTACATGACAGCTCTAGAAGACAGGTTGCTAGTTGAGAGATTACTGAACACCTCACTCGTCCCATACACCTTGCCGCCGACGGTCAGAATGAAGAAATTATACTATCTGATGTCGAACGTGGACGACAACGCCACCAAGGCGTTCATAGAGCTACAGAAACATCAGCTCGCTGTGAGGCGCACGGTGGCCGAGTGGGTGGACTTACACAGGAAGCCGCCCACACCGGCGGTACAGAAGGAAATGATCTCTAAAGTGTTACACATAAGCTCCAAATTCCTGCCAGAGTCCGTCAAGGCTCAGGAGTTCTTGAATAAATTCTCGAATCATATGAAAAAAGCGCCGGAGTTACTGCAAGGGATGGAGACGATATTAAATCCTAATGTCAGTTGCGAAGTTTGTGTTCGCACCACTTCGAGTGTCCTAAAGAAGTTGGGTCAGCCGGTGATGACCAACCTTTACTATAACACGGTCAAAATGTTGCTGGAGCGAGTTAGCTCAGTGATGGTGGACCACGAGTCCCTGCTCATACTGGTTGGCTACGTAGAGGGCGCCGTGAGGGGCAACGACCCCTCCATAGCGGAGGAGTGCGGTCTCGGCCAAGATGAGGGCATGTACCAGCAGTACATACCTCTGGCCCTGCACCTCGCGGACGAGTTCTTCCTCACTCATCCGTCCCGTGACGTCCAACTCCTTATAGCCTGCTGTATAGCTGATGTTCTCAGGGTATATGCGCCGGAGGCTCCATACAAAGACCAGGAACAGGTGAAAACCATATTCCTGTTCCTAATAAACCAGCTGCAGGGTCTCCGTGACCCCAAGGACCCGGCTTTCAAACGTTACTTCTACCTGCTGGAGAATCTGGCTTACGTGAAATCATTCAACATGTGCTTTGAACTTGAAGACTGCCAGGAAATATTCTGCGCTTTGTTCTCACTTATGTTTAAAATTGTCAATACGGAACATTCTTCGAAAGTGAAATCCTTCATGCTTGACGTCCTCTGCCCTCTAATCACCGAGTCCGACGTGGTCTCCAACGAACTGCTCAACGTGATACTATTGAATCTGGTGGAGCCAAACAAAAGGGAACACAAGCACGCTTACACGCTAGCTAAGGAACTCATTATTAAGACAAGCGAGACCCTGGAGCCGTATATACAAGCATTCTTTAACCACGTCCTGATTTTGGGCAAAGAAGAGAAAAATCTTCTCATATTTTCCAAAGTGTACGAATTAATATACGAGTTAAATCAGTGTTGCCCGTCTGTTTTGCTCTCAGTGTTGCCACAGTTGGAATGTAAATTAAAATCAGCACAATTTCACGAGCGTTTATCGGCCGTGGCTCTGCTCGCTAGAATGTTTTCCGAGCCGGGTTCGGAGCTCGCCAAGCAATATCCGGCTTTATGGCGAGCATTTTTAGGTAGATTCAACGATATATCAGATCAGATAAGGATAAAATGTGTCCAGTACTGCATGCATTTCCTGGTGCACCATCCGGACCTGAGGAAAGACATAACGGACACATTGAAGATGAGACAGCACGATGCACAGGAGCAGGTCCGCTATGAGGTCGTCATGGCTATCATAGCGACAGCTCAGAGAGATTTCAAAGCGGTCGCAGCATCCGAAGATCTGCTGCATTTCGTCCGCGAGAGGACCTTAGATAAGAAGTTCAAGATCCGCAAAGAAGCTATGTCCGGCCTGGCCATGATATACAAGAAGTTTTTAACAGAGGAATCTGTGCCGCCCGCCACCGAGAAGGCTGTGCAGTGGATTAAGGATAAAATATTACACGGTTACTACATGACAGCTCTAGAAGACAGGTTGCTAGTTGAGAGATTACTGAACACCTCACTCGTCCCATACACCTTGCCGCCGACGGTCAGAATGAAGAAATTATACTATCTGATGTCGAACGTGGACGACAACGCCACCAAGGCGTTCATAGAGCTACAGAAACATCAGCTCGCTGTGAGGCGCACGGTGGCCGAGTGGGTGGACTTACACAGGAAGCCGCCCACACCGGCGGTACAGAAGGAAATGATCTCTAAAGTGTTACACATAAGCTCCAAATTCCTGCCAGAGTCCGTCAAGGCTCAGGAGTTCTTGAATAAATTCTCGAATCATATGAAAAAAGCGCCGGAGTTACTGCAAGGGATGGAGACGATATTAAATCCTAATGTCAGTTGCGAAGTTTGTGCACCACTAATTATACCCAAAAAATACTACAAAATACAATATTCATCGAGTGTCCTAAAGAAGTTGGGTCAGCCGGTGATGACCAACCTTTACTATAACACGGTCAAAATGTTGCTGGAACGAGTTAGCTCAGTGATGGTGGACCACGAGTCCCTGCTCATACTAGTTGGCTACGTAGAGGGCGCCGTGAGGGGCAACGACCCCTCTATAGCGGAGGAGTGCGGTATCGATTTAAAGAAAGCGGCCGAGCGCGGTCTGAAGCTGCTGGTGATGTTGTCGTTCATGTTCCCCGCTCACTTCCTGCACGAGGACGTGCTGCATCGGCTGACGGGGCTGCTCGAGCTGGACGAGGAGAACGTGGCGCCGCATGTGCTCGCCGCGCTCACCTTCCTCGGCAAATATAGACCCTTGAGTGAGGCGTGTCCAGCGTTGTTCCCGAAACTTATAACACTATGCAAAGCCTATGCGGAGGTCGGTACGCCGAAACAAGCCAAAAATGCAGTGAGATGCCTTTTCGTCAACGTCCCCGATCAGAGATCCCAAATTTTCACGGATATACTGGAAACATTGAAAACTACTTTAAGTCCCCATTCGGAACATTACCGCACGGCCATCGTTACACTCGGACACATAGCGCACAACTTACCTGATAACTTCCCTGTGCTTATTAAAAATATTGTATCTAGGAAGATAGTAAAAGAGCTGTTAGTGCGGGAAGGTGGCGGTGGACCCAACGCTCCTGAAGGGGACTGGTGTCCCGAAGAAGATCTGCCAGAGGAAACTCGCTGCAAGCTGGAGGGTCTGAAGTGTATGGCGCGCTGGCTGCTGGGTCTGAAGAGGGACGAGCTGTCGGCGCAGAAGACGTTCAGGATGCTGAACGCCTTCATAGTACATAAGGGAGATTTGTTACAGCAGAAGCAGTTGTCCGGAGCTGAGATGGCTCACCTAAGGCTGGCGGCCGGTGCTGCCATGTTAAAGATATGCGAACAGAAGGGGGTCGGAGACCAGTTTACTGCGGACCAGTTCTATAACCTGTCACATCTCATGGTGGATAGCGTACCACAAGTCAGAGAAGCATTTGCAGCTAAACTTCACAAAGGATTGTCGAAACCGGACCGCCGGGTGCGCGGGCTGGTGAGGCAGTACATGCTGGCGGACGTGGTGAGACGCAGAGAGTACGTCAGGAACATCACCGTCGGGACCAAGGGAGAAAGGAGTGAGACGAGCGTTATTGTTTGTAGCGTTTATCCCGTTGGAAAAACGTCCGCGAAACGCCGCTGGGAATGCGAGGAGGCTCTGACAGTGGTGAAGCAATGCCTGTGGTTCATACTGGAACCTCTCATAACACGCAATGACTTCTACTGCTACGGATTCTACAAGAGCCTGGTGGAAAGGATGAAGAGTCACAAGGACGCTCTCAACGAGACCGATGACTCGGTTAACTATAAACTGTGGGCCACGTGTGACCTGGCCATGTCCGTAATCTGGGCGCGGTCGAGTAGTTTCGAGTTGCGGGACTTCCCCTCCGACGCTCGCATACCGACCATGTACTTCGCCCCGCAACCTGATTTCTTCGTCAACACCAGGGTCTTCCTACCGCCGGAGCTACAGTTCCAACCGAAACGCCAGGGTACAACGGAAACAAATACAAAGGCAAAGAAACGTCCCAGACAAGACAAGGATTCGGAGAATACTAATGATGTAGAGGTGACTATATAG

Protein sequence:

>DPOGS213498-PA
MAEIVYPQGCRPITDDLGPDELVRRLKALAHTLQGLGQDEGMYQQYIPLALHLADEFFLTHPSRDVQLLIACCIADVLRVYAPEAPYKDQEQVKTIFLFLINQLQGLRDPKDPAFKRYFYLLENLAYVKSFNMCFELEDCQEIFCALFSLMFKIVNTEHSSKVKSFMLDVLCPLITESDVVSNELLNVILLNLVEPNKREHKHAYTLAKELIIKTSETLEPYIQAFFNHVLILGKEEKNLLIFSKVYELIYELNQCCPSVLLSVLPQLECKLKSAQFHERLSAVALLARMFSEPGSELAKQYPALWRAFLGRFNDISDQIRIKCVQYCMHFLVHHPDLRKDITDTLKMRQHDAQEQVRYEVVMAIIATAQRDFKAVAASEDLLHFVRERTLDKKFKIRKEAMSGLAMIYKKFLTEESVPPATEKAVQWIKDKILHGYYMTALEDRLLVERLLNTSLVPYTLPPTVRMKKLYYLMSNVDDNATKAFIELQKHQLAVRRTVAEWVDLHRKPPTPAVQKEMISKVLHISSKFLPESVKAQEFLNKFSNHMKKAPELLQGMETILNPNVSCEVCVRTTSSVLKKLGQPVMTNLYYNTVKMLLERVSSVMVDHESLLILVGYVEGAVRGNDPSIAEECGLGQDEGMYQQYIPLALHLADEFFLTHPSRDVQLLIACCIADVLRVYAPEAPYKDQEQVKTIFLFLINQLQGLRDPKDPAFKRYFYLLENLAYVKSFNMCFELEDCQEIFCALFSLMFKIVNTEHSSKVKSFMLDVLCPLITESDVVSNELLNVILLNLVEPNKREHKHAYTLAKELIIKTSETLEPYIQAFFNHVLILGKEEKNLLIFSKVYELIYELNQCCPSVLLSVLPQLECKLKSAQFHERLSAVALLARMFSEPGSELAKQYPALWRAFLGRFNDISDQIRIKCVQYCMHFLVHHPDLRKDITDTLKMRQHDAQEQVRYEVVMAIIATAQRDFKAVAASEDLLHFVRERTLDKKFKIRKEAMSGLAMIYKKFLTEESVPPATEKAVQWIKDKILHGYYMTALEDRLLVERLLNTSLVPYTLPPTVRMKKLYYLMSNVDDNATKAFIELQKHQLAVRRTVAEWVDLHRKPPTPAVQKEMISKVLHISSKFLPESVKAQEFLNKFSNHMKKAPELLQGMETILNPNVSCEVCAPLIIPKKYYKIQYSSSVLKKLGQPVMTNLYYNTVKMLLERVSSVMVDHESLLILVGYVEGAVRGNDPSIAEECGIDLKKAAERGLKLLVMLSFMFPAHFLHEDVLHRLTGLLELDEENVAPHVLAALTFLGKYRPLSEACPALFPKLITLCKAYAEVGTPKQAKNAVRCLFVNVPDQRSQIFTDILETLKTTLSPHSEHYRTAIVTLGHIAHNLPDNFPVLIKNIVSRKIVKELLVREGGGGPNAPEGDWCPEEDLPEETRCKLEGLKCMARWLLGLKRDELSAQKTFRMLNAFIVHKGDLLQQKQLSGAEMAHLRLAAGAAMLKICEQKGVGDQFTADQFYNLSHLMVDSVPQVREAFAAKLHKGLSKPDRRVRGLVRQYMLADVVRRREYVRNITVGTKGERSETSVIVCSVYPVGKTSAKRRWECEEALTVVKQCLWFILEPLITRNDFYCYGFYKSLVERMKSHKDALNETDDSVNYKLWATCDLAMSVIWARSSSFELRDFPSDARIPTMYFAPQPDFFVNTRVFLPPELQFQPKRQGTTETNTKAKKRPRQDKDSENTNDVEVTI-