Monarch geneset OGS2.0

DPOGS206842
TranscriptDPOGS206842-TA5817 bp
ProteinDPOGS206842-PA1938 aa
Genomic positionDPSCF300001 - 3095596-3170184
RNAseq coverage1421x (Rank: top 9%)
Annotation
HeliconiusHMEL0132810.066.37% 
BombyxBGIBMGA012794-TA0.072.62% 
DrosophilaSmr-PC1e-8945.54% 
EBI UniRef50UniRef50_E0VK132e-11854.42%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VK13_PEDHC
NCBI RefSeqXP_002426457.13e-11954.42%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420114396e-11854.42%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|3287815100.031.34%PREDICTED: hypothetical protein LOC724535 [Apis mellifera]
Group
Gene OntologyGO:00055158.4e-15protein binding
GO:00036778.6e-08DNA binding
KEGG pathway 
InterPro domain[495-562] IPR0090578.4e-15Homeodomain-like
[510-558] IPR0010058.6e-08SANT domain, DNA binding
Orthology groupMCL16304 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206842-TA
ATGCCGCCTCGTCAGGACATCGTGGTGCAGGTGGAGCGCGTGCAGGTGCAGGGCTCCCGGCCTCCGCCACCCCCAGCGCCGGGTCGATTGCAGCGCCCCCCCCACGACCATCCCAACAACTACATAAATCGATCTAACGCTCGTCAAACTAACAACAACACGCAGGTTCAAGCTCGAACTGAAGCTAAGTTATGGGAACAAGAGAAAAAATCTACACGTGTTCAAATAGTTTATATTATTATCTGTGACAATGGTGACCTGCAGCGGCAGTCGGCGGCTCTGTCGCAGGTAGACGAGGCTGAGGAGAAGCAGATGTATTATAACATAACATTACACAGCTCCTTGCAAGCCCGGACAGACAGAGCACGGGCACGGGGCACTGGCGTACGGCTCGTGGCGCCGGCGCGTCCCGCCGCGCCCTACGCGCCGCCCTACCCGCCCGCTCAGCTCCGGGTGCACCAGCCCGGATATGGATCCGCCGCTTTATCATACAGAGACAGTTACAGCCGCGGCGGCGGCGGCGCGGTGGGAGAGTACCGCGCGGGCGCGCGCATGTCGCTCCTGGGGTCTGCGTACCTGCCGCCGCCCGCGCCCGCCCACCATCCGGACCCGCCGCCCTTCAAGAAGATCAGGCTGTCGGCTGAGCGACCCGCCACGCACCAGCCGCTCCGGGTCGACACCAGGGAGCCAGTAATGAACACGTACAACAACAGCGTGGAGGTTCTGTCTCCCAACCCTCCCTCCGAGCCCACCGTCGAAGACCAGAGCTTCCGCACCACCAAGGATGATCTGCTGCAGCAAATATCCAAGGTGGACAGGGAAATGGCGCTATCGGAGCAGACGCTGTTGAAACTAAAGAAGAAGCAGGAGGAACTGGAACAGAGCGCCAGCAAACCGGCCCGCGCCGAGGAGCCCGAGGAGGCGCCGCCCAGGCACAGGAGCCTGGCACAGTGCATCTACGCTGATAATAGGAAGAGGGCCGCATCAGCGCACGCTGTGCTCTCTCACCTGGGTCCGTCCGTGAGTTACCCGCTGTATAACCAGCCTCAGGACACACAGGTCTATCACGACAATATAAGAAGACACCGCTCCTTCAGGAAACGACTCGCAGACCATCTCAGGAAGATCAGGCTCGAGACAGAGAAGCGTGAAGATGCTCTGACAGAAGAATACAGTCGACGAGCTGCAGACTGGCTGAGGAGAGTGGAGCGGCTGGAGATGGGACAGAAGAGAAAGGCCAAGGACGCCAGGAACAGAGAATTCTTTGAAAAGGTGTTTCCGGAGCTCCGTAAACAGCGCGAAGAACGTGAGAGATTCAACCGGCTCGGGGCCCGAGTGAAGTCTGAAGCAGAGTTGGAGGAGATCGCTGATGGGCTCCATGAACAGGAACACGAAGACAAGAAGATGCGATCCTTGACGGTCGTACCACCGCTGTTGAGAACACGCGACCATCATCACCACCACCACCACCATCTCGACACCAACAAGCGCTGTATGGACATGGAAGCTGAACATAAAGGTCTCCAGCTACGGAATGTATGGACTCAGGCTGAGAGAGAACTGTTCCGGGAGAAGTACCTGCAGCACCCCAAGAACTTCGGACAGATAGCATCCTTTCTGCCAAGGAAGAGCGTTCGTGATTGTGTCAGGTTCTACTACTTGTCAAAGAAGGCCGAGAACTACAAACAGTTACTCCGCAAGCCTCGTCCTCGTCGTCACGCCCGCCCGAGACCTGCACCAGAGCCCGAGCTGCCGGCTGGTGTCACCACACGCTTGCAACGAAGTCAAGGATCTCGTGGCGCTGAGAAGGAGTCGTGCCTGGAGGAGATGGCCGAGCCCGTCAATTGCATCCCCGTGCCGCCCGCCGCCCCCGCGACCGCCAGTGAATCAGTTGATATGATTAAATCCAAAATGCTTTTGGGATTGTACAAGTTCCTATTCGGTGAAGGATTCTTAAAGATGATGAGAGCAGAAGGGGCTGAGATGGCGAGGACACCTCCTCTGCCCCCGTCCCCTCCCCCGGCCTCGTCCGCTCCTCCAGCTCCCGTCACCAACAACACCAGCAGCTCGAACCCCGCGGGCGTCACCTGTTCGAGTTCGGTCAGCTCTAGCTCGAGCTCATGCATCACAAGCTCGAGCACTATCACTGTGACCAGCCCGCAAAGCACCAGCGGTGTGAGTGTGGCCACCGTGAGCGTGTCGAGTACTGTGACCGTTGCGAGCAACGTGTCCGCCAGCGGGACGCCCAGCCCGGCCAGTCAGCCCAGCTCTAGTCAGCCGTCCACGGAGACGCCACCAGCTCAGCCCGCGCAGCCTGAACTACCGACGGCCACTGTCACCACCAGCACCACTACAGTCATTACTACGACGGCGGTGTCGAGCGGCGCTTCCGTCAGCTGTGCCACCGGTGTGATGGTGAGCGGAGTGACCCTCACCGTGGTGAGTGGACCGCCAGGAGGGAGCAGCGCCCCCACCATCACACAGACGACATCCTCGGCGGCACCGGAGGCCAACAGCACGCCTTCTTCTACCATCGCGAGCACCACTGCCACCCTGACGCCTGTAAGCAGTACATGCACGGTATGCTCGGTGGCGGCGGCAACCAGGGCGGTGGCCCCGTCACAAGCATCTCAATACGGCTTGGACCCGTCCGTTGTGGGGGCCCGGGTGTGCGAGGGTTGTCACTGCCGATGTGTGAGAGCCCGCCGGAGTAGGTGCACGGTGCCATCCTGTGCTGGGCCGAGAGTCAAACGGCTACGACACCTGCCGCCACGCTGGCACGATCTCCCGCAAGATCTAAAGAGACCGATCATGGATGAATTCCAGATACCGAGCGACCTCAGCAAGTGCTGCCTGACCTGTTTCAAAAGGATAACACGTCGTTTGGACACGATAGGAGAAGGTGGATCCGCCCCGGAACCGACCGAGGATGAAGCAGCAAGGTTCAGAGCTCTGCTCAGAGAGCACGGCACGGCCTGGGAGAGGATGGCCACCGCCAGCGGTAGGACGCCGGCCAGCCTGAAGGCCTTCTACTTCACTTACAGGAAGAAACTGCAACTCGATACTTTGATAAGTGAGAGGAATCCGACCAGAGGCAGCGACACGGACGACAGTATCCTTAGTTCGGGAGACACGGACACCGCGAGTGCGGAGTCTCCGCGACCCCCGCCGCTCCCGGGTCCGCGGACTGATGCCGTGGCGCCCCGGAGACACAGGAGGGACGAATATGATTCATCCGCCACCGAGACGGCCGACGAGGAAAACGATGCACCCAGTACTAAGACTGTGAACGCAGTATCAGTAACCACGGGGGTGACGACGGTGCCGCCGGGGGTGACGGTGACAGGAGTATCGGGCACTACGAGCGGGGCGGGGGTCACCGGGGTCAGCGGGGTCTCCGCGGCAGGAGCGGGTAGTAACGGCTCCGGGCCTCTGACGGTTTGCGACGTGGTTCTCAACATGATCGAGGTCAGCCTCATGAAGAACTCCCGCCCGCCTCCGCTGCCGCACACACATTCTATGCACCAGGCGCCCAAGGTTAATGTGTCTTCGGGCCACGAGTCCCTGGCCACTCTGACAGTAGTGAGCGGTGCTGGACACGAGCACTCCCCGCGGTCTCCACAGCGTGCCACTATCACGCCGCTAGCGGACAAGGACCTCATGGTTCTTCACATAGAACCACGCGCGCCTGAGTCGTTGCTCGACCTCAGCGTCAAGAGACCACGCCACGATCCGCCGAAGCCGCAACAAGCTCACCGGAACACTGAATACTCGCCGTACCGAGCCCAGGAGCGTGAATCTCCGACGCAGTACAATTCCAAACCGAAGCCGGTCGCATCCCCCAGACAGACTCTCAAGTTGCCAACCAATCCGAAAGGTTCTATCACACTGGGAACCCCAGTCGAGTCACGTTTCGAGACGCAGCGTATGCAGTCTGACCCCAAGACTGGTTCCATAACTGCCGGGACCCCGGTCCATGCGCCACACCATCTTCCGGAGAAGAGGAACTTCGATTACTACAAGAGGAGGAGCCCAGGGGGAGCATACGCATACTCGGCCGCCCCCAATCAGCCTCGGCCACAGTCACCGTCGTTCTCAAACCAGCCGAGTACGGGCGCAGTATCTCGAGGTCCATACGGTCACGAGCGGCGGCAGATCATGCTGACGGACTTTATCACGTCGCAGCAGATGCACGGTGGAGCCAGACGCAGCGAGCAGCCCCATGCTCAGCACGGACAGCACGCGCAGCACGCGCAGCATGGACAACTCGTCCATCGCCGGGACCGGGATAGCGTGTCCGTCATACAGAGACATACACACACGTACTCGCATCCGCCGCCCGGTCACGAGGCGTTCACGTCTCTAGTGAACGCGGCGTCAGCGGCTCCCGCCCTCCCCGTCCCCCGGAGGGAGGAACCTGCGTCGCCGCCCGTACACCACCAACCGCACCTCCAGCACCACCAGCACCAGCAGCATCATCAACAACACGAACAGATCCGGGGACCCCGGGAATTTGCTATGGCGCAAAAATACGCACAGTACTCAACGGATAGACGAGCTCCGCCTAGAGACAGTCGGGAGCGCATGGTGCCTATCCAAGAACGCCATTACGTGATCGAACAGCATAATCCACACAACCCTCACGGGCAGCACAACCAGCAGCAGCAGCAACAGCAGCATCACCAACATCAACAGCAACAGCAACAGATGATGATTGACAGACACCACTTAAACCATGGCCAGCAGAACGTATCCTCGAGCGAGGAGCGTCGGTCGAGTAGTGGCAATTATAATACGACTAACAAGAGGCAGATACATCCCATGGAACGGAAAGAATGTCGGCCAACTGTGAGTGTTACAAGCGTCCCCATAATGAGCACGGTCGTCAGCACACCCACCAGCGTATGCGTGAGTTCTGGCGTGGGCGCCGGTGTCGGCGTGAGTGCGGGTGTGGGCACAGGCGGGGCCTCAGCGGACCGCGCACGGCCCGGAGACGGCACGCTCACGGCCGCCTCGCTCATCAACGCCATTATCACGCATCAGATAAACCAATCCAGTGATCAACGGTTTCCACCCAAGATAATACGTGAAAATGAAACTTCTCAGCAGCCAGAGCTCCCGAGAGACCTGCCAAGAGACATGCCCCGTGAGCTGCCTCGCGAACTGTCCCGGGACATGCCTCGGGACGGAGAGAGGGAGGAGCCGCCGGGACACACCACTAGCATCAAACTCGGCGATCTCGCGTCCAACATCATAGTCAGAGACTTCAGCAGCCCGAACACCACTTCACTCATGCACCATAATACCAATAATCGTTTTGCCGTGTCGAGTGCCGACTCGTACACCAGCGGCGGAGCTGGTGGAGGAGCGGCCGCCGGTGGAGGACAAGCAGAGGAGTGGAGGAGGGAGGCTCCCAAACACGCTCCCTATCTGGAGCCAGTGTCGCCTCCAGACAATCATCACTCAGGTCGTAGCTCGGCCGGTGGCGGGACTGGCGCCGGCGCAGGCGGTCGTCGGTACAGCGCAGTCGGCGTGGGCGTGGGTGGTGGAGTTCTCACGGCCTACGACTACGTGAAGACTCGCATCGTGGAGGTGATGCGGAGTGACTCGGACGAGCGCGCCAAGCCGCTCACCTTCCCCACGGCCTACGCGTACCCGTACTCGGCTCTGAACGTCGCGACGCCCACGGCTCCTCCGCCGGCCCAGGCCTCGGCTCAGTTGGACCCGCCGGCCGTGGCAGTGTCCGTGGGCGGCGGGCAGCCCGAGCCGGCTCCGCTCATGTCTGCTCAGTACGAGCCGTTGTCGGACGAGGACTGA

Protein sequence:

>DPOGS206842-PA
MPPRQDIVVQVERVQVQGSRPPPPPAPGRLQRPPHDHPNNYINRSNARQTNNNTQVQARTEAKLWEQEKKSTRVQIVYIIICDNGDLQRQSAALSQVDEAEEKQMYYNITLHSSLQARTDRARARGTGVRLVAPARPAAPYAPPYPPAQLRVHQPGYGSAALSYRDSYSRGGGGAVGEYRAGARMSLLGSAYLPPPAPAHHPDPPPFKKIRLSAERPATHQPLRVDTREPVMNTYNNSVEVLSPNPPSEPTVEDQSFRTTKDDLLQQISKVDREMALSEQTLLKLKKKQEELEQSASKPARAEEPEEAPPRHRSLAQCIYADNRKRAASAHAVLSHLGPSVSYPLYNQPQDTQVYHDNIRRHRSFRKRLADHLRKIRLETEKREDALTEEYSRRAADWLRRVERLEMGQKRKAKDARNREFFEKVFPELRKQREERERFNRLGARVKSEAELEEIADGLHEQEHEDKKMRSLTVVPPLLRTRDHHHHHHHHLDTNKRCMDMEAEHKGLQLRNVWTQAERELFREKYLQHPKNFGQIASFLPRKSVRDCVRFYYLSKKAENYKQLLRKPRPRRHARPRPAPEPELPAGVTTRLQRSQGSRGAEKESCLEEMAEPVNCIPVPPAAPATASESVDMIKSKMLLGLYKFLFGEGFLKMMRAEGAEMARTPPLPPSPPPASSAPPAPVTNNTSSSNPAGVTCSSSVSSSSSSCITSSSTITVTSPQSTSGVSVATVSVSSTVTVASNVSASGTPSPASQPSSSQPSTETPPAQPAQPELPTATVTTSTTTVITTTAVSSGASVSCATGVMVSGVTLTVVSGPPGGSSAPTITQTTSSAAPEANSTPSSTIASTTATLTPVSSTCTVCSVAAATRAVAPSQASQYGLDPSVVGARVCEGCHCRCVRARRSRCTVPSCAGPRVKRLRHLPPRWHDLPQDLKRPIMDEFQIPSDLSKCCLTCFKRITRRLDTIGEGGSAPEPTEDEAARFRALLREHGTAWERMATASGRTPASLKAFYFTYRKKLQLDTLISERNPTRGSDTDDSILSSGDTDTASAESPRPPPLPGPRTDAVAPRRHRRDEYDSSATETADEENDAPSTKTVNAVSVTTGVTTVPPGVTVTGVSGTTSGAGVTGVSGVSAAGAGSNGSGPLTVCDVVLNMIEVSLMKNSRPPPLPHTHSMHQAPKVNVSSGHESLATLTVVSGAGHEHSPRSPQRATITPLADKDLMVLHIEPRAPESLLDLSVKRPRHDPPKPQQAHRNTEYSPYRAQERESPTQYNSKPKPVASPRQTLKLPTNPKGSITLGTPVESRFETQRMQSDPKTGSITAGTPVHAPHHLPEKRNFDYYKRRSPGGAYAYSAAPNQPRPQSPSFSNQPSTGAVSRGPYGHERRQIMLTDFITSQQMHGGARRSEQPHAQHGQHAQHAQHGQLVHRRDRDSVSVIQRHTHTYSHPPPGHEAFTSLVNAASAAPALPVPRREEPASPPVHHQPHLQHHQHQQHHQQHEQIRGPREFAMAQKYAQYSTDRRAPPRDSRERMVPIQERHYVIEQHNPHNPHGQHNQQQQQQQHHQHQQQQQQMMIDRHHLNHGQQNVSSSEERRSSSGNYNTTNKRQIHPMERKECRPTVSVTSVPIMSTVVSTPTSVCVSSGVGAGVGVSAGVGTGGASADRARPGDGTLTAASLINAIITHQINQSSDQRFPPKIIRENETSQQPELPRDLPRDMPRELPRELSRDMPRDGEREEPPGHTTSIKLGDLASNIIVRDFSSPNTTSLMHHNTNNRFAVSSADSYTSGGAGGGAAAGGGQAEEWRREAPKHAPYLEPVSPPDNHHSGRSSAGGGTGAGAGGRRYSAVGVGVGGGVLTAYDYVKTRIVEVMRSDSDERAKPLTFPTAYAYPYSALNVATPTAPPPAQASAQLDPPAVAVSVGGGQPEPAPLMSAQYEPLSDED-