Monarch geneset OGS2.0

DPOGS206641
TranscriptDPOGS206641-TA3033 bp
ProteinDPOGS206641-PA1010 aa
Genomic positionDPSCF300048 - 401895-412277
RNAseq coverage476x (Rank: top 26%)
Annotation
HeliconiusHMEL0111450.070.85% 
BombyxBGIBMGA003928-TA1e-0927.13% 
DrosophilaCG32529-PA5e-6549.37% 
EBI UniRef50UniRef50_E3WXH82e-6357.07%Putative uncharacterized protein n=2 Tax=Pancrustacea RepID=E3WXH8_ANODA
NCBI RefSeqXP_001816051.14e-6445.65%PREDICTED: similar to AGAP004446-PA [Tribolium castaneum]
NCBI nr blastpgi|3838634582e-6358.97%PREDICTED: uncharacterized protein LOC100880619 [Megachile rotundata]
NCBI nr blastxgi|3838634582e-7429.16%PREDICTED: uncharacterized protein LOC100880619 [Megachile rotundata]
Group
Gene OntologyGO:00036771.6e-17DNA binding
KEGG pathway 
InterPro domain[856-950] IPR0010251.6e-17Bromo adjacent homology (BAH) domain
Orthology groupMCL34668 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206641-TA
ATGAGACCGCCTGCGGACTGGTTAGTTCTGGGATATTACCAGCCGGCTGGGCCCTTGATAAAGGAGACGAGCGCGGACGCGGAACGAGCGCTAGCTAGATACCATCCCCCGCCTCACTACCATCCACCACCTGTACTACAACAATATGGTCCATACTACCCGGCTGAACTTTGTTATGGAAGGTACTACCGACCCCCAGGACCATATCACATGCCCCCACAGCCGGATGCACCGATGGTGGGAGTAAGTGAGCCGTATCCTCCCCCTCAATACTACGGCCCGACACCATGCTACCCCCACTCACCACAAAGAATACCATACGTAGAATACCGAGGATGCCCGTGCCCATTAAACGCGTGTCCAAAAAACGTCCTTATTGGGCCCGCTACTGGTAAAGGCCCCACCGGTGGCGGCCCAGCGGACGTTCCCTCGGCCTTAGCGCCGCCCGGGGCCGCCTTCAGACAGAAGGTGCGGGTCGCGCTCGACTCTAACGCAGATACTCACGGAAGCGGAAAGGATCCTCCAATAGAACCGGAGCCACCAGAGCCTCCAACCGCAGCAGACGCCCCTCATCCATGTCCTCTGCCAGCTATTGGTGGTGCATCACCAAAGCGTGAGATGTTTGACGTTCGAGATAACCCTCGCCTGAACGCTGATATCCCGGGACCAACAGAGGCTATTGGACCCCCCTCCCCTGCAAGAGGAGTCTGTGCTCCACAACCACCGCTAAGTGCCCCAAGACGGGCTCGACTTGGCAAAGCCATGGCCAGGCGGTCATTAGCTGGTGACGACACGGCACTTCTCATATCAACACCACAACCGATAGTTGATAATCACAATGAGGCAGATGATGACGTTCTCGCCATATCACCGCCCACGTGTGCGCCCATCGATTTATCGTCCTCTGATGAACGATCAACACCATTAGTGGACGTCAAAAAAGAGAACGAGGGACCCACTTACGCAACGTCAAGAAAGACTGATGCGCAAGCTCTTAGGGAACATAATTGTACAACCGCCTTACAGACCTCGGATGTCGCGGATCCTGCTAAAGCGGTCAAGCGGAGATATTCCAAATCTAAGCTTGAGATAGAAATGAAAGATGTTCAACTCGAGGACGCATGTAAAACAAATGGCATCGGGGAACATGTTAAACTGCAGAAACGAAGGAAGGTTTCTGGCGAACTGTCACATATACCGCGTACAGAAACAATTACTAAGGAACCAAAAAAGAAACCAACGCAGAAAAGGACAACAAAAACTTCAATCAATGATAAGAAAGGACATAAAAGGAAATCAAGTACTGTCGTAGAGGAACCCAAACCTAAGACCATGTTACTTGACATCATTACCAATGAAGATCCTCGTCTGACAAATGTTGCTGCTGTTTCAGAGGATACTATGAGTATGAAACCCAAAACTAAAAGCGCACTTCTGCTAGATAATTTAATAAAGAAAAATAGCGTAGACAGGCCTGAAGCTAAAGGATATAATCCTGATACAAAACTAAACCCAGCGGAACTCTTTTTACCACCGAGCGAAAATGTCACTCAAAACAGTGCGAAGAAAACTTTAAATATTAATAATAACATAGTAGATACAAATTCCCCCATAAACGGGGTGCGATCAAAAACTATTGCGGCTGTGAGTCAGCTTATTGAAAAGAAGCTGGCGTATAAGAATGCTTGCAAAGTAGACAAAGTAGATTCCAAGGTTACTCACAGCCTTAAATCAGTATTATCTTCACCCACCGAAGCTCTGGACGACAAAGAGGATATAAAACAAGGAGTTTTACTAAACGGTGAAATAAATATTGACAAAAAAGTCTGCCTTGAAAGTGATATAGGAAAAATTGCAGATATCAAAGAAAAATGTGAATACAAAGACATTAAAAATACAATGAATATTGAAAACAATATTTTGCCAAATGAAAAGTCAACGAAAGTAAACCTTAAGGCCTGTCCAAACAAAACTAAAGAAACCAATAACATCGATCATGTCGAAGAATCTCAAAAGACAGAAAAAGCTTTAAAAGGCGAAGTAACTAATTCGTGTAAAGCTAAACACGTCGAAAAATTAGTGAAGCTGCTGGCTTCGAAAAAGCAAGCCTCGAAATTATCTGAATCGGAAGCAGTAATAGAAGACGAAGCGGATGTAGACAGCAATCAAGATATAGATAGAGATAAGAGTACAAAACGATGTAAACTAGTCGGGGAAAAGGTAAATGGAGAGAAATGTAAGACAGAGAAAGTTGCGAAGCTGCTCGTGTCCAAAAAGACAGTTATAAAGACTGATGAAGCATCGTTAGTCGTGGAAGACGCTTGTTCATCACCAGTGACGAATACAAGGAAGACTAGAAGAAGACGTAGCGGAGGAAGAAGGATGAGAAGAGCTGTAGCTCCTCTAGCCCCACCAGACTTACAACCAAAAGCCACGCCACGATGGAGCAATGGATGGAACTGGGACGGCGAACACTATTTATCAAAAGTCTACCTCAATAACGACTCGCGCCTTGACCGCACTGCTCGTTGCTGGGCTCGTATGACCCACGCGAGCGGGGATCGCGTCTCGCGAGGAGATTGCGTCTTACTCAGAGCATCCCAGGCCAGGGCTCAACCATTTGTGGCCAGGATCGCCAGCCTGTGGGAGAATCCTGATGACGGTGAGATGATGGTGTCCCTCGTATGGTACTATCGTCCGGAACATACCGAACGCGGTCGTCAGTCAACAGACGCTCCCGACGAAGTGTTCGCTTCCAGACATCGAGACGCTAACTCTGTCGCCTGTATAGAAGACAAGTGCTACGTACTCACGTTCAATGAGTACTGTAGATACAAGAAGCGCCTGAAGGCGTTGGAAGAGGGCGTCGTGATCACTCCATCAATAGTACCGTCGTTGCCGGCCAGCGAAGTGACGCCGGCTCTAGCTCCAAACGATACAAAACTTCCGCCTTCTGTATCACCAGAGTTAGTGCTGTTCTGCCGAAAAATATATGATTTCAGGTCCAAAAAAATTCATGTACCCAACAAATGA

Protein sequence:

>DPOGS206641-PA
MRPPADWLVLGYYQPAGPLIKETSADAERALARYHPPPHYHPPPVLQQYGPYYPAELCYGRYYRPPGPYHMPPQPDAPMVGVSEPYPPPQYYGPTPCYPHSPQRIPYVEYRGCPCPLNACPKNVLIGPATGKGPTGGGPADVPSALAPPGAAFRQKVRVALDSNADTHGSGKDPPIEPEPPEPPTAADAPHPCPLPAIGGASPKREMFDVRDNPRLNADIPGPTEAIGPPSPARGVCAPQPPLSAPRRARLGKAMARRSLAGDDTALLISTPQPIVDNHNEADDDVLAISPPTCAPIDLSSSDERSTPLVDVKKENEGPTYATSRKTDAQALREHNCTTALQTSDVADPAKAVKRRYSKSKLEIEMKDVQLEDACKTNGIGEHVKLQKRRKVSGELSHIPRTETITKEPKKKPTQKRTTKTSINDKKGHKRKSSTVVEEPKPKTMLLDIITNEDPRLTNVAAVSEDTMSMKPKTKSALLLDNLIKKNSVDRPEAKGYNPDTKLNPAELFLPPSENVTQNSAKKTLNINNNIVDTNSPINGVRSKTIAAVSQLIEKKLAYKNACKVDKVDSKVTHSLKSVLSSPTEALDDKEDIKQGVLLNGEINIDKKVCLESDIGKIADIKEKCEYKDIKNTMNIENNILPNEKSTKVNLKACPNKTKETNNIDHVEESQKTEKALKGEVTNSCKAKHVEKLVKLLASKKQASKLSESEAVIEDEADVDSNQDIDRDKSTKRCKLVGEKVNGEKCKTEKVAKLLVSKKTVIKTDEASLVVEDACSSPVTNTRKTRRRRSGGRRMRRAVAPLAPPDLQPKATPRWSNGWNWDGEHYLSKVYLNNDSRLDRTARCWARMTHASGDRVSRGDCVLLRASQARAQPFVARIASLWENPDDGEMMVSLVWYYRPEHTERGRQSTDAPDEVFASRHRDANSVACIEDKCYVLTFNEYCRYKKRLKALEEGVVITPSIVPSLPASEVTPALAPNDTKLPPSVSPELVLFCRKIYDFRSKKIHVPNK-