Monarch geneset OGS2.0

DPOGS215741
TranscriptDPOGS215741-TA3312 bp
ProteinDPOGS215741-PA1103 aa
Genomic positionDPSCF300041 + 727129-732714
RNAseq coverage22x (Rank: top 79%)
Annotation
HeliconiusHMEL0040770.050.96% 
BombyxBGIBMGA003617-TA3e-5636.41% 
Drosophila% 
EBI UniRef50UniRef50_Q8C0W16e-3824.23%Ankyrin repeat and MYND domain-containing protein 1 n=6 Tax=Eutheria RepID=ANMY1_MOUSE
NCBI RefSeqXP_001182622.12e-3824.97%PREDICTED: similar to zinc finger protein, partial [Strongylocentrotus purpuratus]
NCBI nr blastpgi|3272672013e-3823.17%PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like [Anolis carolinensis]
NCBI nr blastxgi|3272672014e-3922.71%PREDICTED: ankyrin repeat and MYND domain-containing protein 1-like [Anolis carolinensis]
Group
Gene OntologyGO:00055151.1e-05protein binding
KEGG pathway 
InterPro domain[619-815] IPR0206831.2e-21Ankyrin repeat-containing domain
Orthology groupMCL24924 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215741-TA
ATGATAAATATTTGTGAGCCCATTGCTACAGTGAAAGAAAAAACATGGAAATATCAACAATACTATATAGGAGAAAGAGATGAGGAAGACAGACGAAGTGGGGAAGGAGAAAATACTTGGACGGGAGCTAAGTCCTTAGAGTGGTATGCAGGAAGGTTTATTCGTAATACCATGCACGGTGTTGGAGATTACCACTGGCGCTTCCTTGGGCCTGAGAACACGTTCGTTACATACGAGGGACATTTCTATTGTAACAGCATGCATGGCTACGGTATTATGTCATACCCTGACGGAAAGACATTCAGTGGCTTATTCTCCAATAACGTTCGATTTGGTCCCGGCGTGGAATCCCATTCTGATGTTACCGAAAACGTTGGGTTATGGCGAGGAAATAAACTGATAAGACTTGCTTGGCGACCAGAGGCACCGTGTGTCACTCCTGACTTCCTTACAAACCCCAACGGACAAATAGCTGTTGAACAATTCAGAACTGTACTCACTACCGAAATTGAAACTGTCGGCGAAGTAAACAATGCTCTAGAATTGCTTAAACAGAAAGGAGCAGATCCCCGTGTTGCTATGGAGAAATGGATGAAATTGTTTCCAAAAAATTGCACTGACCTTGCCAGCAAACTTTGCCAGATAGATATATTTGATAGAGATTATTATAAAGGAAAAATATATGCATTACAAGAAGTCGAAAAAGTACCAGAAAGAGAAGAAACGCTAAAAATAGACCAAAACTTGAATGAAAATACTCCGCAAGACGTTTGTACATTCTACGCTTGGAATAATAGTAGTATAATGATAAATTTAATGAAACATTGCTATAAGCATGAAAGACAGAGGAACATCACAAGAATAAACCTGAAGTCTATCCTATCTGGACCAAGAAGCAATTTTAAACCGACTGGAAATCACGAAGTCAACTGCAGGTCATTACTTATGGCTAGTTATTTAGGGTATATAACAGAAGTAGCAGAACTCATTAACACAGACAAAGTGCTTCCCGATGTGTCAGACATTCAAGGAAATACTGCTGTTATGTATGCTGCTGTCGGAGATCAAATTGAAGTAATTCATTTTCTCGTGGAAGCTGGTGCTAACATAAATTATTATAATGATTGTTGTTGCACACCACTAGGAGTTCTTTTGATGCGATATGCTTGTACTCAGAGAGATATTTCACACAACGCGATGGTTCAAGCTTTGTTACCTTCAACGACGTTTGCACCTCCAGCAATAGAACCCAATATTGTTGAATGGAATATAGTGCGTGAATTGACATGTCAAAGTCCTGGAAATTTACTAACTAAGAGTCCCAGTAAGATCACAAGGAACTTGAGTTCAAAGAAAGTTAAATCTCTTATGTCAATAAAAGATCAACCAACTTCGAAAAGAAAGCAGACCGACGCTGGATTACCGAAAGTTTCTGAACCAGATTATGAAACCGAAGAAGCCTTGATGAAACAGATGAAAAATGAAACAGATGAAAAAATATTATATAACAATCTTAACCGAGAGTATTGTATTAAAATAACAGATGACTTTACTTTGCCGTTTGGTACTAACAACAATCAATATATCTTTGACATAAGCGACATGGTTAAGGAGATTGATTCATTCGATGAAGAACTGAAAAAACCAGTAGAAAAAAATCCAAAGAAAGTTATATCAAAAGTAATTAAAGATACTATGAAAATTAGTAAAGATTTGATGTGGCAAAATGACGAAATACAATCAATAGACAGTGAACAAAAACTTAAAAACGATAAACTGTCCCGAATTATGCAAACTATTACACAGCTACTCGCAGACGGTGCTGATCCAAAGTTAGTTAAATGTCCACAGCCAGCCTTATTCATAGCTGTTATGTCTAACAGCTCGGATTTAATACGAAGTTTAATGAAGTCTGGTGCTGACATCAATGAAATCTATCCGCAGGTTTACCACTATACACCATTAGATATCGCCATATCTCGGCCATACAACACTGAAAATTTGGAAATAGTAGCAACGTTGTTGGAATGCGGAGCAGACACACGGCATTTATTAAAATACCAGCAAGAAGACAAGGAACCAGATGTTCCAGAAATTTTTACTCCTGGACCTACTCTATTTCATGCTGTACTTGCCAGAAAAGTTGACAACGTAGGAGAAGAAGATATACGTCGTCAGCTTGTGGAGCTATTACTACAATATAACTGTGACCCAATAGCCCAGTTTAAAGGCAGTTCGGCAATAGACGTGGCCATGAACCATAATTTCGATCTCCTCGATGTATTTATAAAGAACCCTAATACGAATCTTAATGCGAAAATAAATAGCTTTAATCAAAGCATCCTTATCAAAATGTTTAGTAATCCATTCTTTAGAAACGCTCCAAGTGGAAGTCGTCTGCAAACGTTCACGAATCTTTTATCTTACGGAGCTGATCCACTTATTGAGTGTCAAAATGGTGAACAAATTTACAGCAATATATTTGTTTTCGCAAAGAAAACTCTTTCCGAAATGGAAAATGCTCCGGGAAAAATATCTCCTACAGTCGCAAAACAAGATTCCAAAGTGAAGAAAGCTGACAAACCAAAAAAGGAAGACAGACTAAGCACTAAATCTATTGCAAAAATGGCGGTAGACGATATTGAAGATTACAGACAAGCAATCGAACAAGTAACAGACTGTGCTAGACTTATTTACATAAGATGGCTGCAGTCCAGGCTTGTCAAAGAGATGGTTAAAGTAATAGACAGATACAAACATAGACAGTGGAATATGATTTTTAAAGAATTTAAAGACACAAGAGGTCACGGACTTTGGCTTACACCTCAACGATCTTTGGAAATATGGAGTATATTATCTAAAACTAGAAAGAAGGCCTATAATGATGAGCGAAATTTGAGACATTTACTTTGCATTACCATTTATGTGTCTTGGAAAAGTTTTGAGCCTTTAAAAAATATTAAGTTATCTGTCACACCATTAACGGCTTCTCTTAAAAGCGTTATAGAAACTGATGTGACACGAATGTTGCGACAGTACAGAAGAAATATTAAATCATCGGACATTAAACCATGGGAATATTCCTGTGTAAAACCAGAATTGATGAAAGACATCAAAAAATTCAATATTTGTTTTGAATGCGCTCTGCCATTTGAACAAGATAAAATCGTATGTTCTTGGTGTAAGCTGGTCTCGTTTTGCTCATACGAATGCATTAAAATGAATATTGAAAGAGCCAATTGTCACCCGTGTAGTGACTTCTTGAAGTTAAAATATTTTCCTTCACCCGATGTTAGTTCTACCTTTATTTAA

Protein sequence:

>DPOGS215741-PA
MINICEPIATVKEKTWKYQQYYIGERDEEDRRSGEGENTWTGAKSLEWYAGRFIRNTMHGVGDYHWRFLGPENTFVTYEGHFYCNSMHGYGIMSYPDGKTFSGLFSNNVRFGPGVESHSDVTENVGLWRGNKLIRLAWRPEAPCVTPDFLTNPNGQIAVEQFRTVLTTEIETVGEVNNALELLKQKGADPRVAMEKWMKLFPKNCTDLASKLCQIDIFDRDYYKGKIYALQEVEKVPEREETLKIDQNLNENTPQDVCTFYAWNNSSIMINLMKHCYKHERQRNITRINLKSILSGPRSNFKPTGNHEVNCRSLLMASYLGYITEVAELINTDKVLPDVSDIQGNTAVMYAAVGDQIEVIHFLVEAGANINYYNDCCCTPLGVLLMRYACTQRDISHNAMVQALLPSTTFAPPAIEPNIVEWNIVRELTCQSPGNLLTKSPSKITRNLSSKKVKSLMSIKDQPTSKRKQTDAGLPKVSEPDYETEEALMKQMKNETDEKILYNNLNREYCIKITDDFTLPFGTNNNQYIFDISDMVKEIDSFDEELKKPVEKNPKKVISKVIKDTMKISKDLMWQNDEIQSIDSEQKLKNDKLSRIMQTITQLLADGADPKLVKCPQPALFIAVMSNSSDLIRSLMKSGADINEIYPQVYHYTPLDIAISRPYNTENLEIVATLLECGADTRHLLKYQQEDKEPDVPEIFTPGPTLFHAVLARKVDNVGEEDIRRQLVELLLQYNCDPIAQFKGSSAIDVAMNHNFDLLDVFIKNPNTNLNAKINSFNQSILIKMFSNPFFRNAPSGSRLQTFTNLLSYGADPLIECQNGEQIYSNIFVFAKKTLSEMENAPGKISPTVAKQDSKVKKADKPKKEDRLSTKSIAKMAVDDIEDYRQAIEQVTDCARLIYIRWLQSRLVKEMVKVIDRYKHRQWNMIFKEFKDTRGHGLWLTPQRSLEIWSILSKTRKKAYNDERNLRHLLCITIYVSWKSFEPLKNIKLSVTPLTASLKSVIETDVTRMLRQYRRNIKSSDIKPWEYSCVKPELMKDIKKFNICFECALPFEQDKIVCSWCKLVSFCSYECIKMNIERANCHPCSDFLKLKYFPSPDVSSTFI-