Monarch geneset OGS2.0

DPOGS210891
TranscriptDPOGS210891-TA4545 bp
ProteinDPOGS210891-PA1514 aa
Genomic positionDPSCF300045 - 807114-816819
RNAseq coverage16x (Rank: top 81%)
Annotation
HeliconiusHMEL0039571e-11659.03% 
BombyxBGIBMGA003769-TA0.073.78% 
DrosophilaCG17687-PA1e-6024.09% 
EBI UniRef50UniRef50_D6X4G18e-15029.78%Putative uncharacterized protein n=3 Tax=Tribolium castaneum RepID=D6X4G1_TRICA
NCBI RefSeqXP_966875.21e-17128.37%PREDICTED: similar to AGAP003166-PA [Tribolium castaneum]
NCBI nr blastpgi|1892416622e-17028.37%PREDICTED: similar to AGAP003166-PA [Tribolium castaneum]
NCBI nr blastxgi|1892416626e-17828.15%PREDICTED: similar to AGAP003166-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055154.4e-09protein binding
KEGG pathway 
InterPro domain[129-524] IPR0110464.4e-09WD40 repeat-like-containing domain
[483-520] IPR0159431e-05WD40/YVTN repeat-like-containing domain
Orthology groupMCL10829 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210891-TA
ATGGTTCAAGAGAAATCCTCAGAGAATGAGACTGGTCATGAAGATCCTCACGGATCCTACAAGCCTCATGTTAGATGGGCTATACCACAACAGTTGACTGCCATGACATTTATTGGCCGAGATGTTGTCGCTGTAGCACACGATATTTATATAATATTCGTTAATTTGAAGACTAACGCCGAGCTTGTGTACGCGGCCAACAAAGACGAATGTGGAGATGGAGTCGACGTCATTTCTGATGCTGACGTGACTCGCTACAAAGGTCTGTGTATGATGGAGTCAGACCTGGTCGCCGGTTTCAGTGGTTTTCCAAATTACCTCATATCAGTATGGAGCTGGCGGACCAACCAGCGACTTATTTCTGTACCCACGGACGTTGTACGCCGCAACCAGATTTACACTGCCAGTCACTCTAATATGCTTTTATGTGAGTGTTGGGGCGAGGGCTTAACTGTGTGGGAGGTAGCCCAATGCTATAAACGCTGTCTCATAATGAAGCGAGTGAAAAAGGAAGTCAAAGGCTGGGAGGTGTCCGATCCTAAGCTGGTTGCTGTGTGTTGGAGCAGCGATGGTCAACTTTATGCTCTCGATGGCAGTGCCAATCTATACAGCTTGATGTCGGATGGTATAGGTATGGTGACTAACCTGGATTGGTCTGAAAATCTGGAAGGATCTCTGGAACCTTGTATGTGTTCATTCGGCAACGGTATATTAATATATGGTCCTGATAATTGTTTACGGTCTCTTAAAAAGGGGGAGAAGATGTGGGTGGAAGCTTGGAAATACATCCCTGAAGATAAAGTGCTGTGTCTCGTTTCCAACTTCACTTCAGATGTGGCCGTCATGTGGACTGAGAAAGGCTTTATCTACAAACTGACTGGTGAGTCGGAGGAAAACATCGACGTCAAGTTGAAAATTGTTAAATATGTAGAGGGTGAGGACATTTCATTCGAGGCCAGCCCGACCGATCCCTTATTAATTATATTTGGGGAGGTAGGACAGAATTACGGAATTGCCTTGTTTACGTTTACCTCTGATGGTCTGGAGAAGGTGGGCACAACGTGTCTGACGCATCAGATTGTATCTCGAGTGGTTTTTTCACCAACTGGACGTGAAATGATCGCCGCTGCGATGTCGGCCGGACATATATTTATTTTTAAGATATCCGAGGACTACAAGCTTAGTTTAGTGAGATACACAGAACTCGGCCGAGGGTTGGCCGACTGCTTCCTTATGAAAGTCGGCGATTCCATGCGGGCGTTCAGTCTGGTGTTGTTTTCCGATAAGTATGCTATAGGTAAGGATAACAAGTTAGCCGGGAAGATGGCGGGTCCTTATGGTCAACTGGTGCCGCTCCTGGGGGCAGCGGGCAGCGCCGCACTCGCCGCGCCACACATTTCTTCACGCATGCATGTGTTAAGAATGTCGGGAGAAAAAGGTGTAACCGTATCCGTTAAAATGGGTCCGATGTTAGAAACTGGACACAGTTTGAAACATTTCATCTTTTTCATAAATGGAAATGCTCTCCTTACCTTCGGTGCTGATGGAACCGTTGTACTAAGGCAGCCGAACGAAAGCGACGAATGGCAATTGCAGTTGATCGTCTCGCATCGATACCAAACGGGAGTTAACCAGGCCGTCATCGATGCAGGAAACAAATACATCGCACACCTCGCAAATAACAAAACCATGGCAGTACACTACTTGTACCGGGATAAAAATAGCACTGAACCACAACATGCACCCCCGGACGACAGCTTGTTTACTGAAAGCATAAACACGATTACTATTATCAATAAAAATGATAAGAACTATCTAGACCTGCAAGAAGACAAGAAAGTGCGGGAGGAGACAATGGACTACAAGCATCAGCGAGAAGACGTTATAAAGGCACTCGGTGGTGTGCGGGAAAGACTCGTGAAGCTGTTAGAGGAAAACCTGGCTGAGAGACCGCTGCACCAGCTGTCGCTGTCGGAGTTCAACCTCCACCTGGAAAACAAGAAGGAGCGGATAAAACAGGCTGAGAAAGAGCGCGAACAGATCCGACTAGAAACCGAAGCCCGTATCCGAGCCCAGGACAAAGTAACTGAATGGATCAAGCACACGTGCTGGGATACCATGTCATCACCGAGGGTCAAGCTGTTTGCCATATTTAGCCACTATCAGGTCGAGAACTATGCGGTTCTCCCGACACAGCGTGACAATTGGCCGGAGTTGAATCAAATTGAAGCGCTGCGTACTATAGAAATGGAAAATAATAAAGACGTTTTCCGTCCTTGGGAGGAACCGAAGGAGGAAAGACTCAGTTTGGATGAAACGACAACTTTACAGTTATCCAGACTGGGCCCGGAATCCATGGTGTCGATGCGTGACATACGCAAAAGTATGGAGAGTCAGATGTTGTCGGAGGTGGTGGAGGTGGAGGAGTCGGTGGAGAGCGAGCCGTACGTGCTGTCCGGCTCCAACGCTCACCGCTTCATACAGATACCGAGCTTCATGATACCACAGACCATGGCCTACTCGTTCCTGCACATGGACTGGCTACAGCATATTGTAAAGTTAAACGTTCAAAACATGCGGCTTTGGTTTAACAAACAGTTTGATGATTTAATGAGCCAAAAGAAGCGCGAGGTCGGTCTCGTGAATGACAGGAACAATCGTCTTAGATTTATTCTCGACGAACTAAACAAGCTATCCGATTTGCGAGGAAGTTTCCATCACTTAACTAATCTTATTCGGGATCCGGAGTGGGCGCAGGAGGAACAGCCATACAGACTCATCAAGGTGGAACCTGAGGAGTGCAGTATTGAGCCCTACGTGAGTCCGAGCCAGGTTGTGATCCCACCGCCGGAGCCCAAACCAAAGGACGACTTCCGCGAACGAGCCCTCATGTACATGATGGACGGCGTGCTAGAGAAGCTGTGGCACGAGGAGATCAAGAAACCCATACCGATGCCGCAGTGCATGTTGGAAAAAGAACCCGAACACTTCAATGAAGATGATCTCAAACTCGTGTTTGATTATGAAGCCAAAGTTGCGTTTAGAAACGAGGAGAGAGACAAGTACAGAAAGATGCTTCACGCCGAATATGCCAAGTTATCACAAATACTTAACGAGGGTATTGTCAAGTTTAACCAGAAAGTTAAAGAAACTTGGCTAACTAAACTGAGAGTAGACTCAGTCATTGGTCAAGAAAACTTAAATCTCATGCGCTTGAGAAGAACCAATTTGGATCGCGTCGAAGTTTCTGAAAAACTGGAAGATATCAGATTCGAAATAAAAATCACCGAAGACGAGTTGGAGGTGCTACAACATGAAATGCATGCAATTCAGGAACAGAGCGAAGAATGCCAAACGTCGTACGAAAACCTCCTCCAAAAAGACAAATACAGCGACCGAACATTTAAGAATCACTTTGCGGATCTCTCCCCGATTATTATCGAACAATGTTATAAATTCTTTAAAAAAAGGCCTAAATGGCATCAACGTGCGACGATGATACCGGTGGTACTTTATGAATTAGCCACCGCCGTTTTGACCGGGGTCCGCCCAGCACTCCTACATGCTGATTGCGTTGAATACTTTAAGGGGGTCGAGCAATTGGATCAAATAAGCAATATGCCACCAGTTATGGATGAAGGACTTTGGGCAACCATGTGTAGGCTGCGACGTGCAAAAATTGAAAACGAAATAAGAATGCGTGCCGTGTGCATTGAAATGTCTCTAGTGGAGAGCGCTGCGAACACGTGGCGGGCTGGGGTGACGGCGCGGCGACTGAAGCTAGCTCATGCCGCTGACAGTATTACCTGTCTGCGGAGGGACTACGAGCTGGCAGCGAGGAACCACACCGTACAGATAGTGTTGCCAGCTGGTCAAGTAGAAATAGTCAGCACAGGCCATTTTGAGGACTTCGAGGACGCAGCTCTTATACCCAAAGATGAGGTTGAGAAAGTTAACAATGTTATTTTGCAAGTTGGAGAATGGAAACTGAAAATGATGAGGAAGCAAATAGAATTTCGAAAGGGCATCCTGTCTAAAGAATGGGAACACGCGCAGATGAAAATGAAATTACGCCACATGGAACAAGAACTATATTCCTACCAGCGGCTGAAGGTGCCGAAGGAACTTCAAAGTTACTTAAAAAATAAGGAGTTAGGTTATACGGATGAGCAGGATTACGTTAGGATGGAAAAAGAGATGGAAGCATCCAAAACGTCGGTCAACAAAATACTCAACGAGGAAATACGAAAAGTTGAAGAAATTGAGCTGAAAATAGACGCGGTAGAGGCTCAAGCTCAGGAGCTGGAAAAACTAATAACTTCACTTAACGTCAAGGTGTCGGAGAAGAGACTGAACGAGGACCCTCTGGAACCGATTCGCATCCGTCGTGTGTTTAAGAGACGTATGGAGACGCTGGTGTGTCGCAGCCAGCTGGTCCGCACGGTGCAGGCGCAGCACTCCCGCCTTATGCTGCTGCAAACTGAGCTAGAGCTGCTGAGGCTTAGGACTTATCCCACGCTCGCCTCCTTCCGAACTATGGACTAG

Protein sequence:

>DPOGS210891-PA
MVQEKSSENETGHEDPHGSYKPHVRWAIPQQLTAMTFIGRDVVAVAHDIYIIFVNLKTNAELVYAANKDECGDGVDVISDADVTRYKGLCMMESDLVAGFSGFPNYLISVWSWRTNQRLISVPTDVVRRNQIYTASHSNMLLCECWGEGLTVWEVAQCYKRCLIMKRVKKEVKGWEVSDPKLVAVCWSSDGQLYALDGSANLYSLMSDGIGMVTNLDWSENLEGSLEPCMCSFGNGILIYGPDNCLRSLKKGEKMWVEAWKYIPEDKVLCLVSNFTSDVAVMWTEKGFIYKLTGESEENIDVKLKIVKYVEGEDISFEASPTDPLLIIFGEVGQNYGIALFTFTSDGLEKVGTTCLTHQIVSRVVFSPTGREMIAAAMSAGHIFIFKISEDYKLSLVRYTELGRGLADCFLMKVGDSMRAFSLVLFSDKYAIGKDNKLAGKMAGPYGQLVPLLGAAGSAALAAPHISSRMHVLRMSGEKGVTVSVKMGPMLETGHSLKHFIFFINGNALLTFGADGTVVLRQPNESDEWQLQLIVSHRYQTGVNQAVIDAGNKYIAHLANNKTMAVHYLYRDKNSTEPQHAPPDDSLFTESINTITIINKNDKNYLDLQEDKKVREETMDYKHQREDVIKALGGVRERLVKLLEENLAERPLHQLSLSEFNLHLENKKERIKQAEKEREQIRLETEARIRAQDKVTEWIKHTCWDTMSSPRVKLFAIFSHYQVENYAVLPTQRDNWPELNQIEALRTIEMENNKDVFRPWEEPKEERLSLDETTTLQLSRLGPESMVSMRDIRKSMESQMLSEVVEVEESVESEPYVLSGSNAHRFIQIPSFMIPQTMAYSFLHMDWLQHIVKLNVQNMRLWFNKQFDDLMSQKKREVGLVNDRNNRLRFILDELNKLSDLRGSFHHLTNLIRDPEWAQEEQPYRLIKVEPEECSIEPYVSPSQVVIPPPEPKPKDDFRERALMYMMDGVLEKLWHEEIKKPIPMPQCMLEKEPEHFNEDDLKLVFDYEAKVAFRNEERDKYRKMLHAEYAKLSQILNEGIVKFNQKVKETWLTKLRVDSVIGQENLNLMRLRRTNLDRVEVSEKLEDIRFEIKITEDELEVLQHEMHAIQEQSEECQTSYENLLQKDKYSDRTFKNHFADLSPIIIEQCYKFFKKRPKWHQRATMIPVVLYELATAVLTGVRPALLHADCVEYFKGVEQLDQISNMPPVMDEGLWATMCRLRRAKIENEIRMRAVCIEMSLVESAANTWRAGVTARRLKLAHAADSITCLRRDYELAARNHTVQIVLPAGQVEIVSTGHFEDFEDAALIPKDEVEKVNNVILQVGEWKLKMMRKQIEFRKGILSKEWEHAQMKMKLRHMEQELYSYQRLKVPKELQSYLKNKELGYTDEQDYVRMEKEMEASKTSVNKILNEEIRKVEEIELKIDAVEAQAQELEKLITSLNVKVSEKRLNEDPLEPIRIRRVFKRRMETLVCRSQLVRTVQAQHSRLMLLQTELELLRLRTYPTLASFRTMD-