Monarch geneset OGS2.0

DPOGS203796
TranscriptDPOGS203796-TA3852 bp
ProteinDPOGS203796-PA1283 aa
Genomic positionDPSCF300010 + 1619080-1626041
RNAseq coverage31x (Rank: top 75%)
Annotation
HeliconiusHMEL0125040.088.53% 
BombyxBGIBMGA003703-TA0.084.67% 
DrosophilaCG4329-PA3e-15729.98% 
EBI UniRef50UniRef50_D6WVH50.042.13%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WVH5_TRICA
NCBI RefSeqXP_971414.10.042.13%PREDICTED: similar to CG4329 CG4329-PA [Tribolium castaneum]
NCBI nr blastpgi|910881490.042.13%PREDICTED: similar to CG4329 CG4329-PA [Tribolium castaneum]
NCBI nr blastxgi|910881490.042.13%PREDICTED: similar to CG4329 CG4329-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055151.3e-32protein binding
KEGG pathway 
InterPro domain[390-685] IPR0110461.3e-32WD40 repeat-like-containing domain
[446-672] IPR0159431e-25WD40/YVTN repeat-like-containing domain
[505-544] IPR0016801.3e-06WD40 repeat
Orthology groupMCL13430 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203796-TA
ATGGCAACTAACACCCCACCACATATATCAGCACACATATTCTACGGCTTGAGAACAGATATTCAATATAATGCCCATTACATGACTGATTCCGAAATTATTTATCCTGCCGGTGGAGTTATTGTGATCCATAACCATATACAAAAGAAACAGAAGTTTATAAGACTTCAGGACAAACACAAACCAATCAAATCGTTGGTGTTGGCACCTAATAGACGTTGGTTAGCGTTAAACGAGATTGCCGAAGAGGGTCAAAAACCAATTATAACCATTTACGATCTAACTACATATAAAAGGCGTAAAATTCTGACTGTCCCTTTCGAGAATTCAACTGCTCGTGAATTTGCTTGTGTACAATTTACGTATGATTCAAAATATTTGGTAGCGATAACTGGCGAGCCAGATTGGTATTTATATTATTATAATTGGGATAAAGGCAAGGTCGAAAGTCACGCTAAGGCACAAAATCCGAGCGGACAAGGCACTGTTGAAAGCGTCCAATGTAATCCGTCGGATGCAACCCTTGTGGTTATAACAGGGCCCTACACATTTCGTATCATGAATGTCTCTGAGACTGTTTGGAGACAGTGGGGTTGGTGCAAGGCTGAAAATATCAACATAACCAGCTGTATGTGGTTGACATCCGATCGCATTATGTTCGGGACTGATGCTGGGGTCATCATGGTAGTGGAGAATGGTGAACTGAGACAGAACTGCATATTCCGTGCCACTGAAGTAACTGAGATGTCATTGAAGAAAGTTGACATCGAGGCGACAGAAAGCGAAAAAACCAGTACAGCCAGCGCTGAAGCCACCCCCACTGAATCAGGACTCGTGGACTCTGACAGCTGTCCAGTGATGTGCTTCATTAATTTCAGCAAAGGATTCGCTTATGCTTGTGGCCAAGGATATGTTCATATGTTTGAGAAAGAGTCACCGCACCATTGGCGGAAAAGAAATTTGTTTAGAATATCAAAGAAATCTTATAAACATACCCGAGAGCATCCATTGTGGTCGCCACTTGACGCTATTCAACATATAACAATTGATCCGAATCAAGAAACTCTTCTCATCACAACTCTGAGAAAGCAGCTCTATTACGTCAAATTGTTCGGACAACATATGCTTCAAAATCCAGAAATATCATTTACGGAACTCGGTCCGGCTATGCATTACGGAAGAATAAATTCTCTCTCTATGTGCGCGTGGAAACCAATTTTCATGACATCCGGTGAACTGGACAAGAGTATACGAATTTGGAACTACATGACTGATGACGTTGAATTGATTAAACTATATCAAGAGGAGATACATTGTCTGTCTTTACATCCTTCTGGGCTATTTGCCATAGTAGGGTTTTCAGACAAGTTACGATTTATGGTAGTACTAATTGATGACTTTGAAGTCATGCGAGAATTTCCTATACGTAACTGCCCTCGTGCGAAATTTAGCACCAACGGACACCTTTTCGCAGCTGTCAATGGTCAAGTTGTGCAAGTTTTTTCTTCGGTTTCATTTCAAAATGTTTATAATTTGAAGGGTCATAACGGAAAAATAACGTGTCTGGCGTGGTCAGCCAATGATTTAACACTAGTGTCCTGTGGCACTGAAGGCGCTGTGTACGAGTGGAACATGGCGTCAGGCCAACGAGTCGGCGAAGTTATATTAAAAACGAACCAATTTAAAGCCTGTGCCGTGAATAATAACGGTAAAACAACTTACGGCGTTGGAAGTGACGGCGAAATAAAAGAAATCGGCTCAAATACGATTCGTAGAAATCTTGGTTTGATCGGATGCGGTCTCGACACTATAGTCCTATCCCGTTCCGATTTAATGCTTTTCATTACCGGCGGCGAGGGTGGTGTCACTGCTGTGCAGTTACCATTACTAGACAAGGCCATTTACAACGAATTTCATATGCACAATAAAAATGTAACCGCCATTGCCCTGTCATACGATGATCAAACATTAGTTTCTGTGGCTGAAGACTCGTCTATTTGCTTATGGAGATTAACTAATGCTGACGGAAGAGCGATAGCTTTAGATAAAGACTTCGCATATTCCAAAGAGATTCTGATCAGTAAAAAAGATCTTCAAGAGAAAATTAACAGTATTAATTTACTCAGTACTAGGATGAGTGAATTGGAAACTGAACATACATACCAGCTACGCCAGGCTGAAGCAGCTCAGGCTGAGAAATTAAAAGAGGTTCATGAGGGATATTGTGCCGCTATAGAGGAACTAAAAGAGAAGAACGAGCAAATGGAAAATGAACATACCCACGAAATTGGCATGATACAACAAGATATCGCAAAGCTGCGTTCGGGTCATGAAAGAACCTTACAAGCTTATGAAGCAGACTTTAATATTCGCCTAATTAGCGAATATGACAGATACCAGAGTCTAGAAGACAAAACAGCTCGCATGAGGAAAGATTATGAACAACGATTAGATGATTTGGCGGAGAGCAAGCGGCAGGCTTTAAGAGAATTGAATAAAGCTTTTGAAGCGAGATTAGAAGAAAAAGATCTCATGCTTCAGGAGTTACAAGAACAAGCTGATATGGAGAAGAAGGAACATGAAACTATTAAGGCGTCTATTGAAGAAGATGCTGATCGCGAGATAATAGAGATAAGAACGGCCTACGAAGTTCAACTGAAAGAAGAAAAAGACGCAAATGTCAGGCTGAAAGGTGAAACCGGTCTGATGAAGAAAAAACTTATATCCGCTAATAAAGAGATCGATGAATTCAAACATCAAGTTTCACAACTTAAAGCCGAACACAAACAGTTTCAAAAAGTAATATCGACTTTGGAACGAGACGTCGCTGATCTTAAGAAAGAAATATCGGAGAGAGACGGCACAATACAAGATAAGGAAAAACGGATATATGAATTGAAACGCAAGAAGCAGGAACTAGAAAAATACAAATTCGTTTTGAATTTTAAGATAATTGAATTGAAAAATCAGATTGAACCCAAAGAAAAAGAAATTCGGGAGTTAAAAGTTCAGATTGATGACATGGAAAACGAAGAGCTAAAATTATTGAATACTAAACATGATCTTGCTGTATTGATTTCCGTTTACATCCAATCACAGATAGAGCCATTAGAGCGCGAGATCAAAGAAAGAAAAGTGCGTATTCTTGAACTGGAAGTGTCGTTGGAAACTCTTCTAAAGCAAAAAGATTTTCGCGAACTTAAGATCAATCAGTTGAATGAAAAGTTAGCATCGGCCAAGAAAGATTTCTTCAGTGAGGCGAATCGTAATTTGACTCTTAAGAACACTTTAAAGAAAATAAAAATTGATCTTCACAATATGACGGCCAATTTCCAAGATCCGACTCAGCTTAAACTGAGCGTTAAGGCGCTATTTCAAAAATATGTAGAGGACATTGACTTTGTACGGAGTCGCATGGCTGAGGATGAGGCGATAAGAGAATTCAATAGACAAAGAGATCACCTTGAAAAGCAGGTTGCAGGTCTTAAAATGCAACTATCGAAATCACTGGATGGGTCCAAGAGTGACATTGGAAAGATTATGGATGAGAATTGCACTCTGTTAGGGGAAATTAATAATCTTCGAAGCGAGTTGAAAGCTACCCGTACAAGGTGTTTTCAAATGGAGTCTATATTGGGTCTGTCAGCGCGTTACATCCCGCCCGCAACTGCGCGCGCTAAACTCAAACACGTCACAGAGGAGCGGGAAAAGCTTGATGAGAAATTTAAACAGAAAATCGAAGAGAGAGAGGAAATCATTGTCGCTTTAAAGGAAGAAAATGATCGTCTCCTTGGAAAAGTAAGATGTCCTGATGAAACAGAACCTTCTGAAAATGACACTGAAGAACAATAA

Protein sequence:

>DPOGS203796-PA
MATNTPPHISAHIFYGLRTDIQYNAHYMTDSEIIYPAGGVIVIHNHIQKKQKFIRLQDKHKPIKSLVLAPNRRWLALNEIAEEGQKPIITIYDLTTYKRRKILTVPFENSTAREFACVQFTYDSKYLVAITGEPDWYLYYYNWDKGKVESHAKAQNPSGQGTVESVQCNPSDATLVVITGPYTFRIMNVSETVWRQWGWCKAENINITSCMWLTSDRIMFGTDAGVIMVVENGELRQNCIFRATEVTEMSLKKVDIEATESEKTSTASAEATPTESGLVDSDSCPVMCFINFSKGFAYACGQGYVHMFEKESPHHWRKRNLFRISKKSYKHTREHPLWSPLDAIQHITIDPNQETLLITTLRKQLYYVKLFGQHMLQNPEISFTELGPAMHYGRINSLSMCAWKPIFMTSGELDKSIRIWNYMTDDVELIKLYQEEIHCLSLHPSGLFAIVGFSDKLRFMVVLIDDFEVMREFPIRNCPRAKFSTNGHLFAAVNGQVVQVFSSVSFQNVYNLKGHNGKITCLAWSANDLTLVSCGTEGAVYEWNMASGQRVGEVILKTNQFKACAVNNNGKTTYGVGSDGEIKEIGSNTIRRNLGLIGCGLDTIVLSRSDLMLFITGGEGGVTAVQLPLLDKAIYNEFHMHNKNVTAIALSYDDQTLVSVAEDSSICLWRLTNADGRAIALDKDFAYSKEILISKKDLQEKINSINLLSTRMSELETEHTYQLRQAEAAQAEKLKEVHEGYCAAIEELKEKNEQMENEHTHEIGMIQQDIAKLRSGHERTLQAYEADFNIRLISEYDRYQSLEDKTARMRKDYEQRLDDLAESKRQALRELNKAFEARLEEKDLMLQELQEQADMEKKEHETIKASIEEDADREIIEIRTAYEVQLKEEKDANVRLKGETGLMKKKLISANKEIDEFKHQVSQLKAEHKQFQKVISTLERDVADLKKEISERDGTIQDKEKRIYELKRKKQELEKYKFVLNFKIIELKNQIEPKEKEIRELKVQIDDMENEELKLLNTKHDLAVLISVYIQSQIEPLEREIKERKVRILELEVSLETLLKQKDFRELKINQLNEKLASAKKDFFSEANRNLTLKNTLKKIKIDLHNMTANFQDPTQLKLSVKALFQKYVEDIDFVRSRMAEDEAIREFNRQRDHLEKQVAGLKMQLSKSLDGSKSDIGKIMDENCTLLGEINNLRSELKATRTRCFQMESILGLSARYIPPATARAKLKHVTEEREKLDEKFKQKIEEREEIIVALKEENDRLLGKVRCPDETEPSENDTEEQ-