Monarch geneset OGS2.0

DPOGS210728
TranscriptDPOGS210728-TA3609 bp
ProteinDPOGS210728-PA1202 aa
Genomic positionDPSCF300013 + 65847-76968
RNAseq coverage174x (Rank: top 50%)
Annotation
HeliconiusHMEL0024180.062.24% 
BombyxBGIBMGA006258-TA2e-15647.47% 
DrosophilaCG16908-PA6e-4821.06% 
EBI UniRef50UniRef50_D6WAQ71e-8228.94%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WAQ7_TRICA
NCBI RefSeqXP_966792.14e-7928.70%PREDICTED: similar to Uncharacterized protein KIAA0406 [Tribolium castaneum]
NCBI nr blastpgi|3838590344e-9026.47%PREDICTED: TELO2-interacting protein 1 homolog [Megachile rotundata]
NCBI nr blastxgi|3838590342e-9027.02%PREDICTED: TELO2-interacting protein 1 homolog [Megachile rotundata]
Group
Gene OntologyGO:00054881.2e-09binding
KEGG pathway 
InterPro domain[379-1059] IPR0160241.2e-09Armadillo-type fold
Orthology groupMCL12419 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210728-TA
ATGAACGCACACTTGAAAGAGGCATTTACACGCATAAAGCCAATATGCGACATGGTGATGGTAAATCCATCGCCGGAGCATATAACAGCGTTTGCAGCACTGGTAGGAGAGCTAAAGATGGAAATAATCCAAGAGCTGCAGCAGTATATGCTGTTTCCGTTTATCACACACTTACAGTCCAAAGAAATGGAAACAAAATATGAAATGCAAGGACTGTTAATAGACAGCATGCGGGAAGTGCTACAGAGGGTGTGTGTAACCAGTTTTGAGATGTGCATGAAGATTGAAATGGGCCTTTTGAGTCTCGTGTTTGAAAAACCAAAGCCGGGAATGATTGCGGATGTTCCCGAGGAACTCAAACTAAGTGTGATGCAGTGCTTGACGGTGTTGATGTTGCATATCGATCAGCCGACGAGGGTGAAGATGCTGGAGAACCAAGTGCCATTGTTGGCTCAAGCTGTGTTTGTGTCGGTGCACCTCGCAAAGTTAGAGAAACTAAGGAGCTTGAGGCTAGCTGCCATAACGTGTCTGTGTGCGCACACGGGCTGCCACCCTCAGCAGACGGACGCGCGTGGTCTAGTTCCAGACCCCGCTATAGAGACTGCTGTCTTAGGGCTGCTAGCATGCATCCTGCCGGGCGTGCTGGCTGCGCTGCAGGACGTTGCCATGAGTGACAACCCTGGACATGCAGTTGTTGTTGCCGCACTGAATGCAACCCATCGCGTGCTGTGCCTCACGATGCACAACAAACACTTGGCGACCAAAGAAAACATCACGGCCGATGACTTCGTGGCCATGTTAGCCGAAAAGGCCAAGCCCTCCAATGATGTAAGCAAGGAAAGCATCAGAGATAGTCGGCTAGGTGCGATCATAGCGGCAAGAGATTATCACTCGCTAGCCGAGTTCATACGATCTGGTAGGGACCAAACGGATCCTCCCAGGAGGGAGAGAAAGATTCCCAAACAAAAAAAGAATAGAAATCAGAATGCCAACCAAAAAAACGTTAATTACATGGAAAGGCTCGGCAGCTATGAAAGAACGCCGAATACTTCACAGAATGCAGCGAAGGCACCCCGCGATATTCCCAAGCGGACATCCGAATGGTACACGATGGCCGGGGACAAACTGGCCATAGTGATCAAGAGTCTCATACCGCTGGTCTCTCACGAACACTTCAAAGTGAGGAAGGAGCTCGCCATACTCTGCTATAGGATCATATCAGAGTGTAGTGCCACCATGCAGCCTTCTCTGCCGATGTCTTTGGATGTACTGATCTCCCTCTGCCATGACTCCTACCAGGAGGTCTCGGACTACTGTGATGCAGCACTGAAGGCACAGTTTTCGAACCCCGAGAGAGAAACTATGGACAGTCTCTGTGAGAACTTCTTCGCCACCATCAACTGCTTGCCGAGAATTATGAACAATATTGACGAGAACCGCAAGCTGTCCGCCCTGAACCTGATAGCCGGCTACCTCACCATCCTGTGTGACGGCGGCCGCCCCCAGCGCCTGACGAGTCTCTTGACAGCTAGCGACGGCTTCGACCGCGTGTGCGACGCGCTAATCGCTACGGCTGACATGTACACCGACATGTCTCTCCTCGCCAGACCGGCCGGAGGCGACATCACAGGGATGTCCTCGTGTGAGTTGTCCGGTCCGTGTCCCTGGCGCCGTCTCCGTCACCTGTCTCCGGCTGGAAGTCAACAGCTGCAAGGCCTGCTGGCGTCGCTGGGCGCGGCGGAGTGCGCGGAGCTACTGCTGGACAGATTTCTGGAACTCTTCCACGAGCGACGCTCCTGCGACCTCATATACATCATCAACTATCTTGGATCAGGTCCAGATTCAAACCCGGATCTAGCTCGTCGCATCATCAGCGTGTATATAACCGAGGACGTGTGGTACCAGCCGCTGGAGGTGCAGAGCGGGGAGAAGCCGCTCTCAGCGGACGAGACCCTCGACGAGAGCATTTACAACCCACGGTCCTGGACCAGGGACACCGTACCCGGTCTGTTCGAGGGCGCCATAGAGACTCGCTTCACCGACATCAGCTCGACCCTACCGCGGGTCCGCCTGGAGCCCAACACGTGCGTGACCCTGGGACACGCGCGGAGGAACCTCACCAGGAGCTGCCTCCTCACTGAGGGGCTCGGCCTCATGGCGCTGCGCCTGGGCAGGGACTACCAGCAGTACCTACTGAAGACACTGTGTCTCTTGTTGGAGAGAGTCGGCAGCCGGTACCCTCCGCTCCGCTCATCGGGTCTCATCGCCCTGCAGCAAGTGGCCGCGGCCACCGAGGCCAGTGACGTCACAGACCTCATCGGCAGGAACGCCGACTACTTCACCAGCCAGGTCACCGGCCGGCTTAAGAAGGCCTGGAACACGCAGTCGGCCCTACAGATACTGTCAGTGGTTATGGAGTACAGTGACGTCACTATACTGGACTACTTGTACGGGATAGTGGAGGACGCCTGGAACACGCAGTCGGCCCTACAGATACTGTCAGTGGTTATGGAGTACAGTGACGTCACTATACTGGACTACTTGTACGGGATAGTGGAGGACGTGTTAGTGCAGAGCTGCGACCAGTACTACGAGAAGAATTTGTATTCATATTTACAAGTGTTCCTGACATTCATAAACTGCATACGGAAATGGTTTCTGATCCAAGACACGGTCAAGAGGAAGGAGAGTCACGGGCTGGAGATAGACGTGCTGAAGGACGTCATAGAGTTCGCCAACAACAAGGAGGAAGTAGAGAGGTTGTTGAACACGAAGGAGTTCGAGGAGGAGACGGGCAGGAGTGTGGAGGAGATGTACCAGGAGGACCTGCAGCGGAAGGAGGAGGACCTCCTGGACTACGACGACACCGTGACGCAGGAGGCCGTACCTCTGCCACAACACGTGCGGGTCACGATCACTATACTGAAGCGCTGCGTACACTTCGTGTCGTACAAGAGTCGAGACGAGGCGCTGGTGGCGATGGAGGCGCTGTGGCGAGGCCTGGAGCTGCTGCGAGGACACGACGACGAGCTTCTGCCGCTGGTGCACGCGCTGTGGGAGCCGTTGGCCGCCCGCCTGCAGGCCGAGCCCGTGCTGGCGCGAGCGGCGCTGCGGGTGTTGGCGCTGGTGGCCGACCTGGCCGGGGACTTCGTCCGGGAGAGAGTCGTCAAGGACGTGCTCCCCCGCGTGTGTTCGGTGCTCCGCTCGTGTTCTCGCCGCAGCGTGTTGGCGGACGCGGGGTCGTCGTACCGCCTCACGGCGTCGTACTCCCTTCAGCGCTCCGCCCTCGAAGCGCTGGGTCCGCTCGCCGCTAACGTGGGGCTCCGGGGCATCGCGCTCCTCGACGCCCTGAAGGCCGGGGCGCTCTACCTTCACAAAAACCAACCCAAACCCCTACAGCTGTTGGCGGTGAAGTTCTTCAAGGACATGCTGGAGTACGACTACGGCAGCTCGTGGCAGTTCCTCCGCCGCCTCGCCAACAACAAGCAGCCGCTGACGCCGCCCGCCAACCGGTTCCTGCACCTGGAGCCCGTCGTGGGCTCGCCCTACGAGTGTACGGACCCGCACTACGACAACAACATCAAGCTCATCTTTTACGTACACAAGTGA

Protein sequence:

>DPOGS210728-PA
MNAHLKEAFTRIKPICDMVMVNPSPEHITAFAALVGELKMEIIQELQQYMLFPFITHLQSKEMETKYEMQGLLIDSMREVLQRVCVTSFEMCMKIEMGLLSLVFEKPKPGMIADVPEELKLSVMQCLTVLMLHIDQPTRVKMLENQVPLLAQAVFVSVHLAKLEKLRSLRLAAITCLCAHTGCHPQQTDARGLVPDPAIETAVLGLLACILPGVLAALQDVAMSDNPGHAVVVAALNATHRVLCLTMHNKHLATKENITADDFVAMLAEKAKPSNDVSKESIRDSRLGAIIAARDYHSLAEFIRSGRDQTDPPRRERKIPKQKKNRNQNANQKNVNYMERLGSYERTPNTSQNAAKAPRDIPKRTSEWYTMAGDKLAIVIKSLIPLVSHEHFKVRKELAILCYRIISECSATMQPSLPMSLDVLISLCHDSYQEVSDYCDAALKAQFSNPERETMDSLCENFFATINCLPRIMNNIDENRKLSALNLIAGYLTILCDGGRPQRLTSLLTASDGFDRVCDALIATADMYTDMSLLARPAGGDITGMSSCELSGPCPWRRLRHLSPAGSQQLQGLLASLGAAECAELLLDRFLELFHERRSCDLIYIINYLGSGPDSNPDLARRIISVYITEDVWYQPLEVQSGEKPLSADETLDESIYNPRSWTRDTVPGLFEGAIETRFTDISSTLPRVRLEPNTCVTLGHARRNLTRSCLLTEGLGLMALRLGRDYQQYLLKTLCLLLERVGSRYPPLRSSGLIALQQVAAATEASDVTDLIGRNADYFTSQVTGRLKKAWNTQSALQILSVVMEYSDVTILDYLYGIVEDAWNTQSALQILSVVMEYSDVTILDYLYGIVEDVLVQSCDQYYEKNLYSYLQVFLTFINCIRKWFLIQDTVKRKESHGLEIDVLKDVIEFANNKEEVERLLNTKEFEEETGRSVEEMYQEDLQRKEEDLLDYDDTVTQEAVPLPQHVRVTITILKRCVHFVSYKSRDEALVAMEALWRGLELLRGHDDELLPLVHALWEPLAARLQAEPVLARAALRVLALVADLAGDFVRERVVKDVLPRVCSVLRSCSRRSVLADAGSSYRLTASYSLQRSALEALGPLAANVGLRGIALLDALKAGALYLHKNQPKPLQLLAVKFFKDMLEYDYGSSWQFLRRLANNKQPLTPPANRFLHLEPVVGSPYECTDPHYDNNIKLIFYVHK-