Monarch geneset OGS2.0

DPOGS202022
TranscriptDPOGS202022-TA4752 bp
ProteinDPOGS202022-PA1583 aa
Genomic positionDPSCF300053 - 838655-847044
RNAseq coverage472x (Rank: top 26%)
Annotation
HeliconiusHMEL0167730.079.00% 
BombyxBGIBMGA012531-TA0.075.73% 
DrosophilaBub1-PA2e-8844.71% 
EBI UniRef50UniRef50_E2C8G33e-10450.75%Mitotic checkpoint serine/threonine-protein kinase BUB1 n=1 Tax=Harpegnathos saltator RepID=E2C8G3_HARSA
NCBI RefSeqXP_001864028.15e-9740.67%Bub1 [Culex quinquefasciatus]
NCBI nr blastpgi|3071925051e-10350.75%Mitotic checkpoint serine/threonine-protein kinase BUB1 [Harpegnathos saltator]
NCBI nr blastxgi|3228030582e-17829.53%hypothetical protein SINV_01108 [Solenopsis invicta]
Group
Gene OntologyGO:00167722.9e-37transferase activity, transferring phosphorus-containing groups
GO:00055242.2e-21ATP binding
GO:00046722.2e-21protein kinase activity
GO:00064682.2e-21protein phosphorylation
GO:00046746.7e-11protein serine/threonine kinase activity
KEGG pathwaycqu:CpipJ_CPIJ0136951e-96 
 K02178 (BUB1)maps-> Meiosis - yeast
    Cell cycle - yeast
    Progesterone-mediated oocyte maturation
    Cell cycle
    Oocyte meiosis
InterPro domain[1-1557] IPR0156612.6e-108Mitotic checkpoint serine/threonine protein kinase, Bub1
[38-162] IPR0132126e-42Mad3/BUB1 homology region 1
[1295-1576] IPR0110092.9e-37Protein kinase-like domain
[1300-1509] IPR0174422.2e-21Serine/threonine-protein kinase-like domain
[1300-1563] IPR0022906.7e-11Serine/threonine-protein kinase domain
Orthology groupMCL17036 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202022-TA
ATGGATATAGATGTTAGTAAGGAAAATATCCAACCTCTAAGAGGTGGCAGAAACCTTATGCAGCTGGGCACTGCTTTGCAAGCTCAGTCAGACAGTGATGTTCAAAGACAACTACTATTGCAAAAAGACGAGCATGAAGAAGCTATAAGACAGTACCAAGGCCCTGATCCCTTGGAGCCATGGTTTAATTATATCCAATGGGTGGAACAATCTTTTCCTAAGCATGGCCACGAAGGGAATATTGATAAGCTAATCAAAGACTGCCTACAGTTGTTTGAAAAAGATGAGAGGTACTTCCAAGATAGGCGTCTTGTTAAATTATGGATCAAATATGTGGACTGTCTCTCCAATCCTCTTGAGATATATCAAAGGTTATATAACACTGGCATTGGTGTTGAGTGTTCAGAGTTTTACCGAGCCTGGGCCTGTTATTGCGAGGAGTCTGGAGATTATAAGAAAGCAAACCAAGTGTACATGTTAGGTCTGCAAGCTAAGGCTCAGCCATTGGATGAATTGGAACAGGCTCATATGAACTTCCAGTTATTTTTTGCCCAGCGTATGTTACATGATGATTCACCAACTAAACGTAAGGCAGCTTCAGCGCTTGCCGAAACCAGAATGGCTCTAACATCGCTTAAATCATTCAAGAGGAGAAACATTGCTAATGTCCCAGTACAAAGAGTCGGTGACAGTGTTAAGAGCAGTCTACCAGGTGTTGTAAGACAACAGGCCGTTGATAACAGATTACCCAACTCTAATGTAATGGTTAATGTGTATGAGGATGCTCCGTCAACATCGAGAGGAGTTGTGACAGCTGAAGATCCCGGCCCGGCATCACTGGTCCAGGCCTGCAATGTGGAAAATGAAAAGGAGGTTGGGATATGGACCAACCCCAAAACGAAGATGGTTCACACCAATGTCGTACCGCACCAACCTTTGCCTTTTACTCCATATGAAGATATGGACGATGATTTAAAACTTTCTTCCAAGCATATGCCTTATTGCTTAGATGACTTAACATTTAATGTGCCACTGTGTGTACCTGATCCAGCCGATCCAACGAAAATTCCCTGTTATAATAAAGCCCAGGTATATTTTAACGATAAAGAATACAGTTTAGAAGAAATAAGAGCTAGAAAATATAATGTACAAAAAGCCGTTAAAACAGAAAATGTGCAAGTACAGGAAACCTATAAGAATATAAACGAAACGTTAGCACAGTGCAGTGCATTAGAAACTTTAGCCAATTGTGCTTTGGACTCTGAACAAGATCATATGGGACAATTAATGCCACTAACAATGCCAACACTACAAAATGTCACCAAAGTAACAAATATGCATTCACCGGGAGAGCCTAAAGTATTGTGTAATTTAGACTCGAAAGAAGTGAGCGAGCTTGCCGCTAAGAAAAACGACGAAAATGAAATCCCATCGAGTTCATTAAGCGACAAGGAAAATCAAGTTGCTAATCACTTCAACAACGAGAACGTTGGTAATTTTGGTAAAAATAATTTAATGGAAGAATTCAATAGAAGCTTAATGGGGAACCTTTTGGGTGATTCAGTGACTGTTAACACAAAGGAAGCGCGCTGGGAGTTGCGTAACATATTTAACGACAATGCGGAGCCATCTATAGTTCAGCCGGTTGTTCAACAATTTGACGTTCCCAAATTTGATATACACGAAGACAGATCTATGACTATGGCGATTAATGCGAAGAAAAATTTTGATATTCAGGATGCCAGAAATATGCCAGAGGATAAAGAAAATGCTAATAAATTTAACGCTATTCAGCCAAACGTTACCTCGCAAGCACCAAACGTGTTTAATTTCAACATTAAAAACGCTAGTACTCCAAACATGTCCCAATTTAAGAAACCATCAAATATTGATCAGTCCAGTAAATTCGCTGTACCGAAATTTAGCATTGACGAAAGTGTCATTGAACAGCCTGATCAAATTAGTACTAAGAGAGACGATTGTTCTAGACAAACAAACGACCAACACGCAACTGTTGACAATCAAGGTGGCGGATTATCTGTTATTATGGAAGCTACAAGGGAATACAATAGCAAATCTGGTTCTAGTTCATCGGGCCAGTCAATAAGGACTAATTTTACAGGCTACACTACCAACTTTGACTCCGTTTATAATAACCACAATGATCCTAATCATCCGAGTACAACATCAAAAAGGAATTCATTATCAGCTCAAGCTAGACTGCCGAATGGTCAATTCGCTCGAGCGTACCCACAGAAGAGGGAAGCGGAAAATAAGATGGCAGTTACACCGTCCACCTCAGTGCCTTATCATTCTCACGATCACCAATACCAAAAACCTATGCCGCCAAACTTCAGTGGTTATTCGCCACAACGCTCGATACATTATCAGCAGAACTATAACTATCAACAAAACTTTCCCAACAATCATATGATGAACCAACAGCAACAAGGTTACGGGAGTCCTAACCCGAGTTCCTATCAAAGTCCTCAGCATCCAAGCATGCAAAATTCCCACGAAATGAATCAAGTACCAACTGGATTCCAAAGTCCAACTTATTCTAATCAAATGGCGTACCATCAGTCGCCGGTGAATAATCCTGGAAATGTAATGATGAGTCCCCAACAAGGTTTCACCGGACGACAGGATTTCCACTACAGTAATCATACGGACCGCCATGTTTACGTAAACCAACAACACCAAGTCTTCCAAAGTCCTCCACATCAGACTCAGTTTCAAAACTCTGTTTATTACCAAAGATCAAGTCAGCCACAGACACAAAATTCGTACAATCAACAAAACTATCACCATATCCAAAGTCAATATAATAATCCTAACATGTATTCGAGTAGTGGTCAAAATTCTAATAATCCGAATTTCAATACAGTACAAAGTCCATACAGGCAAGTGCCGAAACAAATTGAGAACCAAGCTTCTTACGGAATGTCAAACAACCAACAATTCCAAATATATCAAAGCCCTCAAGCTTCCCAGAACAGTAACCTCCGCAATACATCAATGCATGACGCAAACATGGTTCAAGGTTATCAAGAAACACAGGGACGGCAAGAAATGGTTGATTCGTTGGGGAAACACGAACGAGATGTTGAAAACACAACTCTCATTAATAACCCGCAGTCCAAAAATTCATCTTCACATAAAAGTCCAAACAATTCAGTATTAAGAAACGTAAGATACGACCAGCCTAACGTTAAAATTGGACAGAAATCACCAGAGGTAGGATTTTCTAATCAATTCTTGAACTTCATATCTAATAGAAACGAACCCAAGGACAATGCGAATACACCGAAATTTACGAATAGCCCAAGTATATCTCAGAAAATGCATAAAAACTTATACGTGTCGAGTCCTGAGCAGGCTCAGATTCCTCCTTCGTCAGGTCTTTCGGACACTGACAGCAAAGATGGTATGACTGCCCAAACAGCTACACCGATACAATCTGCCAAAATTCTCACTATCGATAAACAAAAGGACATTTCAAAAAGGCAGTTAGATTTCGAGCACAGAGATAGAGTCGAGATTCAATCGGAAGATAGCAGGGACTCGGTCAGTAAGGACAGTAGGATATCATCAGTGTATTCGCGACAATCCGATTTCCAGTCCGACGGCTACGGTATGGATATTGACAGTGAAAACTCTATGGAGTGTGGCGCCTTCAAATCTTCGCATTCGATCTCCATGATTGAAACCAGTGATATTCCACGACCCGCTGATATAGACTTCCCCAAGGTCATCGATCCATTCAACAAGAAGATGTTAACATCGTTATTGGAATATGTAAAATTTCCCAACAAGACGCACGCCGAGGGATACAACGAAGTCAGATCTATACCTAAATTGCAAACAGCAACGGTCATATCCGTCGGCAATAGTAAATACTCCATAGAAAAACAACTGGGCAAAGGAAACTATGGTGCTGTGTTTCTGTGTCTAGATTTCCACAGCAATAAGTCTGTTGCAGTGAAATATCAGAAACCAAGTCGTCCATGGGAATTCTACATTTGTCAGGAAATAAAGGCTAGGATAAAAGATCCCTTCATGCTACCTGGTTACATGGAAATATCGACAGCTTTCCTTGGCGAAAACGCCAGTTTGTTCGTATCAGAATACTCACGTTATGGGTCCTTGTTGGACGTAGCCAACAAAGTTAGGATTGCTACGTCAAAGTGCATCAATGAACTCATAGTTATTTTGTTAACATCAGAAATGCTGTCTATAGTACATTATCTCCATAAGGCGCAAATAATTCACGCCGATATTAAACCTGACAACTTCCTATTAATGAAGATTCCCACTCAAGAATGGAGGACTCCTTCCTTACAGTTAATTGATTTAGGATGTGCTATAGACATGTCACTGTTCCCAGAAGGAACTACATTCAGAGAATTAATCGCTACTGAGGGATTTACTTGCACTGAAATGAGAGAAGGGAAGCCTTGGACCTATCAGACTGATTTATATTGTCTCGCTGGTACAATACACGTCATTTTGATGGGTAGCTACATGAAAGTAGCTAATCGATTAGGACAATGGAATATAGATAAGAAATTACCAAGGTATATGAAGAATAGCCTTTGGGATAAAATATTTACGACTCTTCTAAATGTGCCTGATTGTAACAATATACCGGATCTAATGGAACTGAAGAACGATGTTGACAACGTACTAAATCAAATTGACTGTCTCGGTTCACAGCTCCGTAACTTTGCTAATGTGCTTAAATCTAGGTAA

Protein sequence:

>DPOGS202022-PA
MDIDVSKENIQPLRGGRNLMQLGTALQAQSDSDVQRQLLLQKDEHEEAIRQYQGPDPLEPWFNYIQWVEQSFPKHGHEGNIDKLIKDCLQLFEKDERYFQDRRLVKLWIKYVDCLSNPLEIYQRLYNTGIGVECSEFYRAWACYCEESGDYKKANQVYMLGLQAKAQPLDELEQAHMNFQLFFAQRMLHDDSPTKRKAASALAETRMALTSLKSFKRRNIANVPVQRVGDSVKSSLPGVVRQQAVDNRLPNSNVMVNVYEDAPSTSRGVVTAEDPGPASLVQACNVENEKEVGIWTNPKTKMVHTNVVPHQPLPFTPYEDMDDDLKLSSKHMPYCLDDLTFNVPLCVPDPADPTKIPCYNKAQVYFNDKEYSLEEIRARKYNVQKAVKTENVQVQETYKNINETLAQCSALETLANCALDSEQDHMGQLMPLTMPTLQNVTKVTNMHSPGEPKVLCNLDSKEVSELAAKKNDENEIPSSSLSDKENQVANHFNNENVGNFGKNNLMEEFNRSLMGNLLGDSVTVNTKEARWELRNIFNDNAEPSIVQPVVQQFDVPKFDIHEDRSMTMAINAKKNFDIQDARNMPEDKENANKFNAIQPNVTSQAPNVFNFNIKNASTPNMSQFKKPSNIDQSSKFAVPKFSIDESVIEQPDQISTKRDDCSRQTNDQHATVDNQGGGLSVIMEATREYNSKSGSSSSGQSIRTNFTGYTTNFDSVYNNHNDPNHPSTTSKRNSLSAQARLPNGQFARAYPQKREAENKMAVTPSTSVPYHSHDHQYQKPMPPNFSGYSPQRSIHYQQNYNYQQNFPNNHMMNQQQQGYGSPNPSSYQSPQHPSMQNSHEMNQVPTGFQSPTYSNQMAYHQSPVNNPGNVMMSPQQGFTGRQDFHYSNHTDRHVYVNQQHQVFQSPPHQTQFQNSVYYQRSSQPQTQNSYNQQNYHHIQSQYNNPNMYSSSGQNSNNPNFNTVQSPYRQVPKQIENQASYGMSNNQQFQIYQSPQASQNSNLRNTSMHDANMVQGYQETQGRQEMVDSLGKHERDVENTTLINNPQSKNSSSHKSPNNSVLRNVRYDQPNVKIGQKSPEVGFSNQFLNFISNRNEPKDNANTPKFTNSPSISQKMHKNLYVSSPEQAQIPPSSGLSDTDSKDGMTAQTATPIQSAKILTIDKQKDISKRQLDFEHRDRVEIQSEDSRDSVSKDSRISSVYSRQSDFQSDGYGMDIDSENSMECGAFKSSHSISMIETSDIPRPADIDFPKVIDPFNKKMLTSLLEYVKFPNKTHAEGYNEVRSIPKLQTATVISVGNSKYSIEKQLGKGNYGAVFLCLDFHSNKSVAVKYQKPSRPWEFYICQEIKARIKDPFMLPGYMEISTAFLGENASLFVSEYSRYGSLLDVANKVRIATSKCINELIVILLTSEMLSIVHYLHKAQIIHADIKPDNFLLMKIPTQEWRTPSLQLIDLGCAIDMSLFPEGTTFRELIATEGFTCTEMREGKPWTYQTDLYCLAGTIHVILMGSYMKVANRLGQWNIDKKLPRYMKNSLWDKIFTTLLNVPDCNNIPDLMELKNDVDNVLNQIDCLGSQLRNFANVLKSR-