Monarch geneset OGS2.0

DPOGS207899
TranscriptDPOGS207899-TA3315 bp
ProteinDPOGS207899-PA1104 aa
Genomic positionDPSCF300101 + 426364-446682
RNAseq coverage376x (Rank: top 32%)
Annotation
HeliconiusHMEL0064040.083.51% 
BombyxBGIBMGA008490-TA0.072.14% 
DrosophilaASPP-PB1e-11148.57% 
EBI UniRef50UniRef50_D2A1K01e-13155.35%Putative uncharacterized protein GLEAN_07089 n=2 Tax=Tribolium castaneum RepID=D2A1K0_TRICA
NCBI RefSeqXP_968601.22e-13255.35%PREDICTED: similar to apoptosis stimulating of P53 [Tribolium castaneum]
NCBI nr blastpgi|2700050814e-13155.35%hypothetical protein TcasGA2_TC007089 [Tribolium castaneum]
NCBI nr blastxgi|3838627419e-14435.79%PREDICTED: uncharacterized protein LOC100877563 [Megachile rotundata]
Group
Gene OntologyGO:00055157.8e-14protein binding
KEGG pathwaydpo:Dpse_GA167214e-18 
 K06270 (PPP1R12A, MYPT1)maps-> Regulation of actin cytoskeleton
    Vascular smooth muscle contraction
    Focal adhesion
    Long-term potentiation
InterPro domain[905-1033] IPR0206831.5e-28Ankyrin repeat-containing domain
[1038-1097] IPR0014527.8e-14Src homology-3 domain
[969-998] IPR0021107.7e-06Ankyrin repeat
Orthology groupMCL15891 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207899-TA
ATGCTTATACCTCGCGAGCAATGTCGCGTGCAGCCTCTGAGTGAGAGGTCCGCATCTTCTCTGCCCCACGCTCGTCTAGCCAGGACACCTTCCATATCGTCCCAGAGCAGCCTCGACAGCGGTACATCAAGAGCGACTAAAGAAAACAACGTCCGTATCAAACACGAAGATAAACTGAGATATTTGTCATTTGCAAGTGACCTCGAGTCGATTCGGGCGTTATTCAATGAGAAGGAGAAGGAACTGTCGGTGGCAGTGGCGAAGGTGGAAGAGCTGACGCGGCAGCTTGAGGAGTTAAGGCGAGGAAGAGTCCAAACCGCACCGCCTTCAGCCCACGAATTGGATAAACTAAGACGAGAACTTATGAATGAGATGTTAAATGAAAATATATTAAATAGTCAACAGTCTTATTATCGGATAACTTTATCAATTTCTTTGACGATCGTTCTTTATGCGAAACGGCAAAATATGGAATCGGTCTCGATGTTGAGTAGAAAATTCGTTGACACGGAAAAAGTCATTCGTATGGAAGTGATTCAACCCGAAAATGGATTGGTGTACCGGAATAAGCTGAACGAGCAACAAAATGGAAGACTATCAGCACAGAGAGCAGCATTGGGAGCACGGCAGGAGGAAATGCGTAGTATTGATCGTCGAGTTTCTGAATTACAAGCAAGATTGTTGAGAAAACGAGCTTTGAATAGACAACTTGCTGCAGCACACCGTCAACCACAACAAAGGACAAACAATCCAACGCCGATGCAACAATTGCCGAACTATCCGCCTAACACAAACCAACCAAAGAATCAGCCGGCGAGAGGAAACGTGGCAGCGGTAGAACCTTACAATCATGTCCCACACGCGCAGAGCTTGCATAATAATAACTTCCAAGCAATGAAGCAGAATGTGATACAAAATAATGTGCCATTAAAACAGCTGACTCAAACTGATAATATTCAAGGTCACTTAACACAACAAGAAGCTCATTTCTTGCAACAAAAGCAAATGATGAACCCTCTGTACAATGGTCACTACCCACATATGCCTAATGTACAAGATAGTCAATATCAAGGGCAATATAATAATAAACAAGATGTTAACTCGTTTGCTCATGTACAACAAGGCATCACAAACTCTTACGATCAAAAAGCAATGTTTGATCATATGAATAAGTACAACGAATATCCAAAACAACCACAGTATAGTCCAAACAGTACGAGCAGTTCTAACAGCAATCAAACCGGTAAGGAATTAAAAATAAATGAGCAAGAATTCCCCCCAGAGTTTGCTGCAAGTAAATCCGACCCCAAATATCAAACATTGCCGTATAATACAAAGTTTCCTCAAAATGTCAACGGTAAATCGAAACAGCCCGATGTAAACGGCAAAAGCAATGAACAAGATAACAAAAACCAAAGCGCCCCGAATATAAATCATATGACGGTTCATTCTACGCCACTTTCAGTTGTCAATAAGAGTTTGGCGACTCCTCTAAATCAGAATGAAGCAGCATATCAACAGGAAAATACATATCAATTGCAAAAATCTGATTCATCGAGCAGATGTAATCAGGAAGGCAAAGAAAATCATGCATATCCACAAAACTCTAGATTAAGTCAAAATAATAGCCAAAATACTTCAGAAGGTTCACGGAATTCCGGAGGAAAGAATCAACAAAGCAGTGCCCTGAAAGGCAGTTCCCCAAGTTTGCTGTTGAATACTACAAGTGCTAGTAGTTCTATCGGATTTGGGAAACCCGTATCGAGTGTAGCACCTACATCGGTTCAGGTCTCGAGCGGCAAACCATCACCCATATATCAGACATCCTCAACAAAGATCCAACCAGTTCAACCACAAACCGTTCAGACCCAAAGCATTTCTGTACCACCATCAAATCATGTCACTAGTAATATGGCAAATTCCCAACCCCAGATATTAAGGAACACCGCATCAGGGCTATCAGCCAGCACAACTAGCACAAATTTCAGCGGTCCAAACTCATCATCACAAAGTATTTTGCTATCTCCACCTCAAAGTGCTAGTACTCCTTTATCGACGCCAGATGTTAGCGGGACTGACAAATCGCCTAAACCAGCTTTACCTCCTAAACCCACTATTAAGACTCCACCGAGGCAAACTATAAATCATGATGCAGGATTCAACCAATCAAATGAAACTATTAACATGCCAACAATAACAATGGCTGATTCATGTAATGATAACGAACAGCAGAATCACAGGGAGCTCAGCAATGGAAGCAGCAATAACGAAATGATTATAAAAGCTCGACCTTTGACGATAAGAAAACCGCCGCTCAGTGAACAACCCAAACTAAGAAACGTTAGCAATGTCAAAAATGGCATCAGCGTCAGCATGAACCGACGAATAGAAATGCCACCAGCATTCCTATTCCCAGAAATGGATCATTTGAATCTTGATTCCGTACCAAGTGATGTGCAAAACGGACAGAAGGACAAGTCGAAGCAGAAAGATGAAGTCGACAGATCGCTGAATAATAACATATCTTTAGTTTCCAATGATAGGGAAGATAGTAAAGATACTGTTATTACTGATATAACGGAGCAAGTTAGCTCGGTGGATTTGAACGGTCAGGATTCTCAGAGGTCTGAGAATATACTGAGGCGGTCAAAGAAAGGAAATTTGAAACAGGGCGGGAAAACCGCCCTCGCCAGGCGAGTCAGCTTCGACCCTCTCGCATTGCTTTTGGATGCGAGTTTAGAAGGTGAATTGGAGCTTGTCAAGAAAACTGCAACTCAGGTGCAAAACGCGAGCGCTGCAAATGACGAAGGGATCACAGCTCTCCACAACGCCATATGTGCTGGTCATTTCGAAATTGTTAAGTTTTTGGTGGAGTTAGGTTGTGATGTGAACGCTCAAGATTCGGACGGCTGGACGCCGCTCCATTGTGCCGCTTCGTGTAACAACCTGCCTATGGTCAGATTCCTTGTTGATAATGGCGCTTGCATATTTGCAACAACTCTGTCTGATCACGAGACGGCTGCTGAGAAGTGTGAAGAAGACGAGGAAGGTTTCGATGGTTGTTCCGAATACTTGTACAGTGTTCAAGAGAAGCTGGGTATCATGAATGGCGGTACGGTCTACGCGGTTTTCTCATATACAGCTTGCCGGCCGGACGAGCTGTCGTTCGATAGCGGCGCCCGGCTCCAGGTGCTGAGGAAAGGAGACGACAATGAACGGGAGTGGTGGTGGTGCCGCGACCACACCTCCGCCGAGGGCTACGTACCGAGGAACTTGCTTGGGTTATATCCAAGAGTAACCCCACAGCAGGAATAA

Protein sequence:

>DPOGS207899-PA
MLIPREQCRVQPLSERSASSLPHARLARTPSISSQSSLDSGTSRATKENNVRIKHEDKLRYLSFASDLESIRALFNEKEKELSVAVAKVEELTRQLEELRRGRVQTAPPSAHELDKLRRELMNEMLNENILNSQQSYYRITLSISLTIVLYAKRQNMESVSMLSRKFVDTEKVIRMEVIQPENGLVYRNKLNEQQNGRLSAQRAALGARQEEMRSIDRRVSELQARLLRKRALNRQLAAAHRQPQQRTNNPTPMQQLPNYPPNTNQPKNQPARGNVAAVEPYNHVPHAQSLHNNNFQAMKQNVIQNNVPLKQLTQTDNIQGHLTQQEAHFLQQKQMMNPLYNGHYPHMPNVQDSQYQGQYNNKQDVNSFAHVQQGITNSYDQKAMFDHMNKYNEYPKQPQYSPNSTSSSNSNQTGKELKINEQEFPPEFAASKSDPKYQTLPYNTKFPQNVNGKSKQPDVNGKSNEQDNKNQSAPNINHMTVHSTPLSVVNKSLATPLNQNEAAYQQENTYQLQKSDSSSRCNQEGKENHAYPQNSRLSQNNSQNTSEGSRNSGGKNQQSSALKGSSPSLLLNTTSASSSIGFGKPVSSVAPTSVQVSSGKPSPIYQTSSTKIQPVQPQTVQTQSISVPPSNHVTSNMANSQPQILRNTASGLSASTTSTNFSGPNSSSQSILLSPPQSASTPLSTPDVSGTDKSPKPALPPKPTIKTPPRQTINHDAGFNQSNETINMPTITMADSCNDNEQQNHRELSNGSSNNEMIIKARPLTIRKPPLSEQPKLRNVSNVKNGISVSMNRRIEMPPAFLFPEMDHLNLDSVPSDVQNGQKDKSKQKDEVDRSLNNNISLVSNDREDSKDTVITDITEQVSSVDLNGQDSQRSENILRRSKKGNLKQGGKTALARRVSFDPLALLLDASLEGELELVKKTATQVQNASAANDEGITALHNAICAGHFEIVKFLVELGCDVNAQDSDGWTPLHCAASCNNLPMVRFLVDNGACIFATTLSDHETAAEKCEEDEEGFDGCSEYLYSVQEKLGIMNGGTVYAVFSYTACRPDELSFDSGARLQVLRKGDDNEREWWWCRDHTSAEGYVPRNLLGLYPRVTPQQE-