Monarch geneset OGS2.0

DPOGS213551
TranscriptDPOGS213551-TA1089 bp
ProteinDPOGS213551-PA362 aa
Genomic positionDPSCF300033 - 224441-230869
RNAseq coverage913x (Rank: top 14%)
Annotation
HeliconiusHMEL0138678e-4135.32% 
BombyxBGIBMGA011833-TA9e-18088.40% 
DrosophilaMYPT-75D-PA7e-11356.15% 
EBI UniRef50UniRef50_E2B8S42e-13371.18%Protein phosphatase 1 regulatory subunit 16A n=12 Tax=Neoptera RepID=E2B8S4_HARSA
NCBI RefSeqXP_395019.24e-13370.29%PREDICTED: similar to MYPT-75D CG6896-PA [Apis mellifera]
NCBI nr blastpgi|3320244575e-13369.71%Protein phosphatase 1 regulatory subunit 16A [Acromyrmex echinatior]
NCBI nr blastxgi|3072120263e-12971.18%Protein phosphatase 1 regulatory subunit 16A [Harpegnathos saltator]
Group
Gene OntologyGO:00055151.4e-05protein binding
KEGG pathwaymdo:1000117972e-51 
 K12329 (PPP1R12B)maps-> Vascular smooth muscle contraction
InterPro domain[29-280] IPR0206835.5e-57Ankyrin repeat-containing domain
Orthology groupMCL12545 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213551-TA
ATGCAGCAGTTGAAGCTGTGGCAGCAGAGGGAGAAGGAATGGGCGAGAACGAGAACGAAACGAGAGAAAAGTAACAAACGCAACATATACTTCAATGATAGTGTTATGTTACTTGAGGCGGCCGCCAGAAATGACATCGATGAAGTCCGACGATTGCTCGCTCGTGGAGTGACACCGGACGCGACAAACGAGGATGGTTTGACAGCCCTGCATCAGTGCTGCATTGACAACAATGAGGCCATGATGAGGTTATTACTTGACCATGGAGCGAATGTCAATGCTGAAGACAGCGAGAAATGGACACCCCTACACGCAGCAGCGACCTGCGGCAACCTCAACCTGGTCAGGATATTAATACAGTGTGGTGCGAACCTTCTAGCAGTGAACGGTGACGGGAATATGCCCTACGATATATGCGAGGAAGAGAGGACCCTGGACGCCATAGAAAGTGAAATGGCAGCTAGGGGTGTCACGCAAAGACTGATCGATGAAACCAGGGCTGCCACTGAAATGCAAATGCTGATGGATGTGGCCGATATGGTGAAGAAGGGAATGGATTTGGACGAGCCCAGAGATAATCAGGGCGCTACCTTGTTGCATATAGCGTCAGCGAACGGTTACCTCAAAGTGGTTGAGTTTCTGTTAGAGCATCGTGCGTCAACGGATGTAGTGGATCACGACATGTGGCAGCCGGTCCACGCGGCCGCCTGTTGGGGACATTTGGAAGTGTTAGAGCTTTTAGTGCAATACGGAGCCGATCTGAACGTGAGGAACAAACACGATGAAACACCAGCAGATATCTGCGAGGAGGGTGAGATGAGGGGTCGCATTCTTCGGTTGGCGGTGGAGCAGGAGGAGGTGAGGCGGCGAGCGGCCAGCGTGGCCGGGGACAGGCTGCGGGCGGCCAGGAGGTCCTCCTCGACAGCCTCCGCAAGCAGGGTGCGTTCCGTCCGTCGCACGTCGCTCCGTGAGAAGCAGTTGGCGGCCAAGGCGGATGCTCGAGGCGAAGCCAGGCTCAGGGAGACCTTCGACAGTACCGGAGACGACGACTATCAGGTGAGAATAGACGGGTACAAACATATGTATTAA

Protein sequence:

>DPOGS213551-PA
MQQLKLWQQREKEWARTRTKREKSNKRNIYFNDSVMLLEAAARNDIDEVRRLLARGVTPDATNEDGLTALHQCCIDNNEAMMRLLLDHGANVNAEDSEKWTPLHAAATCGNLNLVRILIQCGANLLAVNGDGNMPYDICEEERTLDAIESEMAARGVTQRLIDETRAATEMQMLMDVADMVKKGMDLDEPRDNQGATLLHIASANGYLKVVEFLLEHRASTDVVDHDMWQPVHAAACWGHLEVLELLVQYGADLNVRNKHDETPADICEEGEMRGRILRLAVEQEEVRRRAASVAGDRLRAARRSSSTASASRVRSVRRTSLREKQLAAKADARGEARLRETFDSTGDDDYQVRIDGYKHMY-