Monarch geneset OGS2.0

DPOGS213469
TranscriptDPOGS213469-TA1269 bp
ProteinDPOGS213469-PA422 aa
Genomic positionDPSCF300100 - 296151-300584
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0168411e-13365.03% 
BombyxBGIBMGA005728-TA5e-6143.51% 
Drosophilacdc14-PA7e-7950.35% 
EBI UniRef50UniRef50_UPI00022461DB1e-8448.18%UPI00022461DB related cluster n=1 Tax=unknown RepID=UPI00022461DB
NCBI RefSeqXP_001603728.11e-8352.71%PREDICTED: similar to Dual specificity protein phosphatase CDC14A (CDC14 cell division cycle 14 homolog A) [Nasonia vitripennis]
NCBI nr blastpgi|3454791894e-8448.18%PREDICTED: dual specificity protein phosphatase CDC14A-like [Nasonia vitripennis]
NCBI nr blastxgi|3838590752e-8250.81%PREDICTED: dual specificity protein phosphatase CDC14A-like [Megachile rotundata]
Group
Gene OntologyGO:00081385.4e-12protein tyrosine/serine/threonine phosphatase activity
GO:00064705.4e-12protein dephosphorylation
KEGG pathwaynvi:1001200463e-83 
 K06639 (CDC14)maps-> Meiosis - yeast
    Cell cycle - yeast
    Cell cycle
InterPro domain[170-233] IPR0003405.4e-12Dual specificity phosphatase, catalytic domain
Orthology groupMCL22701 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213469-TA
ATGGATCCAAAGAAAAAGGCGAACGCGGCATTCCTTATGGGAAGTTACGGTGTTTTGTATTTGAGTTTACCGCCGAAAGATGCATTGGAACCGCTCCTGGTTCACGGACAATGTTACAGGCCATTTCAAGACGCTACACAGGGTGACTCAAATTATACAATAACTATAATGGATTGCTTGCAAAGCTTATCTCGAGCAAGAGATTTGGGTTTTTTTGATTTTCAAGATTTCAACTATCAAGAATATGAAAGATTGGATAAAATACAGGGAGACCTTAATTGGATTGTACCCGATAAATTTTTAGCTTTCATTGGTCCAGTTGATTACAACCACGTGTCATCCTTATACCATCCTCCTGAGATATACGTGGATTACTTTAAAGAAAATAATGTCCAAATAGTTATGAGGCTAAATAAAAAGCTTTATGACAGTAACGTCTTTATTAACAGCGGTATAATGCATTACAACTTATTCTTCCCGGACGGATCATGTCCACCTCGACATATATTGTTGAAATTCTTGCAAATAAGTGAGGAATGTGACGGAGCTATAGCAGTCCACTGCAAGGCAGGTTTAGGTCGCACTGGTTCACTAATTGGCTGTTATTTGATAAAGCATTATCGTATGACAGCACACGAGGCTATAGCCTGGATGAGAATCTGTCGACCTGGCTCTGTCATAGGTCATCAGCAGAGTTGGCTGGAAGAGCTTGAACCGTGGCTTATAAAACAAGGAAATCTTTATAGGAGACGTATGTATCAAGATATGAACAGGCTTCCTGTACACGATTATGGAATCTATTCGATGACTGAGAAGATTAATAGACAACGACCTGTCGTTTTGAGTAAATCCCCCTCGCCACCGCCTCCCTTTCAGAGACAAAATAGAAACGATGTTTCGACTTCCTCTCGACCGCAGGCGTCAAAAATAAGAGAAGACGCAGCCATAACCGTTAAACCACAAATTTCAAACAAACCTTCTATAATTCAAAGGCCAGTTCCCGGCACTGTCACTAAAATTGCACCTACAAATTTGAGCCAAATTACTAGGGGTTCAATACAACCAAGTAATCGCCGTCGTCTGGGTCGTAGTCCTAGTCCACCACAGATGAGAATAGAACATAGTCGCGCTACAAATACACCACCGACTGAGTGCTTTTCAACTCGTGATTCTAAGGTTGCTTTAACAGAAGCGTTATCGAGACTCAAATGTAAGTTTGTGTATATCTCGTTTCTATGTGTCGTTTCCAAGCTCATTATTTTAAAGTAG

Protein sequence:

>DPOGS213469-PA
MDPKKKANAAFLMGSYGVLYLSLPPKDALEPLLVHGQCYRPFQDATQGDSNYTITIMDCLQSLSRARDLGFFDFQDFNYQEYERLDKIQGDLNWIVPDKFLAFIGPVDYNHVSSLYHPPEIYVDYFKENNVQIVMRLNKKLYDSNVFINSGIMHYNLFFPDGSCPPRHILLKFLQISEECDGAIAVHCKAGLGRTGSLIGCYLIKHYRMTAHEAIAWMRICRPGSVIGHQQSWLEELEPWLIKQGNLYRRRMYQDMNRLPVHDYGIYSMTEKINRQRPVVLSKSPSPPPPFQRQNRNDVSTSSRPQASKIREDAAITVKPQISNKPSIIQRPVPGTVTKIAPTNLSQITRGSIQPSNRRRLGRSPSPPQMRIEHSRATNTPPTECFSTRDSKVALTEALSRLKCKFVYISFLCVVSKLIILK-