Monarch geneset OGS2.0

DPOGS214303
TranscriptDPOGS214303-TA2001 bp
ProteinDPOGS214303-PA666 aa
Genomic positionDPSCF300020 - 1099915-1104102
RNAseq coverage508x (Rank: top 25%)
Annotation
HeliconiusHMEL0206461e-17356.29% 
BombyxBGIBMGA004108-TA0.069.91% 
DrosophilaFcp1-PA8e-9762.16% 
EBI UniRef50UniRef50_D6WSR30.049.10%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WSR3_TRICA
NCBI RefSeqXP_971974.10.049.10%PREDICTED: similar to RNA polymerase II subunit A C-terminal domain phosphatase [Tribolium castaneum]
NCBI nr blastpgi|910875890.049.10%PREDICTED: similar to RNA polymerase II subunit A C-terminal domain phosphatase [Tribolium castaneum]
NCBI nr blastxgi|3800221331e-16343.82%PREDICTED: LOW QUALITY PROTEIN: RNA polymerase II subunit A C-terminal domain phosphatase-like [Apis florea]
Group
Gene OntologyGO:00055156.9e-52protein binding
GO:00056343.6e-48nucleus
GO:00047213.6e-48phosphoprotein phosphatase activity
GO:00056221.7e-19intracellular
KEGG pathway 
InterPro domain[126-274] IPR0042746.9e-52NLI interacting factor
[122-270] IPR0119473.6e-48FCP1-like phosphatase, phosphatase domain
[368-389] IPR0232144.9e-37HAD-like domain
[410-510] IPR0013571.7e-19BRCT
[1-67] IPR0110536.1e-07Single hybrid motif
Orthology groupMCL12445 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214303-TA
ATGGCTGACAAAACTATGCCTATTTCTGTTCCATCCGAAAAGCCTTTAAAAGTTATAAAATGGAAAGTTAAGGAAGGCAATAAAAGTGAAGTTAAGAAATTTAAAGCATTGCGCTCTGGTACTATTGTGTCCATCAAGGTGAAAGAAGGAGACATCGTAGAGCCTGGGGGTTGTATAGCTGATTTAGAACAATGCCGCCATCCCACTGTCATGAAAGAAATGTGTGCGGAATGTGGAGCCGATTTACGTTCCGGAGAATCACAAAAAAGAGATGTAGCTGTGGTCCCCATGGTTCACTCTGTACCCGAGTTAAAGGTATCTGAAGAATTGGCACAAAAATTAGGTCGTGAGGACGCCGATCGCTTACTTAAAGATCGTAAACTTGTTTTGCTTGTTGATCTTGATCAAACGTTAGTGCACACCACCAATGACAATATACCTCCTAATATAAAAGATGTACTCCACTTCTTTCTTCGAGGTCCTGGCAATCAAGGCAGGTGGTGTCACACTAGATTAAGACCTAAAACCCATGAGTTCTTAGAATCTGCAGCCAAGAATTATGAGCTACATGTATGTACATTCGGTGCGAGGCAGTATGCACATGCAATAACTGAATTATTGGATCCACAAAAAAAATTCTTCTCTCACAGAATTCTATCAAGAGATGAATGCTTCGATGCTAGGACCAAGTCAGCAAATTTGAAAGCACTATTCCCTTGTGGCGACAACATGGTGTGTATTATTGATGATCGTGAAGATGTATGGCGTCATGCCAGCAACTTAATCCAAGTGAGACCTTACTCATTCTTTCAGTCCACAGGTGATATAAATGCTCCACCGCCATTGCCTGAAGAAAAGACGAAACTTTTAAGCGGCAAAAATGGTTCCCAAGTATCCAAAGATAATCAAATGCCAACACTGGATGCTGAGCCGGAGAAAGAAAATAAAGAGATCATAGAGAAAGTTAATTCAGATAAGAAAGATAGTGAAAACGGTATAATAAAAGATAAAAAAGATGATAAAATGGAAAATGATGCTAATGAAAAAGTTGAAACACCAGTGTGGGTTGAATCATCCGAAGGGCAGATAGAAGTTGATGATCCCGATGACTATTTAATATATCTAGACGACATATTAAAAAGAATACACAACCACTTCTATGATATATATGATAAAATGGAGAATAGTGAAAATGAGAAAAGTATCCCAGATTTGAAATATATAATACCTGAAGTTAAAAGTCAAGTGCTGGCTGGTTCCAGTCTTGTGTTTAGTGGTTTGGTGCCTACACACCAGAGGTTAGAGACATCAAGAGCATATCAAGTTGCAAAAACATTAGGGGCTGAGGTCACACAAGATTTCACAGATAAAACTACACATTTAGTTGCTATGAGAGCAGGTACAGCGAAAGTAAATGCAAGTAAAAAGCTGGGCGAAGATAAATCAAAGATACATGTCGTTACACCCGAATGGCTGTGGACTTGCGCCGAGCGTTGGGAGCGTGTTGAAGAGAAATTGTACCCTTTACAAAGAGTAGGGCAGAGCAGTTTACGCCGCCCGCCCGCGCATTGCAATAGCCCTCCACCAGCACCTGCGGTAAGGAAAAGGACTCCGTCCGGCCGATTCATGGACACTATCAATCCTCTGCTGTCTTTTTCAAGCGATGATATTGCTGATATGGATAGAGAGGTAGAAGACATTTTTAATGAATCTGATGAGAGTTCATCGGACGACGAGGAGAAGGTGCTCGGTGATAATGATGAAGAGAATATTACTGAAGACAGACTGCTGAGTCTGGAGTCAGGAAATAGTGCTCAGGAAAGGTTACAAGAAAAACTCAATGAAGATTCTAATGATTCCAACACAGAAGATGGGGAGAGAGCTCTTAAAAGGCCACGACCATCCACACCCTCGGATGATGAGGGTCCGCCTGATGATGACGATACTTCGTGGAACCTCATGGGCGCAGCCCTAGAGAGGGAATTCCTCGCTCAGGATTAA

Protein sequence:

>DPOGS214303-PA
MADKTMPISVPSEKPLKVIKWKVKEGNKSEVKKFKALRSGTIVSIKVKEGDIVEPGGCIADLEQCRHPTVMKEMCAECGADLRSGESQKRDVAVVPMVHSVPELKVSEELAQKLGREDADRLLKDRKLVLLVDLDQTLVHTTNDNIPPNIKDVLHFFLRGPGNQGRWCHTRLRPKTHEFLESAAKNYELHVCTFGARQYAHAITELLDPQKKFFSHRILSRDECFDARTKSANLKALFPCGDNMVCIIDDREDVWRHASNLIQVRPYSFFQSTGDINAPPPLPEEKTKLLSGKNGSQVSKDNQMPTLDAEPEKENKEIIEKVNSDKKDSENGIIKDKKDDKMENDANEKVETPVWVESSEGQIEVDDPDDYLIYLDDILKRIHNHFYDIYDKMENSENEKSIPDLKYIIPEVKSQVLAGSSLVFSGLVPTHQRLETSRAYQVAKTLGAEVTQDFTDKTTHLVAMRAGTAKVNASKKLGEDKSKIHVVTPEWLWTCAERWERVEEKLYPLQRVGQSSLRRPPAHCNSPPPAPAVRKRTPSGRFMDTINPLLSFSSDDIADMDREVEDIFNESDESSSDDEEKVLGDNDEENITEDRLLSLESGNSAQERLQEKLNEDSNDSNTEDGERALKRPRPSTPSDDEGPPDDDDTSWNLMGAALEREFLAQD-