Monarch geneset OGS2.0

DPOGS211254
TranscriptDPOGS211254-TA1578 bp
ProteinDPOGS211254-PA525 aa
Genomic positionDPSCF300425 - 9511-12150
RNAseq coverage32x (Rank: top 75%)
Annotation
HeliconiusHMEL0126890.076.38% 
BombyxBGIBMGA005374-TA0.068.46% 
DrosophilaCG9601-PA3e-10438.81% 
EBI UniRef50UniRef50_E2C2028e-12742.40%Bifunctional polynucleotide phosphatase/kinase n=1 Tax=Harpegnathos saltator RepID=E2C202_HARSA
NCBI RefSeqXP_001606989.13e-12044.55%PREDICTED: similar to polynucleotide kinase- 3-phosphatase [Nasonia vitripennis]
NCBI nr blastpgi|3071964873e-12642.40%Bifunctional polynucleotide phosphatase/kinase [Harpegnathos saltator]
NCBI nr blastxgi|3071964875e-12542.40%Bifunctional polynucleotide phosphatase/kinase [Harpegnathos saltator]
Group
Gene OntologyGO:00055151.4e-23protein binding
KEGG pathway 
InterPro domain[1-525] IPR0156361.6e-170Polynucleotide kinase 3-phosphatase
[159-331] IPR0065512.9e-56Polynucleotide 3'-phosphatase
[171-331] IPR0139549.1e-56Polynucleotide kinase 3 phosphatase, central region
[149-341] IPR0232149.6e-43HAD-like domain
[170-324] IPR0065497.1e-29HAD-superfamily hydrolase, subfamily IIIA
[1-106] IPR0089841.4e-23SMAD/FHA domain
Orthology groupMCL12420 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211254-TA
ATGATACGACAATGCTTCCTACGATGTTTACTAGATTCGCATTCTCCTATTAAATTACCACATAATGTAGATGTAATAGTTGGACGCAGTAAAATTACTAAAATAAAGGACCAGTCCTGTTCTCGTCAACAGATTACTTTAAAAGCCGATTGCGAAGAATGCTCTGTCGAATTAAAATCGATTGGTATTAATCCATCAGGTTTGGATGGTTTCGCTCTAGAGAGAAATAGCTTGTATAAATTACAACATGGGAGCAGAGTTGAGATATTATTAAACAATTATATTCATGTAATTGAGTTTGAACCACCACCTGATAATCACAATGAACAAAAACAGAATAAAAGGAAACTGGAAGAAGACATTGTGGATTCAGCTCCACGTAAGAAATCAAAAACTGAAGCTGAGTTAATCAAAGTGAGTACTAAAGAGGCTGGTAAAGATATGTGGGAAGAAATTGATAAAGGTGAAGTGTATATGTTTACCGCTAAGGGAGTAAAATCTAGCAGTAGAATAGCAGCTTTTGACATGGATGGAACATTGATAAAGACCAAGTCTGGGAAGGTTCATCCTGTTGATGTCAATGATTGGCAAATTGCCATGCCGCAAGTCCCACAGAAGCTGTCGGACAAATTTGAAGAAGGTTATAAGATTGTGATTCTTAGTAACCAATCACCAATTGGAAGTGGCAGAGTTAGAATTGACGATTTTAAGAAAAAAATTGAGGGTCTAGTGCAGAAATTAAATGTCCCAGTACAAGTCTACTTAGCTACAGGTAAAGGAATTTACAGAAAACCTATGACAGGCATGTGGAAAATTTTATCTGAAAAGTATAATGATGATATACTAATTGATATGGATAATAGTTTTTACTGTGGAGATGCGGCAGGCAGAGCAGCTAATTGGGCTCCAGGAAGGAAGAAAGATCACTCAATGGCTGATATACTTCTAGCCGAGAACCTTGGACTAAAATTTTATACACCAGAACAGTTTTTCTTAGGACACTCAATTGCAAATGTTCCGATGAGCAAACCGGAATTCATACCAAAAGAAGTAACAGCAGAGCCTTTTAATGAGGATTTAATTAGTGATGAAAAGGAGCTTCTTGTTCTGGTAGGCTATCCTGGTAGTGGTAAATCATTCGTAGCAAAATTGATTGAACAGAAATCAGGAAGCAGATATGTTACAGTGTGTAGAGATGTTCTTGGTACTTGGCAAAAATGTGCCTCGGAAGCATCTAAGTTACTGCAGCAAGGCAAGAGTGTGATTGTAGATAGCACAAACCCAGATACAGAATCCCGGTCTCGTTGGACGTCCATAGCCAAAAATTTAAATGTACAATGCCGTTGTGCAAGGATGATGACCACCAAAGCACATTCATTACACAATAATAAGTTTAGAGAGATTATGAAGTTTAAACATGTGCCTGTCAATGAAATAGTATTCCATAGTTACAAGAATAAATTTGTTCCACCGTCACTAACGGAGGGATTTAAAGAAATAATAGAAGTCAAATTTAACCCTACTTTCAAAGACGACGAAGCCGAAAAAACATATAGAATGTATTTATTGGAAAAATAA

Protein sequence:

>DPOGS211254-PA
MIRQCFLRCLLDSHSPIKLPHNVDVIVGRSKITKIKDQSCSRQQITLKADCEECSVELKSIGINPSGLDGFALERNSLYKLQHGSRVEILLNNYIHVIEFEPPPDNHNEQKQNKRKLEEDIVDSAPRKKSKTEAELIKVSTKEAGKDMWEEIDKGEVYMFTAKGVKSSSRIAAFDMDGTLIKTKSGKVHPVDVNDWQIAMPQVPQKLSDKFEEGYKIVILSNQSPIGSGRVRIDDFKKKIEGLVQKLNVPVQVYLATGKGIYRKPMTGMWKILSEKYNDDILIDMDNSFYCGDAAGRAANWAPGRKKDHSMADILLAENLGLKFYTPEQFFLGHSIANVPMSKPEFIPKEVTAEPFNEDLISDEKELLVLVGYPGSGKSFVAKLIEQKSGSRYVTVCRDVLGTWQKCASEASKLLQQGKSVIVDSTNPDTESRSRWTSIAKNLNVQCRCARMMTTKAHSLHNNKFREIMKFKHVPVNEIVFHSYKNKFVPPSLTEGFKEIIEVKFNPTFKDDEAEKTYRMYLLEK-