Monarch geneset OGS2.0

DPOGS203430
TranscriptDPOGS203430-TA1308 bp
ProteinDPOGS203430-PA435 aa
Genomic positionDPSCF300548 + 1709-4034
RNAseq coverage0x (Rank: top 96%)
Annotation
HeliconiusHMEL0126893e-11873.91% 
BombyxBGIBMGA005374-TA1e-8662.18% 
DrosophilaCG9601-PA2e-5439.64% 
EBI UniRef50UniRef50_E2C2028e-5737.66%Bifunctional polynucleotide phosphatase/kinase n=1 Tax=Harpegnathos saltator RepID=E2C202_HARSA
NCBI RefSeqXP_002424893.12e-5641.03%polynucleotide kinase- 3'-phosphatase, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3071964873e-5637.66%Bifunctional polynucleotide phosphatase/kinase [Harpegnathos saltator]
NCBI nr blastxgi|3504078873e-5638.05%PREDICTED: bifunctional polynucleotide phosphatase/kinase-like [Bombus impatiens]
Group
Gene OntologyGO:00055151.4e-23protein binding
KEGG pathway 
InterPro domain[1-435] IPR0156363.2e-107Polynucleotide kinase 3-phosphatase
[159-276] IPR0065512.1e-36Polynucleotide 3'-phosphatase
[171-277] IPR0139543.2e-34Polynucleotide kinase 3 phosphatase, central region
[149-277] IPR0232144.8e-26HAD-like domain
[1-106] IPR0089841.4e-23SMAD/FHA domain
[170-276] IPR0065496.3e-22HAD-superfamily hydrolase, subfamily IIIA
Orthology groupMCL12420 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203430-TA
ATGATACGACAATGCTTCCTACGATGTTTACTAGATTCGCATTCTCCTATTAAATTACCACATAATGTAGATGTTATAGTTGGACGCAGTAAAATTACTAAAATAAAGGACCAGTCCTGTTCTCGTCAACAGATTACTTTAAAAGCCGATTGCGAAGAATGCTCTGTCGAATTAAAATCGATTGGTATTAATCCATCAGGTTTGGATGGTTTCGCTCTAGAGAGAAATAGCTTGTATAAATTACAACATGGGAGCAGAGTTGAGATATTATTAAACAATTATATTCATGTAATTGAGTTTGAACCACCACCTGATAATCACAATGAACAAAAACAGAATAAAAGGAAACTGGAAGAAGACATTGTGGATTCAGCTCCACGTAAGAAATCAAAAACTGAAGCTGAGTTAATCAAAGTGAGTACTAAAGAGGCTGGTAAAGATATGTGGGAAGAAATTGATAAAGGTGAAGTGTATATGTTTACCGCTAAGGGAGTAAAATCTAGCAGTAGAATAGCAGCTTTTGACATGGATGGAACATTGATAAAGACCAAGTCTGGGAAGGTTCATCCTGTTGATGTCAATGATTGGCAAATTGCCATGCCGCAAGTCCCACAGAAGCTGTCGGACAAATTTGAAGAAGGTTATAAGATTGTGATTCTTAGTAACCAATCACCAATTGGAAGTGGCCGAGTTAGAATTGACGATTTTAAGAAAAAAATTGAGGGTCTAGTCCAGAAATTAAATGTCCCAGTACAAGTCTACTTAGCTACAGGTAAAGGAATTTACAGAAAACCTATGACTGGCATGTGGAAAATCTTATCTGAAAAGCTTCTTGTTCTGGTAGGCTATCCTGGTAGTGGTAAATCATTCGTAGCAAAATTGATTGAACAGAAATCAGGAAGCAGATATGTTACAGTGTGTAGAGATGTTCTTGGTACTTGGCAAAAATGTGCCTCGGAAGCATCTAAGTTACTGCAGCAAGGCAAGAGTGTGATTGTAGATAGCACAAACCCAGATACAGAATCCCGGTCCCGTTGGACGTCCATAGCCAAAAATTTAAATGTACAGTGCCGTTGTGCAAGAATGATGACCACCAAAGCACATTCATTACACAATAATAAGTTTAGAGAGATTATGAAGTTTAAACATGTGCCTGTCAATGAAATAGTATTCCATAGTTACAAGAATAAATTTGTTCCACCGTCACTAACGGAGGGATTTAAAGAAATAATAGAAGTCAAATTTAACCCTACTTTCAAAGACGACGAAGCCGAAAAAACATATAGAATGTATTTATTGGAAAAATAA

Protein sequence:

>DPOGS203430-PA
MIRQCFLRCLLDSHSPIKLPHNVDVIVGRSKITKIKDQSCSRQQITLKADCEECSVELKSIGINPSGLDGFALERNSLYKLQHGSRVEILLNNYIHVIEFEPPPDNHNEQKQNKRKLEEDIVDSAPRKKSKTEAELIKVSTKEAGKDMWEEIDKGEVYMFTAKGVKSSSRIAAFDMDGTLIKTKSGKVHPVDVNDWQIAMPQVPQKLSDKFEEGYKIVILSNQSPIGSGRVRIDDFKKKIEGLVQKLNVPVQVYLATGKGIYRKPMTGMWKILSEKLLVLVGYPGSGKSFVAKLIEQKSGSRYVTVCRDVLGTWQKCASEASKLLQQGKSVIVDSTNPDTESRSRWTSIAKNLNVQCRCARMMTTKAHSLHNNKFREIMKFKHVPVNEIVFHSYKNKFVPPSLTEGFKEIIEVKFNPTFKDDEAEKTYRMYLLEK-