Monarch geneset OGS2.0

DPOGS212076
TranscriptDPOGS212076-TA2022 bp
ProteinDPOGS212076-PA673 aa
Genomic positionDPSCF300317 + 105083-110376
RNAseq coverage238x (Rank: top 43%)
Annotation
HeliconiusHMEL0093480.076.64% 
BombyxBGIBMGA009640-TA0.060.96% 
DrosophilaCG5830-PA9e-3948.45% 
EBI UniRef50UniRef50_D6WPY51e-10076.92%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WPY5_TRICA
NCBI RefSeqXP_001660632.12e-10358.86%hypothetical protein AaeL_AAEL010078 [Aedes aegypti]
NCBI nr blastpgi|1571251243e-10258.86%hypothetical protein AaeL_AAEL010078 [Aedes aegypti]
NCBI nr blastxgi|1571251244e-9737.55%hypothetical protein AaeL_AAEL010078 [Aedes aegypti]
Group
Gene OntologyGO:00167914.3e-62phosphatase activity
GO:00055154.4e-58protein binding
KEGG pathway 
InterPro domain[486-661] IPR0232141.7e-64HAD-like domain
[497-659] IPR0119484.3e-62Dullard phosphatase domain, eukaryotic
[494-638] IPR0042744.4e-58NLI interacting factor
Orthology groupMCL15681 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212076-TA
ATGCGGTTAAGAAGTAGGAAAAGGGAGCGACCGCTCTCAGCGCGAAATTCACAAAGGCAAACGACTTTGAAATCAAAATTAAGAAGACCGCTCATCCAAGCGAAAAATACTTGGAGAAAAATATTGAAGGCGGACAATGGTATCTCGGTACCAAGTCCTTGCATCACTAGGTCGCAGAACAATCTAAATCTAAAATCCAGAACTCGAGCGAAAATTGCCAAAAGTATGGAAAAACCACTGGAATTGCCATTAAAGAAAAACAAGCCGAAAGAAGAGGCTAAGAAGCCTATTTTCCAGACTGTCGTTCGAAACTCCGCTAAAGATATGGTGACATCAACAACCAAGCCCAAAACACAGAACGATAAAGTAACAACAACCACAAAGACAAAGTTAAAGGATGTGCGAACAACCAAAAGTGCTCAGCTCCGACAGAGCTCTTTGAACAGAGAGCTTCGTAAGACAGAGTCTATATCGTCAGCTCTCAGCTCACCTCGCAAAACACGTAGGATGGATAAACCAGAGAAATCTGCCTCATTAAACAACTCACCCATTCGTAAAGTCCGTCAACTAGGACTGCCATCGTTGAAGCTGAACAATATAGCTGTCAGTTCAACTAACGAGAGTCCCAAGAAGAGAATTAAACCCAAAGCTCCAACTGTTCCGGAGCAGAATAACGATTTCAACGATGATAACGATGTAACCATGTATGAACCGCACACAACTACTTTCGATGACTTCCCGATTCCAAAGACCGTTGAATCGGATGTCGATGGGCAAGTCGGATCATCAGAGCTCTGTCTTCCCAACGACTGTTTCAACGACTTGATAGCTATTGCTGAATGTGCCAGGATAATAAGCAACAATCTATCAACAGACGAAGATATTAACTTACTAGCAGATAAAGCGGCCAAAGTGATGACCACCGACAAAGATGATAAAGTCAAGAGATGTACGGACAAGAAAATTAAACAACGGTCAAGCGGGGATTTAGAAATGTACCAGCCGACATCGACCACGGATGATTTGGATGTATGCACCGATTTCCTGAACCACGGAGATAAATATGTTCAGCAGCCGGACCTAGTATCTCTGTTGGAGCAGGAGTATGTCCGAGATGAGTGTGTCATATCATCGACCATGGCTATGGAGAGTCTGGAGGCTTTGTCTGCCCGAGGACCCAGCAGCGGGTTCCTGGCCGAGATAACACACAGCCTGTCAGCCGGAGACGATACCTGGAGCTCAACGGATGTTGTTGTAGAAGAAAACGCTTTGAACTGTCACGACACAACACCTGGGATTGAGGAGTCCACATCAGTTATATCATCATGCAACACCAGGGCCAGCGGTGAACAGATGACGTCATGGACGGACGCCTTCGATCCTTATCTGTTTATTAAACAGCTGCCACCGCTGGAAACCGTATCAGCTGGAGGACTCAGAACCAGGTGTCCAGCGCTACCCCTCAAAACTCGCACCAGTCCAGATTTTAGTCTTGTGTTGGATCTAGACGAGACATTGGTTCACTGTTCTCTCCAGGAGTTACCGGATGCTAGCTTCCACTTCCCCGTACTATTCCAGGATTGCAGATATACGGTGTTTGTCCGTACTCGTCCCCACTTTGCCGAGTTCCTCTCTAAAGTGTCACGTCTGTATGAAGTGATTCTTTTCACGGCTAGCAAGAGGGTGTACGCTGATAGACTACTGAACCTCCTGGACCCGGCCAGACGATGGATTAAATATAGGTTGTTCCGAGAACACTGTCTACTAGTTAATGGTAACTATGTGAAGGATTTGTCGATACTGGGACGGGATCTCAGGAGAACTGTCATCGTGGACAATAGCCCACAGGCGTTCGGCTACCAGCTGGAGAATGGTATACCTATAGACAGCTGGTTCGTAGACCGCAGTGACAATGAACTGCTCAAACTGCTGCCGTTCCTGGAACATCTGGCCACGAAAGACGATGTCCGGCCATACATCAGGGACAAGTACAAGCTGTTCAGTTACTTGCCACCGGATTAA

Protein sequence:

>DPOGS212076-PA
MRLRSRKRERPLSARNSQRQTTLKSKLRRPLIQAKNTWRKILKADNGISVPSPCITRSQNNLNLKSRTRAKIAKSMEKPLELPLKKNKPKEEAKKPIFQTVVRNSAKDMVTSTTKPKTQNDKVTTTTKTKLKDVRTTKSAQLRQSSLNRELRKTESISSALSSPRKTRRMDKPEKSASLNNSPIRKVRQLGLPSLKLNNIAVSSTNESPKKRIKPKAPTVPEQNNDFNDDNDVTMYEPHTTTFDDFPIPKTVESDVDGQVGSSELCLPNDCFNDLIAIAECARIISNNLSTDEDINLLADKAAKVMTTDKDDKVKRCTDKKIKQRSSGDLEMYQPTSTTDDLDVCTDFLNHGDKYVQQPDLVSLLEQEYVRDECVISSTMAMESLEALSARGPSSGFLAEITHSLSAGDDTWSSTDVVVEENALNCHDTTPGIEESTSVISSCNTRASGEQMTSWTDAFDPYLFIKQLPPLETVSAGGLRTRCPALPLKTRTSPDFSLVLDLDETLVHCSLQELPDASFHFPVLFQDCRYTVFVRTRPHFAEFLSKVSRLYEVILFTASKRVYADRLLNLLDPARRWIKYRLFREHCLLVNGNYVKDLSILGRDLRRTVIVDNSPQAFGYQLENGIPIDSWFVDRSDNELLKLLPFLEHLATKDDVRPYIRDKYKLFSYLPPD-