Monarch geneset OGS2.0

DPOGS210279
TranscriptDPOGS210279-TA930 bp
ProteinDPOGS210279-PA309 aa
Genomic positionDPSCF300216 + 177972-180802
RNAseq coverage381x (Rank: top 31%)
Annotation
HeliconiusHMEL0169792e-16290.94% 
BombyxBGIBMGA014179-TA1e-15590.29% 
DrosophilaCG6697-PA3e-11166.21% 
EBI UniRef50UniRef50_E2BYM69e-9866.32%Ubiquitin-like domain-containing CTD phosphatase 1 n=10 Tax=Coelomata RepID=E2BYM6_HARSA
NCBI RefSeqXP_974317.11e-11973.61%PREDICTED: similar to CG6697 CG6697-PA [Tribolium castaneum]
NCBI nr blastpgi|910906623e-11873.61%PREDICTED: similar to CG6697 CG6697-PA [Tribolium castaneum]
NCBI nr blastxgi|910906626e-12173.61%PREDICTED: similar to CG6697 CG6697-PA [Tribolium castaneum]
Group
Gene OntologyGO:00056343.1e-98nucleus
GO:00047213.1e-98phosphoprotein phosphatase activity
GO:00055154.4e-26protein binding
KEGG pathway 
InterPro domain[91-285] IPR0119433.1e-98HAD-superfamily hydrolase, subfamily IIID
[97-280] IPR0232147e-28HAD-like domain
[112-274] IPR0042744.4e-26NLI interacting factor
[2-51] IPR0006264.7e-08Ubiquitin
Orthology groupMCL13067 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210279-TA
ATGTTAAAAATTGCCATCGAAAATGCTACTGGCGTGCGACCGGAAAGACAAAAACTTCTCAATGTAAAATTTCAGGGTAAAGTCGCGACCGATAATTGTACACTCTCCGATCTAAACCTCAAACCGAATCTGAAGATTATGATGGTGGGTTCTCTTGAAGAGGCAATAGAAGGTGCTAGAACCAAACCAGACGTCGGAGATGATGTTGTCAATGATCTTGACATAGAGGAAGAAGAAGTCGATGTTGAAAACCAAGAGATTTACTTAGCTAAAATAAATAAACGAATAAGGGATTACAAGATAAATGTACTGAATGAACCGCGACCGGGCAAGAAACTATTAGTTCTAGATATAGATTACACACTTTTTGATCACAGATCTGTTGCTGAGACGGGTTACGAGTTAATGCGTCCCTTCCTGCATGAATTTCTAACGTCTTCATATACGCATTACGATATAGTCATCTGGTCGGCCACCGGTATGAAGTGGATTGAAGAGAAGATGAGATTGCTCGGAGTATCCACACACCAGGACTACAAGATTATGTTCTATTTGGACTATCTAGCTATGATTACGGTCCATACAACCAAGTATGGGACTATAGATGTCAAACCTCTGGGGGTTATTTGGGGTAAATATCCCCAGTACAGCTCCAAGAACACGATAATGTTTGATGACATCCGGAGGAATTTCATCATGAATCCTAAGAGCGGTCTCAAGATACGTCCGTTCAGGCAAGCCCATTTGAACCGGGATAAGGACAGGGAACTACTGCATCTAACTACTTACTTAAGGGACATAGCGCAGTACTGCGACGATTTCGACACCCTGAACCACAAGAAGTGGGAGAAATACAAGCCGGACAGGATAACACAGCTGGCCGGCAGCAAGAGGAAGGCGGAGGACAGCGTGTCCAAGCGCAAGGAATGA

Protein sequence:

>DPOGS210279-PA
MLKIAIENATGVRPERQKLLNVKFQGKVATDNCTLSDLNLKPNLKIMMVGSLEEAIEGARTKPDVGDDVVNDLDIEEEEVDVENQEIYLAKINKRIRDYKINVLNEPRPGKKLLVLDIDYTLFDHRSVAETGYELMRPFLHEFLTSSYTHYDIVIWSATGMKWIEEKMRLLGVSTHQDYKIMFYLDYLAMITVHTTKYGTIDVKPLGVIWGKYPQYSSKNTIMFDDIRRNFIMNPKSGLKIRPFRQAHLNRDKDRELLHLTTYLRDIAQYCDDFDTLNHKKWEKYKPDRITQLAGSKRKAEDSVSKRKE-