Monarch geneset OGS2.0

DPOGS208057
TranscriptDPOGS208057-TA1692 bp
ProteinDPOGS208057-PA563 aa
Genomic positionDPSCF300203 + 398872-404104
RNAseq coverage315x (Rank: top 36%)
Annotation
HeliconiusHMEL0121310.064.27% 
BombyxBGIBMGA001484-TA3e-9685.26% 
DrosophilaUbpy-PA1e-4332.27% 
EBI UniRef50UniRef50_E0VJF62e-14951.06%Ubiquitin carboxyl-terminal hydrolase n=4 Tax=Neoptera RepID=E0VJF6_PEDHC
NCBI RefSeqXP_314931.44e-15451.93%AGAP008805-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582935889e-15351.93%AGAP008805-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1571082463e-15649.76%hypothetical protein AaeL_AAEL004980 [Aedes aegypti]
Group
Gene OntologyGO:00065112.1e-62ubiquitin-dependent protein catabolic process
GO:00042212.1e-62ubiquitin thiolesterase activity
GO:00082707.4e-20zinc ion binding
KEGG pathway 
InterPro domain[222-556] IPR0013942.1e-62Peptidase C19, ubiquitin carboxyl-terminal hydrolase 2
[2-105] IPR0130833.1e-21Zinc finger, RING/FYVE/PHD-type
[29-89] IPR0016077.4e-20Zinc finger, UBP-type
Orthology groupMCL17124 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208057-TA
ATGGAATGTACCCATTTGTTAGATAACGTCAAGTTGGATAGCGATCTTTTCGGTGAGGATATTACGATCACAAAGAATTTCAACTGTTCAGAATGTCATATCAAGGAACAGAATTGGCTTTGCCTGCAATGCGGGATCGTCAACTGTGGCCGGTACGCCAACGGACACGCAAAACTACACGCAGAATCATCTGACCACCAGTTGTGCATGAGCTGCGACGTATTTTCCGTTTACTGTTACAAATGTGATGACTATGTTTCAAATGATGTAGAACATCTAACAATAGACAAAATAAGGCAGCATATAATGCAAGCAACGGATGGTAATGTTAATGAAAATGACAAGGAAATAACAAAAGATTGTGCGGAGCAAGATGGTCATCTACAAGAGGATACAAGTGAAAAGTCATTGTCACCTATTGCTGAAGAAAACTCTAATCCAGAAATCCAGCTGGTTACATGTGAGATAACAACTGAACCAAGTGCCTCGGAAAATGTTCAGAGCAAAAAATCAATTAAGAATGCAGATGAACCAGTTAGAAGTTTGCGACCAAGGTCTCGTAAAAGATCTCACTCTGAGGATAGTTTTGCTGAAAATTCATCTGTACCAACCAAAAGCAGACAGAAGGGTTTACCGTTAAATGGGAAGAGTCAGAAGGATAAAAAAATTGTAGGTTTAAAAAATTTAGGCAACACATGCTTCATGAATGCTGTGCTGCAAAGTCTTAATAATATACAAGAATTTAGTTACTACTTCAGTCAACTGCCCTCCTTGGAAATGAAGACCAATGGACGGAAAGTGTATCACTCGAGGAGTTACACAAGACAGGAGATGCACGATGTTGTTATGGCAGAGGAACTCAGGAAGGTGTTAATAAACCTCAACACGGGAGGCTGTGGTTCCAAAGCTGCTATATCTCCTGAATGTCTGTTCCTGGTGATATGGAAGGTAGTTCCGCGGTTCCGTGGCTACCAGCAACAAGACGCACACGAATTCCTTCGCTACATGCTTGACAGACTGCACACAGAACTCCAGCAGTTGGTACCCGATCGTCCTGAAGGTGTTAAGGCACCGGCATCTATTGTCACCGCTGTGTTCGGTGGAACATTACAGAGCGAGGTCCGTTGTCTGGCATGTGGTACTGAGAGTAAGAAGTTTGATCCATTCTTGGATCTGTCCCTGGAGTTACCAGAGAGTGGGCGACATGACGCGCCCGTCTCACTAACTGACTGCTTGGCCAGTTTTGTACAGATCGAAGAGCTGGCTGACACAGAGAGATATTTCTGTAGCAGCTGTAAATGCAAACAGAAATCAACAAAACAGTTCTGGATCAGACGGTTACCGAACGTACTGTGCCTACATCTCAAGAGATTTAGATGGCACAACTATTTCAGAACTAAAGTGGACACTTCCATCTCCTTCCCCCTGTTGTCCCTGGACATGTCCGGGTTTGTGCTACCCAACGTGCCCGACACCAGGCGCTCCGGCCGCGGCAGTCTGTTGTATGACCTAGCAGCTGTGATCGTTCATCACGGCTCGGGCGCCGGGTCCGGTCACTACACGGCGTTCGCTATCAACGAGGAGCAATGGTTCCACTTTAACGATCAAACTGTCCGTGCGACCGACTCCGCGGCGGTGGCGTCCTGCAAGCCCTACATACTGTTCTACATCAGGAGAGAATTCGCTTCGTGA

Protein sequence:

>DPOGS208057-PA
MECTHLLDNVKLDSDLFGEDITITKNFNCSECHIKEQNWLCLQCGIVNCGRYANGHAKLHAESSDHQLCMSCDVFSVYCYKCDDYVSNDVEHLTIDKIRQHIMQATDGNVNENDKEITKDCAEQDGHLQEDTSEKSLSPIAEENSNPEIQLVTCEITTEPSASENVQSKKSIKNADEPVRSLRPRSRKRSHSEDSFAENSSVPTKSRQKGLPLNGKSQKDKKIVGLKNLGNTCFMNAVLQSLNNIQEFSYYFSQLPSLEMKTNGRKVYHSRSYTRQEMHDVVMAEELRKVLINLNTGGCGSKAAISPECLFLVIWKVVPRFRGYQQQDAHEFLRYMLDRLHTELQQLVPDRPEGVKAPASIVTAVFGGTLQSEVRCLACGTESKKFDPFLDLSLELPESGRHDAPVSLTDCLASFVQIEELADTERYFCSSCKCKQKSTKQFWIRRLPNVLCLHLKRFRWHNYFRTKVDTSISFPLLSLDMSGFVLPNVPDTRRSGRGSLLYDLAAVIVHHGSGAGSGHYTAFAINEEQWFHFNDQTVRATDSAAVASCKPYILFYIRREFAS-