Monarch geneset OGS2.0

DPOGS205952
TranscriptDPOGS205952-TA1491 bp
ProteinDPOGS205952-PA496 aa
Genomic positionDPSCF300156 + 273985-277000
RNAseq coverage497x (Rank: top 25%)
Annotation
HeliconiusHMEL0064510.079.16% 
BombyxBGIBMGA002837-TA0.088.13% 
DrosophilaPrp19-PA0.078.02% 
EBI UniRef50UniRef50_Q9UMS40.064.29%Pre-mRNA-processing factor 19 n=101 Tax=Metazoa RepID=PRP19_HUMAN
NCBI RefSeqXP_001661732.10.076.44%wd-repeat protein [Aedes aegypti]
NCBI nr blastpgi|1571100160.076.44%wd-repeat protein [Aedes aegypti]
NCBI nr blastxgi|1948811030.078.22%GG20970 [Drosophila erecta]
Group
Gene OntologyGO:00055152e-67protein binding
GO:00001515.7e-20ubiquitin ligase complex
GO:00165675.7e-20protein ubiquitination
GO:00048425.7e-20ubiquitin-protein ligase activity
KEGG pathwayaag:AaeL_AAEL0151990.0 
 K10599 (PRPF19, PRP19)maps-> Ubiquitin mediated proteolysis
    Spliceosome
InterPro domain[211-493] IPR0159432e-67WD40/YVTN repeat-like-containing domain
[216-494] IPR0110461.6e-66WD40 repeat-like-containing domain
[67-134] IPR0139153.2e-30Pre-mRNA-splicing factor 19
[2-68] IPR0036135.7e-20U box domain
[3-56] IPR0130834.9e-19Zinc finger, RING/FYVE/PHD-type
[245-284] IPR0016804e-10WD40 repeat
[376-411] IPR0197812e-09WD40 repeat, subgroup
Orthology groupMCL12086 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205952-TA
ATGGCTCTTTATTGTGCAATATCTAATGAAGTGCCAGAGGTTCCCGTCGTCTCCCCTTCGTCAGGATCAGTCTTCGAAAAGAGAATAATAGAGAAGTATATTATTGAAAATGGGGTAGACCCTATTAGCGGTAAAGAGCTGAGAGTGGAAGACTTGATTGAAATAAAAACTCCAGCCATAGTGAAACCTAAGCCTCCAAGTGCTACATCTATTCCCGCTACACTCAAGAGTATGCAGGACGAGTGGGATGCACTCATGCTGCACACCTTCACTCAGAGACAACAACTACAGACGGCCCGACAGGAGTTGAGTCACGCCCTGTACCAGCACGACGCCGCCTGCCGCGTCATCGCCCGCCTCACCAAGGAGGTGACGGCGGCCCGCGAGGCGCTCGCCACGCTCAAACCTCAGGCCGGCCTCGCCGCGCCGCTCGGCCGATCAGTGGAGGCTGCGGCAGCGGGCGCGGCGGCTGTGGGTATGTCTCCGGAAGTGGTGTCCCGTCTCCAGGACCGCGCCACCGCCCTCACACAGGAACGCAAGCGCCGCGGCCGCACGGTTCCTGAGGGACTCGTCGGACCCGATACCATACGGTCCTTCGTCACGCTCGCCTCACATCCCGGTCTCCACTCGGCGAGCGTGCCCGGGATCCTGGCCCTGGACATCAACCCGTCGGACCACAGCCGCATACTGACGGGAGGCAACGACCGCAACGCCACCGTCTTCAACAAGGACACGGAGCAGGTGGTGGCCATCCTCAAGGGACACACCAAGAAGGTGACGCGCGTCATCTACCACCCCGACGAGGACACCGTCATCACCGCCTCGCCCGACCACACCATCCGAGTGTGGAACGTGCCCACGTCGCAGACGACGGTGCTGCTGCGGTCTCACGAGGGTCCAGTCACAGGCCTGTCTCTTCACCCCACCGGGGACTACGTGCTGTCCACCTCCACCGACCAGCACTGGGCCTTCTCCGACATACGCACTGGCCAGCTGCTCACTAAGGTGAGCGACGCATCCGGAGTGAGTCTCACGACGGCGCAGTTCCACCCCGACGGTCTGATCTTCGGCACCGGCACGGAGAACTCGCAGGTGAAGATCTGGGACCTCAAGGAGCAGAGCAACGTGGCCAACTTCCCTGGGCACGTGGGACCCGTCACATCCATCTCCTTCTCCGAGAACGGCTACTACCTGGCCACGGCGGCCGAGGACGCGTGCGTCAAGCTGTGGGACCTGCGCAAGCTGAAGAACTTCAAGAGCATCCAGCTGGACGAGGGCTACGTGATCCGCGAGCTGCGCTTCGACCAGAGCGGCACGTACCTGGGCGTGGCGGGCTCGGACGTGCGCGTGTTCCTGTGCCGCCAGTGGCAGGAGCTGCGCGTGCTGGCCGACCACACGGCCGCCGCCACCGGCCTGCGCTTCGGCCGCGACGCCGCCTACCTCGCCTCCACCTCCATGGACCGCACGCTCAAGATATACGGCCTGCAGTGA

Protein sequence:

>DPOGS205952-PA
MALYCAISNEVPEVPVVSPSSGSVFEKRIIEKYIIENGVDPISGKELRVEDLIEIKTPAIVKPKPPSATSIPATLKSMQDEWDALMLHTFTQRQQLQTARQELSHALYQHDAACRVIARLTKEVTAAREALATLKPQAGLAAPLGRSVEAAAAGAAAVGMSPEVVSRLQDRATALTQERKRRGRTVPEGLVGPDTIRSFVTLASHPGLHSASVPGILALDINPSDHSRILTGGNDRNATVFNKDTEQVVAILKGHTKKVTRVIYHPDEDTVITASPDHTIRVWNVPTSQTTVLLRSHEGPVTGLSLHPTGDYVLSTSTDQHWAFSDIRTGQLLTKVSDASGVSLTTAQFHPDGLIFGTGTENSQVKIWDLKEQSNVANFPGHVGPVTSISFSENGYYLATAAEDACVKLWDLRKLKNFKSIQLDEGYVIRELRFDQSGTYLGVAGSDVRVFLCRQWQELRVLADHTAAATGLRFGRDAAYLASTSMDRTLKIYGLQ-