Monarch geneset OGS2.0

DPOGS209575
TranscriptDPOGS209575-TA3354 bp
ProteinDPOGS209575-PA1117 aa
Genomic positionDPSCF300015 - 1070278-1101657
RNAseq coverage175x (Rank: top 50%)
Annotation
HeliconiusHMEL0170440.083.27% 
BombyxBGIBMGA006636-TA0.069.22% 
DrosophilaCG42797-PD6e-5740.26% 
EBI UniRef50UniRef50_D6WZM41e-9344.40%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZM4_TRICA
NCBI RefSeqXP_001647739.12e-9845.27%hect type E3 ubiquitin ligase [Aedes aegypti]
NCBI nr blastpgi|1571416733e-9745.27%hect type E3 ubiquitin ligase [Aedes aegypti]
NCBI nr blastxgi|1571416732e-11936.68%hect type E3 ubiquitin ligase [Aedes aegypti]
Group
Gene OntologyGO:00055152.2e-16protein binding
KEGG pathway 
InterPro domain[819-851] IPR0012022.2e-16WW/Rsp5/WWP
Orthology groupMCL19139 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209575-TA
ATGTCAGAGGAACATGATGGTGATGTCGTTGATAGTGAAAGAACCCTGGACAAAGAGTGTCAGACGTGCGATGTGACTTGTGATGAAGGGAGTGCCAATAACAGTGCCACTAGCAAAGAGAGTGAGACGAGCGAGGTGGAAGAGAAGGGTGACCTGTCCGAGAAAGGAAACGAACCCAGGGGCAGCATCATATGGACGGATGAGAAGACGCACCTTCGAGTGTCATGGAACTTGAAGGAAGGAACGGCCACAGACAAAGATTATGTGGCTCTTTGTTATACAGAGACCACAAGCATAGCCGGTATCGCCAGGCTCGTGCCGGCGACCGGATGCGACACTGGACATATAATGTGGCTGCTTGATGAGCCCAATCAGCCTTATGAAGATTGTGAACAACTGCTCTGTTTCCGTTACTACAATGGCGAAGAAGACGAATGCGTTGCAGAGTCTTCTACCCTACCACCACGATTTAAAATTGATCTTAAAAAGCTGCCACATCGGCTTAAAGAGGGAATCTCGAGGAAAAGAAGCGGTGATCAAGTGTCACCCTTCTCATATAACAATGAGAGTTTTGAAATGAGTTCAGAAAACACACCTCGAGTTGAATTAGTGAGTACGAGCCAGGACATCAAGTATCTTAGTGAAAATGTCAATAAATGCTCAATTAGCACCTTAGGAAATAATGGGACCTTAAATCTCGAACAAATAAATTGTGATCATAGTAAATCTCTAGTTAGTAGTAGTACTGAATCTCCTAGTGAATGTCCCAATATTTTGGACTGCAAACCAACGTCATCTGGTGTCAAAAAGAAATGTCCTCCTCCTGTCGATACTGGTGGTGGGCAAGTTTTGAATGGTGTCAAGAAAAAAACTAATATGAATATCGGGTTAGAATCTCCGACATCCCCCGATGGAACTGAATATTATAAGTTATGGTCACCGCAGAATCCTTGCAAAATTATGAAGTTTGTTTATGATGTGGAAGGTGCGAATGTGCCAAATGATGCTCAAAAATCTCCAGTTCCGCCCTTACCACCTAGACAATCACATAAACCTCTAGAAAGAATGCACGCTTTACCTCCGATTGTTCAGAGGCATAGGAAACCTAAAAAGTTAACAAAGCCAGAAGATGCGTTTACTTTTGAGCTGATAGACACTGATGAACAATTTTTTACGGATAATAACATTGCCAATTCCGCAGCTTTGCACGGAGATATTAATGATTTTAAACCTGGAGAATTTTATTCAGGCGTTCAAGCTCCCATTTCTTCAACCACAACTTTTTTAAGAGGATGTCAAAATGTTGCACCTCTAGCAACTCCTGAGACTGAAGGTAGCTTATGTGACTCAACTATATCTTCCATTAAATTATCTAAATTTGTAGAGGATAAGAAGTCTGTCGATGTACCTGATAGCACGCAAAATGTTGGTAAAAGCAACATCGTGTTTATAAAAGATATAGGTCCTGACAATTATATTACAACAACCTTCGCCAGAAAAGATAAAAATGTTGAAGGTGATTTTCATTTGACGCCTAATCACGCAAAATCAGATGAGAAGGCACAACATACATTCTTAAATGAAATATGTAATCGTGTAGATTTCGAAGACGATAATAGACATCAGGTCGAAGAAAAGGAAATTTCTGTAGTAAAAGGTCACAAGCCCCTCACAAGACAAAATTCGGGGCGAGCTACGCCGACCATGATGACAGCATTTTTGAACTCGTCTCATATTGATGGGAGCAGTCGGGAAGGAGAGCAAGAAATCCACACCGCACCCAGTTCCTGTAATTCATCCCCTTCTAGCTCTCCCATAGAAGAAGGCGGTAAGACGATACCAAATGTGGGGACATTGCTAATGAATCGAAAATATTCGTGTGACAGTTCCACTCCTAAACATTCAAGTTTACCACGTCATTTGTTGAAAAATCTCGGATGTGAAAACAATACCAAACATTCCCCTCACAAATCGGTACCATCGAAGGATGTCCCAGACAGTCCTATGAGGCCTCACCCGAGAGTCTTATCTAGAGTGGCTGCGCTAGCAAATACGACCGTCCCACAATGTCCACCGACACCGACACATCACGCGAGAAGAACGCGGCCGTTGCCGCCACCAGATCTACACCCACCACCTATACAGAACTGTGAAAACTTTGAAATGGTCGAATTCACAAATGAATTAAGAACATCGGAGATTAGATCGCCTAACTTAGAGTTCGTTAATAGCAACCATTTCACTATCGTTCACGCTGGACCACCTGATGATGTGGTTTTGCGGAGACCGCAGACCGTGGACGACACTGATGATAATGAAAACGCGGTAGCATCGCCGACGCCGTTGCGGCATATGGCTGGGATAAGGCTACCCTCCATACCGGAGAGAGCGTCCAGGCAAATGGCCTTGACAGGAGATTTCCCGGCCAACATGATTGGAGGAATCATTGAATGCGAAGAACCTTTGCCACCTGGTTGGGAGGCTCGTATGGACAGCCACGGACGTGTCTTCTATATTGATCATATAAATCGCACAACAACGTGGCAGAGGCCGGCTGCCAACGGCGCAGCGCGTTCGCCTGAACCAGAAGTACAGAGACGCCAGTTAGATAGGAGATATCAATCTATTCGTCGCACGTTGACGAGGGCGCCGCCTGAGGAGGAAGAGCCCCCCGCAGGGACGTCCAACGCGCCCGCCGCGCCTCACGCCCCACATCCCGCCGCCGAGTTCCTCGCACGACCTGACTTCTACTCCATATTGCATATGAACCTGGAAGCGTCATCGCTATACAACTGTAACTCGACCCTCAAACACATGATATCTAAGATCCGTCGGGACACCAGCTCCTTCGAGCGTTATCAACACAACCGCGATCTCGTAGCTCTGGTCAACATGTTCAGTGAAACCGACAGGGAGTTGCCCTTAGGATGGGACTCCAAGCTCGACAGGAACGGGAAGCGTTTCTTCGTGGACCACGTGATGCGTCGCACCACATTCGTGGACCCCCGCCTGCCTCGGGCTCCGACCGCGGGACCGTTCTCTCCGCTGCTGCCTCCGAGGCGGAGGCCCATTATGACCGACCAGGTGACGGTCGATGCTCCCACTCCGCCGCCGAGACCTCCGATATCCACAGCCGACTCGTACTTACAGAACTCTCAGCAAGAGATACCCATAGCATACAATGATAAGGTGGTAGCATTCCTCCGTCAGCCCAACATCCTGTCCATCCTGAAGGAGCGCTGTTCAGGCTGCGGGACCGCCTTGAGGGACAAAGTTAACGCGGTTAGGGTTGAAGGAGCGTCCGCACTCGCGCGGTACCAAAACGACGTCCAGCTGACATGCCTGTTGAGGTGA

Protein sequence:

>DPOGS209575-PA
MSEEHDGDVVDSERTLDKECQTCDVTCDEGSANNSATSKESETSEVEEKGDLSEKGNEPRGSIIWTDEKTHLRVSWNLKEGTATDKDYVALCYTETTSIAGIARLVPATGCDTGHIMWLLDEPNQPYEDCEQLLCFRYYNGEEDECVAESSTLPPRFKIDLKKLPHRLKEGISRKRSGDQVSPFSYNNESFEMSSENTPRVELVSTSQDIKYLSENVNKCSISTLGNNGTLNLEQINCDHSKSLVSSSTESPSECPNILDCKPTSSGVKKKCPPPVDTGGGQVLNGVKKKTNMNIGLESPTSPDGTEYYKLWSPQNPCKIMKFVYDVEGANVPNDAQKSPVPPLPPRQSHKPLERMHALPPIVQRHRKPKKLTKPEDAFTFELIDTDEQFFTDNNIANSAALHGDINDFKPGEFYSGVQAPISSTTTFLRGCQNVAPLATPETEGSLCDSTISSIKLSKFVEDKKSVDVPDSTQNVGKSNIVFIKDIGPDNYITTTFARKDKNVEGDFHLTPNHAKSDEKAQHTFLNEICNRVDFEDDNRHQVEEKEISVVKGHKPLTRQNSGRATPTMMTAFLNSSHIDGSSREGEQEIHTAPSSCNSSPSSSPIEEGGKTIPNVGTLLMNRKYSCDSSTPKHSSLPRHLLKNLGCENNTKHSPHKSVPSKDVPDSPMRPHPRVLSRVAALANTTVPQCPPTPTHHARRTRPLPPPDLHPPPIQNCENFEMVEFTNELRTSEIRSPNLEFVNSNHFTIVHAGPPDDVVLRRPQTVDDTDDNENAVASPTPLRHMAGIRLPSIPERASRQMALTGDFPANMIGGIIECEEPLPPGWEARMDSHGRVFYIDHINRTTTWQRPAANGAARSPEPEVQRRQLDRRYQSIRRTLTRAPPEEEEPPAGTSNAPAAPHAPHPAAEFLARPDFYSILHMNLEASSLYNCNSTLKHMISKIRRDTSSFERYQHNRDLVALVNMFSETDRELPLGWDSKLDRNGKRFFVDHVMRRTTFVDPRLPRAPTAGPFSPLLPPRRRPIMTDQVTVDAPTPPPRPPISTADSYLQNSQQEIPIAYNDKVVAFLRQPNILSILKERCSGCGTALRDKVNAVRVEGASALARYQNDVQLTCLLR-