Monarch geneset OGS2.0

DPOGS201188
TranscriptDPOGS201188-TA3894 bp
ProteinDPOGS201188-PA1297 aa
Genomic positionDPSCF300262 + 63303-70683
RNAseq coverage494x (Rank: top 25%)
Annotation
HeliconiusHMEL0171370.072.35% 
BombyxBGIBMGA014233-TA2e-9589.56% 
DrosophilaCG42593-PA1e-10140.25% 
EBI UniRef50UniRef50_E1ZZC00.039.42%E3 ubiquitin-protein ligase UBR3 n=6 Tax=Formicidae RepID=E1ZZC0_CAMFO
NCBI RefSeqXP_001605178.10.041.21%PREDICTED: similar to CG1531-PB [Nasonia vitripennis]
NCBI nr blastpgi|3454818230.041.74%PREDICTED: E3 ubiquitin-protein ligase UBR3-like [Nasonia vitripennis]
NCBI nr blastxgi|3454818230.041.38%PREDICTED: E3 ubiquitin-protein ligase UBR3-like [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL17055 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201188-TA
ATGACGTACATCCAGGCCAACTTCTGCAACTCCATGGTAGACATGGACATCTACTGGCTGCAGATCTGCGCCGCTCACCTGCCGGCCGACCAGTTCCTCGACATGTGTATTGATGTTTTCGGTGTCCGCGAGTGGCTGAGCATGACGCCCATGTCAGCGAGTCAGGCCTCCGAGCAGGACGCCATGGTGGAGGGACTGCTGACCTTCCTGGCGATACTGGTGTCCTCGCGGACCAACCTCGGTAACGACGAGCTGACCCAGTCCCGCCTGGAGGTGGCGACGCTCCTGGCCGCCGGCGACAAGACGCACTCGCAGCTGCTGGAGCTGATGCCGGAGAGGTCCGGGAACGCTCACACCAGGAACTTCGAGACCGTTCTCAAAGAGCTGTCAGTGTACCGCGCGCCCCCCAAGTGTTCGGAAAGCCTGGAGCAGGGTCTGTTCGTGCCGCGGGCCGTGGTGTGGACACGCTACTACGACCCGCTCCACGTGCTGAGGAGGGCCGTGCACAGGAGGGACTTCCACGCCTCCATGGACAGATTCACAGCCTTCGTCAGGGAGAAGCAGAAGTCTGAAGGGAGTCCCGGCACCTGCACGGGGACGCTCTGGCCGCCGCTGAGACCCGCCCTGCCCCCGCCCGAGGCCGCTGGGGACCCCAGGGAACTCTGCGCCTCTGGTGTGTTACACGGAGCCCTGCTGGCGGTGCTCCACCGCGGGGTCCGCCGTCGGGCGGCGGAGGGCGCCTCCGAGGCGGAGGGGGGCGCCGGTCCCCCCGCCCCGCCAGCCCCGCCTGACCACGTGCTCGCCCTGGCCGTGTACCTGCTGGCCGTGGCCGCCGACCTCGCCGCTGAGAGGCAGAAGACACATGACCGTGACGTGTGTGTGGCGGCGTGCGGGCGGGGAGCGGGCGGCCGCGTGTCGGGCGTGCCTCTGCTGCGGGCCTTCGCCGGCGGCTCGGTGTGTGACAACGCACGGACCGTGGTGGTCCGCGTGCCCCCCACCCCGGCCGCTGCACCCCCCGCCACCACCCTACATAGGCACACCGCCCCACAACACAATCACCACTCCGACAGTGAGACCGAGTGGGAGGGCGGGTCGGGGTCGGGCGAGGCGGGCCCGGGGTCTCACAAGACGGCCTCGCTCCAGGCCGGGGCCGGCACCGCGCTCGCCGTGCCACAGTCCACACAGATGATGCGCGGCCGGTCCGCGTCCGACGACCAGTCCTCGGACGCCGGAGGAGACACGCCGGAGATAAGAGCGCTGGAGCCGCTGTACCCTCAGACGGCGGGCGGCAGCACGGCTCTGGCGCTGCCCTCGCTCCCCGCCCTGCACGACGTGAACATGGGAGACGCGCTGGCTCTCACCACACATGCCCACGAAGACATCGACCCCGATGTAGACGTGCAGTACGAGCTGGAGGCAGCAGCGTGGCAAGACACGAGGCCGGCCCTGCCCGCGCCGCCCTCGCCCATGGTCAGCACGCCGCCCTCCACTGACACGCAGCTCCAGCTGCACACTCCTACCGACGAGCAGATCCCCGTCAACGAGTCCATCATATCCCTGCTGCTGAAGCTTCACTCGCAGCTGTCGGGGCGACTCGACTCCTTCTCGCTGGAGGAGCCGGCTATAGCCTGCGACGAACCTATCGGTGACGGGCCCCACTTCATCGGTCTCCTGGTGCACAAGCTGGCCGCCCTGGACGCGCGGTGTGCCGCCGCCGTGCAGCAAGTGAGGCGCGCGCTCTGGCCCCATCAGAGAGAGAGACAGGCGGAGCAGAGAGCGAGGGAGAGGAGGGAAAAGGAAGAGCGCTCCAGGAGGGCCAGGGACAGGCAGCAGCAGCTCATGAGGGAGTTCGCCAGGAGACAACAGCAGTTTATGTCGGCGATGGAGAGCATGGAGGGCGCACCCATGGAGTGGGAGGAGGAGGTGCAGAGGGACTACGACTGTGTTATCTGCAACACCACCGCGCCCACCACCCCACACGACCCTATAGGGCTCGTTGTACTGCTGCAGTCGACGTCGGTGCTGGGTCATCGTCGTCGTCGCGGCAGTGGCGGCGCTCGCCTGGCGCTGTGCGAGGCGGAGCGCGCCCGTCTCGCCCAGCAGCAGCACGCCACGGCGGCCGCTCACCACTACCGCCTGCACGACGAGCTCCACCAGCACTTCGACCAGGACTCGTGGGTGCTGTCGGTGTCGGTGGGCTGGGAGGGTGGCGTGGCGGCGCAGTCCTGCGGCCATCACCTCCACCTGCGCTGCCTCCGCTCCTATCTGCGCTCGCTGGCGGCTCCTCAGCGGCCTCACAACCTCCACGTGGAGCGCGGCGAGTTCCTGTGTCCTCTGTGTCGCCAGTTAGCTAACAGCGTGTTGCCACTCGCGCCCCCCGCCGTGCCCAGAAACCCCCCGCCCGCCCCCCTCCACCACCAGCTGGCGGCACAGGTGCTCGACATGTTGGAGAGAGACCACCCGGCCCCGAGTCCGAGTCGCCTGTCGGAGGCGATGGGGAAAGCGATGGAGGACATGACGGCGACGGCGGGCGGCAAGCTGAAGCAGCGCTACGGCTCCTCGCCCGCCGCCATCTTCACGTTCGTGGCGTCCCTGGTGAGGACCAACCTGGAGTGTGAGCTGGTGCAGCGAGGCGGCTCGCTCGTCCATCAACCCGCGCCGCGGTACAAGCCTCGCGACGACTGCATAGTCCCCCTCATCGCGGTGGCGGGTGCTCACGCCGTGGCGCTGTCCGCGGCCGGCGCGCGGCTCGGCGTGTCGGAGACGTGGCGCGCGCTGGTGCCCGCGGCCTGCAGCGGCGGCGATGGCCCAGCTGGGAGCGGCTCACCGGCCACGCTGCACGGCGCAGCCTCCCGCCCGGTACCGCTGCTGCTGAGAGACCCCACCGCACTGCTCATGCACTTCCTGCTGCTCGCCCCTGCCTCTCCGCCTCACATCGATATCCAACACTTCACGTGTATCGTCCGCGTGCTGTACACGCTCACGTACTATCAAGTGGTGAACCAGCTGTGTGCGTGCGGCACACTGAGTGCGGCGCTCCAAACGGCAAGCGGACGGGACGCCGGCAGCGGACTGGCGGAGGCGGCGCGGCTGCTGCTGGGGGCTCTGGACGGACATCACCTGCTGGACGACGAGGGAACCGCCGCAGACACCGCCCACCACCTGGACGCCGTGGAGCTGGAGGTCCAGGAGCTAGTGGTTCCGTTCCTCCGTATAGCGGCGCTCCTCCGTAAGCACATGTACGGGGCGGAGCTGCCTCACATCGCCAGAGAAGAGGACGAGTTCGTGGCGCTGTTGCGGTTCCTGGATCTGGCGGAGGCGGGCCCGGCCGACGCCGCGCTGCCCTCGGACGCGGCGAGCGCGGCGCGCGGCTGGGCGCGGCAGCTGGGTGCGGCCGCCGCCGGCGGGCAGCTGGGCGTGCAGCGCGTGGTGCGGTCCCTGCACGCGGACTGGGCGCCGCCGTCGCTGCTGGCCCTGCCGCGGGACTACGACCGCCTGTTCACGTACTACCACGAGCGCGTGTGCCTGCAGTGCGGCGCCGTGCCCAAGGAGGCGTCCGTGTGTCTGCTGTGCGGGACCCTGGTGTGCCTCAAGCAGCCGTGCTGCCGCCACCGCGCCGTGGCCGAAGCGGTGCAGCACGCGACCGACTGCGGCGGCGGCACCGGCATCTTCCTGGTGGTCACGTCCACGTACATCATAGTGATCCGCGGCCGGCGGGCCTGCCTGTGGGGCTCGCTCTACCTCGACGACTACGACGAGGAGGACCGCGACCTGAAGCGCGGCAAGCCGCTGTACCTGTCGCAGGACCGGCTGGAGCTGCTGCAGGCGCAGTGGCTCGCTCACCGCTTCGACCACACCAAGCGGACCTGGGTGTGGCACCGCGACTCGCTCTGA

Protein sequence:

>DPOGS201188-PA
MTYIQANFCNSMVDMDIYWLQICAAHLPADQFLDMCIDVFGVREWLSMTPMSASQASEQDAMVEGLLTFLAILVSSRTNLGNDELTQSRLEVATLLAAGDKTHSQLLELMPERSGNAHTRNFETVLKELSVYRAPPKCSESLEQGLFVPRAVVWTRYYDPLHVLRRAVHRRDFHASMDRFTAFVREKQKSEGSPGTCTGTLWPPLRPALPPPEAAGDPRELCASGVLHGALLAVLHRGVRRRAAEGASEAEGGAGPPAPPAPPDHVLALAVYLLAVAADLAAERQKTHDRDVCVAACGRGAGGRVSGVPLLRAFAGGSVCDNARTVVVRVPPTPAAAPPATTLHRHTAPQHNHHSDSETEWEGGSGSGEAGPGSHKTASLQAGAGTALAVPQSTQMMRGRSASDDQSSDAGGDTPEIRALEPLYPQTAGGSTALALPSLPALHDVNMGDALALTTHAHEDIDPDVDVQYELEAAAWQDTRPALPAPPSPMVSTPPSTDTQLQLHTPTDEQIPVNESIISLLLKLHSQLSGRLDSFSLEEPAIACDEPIGDGPHFIGLLVHKLAALDARCAAAVQQVRRALWPHQRERQAEQRARERREKEERSRRARDRQQQLMREFARRQQQFMSAMESMEGAPMEWEEEVQRDYDCVICNTTAPTTPHDPIGLVVLLQSTSVLGHRRRRGSGGARLALCEAERARLAQQQHATAAAHHYRLHDELHQHFDQDSWVLSVSVGWEGGVAAQSCGHHLHLRCLRSYLRSLAAPQRPHNLHVERGEFLCPLCRQLANSVLPLAPPAVPRNPPPAPLHHQLAAQVLDMLERDHPAPSPSRLSEAMGKAMEDMTATAGGKLKQRYGSSPAAIFTFVASLVRTNLECELVQRGGSLVHQPAPRYKPRDDCIVPLIAVAGAHAVALSAAGARLGVSETWRALVPAACSGGDGPAGSGSPATLHGAASRPVPLLLRDPTALLMHFLLLAPASPPHIDIQHFTCIVRVLYTLTYYQVVNQLCACGTLSAALQTASGRDAGSGLAEAARLLLGALDGHHLLDDEGTAADTAHHLDAVELEVQELVVPFLRIAALLRKHMYGAELPHIAREEDEFVALLRFLDLAEAGPADAALPSDAASAARGWARQLGAAAAGGQLGVQRVVRSLHADWAPPSLLALPRDYDRLFTYYHERVCLQCGAVPKEASVCLLCGTLVCLKQPCCRHRAVAEAVQHATDCGGGTGIFLVVTSTYIIVIRGRRACLWGSLYLDDYDEEDRDLKRGKPLYLSQDRLELLQAQWLAHRFDHTKRTWVWHRDSL-