Monarch geneset OGS2.0

DPOGS206462
TranscriptDPOGS206462-TA3081 bp
ProteinDPOGS206462-PA1026 aa
Genomic positionDPSCF300070 + 48329-59090
RNAseq coverage625x (Rank: top 21%)
Annotation
HeliconiusHMEL0129350.076.19% 
BombyxBGIBMGA005338-TA0.080.54% 
DrosophilaCG11070-PA0.039.08% 
EBI UniRef50UniRef50_D6WFJ10.046.34%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WFJ1_TRICA
NCBI RefSeqXP_966451.10.046.34%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastpgi|910796600.046.34%PREDICTED: similar to predicted protein [Tribolium castaneum]
NCBI nr blastxgi|910796600.045.56%PREDICTED: similar to predicted protein [Tribolium castaneum]
Group
Gene OntologyGO:00344501.8e-152ubiquitin-ubiquitin ligase activity
GO:00065111.8e-152ubiquitin-dependent protein catabolic process
GO:00001511.8e-152ubiquitin ligase complex
GO:00165671.8e-152protein ubiquitination
GO:00048421.2e-30ubiquitin-protein ligase activity
KEGG pathwaytca:6549160.0 
 K10596 (UBE4A)maps-> Ubiquitin mediated proteolysis
InterPro domain[281-928] IPR0194741.8e-152Ubiquitin conjugation factor E4, core
[945-1017] IPR0036131.2e-30U box domain
[946-1006] IPR0130835.5e-22Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL13536 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206462-TA
ATGTCTGAACCAAATAATCCGTTTGTGGGTCTGCTGGGTGTCGCAGAAGCTAAGTCAACAGAGGGATCTAATTTTGATTCGACCCAAATATCAATAGACGTACATAATGTACAAGGTGCCGAACAGCTTCTTGTAATTAACAATATAATTGAGAATGTTTTCTTTTTCACTGTGAACCCTAATGCTGCGGATAAAAGTTCTGAAAGGCAATTGGTATACCTCGAAGAACTTGCCCAGGCTATGAACCCCAGAATACATATTGATTTAGAAGCTCTCGAGCAAGCCCTTTTTGAAAGGCTATTACTACAAGATGTAGAGCAGCAAGTTATACCCAAAGGAAGTACAGTATTTAAAGAGCATGTTATCCAGAAACAAGTATTTCCATATTTATTTAGCTCGATGGAAAACATCAACAGTTACAGTCATATTAACACACCTTATGTCCAAGATGCTTTAAATAGGATGAAGGAATTGATATTCCGGAATGCAGTCACAGCATTAAAACAGCCTGCCCTTTTTGAGGGTCAGGATTTTGCTGAGCAACTTGTAGAAATATTACGACAGATTGTTCAACAAAGTCCGACATTTTTCATAGACTTGGTAAAATCATTTGTAGCTGAAGGAGATCGTGACTGTAAAAAACAGCTGAAAGACACTATGATTCCAGTGTTAAGGAAAATTTACATAGATGTCAATAAATCAAATTTAATTAACCTGCCCATTTATGTCCTACCATCGGTACAACTATTTGCAAGCGATCCAAATTTAGCTCCAATATTAATGGATGCTTGTGACCCAAAAAACGAAATGAGCGGAAGGTTCTTTCAAGATAATATTATGGGAGGTCTTTTGGCACTATCAGTACTACCCCGCTCCAACAGCGGCCTGCCGGATTATTTTGATAACCCTATGGACCAGGCTGCGACATCACTAATCGAGTCATCACTGTGGAATGCAACATCACATTTAACTAATTATATGCACAAAATATTTTTGAGCTTACTAAAAGGCGGACCGGAGTTGAAAAATCGTTTGTTGACATGGATCGGAAAATGTTTAAAATATAATTCTCCTAGAGGGAAACTTTGGAATGTTCAGACAAGTGACATAGGTTTGACAAATTGTGTCTCCGATGGATTCATGCTGAATCTCGGTGCTGTTTTACTACATCTGTGTCAACCGTTCTGCAACACAGCTGACGATCTCAAGGCACTCAAGATTGATCCAACGTATGGTGCTGTGTCTCCGGAGGAGGCTGCCTCCAAGTCTGTCCACCTAAGTCTTCACAACGAGACCTGCCTACTACCAGCTCGGGAAACCGACGACGGAACGCCGATCAAACGCCCAACAGCTGAGACATACAACTTCGTCACGGAATGCTTCTTCATGACGCAGAAGTGTATCGATTTAGGTGTCCGTGTGTGTGCTGAGAAAATGTGGCGTATAGGTCAGGAAGTGGGTCGAGCTCAGCGCGCGATGTCTGATGCTGGACCGGCTAGGATAATGGAATCGCTGAGACAGAGAGCAACATACCTTATGACGAAATTCGTGACATTCCGCTGTGGTCTCCTGGAGAAGAAGATGCTAGCCAACCTACACCGTCTGCAGGCCACCACCTGCACGTGGCTGGTGCAGGTAGCGGCACGTGCTACCACCGTCGGCAACTACGCTCCAAACACCATGATGCAAATAGACATGCCAATAACCACCCCACCGCCAGATACGTTGAGATGCATTCCAGAGTTTGTATTGGAGAATGTTGTTGTGTTGATAACAATGTCCCGCCGTACTGTGGGCGCTATCACCGACGACGCGGACATGGCCGGGTTATTACAACCCGCCCTCACACTTGTATTGACATTCATGGGTGATTCGACACGGACTTACAACCCGCATCTGAGGGCACGTTTGGCTGAATGTCTGGAAGCAATGTTGCCAAACCATCCTGATGATCAGCAACCACTCAGCAACATCGCCTCCTTCTACAGGGAACAGTTGTTTAAGGAACATCCACACAGACTTCAGCTGGTGACATGTCTTCTGGATGTTTTCGTCGGCATTGAGATGACCGGGCAGAGCGTTCAGTTCGAGCAGAAGTTTAACTATCGCCGACCGATGTACCTCGTCATGGACTTCCTGTGGGGCATAGAGGAGCATAGAGAGGCCTTCACACGTCTCGCAAGAGAAGCTGAGGCGAATATGGAAGCTGTTCATCCACCGATTTTCCTTCGTTTCGTGAATCTATTAATGAACGACGCTATCTTCCTACTGGACGAGGCCCTTGGAAATATGGCTCAGTTAAGGAACATGCAAACGGCACAAGAAACCGGCAGATGGCTGAACCTATCAAGCGCTGAACGTGAGCAGAATTTAGCCAACATGTCCCATACTGGTATGCTAGCGAGGTTCGATAACATACTTGGGCGGGACACTGTACGCACGCTCGTCAAACTAACATCACACGCCCCATACGTGTTCTGTCATCCGACGCTGGTTGAACGCATCGCATCCATGCTGAACTACTTCCTCCTACATCTAGTCGGACCAAATAAGAAGAACTTTAAGGTGAAAGACATGAAGGACTACGAGTTCGAACCTGCGAACACTGTGCTGGACATCTGTCGTATGTACGTGCAGCTCGGCAGTAACGAGAGGTTCTGTGCCGCCGTGTCAGATGATGGAAGGTCATACTCGCCTCAGCTGTTCAAATTGGCTGAAGATGTTCTAGTACGCATCGGCGGTGGAGGTCTCATAGCATCTCTCCAGGAAGTGGCATCCCATGTAAGCATACTAGCTGAACAACGTCAACGTGATGAAGAGATCCTCGCCAATGCTCCCGAGGAATTCCTGGATCCCATAATGAGCACCATAATGAGGGATCCCGTCATCTTGCCCAGCTCAAGGACAACAGTCGACAGAACCACCATAGCCAGGCATCTCCTAAGCGATCAAACAGACCCTTTCAATCGATCTCCGCTGTCAATGGATCAAGTGAAATCTAACACGGAACTTAAGGAGCGTATAGAAGCGTGGATTGCAGAACAGAAACAAAATATTACGAAACCTGATACTGCTATGTAA

Protein sequence:

>DPOGS206462-PA
MSEPNNPFVGLLGVAEAKSTEGSNFDSTQISIDVHNVQGAEQLLVINNIIENVFFFTVNPNAADKSSERQLVYLEELAQAMNPRIHIDLEALEQALFERLLLQDVEQQVIPKGSTVFKEHVIQKQVFPYLFSSMENINSYSHINTPYVQDALNRMKELIFRNAVTALKQPALFEGQDFAEQLVEILRQIVQQSPTFFIDLVKSFVAEGDRDCKKQLKDTMIPVLRKIYIDVNKSNLINLPIYVLPSVQLFASDPNLAPILMDACDPKNEMSGRFFQDNIMGGLLALSVLPRSNSGLPDYFDNPMDQAATSLIESSLWNATSHLTNYMHKIFLSLLKGGPELKNRLLTWIGKCLKYNSPRGKLWNVQTSDIGLTNCVSDGFMLNLGAVLLHLCQPFCNTADDLKALKIDPTYGAVSPEEAASKSVHLSLHNETCLLPARETDDGTPIKRPTAETYNFVTECFFMTQKCIDLGVRVCAEKMWRIGQEVGRAQRAMSDAGPARIMESLRQRATYLMTKFVTFRCGLLEKKMLANLHRLQATTCTWLVQVAARATTVGNYAPNTMMQIDMPITTPPPDTLRCIPEFVLENVVVLITMSRRTVGAITDDADMAGLLQPALTLVLTFMGDSTRTYNPHLRARLAECLEAMLPNHPDDQQPLSNIASFYREQLFKEHPHRLQLVTCLLDVFVGIEMTGQSVQFEQKFNYRRPMYLVMDFLWGIEEHREAFTRLAREAEANMEAVHPPIFLRFVNLLMNDAIFLLDEALGNMAQLRNMQTAQETGRWLNLSSAEREQNLANMSHTGMLARFDNILGRDTVRTLVKLTSHAPYVFCHPTLVERIASMLNYFLLHLVGPNKKNFKVKDMKDYEFEPANTVLDICRMYVQLGSNERFCAAVSDDGRSYSPQLFKLAEDVLVRIGGGGLIASLQEVASHVSILAEQRQRDEEILANAPEEFLDPIMSTIMRDPVILPSSRTTVDRTTIARHLLSDQTDPFNRSPLSMDQVKSNTELKERIEAWIAEQKQNITKPDTAM-