Monarch geneset OGS2.0

DPOGS204456
TranscriptDPOGS204456-TA1422 bp
ProteinDPOGS204456-PA473 aa
Genomic positionDPSCF300002 + 358251-362109
RNAseq coverage1788x (Rank: top 7%)
Annotation
HeliconiusHMEL0062406e-16474.81% 
BombyxBGIBMGA007800-TA7e-11258.71% 
Drosophilasina-PA6e-0940.32% 
EBI UniRef50UniRef50_UPI00015B4D543e-2429.82%UPI00015B4D54 related cluster n=1 Tax=unknown RepID=UPI00015B4D54
NCBI RefSeqXP_001605807.16e-2529.82%PREDICTED: similar to seven in absentia, putative [Nasonia vitripennis]
NCBI nr blastpgi|1565431521e-2329.82%PREDICTED: E3 ubiquitin-protein ligase sina-like [Nasonia vitripennis]
NCBI nr blastxgi|1565431523e-2430.00%PREDICTED: E3 ubiquitin-protein ligase sina-like [Nasonia vitripennis]
Group
Gene OntologyGO:00056348.4e-13nucleus
GO:00065118.4e-13ubiquitin-dependent protein catabolic process
GO:00072758.4e-13multicellular organismal development
KEGG pathwaydwi:Dwil_GK208831e-07 
 K04506 (SIAH1)maps-> Ubiquitin mediated proteolysis
    Wnt signaling pathway
    p53 signaling pathway
InterPro domain[18-73] IPR0041628.4e-13Seven-in-absentia protein, sina
Orthology groupMCL18524 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS204456-TA
ATGGTGGAAAAGGCTGAGGTGAAGCCGGAATCACTTCTCAATCTAGATGATTTGTTACAATGTCCAGTGTGCTATGAAATACCATCAGGACAAATATTTCAGTGCAATGAGGGTCATCATGTATGTGGACGTTGCAAGATGCGCCTTGATGTGTGCCCAGTATGTCGGGCACTCTTTTTTGGCACAAGAAATTATGCTATGGAAGAATTGATTGCCAATTTTAGAAAATTACGTGCTTTTAAACTTGGTGCTAAACCTACAAACGGCTCGGGTTCTTCAGAAAGCAGTACACCAGCTAAAGACACAACCACAGGGGAATGTGAAAATGATGTTAATGAGGAAGATGAAGAAAATAATCTCAATTTACCTAATCAAGCTCCACTGAGACCTCCTCCGGCATGCAAAGGGCTCTTCCGTTGCCTTTGTTGCAAAAGTGGACAAGGGGAAAGGTTACCAGCCGCCCGCCTATTGAACCATCTCCGTTATTTTCATGCTCCAGATCTGCTTGAGGGTCGGACTGAGAATGGCGAGTATTTGCAAGCATGGCAATTTTCTACAGTCCCTGGCAAACTTGTTACTGCCGTTAGAGTAGCTGATATGGGCATATTCTTTTTGACAATAGAAATAAGTAGTGATTCAGTGTGTGCATGGTTAGCAATGGCCTCCTCACCTTGGGTAGCTCACAATTTTAATTATACAGTTACCATCTGTGGTAACGATCGTGAAGCTATATTTTCCGATTGTGTCTGGTCCGTAAGGTCTTGCGAAGGGTCGTTGAAGAAGCGAGGTCACTGCCTTATAGTTCGTGATTTAGACGCGCGTGCTCTCGTCGCTCCTAGTACTATTAATGGCAAGCTAAGCATTCGTCGCACGCCGGCAGACCAGCTCGCCAATCAGTCACAACCGCGCGCCGTACTTCGCATAGCGAACAGAGTCAACCAGAACGCGACGAACAATCTGGAACCCTTCTTACAAGACCTGCAAAACGACGTGGCCCGCTTATCCCGCGCTTTCGCCACTTTGGGCAGGGAAGCGAACGCGTTGGTACGCGAGGCGGAAATGGGAGCGAGGATCGAAGACTTGCCAGCGCCGACAGCCGAACAGAACTCCGACGATCGTGCCTCGGAATCTTCCGAGATTGGATCTCAACCGGCGCCAGGCGAGGCCGTCAATGCCGTGCAGCACCTGTCTCGTAACGCTCGCAGGCGTATGCGTCAGCGGTTACGACTGGCTCTTAACGGGCCGATCCAACCGCTGCCGTCACCTCCAGCTCGCGTCACGGGTCAGATGAACCACCTGCCGAACGGTTACACGAACAACAGCATGTTCTTGACGCCCTCCGACTCGCCTCCGTCCTCCTCGAGCGGCCCGTCCCAGCCTCCCGCCTCTCGCAATAAGAGGAAACGCCGCCATCGCAGATAA

Protein sequence:

>DPOGS204456-PA
MVEKAEVKPESLLNLDDLLQCPVCYEIPSGQIFQCNEGHHVCGRCKMRLDVCPVCRALFFGTRNYAMEELIANFRKLRAFKLGAKPTNGSGSSESSTPAKDTTTGECENDVNEEDEENNLNLPNQAPLRPPPACKGLFRCLCCKSGQGERLPAARLLNHLRYFHAPDLLEGRTENGEYLQAWQFSTVPGKLVTAVRVADMGIFFLTIEISSDSVCAWLAMASSPWVAHNFNYTVTICGNDREAIFSDCVWSVRSCEGSLKKRGHCLIVRDLDARALVAPSTINGKLSIRRTPADQLANQSQPRAVLRIANRVNQNATNNLEPFLQDLQNDVARLSRAFATLGREANALVREAEMGARIEDLPAPTAEQNSDDRASESSEIGSQPAPGEAVNAVQHLSRNARRRMRQRLRLALNGPIQPLPSPPARVTGQMNHLPNGYTNNSMFLTPSDSPPSSSSGPSQPPASRNKRKRRHRR-