Monarch geneset OGS2.0

DPOGS207560
TranscriptDPOGS207560-TA1584 bp
ProteinDPOGS207560-PA527 aa
Genomic positionDPSCF300072 - 796565-802150
RNAseq coverage483x (Rank: top 26%)
Annotation
HeliconiusHMEL0171402e-10069.32% 
BombyxBGIBMGA004710-TA3e-11072.65% 
Drosophilasinah-PC5e-2229.91% 
EBI UniRef50UniRef50_E9GJ223e-2829.97%Putative uncharacterized protein n=1 Tax=Daphnia pulex RepID=E9GJ22_DAPPU
NCBI RefSeqXP_002065935.12e-2229.34%GK20883 [Drosophila willistoni]
NCBI nr blastpgi|3214695801e-2729.97%hypothetical protein DAPPUDRAFT_304089 [Daphnia pulex]
NCBI nr blastxgi|3214695803e-2928.05%hypothetical protein DAPPUDRAFT_304089 [Daphnia pulex]
Group
Gene OntologyGO:00056342.5e-23nucleus
GO:00065112.5e-23ubiquitin-dependent protein catabolic process
GO:00072752.5e-23multicellular organismal development
KEGG pathwaydwi:Dwil_GK208835e-22 
 K04506 (SIAH1)maps-> Ubiquitin mediated proteolysis
    Wnt signaling pathway
    p53 signaling pathway
InterPro domain[195-492] IPR0041622.5e-23Seven-in-absentia protein, sina
[314-493] IPR0181212.7e-12Seven-in-absentia protein, TRAF-like domain
[254-336] IPR0130835.4e-07Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL25057 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207560-TA
ATGAGTAACAGAAAAGAAAACATTAAAAAATCACTAGGACTTACTGAATATCCCTGGAGTTCTGATGATGAAGAGACTCTACCTCCTTGTCAGATTGCAACAATTAGACCACCTGAAATTGTTGAGCCTAGAAGTACAACAACACTTGGGACCAGTGGGGCTGGTGTAATCCCTTCTTTTATCCCTGTAAGAAGAAACCAGGCAAGGTTGGAGCGGGAAGTCTTTTGGTCTTTGCTAAACAATCGTAGGACTATGATACCAACAGAATCCTCAAGACCCAGCACAAGCAGCACAAGTACTTGTGAGAATCCACGACCGACTGAGTCCATTGACCTCAATAGTCCAGTTTCCAACTTGCAACGAATCTTTGCCATCAAAAGACTAACGCGTACTCTTAATCCAAATCGTTGCCCTAGAGGCAACTACCTGAACGCATCCACGATATCTCCTTTTTCACCAGGTGTATTGTCTCCGAGGGTGACAGGTCGCTTGAGAAGGAGGATCCGACCGCCCTCAGACAGGACCAGTGTACCAGCCGTTATGGATGAAGAAAATCCTCCAATACAGGACGAGGGAGACGAGTCACCTGAGATTCCTTTGATGCAAACCATTCAAGATAACAATGACAATTCGGGTGAACAAGAAAATAGTCCTTCGAACACAACAGCTGTTGGTATTCCTGAGGTCGAGGAGATAGTGGATATAGACGCGGAGGCAGAAGCTAACTCGCAGGAGGGAACTGGCGATGACAACCAAGAACAGGCCAGTGATGACGAAGAACAAAATGATGCGATAAACCTCCTCCGCCTGCTGGAGTGTCCAGTGTGCCTGGAGTGGATGGAGCCGCCCATGTGTCAGTGCCGGCGCGGGCACCTGGTGTGCGGGCGGTGCCGGGCGCGTCTCGCCGCCTGCCCCGTCTGCAGGACCACCTTCTCATCAGTCCGCAACCGGGCCATGGAAGCTGTGACGGAACTGCTCCGCTACCCGTGTCGCTACGGCTGCGGTCGTGAGACTCGCCTCCGTCGCCGCGGGGTGCACGAGGCGAGCTGTGCCGCGCGCCGCTACCGCTGCCCCGCGCCGCCCTGCGCCGACCGCCCGCATTCACAATATATGAATTTCAAAAAAATTCTCCAGACCAAACACCTGTCGATGCTGAAGGTGGGTCGCAAGCACAAGTTCTCTATGAAGGTGAACACGGAACAGCATGACCACTGGCTGGTGATGGCGGTGAGGGAACTGTTCCACCTGAGGGTTGACGTGGACATACGCACCTGGGGAGTGGATGTTTATGTTGCCTATATTGGGCCCAAGTGCAATGCTGCCAAATATACGTATGAGGTTACTGTGCTAGGTCAACACAACGATAGAAAGTTAGTGTACACACGGGCGACTCACAGCGACCTGGAGAGCTCGTCGCTGAACGTGAGTCGTCAGGACTGCTTCCATTTGACTCTGGATCAGGCTTTGAACTTCCTCCGTTTCAAGAACCGTCACTGCGAACCGGACAAGTTCCTAGACTTCGTGGTCGAAATCAACAAAAGCGAGGCCGCCGTGGAAAATGTTCGGGTGGAATCGGATTCCTGA

Protein sequence:

>DPOGS207560-PA
MSNRKENIKKSLGLTEYPWSSDDEETLPPCQIATIRPPEIVEPRSTTTLGTSGAGVIPSFIPVRRNQARLEREVFWSLLNNRRTMIPTESSRPSTSSTSTCENPRPTESIDLNSPVSNLQRIFAIKRLTRTLNPNRCPRGNYLNASTISPFSPGVLSPRVTGRLRRRIRPPSDRTSVPAVMDEENPPIQDEGDESPEIPLMQTIQDNNDNSGEQENSPSNTTAVGIPEVEEIVDIDAEAEANSQEGTGDDNQEQASDDEEQNDAINLLRLLECPVCLEWMEPPMCQCRRGHLVCGRCRARLAACPVCRTTFSSVRNRAMEAVTELLRYPCRYGCGRETRLRRRGVHEASCAARRYRCPAPPCADRPHSQYMNFKKILQTKHLSMLKVGRKHKFSMKVNTEQHDHWLVMAVRELFHLRVDVDIRTWGVDVYVAYIGPKCNAAKYTYEVTVLGQHNDRKLVYTRATHSDLESSSLNVSRQDCFHLTLDQALNFLRFKNRHCEPDKFLDFVVEINKSEAAVENVRVESDS-