Monarch geneset OGS2.0

DPOGS207271
TranscriptDPOGS207271-TA891 bp
ProteinDPOGS207271-PA296 aa
Genomic positionDPSCF300008 - 251278-255639
RNAseq coverage446x (Rank: top 27%)
Annotation
HeliconiusHMEL0174005e-9976.74% 
BombyxBGIBMGA012134-TA3e-14884.45% 
DrosophilaCG2681-PA2e-3530.56% 
EBI UniRef50UniRef50_B0X8882e-4536.79%Seven in absentia n=3 Tax=Culicinae RepID=B0X888_CULQU
NCBI RefSeqXP_001651488.19e-5037.29%hypothetical protein AaeL_AAEL005826 [Aedes aegypti]
NCBI nr blastpgi|1571113292e-4837.29%hypothetical protein AaeL_AAEL005826 [Aedes aegypti]
NCBI nr blastxgi|1571113297e-4937.29%hypothetical protein AaeL_AAEL005826 [Aedes aegypti]
Group
Gene OntologyGO:00056341.7e-34nucleus
GO:00065111.7e-34ubiquitin-dependent protein catabolic process
GO:00072751.7e-34multicellular organismal development
GO:00055153.1e-10protein binding
GO:00165672.1e-06protein ubiquitination
GO:00082702.1e-06zinc ion binding
GO:00048422.1e-06ubiquitin-protein ligase activity
KEGG pathwaytca:1001422821e-18 
 K04506 (SIAH1)maps-> Ubiquitin mediated proteolysis
    Wnt signaling pathway
    p53 signaling pathway
InterPro domain[18-271] IPR0041621.7e-34Seven-in-absentia protein, sina
[73-265] IPR0181212.4e-29Seven-in-absentia protein, TRAF-like domain
[84-270] IPR0089743.1e-10TRAF-like
[88-139] IPR0133232.1e-06Seven In Absentia Homolog-type
Orthology groupMCL18135 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207271-TA
ATGTCGTCAGCGTCGTCGACTACAGGAAGAAAGATGGTGTTAAGTACGCTACTAGAGACTGATCACTATGAGAGCATAATCAAAGAGCTGACTTGTACCAAATGTAAACAGTACATGAAACCTCCAATATACCTTTGCGTCGATGGCCATAGTATATGCTGTAAATGCTATGAAAAGAGTTATCAATGCCATATTTGTCTGAAAGAATTTGCTCTCATCCGTCCAGTCGTACTGGAATCTCTAGCCAATAAAGTGTTATTCCCTTGCACTAATGGGGGATGCCCAAAACATGCAACACTTCCAGTATTGGAAAAGCACACACCTCACTGCCAATTCCGCATAATTAACTGTTTCATGGCTAGAGTCTATGGTAATTGTGCATGGGAGGGTCGGGCCGGCGAATGGATGGACCATTGCTTTTTGGAGCACAAACAAAAGGTGACCGAGTTGCCTTTTATCACTATTAAAGATAAATGGGATGCTAAACGGACAGAGCCTGTTCTCAACTATTTTTTACTGAAATGCTTTGAAAAGATTTTTAACGTCTACCAGATATACGACAAAAGAGGTGGTAGAATGATGTGGACTGTTCTTGTGAATGATGAACATGCTGATAAATTTTATTTTGAAGTGGATATTTTTTTACCAAATCTTCCTTGCAAAAGGATTGTTTATAGACGGCCTTGTAAGTGCGAAAAAGATGCTGATTTTCTCGAGCACACTCAGAACGTCTACATTCCCGTTGAGAACGTGTTTTCAATGCTGGACGAGGATGAATCACTGAATTTCACTGTTAGAATTGGCGAGGTTGAAAACTTGCCCCTACTAGAAACGCCCACGGCGAGCGAAAGCTTGATACTACTGAACAACGATGAACATAAAGAGGATTAA

Protein sequence:

>DPOGS207271-PA
MSSASSTTGRKMVLSTLLETDHYESIIKELTCTKCKQYMKPPIYLCVDGHSICCKCYEKSYQCHICLKEFALIRPVVLESLANKVLFPCTNGGCPKHATLPVLEKHTPHCQFRIINCFMARVYGNCAWEGRAGEWMDHCFLEHKQKVTELPFITIKDKWDAKRTEPVLNYFLLKCFEKIFNVYQIYDKRGGRMMWTVLVNDEHADKFYFEVDIFLPNLPCKRIVYRRPCKCEKDADFLEHTQNVYIPVENVFSMLDEDESLNFTVRIGEVENLPLLETPTASESLILLNNDEHKED-