Monarch geneset OGS2.0

DPOGS206952
TranscriptDPOGS206952-TA1443 bp
ProteinDPOGS206952-PA480 aa
Genomic positionDPSCF300001 - 208129-212736
RNAseq coverage375x (Rank: top 32%)
Annotation
HeliconiusHMEL0108341e-3231.20% 
BombyxBGIBMGA002878-TA3e-1222.22% 
Drosophilasinah-PC1e-1122.87% 
EBI UniRef50UniRef50_D6WZ003e-1526.17%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WZ00_TRICA
NCBI RefSeqXP_320652.47e-1525.13%AGAP011871-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3583337634e-1524.41%E3 ubiquitin-protein ligase SIAH1 [Clonorchis sinensis]
NCBI nr blastxgi|2700125819e-1726.56%hypothetical protein TcasGA2_TC006740 [Tribolium castaneum]
Group
Gene OntologyGO:00056342.2e-24nucleus
GO:00065112.2e-24ubiquitin-dependent protein catabolic process
GO:00072752.2e-24multicellular organismal development
GO:00055154.3e-15protein binding
GO:00165671.3e-09protein ubiquitination
GO:00082701.3e-09zinc ion binding
GO:00048421.3e-09ubiquitin-protein ligase activity
KEGG pathwayptr:4735575e-14 
 K04506 (SIAH1)maps-> Ubiquitin mediated proteolysis
    Wnt signaling pathway
    p53 signaling pathway
InterPro domain[241-475] IPR0041622.2e-24Seven-in-absentia protein, sina
[290-471] IPR0181213.9e-15Seven-in-absentia protein, TRAF-like domain
[288-476] IPR0089744.3e-15TRAF-like
[291-345] IPR0133231.3e-09Seven In Absentia Homolog-type
Orthology groupMCL20898 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206952-TA
ATGGGAGAAAAGAATTCAAAAATTGAGTGTAACAAAGAAAATTTAAGTAACAGGAATCAAGAGAGCGGGAGGCAAAGTCATTGTACAGCAGAAGACGTCGTAAAAATTTTAGAAACTCAAAAACGGTTATATAACGAGAGACTATTGGCAGAACAAAGCAGAATATACCCCAATATACAATCAGAATTGACTTATTCACCAAGTACTTCAAATAGTTACAATATAACACCGACCGCTCCTGCAGAGATCCTTTTATATACAACTGATACCCAAACATCGGGAGCTGCTAGCCAAATTATCCCATTGCGTGAACAGCAACCTACTCAGAGACATGAATATCCAACTTCGGGCCCTTCAACATCCACTTTGGATTCACAAGAATCAAATAAACAAAGACACAATAGCAGACCAAATCAAAGAGCTGCGAGTTCCTTGCGACAATATGAATATCCAACTTCGGGCCCTTCAACATCCACTTTGGTTTCCCAAGAATCAAATAAACAAAGACGCAATAGCAGACCAAATCAAAGAGCTGCGAGTTCCTTGCAACAACATGAATATCCAACTCCGGGCCCTTCAACATCCACTTTGGTTTCCCAAGAATCAAATAAACAAAGACGCAATAGCAGACCAAATCAAAGAGCTGCGAGTTCCTTGCAACAGACAAATAGAAGAGCAATTGTTAATTGTGTTACTTGCAAGGAAAAGTTTGGTTTAAATATTTATCAGTGTCAAAATGGGCACAGCTCTTGCGAAGACTGCAAATCCAAAATGAAAAATTGCGGTACATGTTGTGAAATTATCACAAATATGAGAAACATCACACTCGAGGCAACTTTTGCCTCTAACATTGTAGATGACAAGCCAAAAAAGCCGTGTATATATAAGAGTCGTGGTTGCATATTGCATTTTCAAATGGATGATATGGAGGCTCATCTGACTGATTGTATATTCAGGGATCTACCCTGCCCTTTGACTAATTTAAATGATGCCTGTAACTGGAAAGGATGGATGAAGAATATTTTGGAACACCTCCATGACATGCATCCAGAAAAGTGTCAGGCTGAAGTTAACAAAGAAATGTCATTGCTGCTAAGCGGCTTGGATTACAAAGGTTTTCATTTAATCACCCTTGGAAATATCCCCTTTATACTACATATACAAATTGACATAACACTAAACAACATTTCCATGGCTGTACTGTGTCTCGGTACAAAAATGCAAGCTTCCAAATGGATCTATGAATTGCATGTATACCAAAAGAAAACTCCTCGTCGCAAATTCGAATACATCGACATTTGTCAGCCGTACGGGACACCAATCTGCGATATAATTACAGCCTGTAATTGTGCCATCATCAATATAGATTATGCAAAAACCTTCCTGGATGCTGGCAAGTTGACTTATAAGGTTTATATCAAGAAAAAGAATATTAATCAATAA

Protein sequence:

>DPOGS206952-PA
MGEKNSKIECNKENLSNRNQESGRQSHCTAEDVVKILETQKRLYNERLLAEQSRIYPNIQSELTYSPSTSNSYNITPTAPAEILLYTTDTQTSGAASQIIPLREQQPTQRHEYPTSGPSTSTLDSQESNKQRHNSRPNQRAASSLRQYEYPTSGPSTSTLVSQESNKQRRNSRPNQRAASSLQQHEYPTPGPSTSTLVSQESNKQRRNSRPNQRAASSLQQTNRRAIVNCVTCKEKFGLNIYQCQNGHSSCEDCKSKMKNCGTCCEIITNMRNITLEATFASNIVDDKPKKPCIYKSRGCILHFQMDDMEAHLTDCIFRDLPCPLTNLNDACNWKGWMKNILEHLHDMHPEKCQAEVNKEMSLLLSGLDYKGFHLITLGNIPFILHIQIDITLNNISMAVLCLGTKMQASKWIYELHVYQKKTPRRKFEYIDICQPYGTPICDIITACNCAIINIDYAKTFLDAGKLTYKVYIKKKNINQ-