Monarch geneset OGS2.0

DPOGS200509
TranscriptDPOGS200509-TA1323 bp
ProteinDPOGS200509-PA440 aa
Genomic positionDPSCF300450 + 12313-19950
RNAseq coverage74x (Rank: top 65%)
Annotation
HeliconiusHMEL0176293e-9849.05% 
BombyxBGIBMGA001721-TA1e-6841.46% 
DrosophilaCG6688-PA8e-2133.70% 
EBI UniRef50UniRef50_E2B2V33e-2336.87%Probable E3 ubiquitin-protein ligase sina-like CG13030 n=1 Tax=Harpegnathos saltator RepID=E2B2V3_HARSA
NCBI RefSeqXP_002069604.11e-2027.02%GK11484 [Drosophila willistoni]
NCBI nr blastpgi|3072151501e-2236.87%Probable E3 ubiquitin-protein ligase sina-like CG13030 [Harpegnathos saltator]
NCBI nr blastxgi|3072151508e-2236.87%Probable E3 ubiquitin-protein ligase sina-like CG13030 [Harpegnathos saltator]
Group
Gene OntologyGO:00055152.7e-16protein binding
KEGG pathway 
InterPro domain[7-136] IPR0014782.7e-16PDZ/DHR/GLGF
Orthology groupMCL26407 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200509-TA
ATGGCGTCATCAGAAAACAAAGAAATGGAGGAGTCGTCAGCTGGCGTACGAATGGTTTTGATGGCGTCATCAGAAAACAAAGAAATGGAGGAGTCGTCAGCTGGCGTACGAATGGTTTTGTTAAATCTACCAGAGGATGAAGGAGAGGGTCTTGGCTTCAAATTAACGCGCACACTTTGGGATCCGTATCCGTGGATCCGTGACGTCACTCCTGAGTCCAGAGCTGCTTTGGCCGGGCTTCGAACCGGCGACTGTCTCTTACAGGCCGACGGTAACGACCTTCTCGGACTACCGATCAGCAAGGTTGCTGGTATCATCCGTGGTGACGGCGAGGGTCGGGAGGTGTCTCTTGTTGTGTGGAGTTGTGGAGTGGACCCTGATGATGATCCAGAGTTGCTGTGGTCCGGGGGAGGCGCTGGCTCTGACCGTCCCCGGCGCGCCCTGGGTGGGGTGCTCCGTTCTCTGTCCTGTGCTGTGTGTGGGGCGACCGCGGCCGCCGCCCTCAGCTGTCGCCGCACGCACCTGTACTGCGACGGTGAGATTGAGCATTTAAGTAGCCCAATGTGTACAACTGAATCACAAGCTATGTTGCTGTGGTCCGGGGGAGGCGCTGGCTCTGACCGTCCCCGGCGCGCCCTGGGCGGGGTGCTCCGTTCTCTGTCCTGTGCTGTGTGTGGGGCGACCGCGGCCGCCGCCCTCAGCTGTCGCCGCACGCACCTGTACTGCGACGGCTGTTGGAGCAGATTGGAAAGATGCGCTCTGTGCCGAGAGATACTACCGCCTAAAGACTCGCCTTATTCCAGGAACTTAGTCGCTCAGCAGGTTTTCGAAGCAATAGCAAAAGAATACGATATAAAACGTACTGGTAACAAATCACAAATCACATCTCGATCCCCATCACGTTCACCAAAAATATCACCAACCACCAGCCGTAGAGGACAGTACCATTTATCTATGATGAATAGAAACAGATTGGGAGGTGAAAAATGTTCATCGGACCCGAATATCAATAGGACACCCAAAAATATGACATTAACATCGAATAATGGCAGCACTGAAGTGAAATGTCAGATGAATATGCCAAGAGAGGCGATGAAGTGTTCTTGCAGCTGTCAAAACTTACTCCACCAGACTCTGGTGTCGAGGTTACGGGAGACCAGCTCTCTGGCTGATTTAAAGTGTTCCGTACATAATTCACTATCAAAATCTTTGAACAACGTCAGCTTGGATGGACAAAGTAGCACTGAGGCCGACCCGAGCAGTTCAATGGAAAATTTAAAGAAATCAGGTACAATTATGAAGATTCGTCTTATTAAGAATTAA

Protein sequence:

>DPOGS200509-PA
MASSENKEMEESSAGVRMVLMASSENKEMEESSAGVRMVLLNLPEDEGEGLGFKLTRTLWDPYPWIRDVTPESRAALAGLRTGDCLLQADGNDLLGLPISKVAGIIRGDGEGREVSLVVWSCGVDPDDDPELLWSGGGAGSDRPRRALGGVLRSLSCAVCGATAAAALSCRRTHLYCDGEIEHLSSPMCTTESQAMLLWSGGGAGSDRPRRALGGVLRSLSCAVCGATAAAALSCRRTHLYCDGCWSRLERCALCREILPPKDSPYSRNLVAQQVFEAIAKEYDIKRTGNKSQITSRSPSRSPKISPTTSRRGQYHLSMMNRNRLGGEKCSSDPNINRTPKNMTLTSNNGSTEVKCQMNMPREAMKCSCSCQNLLHQTLVSRLRETSSLADLKCSVHNSLSKSLNNVSLDGQSSTEADPSSSMENLKKSGTIMKIRLIKN-