Monarch geneset OGS2.0

DPOGS216100
TranscriptDPOGS216100-TA1227 bp
ProteinDPOGS216100-PA408 aa
Genomic positionDPSCF300182 - 321131-338048
RNAseq coverage1261x (Rank: top 10%)
Annotation
HeliconiusHMEL0218111e-7471.00% 
BombyxBGIBMGA009253-TA3e-4577.23% 
Drosophilagol-PD1e-8649.21% 
EBI UniRef50UniRef50_UPI0002060CAF5e-10962.06%UPI0002060CAF related cluster n=2 Tax=unknown RepID=UPI0002060CAF
NCBI RefSeqXP_966546.12e-11456.57%PREDICTED: similar to goliath E3 ubiquitin ligase [Tribolium castaneum]
NCBI nr blastpgi|910816133e-11356.57%PREDICTED: similar to goliath E3 ubiquitin ligase [Tribolium castaneum]
NCBI nr blastxgi|910816138e-11556.57%PREDICTED: similar to goliath E3 ubiquitin ligase [Tribolium castaneum]
Group
Gene OntologyGO:00055151.1e-07protein binding
GO:00082701.1e-07zinc ion binding
KEGG pathway 
InterPro domain[271-319] IPR0130831.8e-15Zinc finger, RING/FYVE/PHD-type
[275-315] IPR0018411.1e-07Zinc finger, RING-type
[275-315] IPR0189571.3e-07Zinc finger, C3HC4 RING-type
[112-175] IPR0031371.6e-07Protease-associated domain, PA
Orthology groupMCL16520 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS216100-TA
ATGTGGTCTGTGTTCCTCCTGATCGGCTTCGTCGGAGGCCAGTCTCCGAGCAGCGAGTTGGAGGCTGTTAGATACGGACCCTCGGAGGACAGGATCGCCAGTGACACCTTCACCGTAGCTGACATTAATATAACGTATAAGGACGAGCACGGGGTGTACTACACGGAAACATCTGAGTCGGGTAAATACGGCGAGGGTTTCATCGGTTCATCCCGCGGGATGGCGGTGCATGTCCGCGCCAAGGGTCCGGAAGGCGAGCGTGATCACACAGGCTGTACGTGGCCGTTGTTGTCCGTGGCCGCGCCCACGGAGCCACTGCCCACGGAGCCCTGGATAGCCGTCATCCGCCGCGGCAACTGCAACTTTGAGATTAAGGTTCAGAATGCCTGGCGGGCGAACGCCTCGGCCGTGCTCATTTACAACGACAGAGAGACCACTGTGTTGGAAAAGATGAAACTATCCGTGAATAATGGACGCAACATTAGTGCGGTTTTCACATACAAATGGAAAGGCGAGGAGATAACCCGCCTGGTGGATAACGGAACACGCGTCGTGATAGCTATCATCAAGGGACGGACCCTAACCCACATCAACAGTAACATCAACAAGACATCAGTCCTATTCGTGTCGATATCCTTCATAGTGCTGATGGTTATATCTCTCGCTTGGCTCGTCTTCTACTACATACAGCGCTTCAGATACATACACGCCAAGGATAGACTGTCGAAGCGGCTCTGTTGTGCTGCTAAGAAAGCTCTCTCCAAAATACCCGTTAGAAATCTTAAGGTCGACGATAGGGAGGTTCAAGGTGACGGTGAATGCTGCGCGATCTGCATCGAACCTTACAAAGTATCGGAGACATTAAGATCGTTACCATGCAGACATGACTTCCACAAGAGCTGTATCGACCCCTGGCTGCTGGAACACCGCACCTGTCCCATGTGCAAGATGGATATACTAAAATATTACGGATTTGTGTTCACCGGAAGTCAGGAGAGTATCCTCCAGCTGGAGGCGGAGGACGCTCGCTCCACGCTCAGTCCCGCCACCAACAGACACCCGCTCACACTTTTACAGAACGAGTCGTCGTACGAGGAGGCGGGCAGCGAGCGCAGTAGCCGAGCGGCGTCCCCCGACAGGCTCGTCAGGAACGAGCTGAATGAAGACAGGTCGTGCGTGTCGTCCCCGTCTCACGGCGACCGCTCGCCCGTTGAGCGCTGCGACTAA

Protein sequence:

>DPOGS216100-PA
MWSVFLLIGFVGGQSPSSELEAVRYGPSEDRIASDTFTVADINITYKDEHGVYYTETSESGKYGEGFIGSSRGMAVHVRAKGPEGERDHTGCTWPLLSVAAPTEPLPTEPWIAVIRRGNCNFEIKVQNAWRANASAVLIYNDRETTVLEKMKLSVNNGRNISAVFTYKWKGEEITRLVDNGTRVVIAIIKGRTLTHINSNINKTSVLFVSISFIVLMVISLAWLVFYYIQRFRYIHAKDRLSKRLCCAAKKALSKIPVRNLKVDDREVQGDGECCAICIEPYKVSETLRSLPCRHDFHKSCIDPWLLEHRTCPMCKMDILKYYGFVFTGSQESILQLEAEDARSTLSPATNRHPLTLLQNESSYEEAGSERSSRAASPDRLVRNELNEDRSCVSSPSHGDRSPVERCD-