Monarch geneset OGS2.0

DPOGS211214
TranscriptDPOGS211214-TA1338 bp
ProteinDPOGS211214-PA445 aa
Genomic positionDPSCF300007 + 1092001-1094455
RNAseq coverage206x (Rank: top 46%)
Annotation
HeliconiusHMEL0124710.069.17% 
BombyxBGIBMGA003194-TA0.072.17% 
Drosophilamib1-PA2e-12749.32% 
EBI UniRef50UniRef50_Q86YT62e-12453.38%E3 ubiquitin-protein ligase MIB1 n=96 Tax=Eumetazoa RepID=MIB1_HUMAN
NCBI RefSeqXP_974870.11e-13655.58%PREDICTED: similar to mindbomb homolog 1 [Tribolium castaneum]
NCBI nr blastpgi|910833253e-13555.58%PREDICTED: similar to mindbomb homolog 1 [Tribolium castaneum]
NCBI nr blastxgi|3838647271e-13458.26%PREDICTED: E3 ubiquitin-protein ligase MIB1-like [Megachile rotundata]
Group
Gene OntologyGO:00055154.9e-09protein binding
KEGG pathway 
InterPro domain[11-155] IPR0206836e-36Ankyrin repeat-containing domain
[17-48] IPR0021104.9e-09Ankyrin repeat
[388-442] IPR0130836.9e-09Zinc finger, RING/FYVE/PHD-type
Orthology groupMCL18219 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211214-TA
ATGCTGTCCAAGTTGCCTCGCCCGTGGATAGTCGACGACGCTAAGGATGACGGTTACACTGCTCTACATCTGGCAGCATTCAACAACCACGCTGAGGCTGCCGAACTTCTGGTACGAGCTGGAGCACGCCTGGACCTTCAGAACGCTAACCTTCAAACGGCTCTGTACCTCGCCGTTGAGAGACAACACACACAAATAGTCCGCCTGCTGATAACTCATGGCGCTAATCCTAATATTTGTGATAAGGATGGTGACACTCCTCTTCATGAGGCGCTACGTCACCACACTCTACAGCAGTTACGCAGGCTACAGGACACTCGTGATACCTCCCTGTTGGGAGGCATCGCTCACTCGTACGATAAGAAGTCTTCCGCTTCCATCGCATGCTTTCTTGCAGCTAATGGCGCTGACCTGACAATCAAGAATAAGAAAGGACAGACTCCGTTGAATCTGTGTCCCGATCCGAATCTCCGCAAAACGCTAACAACATGCCGCAAAGAGAGCACTGGCGCGCCAAGTGAAAGCACACCAGAAAGCGAAGATTCTGCTACAACTGCAACATTTACTCCACAACCAGGACCTAGCACGGCTCCAACTACGGAAACTCCCACAGAAAATCCTCCAGCAGATCCCAGCACAGACGAATGTCTCGTTTGCTCTGACGCCAAACCCGATACGCTGTTCCGTCCGTGCGGGCATATTTGTTGCTGCAATGTCTGTGCAGCTAGAGTGAAGAAGTGCTTAGTGTGTCGTTCGTGCGTGTCATCAAGACAGCGTATCGGTGAATGTGTGGTTTGTTCGGAGGCTCCGGCTACTGTGATGTTCCGTCCTTGTGGCGATGTATGCGCCTGCGCACAATGCGCTCCTCTGATGCGGAAGTGCGTTGAATGCCGCACGCCGCTGCAGCCTCCTGCCGCTGCTTCTACTTCCGTCGCACCCGTAGCACCTCTCGCACCGGCTGTGCCAGCTCCGGTTGCCGCACCTGTACCCTCCCCACCTGCAACAGCCATCGTAGCCGCAGACGCCCAGCAAAATGAAGGTGGAGAAAGCAGCAATCTGGCTCAATTGCAGGTCAATAAAGGTCAACCAGCACCAGCCCCCGCCGCGCCGCATCACCTCAACAATGGCAGCCGCTCGCAACACGCACCAGCAGACGTACAGAAGTTGCAGCAGCAACTCCAGGACATTAAGGAACAGACTATGTGCCCAATATGCTTGGATCGTCTCAAGAACATGATCTTCCTCTGCGGGCACGGGATGTGCCAGATGTGCGGAGACCGTATCACCGTATGCCCCATATGCCGAAAACAAGTCGAGAAAAGAATACTGCTCTATTAA

Protein sequence:

>DPOGS211214-PA
MLSKLPRPWIVDDAKDDGYTALHLAAFNNHAEAAELLVRAGARLDLQNANLQTALYLAVERQHTQIVRLLITHGANPNICDKDGDTPLHEALRHHTLQQLRRLQDTRDTSLLGGIAHSYDKKSSASIACFLAANGADLTIKNKKGQTPLNLCPDPNLRKTLTTCRKESTGAPSESTPESEDSATTATFTPQPGPSTAPTTETPTENPPADPSTDECLVCSDAKPDTLFRPCGHICCCNVCAARVKKCLVCRSCVSSRQRIGECVVCSEAPATVMFRPCGDVCACAQCAPLMRKCVECRTPLQPPAAASTSVAPVAPLAPAVPAPVAAPVPSPPATAIVAADAQQNEGGESSNLAQLQVNKGQPAPAPAAPHHLNNGSRSQHAPADVQKLQQQLQDIKEQTMCPICLDRLKNMIFLCGHGMCQMCGDRITVCPICRKQVEKRILLY-