Monarch geneset OGS2.0

DPOGS214391
TranscriptDPOGS214391-TA1389 bp
ProteinDPOGS214391-PA462 aa
Genomic positionDPSCF300069 - 616128-619132
RNAseq coverage747x (Rank: top 17%)
Annotation
HeliconiusHMEL0064844e-13177.10% 
BombyxBGIBMGA011252-TA0.074.09% 
DrosophilaAnk2-PU2e-2633.11% 
EBI UniRef50UniRef50_E2B8V52e-16563.71%Ankyrin repeat domain-containing protein 17 n=19 Tax=Pancrustacea RepID=E2B8V5_HARSA
NCBI RefSeqXP_002422975.12e-17364.94%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastpgi|2420041064e-17264.94%conserved hypothetical protein [Pediculus humanus corporis]
NCBI nr blastxgi|2420041062e-16664.94%conserved hypothetical protein [Pediculus humanus corporis]
Group
Gene OntologyGO:00055153.9e-05protein binding
KEGG pathway 
InterPro domain[25-334] IPR0206839.6e-45Ankyrin repeat-containing domain
Orthology groupMCL17013 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214391-TA
ATGCCGTCGGAGGTGATAACTGACAAGTCTCTTCAGCGCGAGCTGGCCGACTCCATCATCAGGATGGTGCCTTTGGATGAAATACGTATCCTACTGGCGTGTGGAGCGAAGGTCAACGAGCCAGTCACGCAAGGTCTCCGCCCTCTGCATTACGCTATCTGGCAGCGCAACCTGGAGGCGACTCGTCTCCTCCTGGTCCGAGGCTGTGATATCAACGCCACAGACGACTGTGGTTACAGCGCATTACATCTTTCTGCCGAACATGGGTATACAGAATTAGTAAAACTTCTGCTGGAGTCAGGGGCAGCTGTCGACTACAGACCGGACACTGGCGAGGAGTTCCCCAGGACCACCTTATGTGACGAACCTCTGAGGTTGGCCATAAGGAACAAGCATTACGCCGTGGCTCGTCTTCTTCTTGAACACGGGGCTGACCCTAACAAACGTTACTTCTTCGGCTCCGAGATTAATCTCGTATCGGATCCCGAATATTTGGAGCTGTTGTTGACCTTCGGAGCCAACCCTGACTCCAGGGACAGAGCTGGTCTGACACCGCTGATGAAGGCTGCCAGGCAAAGAAAGGGTATAGAGTCAGTGCTGTTGTTGATCAGCTCTGGAGCTGATGTGAACGCGGCGACGGACGCTCGCAGTGACTACAGGACTGTCATGCACTATGCTGTGCTCGGGGGCTGTACGGACGTGGTTAATCTATTAATAAAGCAAGGAGCGAGGGTGAACTATGATCCTGACTACAACAAGCCCAGTCCCCTCGACCTCGCGATACTCAAGGGAGACGTCGACATGGTCAAGATGTTGATAGCGGCTGGAGCTAAGGTGAATTCCTCCAGCTCTGTGATCGGCACTCCGCTACACGTCGCCTGTTCTGACGGCATCTCGCAGAGAAAGGAGTTAGTCAGGATCCTCCTCGAGTCTGGTGCGGATCCCAACCTGAAGGTGTACAATGAGGATGACGGCGCTCAGCTGCGACCGGCGCTGGCGGAACTGCTCGCTGGTGACCTGCAACCCTGCACGGACACCGTCAGGCTGCTGATGAGATATGGAGCGAGGGTTATAATGAAAACTCAGTTCCGAGACCCGGATGGTATACTGAACCACCTCCAGAACGTGACTTCAGTGGAGTCCCAGCACATCTTCTATCTCCTTCTAGAAGCTGCCGAAGCCTTCGACTTGTGTATGATAAAAAGAAATAACATCCTGCAACCAAAACAGAAACAGGCGTTGATAGACCGCGCAAAAACACCCATCAGCCTGTTGGCTCAAGCCCGGATATTTTTCCGGAGATTCTTCGGTCCAACATTGATTCATGCGATCAAAACCTTCGAAATACCCAAAACTCTCCAAAGATATCTACTTTTCGAGTACAGTTAA

Protein sequence:

>DPOGS214391-PA
MPSEVITDKSLQRELADSIIRMVPLDEIRILLACGAKVNEPVTQGLRPLHYAIWQRNLEATRLLLVRGCDINATDDCGYSALHLSAEHGYTELVKLLLESGAAVDYRPDTGEEFPRTTLCDEPLRLAIRNKHYAVARLLLEHGADPNKRYFFGSEINLVSDPEYLELLLTFGANPDSRDRAGLTPLMKAARQRKGIESVLLLISSGADVNAATDARSDYRTVMHYAVLGGCTDVVNLLIKQGARVNYDPDYNKPSPLDLAILKGDVDMVKMLIAAGAKVNSSSSVIGTPLHVACSDGISQRKELVRILLESGADPNLKVYNEDDGAQLRPALAELLAGDLQPCTDTVRLLMRYGARVIMKTQFRDPDGILNHLQNVTSVESQHIFYLLLEAAEAFDLCMIKRNNILQPKQKQALIDRAKTPISLLAQARIFFRRFFGPTLIHAIKTFEIPKTLQRYLLFEYS-