Monarch geneset OGS2.0

DPOGS206791
TranscriptDPOGS206791-TA1692 bp
ProteinDPOGS206791-PA563 aa
Genomic positionDPSCF300001 - 4791613-4827851
RNAseq coverage55x (Rank: top 69%)
Annotation
HeliconiusHMEL0084697e-9271.95% 
BombyxBGIBMGA000574-TA7e-7663.52% 
Drosophilasowah-PG5e-4633.26% 
EBI UniRef50UniRef50_UPI00020629E33e-4745.60%UPI00020629E3 related cluster n=1 Tax=unknown RepID=UPI00020629E3
NCBI RefSeqXP_001604980.11e-7934.80%PREDICTED: similar to LD31582p [Nasonia vitripennis]
NCBI nr blastpgi|3287779223e-9238.53%PREDICTED: hypothetical protein LOC410299 [Apis mellifera]
NCBI nr blastxgi|3287779224e-9538.26%PREDICTED: hypothetical protein LOC410299 [Apis mellifera]
Group
KEGG pathway 
InterPro domain[273-389] IPR0206833.8e-16Ankyrin repeat-containing domain
Orthology groupMCL13311 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206791-TA
ATGTCCCATCCAGCAGAATTAAGTTTTGACGAAATACTTAAGTTTATGCTAGCAAATAATGGAAAAGTTACAAACCATGAGTTAGTGAAACATTTCAAAGTGTTTTTAATGAATCCGGACATGAGAGATGAAGCCCGAAACACTTTTAAAAAGCATGTTAATGCTTTAGCCATAATTAAAAACCAAAACAATGAGAAGTGGTTAATTTTAAAAAAGAAATATTTAAATAATCCTGTCAAACAAAATGAGGAGGTTGTGGAATCAAAAATTACTGAATTGCCTGTGGTACCAAATGTATCTAACATGGAAACAGAATCTGTTCCTCAGTCAACATATAAGCATCCACCACCATTACAACTTAACCAAGATTTCAATATCTTAGCAAATATTATACAAGACTCATCAGCAGCTGCAACACCAACACAAGCATCAGAAATACCCTTAGAAATTCCAGTCAGTGAAAGCAAAGAGTCTTTGACAGCTGTGGAGGAAACCCCACCCAAAGTTCATCCACGAAGAAAATCTTCTGATAAAATTTTAGCAGAGAAAAGATCAAGTGTTGCTAGCCTAAATTTAGGGTCTCGTTCATCGATACCAAGTCAAGACCTCTCAGAATTAAGTGAGAAGTCCACCTTAACGTTATCATCTTCTAGAAGTGAAAGTATGTTAATTGACCATGAACAAAAAATATCTGTTAAAGAAAGGAAGCAAATGTTTAACAGAATGGCATCGGAGAGTGATGTTCTCAAGACGCAAAAATTGAGCTTTAATAATTCGAGTGTTGACGAAGAAGACAGAGCGTCGCTTGAACAAAAGGAAACGGATCCATTGGATTCCAAGCAGAAACAGTGGATCCTATGCGCTGCGAGGGGCGAGTACCATTCTCTTGCCAAAATGTGCAAAGAGAACGCCAAATTAGTTCGTACAAAGGTAAGTTACTGCTGTTATACCGCAATGCATTGGGCTTGTAAAAGAGGGGATGAAAATTTGGTGAAGCTGCTCGCTGGCATACATCGACACATAGTGAACGAACGTTCGGGCTACACACCATTACACATTGCGATGCAGTTCAGACATGAAAACGTCTATAGACTCCTGGTCGAAATGTACGATGCTGATCCAAATATGAGAGATTGGTCCGGTAAAAAGGCGCGACAATATCTTGTGCATATGGATACGTCCCTGTCACCAGGGTCTTATAGAAAACCGGATACGAATGTCGGTCGTAGTGTTACTTCGCAACCTTCTGTGAAGGTTCAGAAGAGTTATGTTCAACAATTGAGCAAGAACGAAAAGGAAGGTTTCTTACGCATCGGCTCCTTAAACGTGCGCGTCAAGAAGACTACAGAGGCATTCAGTAACTTCCTGGGTGTTGGCGCTACAAGGACGGCGTACGTTCATAAACGAGCTGATGTCGAGAGACGGTCAGATGACGGTGAACTACACAAATCATGGGGTTCCGCAGATAATATACAGAAGGATGATAAGTCTATGCCACCTCCGCTGAGCAGTAAAGTGCGTCGTCGGGGTGCCAGCGGCCGAAGAGGAGTTGCAAGTCACAGTAGAAGCACGCCGTCTACACCAGACCAGCCACGTGCGCAGATAGGTCTAAACGAAGAAGGTGACTCGGACTCCGACACTGCAGCTGGTTTCCATTCAGCCTGGAGGCAGCAGAGGTCGTCAAACCATTAG

Protein sequence:

>DPOGS206791-PA
MSHPAELSFDEILKFMLANNGKVTNHELVKHFKVFLMNPDMRDEARNTFKKHVNALAIIKNQNNEKWLILKKKYLNNPVKQNEEVVESKITELPVVPNVSNMETESVPQSTYKHPPPLQLNQDFNILANIIQDSSAAATPTQASEIPLEIPVSESKESLTAVEETPPKVHPRRKSSDKILAEKRSSVASLNLGSRSSIPSQDLSELSEKSTLTLSSSRSESMLIDHEQKISVKERKQMFNRMASESDVLKTQKLSFNNSSVDEEDRASLEQKETDPLDSKQKQWILCAARGEYHSLAKMCKENAKLVRTKVSYCCYTAMHWACKRGDENLVKLLAGIHRHIVNERSGYTPLHIAMQFRHENVYRLLVEMYDADPNMRDWSGKKARQYLVHMDTSLSPGSYRKPDTNVGRSVTSQPSVKVQKSYVQQLSKNEKEGFLRIGSLNVRVKKTTEAFSNFLGVGATRTAYVHKRADVERRSDDGELHKSWGSADNIQKDDKSMPPPLSSKVRRRGASGRRGVASHSRSTPSTPDQPRAQIGLNEEGDSDSDTAAGFHSAWRQQRSSNH-