Monarch geneset OGS2.0

DPOGS206360
TranscriptDPOGS206360-TA2022 bp
ProteinDPOGS206360-PA673 aa
Genomic positionDPSCF300082 + 1242858-1253586
RNAseq coverage386x (Rank: top 31%)
Annotation
HeliconiusHMEL0171220.068.65% 
BombyxBGIBMGA014573-TA1e-11175.63% 
DrosophilaCG15118-PC1e-14843.21% 
EBI UniRef50UniRef50_E2BSX31e-16050.00%Ankyrin repeat domain-containing protein 13B n=15 Tax=Coelomata RepID=E2BSX3_HARSA
NCBI RefSeqXP_967604.24e-16550.74%PREDICTED: similar to CG9699 CG9699-PA [Tribolium castaneum]
NCBI nr blastpgi|2700124737e-16450.74%hypothetical protein TcasGA2_TC006628 [Tribolium castaneum]
NCBI nr blastxgi|1892403922e-16351.06%PREDICTED: similar to CG9699 CG9699-PA [Tribolium castaneum]
Group
Gene OntologyGO:00055158.1e-05protein binding
KEGG pathway 
InterPro domain[161-469] IPR0218323.7e-79Protein of unknown function DUF3424
[13-100] IPR0206836.9e-16Ankyrin repeat-containing domain
Orthology groupMCL10845 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206360-TA
ATGACTCTGACGTCTGAAGAAGTATCCAACCGATATCCAATCCATTGGCTTGTTTGGAATAACAATCATAATGAGCTGAAAAATGCTTTGGAAGCCAATAAGTTTACGTCAGAAGATATAGAGAGAAAGGATCCTCGGGGCCGAACACCATTATTACTGGCCGTCACTCTAGGACACATTGAGTGTGTGCAGGCATTGATTAATGTTGACGCTGATGTCAACTGTGAGAAGGACGGATGGACAGCGGTCCAAGAAGCAACGGCAACTGGCAACCCTGAGCTTCTATCATTGGTGCTGGGACGACGGGACTATCAGAGGCATGTTGTCCGCTCTAGTGGCATCCCTGACCTACTGAATAAGTTGAGCCTCGCACCAGACTTCTATGTGGAGATGAAATGGGAGTTCACCAGCTGGGTGCCGCTCCACAGATCCTACACGGGAAAACTTACATACAAGGTGTACAAACGCGGTGCTAACGTGCGCGTCGACACGACGCTGGTGGGCTTCGAGAACAACAAGTGGCAGAGAGGGGATAGGACGTACATATTTAGGGGACAGGGTCGCAGCGCAAGTCTGGTGGAGCTCGATCACGAGGCTGGCACGTCGTGGTGTGAGTACCTGGAGGGAGGGGGCTCGCGCGTGGGAGGGGGACAGGTGCCCGCGCCGCACGTGGTGGAGCAGAGACTCGCCGTGCCCATAGCCATCAACTACCTCGATACAGACAAGATCAGCTTTGAGAGAAATAAAAGCGGTATATGGGGCTGGAGGCAGGACAAGAGCGAGACGGTGAACGGCTACGAGTGCAAGGTGTTCAGCGCGAACAACGTGGAGCTGGTGTCCAAGACCAGGGCGGAGCACCTGCCGCGGGGGGAGCGCCGCCCTCACGCCTCGCGGCCCCCGCTGGCAGGGCTGCTGGCGCTCGCCGACGACGACCACTCCACCACCACCAGCCCGCTACTCACACCTGATAACGAGGACACGCCCCGTCGGAGCAGGTCCCGCGAGGAGCTCCGCCCCGTCAGCTGGGAGGAGTACTTCGGCGAGGGAGACGAGAGGGAGCTGCGCCCCCGGGACGTCACCACCAAGGTGCAGAAGTTCAGGGCGACGCTCTGGCTCTGTGAGGACTATCCGTTACAGCTCCAGGAGCAGATAATGCCGATCCTGGACCTGATGGCTGCCATCTCCTCGCCTCACTTCGCCAAGCTGAAGGACTTTGTGCAGATGCAGCTGCCGGCCGGATTCCCGGTCAAGATCGAGATACCTCTGTTCCACGTTTTGAACGCTCGCATCACCTTCGGCAACATCTTCGCAACGGAGACTCCCGTGGAACACGTGGAGTGTATCCAGGAGGGGGCGCGCCTTTCGTGTGTGCTGGACGACAGGTGCTTCAGCGTGGGACGCGGGTACAGCGACGCCGCCGCCCCCGACGACCACCGAGACGACGAGGGCCTGCTGCGGTACGCCATACAACAGAGCCTCATGGACGCCGGCACCCACGACGAACAGGTGGATGTATGGGAGGCTCTCCGCGGCGCGCGCTGCACGTCCCCGGTGGCGCCCCGGCCTCGCTCGCCGCCCCCGCTACTGGCACACGAGGATCACTACTTACAGAGAGCGATAGAGGCGTCGCTGTCCGGACTCACTGTGCACAGTCCGGGGCTGGAGGACGGCGAGGAGGAAGACGGTGAGGAGCTGGACGAAGACCTGCGAGCGGCCTTACAGGTGTCCGCTCAGGAACACGCGCACAGGGACCAGCTGCTCAGGGACGAGCAGAGGCAGCTGGACGAGGCGCTCAGACTGTCGCTCCTGGACAAGTGCGTTAAGTTAGGTGCAGCCGTTCAGTGTAAACTCAGCTTTATCCTAAGAGTGAAAGAGAGATCGCAGTATTCGTCTCCCGCTCTCATTTTAGCCCGATTGTTAGGTTACTACGAGTTGACAACTCCTGGACCGTCATCAGAGCGAGTTGTAGTACTCGAGGCGTTCGATCGACTTCCAGAAGTTATCGACTACGACTGTCTATGA

Protein sequence:

>DPOGS206360-PA
MTLTSEEVSNRYPIHWLVWNNNHNELKNALEANKFTSEDIERKDPRGRTPLLLAVTLGHIECVQALINVDADVNCEKDGWTAVQEATATGNPELLSLVLGRRDYQRHVVRSSGIPDLLNKLSLAPDFYVEMKWEFTSWVPLHRSYTGKLTYKVYKRGANVRVDTTLVGFENNKWQRGDRTYIFRGQGRSASLVELDHEAGTSWCEYLEGGGSRVGGGQVPAPHVVEQRLAVPIAINYLDTDKISFERNKSGIWGWRQDKSETVNGYECKVFSANNVELVSKTRAEHLPRGERRPHASRPPLAGLLALADDDHSTTTSPLLTPDNEDTPRRSRSREELRPVSWEEYFGEGDERELRPRDVTTKVQKFRATLWLCEDYPLQLQEQIMPILDLMAAISSPHFAKLKDFVQMQLPAGFPVKIEIPLFHVLNARITFGNIFATETPVEHVECIQEGARLSCVLDDRCFSVGRGYSDAAAPDDHRDDEGLLRYAIQQSLMDAGTHDEQVDVWEALRGARCTSPVAPRPRSPPPLLAHEDHYLQRAIEASLSGLTVHSPGLEDGEEEDGEELDEDLRAALQVSAQEHAHRDQLLRDEQRQLDEALRLSLLDKCVKLGAAVQCKLSFILRVKERSQYSSPALILARLLGYYELTTPGPSSERVVVLEAFDRLPEVIDYDCL-