Monarch geneset OGS2.0

DPOGS215730
TranscriptDPOGS215730-TA1569 bp
ProteinDPOGS215730-PA522 aa
Genomic positionDPSCF300041 + 448617-459505
RNAseq coverage490x (Rank: top 25%)
Annotation
HeliconiusHMEL0096584e-17077.02% 
BombyxBGIBMGA003601-TA0.090.29% 
Drosophilakrz-PB6e-17769.48% 
EBI UniRef50UniRef50_Q16ID92e-16064.48%Beta-arrestin 1, putative n=2 Tax=Culicinae RepID=Q16ID9_AEDAE
NCBI RefSeqXP_972556.20.080.31%PREDICTED: similar to beta-arrestin 1 [Tribolium castaneum]
NCBI nr blastpgi|1892336480.080.31%PREDICTED: similar to beta-arrestin 1 [Tribolium castaneum]
NCBI nr blastxgi|1892336480.080.31%PREDICTED: similar to beta-arrestin 1 [Tribolium castaneum]
Group
Gene OntologyGO:00071651.9e-267signal transduction
KEGG pathwaytca:6612930.0 
 K04439 (ARRB)maps-> Phototransduction
    Chemokine signaling pathway
    Endocytosis
    MAPK signaling pathway
InterPro domain[94-519] IPR0006981.9e-267Arrestin
[94-264] IPR0147533.1e-85Arrestin, N-terminal
[93-263] IPR0147562.2e-79Immunoglobulin E-set
[500-520] IPR0147526.5e-47Arrestin, C-terminal
[106-261] IPR0110211.3e-32Arrestin-like, N-terminal
[282-463] IPR0110221.1e-26Arrestin-like, C-terminal
Orthology groupMCL10905 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215730-TA
ATGACGTCCGACACACATCACATTAGTACGTTCAGTGATGAAGTCCAGGCAGCGTCGATGAGAGGCCTTCGCGTTACAGGCCGGTGTTCCAATATCCTCGTTCCTCGATTCACAGTAACCATTACAAATTCTGATGTCTATGCGTCACGCAGCAAGCTGTCAGACCGCATGTTGTTTTCATGTCCATCTCACGATACACGTATTTATTACTCAGTTGAGTGGTGCGGGGGGCGAGGCGGAACTATGGACGACGGAGGCAGCAACAAGCAGCGTCAGGCCACCAGGGTCTTCAAAAAGAGCTCACCAAATGGAAAGATCACAGTGTATTTAGGGAAGAGAGACTTCGTCGATCACATCACACACGTAGATCCTATTGATGGCGTGGTGCTGATAGATCCGGAGTACGTGAAGGATCGGAAGGTGTTCGGCCATGTCTTGGCGGCTTTCCGCTACGGCAGAGAGGACCTGGACGTGCTGGGGCTCACCTTCAGGAAGGACCTGTACCTCGCCGCGGAACAGATATATCCGCCCACGAGCAGCCCGAAGCGTCCCCTGACCCGTCTTCAGGAGCGTCTGGTCCGCAAGCTGGGTCCCGCGGCACATCCGTTCTACTTCGAGCTGCCGCCTCACTGTCCCGCCTCGGTCACGCTCCAGCCGGCGCCCGGTGACACCGGCAAGCCATGCGGCGTGGACTACGAGCTGAAGGCCTTCGTGGCGGACTCCCAGGACGACAAGCCTCACAAAAGAAATTCAGTCCGCCTCGCCATCCGCAAGATAATGTACGCGCCGAGCAAGCAGGGCGAGCAGCCGTCTGTGGAGGTGTCCAAGGAGTTCATGATGAGCCCCAACAAACTGTACCTGGAAGCGTCGCTGGATAAGGAGCTGTATCATCACGGTGAGAACATAGCGGTGAACGTCCACATAGCGAACAACTCCAACCGCTCAGTGAAGAGGATCAAGGTCTCCGTCAGACAGTTCGCTGACATCTGTCTCTTCTCCACGGCTCAGTACAAATGCACCGTCGCTGAGGCGGAGAGCGAGGCAGTGGCTGTGTGGAAGAGAAACAAATTTCCATTTTTCGGATCATACAATAAGAAATTTAAAAAGTCGGTAGAAGGCTGTCCCGTGGGTCCAGGCTTCACCCTCAGCAAGGTGTTCACGCTGACTCCACTGCTCGCTAACAATAAGGACAAATGGGGCCTCGCTTTGGACGGACAACTCAAGCATGAGGACACCAACCTGGCGTCCAGCACACTAATCGCGGATCCTTCGCAGCGTGAAAATTTAGGCATCATAGTACAGTACAAAGTAAAGGTCAAACTGTGCCTCGGTCCACTCGGCGGCGAGCTGAGTGCGGAGCTGCCGTTCATCCTGATGCATCCCAAGCCGGAGGAGGAGCCTCGCGCCGCGCCCGACCCGCCGCGCGCCGACCCTGACCTCATACAACTAGACCCCAACCCTGACGAACACGCAGCTCACAGAGACGAGAACGGCCAGGAGCAAGACGACGATATAATATTCGAAGACTTCGCTCGACTGCGGCTGAAGGGAGCGGACGCGGACGCCTGA

Protein sequence:

>DPOGS215730-PA
MTSDTHHISTFSDEVQAASMRGLRVTGRCSNILVPRFTVTITNSDVYASRSKLSDRMLFSCPSHDTRIYYSVEWCGGRGGTMDDGGSNKQRQATRVFKKSSPNGKITVYLGKRDFVDHITHVDPIDGVVLIDPEYVKDRKVFGHVLAAFRYGREDLDVLGLTFRKDLYLAAEQIYPPTSSPKRPLTRLQERLVRKLGPAAHPFYFELPPHCPASVTLQPAPGDTGKPCGVDYELKAFVADSQDDKPHKRNSVRLAIRKIMYAPSKQGEQPSVEVSKEFMMSPNKLYLEASLDKELYHHGENIAVNVHIANNSNRSVKRIKVSVRQFADICLFSTAQYKCTVAEAESEAVAVWKRNKFPFFGSYNKKFKKSVEGCPVGPGFTLSKVFTLTPLLANNKDKWGLALDGQLKHEDTNLASSTLIADPSQRENLGIIVQYKVKVKLCLGPLGGELSAELPFILMHPKPEEEPRAAPDPPRADPDLIQLDPNPDEHAAHRDENGQEQDDDIIFEDFARLRLKGADADA-