Monarch geneset OGS2.0

DPOGS211416
TranscriptDPOGS211416-TA1224 bp
ProteinDPOGS211416-PA407 aa
Genomic positionDPSCF300115 + 356130-357737
RNAseq coverage112348x (Rank: top 0%)
Annotation
HeliconiusHMEL0080840.097.05% 
BombyxBGIBMGA010897-TA0.091.89% 
DrosophilaArr2-PA2e-17670.96% 
EBI UniRef50UniRef50_G9JLB40.095.09%Arrestin 2 n=17 Tax=Pancrustacea RepID=G9JLB4_9NEOP
NCBI RefSeqXP_316327.20.076.19%arrestin, Arr2-like (AGAP006263-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3597193300.095.09%arrestin 2 [Maruca vitrata]
NCBI nr blastxgi|3597193300.095.09%arrestin 2 [Maruca vitrata]
Group
Gene OntologyGO:00071653e-265signal transduction
KEGG pathwayaga:AgaP_AGAP0062630.0 
 K13805 (ARR2)maps-> Phototransduction - fly
InterPro domain[1-405] IPR0006983e-265Arrestin
[14-181] IPR0147532.9e-77Arrestin, N-terminal
[12-181] IPR0147561.4e-71Immunoglobulin E-set
[189-405] IPR0147521.3e-66Arrestin, C-terminal
[25-179] IPR0110212.5e-31Arrestin-like, N-terminal
[200-356] IPR0110226.6e-21Arrestin-like, C-terminal
Orthology groupMCL15644 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211416-TA
ATGCTTGACTTCCTATCCTTAGCTTTCGTGGTGGCGGTGAAGGTTTTCAAGAAAACCACACCCAATGGAAAGGTTACGGTTTACCTGGGCAAACGGGACTTCATTGATCACGTTGACTACTGCGATCCTGTCGATGGAGTCGTAGTGGTTGACACCGAGTACCTGAAAGGACGAAAAGTTTACAGCCAGCTGGTCACGACCTACCGATTCGGTCGCGAAGAAGATGAGGTCATGGGAGTCAAATTCTCGAAGGAACTGGTCATCGGCCAGGACCAAGTGGTACCAATGGTCAACGCGAAAATGGAACTGACACCTGTCCAAGAAAAGCTCCTGAAAAAGCTCGGCCCAAATGCCTTCCCATTCACATTTACCTTCCCCGAAATGTCGCCCAGCTCGGTCACTCTGCAACCATCTGATGAGGATCAGGGCAAACCCATGGGTGTGGACTACTGCGTGCGAACCTACGTAGCTGACAACGAGGATGACAAGGGCCACAAAAGGAGCTCCGTTACCCTTGCTATCAAGAAGCTGCAACATGCCCCAGCTTCTCGCGGACGACGCCTACCTAGCTCCCTTGTCAGCAAGGGCTTCACTTTCAGCAACGGCAAGATCAGTTTGGAAGTGACCCTCGACAAGGAAATCTACTATCATGGAGAGAAGGTTGCCGCCAACATCATCGTTTCCAACAACTCCAGGAAATCCGTTCGCAACATCCGCTGCATGGTTGTACAGCATGTTGAGATTACCATGATCAACTCTCAATTCAGCCGCCATGTTGCATCTCTGGAAAGCCGCGAGGGTTGCCCAGTAACACCCGGAGCTAGCCTGTCTAAGACCTTCTACTTGGTGCCTCTGGCTCGCAGCAACAAGGATATTCGAGGCGTCGCCCTGGACGGCCACCTTAAGGAGGATGACGTCAACCTCGCAAGCTCTACCCTGGTGTCGGAGGGCAAGTGCCCAGCTGATGCTATTGGTATCGTGGTATCTTACTCCGTACGAGTGAAGCTGAACTGCGGAACTCTGGGAGGCGAGCTTGTTACGGACGTGCCATTCAAACTGCTGCATCCTGCTGAGGGAAGCGTAGAACGCCAACGTTTCAACGCAATGAAGAAGATGCAATCCATTGAGCGTCACCGCTACGAAAATTCTCTGTATGCCAACGAGGAGGAAGACAACATCGTTTTTGAGGACTTCGCCCGCCTTAGGATGAACGAACCGGAATAA

Protein sequence:

>DPOGS211416-PA
MLDFLSLAFVVAVKVFKKTTPNGKVTVYLGKRDFIDHVDYCDPVDGVVVVDTEYLKGRKVYSQLVTTYRFGREEDEVMGVKFSKELVIGQDQVVPMVNAKMELTPVQEKLLKKLGPNAFPFTFTFPEMSPSSVTLQPSDEDQGKPMGVDYCVRTYVADNEDDKGHKRSSVTLAIKKLQHAPASRGRRLPSSLVSKGFTFSNGKISLEVTLDKEIYYHGEKVAANIIVSNNSRKSVRNIRCMVVQHVEITMINSQFSRHVASLESREGCPVTPGASLSKTFYLVPLARSNKDIRGVALDGHLKEDDVNLASSTLVSEGKCPADAIGIVVSYSVRVKLNCGTLGGELVTDVPFKLLHPAEGSVERQRFNAMKKMQSIERHRYENSLYANEEEDNIVFEDFARLRMNEPE-