Monarch geneset OGS2.0

DPOGS211041
TranscriptDPOGS211041-TA1107 bp
ProteinDPOGS211041-PA368 aa
Genomic positionDPSCF300202 - 251556-254885
RNAseq coverage2359x (Rank: top 5%)
Annotation
HeliconiusHMEL0043294e-14574.46% 
BombyxBGIBMGA003755-TA1e-16374.80% 
DrosophilaArr1-PA2e-12961.39% 
EBI UniRef50UniRef50_P153723e-12761.39%Phosrestin-2 n=26 Tax=Endopterygota RepID=ARRA_DROME
NCBI RefSeqXP_001663732.14e-15166.93%phosrestin ii (arrestin a) (arrestin 1) [Aedes aegypti]
NCBI nr blastpgi|17034155e-17379.27%arrestin homolog [Heliothis virescens, antennae, Peptide, 381 aa]
NCBI nr blastxgi|17034154e-17479.27%arrestin homolog [Heliothis virescens, antennae, Peptide, 381 aa]
Group
Gene OntologyGO:00071657.9e-204signal transduction
KEGG pathway 
InterPro domain[1-368] IPR0006987.9e-204Arrestin
[166-361] IPR0147522.7e-69Arrestin, C-terminal
[4-161] IPR0147566.9e-64Immunoglobulin E-set
[6-161] IPR0147531.4e-62Arrestin, N-terminal
[180-336] IPR0110221.1e-28Arrestin-like, C-terminal
[18-156] IPR0110216.1e-24Arrestin-like, N-terminal
Orthology groupMCL16437 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211041-TA
ATGGTGTATAATTTTAAAGTGTTCAAGAAATGTTCTCCCAACGGAAAGCTTACCTTGTATATGGCGAAGCGGGACTTCGTGGATCATATATCGTATGTAGAACCGATAGACGGCATCGTGGTCGTGGATGAGGAGTACGTTCGAGATCGTCGCGTGTTCGCTCAGGTGGTGTGCACCTTCAGGTACGGTCGCGAGGAGGACGAGGTGATGGGCTTGTCCTTCTATAAGGAACTGTATCTCGCCTCTGAACAAGTCTATCCACCGCTTCAGAAACGTCCTTACGAACTCACACGAACACAGGAGCGCTTAGTGAGGAAGCTGGGTCAGTGGGCGCTGCCATTCCGCTTAACCCTCCCGGCGGGTTCGCCTGGATCCGTAACGCTACAGCCAGGACTGGAGGAAGAAGGAGAACCCTGCGGAGTTCACTACTACGTAAAGTTGAGCACTGTGGCGCTCGGTATCCGCAAAGTTCAGTTTGCTCCGGATAAGCCCGGGCCACAGCCCTGCACCGTCGTCCGAAAGGACTTCGTGCTGTCCCCGGGGCAACTTGAATTGGAGCTTACTCTAGATAAACAGCTTTATATTCACGGGGAGACAGTTGCAGTGAACATAAGTATAAGGAACCACAGCAACAAAGTGGTGAAGAAGATTAAGGCGAGTATCCTGCAATCTGTAGATATCGTCCTGTTTCAGAATGGCCAGTATAGGAATGTTGTCACAGGAATTGAGACACAGGATGGTTGTCCACTGCAACCAGGAGCCAACATGCAGAAAGTAGTCCAGCTCCGTCCTACTCTGGGAGCTCTGCGTGACCGCCGTGGACTCGCGCTTGACGCTCAGCTCAAAAGACAGGAGACCACGCTCGCCTCCACCACGCTTCTGTTGGACCCCGAGCAGCGTGATGCGTTCGGGATCGTGGTCAGCTACAGCGTCAAGGTCAAACTGTACCTCGGAGCGCTCGGCGGAGAGCTCAGCGCCGAACTGCCCTTCATACTTATGCATCCGAAGGAAGGTCGTACTAAACTAATCCAGGCAGACAGCGAGGCAGATGTAGAAATGTTTAGACAGGACACAGTCATGCATCAGGAGAGTGTCGAGGTTTACTAA

Protein sequence:

>DPOGS211041-PA
MVYNFKVFKKCSPNGKLTLYMAKRDFVDHISYVEPIDGIVVVDEEYVRDRRVFAQVVCTFRYGREEDEVMGLSFYKELYLASEQVYPPLQKRPYELTRTQERLVRKLGQWALPFRLTLPAGSPGSVTLQPGLEEEGEPCGVHYYVKLSTVALGIRKVQFAPDKPGPQPCTVVRKDFVLSPGQLELELTLDKQLYIHGETVAVNISIRNHSNKVVKKIKASILQSVDIVLFQNGQYRNVVTGIETQDGCPLQPGANMQKVVQLRPTLGALRDRRGLALDAQLKRQETTLASTTLLLDPEQRDAFGIVVSYSVKVKLYLGALGGELSAELPFILMHPKEGRTKLIQADSEADVEMFRQDTVMHQESVEVY-