Monarch geneset OGS2.0

DPOGS210361
TranscriptDPOGS210361-TA1104 bp
ProteinDPOGS210361-PA367 aa
Genomic positionDPSCF300025 + 445379-447185
RNAseq coverage61x (Rank: top 68%)
Annotation
HeliconiusHMEL0138240.089.92% 
BombyxBGIBMGA011918-TA0.086.41% 
DrosophilaCG12796-PA6e-7042.76% 
EBI UniRef50UniRef50_Q16RJ59e-8446.15%G-protein coupled receptor n=1 Tax=Aedes aegypti RepID=Q16RJ5_AEDAE
NCBI RefSeqXP_001655046.12e-8446.15%g-protein coupled receptor [Aedes aegypti]
NCBI nr blastpgi|1571675683e-8346.15%g-protein coupled receptor [Aedes aegypti]
NCBI nr blastxgi|1571675681e-8247.97%g-protein coupled receptor [Aedes aegypti]
Group
Gene OntologyGO:00071863.9e-44G-protein coupled receptor protein signaling pathway
GO:00160213.9e-44integral to membrane
KEGG pathwaydre:1000016367e-26 
 K05840 (DRD5)maps-> Neuroactive ligand-receptor interaction
    Calcium signaling pathway
InterPro domain[7-252] IPR0002763.9e-44GPCR, rhodopsin-like, 7TM
Orthology groupMCL16341 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210361-TA
ATGGTGCTCATCATCAGCGGCAACACGCTCACCGTCCTCGCCGTCACCATGAGCCGCCGCCTCTCCTCGCTCGTCTCCAACCAGTTCGTGCTGAACCTGGCCATATCCGACCTGATGGTAGGCCTCACCCTGCCCTACCATCTGGTGTTCTACCTCGACGACGATTTCGGCAAGATCAAGTGGTCGTGCTTGATGAGGTTCATACTCATAATACTCGCGTGCCTCGCCTCCATCTACAACATCATCGCCATCGCTGTAGACAGATACATCGCCATCGTGCACCCCCTGCACTACAGCCGGTACATGACCAAGTTGGTGACTCGTCTCTTGATGAGCACCACCTGGACGGTCGCCGTATGCATCAGCTGTATCCCCATGTTTTGGAACGATTGGCACAGCGGCGTCAGTTGCGAAATGAATGTCGTGGTACCTAAGGAGTACACAACGAGCATCCTGGCACCCATGTTCTCTCTCATCTGGATGGTGATGTTCGTGCTGTACTGGAGGATATGGCGAGAGGCGACCTGTCACGCGAGGAGGATGAGAGCCAACACCTGCTGCCCCTCCAGCGCTAATGACTGGAAGAGCATTCAGGTTGTACTCCTGGTCCTCGGTTCCTTCTCTATCTGCTGGATGCCATTTGTCGTGGTGTCGTGCGCACAGACGTTACCCATCGTGGGGCTCCACAACCCAATAGTATACCGTCTGACTTCATCTCTCGCCATGTCTAACTCAGGAATCAATCCCATCATATACGCGTGGAAGAACGCCGGATTCCGAGCAGCATTTTCAAAGTTATTGCGTTGCAAGCGTCCTGACACCTCCGAATACAGGGGCTCCCCGGCCCCAGAAAGGAAGAGGGGTTCGGTGGCCCTCCGTGAGGGTTCGATCACTCGATCCACACCCGGCGGAGTGTCGACCACAGGGGGGGGAAGACCGGCTCGGCTGATTTATGTCGAAAGTGAGAGCGACACGGCTCGATGTCGCATCATCGAGAACGCCGGGTACATAGACTCGGAGTGCGGGGACAGCAACCCCTCATACGCACCGGACGGGCCCACGCCCGTCCACAAAGCGGGCGCACGGCCGCCGGACGTGGTGTAG

Protein sequence:

>DPOGS210361-PA
MVLIISGNTLTVLAVTMSRRLSSLVSNQFVLNLAISDLMVGLTLPYHLVFYLDDDFGKIKWSCLMRFILIILACLASIYNIIAIAVDRYIAIVHPLHYSRYMTKLVTRLLMSTTWTVAVCISCIPMFWNDWHSGVSCEMNVVVPKEYTTSILAPMFSLIWMVMFVLYWRIWREATCHARRMRANTCCPSSANDWKSIQVVLLVLGSFSICWMPFVVVSCAQTLPIVGLHNPIVYRLTSSLAMSNSGINPIIYAWKNAGFRAAFSKLLRCKRPDTSEYRGSPAPERKRGSVALREGSITRSTPGGVSTTGGGRPARLIYVESESDTARCRIIENAGYIDSECGDSNPSYAPDGPTPVHKAGARPPDVV-