Monarch geneset OGS2.0

DPOGS208824
TranscriptDPOGS208824-TA951 bp
ProteinDPOGS208824-PA316 aa
Genomic positionDPSCF300036 + 540926-542792
RNAseq coverage53x (Rank: top 70%)
Annotation
HeliconiusHMEL0064002e-4653.37% 
BombyxBGIBMGA007930-TA4e-10462.01% 
Drosophila5-HT7-PA9e-3328.81% 
EBI UniRef50UniRef50_C3Y3T36e-3732.75%Putative uncharacterized protein n=1 Tax=Branchiostoma floridae RepID=C3Y3T3_BRAFL
NCBI RefSeqXP_001866063.12e-3532.37%5-hydroxytryptamine receptor 1 [Culex quinquefasciatus]
NCBI nr blastpgi|2608279952e-3632.75%hypothetical protein BRAFLDRAFT_85476 [Branchiostoma floridae]
NCBI nr blastxgi|2608279955e-3832.55%hypothetical protein BRAFLDRAFT_85476 [Branchiostoma floridae]
Group
Gene OntologyGO:00071867.3e-51G-protein coupled receptor protein signaling pathway
GO:00160217.3e-51integral to membrane
KEGG pathwaybfo:BRAFLDRAFT_854763e-37 
 K04136 (ADRA1B)maps-> Salivary secretion
    Vascular smooth muscle contraction
    Neuroactive ligand-receptor interaction
    Calcium signaling pathway
InterPro domain[11-287] IPR0002767.3e-51GPCR, rhodopsin-like, 7TM
Orthology groupMCL25406 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208824-TA
ATGGTGATTATCACTGTCGCCACGGTGGTAGGAAACAGTGCCGTGATGGTAGCTTTATGGCGCGTGAGGCGAGCTCCTTCACATTACCCGTTAATAAGTTTGGTAACCGCTGATTTGCTCGTCGGCCTCCTCGTCCAACCGCTGGCTGCTCTACGAGAGCTTTATGTATTTAATCTGAGTGATTTTCCATCAGATCGTATTATCTGCTCATGTTGGAGTATAATGGATGTGCTATGCTGTACTGCTTCAATACTGTCCCTCTGCGTACTTGGCTGGGAGCGATGGTCGGGTATAACGAATCCTCTAGCAAGAGCCAGAAGAGCGAAAAGAGCGAAGTTGGTCGCAGGTCTCGTCTGGCCGCTCGCTGCTGTGATAGCAATACCTAATGCCCTTGTAAGATCTCATAAGCACTTCCATCCAGGAGAGTTAGAAAAGGCCTGCGAGGTTAACACTAATACTGGATATGTTTTCTTCAGTGTAACTCTATCGTTTTATTTGCCGGCGGTCGTGATGCTGATGATGTACTGTTTTATCTTACGCGCTCTGTCCGCTCCGCCACCCGTGCGTGCACATCGCGGCCGTCCTTCAAATAACAGCCGAGGAAAAGCCATGAGCTCAGAGAGCACCACTCAACCAGGACCCAGTACTGATAATAATCAAAACACTGTTGCTAGTTTCATCAGTCGCCAGCGTCGCGCTACACGCATCATCGTGATGTTGATGACGCTGTTCTTCGTCTGCTGGACGCCTTATTTCGTGATGCTGCCATTAGATTCCCTCTACGACTGTGTGTACGACAGCGGCTGGTTGTGGTGTACCTGGCTGGGATACATCAATTCCTCCCTGAACCCTCTAGTGTACGCGGTATCGTCTCCTAGCGTGAGGAGTGCGCTGCGCTCGTCACTAACAAGCAGTGGAAGGAACGACATGGGTTTAACAAGGCGGAATTAA

Protein sequence:

>DPOGS208824-PA
MVIITVATVVGNSAVMVALWRVRRAPSHYPLISLVTADLLVGLLVQPLAALRELYVFNLSDFPSDRIICSCWSIMDVLCCTASILSLCVLGWERWSGITNPLARARRAKRAKLVAGLVWPLAAVIAIPNALVRSHKHFHPGELEKACEVNTNTGYVFFSVTLSFYLPAVVMLMMYCFILRALSAPPPVRAHRGRPSNNSRGKAMSSESTTQPGPSTDNNQNTVASFISRQRRATRIIVMLMTLFFVCWTPYFVMLPLDSLYDCVYDSGWLWCTWLGYINSSLNPLVYAVSSPSVRSALRSSLTSSGRNDMGLTRRN-