Monarch geneset OGS2.0

DPOGS201383
TranscriptDPOGS201383-TA1164 bp
ProteinDPOGS201383-PA387 aa
Genomic positionDPSCF300083 + 163632-168375
RNAseq coverage20x (Rank: top 79%)
Annotation
HeliconiusHMEL0250460.092.60% 
BombyxBGIBMGA000697-TA0.092.92% 
DrosophilaDopEcR-PB4e-13270.74% 
EBI UniRef50UniRef50_Q8MR641e-12869.77%GH08370p n=30 Tax=Pancrustacea RepID=Q8MR64_DROME
NCBI RefSeqXP_315694.19e-13372.61%putative GPCR class a orphan receptor 21 (AGAP005681-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|312135012e-13172.61%AGAP005681-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|312135012e-13572.61%AGAP005681-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00071864.3e-44G-protein coupled receptor protein signaling pathway
GO:00160214.3e-44integral to membrane
KEGG pathway 
InterPro domain[96-344] IPR0002764.3e-44GPCR, rhodopsin-like, 7TM
Orthology groupMCL15308 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201383-TA
ATGACAAAAGTAAACTGGAAATACAAGTTCACTGAAAGCGATGGGATTTATTATACAGTTTTCATCGCTTTAGTATTGGAAATTTTAATAAAACTCCAACGACTCAGTCTTGTTTTCAGGAAGTCATATCATATCTACAGGGCAAGTATGCCTTCTCGAGTCATCAGCGCCCGCGGAGCGGCAAGCGGTGGTGAGGTGAGCGTAGCGGATAGTCCGTTTGCTACCTTCGAGTCACTGACCCAGGCTGCAGTCATTGCCGTCATCGGAATCGCTATTGTTGTATCCAACCTTCTTATCATAGCGTCCTTTCTTAATTTTAAAGGACTCTCCAACGAAGTCATAAATTATTATCTACTATCTTTGGCTGTAGCTGATCTATTATGTGGACTATTCGTGGTACCGCTTTCTGTGTACCCAGCAATAACTGGTCGATGGATGTTTGGAGACCTGATGTGTCGCTTGGCGGGTTACGTGGAAGTCACACTGTGGTCCGTGTCCGTGTATACGTTCATGTGGATCTCAGTGGACCGGTACCTGGCCGTGCGGAAACCTCTCCGATATGAGACGGTACAAACTCGCACACGTAGCCAATGTTGGATGGTGTTCACCTGGATATCAGCTGCGATGCTCTGTTGTCCTCCGTTACTCGGATACAAAAAGGACGCTAATTTTGATAAGGAAACTCTGATATGCATGCTCGACTGGGGTACAACTTACGCTTATACCGCCACTTTGAGTATTTTGGTGTTGGGTCCGAGCGTTATATCTATAGTATACAACTACTTCTACATATTCTCAATGAAGAGGAAACTGCACAGCGGAGCGCCGATTCACGACAAGGAGTATGCAACTGCTTTGGCTGAAAACCTAGCCAATCCTAGCCATTGGATGAGTTTCATTCTCGTCTCTGTATTTTGGCTTAGCTGGGCTCCCTACGCCGGTGTGAGGTTTTATGAGTACTTCACCGATCAGGAGCTTAAAATTCCGATGTTACATTTTGGCGTAGTGTGGCTAGGCATTCTTAATTCGTTTTGGAAAATTATAATTCTGTTATCCTTAAGCCCTCAGTTTCGACTGGCTTTAAGAATATTATGCTTGACTGCATGCTGTCGTACCAAGGGACGCCTGCAAGCCGAGCTGATTGGCATGGACAATGATGATTAG

Protein sequence:

>DPOGS201383-PA
MTKVNWKYKFTESDGIYYTVFIALVLEILIKLQRLSLVFRKSYHIYRASMPSRVISARGAASGGEVSVADSPFATFESLTQAAVIAVIGIAIVVSNLLIIASFLNFKGLSNEVINYYLLSLAVADLLCGLFVVPLSVYPAITGRWMFGDLMCRLAGYVEVTLWSVSVYTFMWISVDRYLAVRKPLRYETVQTRTRSQCWMVFTWISAAMLCCPPLLGYKKDANFDKETLICMLDWGTTYAYTATLSILVLGPSVISIVYNYFYIFSMKRKLHSGAPIHDKEYATALAENLANPSHWMSFILVSVFWLSWAPYAGVRFYEYFTDQELKIPMLHFGVVWLGILNSFWKIIILLSLSPQFRLALRILCLTACCRTKGRLQAELIGMDNDD-