Monarch geneset OGS2.0

DPOGS208077
TranscriptDPOGS208077-TA1032 bp
ProteinDPOGS208077-PA343 aa
Genomic positionDPSCF300282 + 59300-60961
RNAseq coverage115655x (Rank: top 0%)
Annotation
HeliconiusHMEL0033446e-18087.46% 
BombyxBGIBMGA007786-TA3e-17382.85% 
DrosophilaRh6-PB5e-12159.18% 
EBI UniRef50UniRef50_Q172969e-15679.44%Rhodopsin n=520 Tax=Pancrustacea RepID=OPSD_CATBO
NCBI RefSeqNP_001036882.14e-16779.59%ceropsin [Bombyx mori]
NCBI nr blastpgi|515741120.0100.00%long wavelength opsin [Danaus plexippus]
NCBI nr blastxgi|515741120.0100.00%long wavelength opsin [Danaus plexippus]
Group
Gene OntologyGO:00071868.5e-53G-protein coupled receptor protein signaling pathway
GO:00160218.5e-53integral to membrane
GO:00076021.7e-27phototransduction
GO:00076015.2e-07visual perception
KEGG pathwaydvi:Dvir_GJ101833e-114 
 K13802 (NINAE)maps-> Phototransduction - fly
InterPro domain[37-299] IPR0002768.5e-53GPCR, rhodopsin-like, 7TM
[9-23] IPR0013911.7e-27Opsin, lateral eye type
[43-55] IPR0017605.2e-07Opsin
Orthology groupMCL10316 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208077-TA
ATGTTGCACCTCATCGATCCTCACTGGTACCAATTCCCGCCGATGAATCCCATGTGGCATGGACTACTTGGCTTTGTAATGGGCGTCCTTGGTTTCATATCGATCACTGGCAACGGAATGGTTGTCTACATCTTCACGACCACTAAGACTCTCAAAACTCCATCAAACATTCTCGTTGTAAATCTTGCATTCTCCGACTTCTGCATGATGGCCATTATGGCTCCGCCTATGCTGATTAACTGCTACAACGAAACTTGGGTGTTCGGACCCTTAGCCTGTCAACTGTACGCGTGCGCTGGATCATTGTTCGGATGTGGTTCCATCTGGACTATGACCATGATCGCTTTCGACCGCTACAACGTCATCGTAAAGGGCCTAGCTGCAAAACCAATGACCATCAATGGCGCTTTGCTGCGAGTGCTTGGTATCTGGGCCTTCTCCCTGGCATGGACTGTTGCTCCAATGTTCGGCTGGGGCCGATACGTACCAGAAGGCAACATGACTGCCTGTGGAACTGACTACTTCGACAAATCTTTCGCCAACCGCTCTTACATCGTAATCTACTCGGTCTTCTGCTACTTTGCACCACTGTTTCTCATTATCTACTCCTACTTCTTCATTATTCAGGCTGTAGCAGCTCATGAAAAAGCGATGAGGGAGCAGGCGAAGAAAATGAACGTTGCCTCTCTTAGGTCTTCCGACCAGGCTAACACCAGCGCTGAGTGCAAACTGGCTAAGGTAGCATTAATGACGATCTCACTGTGGTTCATGGCGTGGACACCTTATCTGGTTATCAACTTCTGCGGCATCTTCGACGGTGCCCCTATCAGCCCCCTCGCGACCATCTGGGGCTCTGTCTTCGCCAAGGCTAATGCCGTTTATAACCCAATTGTATATGGCATCAGCCACCCTAAATACCGCGCGGCGCTGTATGCGAGGTTCCCAGCTCTGTCATGCCAAGCTTCATCTGACGACAACGTGTCCGCAGCATCTGCGGCCACCGCCTGCACCGAAGAGAAACCATCTGCTTGA

Protein sequence:

>DPOGS208077-PA
MLHLIDPHWYQFPPMNPMWHGLLGFVMGVLGFISITGNGMVVYIFTTTKTLKTPSNILVVNLAFSDFCMMAIMAPPMLINCYNETWVFGPLACQLYACAGSLFGCGSIWTMTMIAFDRYNVIVKGLAAKPMTINGALLRVLGIWAFSLAWTVAPMFGWGRYVPEGNMTACGTDYFDKSFANRSYIVIYSVFCYFAPLFLIIYSYFFIIQAVAAHEKAMREQAKKMNVASLRSSDQANTSAECKLAKVALMTISLWFMAWTPYLVINFCGIFDGAPISPLATIWGSVFAKANAVYNPIVYGISHPKYRAALYARFPALSCQASSDDNVSAASAATACTEEKPSA-