Monarch geneset OGS2.0

DPOGS213510
TranscriptDPOGS213510-TA1134 bp
ProteinDPOGS213510-PA377 aa
Genomic positionDPSCF300033 - 1007350-1010301
RNAseq coverage5512x (Rank: top 2%)
Annotation
HeliconiusHMEL0109620.089.42% 
BombyxBGIBMGA012539-TA7e-6238.08% 
DrosophilaRh3-PA2e-13159.84% 
EBI UniRef50UniRef50_P049503e-12959.84%Opsin Rh3 n=127 Tax=Neoptera RepID=OPS3_DROME
NCBI RefSeqXP_001653866.16e-13662.88%ultraviolet-sensitive opsin [Aedes aegypti]
NCBI nr blastpgi|515741140.099.73%UV opsin [Danaus plexippus]
NCBI nr blastxgi|515741140.099.73%UV opsin [Danaus plexippus]
Group
Gene OntologyGO:00071862.2e-48G-protein coupled receptor protein signaling pathway
GO:00160212.2e-48integral to membrane
GO:00076023.7e-31phototransduction
GO:00076013.7e-31visual perception
KEGG pathwaydgr:Dgri_GH233124e-58 
 K13802 (NINAE)maps-> Phototransduction - fly
InterPro domain[71-334] IPR0002762.2e-48GPCR, rhodopsin-like, 7TM
[40-56] IPR0008563.7e-31Opsin RH3/RH4
Orthology groupMCL16575 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213510-TA
ATGGACAACGATACGGACAACATCAACGTCTATGGCGCGTACTTCGCACCGCTCAGGTCCAGCGAGGGTACGAAGATGCTGATAGACGGTCTCACAGGCGAGGACCTAGCTGCAGTACCGGAGCACTGGCACACCTACCCCTCTCCACCAGCCAGCGCCCACACGGCCCTAGCTCTCCTCTACTGCTTCTTCACAGCTGCAGCACTCATTGGAAATGGAATGGTCATATTCATTTTTCTAACAACGAAGAGTTTGCGGACATCTAGCAACTTACTGATTCTAAACCTCGCAATTAGCGATTTCATTATGATGGCCAAAGCTCCAATCTTCATATATAATAGTGCATTGCGCGGCTTTGCTGCGGGACCCGTCGGTTGTCAAATATTTTCCGTAATGGGTGCCTATAGCGGCATTGGAGCGAGTATGACCAACGCTTGTATTGCCTACGACAGACATTCTACGATCACTCGACCACTTGACGGGCGACTGTCGCAAGGGAAAGCTTTGTTGATGATAGCCTTTGTATGGATATATGCGACGCCCTGGTCACTTCTGCCACTGTTCAAAGTATGGGGCAGATTTGTACCAGAGGGCTACTTAACATCATGCACGTTCGATTACTTGTCCAATACTTTCGACACTAAATTGTTTGTGGCCTGTATATTCACTTGTAGCTACGTCTTCCCTATGACCATGATTATATATTTCTACAGTGGCATTGTCAAACAAGTGTTCGCACATGAAGCAGCATTGAGAGAACAAGCTAAGAAAATGAATGTTGAATCTCTGCGCTCGAATCAAAACGCCTCCGCGGAGTCCGCTGAGATACGAATTGCTAAAGCGGCTCTCACCGTCTGCTTCCTGTTTGTGGCGTCTTGGACGCCGTACGGGGTGATGGCGCTCATAGGAGCCTTCGGGGATCAACGACTTCTCACACCCGGGGTAACTATGATACCAGCCGTAGCGTGCAAAACTGTAGCCTGCATCGATCCTTGGGTTTATGCAATCAGTCATCCCAAGTACAGGCAAGAGCTTCAGCGTCGCATGCCTTGGCTTCAAATCAATGAGCCCGATGATAATGTTTCAAACACTACTAACGGCACTACCAATTCTACGGCTACGCCTACCGCCTAA

Protein sequence:

>DPOGS213510-PA
MDNDTDNINVYGAYFAPLRSSEGTKMLIDGLTGEDLAAVPEHWHTYPSPPASAHTALALLYCFFTAAALIGNGMVIFIFLTTKSLRTSSNLLILNLAISDFIMMAKAPIFIYNSALRGFAAGPVGCQIFSVMGAYSGIGASMTNACIAYDRHSTITRPLDGRLSQGKALLMIAFVWIYATPWSLLPLFKVWGRFVPEGYLTSCTFDYLSNTFDTKLFVACIFTCSYVFPMTMIIYFYSGIVKQVFAHEAALREQAKKMNVESLRSNQNASAESAEIRIAKAALTVCFLFVASWTPYGVMALIGAFGDQRLLTPGVTMIPAVACKTVACIDPWVYAISHPKYRQELQRRMPWLQINEPDDNVSNTTNGTTNSTATPTA-