Monarch geneset OGS2.0

DPOGS200228
TranscriptDPOGS200228-TA1656 bp
ProteinDPOGS200228-PA551 aa
Genomic positionDPSCF300414 + 84990-96528
RNAseq coverage1x (Rank: top 92%)
Annotation
HeliconiusHMEL0175350.074.37% 
BombyxBGIBMGA008437-TA7e-17661.57% 
DrosophilaninaE-PA4e-3527.33% 
EBI UniRef50UniRef50_Q17A902e-10565.16%Opsin n=4 Tax=Culicidae RepID=Q17A90_AEDAE
NCBI RefSeqXP_312502.23e-10754.93%putative GPCR receptor, opsin family (AGAP002444-PA) [Anopheles gambiae str. PEST]
NCBI nr blastpgi|583833756e-10654.93%AGAP002444-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|583833759e-11254.93%AGAP002444-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00071862.5e-49G-protein coupled receptor protein signaling pathway
GO:00160212.5e-49integral to membrane
KEGG pathway 
InterPro domain[46-292] IPR0002762.5e-49GPCR, rhodopsin-like, 7TM
Orthology groupMCL18337 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200228-TA
ATGTCTTACACCGTGGATTTTAACGACACAGTGGTGAGAGCATACGGCGAGGGATTCCCCTTGTTGATGCCTCGATGGGGATACGTCGTATCCGCCTTTGTTCTCTTCTTAATTGGCTTCTTCGGATTCTTCTTGAATCTCATGGTAATTCTCCTCATGTTTAAAGATCGACAGTTATGGACACCTCTGAACATAATATTGTTCAACTTGGTGTGTTCAGACTTTTCCGTTTCCCTATTGGGAAATCCCTTGACCCTTATTTCCGCACTATTCCATCGTTGGATTTTCGGCCACACTATGTGTGTCATTTACGGGTTCTTCATGGCTTTACTTGGTATAACATCTATTACGACACTTACCGTGATATCGTTCGAGCGTTATCTCATGGTAACAAGACCTCTCAGCTCACGTCATCTTAGTTGTAAAGGAGCGACTGTATCGGTGGTATTCATCTGGCTTTACTCACTCGCTCTCACAACCCCGCCATTATTGGGTTGGGGGAATTATGTCAATGAGGCTGCTAATATCAGCTGCTCAGTGAATTGGCACGAACAATCGAGGAATACTCTCACGTACATCTTATTCCTGTTCGCGATGGGACAGATTGTGCCGCTGGCTGTCATCACCTTCAGCTATGTCAACATTATCAGGACTATGAAGCGGAATTCTCAACGCCTGGGTCGTGTGAGCCGAGCAGAGGCGCGGGCCACAGCCATGGTGTTCATTATGATCATATCATTTACGGTCGCCTGGACTCCATATTCACTGTTTGCACTGATGGAGCAGTTTGCGACCGAGGGAATTGTATCGCCGGGGGCGGGAGTTATACCAGCACTAGTAGCGAAAAGTTCCATTTGTTACGATCCTTTAATTTACGTTGGGATGAACACACAGTTTCGAAAATCTATCAAAAGAATATTTGGAATACAAAAACGAAGGGGTTCTAGAATTGAAAAGTGCTATAATAACTCGATACTATCACCAACACATCGACGATCTGCTTACAATGATATTACCGTCCGTTATAATTCTTCAGACACAGTCATATCAACTCCGAAACGTTACTCTGATAAACGGATTTTCAGTAGTGAAGATTCTGAAAGTCATATGGCATCCACAGTAACGAATGAATCGAGTCGTCCGACAGGAAAAGTGTATGAATTATGCACTATCCAAGAGAACAAAACCGGATCGGATGCTGAGAGATCCGGAGACACTGTTTGCGATGATGATAACACAAGTAACAAAAACCTAGAACTTGTTTTTGGTGAATCTTGTAAAATATTTAAAGACTCCAATGTTGATAACGAACGATCCACAACTGGTGCAGTCAAAAACAAAATTCATATAACTGAAAGTAAGAACATTTTAGATGAAGCCGACTTGCGTCTAATTGAAAGTAATTTAAATCAAAGTTCTAAAAGATCTAAAATAATAAGACACAGCTATTCGTTGGATTTGGGAGGAGTTTCTAATGAAGGTACACGAAGAAAATTTTCTATTGAAACTAAACTAGAAACATTTAACCCTCCAAAACTATTCAGAAAAGATGGCGTAGTGAACAAACTCTTTGAATCGGATGATACAGATAATGTTAGTTATATATGTACAATAGATAATAGTGAAAATCATGAGGAACACTTGTCGTCTAAGTAA

Protein sequence:

>DPOGS200228-PA
MSYTVDFNDTVVRAYGEGFPLLMPRWGYVVSAFVLFLIGFFGFFLNLMVILLMFKDRQLWTPLNIILFNLVCSDFSVSLLGNPLTLISALFHRWIFGHTMCVIYGFFMALLGITSITTLTVISFERYLMVTRPLSSRHLSCKGATVSVVFIWLYSLALTTPPLLGWGNYVNEAANISCSVNWHEQSRNTLTYILFLFAMGQIVPLAVITFSYVNIIRTMKRNSQRLGRVSRAEARATAMVFIMIISFTVAWTPYSLFALMEQFATEGIVSPGAGVIPALVAKSSICYDPLIYVGMNTQFRKSIKRIFGIQKRRGSRIEKCYNNSILSPTHRRSAYNDITVRYNSSDTVISTPKRYSDKRIFSSEDSESHMASTVTNESSRPTGKVYELCTIQENKTGSDAERSGDTVCDDDNTSNKNLELVFGESCKIFKDSNVDNERSTTGAVKNKIHITESKNILDEADLRLIESNLNQSSKRSKIIRHSYSLDLGGVSNEGTRRKFSIETKLETFNPPKLFRKDGVVNKLFESDDTDNVSYICTIDNSENHEEHLSSK-