Monarch geneset OGS2.0

DPOGS208446
TranscriptDPOGS208446-TA2166 bp
ProteinDPOGS208446-PA721 aa
Genomic positionDPSCF300095 + 357609-372203
RNAseq coverage33x (Rank: top 75%)
Annotation
HeliconiusHMEL0078442e-7693.15% 
BombyxBGIBMGA008849-TA0.089.01% 
DrosophilaCG31760-PA0.053.58% 
EBI UniRef50UniRef50_Q9VKA40.053.58%Probable G-protein coupled receptor CG31760 n=15 Tax=Endopterygota RepID=Y1760_DROME
NCBI RefSeqXP_973772.10.060.48%PREDICTED: similar to CG31760 CG31760-PA [Tribolium castaneum]
NCBI nr blastpgi|910891610.060.48%PREDICTED: similar to CG31760 CG31760-PA [Tribolium castaneum]
NCBI nr blastxgi|910891610.060.42%PREDICTED: similar to CG31760 CG31760-PA [Tribolium castaneum]
Group
Gene OntologyGO:00071868.4e-32G-protein coupled receptor protein signaling pathway
GO:00160218.4e-32integral to membrane
GO:00049308.4e-32G-protein coupled receptor activity
KEGG pathway 
InterPro domain[381-614] IPR0179788.4e-32GPCR, family 3, C-terminal
Orthology groupMCL11322 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208446-TA
ATGCATAAAATAAGGTATATTTCATATTCATTGTTAAAGGTGAGCGCGTGTTCGCGCGAATGTTCGCCGGAGATGAGAGCGCCGCGGCGCGTGGCAAGTGCTCTGCTGCTGCCAATCACTCTATTATCATTACCACACGCCTATGCTACCAACTCGACCACTTACCGGGAGGAATTAGAGTCATCTCTGCGTTTGGTACACGCGGTGGCGACGGGTGCCACGGGTGCATTGTGCGTCACCCATCATTACGGAAGGCTGCGAGCGCCCATGCACACTTCACGCTGGGACGCAGCACGAGCCAGAGCAGACCTCACGGCCAATATTCTGCAACAACTGGGGTTAGAAACCGCTGAAGTGCTCATGGCGGTATCCCAAGGTTTAGTCATGGGAGCCAGCAATTCCGAGCGGGCAGTTGGGGCCAGAGCTCTAGCATTAAGATCTGACGGGGTCGTAAATAGCGCAGTAGCATGGGAGAGATATGGCGCTGCTGAGCCAGTGGCGGTCGTGTCCCCACAACTGGAGAAACCACCTGATTCTGATTATCCTTGGTACGTTTCTGCTGAAGACACCGAATCTTTAAGGTCGCCAAAGTTCATGCCCTCCCCTCCAGGTGCAGCTATGGAAGGATGGTGGACCTTCCCTTACTATAGTTGTGGAGCTAAACGATGGCTGCTATCTTATACTGTTCCTGTATCAACAATACGAGGATTAAAGGGCGTTGTTTCGTTAGACATTGATATCTCTAATCTCGAGATAAATCAGTGTGAAGTGGAGAATAATGAGGACAATGACAACCAAATTTATTCTTTTCATGGAACACATAAATGTCCGAATGAAACGACATATTGTGATTACAGACCAAATAGAGAGATGAGCGTACGTTTTGCTGGCTGGGCTCGCGGCTCCTACATTTGCAAATGCCGTCCTGGTTTTTATTCCATGCATCACCCCGATGGTTTTAATGGATCTCTTGTTGAAGTGGCATATCAAGAATACGAAGAAAACGGCTTAGAAAACTGGAATGAACCATACGAATGTCTAAAATGTCAACCTGGCTGTGAGACCTGCCGTGGGCCAGAGCCATGTCTAGCCACTTATAACTGGCCTTTTCGGATATCTCTGTTGGTGATATCTGTGAGCTGTGCGGTGATGACGGTCGTATTGGTGATCTACACACGTCATCACCGCCGTGTGAAGGTGTTCCGAGTAGCCAGCCCTGTATTTCTATCAATCACGTTACTTGGATGCGCCATCATGTATATGGAGATGGCTGCCATATTTCCCGTACTGGACCGATATTCATGCATCGCAACAAAATGGACTCGGCACATGGGCTTCTGCATCACCTACACTGCACTTCTAATGAAAACTTGGCGAGTATCTCTCACGTTTCGCGTGAAGTCGGCACACAAATTGAAACTAACGGACAAACAATTATTGCAATGGATGGCTCCCATTATACTAATCATGCTTGTGTATCTTGGCACTTGGACGCTCTCTGCGCCTCCAGACGCTGAAGTTATAACAGATAATAAGGGGCTGAAATTCAAACAGTGTACATATAATTGGTGGGACCACAGTTTAGCTATCGGAGAAATTCTATTTCTTCTATGGGGCGTTCGTGTATGTTATCGCGTCCGACACGCAGAAAGTCTCTACAATGAAGCGAGACTCATATCTATTGCCATTTACAACATATTCACCGTCAATTCACTAATGATAGCCTTCCATTTATTGATTTTACCAAGAGCTGGTCCTGATATAAAGTACTTGCTAGGCTTCATACGCACTCAACTCTCAACGTCAACCACAGTTTTACTAGTATTTCTACCTAAAGTACTACGCGTGGTTCGTGGCACGGGTGACACGTGGGACAGCAGGGCACGTGCTCGTGGCGTGCCAGCCTCCTCGTCACTCAACGGCATCGGACTCGTGCCGGATGATCCGCCAGATTTGTATCAGGAAAATGAGGAGCTTAAGGAAGAAGTTCAAAAGTTGGCAGCACAAATTGAATTTATGAAGATAGTCCAAATGGAGATGCACAACCGACATCTGCGACCGAGACCAGGTGGCTACTTCACAACTACTGGGGCACCTCAAAGCCCTATGCATTCTAAGGCTATTAATATTTCACAAGAGAATAATGATTGGTCAGGTCCAGTGTGA

Protein sequence:

>DPOGS208446-PA
MHKIRYISYSLLKVSACSRECSPEMRAPRRVASALLLPITLLSLPHAYATNSTTYREELESSLRLVHAVATGATGALCVTHHYGRLRAPMHTSRWDAARARADLTANILQQLGLETAEVLMAVSQGLVMGASNSERAVGARALALRSDGVVNSAVAWERYGAAEPVAVVSPQLEKPPDSDYPWYVSAEDTESLRSPKFMPSPPGAAMEGWWTFPYYSCGAKRWLLSYTVPVSTIRGLKGVVSLDIDISNLEINQCEVENNEDNDNQIYSFHGTHKCPNETTYCDYRPNREMSVRFAGWARGSYICKCRPGFYSMHHPDGFNGSLVEVAYQEYEENGLENWNEPYECLKCQPGCETCRGPEPCLATYNWPFRISLLVISVSCAVMTVVLVIYTRHHRRVKVFRVASPVFLSITLLGCAIMYMEMAAIFPVLDRYSCIATKWTRHMGFCITYTALLMKTWRVSLTFRVKSAHKLKLTDKQLLQWMAPIILIMLVYLGTWTLSAPPDAEVITDNKGLKFKQCTYNWWDHSLAIGEILFLLWGVRVCYRVRHAESLYNEARLISIAIYNIFTVNSLMIAFHLLILPRAGPDIKYLLGFIRTQLSTSTTVLLVFLPKVLRVVRGTGDTWDSRARARGVPASSSLNGIGLVPDDPPDLYQENEELKEEVQKLAAQIEFMKIVQMEMHNRHLRPRPGGYFTTTGAPQSPMHSKAINISQENNDWSGPV-