Monarch geneset OGS2.0

DPOGS200254
TranscriptDPOGS200254-TA1011 bp
ProteinDPOGS200254-PA336 aa
Genomic positionDPSCF300169 + 250134-258059
RNAseq coverage80x (Rank: top 64%)
Annotation
HeliconiusHMEL0106532e-11761.62% 
BombyxBGIBMGA000016-TA1e-12278.26% 
DrosophilaNPFR1-PA2e-7142.15% 
EBI UniRef50UniRef50_B3XXL65e-15778.98%Neuropeptide receptor A4 n=2 Tax=Obtectomera RepID=B3XXL6_BOMMO
NCBI RefSeqNP_001127739.11e-15778.98%neuropeptide receptor A4 [Bombyx mori]
NCBI nr blastpgi|1972099502e-15678.98%neuropeptide receptor A4 [Bombyx mori]
NCBI nr blastxgi|1972099503e-15978.98%neuropeptide receptor A4 [Bombyx mori]
Group
Gene OntologyGO:00071867.9e-35G-protein coupled receptor protein signaling pathway
GO:00160217.9e-35integral to membrane
KEGG pathwayaag:AaeL_AAEL0106264e-78 
 K04209 (NPYNR)maps-> Neuroactive ligand-receptor interaction
InterPro domain[42-66] IPR0002767.9e-35GPCR, rhodopsin-like, 7TM
Orthology groupMCL17260 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200254-TA
ATGAATCTTACCCTAAATGCCTCAGCAAGATGGTTAATCGAGGCATCCCGGGGGGTCCAGAATCCCAATATTCTACTACAGAAATTCTCACAAAACAGAAAAGTGGACGACCCAACACGATCCCTGCTCTTAACCTTTTACGGCATCTTGGTGGTCATTGGCGCTGTAGGGAATGCTTTAGTGGTAATATCTGTAGTGAGAAAACCGGTGATGCGTACAGCACGTAATATGTTCATAGTCAACTTAGCTGTATCCGATGCGCTAGTTTGTTGTGTCGGCACGCCACTTACATTAATGGAACTTTTGACCAAACATTGGCCATTACCAGATTGGCCCAGTCTTTGTAAAGCTTGCGGGGCAATACAGGCTATATCTATATTTGTTTCAACGATCTCCATAACAGCTATCGCATTAGATAGGTACCAGTTAATCGTCTATCCAACTCGGCCTGGATTACAAACAATGGGTGCACTTGTGACAATGTTTTTCATCTGGGTAACAGCGTTCACTCTGGCTTCACCCCTATATATATTTAGAAGTCTAAAACGGCACAAGGTTGGAATTTTAGTGCTGGTGGTTGTCATAGCCCATGTCCAAATTCATAGAAGGTTACGCGGCAGGAGACGGACCACCAGGAAGACCCCAGCCATTTTGATAGCCATAGCTGTTACTTATGTCATAAGTTGGTTACCCCTGAATGTTTTTAATTTGGTTGCTGACTTTAGCAAAGATGCCATTCTAGATGAAAAGTCGATGACAATAACTTATGCTATCTGTCATATGTTTGGTATGTCCAGCGCTGTGTCAAACCCATTACTCTATGGTTGGCTCAATGACAACTTTAGAAAGGAATTTGAAGAAATTTTATGCTGTTGTAGACAGAAGAGGCAAAAGAATAAGAACTCAAGGATCAACAATAGAAAAATGGATACCGAATTGACAGCGCTTGCGCAATTAGAGCACACTGTTACAGCAAATACGAAGACATCACAGTGTTCACAAATTTTCTGA

Protein sequence:

>DPOGS200254-PA
MNLTLNASARWLIEASRGVQNPNILLQKFSQNRKVDDPTRSLLLTFYGILVVIGAVGNALVVISVVRKPVMRTARNMFIVNLAVSDALVCCVGTPLTLMELLTKHWPLPDWPSLCKACGAIQAISIFVSTISITAIALDRYQLIVYPTRPGLQTMGALVTMFFIWVTAFTLASPLYIFRSLKRHKVGILVLVVVIAHVQIHRRLRGRRRTTRKTPAILIAIAVTYVISWLPLNVFNLVADFSKDAILDEKSMTITYAICHMFGMSSAVSNPLLYGWLNDNFRKEFEEILCCCRQKRQKNKNSRINNRKMDTELTALAQLEHTVTANTKTSQCSQIF-