Monarch geneset OGS2.0

DPOGS210272
TranscriptDPOGS210272-TA1461 bp
ProteinDPOGS210272-PA486 aa
Genomic positionDPSCF300216 + 71017-78869
RNAseq coverage30x (Rank: top 76%)
Annotation
HeliconiusHMEL0086130.073.08% 
BombyxBGIBMGA000025-TA8e-17463.22% 
DrosophilaCG15589-PA3e-6341.71% 
EBI UniRef50UniRef50_A7USY81e-6845.82%AGAP004124-PA n=3 Tax=Culicidae RepID=A7USY8_ANOGA
NCBI RefSeqXP_001848745.18e-7441.16%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700420312e-7241.16%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1892413872e-7240.14%PREDICTED: similar to g-protein coupled receptor [Tribolium castaneum]
Group
KEGG pathwaytca:6560413e-63 
 K04209 (NPYNR)maps-> Neuroactive ligand-receptor interaction
InterPro domain[116-191] IPR0124642.5e-08Protein of unknown function DUF1676
Orthology groupMCL16070 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210272-TA
ATGGATAAATCAATCCAAAAGATGTTTAAGAGAAATCAGTGGAGGTTCTTGGAAGAGGCTATCTTAGTCACATTTTATATAAACCTTATAATAGTCTCAGGAAATTGTCAGACTGTCCACAAAGAGATATCAAGTCAAGTAGATGGCTCAAACAAAACGGATCTAAATAGTGTACATAGTATAGATTCTAATGCCTGCCATAGCTGTCAAGATAGTGAAAATAATATTAATAAAGAAAATGAAAATCCAAGCTATCTTGCAGACTTTATAGATTCTGTGAGTAGATTTTCAGAAAAATACATATTCACTAATAACACTGAACAAAAATTACAAAATGTTTTCGAAACAGCATTAGATAATGTGCTTGATAAAGATAGATATGAATTATTTGATGGAGTTGAAATCAAAACAATTGACGGTCAAAATAAAACAGAGCAAAAAACTGAAAGGTCGGAAGAAGAGGAGAGTCGAGCATTATTCAGTACTTACACATATGAATACAGACTATTCCAAAAGATAAAAAACTTTGTCGATACTCATATACTATCAATTAACCTTCCAAAAGCTGCCAGGCTAATGGGCTTCCGATCATTCGGATTAAAAAACTTATTTTTGCCACTTCTTATTGGAGCACAAGTTTTTAAAAGTATTTTACTGGCGATGTTTCTTCCGAGCATCTTGGGAAGTTTTGGAAAAATACTCGGAAAAGGTATATCCCAAGTCTCGGCTGCATCGAGTCAAGCAAGCTACCCTCCAGCAAACACTGACGATCAGACCGCTTACAATAATGATCATATGGGCTATGAAACAAATCCTGCAGCGACTTACGCATATACCGATGGGATGTATGGAAACGACGCCAACGATATGAGCGATCTGTCTAATGTCGATATGTCAAGATTCGGCGCTGGAGGTCAGAAAGTAACTTATCTCCCGACTAAGAACGGATATTATAAGAACCAAATGTCTTCTGGCAACAACTACAAGATATTCCAGAAAATTCCAGCATCCTCTATCATACTGAGCAATTACGACCCGTTCTATTCGCCTTTGCTTTCTCGACTCGACGGCATCTTCGCCAGACTGGGATTAGCACCGTCAGATACCAAGACAGACGATGGAATGCAAATCGGAGGACAAACGCTTAGCGAGGTCAAATTGGAAGCTTGCAGGGAACAGCTGATATGTCTCATGTATGCTAGTCCCGCGAAATATGCGCCGTACAGTAATCTTGTATCAGCACAGTTAAGCAGAGAGCTAAACGAACTCCGCCGTCCTGTGTCAGACAACCCGGAGATTCTTCGTTTCTTCCGCTACATGAGAGCAGCACGTCGCGGTCAGGAAGGAACTGATTGTGTCAGCGAACACGCAGCCTGTGCCACGGCAGCACCCTCACACACCATGATATCAGCTTACCATGACATAAACAAACTCGTGACAGCGAGGAAGCTGCATAATTAG

Protein sequence:

>DPOGS210272-PA
MDKSIQKMFKRNQWRFLEEAILVTFYINLIIVSGNCQTVHKEISSQVDGSNKTDLNSVHSIDSNACHSCQDSENNINKENENPSYLADFIDSVSRFSEKYIFTNNTEQKLQNVFETALDNVLDKDRYELFDGVEIKTIDGQNKTEQKTERSEEEESRALFSTYTYEYRLFQKIKNFVDTHILSINLPKAARLMGFRSFGLKNLFLPLLIGAQVFKSILLAMFLPSILGSFGKILGKGISQVSAASSQASYPPANTDDQTAYNNDHMGYETNPAATYAYTDGMYGNDANDMSDLSNVDMSRFGAGGQKVTYLPTKNGYYKNQMSSGNNYKIFQKIPASSIILSNYDPFYSPLLSRLDGIFARLGLAPSDTKTDDGMQIGGQTLSEVKLEACREQLICLMYASPAKYAPYSNLVSAQLSRELNELRRPVSDNPEILRFFRYMRAARRGQEGTDCVSEHAACATAAPSHTMISAYHDINKLVTARKLHN-