Monarch geneset OGS2.0

DPOGS200456
TranscriptDPOGS200456-TA3492 bp
ProteinDPOGS200456-PA1163 aa
Genomic positionDPSCF300260 - 124917-137467
RNAseq coverage19x (Rank: top 80%)
Annotation
HeliconiusHMEL0130690.066.15% 
BombyxBGIBMGA011187-TA0.061.33% 
DrosophilaCG9304-PA6e-1423.81% 
EBI UniRef50UniRef50_D6WUH91e-13462.46%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WUH9_TRICA
NCBI RefSeqXP_971871.12e-13562.46%PREDICTED: similar to AGAP012225-PA [Tribolium castaneum]
NCBI nr blastpgi|910879354e-13462.46%PREDICTED: similar to AGAP012225-PA [Tribolium castaneum]
NCBI nr blastxgi|910879354e-13662.46%PREDICTED: similar to AGAP012225-PA [Tribolium castaneum]
Group
KEGG pathway 
InterPro domain[279-534] IPR0193362.3e-64Rhodopsin-like GPCR transmembrane domain
Orthology groupMCL16557 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200456-TA
ATGCCCGGAGTGGACGCCAAGTGGGTCGAGGGCTGGGTCAATACCAAGGAGAACTGGGCGTTTCTGGCTCGTTTCTGTTTCCTGTCGCTGGAGGGGCAGTTCGAGTACCTCATAGAGTATGACAGAGATCTGGGAACACCGAACCTACTGCTGTATTATGACGAGGAGTCTCAGTGGCCCGCAGTCTATCATAGCAGTAAGACATGCAGAGAACGTGAGGCGGTGCTGAACAAGGGAGGACAGAACCAGATTGTGAAGCTATCTCACTGGTATCCAGACACAGAGTACTCTGGATGCATTCTGACACAGTCAAACAAAGAGTTGCCAGCTTCGAAAAGCACAACAGCAACAGTCCCAACGACAACAACAAAAAAGACAAATTCAACTAGGAATGACCTGGGCTTCCTGGGCGACATGATGCTTCAGTTTTTAAAAACTACCACACCAAAACCAAAAACTACCATAATAGATGCAAATTCTACCATCTGGTACGAATTGCTACCAGAGAACATAACGACAACAGTGTTGCCAGAGGACGAGTGGGAAAACATCGGTCCGCAAGAAAACGACACTCAAACGGAAAAACTAAAAGACGTGGAGTTGTTATTCGACAATACGAACAGCAGTGCGCTGAAAATCAAACGTTCGACGTTTGAACTTTACGAGAGGTTCCGCAAAAGGAGGTTGTATTCTGAATCGAATCAAAATGGAGAAAGTGGGTCCAATACTGACACTGTCAAATTTATAGTCACTTGTCGTAACTCGAGGAGGTTCCGGTCAGCGAGAGAGAGGTGGTGGTTCATCGCCATAAGTAACTGCGGCAGTCCTAAGGGCTTGGACGTTAGATATAAATTTTTGATGACGAATGGCCCGAGTGGAGATTTTTGGCATGAACATTTCTCAGCTGACGAATTCTATGTGCTACCAATACTCCTGGCATACACTTTCGCCTACGTCATAGTCATGGTGGCGGTAGTGATGTGCAGCGTGGAACTCAAGACGCGCCACTTGTTGCACTCGACCTACAAACTATTCCTGATATCGATAGTGTCGCAACACTTCGGTGTTATCTTGCAGAGTCTCGCCGGGATCAGATACGCCTATAACGGTATTGGTAACCCCGCTGCTAGAGTTCTGGGTCAGATTCTCTGCGGTGTCTCGGAGACGAGTTACCTCCTGCTCCTGATTCTCTTGGCCAAAGGTTACACTATAACCCGCGGGCGGCTGAAGGTCGCCTTCACCGTCAAACTGACAGTTTTCATGTGTTGCTACGTCATCACTTATATTGTCCTGTTCGTGTATCAGGCTAAGGCGTTCGATCCTGGTGAGGTGTTATATATATACGAGAGCCCGGCCGGCTACGGTCTCATCGTGTTGAGACTCAGCTCGTGGTTCGCTTTCGCGTACTCAACGTTTTTCACCGTCAGGAAATTTCCAGAGAAGAATTTATTCTACTGTCCGTTCTTCATCTGTGGGACTTTGTGGTTTTTCGCGGGACCTCTTTTCATCTTGACCGCCAACTCTTATATAGACAAATGGGTCCGCGAGAGCGTGGTGACAGCCGTGTTGCTACTCATCACGTTTTGCGGACACACGATGTTTTTGCTGTTAACACTACCGGTTTTCGCAAATAAAAATTTTCCCTACCACGTGAGGACCACTCAGATAGGTGTGATGGAGGTCAACGGCAACAATTTGGATAGGTTTGGACCGACGCCCTACCACCCCAGCGGCGGCACGGCGCAGACTGTTATAATACCTCTCACAAGACGCACAGAAGAACTCATCGGTAATATGTACAACCAGTACATGGCAAGCGCTCCGTCTTTGTCAATGGAAACACAACAACCAAAACCGAACGCGCCAAAAAGTTTAGGCTCAATAAGGTCCGGAAGTCTCGTCACGCAGAACAGTTCAGAGACGAATAGATCTGACGACATACAGCCATCGATTAAAGAAATATATACAGTCGAGGGGGAAATCAACAAAATGGACACCGAAACTGAGGCAGTGTTGCCAGATGAAAAGCTACCGAATGAAATGGTAGAAAATACAGAATTACCGCCCATTGTTAGGTCAAGGAGAAATATTTTGGAGCCCATAAAAAGAGATGTCCCCGATTGGTCGTTAGCTAAGGGAGCGTGTGTGGTCGCTATGCAGTTAAAGAAACTAAAGACTGCAGAAGACGAAGAAGTTTCCGAACTTCCACCGCTACAAATAAACGGCAAGAAAGTTTTAAGGAACGGGCAAACAAGTCTAGCGGAAATGGGGACCATACATGAAGGTCAGATTCTCTGCGGTGTCTCGGAGACGAGTTACCTCCTGCTCCTGATTCTCTTGGCCAAAGGCTACACTATAACCCGCGGGCGGCTGAAGGTCGCCTTCACTGTCAAACTGACAGTTTTCATGTGTTGCTACGTCATCACTTATATTGTCCTGTTCGTGTATCAGGCTAAGGCGTTCGATCCTGGTGAGGTGTTATATATATACGAGAGCCCGGCCGGCTACGGTCTCATCGTGTTGAGACTCAGCTCGTGGTTCGCTTTCGCGTACTCAACGTTTTTCACCGTCAGGAAATTTCCAGAGAAGAATTTGTTCTACTGTCCGTTCTTCATCTGTGGGACTTTGTGGTTTTTCGCGGGACCTCTTTTCATTTTGACCGCCAATTCTTATATAGACAAATGGGTCCGCGAGAGCGTGGTGACAGCAGTGTTGCTACTCATCACGTTTTGCGGACACACGATGTTTCTGCTGTTAACACTACCGGTTTTCGCAAATAAAAATTTTCCCTACCACGTGAGGACCACTCAGATAGGTGTGATGGAGGTCAACGGCAACAATTTGGATAGGTTTGGACCGACGCCCTACCACCCCAGCGGCGGCACGGCGCAGACTGTTATAATACCTCTCACAAGACGCACAGAAGAACTCATCGGTAATATGTACAACCAGTACATGGCAAGCGCTCCGTCTTTGTCAATGGAAACACAACAACCAAAACCGAACGCGCCAAAAAGTTTAGGCTCAATAAGGTCCGGAAGTCTCGTCACGCAGAACAGTTCAGAGACGAATAGATCTGACGACATACAGCCATCGATTAAAGAAATATATACAGTCGAGGGGGAAATCAACAAAATGGACACCGAAACTGAGGCAGTGTTGCCAGATGAAAAGCTACCGAATGAAATGGTAGAAAATACAGAATTACCGCCCATTGTTAGGTCAAGGAGAAATATTTTGGAGCCCATAAAAAGAGATGTCCCCGATTGGTCGCTAGCTAAGGGAGCGTGTGTGGTCGCTATGCAGTTAAAGAAACTAAAGACTGCAGAAGACGAAGAAGTTTCCGAACTTCCACCGCTACAAATAAACGGCAAGAAAGTTTTAAGGAACGGGCAAACAAGTCTAGCGGAAATGGGGACCATACATGAAGGTAGGGAGCAGACATACATACGCTCGCCCGCGGATATATTCACAGTTACAACTAGGAGTTAA

Protein sequence:

>DPOGS200456-PA
MPGVDAKWVEGWVNTKENWAFLARFCFLSLEGQFEYLIEYDRDLGTPNLLLYYDEESQWPAVYHSSKTCREREAVLNKGGQNQIVKLSHWYPDTEYSGCILTQSNKELPASKSTTATVPTTTTKKTNSTRNDLGFLGDMMLQFLKTTTPKPKTTIIDANSTIWYELLPENITTTVLPEDEWENIGPQENDTQTEKLKDVELLFDNTNSSALKIKRSTFELYERFRKRRLYSESNQNGESGSNTDTVKFIVTCRNSRRFRSARERWWFIAISNCGSPKGLDVRYKFLMTNGPSGDFWHEHFSADEFYVLPILLAYTFAYVIVMVAVVMCSVELKTRHLLHSTYKLFLISIVSQHFGVILQSLAGIRYAYNGIGNPAARVLGQILCGVSETSYLLLLILLAKGYTITRGRLKVAFTVKLTVFMCCYVITYIVLFVYQAKAFDPGEVLYIYESPAGYGLIVLRLSSWFAFAYSTFFTVRKFPEKNLFYCPFFICGTLWFFAGPLFILTANSYIDKWVRESVVTAVLLLITFCGHTMFLLLTLPVFANKNFPYHVRTTQIGVMEVNGNNLDRFGPTPYHPSGGTAQTVIIPLTRRTEELIGNMYNQYMASAPSLSMETQQPKPNAPKSLGSIRSGSLVTQNSSETNRSDDIQPSIKEIYTVEGEINKMDTETEAVLPDEKLPNEMVENTELPPIVRSRRNILEPIKRDVPDWSLAKGACVVAMQLKKLKTAEDEEVSELPPLQINGKKVLRNGQTSLAEMGTIHEGQILCGVSETSYLLLLILLAKGYTITRGRLKVAFTVKLTVFMCCYVITYIVLFVYQAKAFDPGEVLYIYESPAGYGLIVLRLSSWFAFAYSTFFTVRKFPEKNLFYCPFFICGTLWFFAGPLFILTANSYIDKWVRESVVTAVLLLITFCGHTMFLLLTLPVFANKNFPYHVRTTQIGVMEVNGNNLDRFGPTPYHPSGGTAQTVIIPLTRRTEELIGNMYNQYMASAPSLSMETQQPKPNAPKSLGSIRSGSLVTQNSSETNRSDDIQPSIKEIYTVEGEINKMDTETEAVLPDEKLPNEMVENTELPPIVRSRRNILEPIKRDVPDWSLAKGACVVAMQLKKLKTAEDEEVSELPPLQINGKKVLRNGQTSLAEMGTIHEGREQTYIRSPADIFTVTTRS-