Monarch geneset OGS2.0

DPOGS200640
TranscriptDPOGS200640-TA4974 bp
ProteinDPOGS200640-PA1657 aa
Genomic positionDPSCF300076 + 528311-536714
RNAseq coverage229x (Rank: top 44%)
Annotation
HeliconiusHMEL0010170.078.90% 
BombyxBGIBMGA011318-TA0.069.11% 
DrosophilaCG15744-PA1e-12028.91% 
EBI UniRef50UniRef50_C3PPF70.077.98%DNA sequence from clone AEHM-28L23 n=90 Tax=Heliconius RepID=C3PPF7_9NEOP
NCBI RefSeqXP_001607909.10.034.18%PREDICTED: similar to CG15744-PA [Nasonia vitripennis]
NCBI nr blastpgi|2294874140.077.98%unnamed protein product [Heliconius melpomene]
NCBI nr blastxgi|2294874140.077.98%unnamed protein product [Heliconius melpomene]
Group
Gene OntologyGO:00071862.2e-08G-protein coupled receptor protein signaling pathway
GO:00160212.2e-08integral to membrane
GO:00049302.2e-08G-protein coupled receptor activity
KEGG pathway 
InterPro domain[773-961] IPR0008322.2e-08GPCR, family 2, secretin-like
Orthology groupMCL11385 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200640-TA
ATGAAGAATTTCGCGATAATAGTTCTGTTTTTATTGATCGAGGGAGCGTTTGGTTTTTGTCCGTCGTTATGTTCTTGTAAATCGAATAAGTCCGCCGATGGGCCTCCCTCGGAGCCCCTTCCCGGCGATTTCTTGAGATTAAAATGTGGTGGAAGTCCTGCACAGATCACTGAGCTTAAAGAGATAGACCTGAGCAAACTGTGGACATTGGTCGTTAGTCTAAATCTATCAGGAAACGCCATATCTACATTATCAAGAGAGCTTCATTTGCCTAATTTACAAAAATTAGATCTCAGTAGAAATCAAATAACATTAATAGAGTCCGACGCCTTCTACAACATGACTTCATTACAAAGATTGGATTTATCTTACAACCAGATCAGTCATATTTACAAGGAAATGTTCAAGGGTATGGTGAACTTGGAGAGGCTTATGCTCACTCACAACCATATATCGGTGTTAGCGGCCGGAACTTTTGATTACCTCGTCGGACTTAAACAATTAGACATAGCTGACAATCCCTTGATATGCGACTGCGATCTGCTGTGGGTGGGAGACTGGTCACGGAACACGAGTGTTAAACTTGTCGGGAACCCAAAGTGTGCCTTCCCAGAGAACATGGTGAACAAAACTGTAAGGAAGCTGAAGATATTTTTGGATCTATCGGCCTGTGGTAACGTGCTGCCGTCCCATAATCTGCTGGTGAAGCCGAGTCACGATCAAGTCGTGTTTGAAGGTGACACGCTGGTTCTCTCGTGTAACGCTCCCTTTGCATCCGTTATGGCTAAATATGAGTTGAAATGGTACCATCCCATGTTGGAGATATGCGACGTAAACATAACGAACACGGACATGCAGGAAGAGGGTCTGGCGGAGACCAGTGTGTTCTTCCCGAATATAACGATTCATCATATGGGCAACTGGACATGCGTGTACGGAGACCAGAACCATCTGAGGCATAATCGCACGGTGCAAGTTTTAGTGATATCAAACCAAACTCAGTATTGCAAGACGGAGCACACTATCGACAATAAAGGCCTGTACTCATGGCCGCAATTGTTGCTGAACCATACAGCCAACGTGCCTTGCCGTAACGGTGAAGGTCTGGCCTATAGGAAGTGTTACGCCAATGCCACCTGGGGCCCGGCTAACACAACCGAGTGTTCATACATCAGTAATATAACGAAATTACTACAGCAGTTCGCCTTGTTGAATGTGAGTTTGGTGCAATACTCGGCCTTGAACGCGACGGAACGGCTGGCTATGTTGATACGGGATAAGACGTACCCGCTGGCAGAGATCACGGACCCCGATGATGTAATGTTCATTGCCAAAGCTATAAGGAATTATATGCAGCATATATCAGAAGAACGAGATTTGGGTTCCACACTCCTGGATGTGATCAGTTCGGTGATGAATATATCAAGCGGGGTGATGGTGAAGGCGGAAACCTTATACCAGTCGTGTACGGACATTGTGAAGGCTGCTGAAGAGATATCAGCTTACACCAGCAACGTTCAGGGGCATAAGGTCAACTTGGCAGTGGAAAGGTTTCCCGTCCGCGAAGGTTTCAGCGGCGTGACCTGTGTGTGGTACAGCGTTTCTCGGGGCCTACCGCCGCGCCTCCACTGTTCCACCACCAACAGGACAGTGGCGCCACTGCTGACGGTCAGCGACGCTCTCATACATGCCAGTGTTCAGTGCTCGCAAGTGTACTCGGGAGCCAACATACCTGTTATATCTAAAAATTTTGCACATCTATCGTCGCCTGCCAAGTTGAGTGCCGGTTACTACAGCACAGTAGAATCCAGTTCATACACTTATTACATCGGGACGACGGACTCTGGGGACGTTTTGCAGAGCAAACCTCAAGATCTGATTGTTTCTATGTATGAAGATGCTTCCCTGTTTCCTCTTCTGCCCGCCATGGACGACAGCCCCATGCCTGGACACCGCTCCAAGGATTACTATGGCAGGAGTACTAAGAGGGAGGATATGAACATGGAAATTACATCACCCGTGGTCGGATTCCAGCTGAGCGGGGGATCTTTACGTGGATCTTTATTGGAGCCGGTGATAGCTACCGTTCGCGCCAGAACAACTGGCGGAGAAGACGGACGTGCATCCACCTGGGACCAGAGAGACCGGATATGGATTGACAATACATCAGATTGCAAAGTGTCGCACGTGGTGTCCGAGATGATAATAATAAGTTGCACTCGTCTAACGTATGTTGGTCTGTTACAAAACGTCGAGACGGGCATCTACGGGGCCAGAACCGATGGTGCGAGGTTTAAGGTCTCCCACCCAGCGGTGTACGTCGGCAGTCTGATACTCATCGGATGCGTCAGCTGTTCCACGATAACCTACATCATGTGCTTCAAAGCGGTCCAGATGGCGAAAAAGACGAAGCACGCGCTCATTAACACTTGGATAGCCATAGGACTGCTATGCTTTCTGTACACGCTTGGCATATATCAGACGGAAGATGTGAAACTCTGCCAGATTTTAGGTCTTCTGATACACTACCTGTCTTTGTCGTGTTTGTTGTGGATGTGCGTTTCCGCCAGTAATATGTACAAGTGGGTCACCAAGACCCACAACCCTGTCAGGACACCGGAAGACGATTTACCACCGGATGTGCCGATACAGAAGCCTATATTGGGGCTGTATCTTGTAGGTTGGGGGATAGCTTTGATCGTCTGTGGTATTTCCGGTGCTGTCAATTTGAAGGATTACGCCGGATATTCCCAATGTTTCCTGAGCAATGCGCCGGCTTTGAGCGCTCTTTTCATACCGGGCGGTATCTTGCTAATGTTTTTAATGGTACTGTTCCTATTGATACGTTGTACTATACGTAATATGAACGTGCAGTTGTCCGAGGGTACCCAGGCCACTGAAAACGTTGACCTCGAGATGTGGGAACCGAACCATGCGAATGATGGAGAGAGGCGTAGCATTAAATCTGTAGTCGACTCTGAAGTAGATGATGTGGAGCATACACCCATAATACAGCTAAGGGCGCAAGTTATTGTCTTGTGCCTGTATTTGTCCGTGTGGACTTGCGGTGCGATCGCTGTGTATAGACCTCTACCCGCTTACTTACCCTACCAGGAAGACATTTGCAGCATAATCTACGCAGTCTGTGCCACGGTATTGGGAACTTTCATTCTGTTCTTCTACGGCATCGCCAGGAGCGACGTTCGAGCCCAGTGGTCATTAATGCACTGCTATTTTCAATCAAGCAAGCAATGCTGTAGGAACAGAAGCGTTTTCGATACAAATCAACAAAATTTACCGTCCGGTCAAGGCACCACCAACTCGCCCCAGCGTGTCATAACGAACGATAACAGATCGAGATCCGGTAGTAGAAGTTCCAATAGAACTAACACAAAAACTAATAACAGCGGCACCTACAAGGCGGCGGCTGAGCTCAACGGTCAAACGCATCCCAAAGATCTCAAGAACACTAACGGAAAAACTCCCAATATAAATTTGGTAGTGCTGCATCGACAGCAGTACAGGTCAAACAATTCAATGATGACGTACCCAGAAAGGGGTTCATTAGCGCCCGAAGTGTTCTACAATCCTAACCAGATGAATGTCGCTAAGAAGTTCTTTAAGAAACAAAGGCAGAATATGAAAAGAAATTGTCTCGAATTGCCCATCAGGAGGGATTTGGACGACGAGTCCCACGCCTCTCTACCGCTGCCGAACAAGGAAGCGTACAATGTCGCTAACTTCATAAACGGCGGTATCAAGTTTAATAACACCAACCAGCACGTGGAGAGGTCGGCTAACGGCATCAAGGAAACCAGGAACGCTGGCAATCCGAACCTTCTAGAAGACGATTCAAAAGACAATATTGGGAATTATACCAACGATCAAAAATCATGGGCCAAAAAGGACGATCCCAAACAGGGTAGAGCCCAGATAATGAATATCTACACGAACGTCCCGGAGACGAGGGTACCTCAGCATCAGGTTGTGAAAGCGAACGTCAACAAGAACTTGAACGGTCAGAGGAACAGCGTGTCAGAGGAATGCTTGCAAGCGTGTGAACAACCGGAAATGCGGACCATTTCCCAACAGTGCAGCTTGGAATATAGCTCGGCGTCAGAAATAGCCATGCCTCACACGTGCTCCGACCAGACCTTGAACACTCATTCAGAAATGACATCGTTCCAAGAGAATAGTGCCTTATGTTCGACCAACGAAATAACCGACACCGAAAGCTTCGGGCAATACATAGACTACCTACAACCGAGTAAACCCCTCGACGGCAAGGAAGTCATCGATAAATCCTTGCAGGATATATCAGAGGAGTGCACCATGAACAGCGACGACGTGAACAGATCCGGCACGCCACAGAAAGCCGATAGCGGCTCCGGCGAATTGGACAATTTGCTAGAAAATCCTGAAATCGCCGGATCCAAAGACAAGATGGAGGAAGATAAAGACACATTCTTCATGGCGAAGACGACATACGACTTCGAGACTTGCAGCACGAACGCCAGCGAGCAGGGCTTCGAAAACGAAAGCGACATATACTGTCCGAACTACCAAATGTCTGAAGTGAGCATCCGGAGTCACGGCTTGTACGCGCCATCGCCGTCATCCATGTGCCCCAATGAGATAAGCTTCAGCAACGAGGACGTGTCAACGTTAGAGAGCCAGTACGTGAACTACGACCCCGGCTGGCGGAAAACGATAAAGAACAATCCGAAACTAGGCAAATACGTGCCCACCCCGGCTCTGAGCTGTCCGGAGGTCAACAGAGACAGTCCGATATCGTTCACGAGTGAATTAGACGAACTTTATACTCAAATAACTAGTAGGGATAAAGACAGGACGCGGAACAAGGAACATGGCTTCGACGTCACGGTCAACAGTGACGTGACCTTGAAACCTGACAGTGACAGTTGTGTCAGTGAAGCCGTGTCCGACGTCGACGTTAAGAAGGCGACGGTCTAA

Protein sequence:

>DPOGS200640-PA
MKNFAIIVLFLLIEGAFGFCPSLCSCKSNKSADGPPSEPLPGDFLRLKCGGSPAQITELKEIDLSKLWTLVVSLNLSGNAISTLSRELHLPNLQKLDLSRNQITLIESDAFYNMTSLQRLDLSYNQISHIYKEMFKGMVNLERLMLTHNHISVLAAGTFDYLVGLKQLDIADNPLICDCDLLWVGDWSRNTSVKLVGNPKCAFPENMVNKTVRKLKIFLDLSACGNVLPSHNLLVKPSHDQVVFEGDTLVLSCNAPFASVMAKYELKWYHPMLEICDVNITNTDMQEEGLAETSVFFPNITIHHMGNWTCVYGDQNHLRHNRTVQVLVISNQTQYCKTEHTIDNKGLYSWPQLLLNHTANVPCRNGEGLAYRKCYANATWGPANTTECSYISNITKLLQQFALLNVSLVQYSALNATERLAMLIRDKTYPLAEITDPDDVMFIAKAIRNYMQHISEERDLGSTLLDVISSVMNISSGVMVKAETLYQSCTDIVKAAEEISAYTSNVQGHKVNLAVERFPVREGFSGVTCVWYSVSRGLPPRLHCSTTNRTVAPLLTVSDALIHASVQCSQVYSGANIPVISKNFAHLSSPAKLSAGYYSTVESSSYTYYIGTTDSGDVLQSKPQDLIVSMYEDASLFPLLPAMDDSPMPGHRSKDYYGRSTKREDMNMEITSPVVGFQLSGGSLRGSLLEPVIATVRARTTGGEDGRASTWDQRDRIWIDNTSDCKVSHVVSEMIIISCTRLTYVGLLQNVETGIYGARTDGARFKVSHPAVYVGSLILIGCVSCSTITYIMCFKAVQMAKKTKHALINTWIAIGLLCFLYTLGIYQTEDVKLCQILGLLIHYLSLSCLLWMCVSASNMYKWVTKTHNPVRTPEDDLPPDVPIQKPILGLYLVGWGIALIVCGISGAVNLKDYAGYSQCFLSNAPALSALFIPGGILLMFLMVLFLLIRCTIRNMNVQLSEGTQATENVDLEMWEPNHANDGERRSIKSVVDSEVDDVEHTPIIQLRAQVIVLCLYLSVWTCGAIAVYRPLPAYLPYQEDICSIIYAVCATVLGTFILFFYGIARSDVRAQWSLMHCYFQSSKQCCRNRSVFDTNQQNLPSGQGTTNSPQRVITNDNRSRSGSRSSNRTNTKTNNSGTYKAAAELNGQTHPKDLKNTNGKTPNINLVVLHRQQYRSNNSMMTYPERGSLAPEVFYNPNQMNVAKKFFKKQRQNMKRNCLELPIRRDLDDESHASLPLPNKEAYNVANFINGGIKFNNTNQHVERSANGIKETRNAGNPNLLEDDSKDNIGNYTNDQKSWAKKDDPKQGRAQIMNIYTNVPETRVPQHQVVKANVNKNLNGQRNSVSEECLQACEQPEMRTISQQCSLEYSSASEIAMPHTCSDQTLNTHSEMTSFQENSALCSTNEITDTESFGQYIDYLQPSKPLDGKEVIDKSLQDISEECTMNSDDVNRSGTPQKADSGSGELDNLLENPEIAGSKDKMEEDKDTFFMAKTTYDFETCSTNASEQGFENESDIYCPNYQMSEVSIRSHGLYAPSPSSMCPNEISFSNEDVSTLESQYVNYDPGWRKTIKNNPKLGKYVPTPALSCPEVNRDSPISFTSELDELYTQITSRDKDRTRNKEHGFDVTVNSDVTLKPDSDSCVSEAVSDVDVKKATV-