Monarch geneset OGS2.0

DPOGS206730
TranscriptDPOGS206730-TA3756 bp
ProteinDPOGS206730-PA1251 aa
Genomic positionDPSCF300320 + 102889-108217
RNAseq coverage72x (Rank: top 66%)
Annotation
HeliconiusHMEL0023290.055.40% 
BombyxBGIBMGA002824-TA0.046.00% 
DrosophilaCG11318-PA1e-4427.08% 
EBI UniRef50UniRef50_D6WS121e-5232.99%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WS12_TRICA
NCBI RefSeqXP_001861870.11e-5231.49%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|2700110254e-5232.99%hypothetical protein TcasGA2_TC009282 [Tribolium castaneum]
NCBI nr blastxgi|2700110254e-5233.08%hypothetical protein TcasGA2_TC009282 [Tribolium castaneum]
Group
Gene OntologyGO:00071866.9e-34G-protein coupled receptor protein signaling pathway
GO:00160216.9e-34integral to membrane
GO:00049306.9e-34G-protein coupled receptor activity
GO:00160202.3e-10membrane
GO:00072182.3e-10neuropeptide signaling pathway
KEGG pathway 
InterPro domain[943-1182] IPR0008326.9e-34GPCR, family 2, secretin-like
[885-927] IPR0002032.3e-10GPS domain
Orthology groupMCL26297 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206730-TA
ATGACTTCTTTATTTACGAAGTTTTTAAAACTCACTTTCCTTTTTTCGCTATGGATTTGTGCAAATTTCGTTTCTGGAAAAGAGGTTTATGGTCCGAATTGTCCGATTGGTTTTACGCTGCTGTTCAGGAAGGAAGGACCTCTTTGCTACAGACGGAAAGGTCCAGAATTATTTAACGAAAAATACAGAGAATGCGCTGGAAACTTGTACTCATCAGAGTTAATACATGAGACAAATTTTACGAAACCAGATCATGCAGTCTGGACGCAGTACAAGTCCATGTACCCGAGTGGTTTCTTATACGACACCAGCTTCTCCAAAAACTACGGAAAGGTGATTAGCTTTGATTTGCACGGTATATCCACAAGCATAGAGTTGTTAGATAAAGATGAAGAACTTTGTCTTATACTAGACCCTGTAAGTAACTACACATTGGTGAATTGTAATAAAAGATATTATAGCTACTGTGTCATAAGACCGTACAAATCTGAAGAGTCCCTTGTAGGCTGTCAGGAATTGAAAGATTCAGTGCATTTCTGGAGTCCTGATTCAACTTGTTTGACCTCTTTGATCGGAGTTGGAGGCGGTGCTATAAGAGCTACTTGGAACCAAGCTAAAGAACTTTGTTGGAAAAATGGCGCTTCGCTTTTACATCGGGGTTGGAGATATTCCAACCATCCGGTATTTCGCGGATCTTTTATGAATCATACATATCCGTTAGGAATCATTATGAGCAATGACCTTTCGTTGTTGAGATATGACGCTATACAGGACCATTCGGAGATACCCCAAAGTGATTGGAATTTCGACGATACTATTGAAAACTCTGATACTTTACTGGGTGGACTGCAAAATGATTTCTGGTCACTTGTCAACGGATCGTATATCTTTTATGAAGTTATCTGTGAGTTGCATTTAGGTGTTAGGAATGTCAGTCTACACTTGGCTATAGATGAGGATAATAGAATGACTCTCACTATAAACGCTTCATTGCATGATGATGATATTTCGTGCTACACGGATTCTGTTAAACCAATTATAACGAAAGTTAACAAACGGAGGATTGATGGTAAAAATATGTTCATAATCTCACCCAAACAGGACGGCTATTACTGGTGTATCCACACCAACACGAGGAACTTTAAGCCAGCAGCGTCCAATAAAGTACTATTTTTACGTGCCAAAGAGGCACTCATTAACACATACTCTATAAAATTTCAATTAGATCATTACGTACGATTAACTGGAAAGGACTGGGAGGTTTATTTGCACGAAGTCTGTGAGAACAAAATTAAAGAATACATATTCTATCGAACAAAGTACGAACAGATATTTGGGGAGTTGAATACAAATTTGACGGAGGACACTTTGAAAGTTTTTAAACATTCAAATCCTAGTTTGAAAGGAAAAGATAAAGCGATCTTGAACATAAAATTGAAGAGGCTGTATCCGGATGTTCGAAAAGTTTTAGTGCATGTAGAATTAAATCCAGATATGAAACCTGTACCTCCAGGGTATTGGGAAGGCTTGAAGATCTTCTATATGAAGTCTGTTTATTATTGCAGGGGGTTTGATACTGTTGCAGATGGGACTTTAGGAGAAACAAAATTGTCTGGATGCCATAATCATACTTGTATAGGCAATTTCAACGAAGGTGTTCAATGGGTGACCACGGCGAGGACTGACTGTAGGAATGCTGGTCATAGGGCCCTCAGCATTGATGACGTCACCATGGTTATGCCATCAGTAGTAATAACTTCAACCCAGTCATCAAGCTCTGGCAAGGAAGAAACAAGCGAGAGTTCAGAGGAGAGTGATAACTATTTCACTGGGAAGCAAACATCTCGCTTTATCCCAGACCGAAATACTCCTGAAGTTACCACAGATGGTACCGTTACCGCAGAGACTACGCTCACAGTTGACAATGTCAATAATTTCACCACTGAAAGTGTTCTAACAACCGATCTTACCTTCCCTGATACCACCGTAACTACCACCACTTACCCTCCTACTACCACCAGCGTTACCACAACTGAGGAAGCTGTTACTGTGCCGCCGGAGGTACTTCTGGATCGAGTGATAGAAGATCTGGATAATCTTGTCAACAACACAGAACCTGTTATGGTAAAGGACATTGACAACGTATTTAATCAGATAGACAATATTCTCGAATCACGGGGAAGTCTCGAAATACCAAGCCAATTCCTACATTTACTAGATACTTTAGGTACGACGGTGAATTTAAATGGATCTCTAACAGCTACTGCCGTCCGAAACAACATAGCTTTGGTACTAGCGGACGCTGAACCCAGTCATCCCGTTAGAGGCATGAAGATAGCTGCCAGAGACAGCGACATGTTCACTAATGACGCGTTCCAAATTTTTAATGGTGATCTCAATTCAACAGATCTTGAAACTGATCGCAACGAAGTAGTTGTTCACCTGCCTTCATCCTTGTCGGAGACGTCACGTCGAGTGTCTTTCGTGGTGTTTCGGAACGACCGTGCCTTTACATCGAACTCTAACGTGTACTCCGTTAATAGTAGAGTTCTAAGCGTGAAAATAGAAAATATAACGACGTTCGATAACGACGAGGTCATAGATATACACGTTAGTCCCATCACAATTGATGTAAACAGAAATGAGAGTCGTGCGTGTGCGTATTGGCAGTTCACGGGAAATGGGACCGGATATTGGTCACAGGATGGTTGTAAATTAATTCCTGCCACACAACCGGGAATGTTAAGTACATGTCGATGTACACACCTGACACATTTCGCAGAAGTTTTGTTCCCTCGAACAGTTTTTACTCTAAGAGATGAAGATATGTTAGAGCTTTTAACAATCATAGGATGTTGCTTGTCCATTTTCGGTTTAGCTTTTGTGGGGGTGACGGCAGCTATGTTCAGATCATGGCGTCGAGATTTCAGCAACAAAATATGGTTACAGTTATGCATTGCTATATTTATATCTGTTCTCAGTTTTTTGGTCGTAATTTTTGCAAAATTCCAAGAATACAACGTACCGTGTATGTTAATGGGCGTTGTTCTTCATTATTCTGTTTTATCTTCATTTTGCTGGATGTTAGTAGCAGCTATTCTCTCGTATCGGCGATTAGTCGTTGTTTTTACAAGAGATGCTTCTCACAAATTACTTAGAGCATCTGCTTTTGCCTGGGGTACACCCTGTGCAATAATTGGTATACTTCTATCAGTATCCCCGCAATCATATTCCAATCGTTTCGAAGAAATATCACCAAGTGGTTCATTCTGCTATCCTTCAGGTCTAGCTCTATGGATCTCAGTCTATGCTCCTATTGCTATTATACTTGTGATAAACTGGATACTATTCGCTTTAATAGTAAGATCCGTGTTTGCATCACGAGGTATCCAACGACACGGAGATTCAAACGAAGCGTTACGTTGCGCATCAGTGAGTTGTCTGTTGGTATTTCTGTTTGGTTTGCCTTGGATTTTCGGTCTCTTTGCATTTAACGTCGTATTTGCCTATCTGTTTACGCTAACGGCAACATATCAGGGCTTGATTTTGTTTCTATTTTTCGTTGTCGGTAACAAAAAGACGAGAGATTTGTGGTTAAATAAACTGAAGATCAAACAAACTCGTAAAGTGCCGGTTACATCATCGACTTACTCTAACAGAAGTTCGGGTTGGAGGGGAGGAACTCATCCCATGACTATCGAATCAAAAACTTCCAAACCTAAATCTCTCGAGCAAGATGACTCCAGATTTTCTTGA

Protein sequence:

>DPOGS206730-PA
MTSLFTKFLKLTFLFSLWICANFVSGKEVYGPNCPIGFTLLFRKEGPLCYRRKGPELFNEKYRECAGNLYSSELIHETNFTKPDHAVWTQYKSMYPSGFLYDTSFSKNYGKVISFDLHGISTSIELLDKDEELCLILDPVSNYTLVNCNKRYYSYCVIRPYKSEESLVGCQELKDSVHFWSPDSTCLTSLIGVGGGAIRATWNQAKELCWKNGASLLHRGWRYSNHPVFRGSFMNHTYPLGIIMSNDLSLLRYDAIQDHSEIPQSDWNFDDTIENSDTLLGGLQNDFWSLVNGSYIFYEVICELHLGVRNVSLHLAIDEDNRMTLTINASLHDDDISCYTDSVKPIITKVNKRRIDGKNMFIISPKQDGYYWCIHTNTRNFKPAASNKVLFLRAKEALINTYSIKFQLDHYVRLTGKDWEVYLHEVCENKIKEYIFYRTKYEQIFGELNTNLTEDTLKVFKHSNPSLKGKDKAILNIKLKRLYPDVRKVLVHVELNPDMKPVPPGYWEGLKIFYMKSVYYCRGFDTVADGTLGETKLSGCHNHTCIGNFNEGVQWVTTARTDCRNAGHRALSIDDVTMVMPSVVITSTQSSSSGKEETSESSEESDNYFTGKQTSRFIPDRNTPEVTTDGTVTAETTLTVDNVNNFTTESVLTTDLTFPDTTVTTTTYPPTTTSVTTTEEAVTVPPEVLLDRVIEDLDNLVNNTEPVMVKDIDNVFNQIDNILESRGSLEIPSQFLHLLDTLGTTVNLNGSLTATAVRNNIALVLADAEPSHPVRGMKIAARDSDMFTNDAFQIFNGDLNSTDLETDRNEVVVHLPSSLSETSRRVSFVVFRNDRAFTSNSNVYSVNSRVLSVKIENITTFDNDEVIDIHVSPITIDVNRNESRACAYWQFTGNGTGYWSQDGCKLIPATQPGMLSTCRCTHLTHFAEVLFPRTVFTLRDEDMLELLTIIGCCLSIFGLAFVGVTAAMFRSWRRDFSNKIWLQLCIAIFISVLSFLVVIFAKFQEYNVPCMLMGVVLHYSVLSSFCWMLVAAILSYRRLVVVFTRDASHKLLRASAFAWGTPCAIIGILLSVSPQSYSNRFEEISPSGSFCYPSGLALWISVYAPIAIILVINWILFALIVRSVFASRGIQRHGDSNEALRCASVSCLLVFLFGLPWIFGLFAFNVVFAYLFTLTATYQGLILFLFFVVGNKKTRDLWLNKLKIKQTRKVPVTSSTYSNRSSGWRGGTHPMTIESKTSKPKSLEQDDSRFS-