Monarch geneset OGS2.0

DPOGS211139
TranscriptDPOGS211139-TA2064 bp
ProteinDPOGS211139-PA687 aa
Genomic positionDPSCF300007 - 207102-231095
RNAseq coverage101x (Rank: top 61%)
Annotation
HeliconiusHMEL0172120.078.04% 
BombyxBGIBMGA003012-TA0.078.74% 
DrosophilaCG34400-PC2e-13250.51% 
EBI UniRef50UniRef50_Q9VU983e-13050.51%CG34400, isoform C n=52 Tax=Coelomata RepID=Q9VU98_DROME
NCBI RefSeqNP_648668.35e-13150.51%CG34400, isoform C [Drosophila melanogaster]
NCBI nr blastpgi|2213311421e-12950.51%CG34400, isoform C [Drosophila melanogaster]
NCBI nr blastxgi|2213311422e-13851.01%CG34400, isoform C [Drosophila melanogaster]
Group
Gene OntologyGO:00055152.3e-27protein binding
KEGG pathwaybfo:BRAFLDRAFT_1311513e-12 
 K08018 (RAPGEF2, PDZGEF1)maps-> MAPK signaling pathway
InterPro domain[372-484] IPR0014782.3e-27PDZ/DHR/GLGF
Orthology groupMCL15343 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211139-TA
ATGAGGTCGGGGGAGGGCGAAGGCTCGGAGCTGGCGGAGTTCGGTGCTCGCTGGCCGCCGCCCTACACTCAGGGCTACGGCTGGCGCTCCGCCACGCCGCAGCATCCGCCTGCTCACCACCTCGCGAGACATTCGCGGTCACAGCAGGACAACCGGCACACTAGCTACGAGGCTGAGGACGCTCGCGCCAGAGCGCGCGGCGGCGTCTACTATTCCCCGCCGGGCACATCCTATACCATTGTCGAACGACCGGCCTCCTCACATGGCCACCACACTCCATACTCTCAACACCACTACTCGGGGAAACTATCGAGAACACCATATCTCGGTTCTACTACTATAGCAAACACTGCAGGGGCGACGAGTTTAAGAAACAGTACGAACCATTTAAGCGCCAACGGTAAGAAACGGCCAATATCTCCGGAGCAGGTGCTTCGTTTATTCGGCCAGGGCGGAGCGGCTGCGTTAGGCAAGGCTCTACCGCCGTCACATCCGCCTGCACCGCTACCCGCACCGCTGCCTGCGCCCTTGCCGGCGAGGAGGCACTCGCCCGCCAGCAGTCCCAGCAGTACAACACACCATGCAATATATCGTGGCGGAGAGAGGGAGCGTCCGTACTCGTCGGGCGGGGCGCCGCCGCCGCCGGAGCCGGCCACGAGGACGGTCACCATGAGCAGGGACCCCGCAGACTCGCACGGCTTCGGCATCTGCGTTAAGGGGGGAAAGGAAGCTGGTGTGGGTGTGTATATATCGAGAGTGGAGGAAGGCTCTGTAGCTGAGAGAGCAGGCTTACGACCGGGAGATTCTATTCTCCAAGTCAATGGGACGCCCTTCTCTGGAATATCACATGAGGATGCACTTAAGATGCTCAAGTCGTGTCGCCAGCTGACTATGGTGGTTAGAACGGCTGGTGCGTTAGTTGGTAGGGCTTCCTGTTCTTGGATGGATCGATACGGCAGGCCAGCCTCGCCACCACCACAACGACCCCTGAGATCTGCAAGCAAGGATCGCTCTATACGGAGGATGCTCAAGTCGTGTCGCCAGCTGACTATGGTGGTTAGAACGGCTGGTGCGTTAGTTGGTAGGGCTTCCTGTTCTTGGATGGATCGATACGGCAGGCCAGCCTCGCCACCACCACAACGACCCCTGAGATCTGCAAGCAAGGATCGCTCTATACGGAGGGTGGATTTGTGTATCGAGCCGGGGCAGTCACTCGGACTGATGATTCGCGGAGGTTTGGAGTACAACCTTGGAATATACATCACAGGGGTTGACAAGGATTCTGTAGCTGACCGGGCGGGACTTATGGTCGGCGACCAGATCTTGGAAGTGAATGGACAGTCATTCGTAGATGTGACTCACGACGAAGCTGTCGCCCAGCTGAAGTACCATAAACGAATGTCTTTGCTAGTGAGAGATGTTGGAAAAGTTCCTCACGCTTGTACTGCTTATGGAGAACGAGATGCTGCCCCTAGAATAAGCGGCTGGGGTAAAAGAAGAGGTGCGGCTGCAACAGCTGTCGAACAGAAGGCAAAGTCGTTGTTACCTCAAAGCGACTTGCCCGCGCTGGCATATTACATGGAGGAATACGCCAACAGAAGACTCACAGCTGACGCATTCCTCACAGTTTTAAGAGATTTACTCGACACACCCCAAAAATATTCCCTTCTGACGGAAATCCGGGAGTTTTTACTTCCTGAGGACCGGCCTCGTTTCGACGAGCTCGTGTATAGACGCCCCGAGGACGGTACAGAACACCATGTGAAGCGAAGTGGCGAACGACACATGCTGCCGTCGTCAACCATGCACGACCTCCACGACCCGGAGGCGCCGGCTGAAGTGCCCCTCGTTGTGGATCACCGCTCGCCCTCCGAGGACTCCGGCTTGGGCCTCCCGCCTCATGACCAGGCTTACAGGAGCGGGCGCGCGTGGTGCCCTGGGGACCCTGCGCCCCCACCGCCCAAGCCGCCGGACGAAGACCTGGAGCCTCCGCCCGAGGTGAATTTGCCAGAGTTACCTTCCACACTTAATCAAGTTCTACACGTTTACCTCTCTATGCCTTAA

Protein sequence:

>DPOGS211139-PA
MRSGEGEGSELAEFGARWPPPYTQGYGWRSATPQHPPAHHLARHSRSQQDNRHTSYEAEDARARARGGVYYSPPGTSYTIVERPASSHGHHTPYSQHHYSGKLSRTPYLGSTTIANTAGATSLRNSTNHLSANGKKRPISPEQVLRLFGQGGAAALGKALPPSHPPAPLPAPLPAPLPARRHSPASSPSSTTHHAIYRGGERERPYSSGGAPPPPEPATRTVTMSRDPADSHGFGICVKGGKEAGVGVYISRVEEGSVAERAGLRPGDSILQVNGTPFSGISHEDALKMLKSCRQLTMVVRTAGALVGRASCSWMDRYGRPASPPPQRPLRSASKDRSIRRMLKSCRQLTMVVRTAGALVGRASCSWMDRYGRPASPPPQRPLRSASKDRSIRRVDLCIEPGQSLGLMIRGGLEYNLGIYITGVDKDSVADRAGLMVGDQILEVNGQSFVDVTHDEAVAQLKYHKRMSLLVRDVGKVPHACTAYGERDAAPRISGWGKRRGAAATAVEQKAKSLLPQSDLPALAYYMEEYANRRLTADAFLTVLRDLLDTPQKYSLLTEIREFLLPEDRPRFDELVYRRPEDGTEHHVKRSGERHMLPSSTMHDLHDPEAPAEVPLVVDHRSPSEDSGLGLPPHDQAYRSGRAWCPGDPAPPPPKPPDEDLEPPPEVNLPELPSTLNQVLHVYLSMP-