Monarch geneset OGS2.0

DPOGS212031
TranscriptDPOGS212031-TA3159 bp
ProteinDPOGS212031-PA1052 aa
Genomic positionDPSCF300054 - 267713-289769
RNAseq coverage404x (Rank: top 30%)
Annotation
HeliconiusHMEL0037080.072.73% 
BombyxBGIBMGA005351-TA1e-2826.56% 
DrosophilaCG43102-PC0.044.25% 
EBI UniRef50UniRef50_Q7QFT40.046.73%AGAP003854-PA n=2 Tax=Culicidae RepID=Q7QFT4_ANOGA
NCBI RefSeqXP_001998429.10.045.16%GI23633 [Drosophila mojavensis]
NCBI nr blastpgi|3838617710.047.59%PREDICTED: uncharacterized protein LOC100877669 [Megachile rotundata]
NCBI nr blastxgi|3287921360.047.60%PREDICTED: hypothetical protein LOC413562 [Apis mellifera]
Group
Gene OntologyGO:00056225.9e-49intracellular
GO:00350235.9e-49regulation of Rho protein signal transduction
GO:00050895.9e-49Rho guanyl-nucleotide exchange factor activity
GO:00055153.1e-12protein binding
KEGG pathwayxtr:1001701862e-30 
 K04436 (MAPK8IP3, JIP3)maps-> MAPK signaling pathway
InterPro domain[47-273] IPR0002195.9e-49Dbl homology (DH) domain
[432-466] IPR0119933.1e-12Pleckstrin homology-type
[873-898] IPR0159433.8e-06WD40/YVTN repeat-like-containing domain
Orthology groupMCL11568 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212031-TA
ATGAGTCGTGAGTCTCTGAAGTTGAAGTTCAGCGATCAACGGGCGGCCCTTAGTCTGTCCTTCCACGGTTCTGAGTCTGATGATGAAATTACTGAGAGGAATATCAGGAGATCTAGAAGCGGCACCGGCACAATGAACGCTGCGGCCCTACCCTCGATATCCGCCGGCCTACCGTCGGCTTTACTTGCTGCTCAAGACACACGGACACACGTGGTCGTCGAACTTTACGAGACGGAAAAATCTTATGTGGAGGCGCTCGAAAACTTAGTCAAGAAATACCTCCAACCGCTTAAAAGTCCAGAGAACGCTGGACTCTTAGATGCCTATTTGGTGGATGAAATATTCTACCAAGTACCGGCTATCCTCAACGTGCACCAAGTATTCCTCGAACAACTGAGGCTAAGACTTGAGCAATGGGACCTCCAGCAAAAAGTTGGGGACGTGTTCCTCGAAGTGTTTACCTTTGGGTGGCATTACCTGTGGATCATGTCCTTTATCAACAACCTGAAAAAGGCCAAGGAAACGATAAAATCCGCAGCGGCATCGCGACCTGCTTTTGCAAAGTTTTTGGAAGCACGTGCAAGAGACCACAAGGGAAAGTTGTCTTTAGACAATCTCTTAATAAAGCCAGTACAGAAGTTTCCAAGCTATAAGCTTTTGATTCAAAGATTGATTAAACACACAGAGCAGTCGCATCCTGACCACAAACTATTGTTGGAAGCTCAGAGGGAGATTCACGATCTCTTGGAACTTATAAATTGTACTGAAAGAGAAAGTCTCGAACAAGAACAACAGCAACAGACTCTGAGGGAATTAGAACAATTGATAGAAGGTCTCTCTAATTTAGTGTCGGCTGATAGAACATTCATTAGGCACGAGATGGTTACAATGCCGTCAGCGCAAGGAGCTGTTAAAGACAGAGCTTTATTCCTCTTTAACGACACTCTTTTGATAACAAGTGTCAAGAAAAGGACTGGTACCATAAAGAAACCGATTCCGACATACCAATGCAGCATCGCGAGTCAAATGGAAGGGAACAAATATAAGCTTCTGATGAGGATATCTCTCGGAGACTTGGAAATCGTGAAAGGAAAAGACGAAAACATGAGACGCTTAATCCATGAAGTGGAAACACTTACAGAAGATGTCAATACATTGACTGTTATATCAGAACAAGTGGCCGCTCTACACACCCAGCATCTTCCTTTAGAAGAGCTGGTCAGAGAAATGTTACAATCGGCCAATAGACAGCTATCTGAGAGACAGAATTTTGATAATCAACTATGCTGTATGGAACTAACTTTGAATACCTCAAACGTCCCAGAGAATTTGACGGTGATATTTGCTAATTCGGAAAAGAGGAGCAATTGGGAGGAACTCATAAACGAGACGAAGCAGAAGCTATATATGTACGGCCCAGAGCGTCCGGCGCCCGAGTTTCTCTCCCCTGTCCCTATAAGGAAGACGAGAGCCGGCCTACAGTTCACATGCGCTGCCCCTACACTACCCCCGAAAGGACAACCGCCTGATGTTTGGGTTTGCAACAGCGACGGGTATGTCGGTCAAGTGTGTGTTCTGACTTTGAATCCTAAGCCGCAAGTGACATCTTGCAATGGCGTCTGTAATGCTAGGATCGTATGCGTTGCTTGCGTACCACCCGCACCGGCCCTCGTTCGCCAGCAGACATTAGACATACCGAGTACCAGCTCGTTGAACAGTTCCGGTAATAAGCCTGGTATAAGTATTTCTGATGCTGATGAAAGCTGCAAGAATATACGTCTTGACAGCTCATCATCTAGTGAAGACGAGGACGATGGCTCGTCCACCAGCGAGAATCAAGACGCCCAGTCCGAAAGAAGCCAGGACTCTGTTCGTCTGCACAGCCTCAGTGCGATCGGGGCCAGGGCGACGCTGACGCCTAACCATAGCAAGAGTCTGAGCACGCCTCACACCGGACAGAACATACCGATACACCCAGTCATGAAGTCCAGCTCGAATCCCGCTGTCGACAAACAGGCTATGGGAATCACATCCGGCACATTATCGTCACCAGCCAGTCGCCAGTCCTCGGAGGACAACGCTACGAATCAGCCAACAATGTGGCTCGGGACCGAAGACGGCTTCATTCACGTCTACAACTGCATGGACAACATACGCATCAAGAAGAACAAGATTAAGCTACAGCACAGCGCCTCTGTGATATCCATCAAATACGTCGAGGGTCAAGTGTTCGTGTCTCTTGGTAACGGTGAATTGGTCGTGTATAACAGAGATATTGATGGTACATGGTCAGAGCGCGCTACGCTGGTGGTAGGTGGTAGCTCTAACCCAATATCCGCCATGTTGGTGGTCGCTACGCGTCTCTGGTGCTCAACACAGTCCTACGTTAAAGTCATCAATCCGCATACGCTGCAGGAGGACGGTTCATTTCAAGTGCCCACTCACAGTCGTCAGATCAGTCACATGGCCATCTCCGGTAACACCATATGGCTGGTGCTCAACCCGACGCATAAACAGTCGCAGGAGGACGATACATTCCACGTGACCACAGACACTCGTCCCATCAGTCATATGGCTGTTGCCGGCACCTCTTTGTGGATGGCCCTCAACACGACTCCCCAGCTCCGATGCTATCAGACAAATTCCAAGGAACTGCTAGCTGAACTGAGCATCACAGCGCCCGTCACTAAAATGCTGCATGGATGCGACGACATTATCCGCCAACATAAGGCGGCTTGTCTGCGAGTGACGGCTTTATTGGCACATCGAGATACCTTGTGGGTTGGAACGTCTGCTGGCGTGTTGCTCACAGCGCCGCTACACAACTCGCCCAACACACGAACCGGACAGTTCACTGTGCCACAACTCACCGGGGTGACTTACGGTCACACTGGACATGTTAGATTTTTGACAATCGTCGAAAATCCAGTTCCGCATAAGCCAACAACGAAACCAAGCACGAGCTTGAAGACGAAGGCTCTGAGTCGACGATCGACGAACGCTGAGAAACTACAGAAGCAGACAGAAAGCAGCCCGAACAATAAGGAGACTTTGGTGATATCTGGAGGGGATGGCTACGAGGACTTCAGGACGTCCTCCATGTCCGAGGACGCGGGGCGAGAGGACTCCACAAATCACCTGCTATTTTGGAGGGTATGA

Protein sequence:

>DPOGS212031-PA
MSRESLKLKFSDQRAALSLSFHGSESDDEITERNIRRSRSGTGTMNAAALPSISAGLPSALLAAQDTRTHVVVELYETEKSYVEALENLVKKYLQPLKSPENAGLLDAYLVDEIFYQVPAILNVHQVFLEQLRLRLEQWDLQQKVGDVFLEVFTFGWHYLWIMSFINNLKKAKETIKSAAASRPAFAKFLEARARDHKGKLSLDNLLIKPVQKFPSYKLLIQRLIKHTEQSHPDHKLLLEAQREIHDLLELINCTERESLEQEQQQQTLRELEQLIEGLSNLVSADRTFIRHEMVTMPSAQGAVKDRALFLFNDTLLITSVKKRTGTIKKPIPTYQCSIASQMEGNKYKLLMRISLGDLEIVKGKDENMRRLIHEVETLTEDVNTLTVISEQVAALHTQHLPLEELVREMLQSANRQLSERQNFDNQLCCMELTLNTSNVPENLTVIFANSEKRSNWEELINETKQKLYMYGPERPAPEFLSPVPIRKTRAGLQFTCAAPTLPPKGQPPDVWVCNSDGYVGQVCVLTLNPKPQVTSCNGVCNARIVCVACVPPAPALVRQQTLDIPSTSSLNSSGNKPGISISDADESCKNIRLDSSSSSEDEDDGSSTSENQDAQSERSQDSVRLHSLSAIGARATLTPNHSKSLSTPHTGQNIPIHPVMKSSSNPAVDKQAMGITSGTLSSPASRQSSEDNATNQPTMWLGTEDGFIHVYNCMDNIRIKKNKIKLQHSASVISIKYVEGQVFVSLGNGELVVYNRDIDGTWSERATLVVGGSSNPISAMLVVATRLWCSTQSYVKVINPHTLQEDGSFQVPTHSRQISHMAISGNTIWLVLNPTHKQSQEDDTFHVTTDTRPISHMAVAGTSLWMALNTTPQLRCYQTNSKELLAELSITAPVTKMLHGCDDIIRQHKAACLRVTALLAHRDTLWVGTSAGVLLTAPLHNSPNTRTGQFTVPQLTGVTYGHTGHVRFLTIVENPVPHKPTTKPSTSLKTKALSRRSTNAEKLQKQTESSPNNKETLVISGGDGYEDFRTSSMSEDAGREDSTNHLLFWRV-