Monarch geneset OGS2.0

DPOGS211816
TranscriptDPOGS211816-TA3225 bp
ProteinDPOGS211816-PA1074 aa
Genomic positionDPSCF300031 - 236040-258275
RNAseq coverage393x (Rank: top 31%)
Annotation
HeliconiusHMEL0225873e-11054.02% 
BombyxBGIBMGA008155-TA9e-7686.21% 
DrosophilaRhoGEF2-PF5e-5033.00% 
EBI UniRef50UniRef50_G6DD662e-94100.00%Putative uncharacterized protein n=2 Tax=Obtectomera RepID=G6DD66_DANPL
NCBI RefSeqXP_001945506.15e-5532.21%PREDICTED: similar to RhoGEF2 CG9635-PE [Acyrthosiphon pisum]
NCBI nr blastpgi|3085127792e-7882.76%guanine nucleotide exchange factor RhoGEF2 [Biston betularia]
NCBI nr blastxgi|3085127791e-7482.76%guanine nucleotide exchange factor RhoGEF2 [Biston betularia]
Group
Gene OntologyGO:00057372.9e-24cytoplasm
GO:00050892.9e-24Rho guanyl-nucleotide exchange factor activity
GO:00055153.5e-19protein binding
GO:00355563e-07intracellular signal transduction
KEGG pathwaymdo:1000309043e-16 
 K07532 (ARHGEF12, LARG)maps-> Axon guidance
    Regulation of actin cytoskeleton
    Vascular smooth muscle contraction
InterPro domain[592-748] IPR0152122.9e-24Regulator of G protein signalling-like fold
[592-746] IPR0161378.6e-22Regulator of G protein signalling superfamily
[14-60] IPR0157213.8e-20Rho GTP exchange factor
[10-96] IPR0014783.5e-19PDZ/DHR/GLGF
[880-929] IPR0022193e-07Protein kinase C-like, phorbol ester/diacylglycerol binding
Orthology groupMCL10498 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211816-TA
ATGCTGGGAGGGGCGGGGCCTGCGGGGGGCGACGACCTGCTCACCGTCACCGTCGTCAGGGACGAGCACGGATATGGAATGAAGGTCTCAGGTGACAATCCAGTGTACGTGCAGTCGGTGAAGGAGCACGGGGCCGCCTGGCGAGCTGGTCTCCGAGCCGGAGACCGCATCCTCAGGGTGGACTGCGTGCCCGTTCACAAGTACGCTCACCAGCAGGTCGTCCACATGATCAGAGCGAACCCCAGCACGGTGTTGACCGTGCAGCAGAACACGTCCCGTCTGAAGACACCCATCACAGCTCCCGTGCCGGTTAACGTGAGCACTTCCGAGAAGCAGCGGCAGTTGGAGGTGTCGAAGGCGCAGACTGTGAGGCTGATGTTGGAACAGGAACAGAAATACATCCGGGACTTGAGAAATGACCTGCTCAAGGTACCGGATGTGAGGAAACAGGCCCAGCTGGAGTCCGCGGAGCAGAGGTGCTCCATACTACAACACGAGATAGACACCATCATGGCCACACAGGTAAGACGCGAGCAGCGCCGGAGGATGAGCCGGATGCCGCGCCGCTTCCCACTACTCCGAAGCTATCCCGGAAGTACCCCCACCCCCACCCCCACCACCACCGCCAACAAGCCGGCCAAGTACAAGAAATATCCCGCCCAGCCCTATTCCGCGGACAAGTTCCGGAGATCCTCCCACTCGCTGGACGGGGAGCTGGACGGCCCCGAACCCAACTCGATAGCCCTACAACTATACTACCCGCTGGTGAACGCCCCGCCGCCGCCGCTGCTGGAGTCGTGTCCAGGTCTGTATGTAATGTCGATCCTCTCTCATCGCACAGAACGACAGCCGCGGGTGTGTACCGGTGCTGCACGTGAGGAGCCAGTCGTCTCCGGAGCACCTGGACCACGAGCCGCGGAGTTTGTATACATTCCGTTACATTCATTCTCCATAATAAGTATTACGCTATTATTTTCTGGCACCTGTTCCTCAGCGAGCGCCCTGTCTCCGCCCGATAGTGAGTGTTGGTCGGGCGGCTCTCCCCCGCTGGTGACTCCGCCCCCCGGGACCCCGCCGCCGCCCTATCATCACCCCACACAGGACAAATATCACGCGATCATTGCACTTAAGTTAATCTGGACTCGTGAGGCATTTTTGGTATCTGTTGGTAGTCCTGCTGGTCTCCGAGCCGGAGACCGCATCCTCAGGGTGGACTGCGTGCCCGTTCACAAGTACGCTCACCAGCAGGTCGTCCACATGATCAGAGCGAACCCCAGCACGGTGTTGACCGTGCAGCAGAACACGTCCCGTCTGAAGACACCCATCACAGCTCCCGTGCCGGTTAACTCCGAGAAGCAGCGGCAGTTGGAGGTGTCGAAGGCGCAGACTGTGAGGCTGATGTTGGAACAGGAACAGAAATACATCCGGGACTTGAGAAATGACCTGCTCAAGGTACCGGATGTGAGGAAACAGGCCCAGCTGGAGTCCGCGGAGCAGAGGTGCTCCATACTACAACACGAGATAGACACCATCATGGCCACACAGCCGGATGCCGCGCCGCTTCCCACTACTCCGAAGCTATCCCGGAAGGACCCCGACCGAGTCAATAGCGGCTTCCTGTCGGCTCTGCCGCGCTCGCTCAGCAATCTCACGGGACAGAGTCTCAGGAACCTGCGCCAGAAGAAGGACCAGCAGCTCCCGTGCGCCGACCCTCAGCGAGCCATCATATCCATGGAGGAAGACGAACCGCCCAGCTCGGAGTCGTGCGCCCCCGGGCCGTTCTCCAGCTTGCAGTCCGTGCTGGAGTCCCGCGCCCGGACCGCCGTCTTACTCAACTGGCTGCTAGCTGAGAGTCGGTGTGCTAGCGCCGCCCTGCTGCTCCTGCTAACGGACGCCTACCGAGCCGCGCCGCCGCCCGTAGCGGCTCTACGACGCTGGGCCTACGAGATCCACTCCACCTTCCTCATGCCGGGCGCCGTGCTACAACTCACCGGCGTGGACGAGAACATGGCCAACGAGATAGAACACGTGCTGGTTAACGAATTCGACAAGGAGGAGCTTCTGCGGAACGTGTTCCGTAAAGCGAGGAGACAGGCCAAGGAGCAGCTGGCGAGGCAGCTCAGCGAGTTCCAGGTGAAAAGACAGGTCGGCCTCGCCACGCTCTACGGACCAGACGATAGAGTGCTGCAGCGCTGCGAACAGGACAAGCAGCAGGAGCTGGCCGTGGTGGAGCGCATCATGTGCGGTCGTCTGCGGGCCATGTCCGCCAACGGGACGTCGTCGGTGGCGGGGGGCGGGGACGCGCAGTCCGCGCTGCTAGCGGCCCTGGTCACCGCCGCGCTGTGTCTACAGTGCAGAACGTCGCTGCCAGCCTTATACTATCAGCAAATGCGTTTCATGTTGACACGAGCTCGGGGAAGCCGTATAATGTACGATAAGAAGCTCGCCGCCAAGACTGTGGATGCCAAGGATCTGGATCACAGGAGATGGTGGCACAGGAGGAGTAGAAAGAAACCGGCGTCCGTGTTGGCAGCGGCGGGCGGCGCGCGAGCCTCCTTCCTGCAGCGTGAGAAGCCTTACAGACAGAAACTCAAACAGATCGCCAAACAACAACCGCAACCGTTCGTGGTCCGAGGTCACCACCTCCAACTCCAAGCCCTCACCTCCGTGGCTCACTGTCACCACTGCGACCTCGTCATCTGGGGTCTCGCTCCGCAGGCCTATGTCTGTACAGACTGTAAGCTACGCGTGCACCGCTCCTGCGCTCGCTCCGTGGAGGAAGGCTGCTGCCTGGACGGGGAACACTCCAACAACCGCATCTCCAGGTTCATGGAGAGGATACATCCGCTGCCCGGTCAGGACTCCAGCGAGAAGAAGTCGAGGAAGTCGTCGGGGACGGCGCACTTCCTGAACATGGAGCGCAGTTTCCGTAAGATGGAGGACGACCCGCCCTGGGACAGCTCGCTCAACGCCACGCAAACACAAGATGACGCGCTGTCCAATAGTGTTGTAGGAATGTCGGCCCGGCGCTCGACGGTCGCACGGACTATAGAACGGGGTTCGATACCCGGAAGTGACAGTCGTGCGTCCGATGATAACAGATCGCATGGACGGGGTGGTAGGGTGGTAGGTGGTAGGAGCGTGGTAGGTCGTCGTGAAAGACGGCTGAAGTTTCAGACTGATGGAGTATCGCTGGAGGCGGCGCGCGGTTCGGACGCGTTGTGA

Protein sequence:

>DPOGS211816-PA
MLGGAGPAGGDDLLTVTVVRDEHGYGMKVSGDNPVYVQSVKEHGAAWRAGLRAGDRILRVDCVPVHKYAHQQVVHMIRANPSTVLTVQQNTSRLKTPITAPVPVNVSTSEKQRQLEVSKAQTVRLMLEQEQKYIRDLRNDLLKVPDVRKQAQLESAEQRCSILQHEIDTIMATQVRREQRRRMSRMPRRFPLLRSYPGSTPTPTPTTTANKPAKYKKYPAQPYSADKFRRSSHSLDGELDGPEPNSIALQLYYPLVNAPPPPLLESCPGLYVMSILSHRTERQPRVCTGAAREEPVVSGAPGPRAAEFVYIPLHSFSIISITLLFSGTCSSASALSPPDSECWSGGSPPLVTPPPGTPPPPYHHPTQDKYHAIIALKLIWTREAFLVSVGSPAGLRAGDRILRVDCVPVHKYAHQQVVHMIRANPSTVLTVQQNTSRLKTPITAPVPVNSEKQRQLEVSKAQTVRLMLEQEQKYIRDLRNDLLKVPDVRKQAQLESAEQRCSILQHEIDTIMATQPDAAPLPTTPKLSRKDPDRVNSGFLSALPRSLSNLTGQSLRNLRQKKDQQLPCADPQRAIISMEEDEPPSSESCAPGPFSSLQSVLESRARTAVLLNWLLAESRCASAALLLLLTDAYRAAPPPVAALRRWAYEIHSTFLMPGAVLQLTGVDENMANEIEHVLVNEFDKEELLRNVFRKARRQAKEQLARQLSEFQVKRQVGLATLYGPDDRVLQRCEQDKQQELAVVERIMCGRLRAMSANGTSSVAGGGDAQSALLAALVTAALCLQCRTSLPALYYQQMRFMLTRARGSRIMYDKKLAAKTVDAKDLDHRRWWHRRSRKKPASVLAAAGGARASFLQREKPYRQKLKQIAKQQPQPFVVRGHHLQLQALTSVAHCHHCDLVIWGLAPQAYVCTDCKLRVHRSCARSVEEGCCLDGEHSNNRISRFMERIHPLPGQDSSEKKSRKSSGTAHFLNMERSFRKMEDDPPWDSSLNATQTQDDALSNSVVGMSARRSTVARTIERGSIPGSDSRASDDNRSHGRGGRVVGGRSVVGRRERRLKFQTDGVSLEAARGSDAL-