Monarch geneset OGS2.0

DPOGS207246
TranscriptDPOGS207246-TA5241 bp
ProteinDPOGS207246-PA1746 aa
Genomic positionDPSCF300008 - 1370640-1380780
RNAseq coverage155x (Rank: top 53%)
Annotation
HeliconiusHMEL0165030.075.30% 
BombyxBGIBMGA012060-TA0.063.50% 
DrosophilaGef64C-PA0.050.46% 
EBI UniRef50UniRef50_B4IZ660.051.65%GH15696 n=3 Tax=Drosophila RepID=B4IZ66_DROGR
NCBI RefSeqXP_001864486.10.055.61%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700574460.055.61%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700574460.047.18%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00056223.4e-47intracellular
GO:00350233.4e-47regulation of Rho protein signal transduction
GO:00050893.4e-47Rho guanyl-nucleotide exchange factor activity
KEGG pathwaydre:5710311e-12 
 K07532 (ARHGEF12, LARG)maps-> Axon guidance
    Regulation of actin cytoskeleton
    Vascular smooth muscle contraction
InterPro domain[1328-1531] IPR0002193.4e-47Dbl homology (DH) domain
Orthology groupMCL17868 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207246-TA
ATGCCAGAAACTTGTGGGGGTATCAAAAGGAATGTGAAGATATACGAAGCACCAGAAGCTACGAGATCTAAAAGATGGTCTTCAAAATTAAGGCTTAATACATCTGCAAGTTCTAAATCTTCTGAAATATCCCCTGAGAGTCCATATATGTATGGCACTATTAGTGGTCAAGGTGGTTCAGCATTATCGAAATCCATGAAGTATGCGGAAACTTGGTTGTACGGGTCTGTGAGAACCCAAACACCACCTGTGAGGCCAAGCGTCTTTTCAGCGTATCCTGACATTCCTGGACCCGTATTAATAAGCACTCCACAAAAACCAACCCCACACAACTACGCTGTTATATTATGTTCCTGTCCTGAATATCTAAATGGTACTAAAAAAACCACTTCAACCAAAGTCAGTATATGTAAGAAATGCAAAGGATCTCGTTTACCTCTTACAATAGCTGAGAGTCCCCGTTTATTAGTAGGTGGTACTGTACGAGGCCCTCAAGTAAATCGCGATGCAGGCTTACTAAGAGCTGGCACTGTAAGAGTTCCTAGTTCTAAATCACGTCCTAGCATTTTAAAGCCAAATGGCGACTCAGATCCATATGATTTAATGCGTAGAAGTAGACTAGCTCCACCTCACGAAACGAGAATAGGAAGTCAATTTTCTACAACAAGATCCCGAGCGAAAAGCATAAGTCCTTGTAGAGTGAAAACGAAGCAAACAAGTCCGGAATCATTAAACAAAAATCGCAGTAAGTCTGTCAGTAGAGTAAATGAATTATGGATAGATGAGGATGCAATTACAACTGACAAAAAAAAATCCATTCTCTCTTGTGATATAAACCCTTACGAATTAGTAAAAGCAGGAGGACATTCTAAATCTACGGTGGATATAGAGTTCGACGATGACTTTGTTGAAGGTTTCCAGGATCCAAGTAAGAGTAGTAGTCGGTCTTCAAGTAAGGGTAAAAATGATAGCAATGTTAAGGCAATAGGAGGACAAAGGATAAGAGTGTCAAATGATTCAAAATCTGGTAGTGTAGACGAAGTATCAGTATATGACCCTATAAAATACGATCTCAAAAATTATGACTTTCAATCAACTGATATGTCTACGGCACTATCTTTACCAAACATAAAAAATGAAAATAAAGTAGTTCCATTGAATAAAAATAGAAATAAATCTATATCTCCCTCAAACCGTTTGAAACAAAAGAGTGTGTCACCAAAAAGACCACCGAGAAGATTAAGATATAATAAAACAGAGGATGATAGTGAAAGTGATAATCAGCGATTACCCGATAAGAATTTTAGCACAAAAACAGCTCTAGATACTACTCGAAGTTCCAAATATAAGAATAATAACAAGGAAACTATAAAATCTATATTAAAAAAGCCTAAACGGTATGATCCAGATGATTCCAATGGAAAGTTTGAAAATAAAGAGGAATCATTTGAAAGTAAACGTCTTAATTCATCACAATTTTATCTTCCAAAACCGAAAGATAATAAAAGTTTGGTCTTAACTCAGAATATCCATCAAAGAAAACGAGTTCAATTTTTGGTTGAAAAAGAAGAAACTAAAGTAATATACACAGCAGACCTAAACTTACATGAAAATATTGTCACAGAGTCAAATACAATTATTGAAACAGAAGAGGAAATTGTTGCCAAGGGTGATGTCGTTAGTCAAAGTTTAGAGGCAAATGAAACTGTTACTACCGAGTGTAATGAACAAAATTTGATAAATGAAATGTTAAATGATTCTGGAGTTTATGGAGAAGTTGAAATGAATAGTGAGAGTGAAAATATAATACAGCAAAATGTTAATACTCAAGGAGGATATATCAAAAAAAGCGAAGAAAACAATGTAAAAGATAACCATGTGCCGAAAATTCCTGTTTTAAGGCGTTCAGAATCAGAACGTCTGGTTCCGACTTTATCAATATCTCCTCCAAAGTTTCTAGATACGATGGCTTTACGTAAAAACAAATATGGTTATAAAAGTCAAGTATTTCTAGAATTTGTAAATGAATTAGAATCAACAGTTCAGTCTGAGAAACATGAATCAAAAGACTCTGAAGACATGAGTGCTAACCGTGTACCTATTGGAAATGAAAATGAAGTAGATACAGGTAATTTAAGTGATAACAGTGAAATAACAACTAAAGTTTTAGAGAACTTAAGACGTGAAATTTCTTGTTCTCCTGAACCACCACCGCGTTTAAAATTAAAATCTAAGAAACACAATTACGTTAAAACTAGATTCGTTCGTAACAATTCTACAAGTTCCACAAGTGATGAATGGTCTGATTCCAATGATAGGGAACAAAAAACAATACTGAAAATTAAACAATTTGATTCTGATGATGGTGATGAACAAAACAGTAAAGATACTTGTAAAAATATAGAACTTGAGCCTAGAAAAACGTCTATTCAGATAAATGGCAATGAATGTTATTCAACTATGAATGTTAATAACGATACACCAATATATCTGTCTTCTGTTGTTGTTAACGATGATTATGGTAATACATGTAACACTTATCAAACCGGTACTACAGTCACAATTAGTGTTGGGACTCCACAAGAAGGTATGAAGAAATCTAAGAGTCAAATATATATAGGAGCTGTTTTTCCAGGACACAATAATAGTATCGAGGATACAAATACTTATGAAGATTATTATAATGATAGTCAGGGAGATCAATTTAATTTAGTGAGCAGTAATGAAATATCTTCTATTTTAAATGATCCGGTAGAAGCAGTCCGTCGGAATCTTATCCCCCATGTTTGTGGGAAAAAAGATGTTTTCACCCAGGAAGTAAATAATTTAAGTCAAAGAAGTAAATCAGAAATTAAAAATTCAGATAAAAGCGGTAATTTTGTAACGAAGCTTTTTGATGACCCATTTTTTGCTCATTTAGCAGAAGGACTTGATTCCAACTTAGTTAAGAAATTGATAGAAAATTCGTTAATCAAACTTCAAGAGACTAAATATCAAGAAGGATCAGAAAGCAATAAACAAACTATTGAGAAACTTATCGAAAATTCATTAATATCGTTAAAAGAAGAAGTAAAAAAAGAAAATAAAAGGAATGAAATTATACATATACCAAAAAATGAAGATCTGGAACCATCATCAAAATCAGAAATTCAAACATCCGATAATTTAGAAGATGATAAAGCCTGTTCAGCTCCTTATGAGAGCATGGAATATGAAAGTGGGACAGTGGGAGTTTTTTCAGACCAGGAACCAATGTCTGATTGTTATAACGCTTCCGCGAGTGAACTTTCTACAGAAGATGATACAAATTCTACAAGATCTAAGTTTTATCAAATGCTAGTGGATGCTGCCATTTGCGATATTGAAATTTCAAATAACACTGACGATGACCATCTTTATGAATCTATACGGTTAAATAGTAGTGACCCAATTTATGAAGAAATAGGTGACATGCCTCCTCCTTTGCCAACCAATCCACCGCCAAATTCTCTTTTATTATTAGATGATGAAAAACGAAGTGGGTCCCGTTCAATTTTTGAAGGTGCGTCTAAGTACGATATTTTGTCTTATTTAGTAGATGCTAAAGAAAGGGGCATAGATGATGAAGAGACCTATATAACTAATTACAATAATGACAATAATGAATCAAAGGATAAAACCAAGATAGTAAAAAAACACATAAGTAGTAATACCAGTCAAATATCAAATGCATCAGATTCAAGTGAAGATAATTCGTTGGTTATTAATCAGGAAAATATTGAAAAAGTTGTGGTTTGTAAGAAAACATCAGCAGAAATAGAGAGAAATGATTCTGGCGTTGGTTCGGAAACAAGTAAATCGTCTAGAAACCGGCTCCAGGGTAAAACAACAACAACTAACTCATTAAGCGATAAAGACACTCCTATTCATCTGTGTGAAGATTGTGATACTGCTGTAGAAACACAAGTGACAGAACAAGGATCAGTTTTTGCTCCACTGGTTTGCCGTAAATGTTCGAAGAAAAGAACAGAAAGAAAGGAAATAATAACGGAAATTGTGGAAACTGAAGAAAAATATGGACGTGATTTACAAATAATTCTTGAAGAATTCTATAAACCTATGTTAGTTGCTGGTCTTCTAACGCAAGAACAATTGAGCGCTATATTTTTAAATGTTGAAGAGTTAATTGATAACAACCAAGTTCTTTCGGAAAAACTAAGGGACGCTCTTGAAATTGCAGTGGAACAAGGAGATGAGGACTTACTGACTGTTAACGTTGGTAAGATTCTCTTGGAATGTTCAGGAATGTTGACAGCTTTTCAATCATATTGTGTAAAACAAGCAGGCGCCGCATTGCTTTTAGCCGGACTTGAGAAGGAGAAGGAACTTTTAAGGATATTCTTGCGCGTCTCGCAAATGGAAAATGCGGTTCTGAGGAGAATGAACTTAAACTCGTTCCTCATGGTACCTGTTCAACGCGTGACAAAATATCCGCTCCTTCTTTCGCGGCTTTACCGAGCCACACCAACTTGTGCGTCTGAGAGGGAAGACGTCAAGGGTGCTCAGCGTTGTGTTGAATCCAGACTTGAGGAGATCAACGCTGCCGCTGCAGCTGCGGCCGCCGCTGCCAGAGATGTGCCGCTATGGCGGAGACTCGCAGTGGCGCGGCGGACTGCTCATGATTTGCACGTCGCCGACATAAGACTCAGAAAAATGGCCGTAGATGTTTTGGATTGGAATCACGATGATGCTAGGTTCGCAATGGAAGGCAAGCTGCTGTTTACACAACCGAACGACAACAACTGGCGAAAAGGCCGAACAATCAAGTTGATGCCGATCAATGCACTTTTGGTAACTAATGGAAAGCCAACAATCGCTCATAAAACGAACGAAATACGGGAAACAAGAGAAGCAAGAGATCGAGAGGCGAGGGAAAGAGAAGGAGATGCTCTTTTCGCACGCAGTGGAGTAAGGGAGGCTGCTTTACTTCTTGTAAGAGAAAAAGCAGGAAGATACACCTTACAAAGAGAGCCACTCTTTTTAGACCGCTGCGTAGTCGCTGCCGATCATGAACCAGAACATTTCTTTGAAGTCCACGAAATAACAACTAAAGATTCATTCATTTTCAAGGCCGAAGAAAATACCCGCACTCGAACTTGGTATCGACAGTTGCAATATCATGCACAAGGAGCTGGCGCATGGCGTAAACGGCGAAATGCGCTGGCTAACATTATGATTAACCCGATGCTTACTAGAAACTAA

Protein sequence:

>DPOGS207246-PA
MPETCGGIKRNVKIYEAPEATRSKRWSSKLRLNTSASSKSSEISPESPYMYGTISGQGGSALSKSMKYAETWLYGSVRTQTPPVRPSVFSAYPDIPGPVLISTPQKPTPHNYAVILCSCPEYLNGTKKTTSTKVSICKKCKGSRLPLTIAESPRLLVGGTVRGPQVNRDAGLLRAGTVRVPSSKSRPSILKPNGDSDPYDLMRRSRLAPPHETRIGSQFSTTRSRAKSISPCRVKTKQTSPESLNKNRSKSVSRVNELWIDEDAITTDKKKSILSCDINPYELVKAGGHSKSTVDIEFDDDFVEGFQDPSKSSSRSSSKGKNDSNVKAIGGQRIRVSNDSKSGSVDEVSVYDPIKYDLKNYDFQSTDMSTALSLPNIKNENKVVPLNKNRNKSISPSNRLKQKSVSPKRPPRRLRYNKTEDDSESDNQRLPDKNFSTKTALDTTRSSKYKNNNKETIKSILKKPKRYDPDDSNGKFENKEESFESKRLNSSQFYLPKPKDNKSLVLTQNIHQRKRVQFLVEKEETKVIYTADLNLHENIVTESNTIIETEEEIVAKGDVVSQSLEANETVTTECNEQNLINEMLNDSGVYGEVEMNSESENIIQQNVNTQGGYIKKSEENNVKDNHVPKIPVLRRSESERLVPTLSISPPKFLDTMALRKNKYGYKSQVFLEFVNELESTVQSEKHESKDSEDMSANRVPIGNENEVDTGNLSDNSEITTKVLENLRREISCSPEPPPRLKLKSKKHNYVKTRFVRNNSTSSTSDEWSDSNDREQKTILKIKQFDSDDGDEQNSKDTCKNIELEPRKTSIQINGNECYSTMNVNNDTPIYLSSVVVNDDYGNTCNTYQTGTTVTISVGTPQEGMKKSKSQIYIGAVFPGHNNSIEDTNTYEDYYNDSQGDQFNLVSSNEISSILNDPVEAVRRNLIPHVCGKKDVFTQEVNNLSQRSKSEIKNSDKSGNFVTKLFDDPFFAHLAEGLDSNLVKKLIENSLIKLQETKYQEGSESNKQTIEKLIENSLISLKEEVKKENKRNEIIHIPKNEDLEPSSKSEIQTSDNLEDDKACSAPYESMEYESGTVGVFSDQEPMSDCYNASASELSTEDDTNSTRSKFYQMLVDAAICDIEISNNTDDDHLYESIRLNSSDPIYEEIGDMPPPLPTNPPPNSLLLLDDEKRSGSRSIFEGASKYDILSYLVDAKERGIDDEETYITNYNNDNNESKDKTKIVKKHISSNTSQISNASDSSEDNSLVINQENIEKVVVCKKTSAEIERNDSGVGSETSKSSRNRLQGKTTTTNSLSDKDTPIHLCEDCDTAVETQVTEQGSVFAPLVCRKCSKKRTERKEIITEIVETEEKYGRDLQIILEEFYKPMLVAGLLTQEQLSAIFLNVEELIDNNQVLSEKLRDALEIAVEQGDEDLLTVNVGKILLECSGMLTAFQSYCVKQAGAALLLAGLEKEKELLRIFLRVSQMENAVLRRMNLNSFLMVPVQRVTKYPLLLSRLYRATPTCASEREDVKGAQRCVESRLEEINAAAAAAAAAARDVPLWRRLAVARRTAHDLHVADIRLRKMAVDVLDWNHDDARFAMEGKLLFTQPNDNNWRKGRTIKLMPINALLVTNGKPTIAHKTNEIRETREARDREAREREGDALFARSGVREAALLLVREKAGRYTLQREPLFLDRCVVAADHEPEHFFEVHEITTKDSFIFKAEENTRTRTWYRQLQYHAQGAGAWRKRRNALANIMINPMLTRN-