Monarch geneset OGS2.0

DPOGS206544
TranscriptDPOGS206544-TA4107 bp
ProteinDPOGS206544-PA1368 aa
Genomic positionDPSCF300190 + 122941-130973
RNAseq coverage207x (Rank: top 46%)
Annotation
HeliconiusHMEL0116330.081.08% 
BombyxBGIBMGA005906-TA0.074.45% 
DrosophilartGEF-PC4e-15350.99% 
EBI UniRef50UniRef50_G6DMX40.097.57%Putative uncharacterized protein n=2 Tax=Coelomata RepID=G6DMX4_DANPL
NCBI RefSeqNP_001097184.11e-15150.99%rho-type guanine exchange factor, isoform C [Drosophila melanogaster]
NCBI nr blastpgi|2700080356e-15658.72%hypothetical protein TcasGA2_TC014788 [Tribolium castaneum]
NCBI nr blastxgi|2700080351e-16742.20%hypothetical protein TcasGA2_TC014788 [Tribolium castaneum]
Group
Gene OntologyGO:00056229.9e-42intracellular
GO:00350239.9e-42regulation of Rho protein signal transduction
GO:00050899.9e-42Rho guanyl-nucleotide exchange factor activity
GO:00055155.6e-24protein binding
KEGG pathwaybta:5243523e-102 
 K13710 (ARHGEF7, PIXB)maps-> Regulation of actin cytoskeleton
InterPro domain[61-272] IPR0002199.9e-42Dbl homology (DH) domain
[6-61] IPR0014525.6e-24Src homology-3 domain
[274-385] IPR0119932e-17Pleckstrin homology-type
[290-386] IPR0018493.2e-08Pleckstrin homology domain
Orthology groupMCL11378 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206544-TA
ATGTCTGTTGAGAATTGCTTAGTGAAGGCAATTTATTCATTCAAGGGTAAAAACAATGATGAATTATGTTTCAAGAAAGGTGACATTATTACAGTCACTCAGAAGGAGGAAGGTGGCTGGTGGGAGGGAACATTAGGTGAAACAACAGGATGGTTCCCTAGTAACTATGTTACTGAATATAAAGATACTTCTGGGTCACTGACTACATCACCAATAAGAGCAGCTTCAGAAATACAGGCTTTTAGAAATGTAGTTCTAAAAGACATAATTGATTCAGAAAAGGCTCATGTAGCAGAAATGCAGGGTCTTGTCAGCAATTTCTTGCAGCCGTTAGAAAAGAGTGATATATTGGTTAAAGATGAATTTAAGCAGCTGACCGGAAATATAAATGAGGTGCTGCAAGTGCATGAGCAGTTCCTCTCTCTTTTAGAGGAATGTGCAATGAAGACAGGGCCAGACCAAAGAGTGGGAGGGTTATTCTTACAGTGGGCGCCAAAGATCAAGACAGTACATCTGACATACTGTGGTGGACACCCAAAAGCAGTCTGCGTTCTTGATAAATACAAGGAAGAACTGAATACCTGGATGGAAAATGCCGGAGCAGTGTGTCCGGGGGTTCTTGTTCTTACTGCCGGTTTATCAAAACCATTCCGACGTCTCGGAAAATACCCTGCCATGTTGCAAGAACTGGCTCGCCATGTCCATGAAGCTCATCCAGACAGGGGTGACACTCATAGAGCATCTGTTGTTTACAAAGATATAGTTAGTGCCTGTGCAGCTTTGAGGCGTCAGAAGGAGTTGGAACTTCAAGTTGTGACGGGTGAAGTCCGTGGCTGGCCTGGTGGTGAGCTCACGTCACTCGGCGATGTTTTACACATGGGTAGTGTTGCAGTTGGACCGTCACATCAGGATAGATATCTCGTACTCTTTCCTTCGGCTTTGCTACTACTATCAGTTAGTAAACGCGTATCTGCGTTTGTTTACGAGGGATGTCTTCCATTAACTGGTATAAATGTATGCAAGCTAGAGGATTCAGATACAAGGAAAAATGCATTTGAAATAAGTGGTCCAATGATAGATACTATTGTTGCTGTTTGCCAAACCAGAGCTGAAGCTGATAATTGGGTCAGCTTGCTTCAGAAGCACTCGAACAACAGCAGCCCTTCACATGAGCCTTCCCAGCCGCAATCACTACCTCATTTGACCCGATCTCCTTCTGAAGGGGCTCTCTCTAGCATCAACTCTAGTAGACGCAGTTTATATCACGTGACCCTGCCACCGTCGCACTATCCGTCCGCATCCCCATATTATTCATTAACAAAATATTTTGCAAGGTTGGTCAAAAAAAAGGTCATCACTAGGCAAATGTTGCGTAAACTTCTACACGAAAAGCCTTGGGCTAAAGCGTTCGAACTCACCGGCCTGCCAGTGATGAGACGACATAAAAATCACATAAAGCTCAAAATAGATAATGATGGAACTATCGCTGAAACTGATTATTCTGATAGCGATAGGAATTCTGAAGACAGTGATAGGGAAGTGGAAACGACTGAAAGTTGTTCCACTTCCAGCTGCACTAACTCAGCATCAGCTGGCACGAATAATTCATCTAAAATGATGAGACAGGATGCAGTCGAAAGTATCGGACCTAGAACAATTTCTTCTTCCTGTAGCAGTTGGTGTAGCAATTTTGGTTATGTCAGATACTTTGATACGTCAACAGACATGATGCTCGATGCACCAAACCCAAAAGGTGACTGCACGAAAAATATATTTGCCTCTGTACCTAAAAGAGCAGATGCCAATGTAAATGAATATAATTTACAAAAATCTTCGAAATCCAATAATTTGAATGTTTGCATCCCTTACCAACATTCCGGTAGCGCAGAAAATTCTAGTTTAGAAATAAAAAATTATATGGCTTGTGAAGATTTAGTGAATATAGATCAGGATATGCAAATAAATAATTTTGTCGACGAGACAGATTATGTGCCGACGAGGCGAAGCTTTCCTAACTACAGCGTAAGAAACGATATCCCAAAGTTGTGGAAGTCCACCGAAGAAAGGTATACAAGAGAAGAAGAAATTAATAACAATCTTATGATATCTGATTTATTACAAGTTCTTGATCCTCCATCGCCTAAAAATCCACGAGATTCTCTTCTAGAATCACTTATAATAGATCCACCTCCGATGTTTCGTAACGAAAATGACGACCTTAAAATTCTAAATGTGAATCTAAATGCCAACGTTCCTTTCCGAAAGCATTCACTAAACTCGGACAAAAAGATTAGACGTTCGACGTCAAGGTCAATGGTGGAGACGGATAAGAAAAATAATAATGGACATTTGGAGAGAATGAGTTCACAGTCTGAAACGTCTAAAGTTAGGCGAAAATGTGAATGCTGCAATCGTTCTTTATGTCCAAGTCCCAGATCTTCAGACTCTGGGGTCGCTGGTAGCTGCAATTTAGCGTCACCTGATTTAAACATGCACGGCAATGATTCGGATAGCCATGAGAAAACATCAAATGACATGAAAGATTCTCTCAACAACCTATCTGATAGTAACTTTAATAATAGAAAGTCAACTTTATCTGAAATCGAAGCAGCGACATTTGAAGATCAATGTAGATGTACGTCTCCTTTCGGGTCGACTGCTAGAACCTCGTGTGTGACAAGTGTAACCTCCGAAATGAGTCTAGACGTGAAGGATTTATCTAAAGCAAACGTAACATCTACTTTCACCGCCCATCCACCAGTGCCCTCTCCCGAAATCAAACGAACATCGATCAGAAGGAACATCGACCTATACGTGCCGGAAATTAAAATTAAGCCAGCGGTTCCACCGCGTATTTATAGGAAGCCAAGCACCCATTTGGAGATACCAAAAAATGTGCGACATCACATGCCATGTCAATGGAGCACATTGAATATTACTGAACGAACAAAACCGAGCTTGCATTATCACATGCGGATATACCGGGAGAAACCGGAAGATAATATGCACAGACTATCAAGAAAAGATATGACAAGAAATTGCGATGTTTTCAATAAAGAACACAAACGGAAGGATGAAAAGACTTCGACATCCCAAAAGACGCGGTCACGTAGTGAGGATTTGGCCAAACTGCAGAACGGTTTAACGGAAGCTCAGACTGGTTTCGTGGTCTATCGTTCAGATCTGTACGCACACTGGTGGATGAAAGCCAAATTACCGATAACCGTGGTCACCGACTCAGGCAAGGATAATTTTTTTATGGCCTTGAGCGACAGCAATAACGAATTGTCCTCTGGTCTCACTAGTCACAAGAGATCCCATTCATTCAACAATCATCAAAGCTTCACACAGCACACGCAGCAAACACAAAAGAATAAGTCAAAAAATCAATCGGCGCGTTGGCTCCTAAACTCCTGTGACTCACTCGATCAATCGCCAGCTAAATCAAAAATACCTATAGGAAAGAAATTCCCATCCGTGGATTTCACCGACGCAGCAGACATAAGCTACATTAGTAAAGGGTGGAGCATAACATGTCTTCGTCCGGCGCCGCCTTTACGTCCGCAGTCATTTACTGTGGGTTCAGATGAGAACATCGGCCCCACACACCCGTACGCGCCCCACCAGAGGAAATCGAACTCCTACGAGGAAGATGCCCTCATACTGAAGGTCATAGAGGCTTATTGCACTTCCGCCCGGTGTCGCAACGCTGTTAGTTCGGTACAATATCATGCACCCGCTACTAACATGCCTCTTACGCACACTAAACTGTCGGTGACGTCAGACAGGAGTACCAAAGTGAAGCGGAACAAGAGCTCGGCCGACGCTTACATGTACCGGGCACCTGATAGGAGATTAGTGCCGGATAGGAAGTACAACCTGAATAGATCTAATCCAAATCTATGGGATTGCGCTGAGCGGAGGCGGACGAATCTGATTGGTGGGAACGAACGGAGGAGCGGCTCCCAGCCGAGTCTCGTGCCCCTTCGCGCTACCACTTCCCCCCATCCAATCCAGCCGTCACAACCGCCACGCTCAGCGCGTTCGTCTACATGGTGCTGTGGGACATTCGTCAAGCAGCTAACTAAATCCCATCATTTCGACTGA

Protein sequence:

>DPOGS206544-PA
MSVENCLVKAIYSFKGKNNDELCFKKGDIITVTQKEEGGWWEGTLGETTGWFPSNYVTEYKDTSGSLTTSPIRAASEIQAFRNVVLKDIIDSEKAHVAEMQGLVSNFLQPLEKSDILVKDEFKQLTGNINEVLQVHEQFLSLLEECAMKTGPDQRVGGLFLQWAPKIKTVHLTYCGGHPKAVCVLDKYKEELNTWMENAGAVCPGVLVLTAGLSKPFRRLGKYPAMLQELARHVHEAHPDRGDTHRASVVYKDIVSACAALRRQKELELQVVTGEVRGWPGGELTSLGDVLHMGSVAVGPSHQDRYLVLFPSALLLLSVSKRVSAFVYEGCLPLTGINVCKLEDSDTRKNAFEISGPMIDTIVAVCQTRAEADNWVSLLQKHSNNSSPSHEPSQPQSLPHLTRSPSEGALSSINSSRRSLYHVTLPPSHYPSASPYYSLTKYFARLVKKKVITRQMLRKLLHEKPWAKAFELTGLPVMRRHKNHIKLKIDNDGTIAETDYSDSDRNSEDSDREVETTESCSTSSCTNSASAGTNNSSKMMRQDAVESIGPRTISSSCSSWCSNFGYVRYFDTSTDMMLDAPNPKGDCTKNIFASVPKRADANVNEYNLQKSSKSNNLNVCIPYQHSGSAENSSLEIKNYMACEDLVNIDQDMQINNFVDETDYVPTRRSFPNYSVRNDIPKLWKSTEERYTREEEINNNLMISDLLQVLDPPSPKNPRDSLLESLIIDPPPMFRNENDDLKILNVNLNANVPFRKHSLNSDKKIRRSTSRSMVETDKKNNNGHLERMSSQSETSKVRRKCECCNRSLCPSPRSSDSGVAGSCNLASPDLNMHGNDSDSHEKTSNDMKDSLNNLSDSNFNNRKSTLSEIEAATFEDQCRCTSPFGSTARTSCVTSVTSEMSLDVKDLSKANVTSTFTAHPPVPSPEIKRTSIRRNIDLYVPEIKIKPAVPPRIYRKPSTHLEIPKNVRHHMPCQWSTLNITERTKPSLHYHMRIYREKPEDNMHRLSRKDMTRNCDVFNKEHKRKDEKTSTSQKTRSRSEDLAKLQNGLTEAQTGFVVYRSDLYAHWWMKAKLPITVVTDSGKDNFFMALSDSNNELSSGLTSHKRSHSFNNHQSFTQHTQQTQKNKSKNQSARWLLNSCDSLDQSPAKSKIPIGKKFPSVDFTDAADISYISKGWSITCLRPAPPLRPQSFTVGSDENIGPTHPYAPHQRKSNSYEEDALILKVIEAYCTSARCRNAVSSVQYHAPATNMPLTHTKLSVTSDRSTKVKRNKSSADAYMYRAPDRRLVPDRKYNLNRSNPNLWDCAERRRTNLIGGNERRSGSQPSLVPLRATTSPHPIQPSQPPRSARSSTWCCGTFVKQLTKSHHFD-