Monarch geneset OGS2.0

DPOGS208152
TranscriptDPOGS208152-TA3423 bp
ProteinDPOGS208152-PA1140 aa
Genomic positionDPSCF300058 - 52992-70746
RNAseq coverage288x (Rank: top 38%)
Annotation
HeliconiusHMEL0110820.072.66% 
BombyxBGIBMGA013771-TA0.077.10% 
DrosophilaCG10188-PA2e-15344.19% 
EBI UniRef50UniRef50_D6WQX00.043.61%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WQX0_TRICA
NCBI RefSeqXP_002423205.10.041.63%Rho/RAC guanine nucleotide exchange factor, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2700100700.043.61%hypothetical protein TcasGA2_TC009420 [Tribolium castaneum]
NCBI nr blastxgi|2700100700.043.20%hypothetical protein TcasGA2_TC009420 [Tribolium castaneum]
Group
Gene OntologyGO:00056222.7e-51intracellular
GO:00350232.7e-51regulation of Rho protein signal transduction
GO:00050892.7e-51Rho guanyl-nucleotide exchange factor activity
GO:00055151.2e-30protein binding
KEGG pathwaybta:5059406e-68 
 K12791 (ARHGEF2, GEF-H1)maps-> Pathogenic Escherichia coli infection
InterPro domain[351-746] IPR0157214.1e-75Rho GTP exchange factor
[339-566] IPR0002192.7e-51Dbl homology (DH) domain
[568-701] IPR0119931.2e-30Pleckstrin homology-type
Orthology groupMCL14212 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208152-TA
ATGGGGTTGCAGAGTGCGTTGAACGGTCACAGCGCTTCACTATCATCTATGGAGTCTGAGGGGGAGGTGTCACGGCCAGTACGACATTCAACTCACTCGTTGAATGACAACGATTTAGCTAAGGAGTTCGAGAAGGTAACTGCAGCATTACGTGCTGCCGCTGACACCAGCTCCAGCAGACTGCTGCCACGTCTGCCGCTGCAGAAGTCCGTCTCCACACCGAGCATCGTGGCCGCCGTACATCCGACACAACCACCCCCAGCCGCCAATATTCTGTCAGCGGGTGGTGCGAGCGAGACGGAGAGTGACGACGACACGTCCGCGCCGGTCGCCAACCAGCCGAGAAGGAATAACACTCAGGAGCACCTCGGCTTACATCGGCTGGCGTTCTTGGAACAAGCCGCCTTCGACCATCACGCTGAGAAACGCCGTAAGAGAGGAAGCTTGTTCTTTAGGAAGAAGAAGGACAAATCTCGCAAGGCCTCTCACGTGTGGTCGGCGGCCGCGGCTGGTGGTGCATGTGATTGGTGCGCGAGGGACCTGGCGGACAAACCGGCTCTGTACTGTGACAATTGCACTATGACGGTTCATCAGAACGTTTGTAAGGACTACATTGTTGAATGCAATAAACCAAAATCATCCAAGACGTCCGTGGGCAAGTCGCTAAGCGCCAGTAGCGGTAAAAGCAGCAAACGCAGTTCCGTTTCCGGTCAATCACAGAACTCTAACAGCAATCCAAATCCCCTACTACAAGTGAAGGGTATAGGCGTGTGTGTATTGTTATTGTTCAGTAAGAAGCAGTCTTCCGGGTGCTACTCTCCGTGGCGTCGCGTGGCCACCAAGCTCGGAGTACACACGAGTCATATTTATAATGACGATAAAGACGGATCCGATCACAAGCACGATCAGAGCAATGCATCAGACGACGCTTCCACTGAATGGCCGGAGGTTTATCTGACAGCGGAACAGTTAGGTGATGAAGCGATACAGCTGGGCTTGGGGGCTGTCGAACCTGACACCTGGGCAGCTGGAGCACCTAAAGGACTTGCTAAACAAATCGGCGACAGAGAGACCAAGCGCCAGGAGCACATATATGAACTAATATTAACCGAAAAACATCATTGCCTCACCGTGAGGCTCATGCAGAAGATGTTCGCCGATGGTCTATCTCGTCTTGGAGGCGTGTCTTCATCACAGGTCTCCCGTATGTTTCCTCGTCTGGATGAACTATGGTCGTTGCACGCGGCGTTGCTCGCACGACTCAGGGCTCGCCAGCGCGTTGGCCCACGAGTGGCCAGCATCGCCGATATATTAGCGGATACGTTCGCAGCGCCACATCACCAAAGGCTGAAGGCTGCATATGGTGAGTTCTGTTCGCGTCATCGTGACGCCGTCGAGGTGTTCAAGGATGTCTGCGCGAGGGAGACGAGGGTCGCACGGTTCATAAGGAAATGTCAACAGAATCCACTACTCAGGAAGAAGGGTGTTCCCGAATGCGTGTTGTTCGTGGCACAGAGACTCACAAAGTATCCCCTACTATTGGAACCGCTCCTCAAGACCGCGGGTGACGATGCACACGAACGTGAACTATTACAGAAAGCACTGTGCGGTGTGAAGGAAATTCTAGTCGACGTTGACAACCAAGTGGCGGCAAAAGAGAGAGAAGACAGGAAACTGGAGATATATCATCGTATCGATGCAAAGTCCTTCGCCAATTATCGCGGACGGAAGTTTAAGAAGAGCGACATCCTGCAAGGGAATAGGAGTCTTACATTCGAAGGTGTAGCGACTTTGATGCAAGGTAGGAGTAAAATGCAGACGCTCCTAGTGATAGTGTTGACGGACGTGTTGTTCTTCCTTCACGACAACAACAACAAATACACTTTCTTCACGCCTGACAATAAGACTGGTGTAGTATCTCTATGGAAGTTGTTAGTTCGCGAGAAGGCCGGAGCCGACGGCCGAGGCCTGTACCTCATATGTAGCGGGCCGCCTGGACCCGAAATGTTCGAACTAAGAGTTCATCGACCCAAAGACATCGCTCAATGGATACGGGCTATACGAGGGGCAGTTCAAAGCTGTCCCGAAGAAATAGAAGAATCTGAAGCTGGGAGTACCGTGACGTCAGCAGAAGAGAGACAGAAACAGTTGGAGGCGAGGCATGAGAATATAAGACTGATTACAGAGGCTTTGAGGGCAAAGGACAGAGAGCAGGCGCAATTGTTGGAAGAGAAAATGGTGTTGCATATGAGGATGGTCGGACACACCGGGAGTTCCGCCTTAGATGTACCCAGCACAGGTGTGCCCCCCTGTCCCGGGGGATTGTCGTTCCCGGAGTACGTCCGTCTATCAGGACCCACGCCAGACACGCACGCCTTATGGCAGGAAGTCTGCAGGGTCGTTCAGGATGCTCTGGAGGCGTCGTCCCTGGGCTGGTCGTCTCTCAGCGGCGTGTCTCTGGGGCGAAGCACGAGCTCGGCGGGTGAGAGGCACTCGGTCCACTACACCAGCCCCGCTCTGCCGAGGAGGGCGGACACCTTCGCCGGCTTCGACGCGCATAGAGGGGGTGTGTCCGTACGTCTGTCGGCGTCGGACGTTCCCAGCGAACCGGAAACGGAAGCCCAAATGCACGCTCGGATCAAGGACGAAGCGAACGCTGCGCTGAAACTACAACATGCCATATACACGCTCACCTGTATAGTGTGGCAGCAGCTGACCACCATACACAGCCTGGAGGCGCAGGTGTGTGCGTGGAGGGCGTGCGGGGGGGCGGGAGCGGCCGCGGTGGGCGGGCGGGCACACGACGCGCAGCTCGAGGAACTGAGGCACGCACAGGCGAGGCTCACGGCGGAGAGGGCGGCCTGGGAGGCGCAGAGGACGGCCGACAGGGACGCGCTGGAACATGATCGTAGACAACTACAGGCAGCGCGAAAGGAGCTGGAGGAACAACAGAAAGACGTTGAACAACAGAGAGAAAGGCTCTACAGGCGCCTGGAGAGATTACAACAACACGGTGGCGGATCTCAAGAGGAAATAGCTAGCGTTGGAACTCTGTCGCCTGATTCAAGTGTCAGCGATACCAACAGAAGGAAAGAACCAAAATGGAGAAATAACCGCGGTTCAACCGGCTCGGAGTCGTCGTTGAGCGCCTGCAGCGTCCGCGGCGCTGCACTGCCACCGCCACAGCTGCTGTCGGCGCACAACGAGACGAGAGCCACAGCGCGCGCTCCGGTACAGGTGATGCGAGACTGTTCTAGTATGAGAACAAACACCTACCCAAAGCTGCCGGACAAGTTCCGCGTGCGATCTCCGGATGCCGCGCCCCCCTCGCAGGCCCCGCCCACCTCGCAAGCCTCCCCCCTCCCGCCGGCGGCCCCCATCCCCCCGGCGGCCCCCTCGGAGGAGGAGGTCATCTACTTCTGA

Protein sequence:

>DPOGS208152-PA
MGLQSALNGHSASLSSMESEGEVSRPVRHSTHSLNDNDLAKEFEKVTAALRAAADTSSSRLLPRLPLQKSVSTPSIVAAVHPTQPPPAANILSAGGASETESDDDTSAPVANQPRRNNTQEHLGLHRLAFLEQAAFDHHAEKRRKRGSLFFRKKKDKSRKASHVWSAAAAGGACDWCARDLADKPALYCDNCTMTVHQNVCKDYIVECNKPKSSKTSVGKSLSASSGKSSKRSSVSGQSQNSNSNPNPLLQVKGIGVCVLLLFSKKQSSGCYSPWRRVATKLGVHTSHIYNDDKDGSDHKHDQSNASDDASTEWPEVYLTAEQLGDEAIQLGLGAVEPDTWAAGAPKGLAKQIGDRETKRQEHIYELILTEKHHCLTVRLMQKMFADGLSRLGGVSSSQVSRMFPRLDELWSLHAALLARLRARQRVGPRVASIADILADTFAAPHHQRLKAAYGEFCSRHRDAVEVFKDVCARETRVARFIRKCQQNPLLRKKGVPECVLFVAQRLTKYPLLLEPLLKTAGDDAHERELLQKALCGVKEILVDVDNQVAAKEREDRKLEIYHRIDAKSFANYRGRKFKKSDILQGNRSLTFEGVATLMQGRSKMQTLLVIVLTDVLFFLHDNNNKYTFFTPDNKTGVVSLWKLLVREKAGADGRGLYLICSGPPGPEMFELRVHRPKDIAQWIRAIRGAVQSCPEEIEESEAGSTVTSAEERQKQLEARHENIRLITEALRAKDREQAQLLEEKMVLHMRMVGHTGSSALDVPSTGVPPCPGGLSFPEYVRLSGPTPDTHALWQEVCRVVQDALEASSLGWSSLSGVSLGRSTSSAGERHSVHYTSPALPRRADTFAGFDAHRGGVSVRLSASDVPSEPETEAQMHARIKDEANAALKLQHAIYTLTCIVWQQLTTIHSLEAQVCAWRACGGAGAAAVGGRAHDAQLEELRHAQARLTAERAAWEAQRTADRDALEHDRRQLQAARKELEEQQKDVEQQRERLYRRLERLQQHGGGSQEEIASVGTLSPDSSVSDTNRRKEPKWRNNRGSTGSESSLSACSVRGAALPPPQLLSAHNETRATARAPVQVMRDCSSMRTNTYPKLPDKFRVRSPDAAPPSQAPPTSQASPLPPAAPIPPAAPSEEEVIYF-