Monarch geneset OGS2.0

DPOGS208832
TranscriptDPOGS208832-TA4788 bp
ProteinDPOGS208832-PA1595 aa
Genomic positionDPSCF300036 + 726323-740857
RNAseq coverage442x (Rank: top 28%)
Annotation
HeliconiusHMEL0041870.068.97% 
BombyxBGIBMGA007937-TA0.064.92% 
DrosophilaCG1657-PA2e-6034.53% 
EBI UniRef50UniRef50_E2C1S55e-16843.83%GTPase-activating protein and VPS9 domain-containing protein 1 n=4 Tax=Arthropoda RepID=E2C1S5_HARSA
NCBI RefSeqXP_395273.36e-17043.44%PREDICTED: similar to GTPase activating protein and VPS9 domains 1 isoform 1 [Apis mellifera]
NCBI nr blastpgi|3071708695e-17043.71%GTPase-activating protein and VPS9 domain-containing protein 1 [Camponotus floridanus]
NCBI nr blastxgi|3227870430.035.08%hypothetical protein SINV_11752 [Solenopsis invicta]
Group
Gene OntologyGO:00071657.4e-30signal transduction
GO:00056227.4e-30intracellular
GO:00510563e-27regulation of small GTPase mediated signal transduction
GO:00050963e-27GTPase activator activity
KEGG pathwaynve:NEMVE_v1g2430686e-09 
 K05767 (IQGAP)maps-> Regulation of actin cytoskeleton
InterPro domain[98-446] IPR0089367.4e-30Rho GTPase activation protein
[154-364] IPR0019363e-27Ras GTPase-activating protein
[1483-1581] IPR0031231e-20Vacuolar sorting protein 9
[1480-1594] IPR0139952.9e-13Vacuolar sorting protein 9, subgroup
Orthology groupMCL15875 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208832-TA
ATGATGCAGCCAACATCAGAAAGCTGTATGAATATAGCAGAACTAGCAAACCAATTAAGGAGAGAGAAAATCTTTATAAATTCCGAAAGGCAATTGATACAGAATCTCAATGAAGAGGTGGAGAAAACTGTCTTGGAACTGTTGCAAGCTGCCTGGATATGTTCACAACAGCGGCAGAATCTAAACAACCTGATCAGCTCAAGGAGTGAGGCAGAGTCGGTAGCAGCTTGTCAGAAAGCAAGTCTACTAGAAAACACCACTTTCATTGATGCACAAAAAGTTTTGAAATACAAAGAAGTACAAGCCTTAGGAGAGTTGCTAGGATGGCTTCGGAACACACCAAACCTCATGGCTCTGTGCCTCATACTGGGGGAGGATCACATGCCATTGAGTCTTCCTTCGTCTTTGGCCGCCGGGCTGTATGGGAGCTGTCGGTCTCCATCTGATCGAACACGCATGTTGGCCGTGCTGCGATCGCTTATCAAGTATCAAATGGCCCCATCTGCAGACCCGAGGAAGGTGATTCGTAGTAACAAGTCGTCTCTGGCGCTGCTGTACTCCGTGTTCCGCGACAACCACGCCCCCACGCGGCAGTTCCTCCGCACGGTGTTGCGCGCGCCCGTACTGGCCGTGCTCACTGAGGACGAGTTCTTCCTCGACGTAGACCCCGATAAGGCAATGGAGCGATTCACTCCCGCTGATCGTGCTAAGAAGTTTGGTCAACCGAACACCACCGAATACAACACTAAGGTGGCCCGCTATCGAAAGTGGACGGTGCAGTCGTTGTACAACCTCACCAACAAGTTCATCGCCTCCCTGAAGGAGAACTGGTCGACCTTCCCCTCCTCCATAGCCTGGATTGTACAGCAGGTCGTGCATCTGCTCAAACAGAATTCCAGGAGCACGGACCGCGACATCCACACTATATGCAGCGAGTTAGTCCTGGGTCAGCTGCTGTGTCCGGCGGTGGTGAACCCGGAGTCCCACGGGGTCCTCTCCGCCCCCGTCTCGTACATCGCCCGCTTCAACCTGCTGCAGATCGCTCAGATACTACAGATGCTCAGTCTGGCAAAGTACCAGGAGGTGGAGCCGAAGGTCCGCGACCTGTATATGAAGTTCGAGCGGTCGTGTGTGTGGTCGCTGGTGGACAGTGTGGGCGGCAGCGCCTTCGCCCCCACCACGCTGGTGCCTCCGCCAGAGACCCCACTGAGAACCGACGCTCCGCCCCCCGCCACGGTCGCGCTATACACGCCACAGGACGCACACCTACTGATAACATTCCTAAGAAAGATCTCGTCCAAACTAAACAACAACAACCAGAACCCCAGCGAGGAGAGAGCTAACGGAGCTGCGGACAACGCTGTCGACTCTGGGTTTAGTGAGCATGTTCGTGACCAGAGCAATGGTGGTGCGGGCTGCGGCCGCCTGGAGGCGCTGGTGGCCAACAACGACCCCAACTACCGCTCCAGGCTGCACGACCTCTTACGAAAGATGCCAGACTTCGCCGCACTACGGCCTTCGCCGGTCGAGTGTCGCGAGTGGTCGGAAGGCGGACGGCGAGGTCTGCTCGGACGACGGAGGAGAGAGGAGCTTCACAGGGACGAGCTCGAAGTACTCGTGGTGAAGATACACGAGATGAACGAGAACGAATATGTCGGATTGATTCCGGAGTCCAAAGTCTTAGAGAACTACTACGGCGAGACGGAGGGAGACGAGGCCGGGGCACTCGTCGCCGCCGCCGCACCCGTACAGAAACGAACGAGATTCTCAGTGTCCCATGACGAGGTATCGTTGGGGAACACTTCAGACAACCTGGAGGCCGTGTCTGAAGCGGCCTCCAACCACAGCGTCACCTCCAGCCTGGAGCTCGAGACAGAGGATCAGAACGACAACCTGTCCGACATGGTGTCGGCGAACGTGAGCGGTCGCGGCAGTCCTAACATATCTGGTCGAGAGACGCCTTCCTCGCAGGTGACGGACGGAGACCCGGCTGCCGACCAGGCCCTTAGACGCATCCCAGCCAACGCGCCCGCCAGCCAGGCCGTGAGCGTCGCGCACGCCAAGATGCTGAAGCAGGCTCGGAACGACATAGAGGACAAGTTCTGCAAGTTCGAGATCAAGAAGCTGCTGGAGGGAGACGAGACCATCAGCATCATGTCGGACACGTGGAGCACGGACGTGCTCGCCTCCGACTCGGAGACCATCGGCGACTCGTGTGACACCACACAGGCGGCGGCCGGACAGGGATCCAGCGCCAACACCATCGCCCCGGACGTGTCCGAGACGGCCAGCGAGTCCGCCTGGAGCGTGGACGTCCTGGCCTCCGACAGCGACAGGAACACTGAGGTCGACACTGATGACTGCGTGTCTGTGGCGGCTCGGTCCGACACCTCCTGGCCCCGGCGAGCCTCGCACCACGACGACCTCACACGAAGACATCTAGCACAGAACGGTGTCATGAAGCGGCCGGACCTCAACAGCCCCCGACACTCCTCAGCGGCACAGCTGCCTAACAGGAGCGGGATATACCTGAGCGCGACCGCGGCGTTGTCCCGGGGCGGGGAGCTGGACCTGTCCGCGCCCTACGACCACAGGAAGAGCATCCTCGTTAACGGATATCCGGTGATGACCTGTTACGCGTCAGCCCCCGAGAGCCCCAGCACTGCGCCCCCTCTGACCAACGATGTGTTCGACTACCCTCAAAACAACGGCGGGAGTTCCTCGTCCACGATAGAGGGCGGGCCAGCGGAGGCGCCCGAGGCCGAGGTCTCCTCAGAGGCGGGCCGGTCCTCGTGCTCCACGGCGGGACCGATGGGGAGCGGAGCACCCGGCTCGGACGCCACCAGCCAGTGGGTGGACGACAGCTTCCCCGACAAGTATCTGCCCTCAACGTGTAAAGGATACGAGGATAATGAAATGGACAGGTCATTGAACTCGAGCGAGAGTTCATTTAACGCGATGTCAGCGTCTCAGTACGACAGGAGGTTCGAGCCGGGGCCGGAGGCCGAGCGGGCCGAGCGACGGGAACCGGAACCGGAGCCCGGGCGCTCCTTGACGGAGCTGGTGACGCGCATGAGCGCCGCCAGCGTCAGCTCCAACATCGCCAGTATAACGAGAGCCGCCAAGACCGAGATCCGGACCAGCATCGTCAACATCAGCTCCTCGAGGATCGACAAGAAGACCAGTGACGTGTCGCTGAGCACGTTGTCCGTCAACAGCGAGAACTCCACGTCGGACCGCAGCTCCAGCCAGGACGTGAACTCGGACAACGTAACCGAGCGCCGGGTCAGCCCGCCGCCCAAGACCTCCACCGGCGCCATTCCCAAGAGCATCTCCTTCGATGCCACAGCTGAGAAATCTCAACGTCGCCGCATGATCGGCGAGGAGGGTCTGGTGGGGAACCTGGACGAGCTCAAGAACAACGTGAAGCGCTCCGGGGGGAACCTGCTGCACAAGATTAAGATGTTCAGACACAAGGCCCGGACACACCATGGAGATATGAAGGTGTCTACTGAAGAGGTTTCCGTGGAAGAGAGGAGCGAGCGAGACGGGGAGAGCTCGGAGGAGATACTCGCCAAGTACAGGCGAGCGGCCGACAGCGCTCCGAGACGACCACGACACGACAACAACACGCAGGCCGTCACGACACACGACCTGGAGCGCTGTGAGGGCGCGGTGGACCTGAACGACCCCGAGGTGTTCAACAACATGAAGAGGCGGCTGCGAATAGTGCTCTCCAACATAGACCTGCACTGCGTGGACTACGTACCGAACCGCTGTACGAGCGCGTCCCTGTCGGAGTGGTGTCGCGCGGGCAGCAGTGCGGAGTGCGCGGCCATGAGCCGTGCTCTGCTGAGCGGGGCGGGCGGGGAGGAGGGCGCGGCGCGTATCGCTCGTCTCGCGGCCTCGCTCCGGGCGGACCTCGCCTCACGCCGCCCGTACGCCGCCTACCTCGCCTCCTGCCGCGCCTCGCTCGCGCACGCGCTATACATGCTGCAGACCGAGGTGAAGGCTATCCGCCGTGAGTGTGAACGGTGGCTGCGGGCAGCGGCCGCGGCCAGGGTCCGGGAGCAGCTGGAGCGAGGGTCTTCGCTGAGGCGGCTGGCGCATCACACCACACAACACGCACACGCTCTACTACACGGTGGGGAGTTGATCGACGAGAGAGCGATGCGGCTGACCGCCAGCATCAAGGCCATCACGGCCGAGATCAAAAGCGACCCGTCCTGGGAAGGCGCCTCCGGCGTCATGATGGAAGGTCTGGAGCGCACCGTGGAGAGAGCCGTCTACACCAGGCTCTACCTGCACGTGATGTTCCCCAACGGAGACGGAGACATCGCAAGGGATCAGGTGTTCAGCGAGCACATCCGCCGCGTGTTGTCTGTGAGCGGCGGTGCCGGTGGTGTGGGCGTGGCAGCTCGTCACCTGTGGGCGGCGCCCTTCCCCCACGCCCAGCAGCAGCTGCGAGCCCTGTCCGCTCACCGCACTCCGTCGGACAAGCTGCGCTGCGTGCTGCGCTGCGTGCGCTCTCTGCTGGCCACCCTGGCGCTGTCCACGCCCGACCCGCCCGCCGCCGACGACCTCACACCCGCCCTCGTATACGTCATACTCAAGGTGAACCCGCCGTCCCTGCTGTCGACCCTGGAGCTGGTGAACGCCCTGTCCGGTTCGTCACTTCAGGGCGAGTCTCTGTACTGGTGGACTCAGTTCTGCGCCGCCGTCGCCTACATCAAGACCATGGACTACCCGCAGACGACCGGCTCACACACACATGCGCACGACTCCTAG

Protein sequence:

>DPOGS208832-PA
MMQPTSESCMNIAELANQLRREKIFINSERQLIQNLNEEVEKTVLELLQAAWICSQQRQNLNNLISSRSEAESVAACQKASLLENTTFIDAQKVLKYKEVQALGELLGWLRNTPNLMALCLILGEDHMPLSLPSSLAAGLYGSCRSPSDRTRMLAVLRSLIKYQMAPSADPRKVIRSNKSSLALLYSVFRDNHAPTRQFLRTVLRAPVLAVLTEDEFFLDVDPDKAMERFTPADRAKKFGQPNTTEYNTKVARYRKWTVQSLYNLTNKFIASLKENWSTFPSSIAWIVQQVVHLLKQNSRSTDRDIHTICSELVLGQLLCPAVVNPESHGVLSAPVSYIARFNLLQIAQILQMLSLAKYQEVEPKVRDLYMKFERSCVWSLVDSVGGSAFAPTTLVPPPETPLRTDAPPPATVALYTPQDAHLLITFLRKISSKLNNNNQNPSEERANGAADNAVDSGFSEHVRDQSNGGAGCGRLEALVANNDPNYRSRLHDLLRKMPDFAALRPSPVECREWSEGGRRGLLGRRRREELHRDELEVLVVKIHEMNENEYVGLIPESKVLENYYGETEGDEAGALVAAAAPVQKRTRFSVSHDEVSLGNTSDNLEAVSEAASNHSVTSSLELETEDQNDNLSDMVSANVSGRGSPNISGRETPSSQVTDGDPAADQALRRIPANAPASQAVSVAHAKMLKQARNDIEDKFCKFEIKKLLEGDETISIMSDTWSTDVLASDSETIGDSCDTTQAAAGQGSSANTIAPDVSETASESAWSVDVLASDSDRNTEVDTDDCVSVAARSDTSWPRRASHHDDLTRRHLAQNGVMKRPDLNSPRHSSAAQLPNRSGIYLSATAALSRGGELDLSAPYDHRKSILVNGYPVMTCYASAPESPSTAPPLTNDVFDYPQNNGGSSSSTIEGGPAEAPEAEVSSEAGRSSCSTAGPMGSGAPGSDATSQWVDDSFPDKYLPSTCKGYEDNEMDRSLNSSESSFNAMSASQYDRRFEPGPEAERAERREPEPEPGRSLTELVTRMSAASVSSNIASITRAAKTEIRTSIVNISSSRIDKKTSDVSLSTLSVNSENSTSDRSSSQDVNSDNVTERRVSPPPKTSTGAIPKSISFDATAEKSQRRRMIGEEGLVGNLDELKNNVKRSGGNLLHKIKMFRHKARTHHGDMKVSTEEVSVEERSERDGESSEEILAKYRRAADSAPRRPRHDNNTQAVTTHDLERCEGAVDLNDPEVFNNMKRRLRIVLSNIDLHCVDYVPNRCTSASLSEWCRAGSSAECAAMSRALLSGAGGEEGAARIARLAASLRADLASRRPYAAYLASCRASLAHALYMLQTEVKAIRRECERWLRAAAAARVREQLERGSSLRRLAHHTTQHAHALLHGGELIDERAMRLTASIKAITAEIKSDPSWEGASGVMMEGLERTVERAVYTRLYLHVMFPNGDGDIARDQVFSEHIRRVLSVSGGAGGVGVAARHLWAAPFPHAQQQLRALSAHRTPSDKLRCVLRCVRSLLATLALSTPDPPAADDLTPALVYVILKVNPPSLLSTLELVNALSGSSLQGESLYWWTQFCAAVAYIKTMDYPQTTGSHTHAHDS-