Monarch geneset OGS2.0

DPOGS200177
TranscriptDPOGS200177-TA5166 bp
ProteinDPOGS200177-PA1721 aa
Genomic positionDPSCF300128 + 574017-626176
RNAseq coverage989x (Rank: top 13%)
Annotation
HeliconiusHMEL0091500.075.39% 
BombyxBGIBMGA002926-TA0.067.92% 
DrosophilaRhoGAP19D-PA6e-4637.37% 
EBI UniRef50UniRef50_E2BKC43e-9044.32%Rho GTPase-activating protein 21 n=9 Tax=Eumetazoa RepID=E2BKC4_HARSA
NCBI RefSeqXP_391884.31e-8643.30%PREDICTED: similar to Rho GTPase activating protein 21 isoform 1 [Apis mellifera]
NCBI nr blastpgi|3320213312e-9638.49%Rho GTPase-activating protein 21 [Acromyrmex echinatior]
NCBI nr blastxgi|2700139366e-12929.26%hypothetical protein TcasGA2_TC012615 [Tribolium castaneum]
Group
Gene OntologyGO:00071654.7e-55signal transduction
GO:00056224.7e-55intracellular
GO:00055151.2e-16protein binding
KEGG pathwaymbr:MONBRDRAFT_303433e-22 
 K12490 (ARAP)maps-> Endocytosis
InterPro domain[1132-1305] IPR0001984.7e-55Rho GTPase-activating protein domain
[1128-1313] IPR0089363.3e-51Rho GTPase activation protein
[54-197] IPR0014781.2e-16PDZ/DHR/GLGF
[976-1032] IPR0119933.9e-13Pleckstrin homology-type
[788-1026] IPR0018495.1e-07Pleckstrin homology domain
Orthology groupMCL14487 Patchy
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200177-TA
ATGGCACATCAGTGTGGTACGGCTTTAATTAAGGCGTCGCGTCTCAAATGGTTGTTTCGCAATGTTAGCCAACAAAGATCTGAATCTAGTACTATATCGAAGCCCTCGCCTCCGTCGATCGTCCAGGCTCCAGCTGTTCTCTCCCCGCCGGTGTTCCCTTCACCCCCCGCTCCCCCTGGAACCCTCGCGCCCTCAGCTCCCATCTCATCCACTTCTCTGTCAACAAATTGCTCCGCCCCAAGGACCATCGTCATCCAGCGACCCGACGCCAGGCACGGCTTCGGTTTCACCCTCAGACATCTCGTTGTTTACCCACCAGAGTCGTTAAACGAGCTATATGAGGATGCGCGCCACCTCGCTGTGGGCGCGGTGGGCGCTCCGATGGACACAATCTTCGTAAGGTCTGTGCGTGCGGGCGGGCCGGCGGCGGCCGCGGGGCTGCGAGCCGGCGACCGTGTGCTGGCCGTCAACGGAGAGCCGGTAGCCTGCGGCCGTTCGCGAGCGCCCTACGCCAGGGTTGTGCAACTAGCGCAGAGGGAACCCAGGGTGTTGAGGCTCACCGTCGTGCCGAGGGATCACGATCTCTTACAGAGGTATTTCAGCGAAGCAGCGTACAATCCGGAGACTAACCAGCGACACAGCGAAATGCCGGCGGTGACGGAGCACGAGTCGCGCCTGTTCGACGCCTACCGCCGACCAACCACAGGAAGACACGACCGTCCGACCGCTGTCCGACACGACCGTCCTAGTGAGCCGGAGTACGCGACGGTGGCGCCACCACCCAGGATCGACGCTCCCCGACTATCAGCAACGCATTACATTATGCATGATGGGAGAGAATCCGATACTTCTTTGCCATCATCAGCTACAGAAAGTAAAGACAGTCTCGGCTCATTTGACTCCAACTCGACACTGACGGGCCGAGAACTGGACGATTCGATTATCCTCTCCCGAATACGAAAGAGTCTGGAACAGAAGGAGGCGTTTCTGAGACGTCCCAGCCAACCCACGGGATGGCTCACTCCAGATGCCCCACCGCCGATTAAACAAAAGGAATTCTACGCGAGGCCACAGAAATTGCAAAAGCAATCGTGGCCACCGCCGCCTTCTTCACCACCACTGGCCGCTTCTCCGCCGCTATCTCTGCTTCAACCGACAAACCAGGAACTCAAAGCAGACGAGTGTGCGACCAGTGTTGATAGCGGAACGTACAGTGGATATGGCGAGTTGAACGGTACTCACAACGACAAAAACAGGTTGAAGCAAGACAAGACCCAGTTCGTGGCAACTTTGTCCAAAATACAGGAAACATCGCCGAATATACACAATAGCACGAATATGGGACCGCTCAGCAACGGCTCGTCGGCTGAATTCAGGGATGAGAATATGGCTTCTAATAATCAGGACAACAGGTACCCAGTACCCGAGCTCCAACTCGTCACAGCTCGTACCCGTCAGTTGGAAGAAGCGCGTGTGGGTGAAAGGAGAACCGAAGTTGCTAGAAGTGAATTAGCGCGGCTGACGACAACCCGGCTAGAACCGAGCGTGGCGCTACGGAGGAAGGAGTTCGAAGCAAGAGCGGCCAGGAGAGAGGCGCGGTCGCTGGATGTGCAACCACAGCAAGACTTAGAGGAAGAATCGACACCCCTTGGCGGCAGCTTATCGGGCAACCAGCTAATGATTACTGGCAGCAAGCACCTCCACTGTACGCCTCCAGCAGGATTCGCTCCCGAAATAGAAAAGGTTCAAATGCGAACAAGGTCAAATAGTACCGGTAGTGAAGGATTACCCATTAGAAGACATCATGCCACAGATGGCAGTCACGCGAAGAGACGCATGGGGCTGAACAGGGAACAGATGGAAGCGATGAACCGCGTGTCGCTGCCGGGCGGCGCAGCGCGGCCAACTCGGCTCAGCCTGGCGCCGCCCTCGCCTACGCCGCCCACTTCCTCGGGCTCGGCGCACACGTCCGGCTCGCCAACGCCCACCACGCCCGCTGACTCAGGTTTTGCTGACGACTTGGTTACGAGAAGACACAAAAAGGATGCGCAAATGGGAGAAGAAGAACGTGCTATGAGAAGGGTGTCCTACCTTAAAGCAACGTGGGGAGACAGACTACACTTGGACAGCGATGTGGAACCAGAGGCTGAACAAAACGCTCTCAGGAGTGCTTACCGCAAATGGCGGCCACCACGATTCACAGAGGACATAACGCCGTTGCTACGGATATTCGACCCTCAAGCGTCGATTCCAACGCGTCCGGAGAAGAAGTCCCAAGAGAGCACAGAGAACTCCGAGGACGTGGAAGCTGACGTACAAGGGAACGTTCAGATAAAGATGGTGGTCAGCAATGGAAAGCGAGCCAGCGATCGCTCGTGGAAGCAAGTCTGGGCGATGCTGCAAGGCACCAAGCTGTATCTTTACAGACATCACCCCACACAGGTAAGAGACTTTGACTATTCAACAAACAAGCACTATGATTACAAATTTTATATAAATTTAACTATCGTCACGGTTTGTCACAAAACTAAAAAAGAAATAGCCAACAGTAACAGTAAAGAAAGTATCGGCGATTTGTCCCCGTTGTTCAAAAGCGACTCGTCCGGAAAATATTATCAATTTGGGAAGAGTAAAAACGTCGTTTTGGCAGACGCCGCGGAGTCTGCGCTAGAATACTTTCTCTGCTCAGACGCAAGTCTCGCCATAATCCTTCAGGCGACCGTCGGAGCCGCCCGTCAGCAGCGCGAGTCCCTCACTCTAATGCTGAGCTTTGACAATGACCAAGAGACGTCTCGTTTGGCGTTGATTCGGATGGACTCCAGCCAAGCCATGACGCTATGGTACACAGATCTACTGAGCCCCGGTGCTGTGAGCGGTAGCGGCGGATCGGCCTCCACTGTAACAGAGGGCGGCTCCCGAGATGCTCTCGATGTCCGCTACTCTCTCGCTGCTCTCGCAGACGACTATACGAAAAGAAAACACGTAGTAAGAGTCACTACAGCCGCTGGTGCAGAACTTCTGCTACAGGCTGAAGGCGCGGCAGATGCACAACGTTGGCTAGCAGCACTCAGAAGACATTCCGCAGAACCACCACCATCAGAGCCGGTGTCTATACCAACAGCGCCGGCCTCTGCGCCTGCGACAGCCACCAGCGCCCCCGCCACAGTGGAGTGTCCTTCACCTTTGCCACCACAAAGAAACAAGAAGATTGGCAGGAACAGGTCGCCTACCGGTCCTCCAACACCTACTCTGCCTTCGTCCCCCAAGAATAAGACTTGGAAGGGTCGTATGGCAAAACAGCTGCGTCGTATGCACGGCGGTGCCAGCGCTGCTCCGCCAGTCACTGGCTGGTTGGGTGCCCCACTTGACCGCTGTCCCAGCGACCCTGAACACCCGCTTGTGCCTAAAGCCGTCACCTTACCCGCTCATGCTGTAGAGGCGTACGGCCTCCGCACAGTAGGGGTGTACCGCGTGCCGGGTAACGCGGCCGGAGTGGCGGCGCTGGCTGCCGCCCTGGACCGCGGGGAGCCGCCCCCCGCCGACGACTCGCGCTGGGCGGACGTCCACGTGGCCTCCAGTCTGCTCAAGGCCTACCTGAGGCGACTGCCCGATCCCATTCTCACCGCACATCTATATCCAGCTTTTATTGCGGCTGATCGTTCCCCCGAGAGAGCTCGCGAACTGCGCAAGCTCGTCCACGCCCTGCCCGACGCTCACTACGAGACACTAAAGTACCTGATCCAACATTTGCGCAAAGTGGTCGCAAACTCCGCTTATAATAAAATGGAAGCGAGAAACTTGGCCATCGTTTTCGGGCCAACGTTGGTGCGCGCGGCCAGCGACGACATGCTGGCGATGGTCAACGACATGTCCAGTCAGTGTCGTATAATAGAATCCTTCTTAACACATTACGAATGGTACTTCGAAGAGGAAGAAGGCTGTCCTCCGTCGGAGCCCCCTCCGCCGGGAGACGCCCTCACCCCCGCGCCTGCCCCCTCCCGCGACCTGCTCATACATAACGTCAAGAAGATAGAAGCAGGCAAAGACGTGTCCACACGTGACATAGTGTCGTCTATAATATCGGCCGCGAACCGTAAGATCCAGCGCAAGCCTCGCAAGACTGATAAGAAGTGGGAGGAGAAGGCTCGCTCGCCCGCGGGCTCCCCTCCCCGCTCCCCGCTCGCCTCCCCTCCGTCGCCCTCGCCCGCACAGGCACACGCACACACGCACGTCACCACGCTCGCACACTACGACGGTCTGCCGCACTACATGGAACAGCACCACCAAACTACCAATAAAATTGAAATGGAGTCAGTGACCTCGATGGGCCGCGCGAGACAGGAGATATCGCGCCACACACACGCCTTCAATAACTTCTCCATAGACGAAACCGGCGACCTGGTGTCTTCCTTAACGAGCACCTTCGACCAGAAGCTGCGCTCCCTTAACAACTCGTCCCCGCTCGACGACGGCAGCATCCCGTACGCCGACGAGTGCCAGGAAGACGCCGACGAGGGCAAGTCGAGCGCCTCGCCCTCCGAAGCGGGCTCCGTCCGGGGAACGTCCCCCGCGAGGCACAGAGCCGACCAGCTCAAAGCGGCCTGGCTGAGGTCCGGCTCCTCCTCCTCCGACGAACATGACGCGCGCTGGCCACCGCCCAGAACACTCCCACCCGGACCACCCGGCCCGCCTGGGCCCCTCGGACCGCTCGGGTCTCACGACGACACAGACGATTCCGAAAAAGAGAATTCCGGCCCCAAGAACGGCCCCGACGACGACAAAGATGTCGACACGCGGTCTCAGGCTTCCGCCGGGGACTCCCCCGACAACGATCGCAGCGCGGCCTCCGCGGGCGAGGGTCCACGTCGCTCCGAGTCACTCGGCCGCGTGCTCCGCTCCGAGAGCCTCAACTGCCGCAGCGACCGGCCGCAACGCTCTGAATCTCTCAGCAAGGCGGAGCGCCCGACCAAGTTGGAGAAAGGCGAGTGGAACGTGGTGCGACGCCGCGAGGCGGGCGCCTGGCGCTCCACCAAGCTCAAGCGGAAGAACGGTATGCCGGAGCGAGGCATCAAGCGCCGCCACACGGTCGGCGGCACCAAGGACTTCGACAAGGACGGCTGGGCGGCGCGCCACACGCGCACCTCGTCCCCGGACCTGTCGTTCTAA

Protein sequence:

>DPOGS200177-PA
MAHQCGTALIKASRLKWLFRNVSQQRSESSTISKPSPPSIVQAPAVLSPPVFPSPPAPPGTLAPSAPISSTSLSTNCSAPRTIVIQRPDARHGFGFTLRHLVVYPPESLNELYEDARHLAVGAVGAPMDTIFVRSVRAGGPAAAAGLRAGDRVLAVNGEPVACGRSRAPYARVVQLAQREPRVLRLTVVPRDHDLLQRYFSEAAYNPETNQRHSEMPAVTEHESRLFDAYRRPTTGRHDRPTAVRHDRPSEPEYATVAPPPRIDAPRLSATHYIMHDGRESDTSLPSSATESKDSLGSFDSNSTLTGRELDDSIILSRIRKSLEQKEAFLRRPSQPTGWLTPDAPPPIKQKEFYARPQKLQKQSWPPPPSSPPLAASPPLSLLQPTNQELKADECATSVDSGTYSGYGELNGTHNDKNRLKQDKTQFVATLSKIQETSPNIHNSTNMGPLSNGSSAEFRDENMASNNQDNRYPVPELQLVTARTRQLEEARVGERRTEVARSELARLTTTRLEPSVALRRKEFEARAARREARSLDVQPQQDLEEESTPLGGSLSGNQLMITGSKHLHCTPPAGFAPEIEKVQMRTRSNSTGSEGLPIRRHHATDGSHAKRRMGLNREQMEAMNRVSLPGGAARPTRLSLAPPSPTPPTSSGSAHTSGSPTPTTPADSGFADDLVTRRHKKDAQMGEEERAMRRVSYLKATWGDRLHLDSDVEPEAEQNALRSAYRKWRPPRFTEDITPLLRIFDPQASIPTRPEKKSQESTENSEDVEADVQGNVQIKMVVSNGKRASDRSWKQVWAMLQGTKLYLYRHHPTQVRDFDYSTNKHYDYKFYINLTIVTVCHKTKKEIANSNSKESIGDLSPLFKSDSSGKYYQFGKSKNVVLADAAESALEYFLCSDASLAIILQATVGAARQQRESLTLMLSFDNDQETSRLALIRMDSSQAMTLWYTDLLSPGAVSGSGGSASTVTEGGSRDALDVRYSLAALADDYTKRKHVVRVTTAAGAELLLQAEGAADAQRWLAALRRHSAEPPPSEPVSIPTAPASAPATATSAPATVECPSPLPPQRNKKIGRNRSPTGPPTPTLPSSPKNKTWKGRMAKQLRRMHGGASAAPPVTGWLGAPLDRCPSDPEHPLVPKAVTLPAHAVEAYGLRTVGVYRVPGNAAGVAALAAALDRGEPPPADDSRWADVHVASSLLKAYLRRLPDPILTAHLYPAFIAADRSPERARELRKLVHALPDAHYETLKYLIQHLRKVVANSAYNKMEARNLAIVFGPTLVRAASDDMLAMVNDMSSQCRIIESFLTHYEWYFEEEEGCPPSEPPPPGDALTPAPAPSRDLLIHNVKKIEAGKDVSTRDIVSSIISAANRKIQRKPRKTDKKWEEKARSPAGSPPRSPLASPPSPSPAQAHAHTHVTTLAHYDGLPHYMEQHHQTTNKIEMESVTSMGRARQEISRHTHAFNNFSIDETGDLVSSLTSTFDQKLRSLNNSSPLDDGSIPYADECQEDADEGKSSASPSEAGSVRGTSPARHRADQLKAAWLRSGSSSSDEHDARWPPPRTLPPGPPGPPGPLGPLGSHDDTDDSEKENSGPKNGPDDDKDVDTRSQASAGDSPDNDRSAASAGEGPRRSESLGRVLRSESLNCRSDRPQRSESLSKAERPTKLEKGEWNVVRRREAGAWRSTKLKRKNGMPERGIKRRHTVGGTKDFDKDGWAARHTRTSSPDLSF-