Monarch geneset OGS2.0

DPOGS207428
TranscriptDPOGS207428-TA3345 bp
ProteinDPOGS207428-PA1114 aa
Genomic positionDPSCF300087 + 406651-435826
RNAseq coverage374x (Rank: top 32%)
Annotation
HeliconiusHMEL0054470.065.05% 
BombyxBGIBMGA009325-TA1e-12866.51% 
DrosophilaRhoGAP102A-PE5e-8731.70% 
EBI UniRef50UniRef50_D2A1J40.041.29%Putative uncharacterized protein GLEAN_08395 n=3 Tax=Tribolium castaneum RepID=D2A1J4_TRICA
NCBI RefSeqXP_002423816.11e-18037.52%hypothetical protein Phum_PHUM087100 [Pediculus humanus corporis]
NCBI nr blastpgi|2700062260.041.29%hypothetical protein TcasGA2_TC008395 [Tribolium castaneum]
NCBI nr blastxgi|2700062260.041.07%hypothetical protein TcasGA2_TC008395 [Tribolium castaneum]
Group
Gene OntologyGO:00071654.1e-55signal transduction
GO:00056224.1e-55intracellular
KEGG pathwaymbr:MONBRDRAFT_303431e-18 
 K12490 (ARAP)maps-> Endocytosis
InterPro domain[651-841] IPR0001984.1e-55Rho GTPase-activating protein domain
[652-846] IPR0089363.1e-48Rho GTPase activation protein
Orthology groupMCL16522 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207428-TA
ATGTCCGATGCCGGTCGCCGGCTCATCCCAGACGGCGCCCAGAACCGCGCGGACGACGTGCCGCGGATCGAGGCCTACTTCCAGGAGGTGTGCCAGCGGGAGCCGCGGTTTTTGCTGTGGAGAAAGAGCTCTTACCCGGGAGTGCTGCCGAAACCCCGACGCAAAAAGAGAGCGCGCGGTGGTTCTGTGAGGGCACGTTCGCCTTCGGATGACGCTCCGCCTCCACCCACGAGGCCCACCGATCTCTATATTCCTCACACCTCGGGTGGCCGCTTTGATATCGCTAAGCTACGGCGAGATTTTTTCGCAGCGCCTCCTCCTTCTCCGTCCTCCACTGGATTTCCTTCAGTCAGTGCTACTTTTAATACTAATAATAGTGCTATTAAGAATGGATGCCCGCATCCGTCTATTTCTGAGGATGATGAAGGAATTTTAGTAGATTTACTACGTAAGTACCTGAAAGTTGAAGATGCTAGGGATCCACCTCGTGTGTCAGAGTCTGAGGAACTTATCCGAGCTCTGCGTGACTACCTCAAACGTCTGTCGGAACGTGAGCCTACAGATTCCGATACAGATCCCACGCGAAGAATTCTCCGTGAAAACCTCGGTAGGTACTACCTCCGTTCTTCCAATAGAGACAATACAGTGCAAGATTTATTGAACGATAAAAATTTGTTGAAAAACCTGTACCATGACCTTCGTAAACCGAAGCCTTATCGTGGTGGCAGGAGTGGCGGTGGTCCGAGTTCGCTCGGGTCTAGTAGTGGGTTTAGTGCATCTAGTTCACGTAGCGTGTTTTTTGGCCGAAATTTTGGTGGTTCAAAGTTTAACGATGAGAACTGTTCATCACCACCAATGTCTCCTCCACCTCTTATTGAAGTTTACGGCGAATCGATGGAGGATAAACTCGTCGACGTTGGAACGCAAACACTTCCTATACCTGAGGAGGTATTACAGGAGCAGGAACGAAGTTATAGAGAAAAGTTGGAAGCCGCTACGGCCCCAACGAGTCCAACGAGTCCACCTCCCGAGAGAAGACCTCGGCGACGTTCCAGCGTGGACCACGACGACGTCTCACAATCAGTCAGCGACACCATCAAGCGGTATTTGAGGATGGCGCGCAAAAAGAGCGTAGACGCTGAAAAGACTGATCGCTTCAAGCGTATAAACTATGATAAAAACTTACGCAACATAAAACCTCGTCAGCAGGGTGACGTTGACGACGACGGGCCTCACAAGGGATGTCAAACAGATGAGGCTTGGATATTAACGTACAGAGATTTACAATTCGCGTCTGTAGCATCGACCCCGACCTCTCCTCCGTCCCCGTCTCAGTCGCACTCTTTTCTGTCAACCCTTTTGGGTCGAAATGCATCAATGGCTCCTAATGCAGGTGGCATGCAAAAGTCTCGGTCTTCGAGCAGCGTGGTCCAGAGCGTCAGTAAGCGACTCTGGCGCACGAGGAGCCGGTCTTCGAGCCGCGTCGCAGCATCATGGACGCCTCAGGGCAGTTGTTGCTGGACAGACGGCGCCGGGCGCTGCGTGAAGCTGACGGACACCTCGCTGCTGTCTCTCACGGAGGTGGAGAGGAGGGCGCTACAGCAGGCGGCACTGGCCAGGCTGCAGCAGCTCAACCTTGGAACTACTATTAAGATACCTGAGGATAACACGTCGACAGTAGCGACTAAGCCGAAGCGCCGCGCCTACCTTCTGAAAAGGAAGGCGCTCACCACTGGCTTCTTCGACCAGCGACCCAAGGACGCTGAGAAGGAGAAAGAGTCCACGGGCAGCGTTTTCGGCGTGCCGTTGTCTCAATGTGTGGAAACAGAACGAGCTCTGAGGAGACAACATGGAGGTTCCAGGGCGTCTCTGGCTTCCATTGGAGGCTTGGAGAAAGGAGACGATAGTGAATCGTGTGACTCCGGTGAGTGGGGCTGGTCAGGGGTGGACGAGGGCAACGGAGGGCCGAAGGTCCCCGCGTTAGTATCTTCTTGTCTGTCCCACCTCCGAAGGCACGGTCTCGACACACTCGGGCTCTTCAGAGTGTCCGCCTCTAAGAAGAGAGTGAGACAGCTCCGCGAGGAGTGGGAGCGAGGTCAGGAGGCAGCTCTAGACGCGGCAGTATGCCCCCACGACGTAGCCACTTTATTGAAGGAATTCCTCAGGGATTTACCAGATCCATTGCTATGCAGGGATCTATATCCCGCATTTCTACAGACTCAAAAGATCCGTAACCGTCGTCTTCAGTGGGAGGCGCTCCGTCTGATCGTCCAGCTCTTGCCGGCGGCTCATCGCGACACTCTCAGCGCACTGCTCGCGTTCCTCTCGCAGCTGGCATCACACGCGGGGGACGAAGACACCCCCGGCAACAAGATGAATGCCGCTAACCTGGCCACCATCTTCGCACCCAATATACTGCATAAGAACAAACCCAACGAGACCGCGAGTGCGGAGCAGTTGTCGGAGAGAGCTGACGTCATCAACGTGGTCCGCACCTTGGTGGAGCGACAGAGTGAGCTTTGGTCGCTGCCGGCGGAACTGTTACATGAGGCCTACATTCATCTCGCACACCACGCACCGGCAGGGCTTGACGCACTGCTGCTTAGGAGGGCAGAAACAGCAGCAGAAAATGAAAAGGCCAATGCAGAGGGCGCCAAACGTCTTTGGTCTCGTGAAAGCTTCCTCCACGCGGCTGCCAACACTGTCCCCGCTGTCTCCAGGAGTAGTGTGACGGAGGGCGGGCGGGCGAGAGACTCGGACGCCTCCTCCGCATCACTGTCTTCTGCCGTTATGCTCATGACCAGGTTACGCAGCAGCGAGGAACGCGCCAGCAGCGGGGTGGTGAGCGGTCACAGGGACTCCGCCCCGGACTCCAACGACTCCAGCGACTACAACGAGGTGGGCGGTGTGTCGGAGGACGAGCTGGTGATCACGGCGTCGCTCCACATCCCGGCGCTCCGGCGTCGCTCCGTGTCCTCGTCCAAGCGGGACTCGGCGGTGGGCTCGTCGTCGTCAGCGGCGTCTCCGTCGTCGTGCTCGCCGCCTTCCTCTCCGCCGCCCCGACCGGCGCGGGACATCGACCGCCTCGTGGGCCTCTCGCGTGAACGGAACACTTCCGACTCCCGCGCGGCGGTCGTTGGCAGAACTCACGAAGAAGTGACAGTTTCAAGAAGCACACGCGTGTCCAGACAGGAGCACACCGTACGGCGAGAAGAAGTCATCAGGAGAGAAGACCAGAACACGAGGAAAGAAGATCACAACGCCAGAAGAGAAGACAAAAGAGAAGGGACGATGCTTTACAAAAGAGGCGAACTCATCTCAAGCGCGAGGACGCCGCCCGCATGA

Protein sequence:

>DPOGS207428-PA
MSDAGRRLIPDGAQNRADDVPRIEAYFQEVCQREPRFLLWRKSSYPGVLPKPRRKKRARGGSVRARSPSDDAPPPPTRPTDLYIPHTSGGRFDIAKLRRDFFAAPPPSPSSTGFPSVSATFNTNNSAIKNGCPHPSISEDDEGILVDLLRKYLKVEDARDPPRVSESEELIRALRDYLKRLSEREPTDSDTDPTRRILRENLGRYYLRSSNRDNTVQDLLNDKNLLKNLYHDLRKPKPYRGGRSGGGPSSLGSSSGFSASSSRSVFFGRNFGGSKFNDENCSSPPMSPPPLIEVYGESMEDKLVDVGTQTLPIPEEVLQEQERSYREKLEAATAPTSPTSPPPERRPRRRSSVDHDDVSQSVSDTIKRYLRMARKKSVDAEKTDRFKRINYDKNLRNIKPRQQGDVDDDGPHKGCQTDEAWILTYRDLQFASVASTPTSPPSPSQSHSFLSTLLGRNASMAPNAGGMQKSRSSSSVVQSVSKRLWRTRSRSSSRVAASWTPQGSCCWTDGAGRCVKLTDTSLLSLTEVERRALQQAALARLQQLNLGTTIKIPEDNTSTVATKPKRRAYLLKRKALTTGFFDQRPKDAEKEKESTGSVFGVPLSQCVETERALRRQHGGSRASLASIGGLEKGDDSESCDSGEWGWSGVDEGNGGPKVPALVSSCLSHLRRHGLDTLGLFRVSASKKRVRQLREEWERGQEAALDAAVCPHDVATLLKEFLRDLPDPLLCRDLYPAFLQTQKIRNRRLQWEALRLIVQLLPAAHRDTLSALLAFLSQLASHAGDEDTPGNKMNAANLATIFAPNILHKNKPNETASAEQLSERADVINVVRTLVERQSELWSLPAELLHEAYIHLAHHAPAGLDALLLRRAETAAENEKANAEGAKRLWSRESFLHAAANTVPAVSRSSVTEGGRARDSDASSASLSSAVMLMTRLRSSEERASSGVVSGHRDSAPDSNDSSDYNEVGGVSEDELVITASLHIPALRRRSVSSSKRDSAVGSSSSAASPSSCSPPSSPPPRPARDIDRLVGLSRERNTSDSRAAVVGRTHEEVTVSRSTRVSRQEHTVRREEVIRREDQNTRKEDHNARREDKREGTMLYKRGELISSARTPPA-