Monarch geneset OGS2.0

DPOGS203274
TranscriptDPOGS203274-TA3843 bp
ProteinDPOGS203274-PA1280 aa
Genomic positionDPSCF300229 + 268765-277132
RNAseq coverage41x (Rank: top 72%)
Annotation
HeliconiusHMEL0211260.074.28% 
BombyxBGIBMGA000454-TA0.085.45% 
DrosophilaRhoGAP100F-PD0.055.52% 
EBI UniRef50UniRef50_E2AF340.050.13%Rho GTPase-activating protein 100F n=8 Tax=Formicidae RepID=E2AF34_CAMFO
NCBI RefSeqXP_971877.20.054.78%PREDICTED: similar to rho-gtpase-activating protein [Tribolium castaneum]
NCBI nr blastpgi|1892411040.054.78%PREDICTED: similar to rho-gtpase-activating protein [Tribolium castaneum]
NCBI nr blastxgi|1892411040.054.66%PREDICTED: similar to rho-gtpase-activating protein [Tribolium castaneum]
Group
Gene OntologyGO:00071653.6e-43signal transduction
GO:00056223.6e-43intracellular
GO:00055152.4e-16protein binding
KEGG pathwaymmu:1102792e-32 
 K08878 (BCR1, BCR)maps-> Pathways in cancer
    Chronic myeloid leukemia
InterPro domain[880-1083] IPR0001983.6e-43Rho GTPase-activating protein domain
[890-1084] IPR0089369.6e-42Rho GTPase activation protein
[731-872] IPR0089732.4e-16C2 calcium/lipid-binding domain, CaLB
[92-200] IPR0014789.3e-16PDZ/DHR/GLGF
[737-851] IPR0000083.5e-06C2 calcium-dependent membrane targeting
Orthology groupMCL14588 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203274-TA
ATGATCATAAAAAAGGAAAATGGGCGCGCGGCGCCGGACTTGACGGCATCGCCGGGACGCGCGCCGCCCGGCCCGCCGCAGCCGCCCCGCGCCAACCACGCGCCGCCCTGCGTGCTGCAGCCAGACTTCCGCAAGGTATCGGGTGTCAGTAATGAAATATTCAGGCAAATTGAGATGGTGGAAAACGATCACGACACCACCACTGCAGCGGCGCTTGAGGCGGTGGAGCGACGCGGTGAGATGATTGTACGAATTCTTGAATTGAGACAAGTCGGAAGAAATAATGTTGAAGCCGCCAAAAAATTTTTTTCCTTACAAGATGCACGTCACATAGTTCAACTAGTGGAGATCGTGAAGCGCCCTGGTCAAACCTTGGGCCTTTATATACGAGAGGGAGACGGTGGTGCTAGGACTGATGGAGTATTCATATCACGAATAGCTCTCGAATCAGCGGTATATAACAGTGGATGTCTAAAGGTTGGGGATGAAATCCTGGCTGTCAACCTGGTGGATGTAAGACGCATGTCTTTAGACGATGTCGTGATTATTATGTCTATACCCCGCAGGCTTTTATTGTGCACTAGACAGAGAAAAGGAAAATCAGGCCCTGGTTCACCATCATTACCCAGATCTGAGCACAAACCACCTCCTGTTGTTGTTTTGAAAAGAGACTGCCGGGACGATGACGACCACGATAGAGTTGATGGACTTTATTCGCAACATGGTACCTTACGTTCAACGGGAGGTCCTGGAAGTTCTGTGAGACCAGTTGGGGATGGTAGAGAAGAAAGGAGTCGTTTACAACTTGGTGCATTATCACCAGATTCCACTCCACTTGATCTATATTACAATTCACGACCACCCTCCGATCATGCTACGTGGAGTTATCGGCCACCTCCGCCAGTTATTACGGAGCAACCTAAATCTACTGCAACACATTTTGTGCCATATGAAAGGTCTTATCCGAATACATTGGAAAGTCTAGCGGAAAAAGTTCATTCCTTTTACCCATCGGAGACTGGAGGCAGCAGTATGCGGTTTGGTGGAAGAGTTCCACGTTCTGGTTCTGAGCAACAGCTCCCAAGGGCCGAAACTCATTCTGACTTTGGCCGTCATTCACTGCTTCGGTCTAGTTTGAAAGCGTCAACTGCTGCAGGGCCAGGAGCTTCCAGTTTGACAAGATATGGACAACGTTACGGTGGAGTAGGAATGCCAGGTTCAGGATTACCTGGTAGTGGACTCACTGGCACGGGACTAGGCGGTACTGGTTTAAGTGGTTCAGGATTGCCTTCAGGGTTTTCTGCTACTGGCCTCAGTGGTTTGACTGGTTCCAGTTTAGTAGGGTCAGGTTTGGCAAGTTCTGGTCTTAGCAGTACGCTCACAGGAACTGGCCTAAGTTCAACACTTGGTGCAGGTTATGGCACTACAGGGCTAGGACTACCGTCATATGGAAGTAAATTTGGAACGACTAGAAGAAATCGAAGCTTGGATTACTCTTCTGATACTGAGGCAACAGCACCCACTAGAACTCCTTATTACTCAGGCTTGAGTGGTTACAGGAGTAGTACTTTAGGAAGGGACATAGGATCGAAATTTAATTCTTTACCAAGAGATGTCAGAGGAACTGGTCAAAGACTGGGACTAACTAGACGCACAGGGAGTGTACTTCAAGATGAACCTGAACCGTTGTCTTCACGATTAGACTTACGTTCCTCCAGAGGTCGCCTGCCTTCCTCTCCCTCAGTATTTACATCAGATGAATATCGCGCATGGTTATCTCGAGCTCCATCAACAAGTGCATTGTATGAAACACTTCGTCCTCGACTGCCTACACACTACTCAGCGGAAAATATACATGATGCGCTTAAGAATATGGAAAGTGGAAGTCGCTTTGGATCTTCTCTCGGCTTAGCTGGAAGGCGAATTGAAAGGCCTCGTCATTTACCAGCTAGATCGTTATCATCGCAACAGTTGGGACCGACAGCAAGCGGATCACCATCAGCTCGTCGTGTGAGACAGCTTCTTGAGTTGGGATCAAAATTTACATGCCCAAATCCAAGTCCTGTACCAACACCAGGTTCAAGACATCAACGACATCTGGATATAAACCCGAATGAGTTCCTAAAATATAAAGTGGATAAACCCGGTCAAGGAGGTTTATCATCGTCTATGACGGGTTTGTCTCGGCTCTCTGGAGGTGTCTCTGGTATGCTCTGGGTCCACCTTCTTGCTGGTAGAGGGTTGCGTCCAGCTCCTACTGGTTCCTCACCAGGCTCACCACCTTCAGGACCCCTTGCGCCACCACAACCTCCAGTAGCACCTCGTGATCTATATTGTGTCTTAGAATGCGATCGCGTACATAAAGCTCGTACAGTGGTTCGCACTGGTGAGTTGCAGTTTGATTGGGATGAGTCCTTTGAATTGGAGTTGGTTGATAACAGACAGCTAGATGTACTCGTGTATTCCTGGGATCCACAACACAGGCACAAACTTTGCTTTAGGGGAGCTGTGACTTTACCAGATTTATTAGCTCGTTCGCCATTTCATCAACTTGCTATAAAGATGGAACCGCGCGGGACGCTATACATGCGAGTCCGTCATACTGAACCACACGAGTTATTCCGTCGCCGCGTAGCGCCTTCTCGCATAGCTCCAGCACCGTTGTTTGGGGCTGAGTTAGAAGCAGTTGTGGCACGGGAATTGCGACCACCTCACGCACCACCTGTACCGTTGGTGGTAAGACGTTGTGTTGAAGAAATTGAGAGACGAGGCTTGGACATTATTGGACTCTATCGCTTATGTGGTTCTGCTAACAAAAAGCGTATACTTCGTGAGGCATTTGAGCGTAACGCACGTGGTGTTGAGTTGACACCGGACTCAGTTCCTGATATTAACGTGATTACTGGAGTATTAAAGGATTATTTGACCGAATTGCCACAACCTCTTATCAGCCGCTGTTTGTATCAAATGACGTTGGATGCTTTAGGCGTTTGTCTTCCCGATGATAAAGAAGGCAATGCTCGTCTCATGGCCTCTATAGTGGAATGTCTTCCACGAGCCGCAAGAGCTACTTTGGTCTTCCTGCTTGATCATTTAGCATTAGTTGTGGCTGCTCAAGATCGTAATAAAATGTCACCTCAGCATTTGGCTGTTGCTATGGCACCTCCATTAATGTTACATTCCCAGCCACCAGCGGAATTAGATTATCAGCGTCCAATACATGTACTCCAATGTCTTCTTCAAATTTGGCCTCCACCGAAACGTTCAGGCCGAGCTCCGCCCTCAGTCAGTCCATATCGGCATCAAGCGGCAGCATCAGCAGCATCTCCGCCGCCAGCTCTCTCGGGTCGTCCCGGAGTCGACCGCTCGCCGCCCGCACATGTTCTCTCATCAGTCAGGGGAGGCCAATATCGTCCGCAGTCGCCTCTTCGCGGCCCTCTGCCCGCTCCTCCTCGCTCCCGACAGGTGACGGTGTCATCTCCCGGCTCCCCCAGCAGTAGCTCCGGGAGCCACAGCCCAGCTGACACGATAAAGCATGGCGGCTCTGTGTCATCAATACTACGTCAACCGGAACGCGCGAGCTCTCCTCGCGTATCACCCAGACAATCACCTCGCGCATCTCCTCGGGATTCTCCGCGCGGGACACCAAGGGAGACTACGCCCCGGGAATCTCCTCGGCCTGCTATACCAGGAACGTCGGCCGGTCTGGCCGTGACTCTGAATTCCGAGCGGGGTTCGAGTCCACGCTACAGCTCCACCAATCCATTTCTGCAGCAGTATGATGCTGAGGAGGAGGCGGAGGCGTGGCGCGCTGCAGACATATTCTCATCCACGCACACATAG

Protein sequence:

>DPOGS203274-PA
MIIKKENGRAAPDLTASPGRAPPGPPQPPRANHAPPCVLQPDFRKVSGVSNEIFRQIEMVENDHDTTTAAALEAVERRGEMIVRILELRQVGRNNVEAAKKFFSLQDARHIVQLVEIVKRPGQTLGLYIREGDGGARTDGVFISRIALESAVYNSGCLKVGDEILAVNLVDVRRMSLDDVVIIMSIPRRLLLCTRQRKGKSGPGSPSLPRSEHKPPPVVVLKRDCRDDDDHDRVDGLYSQHGTLRSTGGPGSSVRPVGDGREERSRLQLGALSPDSTPLDLYYNSRPPSDHATWSYRPPPPVITEQPKSTATHFVPYERSYPNTLESLAEKVHSFYPSETGGSSMRFGGRVPRSGSEQQLPRAETHSDFGRHSLLRSSLKASTAAGPGASSLTRYGQRYGGVGMPGSGLPGSGLTGTGLGGTGLSGSGLPSGFSATGLSGLTGSSLVGSGLASSGLSSTLTGTGLSSTLGAGYGTTGLGLPSYGSKFGTTRRNRSLDYSSDTEATAPTRTPYYSGLSGYRSSTLGRDIGSKFNSLPRDVRGTGQRLGLTRRTGSVLQDEPEPLSSRLDLRSSRGRLPSSPSVFTSDEYRAWLSRAPSTSALYETLRPRLPTHYSAENIHDALKNMESGSRFGSSLGLAGRRIERPRHLPARSLSSQQLGPTASGSPSARRVRQLLELGSKFTCPNPSPVPTPGSRHQRHLDINPNEFLKYKVDKPGQGGLSSSMTGLSRLSGGVSGMLWVHLLAGRGLRPAPTGSSPGSPPSGPLAPPQPPVAPRDLYCVLECDRVHKARTVVRTGELQFDWDESFELELVDNRQLDVLVYSWDPQHRHKLCFRGAVTLPDLLARSPFHQLAIKMEPRGTLYMRVRHTEPHELFRRRVAPSRIAPAPLFGAELEAVVARELRPPHAPPVPLVVRRCVEEIERRGLDIIGLYRLCGSANKKRILREAFERNARGVELTPDSVPDINVITGVLKDYLTELPQPLISRCLYQMTLDALGVCLPDDKEGNARLMASIVECLPRAARATLVFLLDHLALVVAAQDRNKMSPQHLAVAMAPPLMLHSQPPAELDYQRPIHVLQCLLQIWPPPKRSGRAPPSVSPYRHQAAASAASPPPALSGRPGVDRSPPAHVLSSVRGGQYRPQSPLRGPLPAPPRSRQVTVSSPGSPSSSSGSHSPADTIKHGGSVSSILRQPERASSPRVSPRQSPRASPRDSPRGTPRETTPRESPRPAIPGTSAGLAVTLNSERGSSPRYSSTNPFLQQYDAEEEAEAWRAADIFSSTHT-