Monarch geneset OGS2.0

DPOGS202199
TranscriptDPOGS202199-TA1788 bp
ProteinDPOGS202199-PA595 aa
Genomic positionDPSCF300149 - 281539-292692
RNAseq coverage630x (Rank: top 20%)
Annotation
HeliconiusHMEL0091890.080.76% 
BombyxBGIBMGA013521-TA1e-13262.67% 
DrosophilaCG17082-PD8e-7634.36% 
EBI UniRef50UniRef50_Q8T0G41e-7334.36%CG17082, isoform A n=6 Tax=melanogaster subgroup RepID=Q8T0G4_DROME
NCBI RefSeqXP_391837.27e-7935.93%PREDICTED: similar to CG17082-PA.3 isoform 1 [Apis mellifera]
NCBI nr blastpgi|3838628846e-7936.49%PREDICTED: rho GTPase-activating protein 28-like [Megachile rotundata]
NCBI nr blastxgi|1571078675e-8538.05%hypothetical protein AaeL_AAEL014901 [Aedes aegypti]
Group
Gene OntologyGO:00071655.4e-30signal transduction
GO:00056225.4e-30intracellular
KEGG pathwaygga:4169528e-08 
 K08878 (BCR1, BCR)maps-> Pathways in cancer
    Chronic myeloid leukemia
InterPro domain[319-572] IPR0089365.4e-30Rho GTPase activation protein
[386-578] IPR0001987.4e-30Rho GTPase-activating protein domain
Orthology groupMCL16016 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202199-TA
ATGGTCGTGATATTGAAGTACGGGACCGCTGAATACGTTAACTTCTTCCATTGTGAGGGCGAATTAGAGACTGAATGGCTCCAATCAGCTGGGCTGGCGTCCCTGGCGGCGCCCTTCCAGGCCGGGCTGGAGGTGTCCGAGGCTCAGCTCGGAGAGGCCGTCCGACCCCTGCCGAGGGAACAGGCGGCGGCGGTGAGGCGAAGAGTGAGGAGGCTCAATAGAACAGTCAGACGAAGACGGGCTGCCTCCCGAGCCAAGAAACCGGATATACGAGATGTCTTCAGAGACTTGGAGAACTCTAGTGGTAGCGAGCCTCGTTCCCGTAGTGCCACCCCCGATTCTCTGGACTCGCTGCCCAGCGGTGGGTCCGGCTCACCCCCCACTGAGTGGGCAGACACCGCCACGCCTCCAGATTTCGTCGACAGTTACCCAGTGGGAACCCACGCTCCCCTGGCCCGCACGCCTTCAGCCCCGGCCCGTAGAATCCGCGTTTCTCCTCCTGGACCTAACGCTCACTCCCCGCCCAGACCATGCCACGAGCTGTTCAAACCCAATGACCTGCACTGGGCCGACATAGCATCTAATACCGAAGGTATCGAGCTTCTGGGATACCAGAGATATGGAACCGTCCAAGGTCCCAGGATTGGCAAGGAACGTATCAATGGCACCATACTGAAGAGCAACGATCCTTTCATAAAACATACGGCCGTGTCTCGGACCAAGAGTGCTGCGAGCGCTCAGCAGCCGCTCAGCTTCGAACATAAGTTCAATCTGGACCGACAGATGCTGGACCACGAGGAGGAGTGGCCGGAAGTGGACAGCACAGTCGACATAGAGACGGTCTCCGAGCCTCAGTTGAAGAAGCTGCAGCCGCTGCTGTGGCTGGAGCTGACCGCATTGTTCGACCGATACTCGCTGCCCTTCCACAAGAGGAAGCCGCCCAAGAAGAAGAGGAAAGAAGAAGGAGCCGTGTTCGGCGTGTCCCTGGAGACGTTACTGAGGAAGGATATGTTATTGTGGGAGGAGACGTGGAGCTCCGTGCCGGCCGTGCTGAGGGCGCTCGCCTCCGCGCTCAGGGACCGCGCCGCACAATCCGGCCTGCTCAGGGTGCCCGGCAACAAGCACAAGGATATGTTGTTGTGGGAGGAGACGTGGAGCTCCGTGCCGGCCGTGCTGAGGGCGCTCGCCTCCGCGCTCAGGGACCGCGCCGCACAATCCGGCCTGCTCAGGGTCCCCGGCAACAAGCACAAGATCGAGGCGCTCTGTCAACTGATCGAGCGTCAGTGGTACGAGGACCGCTCGTCCGTGGAGTCAGCTCTCCACCGCGCCACGGGTCACGACCTCGCCGCCGTCTTCAAGCGTCTCCTGCGCTCCCTGCCCCAGCCGCCGCTCACACAGGAGCTCATGAGACTCTTCTACCAGACATACACACTGTCAGGCGCCAGTCAAGGCCGCGCGCTGAACCTCCTCGTGCTGCTGCTGCCGGCCGAGCAGCGCGCCACACTCCGCGAGGTGCTGCGCCTGGTGCGGGAGATCGCCGCCATGGCCGACACCAACAAGATGAACGAACACAACGTAGCCATGATCATAGCGCCCGCCCTGTTCCCGCCCAGTCTCCTCATCAAGCAATCGGACAGCCTGGAGACTCAGCTGGCGACGGCGGCCAACAGTGTCCACGTGACGGAGGCTCTGATGCGATGGTGCGACCGTCTGTGGTGCGTACCTCCTTCACTGCTGGCAGCCTCACACCGCAAACCTGGACCCCACAGGAGGAACAATCACACTTGA

Protein sequence:

>DPOGS202199-PA
MVVILKYGTAEYVNFFHCEGELETEWLQSAGLASLAAPFQAGLEVSEAQLGEAVRPLPREQAAAVRRRVRRLNRTVRRRRAASRAKKPDIRDVFRDLENSSGSEPRSRSATPDSLDSLPSGGSGSPPTEWADTATPPDFVDSYPVGTHAPLARTPSAPARRIRVSPPGPNAHSPPRPCHELFKPNDLHWADIASNTEGIELLGYQRYGTVQGPRIGKERINGTILKSNDPFIKHTAVSRTKSAASAQQPLSFEHKFNLDRQMLDHEEEWPEVDSTVDIETVSEPQLKKLQPLLWLELTALFDRYSLPFHKRKPPKKKRKEEGAVFGVSLETLLRKDMLLWEETWSSVPAVLRALASALRDRAAQSGLLRVPGNKHKDMLLWEETWSSVPAVLRALASALRDRAAQSGLLRVPGNKHKIEALCQLIERQWYEDRSSVESALHRATGHDLAAVFKRLLRSLPQPPLTQELMRLFYQTYTLSGASQGRALNLLVLLLPAEQRATLREVLRLVREIAAMADTNKMNEHNVAMIIAPALFPPSLLIKQSDSLETQLATAANSVHVTEALMRWCDRLWCVPPSLLAASHRKPGPHRRNNHT-