Monarch geneset OGS2.0

DPOGS214434
TranscriptDPOGS214434-TA4377 bp
ProteinDPOGS214434-PA1458 aa
Genomic positionDPSCF300069 + 643819-652601
RNAseq coverage673x (Rank: top 19%)
Annotation
HeliconiusHMEL0106200.067.67% 
BombyxBGIBMGA011353-TA0.064.59% 
DrosophilaCdGAPr-PA5e-16142.15% 
EBI UniRef50UniRef50_D6WYR30.047.73%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WYR3_TRICA
NCBI RefSeqXP_968224.20.047.73%PREDICTED: similar to cdc42 gtpase-activating protein [Tribolium castaneum]
NCBI nr blastpgi|2700119560.047.73%hypothetical protein TcasGA2_TC006051 [Tribolium castaneum]
NCBI nr blastxgi|1892397810.036.38%PREDICTED: similar to cdc42 gtpase-activating protein [Tribolium castaneum]
Group
Gene OntologyGO:00071652e-59signal transduction
GO:00056222e-59intracellular
GO:00055153.3e-12protein binding
KEGG pathwaynve:NEMVE_v1g2392942e-25 
 K08878 (BCR1, BCR)maps-> Pathways in cancer
    Chronic myeloid leukemia
InterPro domain[348-554] IPR0001982e-59Rho GTPase-activating protein domain
[362-552] IPR0089361.5e-52Rho GTPase activation protein
[243-318] IPR0014523.3e-12Src homology-3 domain
[251-302] IPR0115112.7e-06Variant SH3
Orthology groupMCL15967 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214434-TA
ATGTCGTGCCAGTCGCTGGCGGCGCGCGCGTTCGCCGAGCCCGAGAAGGAGTCGCACTGCGCCCGCGCCGCCACACACAGAACCGTACGCTTCAGCTCGGACCCTCGCAGCCAGAACAGGACCACGAGCGAGTCAGAATGCAAGAGAGAAACACCCGGTTCGATGTCCCTGAGCACGTTGTCCATGTCTCCAGCCATGAGCCAGTCGGCGCCGAGCGCCGCGGCCGCCTTCCACATGGACGACCAGCCGCCCACATGTCGTTTCCCCAAGTTGGAGGAGTGCGCTCACTTCCACTACGAGCGAGTAAGTCTGGGGGGAATCAGCGTGGAGCTGGTGCCGTGTGACTTCAGCGGGGACTCCCCCGGGGAAGGTTGGGTGTGCCTGAAGGTCAGTAGCCACGTGAACTCCTGCGAGGAGGACGCAGCCGGCGCCGGAGACGACACCTGGACGCTCACCAGGAACAGGGAACACTTCCTACAGCTGGACGACATGCTGCACAGGTGTATATATGACAGGAAGGTGTCGGGCTTGCCGTGTCTCTCGTCAGCGCTGTCCTCGTCCCTGGACGAGCAGGCGTACTCCCGGCTGGTGGCGGACTACGTCCATCACCTGTCCATCATAGCCGACGACTCCATCAACTGCGGCCCGGCACTCAACTGGCTTCAGATGGACAACAAAGGTCACAAGCTGCTGGTGGCCAACGAGGACTCCAGCTCAATTAACACTCCAGCTGTAGCGGCGGCTTACTCCGTGAGGAAATATGTGTCCCAGGCTCGGGATGAGATCAGCTTCGACGTGGGTGACATGATCTCGATAATCGACATGCCAGGTCCCCAGGAGTCCCTTTGGTGGCGCGGCAAGCGTGGGTTCCGCGTGGGTTTCTTCCCTCACCACTGTGTGGCCGTCATCGGAGACAAGGTCCCGCGGCACATGACGCAGCCTCCGCCCATCGTGGGGTCTGTGGCTGTGGCACCCATAAAACCGGTCCTCCGCAAGCACGGGAAGCTGATATCCCTGTTCCGGAGCTTCATCCTGTCCAGACCTTCGCGGCGGAGTCTCAAGCAGCAGGGCATCCTGAGGGAGAGGGTGTTCGGCTGCGACCTGGGAGAGCACCTCACGAACTGCGGACACGACGTCCCTCAGGTGTTGGTGGAGTGTTCGCGGGCCATCGAGCAGCGCGGCGCGGTGGACGGCATCTACCGTCTGTCGGGGGGCGCGGCCCTGACCCAGCGGCTGAGGGCCGCCTTCGACGCGGGCCTCGCCGCGGACCTGCGGGCCCCACTCCAGAGAGACCCGCACGCGCTGGCCAGCCTCCTCAAAATGTACTTCAGGGAGCTGCCCAACCCTCTGTGCACGTACCAGCTGTACGACAGCTTCGTGTCGGCCGTCACCGCCCCGGAGCAGCTGCGACTGAAGGCGGTGAGGGACACCGTGGTCAAACTCCCGCCGCCCCACTACAGGACGTTGTCGTACCTCATGCGTCACTTGCGGCGCGTGTCCCTGCTGAGCGAGTCCACGGGCATGACGGCCAGGAACATGGCCATTGTGTGGGCGCCCAACCTGCTGAGGTCGCCGGCGCCGCAGCACGCGCTGCAGGGGGTCGCCGTGCAGGCGGTGGTGACCGAGTTCCTGATATGCTACGCCGAGGAGTTGTTCTCCAAGGAGCAGGGAGCTGACTCCGGACCGCCCTCCATAAGTCAGGAGAGTCAAGAATCTCAAATAGAACTGCGGGGCCTGGAGGCCGGCAGACGACCCAAGAGTCTACCTTTGAACCCGCCCACTAAGTTGCTGAGCCTGGAGGAGGCTCGGCGCCGCGGTCGACCTGCTCGGACCTCTCCCCCCGCCGACCACGTGCGCCTCTCCACACCCGACACCATGGCGCTCCGGCCCGCCTCACACCTCACGCCCAAATATATAGAGGTCGGCTCCGGCCCCAACAACCTGCCGCAGTACCACACCGTGCTGGAGCTGCCGGGGCCCGGGAACAAGAGACTCAAGCGGTCCCCGTCCGGCTGGAGGGGGCTGTTCACCAGGAAGAGGAGCTCGCGGCCTCCGCTCATACCGCACGTAATGGTACACATGACGGAGCCGTCAGCGACAGCGCTGAGGCCGGCCGTCTCGTGCGAGTCTCTCAGCGAGGGCGAGGCGGCTCCTCCCCCGCGACCTCCACACCACACCAGGTCATCGTCGTGTGACTCGTACTTCGAGCCGTGGCAGGCTGAGCTCGCCACCATGAGGCTCAGACTGTCCCCGCAGGAGAGGGACCGGCACATGTTCAGCGAGGAGGACGACGCGCACGCGCAGCACGGCGCCGCGGAGGACTCCCTCAACACCACGCCGGGGAACAGCGCGGTGGGCAGCGCCGCGGACACGCCGAGACGGACGAGAGACGACAACACGGAGCGGAAGAGGGTGTCCCTGGAACAGCTGGAGCGGCGCGTGGCCAGACTCGGCTACATCGACAGCGACGAGCCCGGGGACAACACGCGCGCCAAGAGACTGTGCAAGGACCGAGACAGTCCGCAGTCTTCAGGCCTAGAGCTGCCGGCCCTGACTAACGTTAAGATGAGAGAGAGAACCTCCCCGAGGAAACAGAAACCCTCGGCTCGATACAGCGGTCTCCAGCAGCCCGTCACCGCGGACAGCGAGCGACTCACCTGGCACGGCAAGGATCACTCCACGCTGATACGGATCGACTGGCCGGAGACGCCCACCCCCACAACCCCCACCACCCCCGCCCCCGCCACACCGCTGTACGACCCGCTGGAGAGTGACTCGGAGCCCAGCGACAAGCACACCATCACCATCAAGAACAGCTGCCGGAACTGCGACGAGGAGAAGTGTCTCAAGTGCGAGCTGAAAGCCGCGGACTACGAGAACGTGGCGGCCGCAGACCTCGCCAGCCTGGGCTCCACCGACATCAGCTACCACAACCTGAACCGCCTGTCCGCCGTCTCCAGCTCCTCCGCCTCGGAGCACACCTCGCACCGCAGGGACGACCTCATGAAGATCATGACGTCCTCTCACGAGAGCTCCTCCGGCTACTCCAACGTGAGCTACAAGCAGGACGAGCCGTGCGACCGCTACGACTTCGCCAGACCCGACTATGTCAACCTGTCCTCCAGCACGTCCAAGAGCTCCCCGGCCAGTCCTCTCAAGAGTCCACTGAAGTCCACCATCAGCATAACGTTCCGCTCCCCGGGCAGGGTCCAGACGCCGGACTACGAACCAATAGGCAACGAAGACACGCCCACCAATGAGCGGGACTCGGTCTACGAGGACGTAGACCTGGAGAAGAGTCTCTCCATAGTGGAGGAGGCCAGCGTGCCGCAGACCGACGCCAGTCTCCCGACACAGGAGGCCCGGCAGCCCGGGGACTACGGCCGTCTCGCCTCCGACCTCATCATTCTGGATACGCCCACCGACGACCGAGACCTGTACAGCCAGGTCAAGTTCTTCAAGAAGAGCATCGAAGAAGTCAACGCCATGATACTGGAGACACCGGACAAGGAGGCGGACTACGAGCGCGTGGACTTTAAGACGAACAACTACAGCTTAGACGAGACCGAGGAAGACGGACACACGGAAGACGGGGGCGGCTCGGAGAGGGAGAACGGAAACGTGATCACGACCAGCAACAACCTGAACGTGCGGGAGCTGGCGAACAGGTTCGAGAGTCCTACGGAGCAGAAGGGGGCGTTCACCTTCGACAAATATAAGACGGAAGACAAACATGCCGCGGCCAGGAGGGACGACCACCAGCCCCGGACCACCGCGAGGACCCTCGCCCTCGCCCGGGACACATACACGCTTACCAAGAACGTCACCGCGAGGTCGCTGGACGAGAACGCGTTCGTCAAAGAGTTCGGATCAGACAGAGACAGGAGGAAGAGTCTAGAGGTGAAGGACGGACAGAGACAGGCGAGGGGCGTACCCGACCTCAACCTGAACACGGAAGACCTGCAGACCAAGACGGATACGGCGGAGGACAGGGTGGCGCTGATACAGGGAGCGGGCAAGGAACACGAGAAGATGCTGAGCAGGCAGAGAATAGAGAAGTACAAAGAGGAGAGAAGGAACTTCCTCAGGGAGAAGTACAGCTCCCAGTCCTTCAGGAGCAGCCCGGAACAACTCACCAGGATCAAGCTGAAGAAGAGCGACCACGAGGCCAGGACGGACGAGCCGCACAAGTTCGAGAGGAGGAACACCGTGGACCTGGGACAGAGGATGAGGTTCACACTCGCCAGGAGCGCCAACGACCTCGACAGTATCCCCTCCCCCGGCAGCCAGGAAGACAGGTCAGAGAAGATGTCGCCCTCCTTCAACATACGGGACATGACGGCCATCTTCGAGCAGAAGTCGCAAGGCACCGGCTGA

Protein sequence:

>DPOGS214434-PA
MSCQSLAARAFAEPEKESHCARAATHRTVRFSSDPRSQNRTTSESECKRETPGSMSLSTLSMSPAMSQSAPSAAAAFHMDDQPPTCRFPKLEECAHFHYERVSLGGISVELVPCDFSGDSPGEGWVCLKVSSHVNSCEEDAAGAGDDTWTLTRNREHFLQLDDMLHRCIYDRKVSGLPCLSSALSSSLDEQAYSRLVADYVHHLSIIADDSINCGPALNWLQMDNKGHKLLVANEDSSSINTPAVAAAYSVRKYVSQARDEISFDVGDMISIIDMPGPQESLWWRGKRGFRVGFFPHHCVAVIGDKVPRHMTQPPPIVGSVAVAPIKPVLRKHGKLISLFRSFILSRPSRRSLKQQGILRERVFGCDLGEHLTNCGHDVPQVLVECSRAIEQRGAVDGIYRLSGGAALTQRLRAAFDAGLAADLRAPLQRDPHALASLLKMYFRELPNPLCTYQLYDSFVSAVTAPEQLRLKAVRDTVVKLPPPHYRTLSYLMRHLRRVSLLSESTGMTARNMAIVWAPNLLRSPAPQHALQGVAVQAVVTEFLICYAEELFSKEQGADSGPPSISQESQESQIELRGLEAGRRPKSLPLNPPTKLLSLEEARRRGRPARTSPPADHVRLSTPDTMALRPASHLTPKYIEVGSGPNNLPQYHTVLELPGPGNKRLKRSPSGWRGLFTRKRSSRPPLIPHVMVHMTEPSATALRPAVSCESLSEGEAAPPPRPPHHTRSSSCDSYFEPWQAELATMRLRLSPQERDRHMFSEEDDAHAQHGAAEDSLNTTPGNSAVGSAADTPRRTRDDNTERKRVSLEQLERRVARLGYIDSDEPGDNTRAKRLCKDRDSPQSSGLELPALTNVKMRERTSPRKQKPSARYSGLQQPVTADSERLTWHGKDHSTLIRIDWPETPTPTTPTTPAPATPLYDPLESDSEPSDKHTITIKNSCRNCDEEKCLKCELKAADYENVAAADLASLGSTDISYHNLNRLSAVSSSSASEHTSHRRDDLMKIMTSSHESSSGYSNVSYKQDEPCDRYDFARPDYVNLSSSTSKSSPASPLKSPLKSTISITFRSPGRVQTPDYEPIGNEDTPTNERDSVYEDVDLEKSLSIVEEASVPQTDASLPTQEARQPGDYGRLASDLIILDTPTDDRDLYSQVKFFKKSIEEVNAMILETPDKEADYERVDFKTNNYSLDETEEDGHTEDGGGSERENGNVITTSNNLNVRELANRFESPTEQKGAFTFDKYKTEDKHAAARRDDHQPRTTARTLALARDTYTLTKNVTARSLDENAFVKEFGSDRDRRKSLEVKDGQRQARGVPDLNLNTEDLQTKTDTAEDRVALIQGAGKEHEKMLSRQRIEKYKEERRNFLREKYSSQSFRSSPEQLTRIKLKKSDHEARTDEPHKFERRNTVDLGQRMRFTLARSANDLDSIPSPGSQEDRSEKMSPSFNIRDMTAIFEQKSQGTG-