Monarch geneset OGS2.0

DPOGS209666
TranscriptDPOGS209666-TA3084 bp
ProteinDPOGS209666-PA1027 aa
Genomic positionDPSCF300134 - 340133-346938
RNAseq coverage223x (Rank: top 45%)
Annotation
HeliconiusHMEL0084550.084.50% 
BombyxBGIBMGA000699-TA0.077.49% 
Drosophilapico-PA1e-10142.46% 
EBI UniRef50UniRef50_D6WJS62e-17652.23%Putative uncharacterized protein n=3 Tax=Pancrustacea RepID=D6WJS6_TRICA
NCBI RefSeqXP_001812179.13e-17752.82%PREDICTED: similar to growth factor receptor-bound protein [Tribolium castaneum]
NCBI nr blastpgi|1892379775e-17652.82%PREDICTED: similar to growth factor receptor-bound protein [Tribolium castaneum]
NCBI nr blastxgi|2700080440.049.43%hypothetical protein TcasGA2_TC014797 [Tribolium castaneum]
Group
Gene OntologyGO:00055159.5e-15protein binding
GO:00071656.4e-09signal transduction
KEGG pathway 
InterPro domain[351-464] IPR0119939.5e-15Pleckstrin homology-type
[352-464] IPR0018492.1e-12Pleckstrin homology domain
[215-298] IPR0001596.4e-09Ras-association
Orthology groupMCL15622 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209666-TA
ATGATGGACGTGCAACTAAGCGAGCCGACCAAGTGGCACCGTGGAGGCTTCCTCTCCACTCTCAACAGAAGTTTCCGTCTGGCCACCAAATCAAAAAGCGCTAACAATTCACCAATTGAAAACAAATCTTTTGAACAAAGTTTAAAAATGACTGACACGGGTTTAAATTCATCAAGCGATGCTACAGCAGCGCTGAGGCCGCTGGAAATTGTGGCGCCCCGCATCGATTCCTACAGATTTTCTATGGCAAATCTAGAAGAAACACAAGACGCTGATTTAGACGCGATATTAGGGGAGTTGTGTGCGCTCGATTCCGAATACGATGAAGAAATTTCCCGAGTATCCACTGATTATTCTCAATCCAAAGAAAGAGAAGAGGGCGAGAGTTCTCAGCGACAAGAGAACAAGGAGGGCGATGGTGCACCAACCATCGCAAGAACTGATTCACCTGACAATGACTCTGCCTTTAGTGACACGGTATCGATGCTCTCTAGCGAGTCGTCAGCCTCGAGTAGTGCAAGTTCTAAATGTAAACCGATGAAACTCAGTCTGCATCCAAATCAAAAGGATGCAATTTTTCAACAGAAAGCAGATAAAATTAAACTGGCATTGGAACGTATGAGAGAGGCGAATGTAAAGAAGCTCTTTATTAAAGCTTTCTCTATGGACGGATCATCAAAGAGTCTCCTAGTCGACGAAAAGATGACGTGCGGCTACGTCACACGGTTACTGGCTGACAAAAATCATGTCACTATGGAACCTAAATGGGCCATAGTTGAACACTTACCAGATTTGCATATGGAACGAGTGTACGAAGATCACGAGATGTTGGTGGACAACCTCATGTTGTGGACGCGAGAATCCAAAAACAAAATACTGTTTGCCGAAAGGCCAGACAAGATATCGCTCTTCCAAACACCCGAGAAATTTCTCCTAACTGAAGATGAAAGAGGAATTAGTGAATACGACGAGCATTTACGTCAAGTGGTCATAGAAGAATTCTTCGGTCAAAGCGGAGCCTCTACAATTCCCTCCGTCTCCGGTCATCAAGTCCCAGCCATGGAAGGACCTCTCTATCTTAAAAGTGACGCTAAAAAGGGCTGGAAGAAATACTACTTCGTCCTGAGACCTTCCGGACTGTACTATTTACCGAAAGATAAAGTGAAGACCTTAAAGGAGCTGGTGTGTTTGGCGACTTTCGACACTAACGAAGTATATTTAGGAGTCAATTGGAAGAAAAAGTACAAGTCTCCAACTGACTTTTGCTTTGCTATCAAGCATCCACGGCTTCAACAGCCGAAGAGCGTCAAGTTTATTAAATTTCTATGCGCCGATGATCAAAGGACTCTCGAGAGATGGGTCACCGCCATGCGTATAGCGAAGCACGGCAAACAATTACTAGAAAATCACCGCACCCTCGTCGAAGAGCTGACCCAGGAAGATTTGGACCATTTGGCCCACGCCCGCTCATGCTCCATAACATCGATCCCTACAAAGACTAATGGCACGGCGCCAGGCCTGCCCGCGCCCGTTTCCAGCAACGTCAGTGTTGCCAACTCTGACATCAGCAGCGGCAGACATTCTCGAGCATCATCTTCAAGTTCAAGCGGTTGTCTCTCAGATGGCGGGACCGCCTCGGAAAGTGCGTTCGATTGCGAATTTCCTATGGGTACAATAAAACGAAAACCATCCATGAAGCCAAATATTCCTTTGACCTGGATGACGCGGCAACTTAAAGAAATGGTAGAAAATGAAGGTGACGCAGAAGTAGGTGATTCTGGAACGCTCACAAGACGACCGCGCACTCGAGACGATTCAACTCTTAAACGCCACCACTCAACTGCTACAGGATCTTCGGAGCCCACTATTTACAGTACCAGCAGCATTACATCTAGCAGTCCAGTGAGAGATCCGTCATCTCCAACCTATGGCCACTATGAAACTATTACTCACGAACCATATAGAGCAAGCGTAGACACGGCATCCTCGCTATACGGGTACACGATTTATGACAGTTCACAATCACAATCAGAACCTACTGTTGAGGATCTTCCTCTTCCACCACCTCCAACTGATATTCCAGATGGCATGTTTAGTTCAACTCTCAGCCTAGATTCATTGCCACCGCCGCCACCACCTGTGGCCTACCCTATAGAGGATTTGAATGGATCCCAGCTGAGTCTACCACCACCTCCGCCTGAACACACTATTGAAACTCACACTGGACGAGTTCAAGATATAGTTAGCCAGCTAACCGCTCAGCAAATAGAGCAAACATCGAGAGCCGGCCAGAGAAGCAGTTTGAGAAGCTCTGAGAGCAACCGATCATTCCCCCGACAACCCTCGCTCGATAGTGTCAATTCGGAAGCTTCGAAGACGTCTTCTTTACAATCCGACAAAAGTATTTACGCTCATACGCAACAAAATGTTGCATATGGTGCTTGTCTTGTAGAGCTACAGAACAAAAAAATAAGCAACGGCAGTCCAGCCATACAGAAGAAGACAATGGAACCTGTAAAAGAAAGAGCTGGTTCCATTAAGAAAGTTAACTTTGCAGATGACCTTCCAAGTAATACTGACAAGAAAGCCAAAAAAATTTCTTTTAATTTGACGGACGCTCCACTTTCACCAAGAAAGCCTCCTCCGCCGAAACGCAATGAGAGCACTCGTCTCTCGTCCCCTAAAAAGCTAGCTGATTCAAACAGCAATCCTCCAAAAGACTTTCTAAAAGATCTTCAAAGAGTTATGAGGAAGAAATGGCAGGTCGCTCAAAAATGCAAACTTGAACCGGCAACTACGCCACATGAGGTACTTGGTTTCAGAGAATACCCTTTGTCAGATGACTACAAAGAGACTAGCGTGTCCATGTGGGTGCAAGAACATTATGGAGGAGGTTCAGGCGTAGAGGATCCCTTCTACGAGAATGTGTTCGGAAGAGAAGCTCAGCCACGGCGAGAAGAGCCCAAACCAATAAAGAAGCGTCCACCGCCTGCCCCTCCTCGCCGTAGCGACTCGACGCACTTGAGCACACTCCCCGGCATCCCCCCGCCCTCGCATCCATCGCCCGTTCAACCGACCGCTTGA

Protein sequence:

>DPOGS209666-PA
MMDVQLSEPTKWHRGGFLSTLNRSFRLATKSKSANNSPIENKSFEQSLKMTDTGLNSSSDATAALRPLEIVAPRIDSYRFSMANLEETQDADLDAILGELCALDSEYDEEISRVSTDYSQSKEREEGESSQRQENKEGDGAPTIARTDSPDNDSAFSDTVSMLSSESSASSSASSKCKPMKLSLHPNQKDAIFQQKADKIKLALERMREANVKKLFIKAFSMDGSSKSLLVDEKMTCGYVTRLLADKNHVTMEPKWAIVEHLPDLHMERVYEDHEMLVDNLMLWTRESKNKILFAERPDKISLFQTPEKFLLTEDERGISEYDEHLRQVVIEEFFGQSGASTIPSVSGHQVPAMEGPLYLKSDAKKGWKKYYFVLRPSGLYYLPKDKVKTLKELVCLATFDTNEVYLGVNWKKKYKSPTDFCFAIKHPRLQQPKSVKFIKFLCADDQRTLERWVTAMRIAKHGKQLLENHRTLVEELTQEDLDHLAHARSCSITSIPTKTNGTAPGLPAPVSSNVSVANSDISSGRHSRASSSSSSGCLSDGGTASESAFDCEFPMGTIKRKPSMKPNIPLTWMTRQLKEMVENEGDAEVGDSGTLTRRPRTRDDSTLKRHHSTATGSSEPTIYSTSSITSSSPVRDPSSPTYGHYETITHEPYRASVDTASSLYGYTIYDSSQSQSEPTVEDLPLPPPPTDIPDGMFSSTLSLDSLPPPPPPVAYPIEDLNGSQLSLPPPPPEHTIETHTGRVQDIVSQLTAQQIEQTSRAGQRSSLRSSESNRSFPRQPSLDSVNSEASKTSSLQSDKSIYAHTQQNVAYGACLVELQNKKISNGSPAIQKKTMEPVKERAGSIKKVNFADDLPSNTDKKAKKISFNLTDAPLSPRKPPPPKRNESTRLSSPKKLADSNSNPPKDFLKDLQRVMRKKWQVAQKCKLEPATTPHEVLGFREYPLSDDYKETSVSMWVQEHYGGGSGVEDPFYENVFGREAQPRREEPKPIKKRPPPAPPRRSDSTHLSTLPGIPPPSHPSPVQPTA-