Monarch geneset OGS2.0

DPOGS206145
TranscriptDPOGS206145-TA1533 bp
ProteinDPOGS206145-PA510 aa
Genomic positionDPSCF300028 + 1377367-1382477
RNAseq coverage84x (Rank: top 64%)
Annotation
HeliconiusHMEL0150584e-8147.38% 
BombyxBGIBMGA000514-TA8e-2350.91% 
Drosophilados-PA5e-2851.45% 
EBI UniRef50UniRef50_E0V9D29e-4060.34%GRB2-associated-binding protein, putative n=1 Tax=Pediculus humanus corporis RepID=E0V9D2_PEDHC
NCBI RefSeqXP_002422726.12e-4060.34%GRB2-associated-binding protein, putative [Pediculus humanus corporis]
NCBI nr blastpgi|3800262733e-4061.29%PREDICTED: uncharacterized protein LOC100872582 [Apis florea]
NCBI nr blastxgi|2700059036e-4133.92%hypothetical protein TcasGA2_TC008021 [Tribolium castaneum]
Group
Gene OntologyGO:00055157.8e-25protein binding
KEGG pathwaygga:4224562e-23 
 K09593 (GAB1)maps-> Bacterial invasion of epithelial cells
    Neurotrophin signaling pathway
    ErbB signaling pathway
    Renal cell carcinoma
InterPro domain[3-120] IPR0119937.8e-25Pleckstrin homology-type
[5-123] IPR0018491.4e-21Pleckstrin homology domain
Orthology groupMCL26630 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206145-TA
ATGTCCAAAAATATTGTTTATGAAGGATGGCTTACTAAATCGCCACCATCAAAGCGCATTTGGAGAACTAAATGGCGCAGACGGTGGTTTGCCTTGCGACAGTCAGGTGAACTACCAGGACAATACTTTCTCGACTACTATTCCGATCGACACTGTAGGAGGTTGAAGGGGTCTATCGATTTGGATTGCTGTGATCAGGTTGATGCTGGATTGCATATGGAAAGGGATAATAACGGATCATTAAATCGCAAACTCCGTGGTTGTGTGTTCACAATCCAAACTAACATTCGCACTTATCACTTGGAGGCTGATTCTGAGGAAGAAATGGAAAAGTGGGTTGATGCTATATGCAGAGCGTGTGGCCTCAGAGCTACGGACGAATCAACAAATGCTGTCGGTCTATATCAGAATATAACGCTAAATGAGAGAGAAAATATTGGAACAAAGGTAATAAATGATTCCAATGTCACAGGGGGTGCGAGAAAAAGGACTCACACCACCACATTTAGGAACAATAAGCCTTCGTCTACACGAGGCAAGTCCAAAGTGAAGACAAACACGAAGCAAACGCCTATAGAAAGACATGATCAGGGTACAATGACGGTTATGGATGATTATCCCTCCGGCGAAGGCACTGGCCCATACATACCAATATCTGAATGCATAACTGGAGTTCGGACACAGGACACCCAAACAGCGTTTACCTTTGACCCCAAGAACATCGTTATCAGCTCGAATAAGAAAGTCGGTAGCTCAAGAAGCTATACCTATTTGACGCAACCACAAATAAGAGTAAATAATGTTGAACTATCAGAAAATGAATCAAATCTCAGTGAAGACGAATGCCGGTCCCTCAGCGCTAGTCAATTCAATATGGGGGATTGGACTGTTGCAAAGTCTTTTAAAAGGCTATCTGTGCATCCTCAAAGCCAAGAAGGCTTCAACGCTGATGGTCCCCCCGTACCACCTCGGCCGCCGAAGACCTTCGCCATGAGCAAGGATCTCACGCGAGCGAAGGATTCCTTCCAAGGACAAAAATTTCAGGAAACTGTTGATGTTCATGAGTGTTCTTCTCCTTTCCCCTGGGTTCGCTTGCCACGTCGCATGTCACAAGGTGCGCCGACATCACCAGGAAGATCCGTGATCAGCCACGCCAGAACTGATGATGAAGATGACGTTTCAATGGGCCATTCGCTGCAGTATTGCAACTTGTCGTCTCTGCCGCCGGCCGTAGATCGCGCTTTGAAGCCACGGCACTCGACTCACAGCATAGGCAATATAACCGCTGGACATAAAACAGCGTGCAGGGCTAGCGATGAGATCAAGTCGGAGACTTTGCAGTACTTGGATCTCGATCTACCCGCCCCCAGCTCGCAATCTACTTTCAAGGAATCAGCAAGGAAGACGTCTATAGTCCACGGTAAGTCGTTGTCATCCGACGAGTGCGCGTATAAAACGGTCGACTTCTTGAAGACCGAGGCGTTCAATATTACTCGCCAGGACGCTGAAGCGTCTAGAAGTATCCAGCAATGA

Protein sequence:

>DPOGS206145-PA
MSKNIVYEGWLTKSPPSKRIWRTKWRRRWFALRQSGELPGQYFLDYYSDRHCRRLKGSIDLDCCDQVDAGLHMERDNNGSLNRKLRGCVFTIQTNIRTYHLEADSEEEMEKWVDAICRACGLRATDESTNAVGLYQNITLNERENIGTKVINDSNVTGGARKRTHTTTFRNNKPSSTRGKSKVKTNTKQTPIERHDQGTMTVMDDYPSGEGTGPYIPISECITGVRTQDTQTAFTFDPKNIVISSNKKVGSSRSYTYLTQPQIRVNNVELSENESNLSEDECRSLSASQFNMGDWTVAKSFKRLSVHPQSQEGFNADGPPVPPRPPKTFAMSKDLTRAKDSFQGQKFQETVDVHECSSPFPWVRLPRRMSQGAPTSPGRSVISHARTDDEDDVSMGHSLQYCNLSSLPPAVDRALKPRHSTHSIGNITAGHKTACRASDEIKSETLQYLDLDLPAPSSQSTFKESARKTSIVHGKSLSSDECAYKTVDFLKTEAFNITRQDAEASRSIQQ-