Monarch geneset OGS2.0

DPOGS202429
TranscriptDPOGS202429-TA546 bp
ProteinDPOGS202429-PA181 aa
Genomic positionDPSCF300085 + 504477-505022
RNAseq coverage273x (Rank: top 39%)
Annotation
HeliconiusHMEL0034005e-10295.58% 
BombyxBGIBMGA013312-TA3e-9993.92% 
DrosophilaCbl-PB1e-8076.54% 
EBI UniRef50UniRef50_Q9VSK22e-7876.54%Cbl long isoform n=22 Tax=Bilateria RepID=Q9VSK2_DROME
NCBI RefSeqXP_002428625.12e-8584.39%E3 ubiquitin-protein ligase CBL, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420159973e-8484.39%E3 ubiquitin-protein ligase CBL, putative [Pediculus humanus corporis]
NCBI nr blastxgi|2420159971e-8284.39%E3 ubiquitin-protein ligase CBL, putative [Pediculus humanus corporis]
Group
Gene OntologyGO:00230511.6e-121regulation of signaling
GO:00048421.6e-121ubiquitin-protein ligase activity
GO:00056343e-67nucleus
GO:00071663e-67cell surface receptor linked signaling pathway
GO:00048713e-67signal transducer activity
KEGG pathwayphu:Phum_PHUM3945506e-85 
 K04707 (CBL)maps-> Ubiquitin mediated proteolysis
    Bacterial invasion of epithelial cells
    Pathways in cancer
    Endocytosis
    T cell receptor signaling pathway
    Insulin signaling pathway
    ErbB signaling pathway
    Jak-STAT signaling pathway
    Chronic myeloid leukemia
InterPro domain[1-180] IPR0241621.6e-121Adaptor protein Cbl
[33-160] IPR0031533e-67Adaptor protein Cbl, N-terminal helical
Orthology groupMCL19613 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202429-TA
ATGGCGGCTCCCCGTCACAAATCACAACAGAAAAATATATCTTCGATATTTTCTAAGTTACATGGTGCTTTCTCTGATGCTATGTGTCAAACAAAGCTGACGACAGACAAACGGACTCTAGACAAAACTTGGAAGTTGATGGACAAAGTGGTAAAACTATGCCAGCATCACAAAATGAATTTAAAAAATAGTCCGCCGTTTATTCTCGATATTTTACCTGACACATATCAGAGGTTGCGTATTATTTACTCCAATTACGAAAACAACATGCAGGAGTTGAATAGCAACGAACATTTTAATATATTTATAATAAATTTAATAAGAAAATGTAAGCAAGCTATAAAATTGTTCAAGGAGGGCAAGGAAAAGATGTTCGACGAAAACTCACATTTCAGACGTAATTTGACAAAGTTGAGCCTTGTTTTTAGTCACATGTTGAGCGAGTTGAAAGCTATGTTCCCGAACGGTACCTTCGCTGGCGACCAGTTCCGTATAACGAAAAGCGATGCCGCCGAATTCTGGAGAGCGAATTTCGGTAACAGGTAA

Protein sequence:

>DPOGS202429-PA
MAAPRHKSQQKNISSIFSKLHGAFSDAMCQTKLTTDKRTLDKTWKLMDKVVKLCQHHKMNLKNSPPFILDILPDTYQRLRIIYSNYENNMQELNSNEHFNIFIINLIRKCKQAIKLFKEGKEKMFDENSHFRRNLTKLSLVFSHMLSELKAMFPNGTFAGDQFRITKSDAAEFWRANFGNR-