Monarch geneset OGS2.0

DPOGS203771
TranscriptDPOGS203771-TA765 bp
ProteinDPOGS203771-PA254 aa
Genomic positionDPSCF300010 + 629693-638059
RNAseq coverage812x (Rank: top 16%)
Annotation
HeliconiusHMEL0042254e-8873.58% 
BombyxBGIBMGA013349-TA3e-8057.03% 
DrosophilaCrk-PA5e-6647.45% 
EBI UniRef50UniRef50_Q9XYM06e-6447.45%Adapter molecule Crk n=7 Tax=Diptera RepID=CRK_DROME
NCBI RefSeqXP_001943269.11e-7054.29%PREDICTED: similar to Adapter molecule Crk [Acyrthosiphon pisum]
NCBI nr blastpgi|3320249992e-6947.12%Adapter molecule Crk [Acromyrmex echinatior]
NCBI nr blastxgi|3320249993e-6747.80%Adapter molecule Crk [Acromyrmex echinatior]
Group
Gene OntologyGO:00055151.5e-24protein binding
KEGG pathwayapi:1001646323e-70 
 K04438 (CRK, CRKII)maps-> Regulation of actin cytoskeleton
    MAPK signaling pathway
    Bacterial invasion of epithelial cells
    Fc gamma R-mediated phagocytosis
    Renal cell carcinoma
    Pathways in cancer
    Shigellosis
    Chemokine signaling pathway
    Neurotrophin signaling pathway
    Insulin signaling pathway
    Focal adhesion
    ErbB signaling pathway
    Chronic myeloid leukemia
InterPro domain[10-113] IPR0009801.5e-24SH2 motif
[130-186] IPR0014521.2e-17Src homology-3 domain
Orthology groupMCL11709 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203771-TA
ATGGCGAATCCTCCGGTCGGAGTTTCGTTCGACCAGAATGATATGTCCAGCTGGTATTTCAGCGGTCTCTCCCGCGCGGAGGCCACCAAACTGCTTCTCAATGAGACGGAGAGCGGTGTTTTCCTCGTCCGGGATTCCAAAACAATACACGGAGACTATGTACTGTGTGTCAGGGAGGACGACCGTGTGTCCCACTACATAATAAATCGCATGGTGTCCCCGGATGGCACAACTCGTTTCCGTATCGGGAACCAGTTGTTCGCGGACATGCCGGCATTGTTGGCGTTCTATCGTCTCCACTACCTGGACACGACGCCCCTGGTGAAGCCTCTGCCCCAGGCCAGTGTCCAAGCGACCGCGGCCCCTACTCACTCCTACCACGTGCTCGAAGTTGTCATAGCCAAATTCGATTTCGTCGGATGTGACGCTGATGATCTACCCTTCCGCCGTGGAGAGAGGTTGATGGTGATAAATCGTGATGAAGAGCAATGGTGGACAGCCAGGAACGCTCAAGGCAGGACGGGTTCCATACCCGTCCCGTACGTACAGAGGTTACTGGACCCGTCGGGCGTGCCGTACCCCGCGGGCGAGGCGATGTCCCAGTTGGACCCCCCCAGCCCGCCCGCCAACAAGACGCCACAGAAACCCACGAACATGCAGGTTACAAAAATGAACATAAACGGTCAATGGGAGGGCGAGCTGAACGGGAAGGTTGGACATTTCCCATTCACATACGTAGAGTTCCTTGATGACGTCACCAGCTAA

Protein sequence:

>DPOGS203771-PA
MANPPVGVSFDQNDMSSWYFSGLSRAEATKLLLNETESGVFLVRDSKTIHGDYVLCVREDDRVSHYIINRMVSPDGTTRFRIGNQLFADMPALLAFYRLHYLDTTPLVKPLPQASVQATAAPTHSYHVLEVVIAKFDFVGCDADDLPFRRGERLMVINRDEEQWWTARNAQGRTGSIPVPYVQRLLDPSGVPYPAGEAMSQLDPPSPPANKTPQKPTNMQVTKMNINGQWEGELNGKVGHFPFTYVEFLDDVTS-