Monarch geneset OGS2.0

DPOGS201490
TranscriptDPOGS201490-TA1236 bp
ProteinDPOGS201490-PA411 aa
Genomic positionDPSCF300006 + 374355-378893
RNAseq coverage768x (Rank: top 17%)
Annotation
HeliconiusHMEL0159594e-12875.19% 
BombyxBGIBMGA002678-TA0.082.25% 
DrosophilaShc-PA1e-9146.08% 
EBI UniRef50UniRef50_E1ZX941e-11248.25%SHC-transforming protein 1 n=8 Tax=Formicidae RepID=E1ZX94_CAMFO
NCBI RefSeqXP_001602298.12e-11549.45%PREDICTED: similar to shc transforming protein [Nasonia vitripennis]
NCBI nr blastpgi|1700403265e-10746.76%shc transforming protein [Culex quinquefasciatus]
NCBI nr blastxgi|3287223629e-10350.71%PREDICTED: SHC-transforming protein 1-like [Acyrthosiphon pisum]
Group
Gene OntologyGO:00055152.5e-55protein binding
GO:00355566.8e-33intracellular signal transduction
KEGG pathwaynvi:1001182916e-115 
 K06279 (SHC)maps-> Bacterial invasion of epithelial cells
    Glioma
    Chemokine signaling pathway
    Natural killer cell mediated cytotoxicity
    Neurotrophin signaling pathway
    Insulin signaling pathway
    Focal adhesion
    ErbB signaling pathway
    Chronic myeloid leukemia
InterPro domain[4-184] IPR0119932.5e-55Pleckstrin homology-type
[29-185] IPR0060205.4e-33Phosphotyrosine interaction domain
[24-39] IPR0060196.8e-33Phosphotyrosine interaction (PID/PI)
[314-393] IPR0009805.9e-21SH2 motif
Orthology groupMCL14349 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS201490-TA
ATGGGTGACAACGGCCCCTTTGTCGCGAAACCGGCTCGAGGTTGGCTACATCCAGATTCCGTATTAGCCAGCGATGGAATAACTTATGCAGTCAGATATATTGGTTGCATGGAGGTCCTTACATCTATGAAAAAATTAGACTTTGAGACAAGATCTCAAGTAGCAAAGGAATGTATTGCGAGAGTATGTGCTGCAGCGGGACTAAGAACAGCCGATAAAAAGCGTAGAGTTTGTCAAGCAGCTGCAAATGCTTTAGCGGCTAGACCTAGAATGTCACATTCCGGCTCCAATGTAGCCTTAACAATCTCATCGAGAGCTATAATACTGGCCGCTCTGGAAGGTGGTGAAACTATTGCACGACATGACATGCCACGGGTGTCATTTGCATCCGGAGGTGATCAGGATTCATTGGATTTTGTGGCGTATGTGGCTAAGTCTGCCCCACCAGCGGAATGGAGAGCATGTTATGTATTAGAGTGCGGTGGGAGATTGGCTCAAGATGTCATCGCGACCGTCGGACAGGCGTTTGAACTTCGTTTTAAAGAATTCTTGACGAAACCTACATCCTTGAACATCAATGGCTCGCGTCCAGTGTGCGGTTCTTCAGGGGAGGGTTTGGAAGAGCGCGAGTATTACAACGACATGCCCGACAAGATGCCGCCGGAGCCTACCAGCCACAGACACCCGCCTCCGCCGCCGCTCGCCACACTCGCACCTATATCATCATGTGAGGAGAGTGTACGTCACTATGTCAACCAAACCCCTCCCCCGCGGACCCCACCTACGGCGCTGCTGCCTAATCATCATACTGATATATTCGACATGCAGCCGTTCACAGTAGCCGCGGCGTCGTCGGCGGCGTCCACTTCTCCGTCGTCAGTAGCAGAGGAGCCGCCGCCTGCTCTGTCCGCGGCCGCTCAGTGCGCGCTGCTGGCCCGCGAGCCCTGGTACCACGGCCCTATATCGAGAACCGCGGCTGAAAGGCTGGTCGTGGAGGACGGCGAGTTCCTCGTCCGTCAGTCGGCCGCGTGTCCCGGGCAGTTCGTGTTGACCGGGGCACGCCGCGGGGCGCACAAACATCTACTGCTGGTCGACCCTAACGGCGTTGTGAGAACTAAAGACCGCGTGTTCGACAGCGTGCCTCATCTCATCAAATATCACTGTACTAATGAACTGCCAATAGTATCAGCTGATTCAGCGCTGCTCCTACGACTTCCGGTGCAGCGACCTTCCTGA

Protein sequence:

>DPOGS201490-PA
MGDNGPFVAKPARGWLHPDSVLASDGITYAVRYIGCMEVLTSMKKLDFETRSQVAKECIARVCAAAGLRTADKKRRVCQAAANALAARPRMSHSGSNVALTISSRAIILAALEGGETIARHDMPRVSFASGGDQDSLDFVAYVAKSAPPAEWRACYVLECGGRLAQDVIATVGQAFELRFKEFLTKPTSLNINGSRPVCGSSGEGLEEREYYNDMPDKMPPEPTSHRHPPPPPLATLAPISSCEESVRHYVNQTPPPRTPPTALLPNHHTDIFDMQPFTVAAASSAASTSPSSVAEEPPPALSAAAQCALLAREPWYHGPISRTAAERLVVEDGEFLVRQSAACPGQFVLTGARRGAHKHLLLVDPNGVVRTKDRVFDSVPHLIKYHCTNELPIVSADSALLLRLPVQRPS-