Monarch geneset OGS2.0

DPOGS212764
TranscriptDPOGS212764-TA1428 bp
ProteinDPOGS212764-PA475 aa
Genomic positionDPSCF300012 + 836379-841028
RNAseq coverage96x (Rank: top 62%)
Annotation
HeliconiusHMEL0155352e-17875.99% 
BombyxBGIBMGA013262-TA0.077.70% 
DrosophilaLnk-PA9e-8260.85% 
EBI UniRef50UniRef50_G6D5M70.0100.00%Signal transduction protein lnk-realted n=2 Tax=Endopterygota RepID=G6D5M7_DANPL
NCBI RefSeqXP_001648512.12e-15859.41%signal transduction protein lnk-realted [Aedes aegypti]
NCBI nr blastpgi|1571046635e-15759.41%signal transduction protein lnk-realted [Aedes aegypti]
NCBI nr blastxgi|1571046639e-15257.94%signal transduction protein lnk-realted [Aedes aegypti]
Group
Gene OntologyGO:00055155.9e-22protein binding
GO:00355561.5e-14intracellular signal transduction
GO:00048711.5e-14signal transducer activity
KEGG pathwaytgu:1002319679e-65 
 K07193 (APS, SH2B2)maps-> Insulin signaling pathway
    Neurotrophin signaling pathway
InterPro domain[307-407] IPR0009805.9e-22SH2 motif
[10-74] IPR0150121.5e-14Phenylalanine zipper
[161-253] IPR0119931.2e-09Pleckstrin homology-type
[148-256] IPR0018493.4e-07Pleckstrin homology domain
Orthology groupMCL14180 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212764-TA
ATGGCTGGCCCGTCCGACGGCGACTCCGACGGATGGGTCGAGATGTGCGAGCGTCAGGCCAAAACCTCCGCTCAGGACTTCGCCAAGGCTTGCCTGCAGTACATACAGACCAACAGCAACGAAGCCGCTGCCCGACCGCTCTCGCAAAAAGAGCTGCTAAAGAAATTTGTTGAATGTTTTTCCGAGCAATTTGATTTCGAATTCGGTAAATTGAAAGCACAGCACAAGCTACCGAACGGTATGCATACCGGTCACGACGAGAGTGATTATTCCGAGGACACGGACTCGCCTAAAACGCAACACAAACCATTCTTTCGAAGGTTATCTTTTAAAGGTCTGCGCCGCGGCAAGGGTTTATTTCACAAACAGCATTCAGACGAAGTCGAGTTACAATCAAATTTAAACAAACATAAAACGAAGTTAGCTAAAATTGTTGTCGAGTGCCGTAAGGAAGGGCTCGTAAATTATTTGACTCCAGAGAGCTTAGAGCAACCAGGAGGTCCACAGAAATGGGAGAAATGTAGACTGGCGTTAGTTAAGACTGTTGGCGGTTACATGTTAGAGTTCTATTCACCACCTAAATCACAGAAGCCTAGGAGTGGGGTTTTCTGTTTTCTTATATCCGAGGCTCGGGAAACAACAGCTCTTGAAATGCCTGACCATGAAAACACTTTTGTTCTGAAGGCCGACAACAACATGGAGTATGTGATAGAAGCAGCGGATGTGGACGACATGAAGTCCTGGCTGGCTACAATCAAGTACTGCATGCGTTCAGCGCCGACATCACAGCCCCCACCAGAGGCATTGGCCGGCCTCCCCGAGCCTGCACCCCCCGACCTGCCACCAAGAAGAGACCTACCCGCAAGCACCAGTAATGTTGACCTAGCTAATGATACACCGGACGCTGAATTAGGTTCAATAGCGGAGGAGGCTTGCGAGTCCCGCGTGTCGCTGGGCGAGTGGCCGTGGTTCCACGGTACACTGGCCAGGTCTGCGGCGGCGGCCTGCGTGCTGGCTGGCGGAGCACCCGCGCACGGCTGCTACCTCGTCAGGCAGAGCGAGACCAGGAGAGGGGAATACGTACTCACTTTCAACTTTCAGGGTCGTGCTAAGCACCTCCGCATGACGTTGAGCGAGGCTGGCCAGTGCCGCGTGCAGCACCTCTGGTTCCCGAACGTGCACGACATGCTGGAGCACTTCAGGGCGCACCCCATACCGCTGGAGTCCGGCGGTGCGGCTGACGTCACGCTCACCGAGTACGTCGTCTGCCAGGACAACCACCTGGGTGTGTCTCACGGCAGCGATGTCCGTATACGTCGTGCTGAACTTGAGGCGTTGTTGGTGGCGAGTGGGGCTCAACATGACGTTCGAGCCGTTGACAACCAGTATGTGTTGTGCAACTTGAGATCCTCACAGGGGCATGCTTGA

Protein sequence:

>DPOGS212764-PA
MAGPSDGDSDGWVEMCERQAKTSAQDFAKACLQYIQTNSNEAAARPLSQKELLKKFVECFSEQFDFEFGKLKAQHKLPNGMHTGHDESDYSEDTDSPKTQHKPFFRRLSFKGLRRGKGLFHKQHSDEVELQSNLNKHKTKLAKIVVECRKEGLVNYLTPESLEQPGGPQKWEKCRLALVKTVGGYMLEFYSPPKSQKPRSGVFCFLISEARETTALEMPDHENTFVLKADNNMEYVIEAADVDDMKSWLATIKYCMRSAPTSQPPPEALAGLPEPAPPDLPPRRDLPASTSNVDLANDTPDAELGSIAEEACESRVSLGEWPWFHGTLARSAAAACVLAGGAPAHGCYLVRQSETRRGEYVLTFNFQGRAKHLRMTLSEAGQCRVQHLWFPNVHDMLEHFRAHPIPLESGGAADVTLTEYVVCQDNHLGVSHGSDVRIRRAELEALLVASGAQHDVRAVDNQYVLCNLRSSQGHA-