Monarch geneset OGS2.0

DPOGS211052
TranscriptDPOGS211052-TA1227 bp
ProteinDPOGS211052-PA408 aa
Genomic positionDPSCF300202 + 298027-301715
RNAseq coverage845x (Rank: top 15%)
Annotation
HeliconiusHMEL0043345e-11869.07% 
BombyxBGIBMGA003753-TA2e-7457.31% 
DrosophilaStam-PA4e-10047.00% 
EBI UniRef50UniRef50_E0W2505e-10249.51%Signal transducing adapter molecule, putative n=3 Tax=Pediculus humanus corporis RepID=E0W250_PEDHC
NCBI RefSeqXP_623539.18e-10548.20%PREDICTED: similar to Signal transducing adaptor molecule CG6521-PA [Apis mellifera]
NCBI nr blastpgi|3838596882e-10548.67%PREDICTED: signal transducing adapter molecule 1-like [Megachile rotundata]
NCBI nr blastxgi|3838596882e-10147.74%PREDICTED: signal transducing adapter molecule 1-like [Megachile rotundata]
Group
Gene OntologyGO:00068868.5e-25intracellular protein transport
GO:00055157.5e-20protein binding
KEGG pathwayame:5511402e-104 
 K04705 (STAM)maps-> Endocytosis
    Jak-STAT signaling pathway
InterPro domain[5-140] IPR0089426.1e-32ENTH/VHS
[10-141] IPR0182051.4e-28VHS subgroup
[6-138] IPR0020148.5e-25VHS
[202-257] IPR0014527.5e-20Src homology-3 domain
Orthology groupMCL12041 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211052-TA
ATGGGTATCTTTGGAACCTCGTCTCCATTTGATCAGGATGTAGAGAGAGCGACCAGCGAGAACAACACCAGCGAGGAGTGGGGCCTCATCCTGGAGATCTGTGACCGAGCGGGCTCCGGGCCCGCCGCGGCCCGGGACTGTCTGCCCCGGCACGACGCCGCCGAGCCACACGCCGACCCGCACGTGCAGGTGCACGCCGCTACTCTCCTGGACGCGTGCGTCGCTAACTGTGGCCGTGTTTTCCACCTCGAAGTGGCGTCGCGGGACTTCGAGGCCGAGTTCCGTCGCCTGCTGTCTCGCGCCCAGCCTCCTGTCGCCGGCCGTCTCCGCGCTCTGCTGCGCAAATGGGCCGAAGGAGAGTTCCGCGACGATCCCCAGCTGGATCTCATCCCCTCCCTCCACGCCAAGCTAAGCGCGGAGTCCGGCGAGCGCGTGTCGTCGGCCGCCGCGCCCGCAGCCGACGCTCAGACCGTCCTAACGGCGGCAGAAAGGCGTGAACAAGAGGAGCTGGCTCGCGCCATCGCATTGTCGCTGCGCGATTCAAGTGGGTCCGGGGGGGCCGCGGGGGCCGGGGCACGCCGGGGGCTCGCTATATCCGCGCCGATGAAGGTGCGCGCCCTATACGATTTCGAGGCGGCCGAAGATAACGAGCTTACTTTCCTAGCGGGAGAAATCGTTCACGTGACAGACTCCAGCGATCCTAATTGGTGGAAAGGTCACAACGAGCGAGGAGAGGGTCTCTTCCCCGCCAACTTCGTCACGTCCGACCTCACCGAGCCCGCGCCCGAATCCGAGAATCGATCGAACTCGGGCAAGACGGTTCAGTTCGCGGAGAGCGCGGGCGGGGCCGAGCAGCCCGCGCGTATCGACGAGGCCGTCGTGGACGAGGCCCTGGCGCTGCTGCACGAGGCTGACCCCGCCGCCGACGACGCCACCGGGCCTCGCCTGGCCCGCGCCGAGGCCGCCGCGCACGCCATGGGTGCTCTAGTGGACGCCGCACTGGAACGCGCGGACCGCCGCCACGCTCGCCTCACGCAGCTCAGCGCCGACCTCGTGGACGCACTCAACCTCTACCACGACCTGATGCGCGCGCCGCCCACCTTCCTCCCCCCTATGCACTACGCCCCCGGCCCGGCCGCGGCCACCCTGCCCCCTCCTCCCCCCGGCGCCCTGCTCCCCGGCCCGCTGCCGTCCCTCGCCCCCCACCACCCGCAGCCGCCGCGGTGA

Protein sequence:

>DPOGS211052-PA
MGIFGTSSPFDQDVERATSENNTSEEWGLILEICDRAGSGPAAARDCLPRHDAAEPHADPHVQVHAATLLDACVANCGRVFHLEVASRDFEAEFRRLLSRAQPPVAGRLRALLRKWAEGEFRDDPQLDLIPSLHAKLSAESGERVSSAAAPAADAQTVLTAAERREQEELARAIALSLRDSSGSGGAAGAGARRGLAISAPMKVRALYDFEAAEDNELTFLAGEIVHVTDSSDPNWWKGHNERGEGLFPANFVTSDLTEPAPESENRSNSGKTVQFAESAGGAEQPARIDEAVVDEALALLHEADPAADDATGPRLARAEAAAHAMGALVDAALERADRRHARLTQLSADLVDALNLYHDLMRAPPTFLPPMHYAPGPAAATLPPPPPGALLPGPLPSLAPHHPQPPR-