Monarch geneset OGS2.0

DPOGS215242
TranscriptDPOGS215242-TA867 bp
ProteinDPOGS215242-PA288 aa
Genomic positionDPSCF300047 - 543343-547583
RNAseq coverage356x (Rank: top 33%)
Annotation
HeliconiusHMEL0127202e-6191.94% 
Bombyx% 
Drosophilachico-PB5e-4746.09% 
EBI UniRef50UniRef50_E0VU101e-5645.32%Insulin receptor substrate-1, putative n=1 Tax=Pediculus humanus corporis RepID=E0VU10_PEDHC
NCBI RefSeqXP_002429604.12e-5745.32%insulin receptor substrate-1, putative [Pediculus humanus corporis]
NCBI nr blastpgi|2420182754e-5645.32%insulin receptor substrate-1, putative [Pediculus humanus corporis]
NCBI nr blastxgi|3504226023e-6049.20%PREDICTED: hypothetical protein LOC100742603 [Bombus impatiens]
Group
Gene OntologyGO:00055153.9e-34protein binding
GO:00051588.9e-28insulin receptor binding
KEGG pathwayphu:Phum_PHUM4412906e-57 
 K07187 (IRS)maps-> Aldosterone-regulated sodium reabsorption
    Adipocytokine signaling pathway
    Insulin signaling pathway
    Neurotrophin signaling pathway
    Type II diabetes mellitus
InterPro domain[122-236] IPR0119933.9e-34Pleckstrin homology-type
[120-139] IPR0024048.9e-28Insulin receptor substrate-1, PTB
[9-111] IPR0018495.8e-12Pleckstrin homology domain
Orthology groupMCL17950 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215242-TA
ATGGCTGCAATGGTGGAGGGCGCCGTCGTGCGTCAAGGCCACCTTCGGAAGTTAAAGACGATGAAGAAGAAGTATTTCGTGCTTCGCGCGGAAACCTCGGAGTGTTCGGCGAGGCTCGAATACTACGAATCGGAGAAGAAATTTCGGTCTGGAGCTGCTCCTCGCCGCGTCCTGCTGCTGAAGAGCTGCTACAATATAACGCGGAGATTGGACTTGAAACAGAAACATGTGATAGCTTTGTTTACTAAAGAAGAACAATTGTGTATAGTGGCTGAGAATGAGCAGGATCTGCATGCCTGGCTCACTGCTATTCTTAAACAGTTCAGAACAGATGATGCCAGCGATGAGCTCTTGCATCCCATACAACATGTATGGCAAGTGAACGTCCAGAAGAAAGGTCTGGGTGCGTCCAAAAACATCCAGGGACTGTACAATCTGTGTTTAACGGACAAGACCCTGGCCTTGGTCAAGATTAAGAGTCTGAACAACGTGATCAGTGACCTGGGGATCCCGGAGAGGGTCGAGTACTCCTTGAAGAACATCAGACGATGTGGCGATTCCGAGTGTTTCTTCTACATGGAGGTGGGCCGGCAGACGGCCACCGGCGCCGGCGAGCTGTGGATGCACTCCGACGACTCCAACATAGCACAGAGCATGCACTCCACGATATATCACGCCATGAGGAACTGCGCCAAGGAGACGGAGAACGAAAAGGATCACATCGTGATATCCAACAAGAACCTGATGGAGGGCTCGCACCCGCTGCCCGCCCGGAGGCAGACCTACTCCGACGGCCGAGGGCGGGCGGGCTTCTATAACGGTAGGGAGGGTGACGACACAGACTGGGATGAGATGATGCGCGGATAG

Protein sequence:

>DPOGS215242-PA
MAAMVEGAVVRQGHLRKLKTMKKKYFVLRAETSECSARLEYYESEKKFRSGAAPRRVLLLKSCYNITRRLDLKQKHVIALFTKEEQLCIVAENEQDLHAWLTAILKQFRTDDASDELLHPIQHVWQVNVQKKGLGASKNIQGLYNLCLTDKTLALVKIKSLNNVISDLGIPERVEYSLKNIRRCGDSECFFYMEVGRQTATGAGELWMHSDDSNIAQSMHSTIYHAMRNCAKETENEKDHIVISNKNLMEGSHPLPARRQTYSDGRGRAGFYNGREGDDTDWDEMMRG-