Monarch geneset OGS2.0

DPOGS208557
TranscriptDPOGS208557-TA1437 bp
ProteinDPOGS208557-PA478 aa
Genomic positionDPSCF300064 + 1123963-1125577
RNAseq coverage117x (Rank: top 58%)
Annotation
HeliconiusHMEL0023620.068.31% 
BombyxBGIBMGA010663-TA1e-16961.35% 
DrosophilaCG13398-PA1e-1739.29% 
EBI UniRef50UniRef50_E3XFH17e-3649.31%Putative uncharacterized protein n=2 Tax=Pancrustacea RepID=E3XFH1_ANODA
NCBI RefSeqXP_001122793.14e-4330.69%PREDICTED: similar to fibroblast growth factor receptor substrate 2 [Apis mellifera]
NCBI nr blastpgi|3287831288e-4230.69%PREDICTED: hypothetical protein LOC727079 [Apis mellifera]
NCBI nr blastxgi|3287831284e-5131.23%PREDICTED: hypothetical protein LOC727079 [Apis mellifera]
Group
Gene OntologyGO:00055152e-25protein binding
GO:00051583.9e-22insulin receptor binding
KEGG pathwayxla:3982733e-25 
 K12461 (FRS2)maps-> Neurotrophin signaling pathway
InterPro domain[17-103] IPR0119932e-25Pleckstrin homology-type
[15-107] IPR0024043.9e-22Insulin receptor substrate-1, PTB
Orthology groupMCL25755 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208557-TA
ATGGGTTGCCTTCAGAGTAAAAAAGAAATATCTGATCTACATCCAAATGTTTTTCGTGTTGTGAATATTGATGAAAACGGCGTAGACTTGTGCTCTGGACAACTGGAAATCACGGAATCTGATATTATTTTATATAGAGAAGGCAGAGATTCTACTATCTGGCCTCTTCATTCACTTCGGAGATATGGCTTTGAAGGAGAAATATTCAGTTTTGAATCAGGTAGAAGATGTGAAACCGGTGAAGGCATTTATGGATTCAGATGTCGCCGAGCTTCCCTATTATTTCAAACTTTACAACAGCAAATACAGTTAAGAAATGTTGTTCACGATTCTGTGCCCTATCCAGTTTCTCGTTTAACACCATCCCCTCAAGGCAGACAAACATTACAAGCCACTGTTATCCATCGATCCTCTGTCGACAATGGGCAACCGGACAATACTGGAACAGGTTTCAATAATAACATCACCACCATAAACAGTACAGTTCCACCAGCAATTATACCATCTCCCCACTCTCCATCCAGTGCTGACATATTAGAAGTGATGCCACTGTATCCAAGATCACAGTCATCAAGCAACCATGTAACCAATGTTTATCAAGTGAGGGATTTTAAACGTGAACATAATAATAACCAGACCGATGTCAACAATGACACAAGACATGTTTATACAAATGATTTTAATAGGGATCTAGCTATGCTTCGGAAAACATTGAAACAAGAAATCGCTTTAAATACAATCAGGGATATAGAAGACGAAACAAGATTCTTAGAAAGGAGACACATCAATGAAATGAAAACAGATTCAGCAAACCACCCTTTGTCGCCTACAATAAGTTGCTCAAGTGAACATTATGCTCAACTTAGTACTGAACAACAAGAAACATCTAGAATGTATGTGAATATTGCGCCCAGTACTAACAACTCAGCTACAGACATTAAAGCTGACCCAGTCCCAACCACACCTCTTACACCAAAACAAGTAGAATATTGCAATTTGACTGTTGGTTTAAAACCAGAAATAAATACATATGCAAATGTGATGCTGGGCGATTTCTCAGATAGTGCAAAAGGTTCAAGAACGATTCACAATTCAGAACACAACCAAAAATTTTCAGAGTCGGATACCTTCACATCAATGTCGCCAGTTGAGGAAATGGAAGTCAATTACGCAGTTCTTGATATAGATACCAATAAAGAGAATATTAAAGTAGCAAGAGAGCTTGCTTCCCCAGAAAGTCAAACCACAGCCAATCCAATGAATCCACCATCTAGTATTGGTTATACAACAATTGATTTTGATAAAACTGTTGCACTGACATCGGTAGCTGCTGGTGCAGAGAGCAATATGGATGGGCCTAGAAAAAATCGGCACAACTCTTGCAGTATATTAAGTGGAAGCCCCGGCAGTAGTGAAAAATGTAGAGGGAATAACTGA

Protein sequence:

>DPOGS208557-PA
MGCLQSKKEISDLHPNVFRVVNIDENGVDLCSGQLEITESDIILYREGRDSTIWPLHSLRRYGFEGEIFSFESGRRCETGEGIYGFRCRRASLLFQTLQQQIQLRNVVHDSVPYPVSRLTPSPQGRQTLQATVIHRSSVDNGQPDNTGTGFNNNITTINSTVPPAIIPSPHSPSSADILEVMPLYPRSQSSSNHVTNVYQVRDFKREHNNNQTDVNNDTRHVYTNDFNRDLAMLRKTLKQEIALNTIRDIEDETRFLERRHINEMKTDSANHPLSPTISCSSEHYAQLSTEQQETSRMYVNIAPSTNNSATDIKADPVPTTPLTPKQVEYCNLTVGLKPEINTYANVMLGDFSDSAKGSRTIHNSEHNQKFSESDTFTSMSPVEEMEVNYAVLDIDTNKENIKVARELASPESQTTANPMNPPSSIGYTTIDFDKTVALTSVAAGAESNMDGPRKNRHNSCSILSGSPGSSEKCRGNN-