Monarch geneset OGS2.0

DPOGS205303
TranscriptDPOGS205303-TA1641 bp
ProteinDPOGS205303-PA546 aa
Genomic positionDPSCF300021 + 1370724-1372659
RNAseq coverage229x (Rank: top 44%)
Annotation
HeliconiusHMEL0161990.089.81% 
BombyxBGIBMGA011043-TA6e-10779.91% 
DrosophilaDok-PA4e-5945.74% 
EBI UniRef50UniRef50_D6X4371e-8137.64%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6X437_TRICA
NCBI RefSeqXP_971563.12e-8237.64%PREDICTED: similar to Downstream of kinase CG2079-PA [Tribolium castaneum]
NCBI nr blastpgi|3838608647e-8337.01%PREDICTED: uncharacterized protein LOC100881942 [Megachile rotundata]
NCBI nr blastxgi|3320216623e-8838.00%Docking protein 2 [Acromyrmex echinatior]
Group
Gene OntologyGO:00055154.5e-27protein binding
GO:00051583.2e-22insulin receptor binding
KEGG pathwaydre:4369312e-07 
 K12461 (FRS2)maps-> Neurotrophin signaling pathway
InterPro domain[135-233] IPR0119934.5e-27Pleckstrin homology-type
[137-233] IPR0024043.2e-22Insulin receptor substrate-1, PTB
[5-115] IPR0018492.1e-07Pleckstrin homology domain
Orthology groupMCL15959 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS205303-TA
ATGGAGCAAGATGAAATTAAATCTGGCTGTTTATTATTTCCCCCCGCCGGAGGAAATGTGTTCATTAAATTCCCTAAAAAGAAAAATTGGCAACAAAAGCATTGTATACTTTTTAATAAGAGTATTCAAGGTATTGAACGTCTTGAAATCTATGATAATAAAGATGATGTTACTTCAGGTGCTATACCAAAAGTGCTGACACTTGAGAATTGTATAAAGATTACTCATTCTAGTCCAAATACTATAACTATACTTACTAAGTCACATACCCAACAAATCAGTGCTCAATCAGAGGCAGAAACTTTGGAATGGGTGACTGCCCTTCAATCGGTTGCCTTTAAGGGACGGGACTTGAGTCCTATAGGCTGTGATGAAGATAATGATCTTTATTGTTCCTCTGGTGAAGGTGTATTTTCTGTAGTCATGTGCACATCTGAAGCTTCTACCAGATGTGGCTTTGAACCTATTAGGTATCTCTTACTATTGACTGCTACAGCAATAGAACTAAGAGATGTGAATGACAATAAACTATTTGCAACATGGCCATATAGATATATAAGGAGATATGGATACAGACAGGGTAAATTTACATTCGAGGCTGGTAGAAAATGTGATACCGGTGAAGGCATATTCCATCTGGAACACACTAATCAATCAGAAATTTTTAAATGTTTGTCTCTAAAAATGAAAACCATGAAACAAATGATTGGTAATGACAATCTTAATAGCCCAGAGCACAGTGAATTCAATATTCAGCCAAATGCTAACCTATTTCCTGGTTCTAGAAGCCCATTGGTTGCAACCAAACCTGATGATTTGAATTTAAGTTCGCTATCTTTTAAAACTGCTTCAGTATCGAGTCATCAAACTTGTAGTACAGATAGGGAATCGGTTCCTTTACTTATGCCTAAACCAGCACTAAAACCGAAACCACAAAAACCTCCTCGGAAAATTATAAATATGCATAAAGGAATCACAAAAGATCTGACTTGCTCTACAGATGAAAGTATAGATTTTGGTAGATACAAAAAACTGGACAACTATGAACCAATTGAGCAAATAAAACGGGAAATTAGCCCTCCTTCAGTGCCATACGATAAAATTGAGTTAAGAAGTGAAGCTTGGAAGACATTAGGTATTGATGACCCAGATCATACTGAATTCACTCCAAATTTAGTAGATAACCCAAATTGTATGAAGCTTATATCACGATCTCAGGACAATTTGAACACTTCTGGACCAGATTCTTCAAGAATCATTTTACTACCTAGTCCGGTAGTCGACCCAGAGGATGAAAACTATGATAGGCTGCAGTACTTCGGATCAACAAGCAAATTAAACAAATCTTCCAGATACAAGAAGATTGAAGCAAGACCGACAACTTTAGCTCTTGGTGAACCGAAAAATAAGGATACTTGGAATGACTATGACGAAGTTGAAAATGTAATGCAGACAGCAAGATTGGCTGATGATTCTCACTTAGGTTATGGTATGATAAGAAAACCAAATACTCCTGGACCACAAGTGCCAACCGCTGCTCAAGCCCAAGCATTACAAAATCAAGTTGGCTTAGAAGAAATCAACCATAATATTTGTAATGGTACTGACTATGCTATTGTTAGTCGCCCCAAAAGAGTGTAA

Protein sequence:

>DPOGS205303-PA
MEQDEIKSGCLLFPPAGGNVFIKFPKKKNWQQKHCILFNKSIQGIERLEIYDNKDDVTSGAIPKVLTLENCIKITHSSPNTITILTKSHTQQISAQSEAETLEWVTALQSVAFKGRDLSPIGCDEDNDLYCSSGEGVFSVVMCTSEASTRCGFEPIRYLLLLTATAIELRDVNDNKLFATWPYRYIRRYGYRQGKFTFEAGRKCDTGEGIFHLEHTNQSEIFKCLSLKMKTMKQMIGNDNLNSPEHSEFNIQPNANLFPGSRSPLVATKPDDLNLSSLSFKTASVSSHQTCSTDRESVPLLMPKPALKPKPQKPPRKIINMHKGITKDLTCSTDESIDFGRYKKLDNYEPIEQIKREISPPSVPYDKIELRSEAWKTLGIDDPDHTEFTPNLVDNPNCMKLISRSQDNLNTSGPDSSRIILLPSPVVDPEDENYDRLQYFGSTSKLNKSSRYKKIEARPTTLALGEPKNKDTWNDYDEVENVMQTARLADDSHLGYGMIRKPNTPGPQVPTAAQAQALQNQVGLEEINHNICNGTDYAIVSRPKRV-