Monarch geneset OGS2.0

DPOGS215243
TranscriptDPOGS215243-TA1830 bp
ProteinDPOGS215243-PA609 aa
Genomic positionDPSCF300047 - 539149-541863
RNAseq coverage976x (Rank: top 13%)
Annotation
HeliconiusHMEL0127195e-3338.89% 
Bombyx% 
Drosophila% 
EBI UniRef50UniRef50_UPI0001EAE2F63e-1138.26%UPI0001EAE2F6 related cluster n=2 Tax=unknown RepID=UPI0001EAE2F6
NCBI RefSeqXP_001603609.12e-0936.94%PREDICTED: similar to insulin receptor substrate [Nasonia vitripennis]
NCBI nr blastpgi|3287042231e-1038.26%PREDICTED: hypothetical protein LOC100569479 isoform 1 [Acyrthosiphon pisum]
NCBI nr blastxgi|3287042251e-1138.26%PREDICTED: hypothetical protein LOC100569479 isoform 2 [Acyrthosiphon pisum]
Group
KEGG pathwaynvi:1001199084e-09 
 K07187 (IRS)maps-> Aldosterone-regulated sodium reabsorption
    Adipocytokine signaling pathway
    Insulin signaling pathway
    Neurotrophin signaling pathway
    Type II diabetes mellitus
Orthology group 
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215243-TA
ATGAACGATAAAGGCGGCGGTACCGTCGTCTGGCTCCATGTAACAGTCGTGCGGGTGGTTGACGCGTCGCTTCCCATTGTGTCCCCAGCGGACACGGGCCAGTCCACCCACCACCGTACTGGGTCCTTGCCCTCGTGCCCGGTGCCGCCGTTCGAGGCGGGTGCCGAGGGCGGGTCGGAGCGACCCAGGGTGCACCACGGGCGGACGCACTCCGCGCCCCTCACCGGCGAGGAGCTGGACAAGGCGCTGTACTCCAGGACTAACGGTACGGACGGGCGGCGGGCGCCCTGGCTGCGCCGCGCCAGCACGGGCACCAGGCTGGCCAAACTGTCCCCGGCTCACACCACCAACTCCAGCTCGCTCCGTAAACGATGCGACACGATGCCGACCCGAGCCAGCAGCGGGCTCGACCTGGAGCGAGGAGGAACCCTCAGGGACGAACGGGACCCGCTACACTCCAGTACGGACACAACGAATACAACATTGTTATATATTATATCGAGATTACAACTGAGCACAACTGATGGACCTCAGGACCGACTGTCGCACTATACGATATATATATATGTTGAACATTCATTCCTTTCATTTATTTCTGAACGTAACAGTGAACCGTTTTATTCAGGTTTAGAATCATCACGAGACGACATCGACTCGATATCGGAGTGGTATCGGACGCCGCGGATACCGGAGGAACCCGACTGCTGCGATGTCATGTCCAAAGACTTCATGAGCATGACGACCCGCGGCAGTTCCATCGCTCACTCGCGGACCTCGTCGACATGCGTGGAGGAGGCGGAGGTGGGGTCGGGGTCGCCCGCGCTACCTGAGGGGTACATGCCCATGGGACCCATCCACGACCCTCTCCCACTGGTCTCCTCATCCGGCTCAGTCTGTAGCGGGACGCCCTCCACGGACCCTAGGTTCAGACAACGCTACGGTTCCCTTGCTAGCGAGTATCAACTGGAGCCAGCCACGGCTCACATCGCGGAGGAGCGGTCGACGCGCGCCTACAGCGTAGGCTCCCGGCCCGCGGCCAGAGCCGAGCCCGCCAGACTGCGCGCGTACAGTGCGGGCGCTAGACGCAAGCCCCTGCCGCCCGCCACGCGCCCGCAGCACGTCCCCCACACACACACACACTCGTACCCCCGGGCCTCCGCGGACGACCTCATGGAGCTCGACTTCTCCTCCAACTCGCCCGCGCCGAAGGTGTTCATTTATTTCATGCGAATACTACGATGCCGGCACTTCAAAGGAACCGACGGATTCACAAGTTTTTCTCCCGCAGGTTATAGTCGCTCGCACGCCGCCGGCGACCGGGTTCATGGAGTCCAGTCGTCGCAACGTGGACGAGTACGTGGACATGTCGCCGCGAAACGCCGGGTACGTGGAGATGAGGCCCGGGGAGCCGCCGGCCGCCTCCACCCCCGGGCCCGCCGCCCCCGCCAGGGCTCCGGGCCGGGGTCGCCGGACGAGCGCCGCCGCCGCCGCGAGCCACGCACCCCGCTCGGCTCCCAGACCCTGTTTCACATGTCGATGGAGTCGCCGTCCTCCCCGGGCGAGGCGGACGACGACGACGACGACCACCACCACCTCAGCACGGTCCGCGAGCTGGTGGAGCCGCGCGCCGCCAGCCCGGAGCCCTCCTCGCCGCAGTACGTGACGCTGGCCGCGAACCCCCGCCCGGACGAGCTGCGCAAGCTGGGCGGGGCGCTCCACTACGCGTCTCTGGACCTGGAGCCGCGCCGCACGCCGGCCGCGCCCGCACCGCGGCTCTACACACAGATAGACTTCATGAGGAGCGAGAAGTACGCGGCGGACGCCACGTAG

Protein sequence:

>DPOGS215243-PA
MNDKGGGTVVWLHVTVVRVVDASLPIVSPADTGQSTHHRTGSLPSCPVPPFEAGAEGGSERPRVHHGRTHSAPLTGEELDKALYSRTNGTDGRRAPWLRRASTGTRLAKLSPAHTTNSSSLRKRCDTMPTRASSGLDLERGGTLRDERDPLHSSTDTTNTTLLYIISRLQLSTTDGPQDRLSHYTIYIYVEHSFLSFISERNSEPFYSGLESSRDDIDSISEWYRTPRIPEEPDCCDVMSKDFMSMTTRGSSIAHSRTSSTCVEEAEVGSGSPALPEGYMPMGPIHDPLPLVSSSGSVCSGTPSTDPRFRQRYGSLASEYQLEPATAHIAEERSTRAYSVGSRPAARAEPARLRAYSAGARRKPLPPATRPQHVPHTHTHSYPRASADDLMELDFSSNSPAPKVFIYFMRILRCRHFKGTDGFTSFSPAGYSRSHAAGDRVHGVQSSQRGRVRGHVAAKRRVRGDEARGAAGRLHPRARRPRQGSGPGSPDERRRRREPRTPLGSQTLFHMSMESPSSPGEADDDDDDHHHLSTVRELVEPRAASPEPSSPQYVTLAANPRPDELRKLGGALHYASLDLEPRRTPAAPAPRLYTQIDFMRSEKYAADAT-