Monarch geneset OGS2.0

DPOGS213594
TranscriptDPOGS213594-TA1698 bp
ProteinDPOGS213594-PA565 aa
Genomic positionDPSCF300033 + 524991-536848
RNAseq coverage3107x (Rank: top 4%)
Annotation
HeliconiusHMEL0079050.070.68% 
BombyxBGIBMGA011660-TA0.065.83% 
DrosophilaCG9619-PA9e-7857.14% 
EBI UniRef50UniRef50_Q29DH35e-7842.95%GA21917 n=3 Tax=Drosophila RepID=Q29DH3_DROPS
NCBI RefSeqXP_002047502.18e-7950.32%GJ11905 [Drosophila virilis]
NCBI nr blastpgi|1953774492e-7750.32%GJ11905 [Drosophila virilis]
NCBI nr blastxgi|1947516603e-8736.59%GF10772 [Drosophila ananassae]
Group
Gene OntologyGO:00055152.6e-30protein binding
KEGG pathwaydvi:Dvir_GJ119052e-78 
 K07189 (PPP1R3)maps-> Insulin signaling pathway
InterPro domain[432-537] IPR0050362.6e-30Putative phosphatase regulatory subunit
Orthology groupMCL16410 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213594-TA
ATGAATTCTCCAAATCTGACTATGTCATTGGAACATCAAATGCCGGTGCTTATGAAAAAGACGCAGATGAGTGGAGATTGCGCGGGGTCACAGTGCGGTTTGACGTCACTGCTGCCGATGTCCTGTCGCGGGAGAGCCGCAGCGTTCGCGCGCGACTTGCATTCAAGACTGCGTAATTTGGGAGCATCACATGACGGTGAATTGGAAAACAGCTGGCTGGCCAGGGACAGCGCACACAGGCCTACCAACTCGAACCACTCACGTGACATTGACACCTTCTACGACTTCGAGCTCGAGTGTGAAAGTCCGTCTAGTCCTGTTGATGAATACGTTCAGTTCCCGGACAAACACAACGAAGATAAGGATCCGCCTTTTTATGATGTAGACTCAGATCCAGAACATAAGGAGCAAACTAAATCGATGCTCAAGCAGCCCGATAAGGGGGTTAATGGATTTAAATTTTCAACAGCATTTTATACGGAATCTCCAATAAACTTCCAACCATCGTCCAAAGAAAACGGACATATCGATAAACCACTCTATTCTCCTATCACTTTTGAAGGCTGTGCTCGGCAGAACAGTTTCGATGAGGTTGATTGCGCACAACCTTTGAGGCCTTACGACACAGAACGACTATTTTCACAGTTTAATTCAAACGACAGCGACTCAGAATTCGAATCGGCTAAGAGCGATCCCTCTGAAGGAACAGATGACGTTACACAGGATGATACCCTCTCACAAAATAACTTAGTGGATGCTATCGATGCATTTTCAATTACAGATACCGAAAATTTGCAAAAAGGAAATGTTCCTGAATATAGTGTGGAAGTTTCTTTAGCAATTTCGGAGTGTCAGACTGTCGATAAAACTGAAGCGGAAACAATTTTAAATAATGGCTTGACTGAAACAGTTACCAGTGAAGCTTTCAACGAGAACGATTCATTGTCAGTTCAAGAAAATGATTCCAGTTCACTCGATGCAGAATCTAAATCAGAGGACGATGGTGAAGATGATCGACCGCAACGAGTACGGAGATGTTCATCATTAAAAACTGGGAAAACGCCGCCCGGTACTCCCGGACGTAAAAAAATTGTTCGATTTGCTGACGTGCTTGGACTCGATCTGGCGGACGTGAAGACCTTCATGGATGAAATACCGGTAATACCAAAATCCGCTTACGATGATCTCACTGGTTGTGATGTACAAAATTCCCCTCCCACGAGACCACCGCCCCGTCTAGGGGCATTGACGTTAGTTCCTTTATTCCAAGTTCCTCGCGATGTAACGGAAAAACTAGAAAGGCAAAACGTGTGCTTAGAGAGTTCACGTGTATGTGATGGCGTCCATGTAACAATTTGTGGCTCTGTACGAGTACGTAATTTAGATTTTCACAAAACTGTACACATACGCTACACAATGAATCGTTGGAAGACCTACACGGATTTACAGGCGAACTATGTACAGGGCTCGTGCGACGGATACTCGGACCGCTTCCAATTCACGTTATACGCACCTTGTATTTCATCGGGCCAAAGGTTAGAAATCGCCGTCAGATTCCAATGTAAGGGGCAACAGTTTTGGGACAACAATAGCGGAGCTAACTACTGCTTCGATTGCTTGGCTCTGGGTAATATCCATGCTACATCTTCGCCGATGACGCTACATCCGACTGTTGACTGGCACCCATCCTTCTACTGA

Protein sequence:

>DPOGS213594-PA
MNSPNLTMSLEHQMPVLMKKTQMSGDCAGSQCGLTSLLPMSCRGRAAAFARDLHSRLRNLGASHDGELENSWLARDSAHRPTNSNHSRDIDTFYDFELECESPSSPVDEYVQFPDKHNEDKDPPFYDVDSDPEHKEQTKSMLKQPDKGVNGFKFSTAFYTESPINFQPSSKENGHIDKPLYSPITFEGCARQNSFDEVDCAQPLRPYDTERLFSQFNSNDSDSEFESAKSDPSEGTDDVTQDDTLSQNNLVDAIDAFSITDTENLQKGNVPEYSVEVSLAISECQTVDKTEAETILNNGLTETVTSEAFNENDSLSVQENDSSSLDAESKSEDDGEDDRPQRVRRCSSLKTGKTPPGTPGRKKIVRFADVLGLDLADVKTFMDEIPVIPKSAYDDLTGCDVQNSPPTRPPPRLGALTLVPLFQVPRDVTEKLERQNVCLESSRVCDGVHVTICGSVRVRNLDFHKTVHIRYTMNRWKTYTDLQANYVQGSCDGYSDRFQFTLYAPCISSGQRLEIAVRFQCKGQQFWDNNSGANYCFDCLALGNIHATSSPMTLHPTVDWHPSFY-