Monarch geneset OGS2.0

DPOGS200568
TranscriptDPOGS200568-TA2133 bp
ProteinDPOGS200568-PA710 aa
Genomic positionDPSCF300119 + 442626-450033
RNAseq coverage231x (Rank: top 44%)
Annotation
HeliconiusHMEL0054887e-15256.75% 
BombyxBGIBMGA009357-TA2e-14554.12% 
DrosophilaCep97-PC4e-12540.62% 
EBI UniRef50UniRef50_D2A2T53e-15545.77%Putative uncharacterized protein GLEAN_07010 n=1 Tax=Tribolium castaneum RepID=D2A2T5_TRICA
NCBI RefSeqXP_968521.16e-15645.77%PREDICTED: similar to phosphatasepp1 regulatory subunit [Tribolium castaneum]
NCBI nr blastpgi|910818731e-15445.77%PREDICTED: similar to phosphatasepp1 regulatory subunit [Tribolium castaneum]
NCBI nr blastxgi|910818733e-15246.10%PREDICTED: similar to phosphatasepp1 regulatory subunit [Tribolium castaneum]
Group
KEGG pathwaypgi:PG18642e-15 
 K13730 (inlA)maps-> Bacterial invasion of epithelial cells
Orthology groupMCL14580 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS200568-TA
ATGGACAAGTCGAACAATGAAACACTTGATTTATCTAAGCGTGGGCTTAAGAAAATAGAGAAAGCGCACGCTGAGGATGCTGATACTGTAATCAATTTGATTTTCGATCAAAATGAATTGCAGCGTATCGAAAACATTGATTCCTATCAACGAGTTGAAAACCTGTCAATATGCAATAATTTTCTGTTGCGTATGCATGGTGTATCTAGACTCACAAACATCAGGGTGCTGGCTCTCAACAATAATGGAATATTACAGATAGAGGGTCTCAAAGATTTGATATACTTGAAAGTATTGAAATTAGCTGGTAACAGCATCAAATCAATAGAACATCTTAATCACAATATACATTTGGAGCATCTTGATTTATCAAATAATCAGATATCGTATATCGCTGATATATCCTACTTGAAATGTTTGAAGCATCTCAATCTGGAACGTAACCGTATAATGGATCTCCGTCAGTGTGACCGTTACTTCCCCACCAACCTCCTAACTCTAGGGTTGGCACACAACAACATACAGGATCTCAATGAAGTGTCGCATTTGGTTCATCTATCGGAATTGGAGAGTTTCACCATACAGGGCAACCCCTGTGTGGCTATGGCGGGGAAGGATAAATTAGGTTTTGATTACCGTCCATTTGTTCTGAACTGGATCATGAGTGTCAAATCTATTGACGGATTTATAGTTAGTGCAATAGAGAGTTTGAAAGCAGAGTGGTTATACAGTCAGGGTAAGGGTCGAATATTTCAGCCAGGTCAACATAAGGAACTTTGTGAATATCTCGCGGCCACGTGTCCTCTTACGGCAGACGCGCTGGAGACGGAGGATCAGAGGAAATTGAGACTTATATTGAGCAAGGCTCAACATCACCAACAACAGCTGAAAGATCAAAATCATAGTCCCATCAAATCTCGGATACAGTCACCTAGAATGCAGTCTCGTATGTCGTCTCGCTACGTGGGTCGTTCTCCGGATAGGCTCACTTCCAGCTGTCATTCTCGTGTGATGAGTAACAGCTGTTCTGGAGACCTGCCGTCATCGTCGCCGCTACCGCCCACACTCCCCGCAGCTATGTCACAGTCCGTCACACACACACTGACGGACAGACGAGAAAAGACAGATCAACCAAAAGTAATGAACCAGCCATTGGAGGCTGCGTCCAAAATGGTTCCCGTGCCGGAGTCATTAATGAGTCCCGACTACCAGGACCCGATATACGACATACAGAAGACAATACCAAAGACAACCAACAACAACCTATCCAACACCTCCTCACAGTCCAGTAACAGTAGCGTTACGGAAGAGAAAAGACAGATATGCAAACTAAAGGGATCACCAAAGTTACCGAAAAGCAACACCTCGTCCCCCAAAATGAAACCCAAAATAGAAAGGAAAGGAAGTCTCACTAAGATCGAAGCTGACAAGAAATTAAACGGAGTGAAGGACAAAGAAGATATCGACCAAGATAAGCTGGAAGTGATAAAACTCGCGTCCAATCAGAGGCGGCAGAAAAAGATGAGTGTGGAAAACATGGCGGCCGTCACGATACAGAAGATATGGAGGGGATACAGAAGTAGGAATCTTAATAAAGATACGTTGAGGATTCTACATGCTATACAAGCTGCTAGAGCTAGGCAGCATATACAACGTCTCACATGTGATATGGAAGCGACTAAAGCTGCTCTGGAGAGTGAGAGGAAGATACAGCAGTTACAGATGCAGGCCATCAATGCACTGTGGAAGAAAGTCTCCACTCTACAGACTACTGATCCAAAGCATAGAGTTTCGGATTCAGAGGATAACTCGGAGGCTCTCAGACAGTTGACAGAAACCTGTGTGTCGCTGCAAGCTCAGGTAGTGGAGCTCCAAGGTTGTATGAGGGATGTCCTCCGGGCAGTAGGTCGGACTGGTGAGGCTCAAGTCGCTACACAGACGGACATAACAGCTGTTATTACACCGCAGGAGGAACGTTGCAGTTGGCTGAAAAGACCACAGTCGTTAGCCCTACCAGCTCATTGTGCGGATACCGTTGAGAAGTCTGAATCAGAAATCATTCCCATAGAGCGCACCGAGTCAATCATCGAGGAGAGTATAGAGAATCAAGAAAGGGAATTGTTGGCCGACTGA

Protein sequence:

>DPOGS200568-PA
MDKSNNETLDLSKRGLKKIEKAHAEDADTVINLIFDQNELQRIENIDSYQRVENLSICNNFLLRMHGVSRLTNIRVLALNNNGILQIEGLKDLIYLKVLKLAGNSIKSIEHLNHNIHLEHLDLSNNQISYIADISYLKCLKHLNLERNRIMDLRQCDRYFPTNLLTLGLAHNNIQDLNEVSHLVHLSELESFTIQGNPCVAMAGKDKLGFDYRPFVLNWIMSVKSIDGFIVSAIESLKAEWLYSQGKGRIFQPGQHKELCEYLAATCPLTADALETEDQRKLRLILSKAQHHQQQLKDQNHSPIKSRIQSPRMQSRMSSRYVGRSPDRLTSSCHSRVMSNSCSGDLPSSSPLPPTLPAAMSQSVTHTLTDRREKTDQPKVMNQPLEAASKMVPVPESLMSPDYQDPIYDIQKTIPKTTNNNLSNTSSQSSNSSVTEEKRQICKLKGSPKLPKSNTSSPKMKPKIERKGSLTKIEADKKLNGVKDKEDIDQDKLEVIKLASNQRRQKKMSVENMAAVTIQKIWRGYRSRNLNKDTLRILHAIQAARARQHIQRLTCDMEATKAALESERKIQQLQMQAINALWKKVSTLQTTDPKHRVSDSEDNSEALRQLTETCVSLQAQVVELQGCMRDVLRAVGRTGEAQVATQTDITAVITPQEERCSWLKRPQSLALPAHCADTVEKSESEIIPIERTESIIEESIENQERELLAD-