Monarch geneset OGS2.0

DPOGS210436
TranscriptDPOGS210436-TA2835 bp
ProteinDPOGS210436-PA944 aa
Genomic positionDPSCF300062 - 245570-250564
RNAseq coverage236x (Rank: top 43%)
Annotation
HeliconiusHMEL0151094e-16065.17% 
BombyxBGIBMGA001956-TA0.057.20% 
Drosophilami-PA6e-3332.13% 
EBI UniRef50UniRef50_UPI00022468776e-4738.18%UPI0002246877 related cluster n=1 Tax=unknown RepID=UPI0002246877
NCBI RefSeqXP_001809235.12e-4229.81%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|3454847082e-4638.18%PREDICTED: hypothetical protein LOC100679267 [Nasonia vitripennis]
NCBI nr blastxgi|3454847085e-4928.06%PREDICTED: hypothetical protein LOC100679267 [Nasonia vitripennis]
Group
KEGG pathway 
Orthology groupMCL22664 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210436-TA
ATGGTGGTGATGGAATCCTTCGAAGACTCAAATTGTCCTTCGTTGAGCGATAGTTTATCTTTGTATGAGCGGTATATGAGAAGTTTTAACGTTGTTAATAGCTTTAACGATAACAATAAAGAAATCAATGAACCATCTAGTAACTGTGAAAGTTCTAATAGTGGTGGTAATATAGATCTTTCAACTGTAATACAATCAAATTTATCAAATCATATTGATGTAAATTCAGTAAAACCAGAATATAGTCCAATACAGACTATTTTGGACGATGTGGATAAGGTATCTGATCCAGACATAGTTAAAGTTGGGTCGCCTTGCAATTCACTTGCCGCGTCTCTGACGGAAACTTCTTCCGCAACAAACCCTTGTTTGAATGATAAGTCTGACAATACAGACGTTCCTGTATCCTCTGAAGAGTCCAAATTTGAAGGATTCAGATCAGTTTTTATAACCTCTACAATGGAAAACAGTAACAATGCGGTGGATAGTGTACCTTTGTTTCGTGACGTAAAACAAACCGAAAGTGTTTTTAAATTACCAATTTGTGAATTATCATTTAATGGTACCATACTAAAACATGTTCAGAAACTTAAACCTATAGTTGATCCATTAACTGCTTTAAGCATGTCAAATAATGTTCTTAATTTAAATTACGAGCGATGCACAGATGTAATATCAAATTGTTGCAAAAATGAGTCTAGTGTTGATAGAAAATCACAAATATCAAGTGATATGGAAGACAAAAACTCAAATGAATCTGTTAAAAACTTAAATCCCACTGAGAGTAGCGAGAGCTCATGTGGAGAGGATAGTTCAAAGAAGGAACAGACTAATAATGATGGAAATTCTAAGTGGAACTTGGACCACAGTTATTCATCCCTGGAGGCATCCTTTGATAGTGGGATGCGCTCTCCTGACATGTTTTCAGATGAAGATGAACCTGAACCGAGTCCACCGCCTGAACCATTTTGGAACTTTTTAAAAGATTTTGAGCAATATGATAAACGAAAAGTTAGAAAGATAGAGGAAACATTACAAGGCGTACTACCACCTCCGTCTGTAACAACACTTAAGACTGATGTGACTCAAATGTTGAAGAAATATTACTGTTTTCTGCCAGCTTTCAATGGGGAGGAGAATCTTAATATTGAAGCTAATTCTATGACACCCACAAAAAAGGTTTCATTCGTACAAATACCGGAGACAACAAATGTACACAGTTTTTCAGAAACTGTACATTTGGATAAGACTGATACGAGTTCAATACAGCCAGATGTTTTGCAAAAATCTCAAGCTAACACAAGTATCAGCGATAAGAGCATAGAGATAAAAATGTGTTCGGAAATTGAGGTTTTAGAAGCGTCATGGCCGGATGTGTTGAAATGCAAATACTATGATGTTTACTACAATCTAACGTCCCACTCGGAAAAATATGAGATGCTCATGCAAAAGTATGCGGAAAGGTTCATTGGTGCTGAAACTGATACGAGCGTCAATATATACTCTGGAGGATTGCAGTCACCAAGTAGTGCTTGCAAAAGGAAAGCGCTGAGATTGAAGATGTCTCAAGTCAAATCTCCCGGTCGGAGGCTGTCTCACTTAGCACGTCGTCGGCAGGCCTTTTGCAGCGCGGCCACTATCAATGAGAAGGCACAGACGTCTTCCAAAATGGTGTTAATTGATAAAAAAAAGCTAATCAACTCTGCGGAGAGGAAAAGTCCTAGAACCCGTCGCACACCAGGTAAGAAGACCCCCGGGAAGAAGACGCCGTCGAAGACACCAAAGACAAAGAGCGGGGGCTCCAGTAAGAAGAAAGCGATGCGGCGACTTCTCATGGATTCGGATCTGTCGAAGACTCAACCGTCCAGGGATACGCTGAAACGAGCTTTGTTCATCAGCCCGGATAACAAAAAGCCCGTCGCCACATGTTCCTCCGTCCCAAATCAGGCTTTGAAATCCAAACGAGCGTTATTCGGATCGCCGGTGAGGCAAGCGGAAACCAAGAGCCTGGATGGAACAGCCAGCGATCAGTTCCTGAAGCGGAAAAGAGATACGCTGGATGATGAACCGGAAACCAGTAGGAACAAGATAGCGAAGAGTCTCTCGTTCGGTGGGGATAGTCGACTGTCGTTCAGTTCAGAAAATAGATTGACGTTTGGCGTTGAAAACCGAAGGGCGTCAGAGTGTTTGACGACGAAAACTATGGCTGAACTTAACGAAACCCATAAGAAGAAACTTCTTTGGGCCGTGACCGAAGCCCTACGTCTCCACGGCTGGCGCATGTCTTCCCCGGGTTTCCGTGAGAAAGCTTCATCCCTGGCTCGCCTGACACGCAAGCTGTTGACTCTTCCGCCTCACGCGGCTCGGCTTGCAGCACCCAACCTGTCGACTTCTGATACTATGTTTAAGTTAGCTCGCCAATATGTATTTGCAATAATTCAAGGCCGTACAGTCGATGAATGCTATCAAGATGAGGTCCTCAAAATCTCTAACGAGAATAACAAGATAACTGGCTACATATCAGCCACGGCCTACCAACAGATGAAGACCAAACAAGTTCCGTCGACGCTGACCTCGCAGATCAAAGAAAACACTTTTGGGGAAAGGTCAACTAAACTGGAACAGCCTAGGAGCACCTCCAAGAATATACTACAAGACAAATGGATGAATATAGACTGTAATTCAAACTCAAATAGCAACAGCAGTGGTAGTTTTAGTGTTCTAGATAAGGCTGGAGTTTTCAAGTCCAATTCGATGCCTTCATTCGAAGAAGCAGCCAAGATGAGGGCGAGGAGACAAATCAGTTTTGATAATGTAGATTTTCCAAAGAGGTGA

Protein sequence:

>DPOGS210436-PA
MVVMESFEDSNCPSLSDSLSLYERYMRSFNVVNSFNDNNKEINEPSSNCESSNSGGNIDLSTVIQSNLSNHIDVNSVKPEYSPIQTILDDVDKVSDPDIVKVGSPCNSLAASLTETSSATNPCLNDKSDNTDVPVSSEESKFEGFRSVFITSTMENSNNAVDSVPLFRDVKQTESVFKLPICELSFNGTILKHVQKLKPIVDPLTALSMSNNVLNLNYERCTDVISNCCKNESSVDRKSQISSDMEDKNSNESVKNLNPTESSESSCGEDSSKKEQTNNDGNSKWNLDHSYSSLEASFDSGMRSPDMFSDEDEPEPSPPPEPFWNFLKDFEQYDKRKVRKIEETLQGVLPPPSVTTLKTDVTQMLKKYYCFLPAFNGEENLNIEANSMTPTKKVSFVQIPETTNVHSFSETVHLDKTDTSSIQPDVLQKSQANTSISDKSIEIKMCSEIEVLEASWPDVLKCKYYDVYYNLTSHSEKYEMLMQKYAERFIGAETDTSVNIYSGGLQSPSSACKRKALRLKMSQVKSPGRRLSHLARRRQAFCSAATINEKAQTSSKMVLIDKKKLINSAERKSPRTRRTPGKKTPGKKTPSKTPKTKSGGSSKKKAMRRLLMDSDLSKTQPSRDTLKRALFISPDNKKPVATCSSVPNQALKSKRALFGSPVRQAETKSLDGTASDQFLKRKRDTLDDEPETSRNKIAKSLSFGGDSRLSFSSENRLTFGVENRRASECLTTKTMAELNETHKKKLLWAVTEALRLHGWRMSSPGFREKASSLARLTRKLLTLPPHAARLAAPNLSTSDTMFKLARQYVFAIIQGRTVDECYQDEVLKISNENNKITGYISATAYQQMKTKQVPSTLTSQIKENTFGERSTKLEQPRSTSKNILQDKWMNIDCNSNSNSNSSGSFSVLDKAGVFKSNSMPSFEEAAKMRARRQISFDNVDFPKR-