Monarch geneset OGS2.0

DPOGS203358
TranscriptDPOGS203358-TA2142 bp
ProteinDPOGS203358-PA713 aa
Genomic positionDPSCF300003 + 10093-16469
RNAseq coverage108x (Rank: top 60%)
Annotation
HeliconiusHMEL0118461e-11584.09% 
BombyxBGIBMGA014334-TA5e-10872.10% 
DrosophilaCG2063-PA1e-4362.60% 
EBI UniRef50UniRef50_D6WE904e-5748.68%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WE90_TRICA
NCBI RefSeqXP_966537.16e-5848.68%PREDICTED: similar to CG2063 CG2063-PA [Tribolium castaneum]
NCBI nr blastpgi|910794581e-5648.68%PREDICTED: similar to CG2063 CG2063-PA [Tribolium castaneum]
NCBI nr blastxgi|3123806431e-6048.77%hypothetical protein AND_07235 [Anopheles darlingi]
Group
KEGG pathway 
InterPro domain[56-253] IPR0124796.1e-62HCNGP-like
Orthology groupMCL14246 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203358-TA
ATGGAAGATGGAGATCCTACACCTGAGAAATCGGTCACTCATCACACACAGTCAGCTCCAACCAGTCCCAAGAACATCGACGACACCAAACAATCTGCTTCCGCACCAGTTTCTCCAAAACGAAGTTTGGTCTCGTACGTAGACGACACTATCGTATCCGATGACGAACAATTGTCTCCTAACGCGGAAACTCAGGACGATATGAGAAGATTATCGATGGAAACCGACACAGATGAAGCTGTCCCACGATCAGATCCCGACGACTCAGAGGATAGTGTCCTTATACCTCCGGAACCAACAGCCAAATGTCCCAAGGAATTACAAGACAAAATAACAAAATTCTACACAAGAATGGTCAACGAAGGTTACGACATGAACAAAATAATTCAGGATAAAAAGAATTTCAGAAATCCAAGCATATACGAGAAGTTGATACAATTCTGCGACATCAACGAGCTAGACACGAACTACCCACCAGAAATATACGATCCTCTAAAATGGGGCAAGGAATCCTACTACGATGAGCTCGCTAAAGTCCAAAAACTAGAGATGGAGAAACGGGAAAAGGATCGCAAAGAGAAGTCCAAAATAGATTTCATCACCGGAGTGGCAAAGAAGTCGGACAGCGACGATGACAAGAAACGGAAGTCCAAGTGGGACCAAGCGGCGCCCAACGTAGCCAACAAACCCAGCATCAAACAACCGGGGCTCCTCCAGCAACCGCTGACGAGCAACGTCACCGGCACCAAGGGCACTGTATCAGACTTTGAAGATTCATTATCGAAGTATGGCATCAAGAAGGTGCTCCCGGTCGCTACGTCGACTCCAAAGAAACCAGTCAAGAGTCATTCTTTACACAGCAGCATACAGAAGAAAGAAAAACAGAAACTCTCTAATAAAAGTGACAGCACGGTCACCTGTGATAGCTTCGTCAAAACAGACTCTACAAGCTCCTACAGGCAGGCCGTGAGCACAGAGCTCGATTACTCTTCAATGTCTCCGAACTCCGCGGCGTCATCGGACAGCGCGGGCGCCGTCCAGGGCACTTTGGTGTCTCCACATACCCACGATCTTCAAGACTCCAGAAACGAAGGATTACACCTAACACCGAGAGAAACACAGAACATCATTAAATGCGCCCACATCCTTGGGAATGTCCTCACTAAAGCGATCGAAAGACAATCAAAAGAGTATGAATACAGCCAAGAGAAAATACAAGAGCTGTACGTTGAAAAACCGATGCCTGAAACTGAGATAAAAAAAAAGAATCTGACATTAGATCTGAAAGAGACAGTCTTACCTCTGGAGGTTAAAGAGGAAAAGAGATGGGAGAGCGTCCCGACACAAACTGACATTTCGCTGCCGAATACGAAAAGCGCGCCGAAAATATTTGAAAGCATTTTAAGACAATTATCAAGGAGTTCAATAGACGAAGCCGAAAAGACGATAATAGAATGCCAAGAAGAGAATACAGAGGAGAATGAGACGGGACAGTGGAAAACCAGCACGGGTATCAGTACTTTCCGCGAGGACAACTCCGGTGAGTGGGGCCAGTTCTGGGCGAACTACAACAACTCGCTGGCGAGCGTGCCCAGCAGATACTACGACCAATGTCCAACGCCGTACAGGACTGAGGACATCGACCTTGCGGATTTAGAATTCTCAACAGAGGGTTCAAGGAAACGTTCACCAGAAAACATTAAAACAATCAACAATATCATAAGAAACGAAGGATTACACCTAACACCGAGAGAAACACAGAACATCATTAAATGCGCCCACATCCTTGGGAATGTCCTCACTAAAGCGATCGAAAGACAATCAAAAGAGTATGAATACAGCCAAGAGAAAATACAAGAGCTGTACGTTGAAAAACCGATGCCTGAAACTGAGATAAAAAAAAAGAATCTGACATTAGATCTGAAAGAGACAGTCTTACCTCTGGAGGTTAAAGAGGAAAAGAGATGGGAGAGCGTCCCGACACAAACTGACATTTCGCTGCCGAATACGAAAAGCGCGCCGAAAATATTTGAAAGCATTTTAAGACAATTATCAAGGAGTTCAATAGACGAAGCCGAAAAGACGATAATAGAATGCCAAGAAGAGAATACAGAGGAGAATGAGAGTAAGGAGACAAAATAG

Protein sequence:

>DPOGS203358-PA
MEDGDPTPEKSVTHHTQSAPTSPKNIDDTKQSASAPVSPKRSLVSYVDDTIVSDDEQLSPNAETQDDMRRLSMETDTDEAVPRSDPDDSEDSVLIPPEPTAKCPKELQDKITKFYTRMVNEGYDMNKIIQDKKNFRNPSIYEKLIQFCDINELDTNYPPEIYDPLKWGKESYYDELAKVQKLEMEKREKDRKEKSKIDFITGVAKKSDSDDDKKRKSKWDQAAPNVANKPSIKQPGLLQQPLTSNVTGTKGTVSDFEDSLSKYGIKKVLPVATSTPKKPVKSHSLHSSIQKKEKQKLSNKSDSTVTCDSFVKTDSTSSYRQAVSTELDYSSMSPNSAASSDSAGAVQGTLVSPHTHDLQDSRNEGLHLTPRETQNIIKCAHILGNVLTKAIERQSKEYEYSQEKIQELYVEKPMPETEIKKKNLTLDLKETVLPLEVKEEKRWESVPTQTDISLPNTKSAPKIFESILRQLSRSSIDEAEKTIIECQEENTEENETGQWKTSTGISTFREDNSGEWGQFWANYNNSLASVPSRYYDQCPTPYRTEDIDLADLEFSTEGSRKRSPENIKTINNIIRNEGLHLTPRETQNIIKCAHILGNVLTKAIERQSKEYEYSQEKIQELYVEKPMPETEIKKKNLTLDLKETVLPLEVKEEKRWESVPTQTDISLPNTKSAPKIFESILRQLSRSSIDEAEKTIIECQEENTEENESKETK-