Monarch geneset OGS2.0

DPOGS202558
TranscriptDPOGS202558-TA1158 bp
ProteinDPOGS202558-PA385 aa
Genomic positionDPSCF300355 - 181383-188584
RNAseq coverage235x (Rank: top 43%)
Annotation
HeliconiusHMEL0131912e-10980.16% 
BombyxBGIBMGA004335-TA1e-9674.41% 
DrosophilaCG14982-PB2e-1629.88% 
EBI UniRef50UniRef50_D7EM108e-2948.95%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D7EM10_TRICA
NCBI RefSeqXP_967438.21e-2848.37%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|2700160563e-2848.95%hypothetical protein TcasGA2_TC012929 [Tribolium castaneum]
NCBI nr blastxgi|1892388174e-2848.37%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL30242 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS202558-TA
ATGGCTGAGGAGGTGTGCCCCGACCTAAAAAAAAAATTTGAAAATATAAGTCTTTCTGCAAGAACATGTACTCGCCGAACTGAAGATTTGGGCGATAATTTATGCCAGCAACTCCGGGAAAAAGCACAACAATTTGAATGGTTCTCTTTAGCTACCGATGAAAGTAATGATGTTACTGACACTGCCCAGTTTTTAATATTTGTTCGCGGAATCGATAAGAATTTTAATGTCTATGAGGAGCTTTTACAGTTATGTAGTTTGAAGGGCACGACTACCGGAGAAGATTTGTTTTGTAACTTAGAACAAGCGCTAATATCAATGCAATTACCATGGGAAAAATTGATCTTAAACTTAAACTCCGCCCCGCACTACACGCCCGCAGCACCCGCCGCGGGCACACACGGGGCTCACCCCCGGGTCGGGATACATTTCCAGGCCAGCGCCCCCCCTCCCCAGTTCCAGGACGAGCCGCTGTACGACTACAACGGCTACGACTACGACTGCACCTACAAGACGCCGCTGAAATGTCTCACGGGGACTCCGCCGAGGAAGGACTATACTAGGGAATACTCCAGATATCACGAGCCGAGCACGGAGCTGATCAGCAAGCAGCCTCAGTACCAGGAGTCGAGGCTGAGACGTCTGGACGAACAGCAGCAGTACGTCAACATTGACGCGAGAAAGATGTGTACCCCGGAAGAGGCGACTTCGCCCCTCGGGACTTTCAAACGGCAGCGGTGTCTCAGATACAAACAAAGACGACCTAGACCCATACTGAGATCCAAGTCAGATATATCAGACAGATACAGGCGTTCGGACGGCCGCAGTCCGTCGTCGGCGCCGTGTGAGGTGTCCCCCAGCAGTCCGTCCGAGGGCTCCGGTCGCCTGCACCGCTTCTTCGACTACCTCGGCTTGGAGTCGCGCCAGTACGAGGCGCTGTTCAACGACGCCGATGAAGACAGCCCCGTGTTCTTCTCGTCCGCCTCCACCGTCGACTCCAACCAGGTCGCCGCGGCCGCTGACTATACTGTACAAGGTCCGGGGGTGCAGAAGCAAATATACCGCAACACGGAGCCGCCCAGCGTGGTGGAGAGGAACGCTAGGATCATTAAGTGGCTGTGTCAGTGTCGGAAGGCGCAGATACCGCCGCCGCAATAG

Protein sequence:

>DPOGS202558-PA
MAEEVCPDLKKKFENISLSARTCTRRTEDLGDNLCQQLREKAQQFEWFSLATDESNDVTDTAQFLIFVRGIDKNFNVYEELLQLCSLKGTTTGEDLFCNLEQALISMQLPWEKLILNLNSAPHYTPAAPAAGTHGAHPRVGIHFQASAPPPQFQDEPLYDYNGYDYDCTYKTPLKCLTGTPPRKDYTREYSRYHEPSTELISKQPQYQESRLRRLDEQQQYVNIDARKMCTPEEATSPLGTFKRQRCLRYKQRRPRPILRSKSDISDRYRRSDGRSPSSAPCEVSPSSPSEGSGRLHRFFDYLGLESRQYEALFNDADEDSPVFFSSASTVDSNQVAAAADYTVQGPGVQKQIYRNTEPPSVVERNARIIKWLCQCRKAQIPPPQ-