Monarch geneset OGS2.0

DPOGS210289
TranscriptDPOGS210289-TA1362 bp
ProteinDPOGS210289-PA453 aa
Genomic positionDPSCF300216 + 380720-387465
RNAseq coverage40x (Rank: top 73%)
Annotation
HeliconiusHMEL0169744e-1377.27% 
BombyxBGIBMGA014532-TA1e-5185.00% 
DrosophilaCG5882-PA7e-3830.66% 
EBI UniRef50UniRef50_UPI0000D5713F3e-9045.26%UPI0000D5713F related cluster n=1 Tax=unknown RepID=UPI0000D5713F
NCBI RefSeqXP_974953.16e-9145.26%PREDICTED: similar to LOC779580 protein [Tribolium castaneum]
NCBI nr blastpgi|910910021e-8945.26%PREDICTED: similar to LOC779580 protein [Tribolium castaneum]
NCBI nr blastxgi|910910029e-9944.86%PREDICTED: similar to LOC779580 protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL12585 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210289-TA
ATGGAGAACCTTCAAATAATAAACCGTCTGGAACAGCGTCGCAAACAGCTGGAGACGGAGCTGGATGTGAGCGCGGTGCAGCTCAACAAACAGAGGCTGGTCGTCAGACAGCTGGAGAAAGATAAAGACAGAATGCTGGAAGAAACTATAGCTTTGAACGAAAAAATAGACGAAGTCTCTGAGGAGGTGCGGTTGCGCACCGCGGACATCCTGGACCTTAAGAAGGCTCTGCGCGAGGAGTTGATCAAGAGTCGCAAGCTATCAGTGGCCTTGGACACCACACGCGCCGAAAGAAATATGCTGCATAAAAACTATACCGAAGCGCTGGACGAGATACAGGACTTGAAACAGAAGTTGAAGATGCTGGCCTATCAGATAGAACAGCTTAAGGAGGATATAAGCGGGAAGGAGACCGGCCTCAAGTCCTGTGAGGGTGCTCTCCTGAAGTGCAACAAGAAGACGGAACAGCTCAAGTCCGAAGTGCAGGCTGGACTGACCAAGCTGTCGGAGGCGAAGGCCGACATCACAGCTCTAAGACAGGAAGAGGCCAGGCTCAATAGGATCGTCCAGGAAGGAGATTCAGCGAGAGCGAAGCTTATGAAAGAATTGGAGGGTTTGATTAATGAAAGAGACGTAGTGGGAGCCCAGCTCGTCAGGAGGAATGACGAGATATCACTCCTGTACGAGAAGATAAGAATATTGGAAGTTACGCTGCATAGAGGTGAAAGACAGTATGAACAGCGAGTTGAAGATATAAGACTGCTTCGTCTAGAAATCATAAGATTGAGAAAAGAAAAGAATTTGCTGTCAAAGGGCATCGAAAACATGACGGATCTGCGGTTGGAGGTCTTCAATTTGGAACGTGAGCTGGGTCGTGAACGGCTCAGGGTGCGGGCCTTGGAGGAGGCCTTGGAGACGCCGCTGAACGTTCATCGATGGAGAAAGCTGCAGGGAACAGACCCTGAGAGTGTACATCTAACACAAAAATTGAGACTCACACAGAAAAAAGTCCTTGCCCAAAGCGAGATGCTCGTACTCAGAGACCGTGAGCTGAAGGAGACTAGGAATTTGTACAGCGCCGTCAAGGATATGCTGGCCTTACAACCCAGTCCAGAAATACAGCAAACGTTGAACCGAACGCAGCGTGCCCTAGTCCAGAGAACCAGCAAAATGAAATGTCTCATCGCAGAGCTGAGCATGCGGGAGAGACAAGTGACGGACTTCAGACTGGAGCTGACAAAAGTCAGCGACGAGCTTCACTCCTACAAACAGAAGTACTTCGAAATGAAGCGCGCCCTGGACGCTGATGAGGCGAGGAGGCTCAGGGTCCCGTCCCCCGGCAGCGCGGATAACAAAATATAA

Protein sequence:

>DPOGS210289-PA
MENLQIINRLEQRRKQLETELDVSAVQLNKQRLVVRQLEKDKDRMLEETIALNEKIDEVSEEVRLRTADILDLKKALREELIKSRKLSVALDTTRAERNMLHKNYTEALDEIQDLKQKLKMLAYQIEQLKEDISGKETGLKSCEGALLKCNKKTEQLKSEVQAGLTKLSEAKADITALRQEEARLNRIVQEGDSARAKLMKELEGLINERDVVGAQLVRRNDEISLLYEKIRILEVTLHRGERQYEQRVEDIRLLRLEIIRLRKEKNLLSKGIENMTDLRLEVFNLERELGRERLRVRALEEALETPLNVHRWRKLQGTDPESVHLTQKLRLTQKKVLAQSEMLVLRDRELKETRNLYSAVKDMLALQPSPEIQQTLNRTQRALVQRTSKMKCLIAELSMRERQVTDFRLELTKVSDELHSYKQKYFEMKRALDADEARRLRVPSPGSADNKI-