Monarch geneset OGS2.0

DPOGS210010
TranscriptDPOGS210010-TA1812 bp
ProteinDPOGS210010-PA603 aa
Genomic positionDPSCF300327 + 10877-15188
RNAseq coverage106x (Rank: top 60%)
Annotation
HeliconiusHMEL0101866e-11552.21% 
BombyxBGIBMGA008403-TA1e-15153.33% 
DrosophilaCG33964-PA9e-5231.54% 
EBI UniRef50UniRef50_D6WAX86e-6133.20%Putative uncharacterized protein n=2 Tax=cellular organisms RepID=D6WAX8_TRICA
NCBI RefSeqXP_001866633.11e-6637.04%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|1700623563e-6537.04%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastxgi|1700623565e-6837.04%conserved hypothetical protein [Culex quinquefasciatus]
Group
KEGG pathway 
Orthology groupMCL10503 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210010-TA
ATGGCTGAAGAAATTGACTTTAACGATGCCCTCGACAGTATAGTGTTAAGTGAAAATTCTTTATGCAAGGAAAGTTACGACGAAGGTTACAAAAGTGGTTATGAGGCTGGAAATCCTGAAGGTTATCATTTAGGCTACCACAGAGGTGCTGAATTGGGCAGAGAATTAGGATACTACTTTGGCGTTGTTACGAACCACATAGAAAACAAAGAATCTTTATTTATCTCCGAGAAAGTTTTGAAACAGTTGGAAAAAGTTCGAGATCTGATAAACTTATTTCCTCAGACAAATTCAGAAGATCATGATCTTCTAAATTTGGCGGAGAACATACGAGCGCAGTATAAGAGAGCTTGTGCCTTATTAAGGATTCCATCCAAAAAATTTAGTATGGAATCGAGTATAGCTAAAGTGAAAGGTGATTTAGATGCCGCCACAACATTCCTCGATCAATACTTACATCTTGCTAATTGTCATATGGTTGAATTCTTCACTGAGAGCCATTGGGACAGATTAGTGCCAAAAAAACTCAGGAATTATCTAGACGTATGTGAATTATCTCAAGCCATTGATAACTTTTGGAAGTATGCTGATGGAAATTGTTGTGATGATAATGAATTGAACAAATGGATCAAGGAATCAAGGAAATATTACACCGCATTAAACACATACTGTATATCAACGGAAAAACTACAAGAAATAATTAAGTCCTGGGGCGGAGAAATAAAACCTCAAGTTCAAATAACTGAGTTTATGACAAGTAAAAAGAGTTATGAAGTAAAAACTATGTCCCACTTAATAGCATCACTGTGTACCGTCTGTGATGTGACCCACTGTGTGGAGGCTGGTGGGGGTAAAGGTAACTTACCTGTGGCGCTCTCACTAAGTTATCACTTACCCAGCCTTACCATTGATTGTAACCCAATCGCTGTAAACAATGGTGAGAAACGGGTTAAGATTATACAGAAACAATGGCACGCCATATCAAAGAAGGTGAAAGATGGCTCAAAACATCTTGCATCAGACAGCATAGAAACCAATCTTCACAGGTTCGCCGCAGCATACATCACTACAGACACGGACTTTACGCGACTCGTCAGAGAGAAGTTCCCGGAATATTCTGGAGATGTCAAATTACTTTTGACAGGTCTTCATACATGTGGTAACCTCGGTCCGGATTCTCTCGTTATCTTCACCACTAACCCGTCTATATCTTCGCTCTTCAACGTGCCTTGCTGTTATCACCTCCTCACTGAGGACGTGGATGTGGAACTGTTCGATGTGTTCCAGAGGTACGGCGAGGGCTGCGGCGGAAGCAAAGGATTTCCAATGTCTGAAGGTTTAAAAGGTTATAATTTAGGAAGAAATGCTCGTATGTTAGCTGCGCAATCAATACACAGAGTTGTTTACAATAAACAGATTCCGGACAAGGGGCTCTTGTACAGGGCTTTGATACAGATTATTATAAAACAACGTTTACCGGATTTACATGTGTCAGAGGGTAAGCTGAAAGGTATATCTTCGAAATGTCAAAACTTCGACGACTATGCCAAGATGGCGGACGCGATACTCAAAATCGGCGTTGACCAAAACTCTGAGATTTACCTTGAAGTACAAAAAGACATAGATGTTAAGTGGAAGAAAATAGTTATGTTTTATTTATTGAGGCTGTGCCTGGCGCAGGTCATAGAGCATGTGATTCTGTTGGACAGATTGTTGTTTTTATTGGAAAATGGTTTCCAAAAATGTTTTCTCGTCAAATTGTTCGATCCCGTCACGTCGCCGAGGTGTCACGGGCTGGTAGCTGTGAGGTAG

Protein sequence:

>DPOGS210010-PA
MAEEIDFNDALDSIVLSENSLCKESYDEGYKSGYEAGNPEGYHLGYHRGAELGRELGYYFGVVTNHIENKESLFISEKVLKQLEKVRDLINLFPQTNSEDHDLLNLAENIRAQYKRACALLRIPSKKFSMESSIAKVKGDLDAATTFLDQYLHLANCHMVEFFTESHWDRLVPKKLRNYLDVCELSQAIDNFWKYADGNCCDDNELNKWIKESRKYYTALNTYCISTEKLQEIIKSWGGEIKPQVQITEFMTSKKSYEVKTMSHLIASLCTVCDVTHCVEAGGGKGNLPVALSLSYHLPSLTIDCNPIAVNNGEKRVKIIQKQWHAISKKVKDGSKHLASDSIETNLHRFAAAYITTDTDFTRLVREKFPEYSGDVKLLLTGLHTCGNLGPDSLVIFTTNPSISSLFNVPCCYHLLTEDVDVELFDVFQRYGEGCGGSKGFPMSEGLKGYNLGRNARMLAAQSIHRVVYNKQIPDKGLLYRALIQIIIKQRLPDLHVSEGKLKGISSKCQNFDDYAKMADAILKIGVDQNSEIYLEVQKDIDVKWKKIVMFYLLRLCLAQVIEHVILLDRLLFLLENGFQKCFLVKLFDPVTSPRCHGLVAVR-