Monarch geneset OGS2.0

DPOGS212106
TranscriptDPOGS212106-TA1818 bp
ProteinDPOGS212106-PA605 aa
Genomic positionDPSCF300038 - 574700-578277
RNAseq coverage671x (Rank: top 19%)
Annotation
HeliconiusHMEL0125330.074.35% 
BombyxBGIBMGA006607-TA0.084.53% 
DrosophilaCG2182-PA2e-9948.43% 
EBI UniRef50UniRef50_E0VCU66e-12345.61%Putative uncharacterized protein n=1 Tax=Pediculus humanus corporis RepID=E0VCU6_PEDHC
NCBI RefSeqXP_001655865.15e-12944.91%hypothetical protein AaeL_AAEL012102 [Aedes aegypti]
NCBI nr blastpgi|1571314771e-12744.91%hypothetical protein AaeL_AAEL012102 [Aedes aegypti]
NCBI nr blastxgi|910913904e-14548.20%PREDICTED: similar to ZNF403 [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL12760 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212106-TA
ATGGCTAAGTTAGTCGATATTTACAATTGCGATAAAGATCCCCTTATAAAGCGGCGACAATTGCCTTTAGTTATTTACGAAAATTTAACTATGATTATGGATTTAAATACTATGGGCCTAATTTGTGACAATCCGCAAGTTAAAGGTCGCGAGTATGAAGAATTTATGCGCAAGTATAGAATTCTTACGACAGACGAATTAAAAGCAGCGCTTAGGGTCGACGCCTCTGATATATTTAACGTTTTAAACCAAAGCATTCCTTGTGTTGGCTGTCGGAGAAGTGTTGAAAGACTTTTTTATCAGCTTTGTAAATCAGGACATCCAACCTTAGACCCATTAATTATTACACCTGATGAAATAATGTCCATTAAAGAGGATAAAATCACACATCCACAGTCATTGGCTACTTTATTAAATGGTCACAGTACGGGATTAGAATGCTTATTACTCAGCCAACCTAGGTGGAAGAAATCACAAAGGTGTACGCTTCATTCTTTAGAAGCGGGTAGCGCACGAGGTTGGGGTGCAACAGTTGGTTGTGGTTGGGGCGGCCGGGCGGCATGGAGAGCTGCTTGGGACGCAATGAGAGCTTCCGCCAGGGAACATGTCACCCTCGTCCATTTTAACACTCTACACGATACTCTTCACAATTATTTGAGGAAGCATAGATTTTGTGCAGATTGTAAGACTAAGGTTTTGCGAGCTTATCAGTTACTTGTTGAAGAAAAAGAACCTCAAAAGGAAAAGGGATATGTTGGTGCCTTATACGGGGGTATAAAACGTTGTTTGTTAGACAAACACCTTCATTTGGAAGCAAAGACAGATTATATAGCACACCTTATAGCCAGGGCGGAACCTGAACTGCTTGGTAACCACCGTGAACGGCATGCCAAGACATTAGAAATAGCACAGGAAGAGGTGTTAATTTGTCTGGGAATATGTATATATGAGCGACTCCAACGCATATCTTTAAGGTTGCGTGAAGAGGAAGGCACATGCCAGGCTCTCGTTGCAGTTGGTATTGAGGCACTCTATCGAAAATTTGAGACGGCTGTTGAGCTTAAAAGCGGAGTGTCCAAACTTCAATTATTATATGATGAAATAACACAAGAAGAACTAACAAGGCAACAGAGAAAGGAACAGAAAAAACTTAAACGGCGTAAGAAGAAGGAACGGCAAGCAGTTGAGAGCAAGTCCAAGGAAGGAGATTCTGAGGAACCGGAAGAAAAGTGTGAATGTGAAGAATGCTTACTAGAAGAAAGAGACACTCCAATGGATCTCATGTCGCCGGATTGCAACCAGTGTTGTGACGTGATGCAGTCGAATGTGGATAGCTGCCAGTCATGTCAGGACGACTATTGTTTACCCAAGAAATCGAACGCAAAGTTTATTAAAAATAATCTTTTGGCAAACTTTAGTCATGACTGTGGATATTCGTCGGGTAATAACGTTGGTTGTTGTGAAACAATGTCTGGTTCTTCGTCACTTATGAGTTCACCGGAAGGCTCTGAAGTGGCTTGTTCTGAAGGATTCTGCAATCACGAGAGAGGGGACTGTCTAGATTTACCAAAACATGAAAAAATATCCTGTTCCGGTTTCACACTTTCCTTGCAGGAAATGCTGGATACGTGTTCCTCAGACGAAGAACAAGATACTTGTTACATACCCATAGAAGAAGTCCTGGAATTCAAATCACGAAGGAACATCACGGAGAAACGCCAAGAACTGAGACAGAATTTACGACAAAAGTTTGCACAGCTTTGTGTGAACACACCCCACTCACATCCCATGGCCTTACAGAGAAAAAACCAAGTCTAG

Protein sequence:

>DPOGS212106-PA
MAKLVDIYNCDKDPLIKRRQLPLVIYENLTMIMDLNTMGLICDNPQVKGREYEEFMRKYRILTTDELKAALRVDASDIFNVLNQSIPCVGCRRSVERLFYQLCKSGHPTLDPLIITPDEIMSIKEDKITHPQSLATLLNGHSTGLECLLLSQPRWKKSQRCTLHSLEAGSARGWGATVGCGWGGRAAWRAAWDAMRASAREHVTLVHFNTLHDTLHNYLRKHRFCADCKTKVLRAYQLLVEEKEPQKEKGYVGALYGGIKRCLLDKHLHLEAKTDYIAHLIARAEPELLGNHRERHAKTLEIAQEEVLICLGICIYERLQRISLRLREEEGTCQALVAVGIEALYRKFETAVELKSGVSKLQLLYDEITQEELTRQQRKEQKKLKRRKKKERQAVESKSKEGDSEEPEEKCECEECLLEERDTPMDLMSPDCNQCCDVMQSNVDSCQSCQDDYCLPKKSNAKFIKNNLLANFSHDCGYSSGNNVGCCETMSGSSSLMSSPEGSEVACSEGFCNHERGDCLDLPKHEKISCSGFTLSLQEMLDTCSSDEEQDTCYIPIEEVLEFKSRRNITEKRQELRQNLRQKFAQLCVNTPHSHPMALQRKNQV-