Monarch geneset OGS2.0

DPOGS214316
TranscriptDPOGS214316-TA1434 bp
ProteinDPOGS214316-PA477 aa
Genomic positionDPSCF300020 - 817186-821144
RNAseq coverage116x (Rank: top 58%)
Annotation
HeliconiusHMEL0050114e-15861.20% 
BombyxBGIBMGA004131-TA6e-14356.88% 
Drosophilamute-PC5e-1829.41% 
EBI UniRef50UniRef50_D6WQE82e-2425.27%Putative uncharacterized protein n=1 Tax=Tribolium castaneum RepID=D6WQE8_TRICA
NCBI RefSeqXP_001849259.15e-2441.13%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|2700105126e-2425.27%hypothetical protein TcasGA2_TC009918 [Tribolium castaneum]
NCBI nr blastxgi|1700431447e-2241.13%conserved hypothetical protein [Culex quinquefasciatus]
Group
Gene OntologyGO:00056344.1e-05nucleus
GO:00063554.1e-05regulation of transcription, DNA-dependent
KEGG pathway 
Orthology groupMCL24990 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214316-TA
ATGTTGCCTTCTATGCCAATTTTATTTACAAGAGACAGCCAAGATGCATTCGGAAACAAAATAGAACCGAGCACTAGTTTTTTTAACCAAAATAATAAAGAGAATCCTATCTTTCAATCAGAGGTTGTTTTATCTAAAAAAGATAAAAAAGAAGACAATACTAATAACAATGGAACACAAATAAAGCCTAATGATATTCAAAAGGAGGATACAATAAAGAAAAATGATGACAATCTCTTAGAGACTGCCCTGATAAGATCATATTACACAAAAGTACAATCGAGGTTCGCCTCACACCCAGCACTGCCTTCTAACAAGTTTGTACAATTTAGAGATATTTTGAAGTCTTTTGATCCCACGTTGGAAACTCCTGTTGATTTATACAGAAAAATAGAGGAGCATTTTGGTAATGAACATTCAGATATTGTTGAGGAATTCCTTCTATTTCTGAAACCCGGTCAAACGGCTGAAGTTGGGAGATTTATGGATTATCTGATGCTGACAAAGATGACAGGATTCATTCAGTTGTTACAAACAACATTTAGCCGTAAACCGACCGTACTGAGGAAGATAATGCGCACAATGACATCTGGTATCAACAACGGCACTAGTGAGGAGATGAAGGCACGAGTCCTGCCACATTTAAGATCGAGTCCACATCTCACGCAAATGTTTAAGTCTCTTTTCCCTGATGAACGACCCCCTGATAGTTTATATGAAATGGGTACGGATATTTTGGACGAAGGATTCCTCACTTGCGATAGAGGCTATGAAGTGTTTGAATTCGAAGACAAAGCTGGCAAAAGAAAGGAGACTAAAGGACTGGATTCTGAGTACTTACACGGAAGAGTATTTATACAGCACGGCAGGATGTTAAGAAATGCATCGGTCATCTATCCCTATAGTAAGGAGCCTTATAGAGTACACGCGAGGCGCTTGGCCCCTAATCATTGCATCTCACCACCCGAATCTGATTCCGAACGATCATCGCCGAAAAGGAGTAACAAGTCTGTGTGTAAATTGCCTCAGAAGAGATCAAGAAAGCAGCTCAAGTCACCGACAAAAACCGTTAAAGATGTCAATGATAATACAAAGAAAGAATGTGTCCCTAATTCATTTAGCAATAAAATTAAAGGAAAGTCCCATTGCAAGAGCAAAATTTGCAAAAAGGACGACATAGCTTTAAAAAAGAACGCCAGCACAAAGCGAGACGACGCCACCAAGGAGGATGCTCCGAAAAACGAAATAAGAAGCTGGACTCGAGAAGAGGATAAAACTATGCTAGAGGTTTTGAAAGGCGAGCCGGGATCGGAACTAGTGTTCGGCAGAATACGTGAACTGCTGCCGCACAGAACCACGGCCGAAATAAAAGATAGATTCTGTCATGTCATGCATCTTTTACAGCAAATGGCGGGCGGTGAAGTGAATCAGTAG

Protein sequence:

>DPOGS214316-PA
MLPSMPILFTRDSQDAFGNKIEPSTSFFNQNNKENPIFQSEVVLSKKDKKEDNTNNNGTQIKPNDIQKEDTIKKNDDNLLETALIRSYYTKVQSRFASHPALPSNKFVQFRDILKSFDPTLETPVDLYRKIEEHFGNEHSDIVEEFLLFLKPGQTAEVGRFMDYLMLTKMTGFIQLLQTTFSRKPTVLRKIMRTMTSGINNGTSEEMKARVLPHLRSSPHLTQMFKSLFPDERPPDSLYEMGTDILDEGFLTCDRGYEVFEFEDKAGKRKETKGLDSEYLHGRVFIQHGRMLRNASVIYPYSKEPYRVHARRLAPNHCISPPESDSERSSPKRSNKSVCKLPQKRSRKQLKSPTKTVKDVNDNTKKECVPNSFSNKIKGKSHCKSKICKKDDIALKKNASTKRDDATKEDAPKNEIRSWTREEDKTMLEVLKGEPGSELVFGRIRELLPHRTTAEIKDRFCHVMHLLQQMAGGEVNQ-