Monarch geneset OGS2.0

DPOGS214556
TranscriptDPOGS214556-TA1299 bp
ProteinDPOGS214556-PA432 aa
Genomic positionDPSCF300266 - 62206-63704
RNAseq coverage51x (Rank: top 70%)
Annotation
HeliconiusHMEL0096495e-1425.22% 
BombyxBGIBMGA003280-TA1e-11247.83% 
DrosophilaCG2906-PD7e-1121.51% 
EBI UniRef50UniRef50_E2C3H12e-5834.37%UPF0431 protein C1orf66-like protein n=2 Tax=Formicidae RepID=E2C3H1_HARSA
NCBI RefSeqXP_001850616.12e-3832.12%conserved hypothetical protein [Culex quinquefasciatus]
NCBI nr blastpgi|3071956678e-5834.37%UPF0431 protein C1orf66-like protein [Harpegnathos saltator]
NCBI nr blastxgi|3320176213e-5634.73%UPF0431 protein C1orf66-like protein [Acromyrmex echinatior]
Group
KEGG pathway 
Orthology groupMCL19022 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS214556-TA
ATGGCATTACCAGTAAAATTCGAAGAACCGTTTGATTACTTCACAAAATGTCTGCAGTGTTTTAAGGAATATCAGTATTTATTTAAATTCCCAAACACTGATCTATTGACAGAAAATGTCTTGGATTATATAGAGGTTGATGAAATAAATGAGCTTACTATTAACAAGAACTGTGACCTTAGATATACAGGCTGCACTTATTTGGACATATTTTTTGAGAAATTTGATGAATTGAAATCTGAATATACTACAATTAAGTACACAGATTTAGATTTCTTGTTAAATGTACCTCTAAGTCCTAAAAAGAAACACGAAATTACATATTTAGCGAAAGAAATAAAAGATTTTTGTGAGGAAATTGATTGCGAGACTGTTGTCGATTTCGGTTCCGGGCTGGGTTATTTAGATCAACAATTATACGAAACCACAAATCTCAATGTTCTGGGATTGGAGTGTAATGAAAACAATTACGTTGCAGCAAAACGTCGTCAAAGAAAATACCACATAAATTCACTTGCACGAGTTAAATATATAAAACATACGATAAATGAAAATTCTAGAAATAATATAGAAGAATACTTGTGTGATAAATTCCTAAACTGTGATAGTTTTTGCATAACCGGACTCCACGCATGTGCAGATTTGACAATAGATGCCATTAATATATTTTTAAATTCAAAAACAGCAAACGGTTTAGTCATTATGTCTTGCTGTTATCATAGAATGATATCAGAAAATGGAAGGTTTAAGAATTTTCCTTTGAGCGACGCTTTAAAAGAATGTTGTGATGAAAACTCCTTAGAAATATTATCTATACCATTTTTAAGATTGGCAGCACAGACGTCAAACCTTGGCACTAGAATTGAGGATTCGGTTTTTAATTTATTAGCTCGAGCTGTTTTGCAGGTGTACGCATTCAGAAATAATTTAATTCTAAAAAGAACGAAACGCAAAGCAGTTAGATTAAAATCCGTAAAAAACAATTTCGAAGATTACGTACAGGATGCGTTGGCTGGCTATCAATTGATATACGGAGACCGATTAAATAAAACAGAAAATATCAATTTTGATGTCGGAGAAATTATTTCAATTTGGCGGGAGTTATCCGATACGACATTCAAAAAGGCAGCCATATTTGTATTCCTTCAAAATTACTTGCAACCAGTGTTTGAAAATTTCATCTTATACGACAGATTAATATATTTGCAAGAACGTGGTATATTATCTTGTAAATATAAAAAAATTGTAAACAACAATATATCACCCAGATGTTTAGCTTTAATAGTGAAAAAGCATTAA

Protein sequence:

>DPOGS214556-PA
MALPVKFEEPFDYFTKCLQCFKEYQYLFKFPNTDLLTENVLDYIEVDEINELTINKNCDLRYTGCTYLDIFFEKFDELKSEYTTIKYTDLDFLLNVPLSPKKKHEITYLAKEIKDFCEEIDCETVVDFGSGLGYLDQQLYETTNLNVLGLECNENNYVAAKRRQRKYHINSLARVKYIKHTINENSRNNIEEYLCDKFLNCDSFCITGLHACADLTIDAINIFLNSKTANGLVIMSCCYHRMISENGRFKNFPLSDALKECCDENSLEILSIPFLRLAAQTSNLGTRIEDSVFNLLARAVLQVYAFRNNLILKRTKRKAVRLKSVKNNFEDYVQDALAGYQLIYGDRLNKTENINFDVGEIISIWRELSDTTFKKAAIFVFLQNYLQPVFENFILYDRLIYLQERGILSCKYKKIVNNNISPRCLALIVKKH-