Monarch geneset OGS2.0

DPOGS208470
TranscriptDPOGS208470-TA1989 bp
ProteinDPOGS208470-PA662 aa
Genomic positionDPSCF300064 - 1543706-1547067
RNAseq coverage319x (Rank: top 36%)
Annotation
HeliconiusHMEL0042950.060.98% 
BombyxBGIBMGA010652-TA2e-14750.56% 
DrosophilaCG13272-PA2e-1246.24% 
EBI UniRef50UniRef50_UPI00021A68EC1e-1659.42%UPI00021A68EC related cluster n=2 Tax=unknown RepID=UPI00021A68EC
NCBI RefSeqXP_001813653.15e-2049.19%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastpgi|1892364231e-1849.19%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
NCBI nr blastxgi|1892364232e-3830.46%PREDICTED: similar to conserved hypothetical protein [Tribolium castaneum]
Group
KEGG pathway 
Orthology groupMCL25754 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208470-TA
ATGGAGCAGGACAAATTAAATAAAAAGAAACTCTGTTTGATCGCTTTTGTGTCTACGTGTTTGGAAATTTTAGCGACCCCCGAACCCAATACATCATTGTTCAAACAAACTCCCAACGCATCTTATAATGATAAAACAGACTTAAGCAGTATTGCTTCCATTTTCTCTCCTCGTATGGACTTCGATCAATGGAAACCATTGACTGGTCGTGGAGATCCCCTACGCAATGATCCTACTTACGATTACGAACCACCGGTTCTAGAAAGAGTCCATTATTGGGCTGATGATAGCCGATCGGAACGTGAAAAATATCCGGAGAGAAAGTCCGAAGTGCTCGTTTTAGGAGTGTCTTCACGCAAGCCCAGTGTCGCGTCAAGGCCGCCCGTGCCTACGCCATCCAGAAGACTGACCCGGCCACCAGGGTTTTCTAACAAATACGAAGATTATACTTACAAATTTGGTGAACATTATCCAATGACAATACTTGTACCACCGCCACCACCTCCAGCTGGTCAAAAACCACCTCTATTTATACTCAATGATGATAAATTCGCTCTGCCAACATTCCCTCCGAAACCTACCGATTCAACTCAAATCATAAGAGATACTACAACTACACCGGAACTTATAACCTCTTATGCATTCCAAGAAGCTAATCTCATTTATCAAGAATCTACATTGAAACAACAAAATTGGTTCAATAGCGTCAACAAAACAACAAGCCTCCCCAATAATACAGTGAGCAGTGATTATGCTGGCTGGGGTCCAACTACACCCTTCGATGATGTCAATGACTCGCATAACATTATATCATACAATGATCACCATAACATAGAGTTTACAAAAGAACCGCTCGCATACTACAAGCCAATGTTTTCACAAGCTCCACCGTTGCCAGAAACAACTGAAAAAACTATAACATCGCCAGCATTTTTGCCGACAGTACTTCCACCTCTATCTACTAACAGTGATGTCAATGAAGATTGGCCATCAGAAGAAACGACGCAAGAGACATCAACTGAGACTCAACATTACTATAGGGAAACAACGACGGAAAAAGCTACAACACCATCTTTTCATCCCACGCCTGCTCCAGTGAAATCCCAATCAAGTATAATAAATATGTTAGGTTCGATGATATCTATGCCTATGGTAACAGATCCCGATAGACCAGAAGACCATTTGTATGCACATGCATCAAATACCGTTCAAATATTTAAAGAACATACAACTCAAGAGGCTGATTTAGAAAAAATGCAAACAATGCAACCACCTCCACCAATAAAGCATTCAGAAACCGTAACAGAAACACCGAGACAACCATTCCACATAAATCCGCATATATTAAACAATTTTATACATGAAAAAGCTACTGGTCACACCAACGATCCCTATCTGCATATGCGTTTTACAACACCACCTCCTACTGTGACTGAAGCCCAAACTATACCAACTTATTTAATTATACAAGGTCATTCTAAAGTAAAAACCTACGGTTCAAAACCAAAACCATTAAGTGGCGCTGGGAATAACGAAATACAAAAACCAAATGAAACCAACGAAGTGAAACACCTCCATCCTATTAAAGATAAATATTCCAAGAAGACGGAAAAAATCGATCCAAGCAGGGAAAGTAGAGCGATAAACTTAAAATCACTTGTGGATAATGGTCTCGGTTCTATTGAAATCCAGGAAGCCGATGTCGGCATCAAATACGATGTTAGTGATGGTAGTAAGGTGCCTATAGAAATATATAGAAAAGGAATAGTTGACAGTGATGAAAATGATTACTCGCATAAACATAAGCTTTCAACAAACGAAAGAACCAAGAGACATATAAATATAGAAAATATATTGTCGATTGAAGATTCCTTAGAAGATGACGTATACGATTTATTTTCAAATAAGAAAAATGGCACCGTATTTTCAAAACTAATTTCTGAAGAAATTTTAAATGACGATGATGATGAGAGCACAGATGATAGATGA

Protein sequence:

>DPOGS208470-PA
MEQDKLNKKKLCLIAFVSTCLEILATPEPNTSLFKQTPNASYNDKTDLSSIASIFSPRMDFDQWKPLTGRGDPLRNDPTYDYEPPVLERVHYWADDSRSEREKYPERKSEVLVLGVSSRKPSVASRPPVPTPSRRLTRPPGFSNKYEDYTYKFGEHYPMTILVPPPPPPAGQKPPLFILNDDKFALPTFPPKPTDSTQIIRDTTTTPELITSYAFQEANLIYQESTLKQQNWFNSVNKTTSLPNNTVSSDYAGWGPTTPFDDVNDSHNIISYNDHHNIEFTKEPLAYYKPMFSQAPPLPETTEKTITSPAFLPTVLPPLSTNSDVNEDWPSEETTQETSTETQHYYRETTTEKATTPSFHPTPAPVKSQSSIINMLGSMISMPMVTDPDRPEDHLYAHASNTVQIFKEHTTQEADLEKMQTMQPPPPIKHSETVTETPRQPFHINPHILNNFIHEKATGHTNDPYLHMRFTTPPPTVTEAQTIPTYLIIQGHSKVKTYGSKPKPLSGAGNNEIQKPNETNEVKHLHPIKDKYSKKTEKIDPSRESRAINLKSLVDNGLGSIEIQEADVGIKYDVSDGSKVPIEIYRKGIVDSDENDYSHKHKLSTNERTKRHINIENILSIEDSLEDDVYDLFSNKKNGTVFSKLISEEILNDDDDESTDDR-