Monarch geneset OGS2.0

DPOGS209695
TranscriptDPOGS209695-TA1284 bp
ProteinDPOGS209695-PA427 aa
Genomic positionDPSCF300309 - 99439-100913
RNAseq coverage276x (Rank: top 39%)
Annotation
HeliconiusHMEL0098377e-10353.46% 
BombyxBGIBMGA002150-TA4e-9450.56% 
DrosophilaRlb1-PA6e-0624.08% 
EBI UniRef50UniRef50_UPI00022470FE2e-1232.54%UPI00022470FE related cluster n=1 Tax=unknown RepID=UPI00022470FE
NCBI RefSeqXP_001605276.13e-1332.54%PREDICTED: similar to Flj25286-prov protein [Nasonia vitripennis]
NCBI nr blastpgi|3504177885e-1326.93%PREDICTED: hypothetical protein LOC100745427 [Bombus impatiens]
NCBI nr blastxgi|3454881853e-3026.43%PREDICTED: hypothetical protein LOC100121666 [Nasonia vitripennis]
Group
KEGG pathway 
InterPro domain[344-425] IPR0151581.6e-11Bud-site selection protein, BUD22
Orthology groupMCL26707 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS209695-TA
ATGGAAGTTGGTGCGGTCAAACAAGCATTTAATAATGAGATAATATTGCTAAAAAAGAATTTAATTCAAGCAAAAACTCAGACAATACACAAGTTAACGAGAAAAGCAAAACAATTGGTAGAGAAGAAGGCTCCAGATGAAGCAAAGGAAAAGTTAAAGAGAAAAGCCGAGTCCGCTGTGAAAGAAGTTTTGATTCTTAAGAAAATCAAACCGAGAGATATAGCACAGTTCATAGTTACTCACAAAGGTGCCCTCAATGAATACATGAACAAGCCAGTGGATAAGGACAAAGCATGTGCAAGACTATTGCTGCATAAGGCCATGCAAGACAAATACAAACTAATACGGGAGAAATTCTCAAATGTGTCCATTAAAGATTTATTCATGTCGCGCCAGGAGAGGCTCAAACTTAAAAAGGAAGCTCGGGAAAAGCAGAATAGTAAGAAAGATAAAAATAAGGCAGTAAATATTGAAGGTGAATGGAATGTGGAGGCTGCTAATGACAATATAGAATCCATTGTACAATCCAATGATGATGTAGCAGATAATATGTCTGAAGGTAGTGATCCCGGTGGTTATTCAGACGAAGTCAAAGCTGAATGTGATTTATCTGAGGAAGACATCTCTAAGGACAGTATCAACGGCAATAGTGATGACTCCGAAGTGAACGATGAAACCACCAAAGATGGTAATTTATCACAAGTCAAAAGCTCAATACAAAAACAGTCAGATAAATCAGATGTTATTACAAATATTAATAAGAAAGATGTGCAAAAAATAAAAAAACCAAAAGAGAATAAGAACAAGAATTTGAATAAAAAATTGCAGGACAGGAAATTTCATAAAGTATCTGACGATGATCATTCACAAGCTACAAGAGTTGTTGATCCATTTTTCATAACATCGTCAGGAGAAAATTATATGTCTCTTGTTGAACCGCGGCAGCCGGACGAAATAAAGGAGGTTCATAAGCAAGGGAACAGGAAATTTAGACGGGCTGTAATGTTTGGACATGTGCCTAAACCTAAAGTTAGAAAAAGCTACAATGACAGATATAACGATATTGACAGTAGTGAAAATAATTTTGTTGGTGGCAAAACCGATAATAGACAAAAAACTAAAGACAGTTCTAGACAAAATGGTAACAAATTTGACAGAAGAAAAAAATATGATAATAACGAAAGCGATGTACCAGAAAAATTGCATCCGTCATGGGAAGCCAAAAAGAAACAATCGTCTATTTTACCTTTCCAGGGTAAGAAAATTGTTTTCGATGAAAGCTGA

Protein sequence:

>DPOGS209695-PA
MEVGAVKQAFNNEIILLKKNLIQAKTQTIHKLTRKAKQLVEKKAPDEAKEKLKRKAESAVKEVLILKKIKPRDIAQFIVTHKGALNEYMNKPVDKDKACARLLLHKAMQDKYKLIREKFSNVSIKDLFMSRQERLKLKKEAREKQNSKKDKNKAVNIEGEWNVEAANDNIESIVQSNDDVADNMSEGSDPGGYSDEVKAECDLSEEDISKDSINGNSDDSEVNDETTKDGNLSQVKSSIQKQSDKSDVITNINKKDVQKIKKPKENKNKNLNKKLQDRKFHKVSDDDHSQATRVVDPFFITSSGENYMSLVEPRQPDEIKEVHKQGNRKFRRAVMFGHVPKPKVRKSYNDRYNDIDSSENNFVGGKTDNRQKTKDSSRQNGNKFDRRKKYDNNESDVPEKLHPSWEAKKKQSSILPFQGKKIVFDES-