Monarch geneset OGS2.0

DPOGS210188
TranscriptDPOGS210188-TA1632 bp
ProteinDPOGS210188-PA543 aa
Genomic positionDPSCF300283 - 117559-125806
RNAseq coverage3083x (Rank: top 4%)
Annotation
HeliconiusHMEL0095201e-3843.45% 
BombyxBGIBMGA003261-TA9e-3242.59% 
DrosophilaCG8042-PA6e-5130.98% 
EBI UniRef50UniRef50_UPI00022C8F705e-7432.72%UPI00022C8F70 related cluster n=4 Tax=unknown RepID=UPI00022C8F70
NCBI RefSeqXP_391980.17e-6732.71%PREDICTED: similar to UBX domain containing 2 isoform 1 [Apis mellifera]
NCBI nr blastpgi|3407233114e-7533.33%PREDICTED: UBX domain-containing protein 4-like isoform 2 [Bombus terrestris]
NCBI nr blastxgi|3071720243e-7633.04%UBX domain-containing protein 2 [Camponotus floridanus]
Group
Gene OntologyGO:00055152.1e-16protein binding
KEGG pathway 
InterPro domain[366-437] IPR0010122.1e-16UBX
Orthology groupMCL16059 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210188-TA
ATGCACTGGTACGGAGGCAGTATGGCGGAGGCGGTGACGCTGTCCAAACAGAGGAACGCTATTTTCGTTGTATTTGTAGAAGATGATAACAATCTATCCAAGGAGCTCCTGTCTACGATAGACGACAGTGCGGTAGTCAAACGTCTTACGGATCAGAATAATTTCTTGGCTGTTAAGTTGAAGAGCGGATCGGAGAATTACACATACTTCGCACAGATCTACCAGTTCGTGCCAGTGCCCTCGCTGTTCTTCATCGGCCGTAACGGCACTCCGCTGGAGGTGGTGTGTGCTGGTGTGGAACCACACAACCTGGCCACGAGGATCGACAGGATCCTCAAGGAACACCGCAAAGAACAGCCATGCGACCAGGTCGAACCGTCGACTTCATCTAAAAACATTAAAGATGAAACTCTCAGCTTCATACAATCAGAGGCCACAGCCAGCACCGCACCGCCAGCAGACAAAACCCCACAGACCAGTGACAGTGCACCAGAGACCCCCAACACAGCCTCAGCAACCCCCAACACGACACTCGAAGAACCCAAAACAACCCCGGAACCACCCAAAAGTACAGCTGCAGAGGAATCAGAATCCGGTCCAGCTGCCAAGATCCAGAAGATGGATAAACCCGAGTCCCCGGAATACGACGTGGTGTGTGTGGGAGACACCTGTGTTAGGAAACCTCGCGCCGGGGACCCTGAAGCCTCCAGCTCTAATAAGCCGGAACCGAACCCTGTCCCGGCTCAGGAGAGCTCCAACAGCGCTGACGACAAGTTGGAGAAGGCCAAGGAACTGATAGAAATAAGACGGAGAGAGAAGGCGGCCAAGGAAAAAGAGTTGGAGAAGCAAAAGGAGTTGGAGCGTCGGTCGGTGGGTCAGGGAGTGTCGGAGCTGAAGAGATGGCAGGCGGAACAGGAGATGAAGCAGATACAGGAGGAGAGGAAGAGAGAGAAAATGGAGAATGACTTAGCGAGACAACGGATACTGGAACAGATAGCACAGGACCGAGCGGAGAGGAGGGCCAGGGAGATAGTCACCACACAGAATGTGGTGCAGACCCCACCGCCACCACCAGCCACATCAGGCGACACCTTCAGTGCTCGAGTGCAGTTCAAGTTACCGGATGGTACCACCCACGTGACCACCTTCCCCGCCAGAGCGACCGTGCGTGACGTCACCACGTACGTCACCCAACAACTACAGCCGGAGGGTCTGTTCTCACTGTGGACGGCGTTCCCTCGTCGTGAGTTATCGTCTCCGGCCAGCACACTCCTGGAGCTACAGCTGGCGCCCTCTGCAGCTCTGCTGGTGTTGCCCCAGAGGTCGGGACTCCAACCGACTATAAGACCTGGACCGTTCGCGTTCCTGACTACTCTCTTCACAACGCTCTTCCTAAACCCGACATACAATATATTCTACTGGATACGAGACCGCTTCTTCCCAGTGTTATCTAGACCGAGCGCCCCAGACCAGCCTAGACCGGAGCCGGCCAGACCGGAGCAGTCTAGACCGGAACAGTCTAGACCGAGGAGCGGTCTGCGGCGACGAGGGAACGTTCACAGGCTGGCCAGTGACGGGAGCAATGATGATGATAATAACACCTGGAATGGAAACTCAACGCAGCAGATGTAG

Protein sequence:

>DPOGS210188-PA
MHWYGGSMAEAVTLSKQRNAIFVVFVEDDNNLSKELLSTIDDSAVVKRLTDQNNFLAVKLKSGSENYTYFAQIYQFVPVPSLFFIGRNGTPLEVVCAGVEPHNLATRIDRILKEHRKEQPCDQVEPSTSSKNIKDETLSFIQSEATASTAPPADKTPQTSDSAPETPNTASATPNTTLEEPKTTPEPPKSTAAEESESGPAAKIQKMDKPESPEYDVVCVGDTCVRKPRAGDPEASSSNKPEPNPVPAQESSNSADDKLEKAKELIEIRRREKAAKEKELEKQKELERRSVGQGVSELKRWQAEQEMKQIQEERKREKMENDLARQRILEQIAQDRAERRAREIVTTQNVVQTPPPPPATSGDTFSARVQFKLPDGTTHVTTFPARATVRDVTTYVTQQLQPEGLFSLWTAFPRRELSSPASTLLELQLAPSAALLVLPQRSGLQPTIRPGPFAFLTTLFTTLFLNPTYNIFYWIRDRFFPVLSRPSAPDQPRPEPARPEQSRPEQSRPRSGLRRRGNVHRLASDGSNDDDNNTWNGNSTQQM-