Monarch geneset OGS2.0

DPOGS203051
TranscriptDPOGS203051-TA1362 bp
ProteinDPOGS203051-PA453 aa
Genomic positionDPSCF300206 + 47243-48839
RNAseq coverage218x (Rank: top 45%)
Annotation
HeliconiusHMEL0161475e-15463.04% 
BombyxBGIBMGA006541-TA5e-11753.79% 
Drosophila% 
EBI UniRef50UniRef50_E1ZZM47e-3131.15%Uncharacterized protein C17orf85 n=6 Tax=Formicidae RepID=E1ZZM4_CAMFO
NCBI RefSeqXP_001605907.12e-2628.65%PREDICTED: hypothetical protein [Nasonia vitripennis]
NCBI nr blastpgi|3071887172e-3031.15%Uncharacterized protein C17orf85 [Camponotus floridanus]
NCBI nr blastxgi|3320304942e-3526.98%Uncharacterized protein C17orf85 [Acromyrmex echinatior]
Group
Gene OntologyGO:00001661.6e-08nucleotide binding
KEGG pathway 
InterPro domain[93-144] IPR0194161.4e-12Protein of unknown function DUF2414
[103-161] IPR0126771.6e-08Nucleotide-binding, alpha-beta plait
Orthology groupMCL17807 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS203051-TA
ATGGAACACGACAAAGAGGAAGGAGAAATGAGCGATGACCATGAAATGCTTATTGACGATCAACCCCTTGATAGTCCAACTAAAAGCACTGTGGTTTTTATCGCAGATAAAAATGGACTTTTGGTTCAACAATTAGAAAAAGTTGATGCTTGTAGACTTGAGGAAAGGGCTAAAAGATTTGGTTTAAACTTGACTGGAAATAGGATTGTAACACAAAAACAGATTGACGAACTGTATAATAATTTTGGTATTGAAGGTGGAAATGAAAGACATTTCAGATTTGATACACTTCATTTAAATGGTGTCAATGGTTTAATAACAAAGGATATATTTGAGTATTTGGTAGATTACAAGCCAGTATCTCTAGAATGGGTTGATGATAATTCATGTAATGTTGTTTGTCAGGACCATATATCTGCTGCATTGGCATTATTGGTACATTCCAGGGAAATTAAAAGTGAACACATTAAAGATATGCTGCAAAAAAAATCTTCACATTACTGGAGAGAAGGTGTTCCACACCCAAATAAAGACTTGATTTTAATGAGATTTGCTACAAATAGTGATAAGAAGTCAACAAAAGTTGAACCTGAACAAAAACATAGACTAGACTCTGACAAGAATATAAATAATGAGGGTAAGAATCCCTGGGGTGACTTGTGCAGGTCCTGGGGCATCTATGATCACCAAGAAGTGTTTCAAAGAAATTTATCAAAAACTGACTATGAGGAAGAACTTGAAGAACCATTTGAAAAAGTCCAAGTTAGGAACAAGAAGCTAGCTTCACGGCTTGGTAAAAGAAACCATAGTATAGAGGTTGCCACCAGTGATTCCGATTCTGAGTGGAAGAAGAAGTCTAAGACACCCAGAATGAGAATGCGAGCTGACGATGAGGAGTCAAAGCAAAAGAATCACAATCAAACGAAACAAAATGATTCAGATGAAGATGATTATGCACCCTTGTCAATAGAAATTCTGAACTCCAGTAGTAAATTCACTTCTAAACATTCGAAGAGAATATCTGAGAAATTTAGGAATTCAGACCAACATTTCAAGAGCATGCCTCGGAATGTACACTCAAGATTAGGTATCAAAGTAGTGGATAATGAAAGAAGTTATAGTGATGAATCTTCATCAAATGAATCAGACTATAATGTAACAAGCCGGGTGCAAAAAGTAACAACTGGTTCTAAAAATACATCAAATGTTTGGTCACGATTGGAGATTAAACCCAAGAATTCAGGACAGAAAGATTTGAGACAAATATTAACAACACGTAAACCTAAACATAAAGACGATTTAAGAGACAGACTTGGGAAGTCAAAACAATGTAATATTCGCATAGAAATAGACAATAGTTAA

Protein sequence:

>DPOGS203051-PA
MEHDKEEGEMSDDHEMLIDDQPLDSPTKSTVVFIADKNGLLVQQLEKVDACRLEERAKRFGLNLTGNRIVTQKQIDELYNNFGIEGGNERHFRFDTLHLNGVNGLITKDIFEYLVDYKPVSLEWVDDNSCNVVCQDHISAALALLVHSREIKSEHIKDMLQKKSSHYWREGVPHPNKDLILMRFATNSDKKSTKVEPEQKHRLDSDKNINNEGKNPWGDLCRSWGIYDHQEVFQRNLSKTDYEEELEEPFEKVQVRNKKLASRLGKRNHSIEVATSDSDSEWKKKSKTPRMRMRADDEESKQKNHNQTKQNDSDEDDYAPLSIEILNSSSKFTSKHSKRISEKFRNSDQHFKSMPRNVHSRLGIKVVDNERSYSDESSSNESDYNVTSRVQKVTTGSKNTSNVWSRLEIKPKNSGQKDLRQILTTRKPKHKDDLRDRLGKSKQCNIRIEIDNS-