Monarch geneset OGS2.0

DPOGS207854
TranscriptDPOGS207854-TA1278 bp
ProteinDPOGS207854-PA425 aa
Genomic positionDPSCF300042 + 1392687-1398040
RNAseq coverage25x (Rank: top 77%)
Annotation
HeliconiusHMEL0225402e-5973.61% 
BombyxBGIBMGA009818-TA4e-3255.81% 
DrosophilaObp59a-PB8e-1633.59% 
EBI UniRef50UniRef50_E3WKD51e-2029.51%Putative uncharacterized protein n=1 Tax=Anopheles darlingi RepID=E3WKD5_ANODA
NCBI RefSeqXP_001655335.13e-2128.57%hypothetical protein AaeL_AAEL011416 [Aedes aegypti]
NCBI nr blastpgi|3123853274e-2029.51%hypothetical protein AND_00895 [Anopheles darlingi]
NCBI nr blastxgi|1571292552e-3129.64%hypothetical protein AaeL_AAEL011416 [Aedes aegypti]
Group
Gene OntologyGO:00055491.8e-09odorant binding
KEGG pathway 
InterPro domain[328-417] IPR0233162e-10Pheromone/general odorant binding protein, PBP/GOBP, domain
[323-409] IPR0061701.8e-09Pheromone/general odorant binding protein, PBP/GOBP
Orthology groupMCL16816 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207854-TA
ATGAGATTTTTAAAACATTACGTCTTATTTTCCTGTTTGCTGTTCATTACTTCTGATGCACTGTCATGTCGTTCAAAATACGGCGCAAAAGATGAACGGTTAAAGCGTGTTTTTAATGATTGCTTGAAACAACAAGGGAGTAACAGCAGTAGTGATAGATACAATCAAGAGAGAAGAGAGCGATCAAACAATAACAGACATGATCATGGTAGAAGAAACAATCAGGATCAAAGAAACGGCGACGAAGACTACAAGAGAGAAGATTACAATGACGGTGCATACGGAAGAAACAGCAGAGGTCAAATGAATGAAGGCACGAGGGTTGAAAGTGGGAGAAATAACGGAAAGGGAGATACGAGATTTGATGGTCGAAATGAAAGGAGTCGTAACAGTAACTTAAGAGGACAAGACGAATTCTTACAGAGCGAAGAATATGGGAACGCACTGTCATGTCGTTCAAAATACGGCGCAAAAGATGAACGGTTAAAGCGTGTTTTTAATGATTGCCTGAAACAACAAGGGAGTAACAGCAGTAGTGATAGATACAATCAAGAGAGAAGAGAGCGATCAAACAATAACAGATATGATCATGGTAGAAGAAACAATCAGGATCAAAGAAACGGCGACGAAGACTACAAGAGAGAAGATTACAATGACGGTGCATACGGAAGAAACAGCAGAGGTCAAATGAATGAAGGCACGAGGGTTGAAAGTGGGAGAAATAACGGAAAGGGAGATACGAGATTTGATGGTCGAAATGAAAGGAGTCGTAACAGTAACTTAAGAGGACAAGACGAATTCTTACAGAGCGAAGAATATGGGAACGGTTTCCAGCAGAATTATTACTCATCTTCTCAATCAAACGGACGATACAAACGCGAAAAGAAGACGGAAATTAATTCTGGACAAAGAAGTCAGTATAACCCACATTCAAAAAACTCTAATGGTAACAGAGATGATTCAAATGAGAACAATTCCAGTGAGAACACAAACCAAGACATTGCATGTGTGCTGCATTGCTTTCTAGAGAATCTGCAGATGACCGGAGACAATGGAATGCCAGATAGATATTTAATTACTCACGCTCTTACAAAAGATGAAAGGAATGAAGATTTGAGAGATTTCTTACAGGAGTCTGTGGAAGAATGTTTCCAAATCCTCGATAACGAAAACACAGATGATAAATGTGAATTTTCAAAAAATCTATGGATGTGCTTATCGGAAAAGGGAAGATCGAATTGCGACGATTGGCCTAAGAAGACCACTTTTATGTTTTAG

Protein sequence:

>DPOGS207854-PA
MRFLKHYVLFSCLLFITSDALSCRSKYGAKDERLKRVFNDCLKQQGSNSSSDRYNQERRERSNNNRHDHGRRNNQDQRNGDEDYKREDYNDGAYGRNSRGQMNEGTRVESGRNNGKGDTRFDGRNERSRNSNLRGQDEFLQSEEYGNALSCRSKYGAKDERLKRVFNDCLKQQGSNSSSDRYNQERRERSNNNRYDHGRRNNQDQRNGDEDYKREDYNDGAYGRNSRGQMNEGTRVESGRNNGKGDTRFDGRNERSRNSNLRGQDEFLQSEEYGNGFQQNYYSSSQSNGRYKREKKTEINSGQRSQYNPHSKNSNGNRDDSNENNSSENTNQDIACVLHCFLENLQMTGDNGMPDRYLITHALTKDERNEDLRDFLQESVEECFQILDNENTDDKCEFSKNLWMCLSEKGRSNCDDWPKKTTFMF-