Monarch geneset OGS2.0

DPOGS208921
TranscriptDPOGS208921-TA2238 bp
ProteinDPOGS208921-PA745 aa
Genomic positionDPSCF300009 - 291069-306515
RNAseq coverage308x (Rank: top 37%)
Annotation
HeliconiusHMEL0160670.090.81% 
BombyxBGIBMGA002496-TA4e-11475.08% 
DrosophilaCG5807-PA5e-11552.85% 
EBI UniRef50UniRef50_F4X5V01e-13661.81%Protein LMBR1L n=10 Tax=Endopterygota RepID=F4X5V0_ACREC
NCBI RefSeqXP_397358.28e-14262.47%PREDICTED: similar to CG5807-PA isoform 1 [Apis mellifera]
NCBI nr blastpgi|3504072348e-14862.03%PREDICTED: protein LMBR1L-like [Bombus impatiens]
NCBI nr blastxgi|3838560402e-15063.80%PREDICTED: protein LMBR1L-like [Megachile rotundata]
Group
KEGG pathway 
InterPro domain[22-455] IPR0068765.2e-92LMBR1-like membrane protein
[69-91] IPR0080753.1e-53Lipocalin-1 receptor
Orthology groupMCL12611 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208921-TA
ATGGATGATGAGGAGGCGGACTTGCGTGAGCAGATATTCCACAATAATGTTAGGGAGCAGATAATATTTCTTCTGCTTTTTATACTCCTGTATCTATTATCGTTTATTCTCATCGAGAAGTTCCGTCGTCGTGACAGTGAGGATTATTTTACGGCTGATGAAGATGAAGTTAAGGTTTATAGAATCAGCTTGTGGCTTTGTACCTTCTCACTGGCCGTATCTCTAGGTTCTGCACTCCTGCTGCCAGTATCAATAGTCAGCAATGAAGTGCTCATACTGTACCCAAACAGCTATTATGTTAAGTGGCTCAATAGCTCACTCATTCAAGGGCTATGGAATCATGTGTTCTTATTTTCAAATTTGTCATTGTTTGTGTTCCTGCCATTTGCATATCTTTTTTCTGAGTCCACTGGATTCCCAGGCTGTAGAGGTCTGAAGGGCCGTGTATATGAGACCTTTATTGTGTTGGCTCTATTAGGAGTGGCTATGCTGGGCTTGGCATATGTTATCTCAGCTTGGCTGGAAGGTGACAGATCTAGTTTGGATGCATTATTGAATCTATGGACATATCTGCCGTTTCTCTACTCCTGTGTGTCGTTTGTTGGTGTCCTCATGTTACTTGTGTTCACTCCGTTGGGCTTCGTCCGTCTCTTTGGTGTGGTGGGTGGAGTGCTCGTGAAGCCGCAGTTCCTGAGGGACCTCAACGAGGAATACTACGTGTACTCATTTGAAGAAGATACCATAAGACGTAGAATTAACAATGCGGTAAACACAGGAGTGGGCTACATATCACCTGAGCCGATGTACCCTGATCGCGGAGACATATCCGCTCCTGTTACTCCTGATGTCTCGAAGGACGGAATTTCAGTTAACAAGGAAACGCCGTTATTGAGACTCAGTAATGGCGCCCTACAAGCCGGCCTGAATCAGAGGCTCAGACAGGTCGTCGCTATGAGGAAGGAAGTAGAGAGCCAACGGAAGACGTCATCGTTCCAGCGTAACGTGGTGTATCCCCTGGCTATGCTGCTGTTGCTGGCGTTGACCACCATCACCGTCCTCATGGTGCTGCAGAATACACTCGAGCTGCTCATCGGCATTAAGGCACTGCCGCTCAGTACCCGGCAATTCACATTGGGCATCGCGTCTCTATCTAAATTGGGTTGGGCGGGCGCTTCGTTGGAGGCTGGTTTGATCCTGTACTTGCACGCGGCCTCCTTGACCGGCCTGGGTACGCCGTTGCGTGCTCTGCGCGTGCTGCCTAGAGCACGACGCACGCCGCTCGCACGAATCATCGCGCTATGTACGGTGCTGCTCGCGCACTCCACGGCTCAACCTCTACTGGTTAAGATACTCGGAGTGGGCTACATATCACCTGAGCCGATGTACCCTGATCGCGGAGACATATCCGCTCCTGTTACTCCTGATGTCTCGAAGGACGGAATTTCAGTTAACAAGGAAACGCCGTTATTGAGACTCAGTAATGGCGCCCTACAAGCCGGCCTGAATCAGAGGCTCAGACAGGTCGTCGCTATGAGGAAGGAAGTAGAGAGCCAACGGAAGACGTCATCGTTCCAGCGTAACGTGGTGTATCCCCTGGCTATGCTGCTGTTGCTGGCGTTGACCACCATCACCGTCCTCATGGTGCTGCAGAATACACTCGAGCTGCTCATCGGCATTAAGGCACTGCCGCTCAGTACCCGGCAATTCACATTGGGCATCGCGTCTCTATCTAAATTGGGTTGGGCGGGCGCTTCGTTGGAGGCTGGTTTGATCCTGTACTTGCACGCGGCCTCCTTGACCGGCCTGGGTACGCCGTTGCGTGCTCTGCGCGTGCTGCCTAGAGCACGACGCACGCCGCTCGCACGAATCATCGCGCTATGTACGGTGCTGCTCGCGCACTCCACGGCTCAACCTCTACTGGTTAAGATACTCGGTATAACAAATTTCGATCTGCTGGGAGAGTTCGGTCGTATAGAGTGGCTCGGTAATTTCAAACTGGTGCTCCTCTACAACGCTATCTTCGCGGCCACTGTGACTCTATGTCTGGTCAGCAAGTTCACAGCGAGCGTGAGAAGAGAACTATATAACAGATTTAAATGGGTGGCCAATATGTTCAGTGTGCCAGAAGAAGTGCCGAGTGGGAATATTACAAAAACGAATAAACGAGTAAGTACGGAACGCTCGTTCAACATTGAGAAGACTGAATCACTAGATCCATCACATTTGAAAACAGAGTAG

Protein sequence:

>DPOGS208921-PA
MDDEEADLREQIFHNNVREQIIFLLLFILLYLLSFILIEKFRRRDSEDYFTADEDEVKVYRISLWLCTFSLAVSLGSALLLPVSIVSNEVLILYPNSYYVKWLNSSLIQGLWNHVFLFSNLSLFVFLPFAYLFSESTGFPGCRGLKGRVYETFIVLALLGVAMLGLAYVISAWLEGDRSSLDALLNLWTYLPFLYSCVSFVGVLMLLVFTPLGFVRLFGVVGGVLVKPQFLRDLNEEYYVYSFEEDTIRRRINNAVNTGVGYISPEPMYPDRGDISAPVTPDVSKDGISVNKETPLLRLSNGALQAGLNQRLRQVVAMRKEVESQRKTSSFQRNVVYPLAMLLLLALTTITVLMVLQNTLELLIGIKALPLSTRQFTLGIASLSKLGWAGASLEAGLILYLHAASLTGLGTPLRALRVLPRARRTPLARIIALCTVLLAHSTAQPLLVKILGVGYISPEPMYPDRGDISAPVTPDVSKDGISVNKETPLLRLSNGALQAGLNQRLRQVVAMRKEVESQRKTSSFQRNVVYPLAMLLLLALTTITVLMVLQNTLELLIGIKALPLSTRQFTLGIASLSKLGWAGASLEAGLILYLHAASLTGLGTPLRALRVLPRARRTPLARIIALCTVLLAHSTAQPLLVKILGITNFDLLGEFGRIEWLGNFKLVLLYNAIFAATVTLCLVSKFTASVRRELYNRFKWVANMFSVPEEVPSGNITKTNKRVSTERSFNIEKTESLDPSHLKTE-