Monarch geneset OGS2.0

DPOGS208286
TranscriptDPOGS208286-TA2058 bp
ProteinDPOGS208286-PA685 aa
Genomic positionDPSCF300079 + 343070-351073
RNAseq coverage805x (Rank: top 16%)
Annotation
HeliconiusHMEL0028350.068.78% 
BombyxBGIBMGA006456-TA0.063.87% 
DrosophilaCG5397-PA1e-10636.47% 
EBI UniRef50UniRef50_Q1HQ050.064.94%Carboxylesterase n=3 Tax=Obtectomera RepID=Q1HQ05_BOMMO
NCBI RefSeqNP_001040411.10.064.94%carboxylesterase clade H, member 1 [Bombyx mori]
NCBI nr blastpgi|1140508710.064.94%carboxylesterase clade H, member 1 precursor [Bombyx mori]
NCBI nr blastxgi|1140508710.062.07%carboxylesterase clade H, member 1 precursor [Bombyx mori]
Group
KEGG pathwaytca:6598521e-36 
 K07378 (NLGN)maps-> Cell adhesion molecules (CAMs)
InterPro domain[1-582] IPR0020182.5e-91Carboxylesterase, type B
Orthology groupMCL15825 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208286-TA
ATGTGGTGGCTGTTGTTGTGTTCGCTGTTCGCGCCCGCACACGCGGTGGTAGGTGGTAGGCAGGCCGCACCACCCGAACCTGATGCCGCGGTGGTCTTCACACAGAAACATGGGCTGAGTGCGCGTTTAGAAGGCATCAAGGATGACGCTCGTGGATACTACGTGTTCGCCGGTCTGAGGTATGCGGAACCTCCGCTGGGTTCCAGGAGGTTTCAGAGGCCGGTGCGTCGTTTCTTGGCTGGTGAACAGCTTGCTAAGAGACACTGTCTCCCCTGTCCTCAAATGGACCCCAGGGGATCTGGCAGGGTTATTGGACACGAGGATTGCCTCTGTTTGAACGTTTACGCACCGAAAATGCCCGGCGACGAAGAAGGTTGTCCTGTGATTTTCTTTATTCACGGTGGCAACTACCGAACTGGGTCAGCTGCTCCGTATGGGGGTGTACATTTCACTCAAAAGGACACGATACTCGTAACAGCGCAGTACCGCTTGGGATCCCTTGGTTTCTTAAGTACTGGGGAGATCGATGCTTCTGGAAATACAGGTTTATTTGACCTTAGAGCCGCCATGGCATGGGTAAAAGACTACATATCATTCTTCGGTGGAGACAGTTCCAGGATAACTGTAATGGGCCATGGTTCTGGTGGTAGTGCGGCGTCTCTGATGGCTTTGTCACCTGAAGGTCGTTCAGCGACTGGGGTGGCGGCTCTTTCTGGAGCACCCCTGTCCCCGGGCGCTGTCAGGGAGAATCCATATAAACACGCTGAGGCCCTGTCAGAAAAAACTAATTGTCCTAAAGCACCCCCAGAAAAGCTTCTAAACTGTTTAAAATCTCTACCTGCTGAGAAGATTATAATGGCTGATAATGAAATTAAAGGCGATATGGTCGACGTAAACTCTTTCCTAGACGAGCTGTCCGGTCGCTCTGGTGCGGGTTCACGTGTGGAGGGTCCGCACGATCTTCGTTCTCTGCCGCCGCTGGTGGAGGAGGCGCCTACCGACTCCCTCAAACGGAAACACCAACGAGTGCCCATGTTCACAGGAGTGACCTCCGCGGAGACCGCCAATGCCGTCTTTGGTAAATACAGAAATTTCCTGACGAACCAAGTGACAGCTGTAAAAGATTTCATTAAGAAAGATCTGATTGGTGGTTTGCGTGGTGTGGTGCAGACAGTGCAAGGACTTAACCCGTTGAAGTTGACGGGAATCGACCAGGTCGTTCCTCTACCTGATTATTACCAAGCCGTCTTCGACAACTCCCTCAAGGCTATCGATGCACTGAGCGAAATCGTTGAAGCTACCAGTGATGCTCTATTCAATTTCCCTGCATACCAAAGCGTCCGTGAGTGGAGTGCCGGAGCCCCTGCATTCCTCTACAGCTTCGAACATCTCGGCAATTTGACACGCGGCAGTCACTTCCTCCCAGGACTTGCTCTCACACAGAGATCGGCTAATGCTACTGATGGTAAAACTAGCAAAATCAAGGGCCCAGCTCACGGTGACGAGCTGGCGTATCTGTTTGAACCACTTGACGAAGAAGGCAATCCAATTAATGAAGTGGTCTCGTCAACGGATGCTCGTGTCAGGGAGTCTTTTATTGAGCTTTTTGCTAAATTCGCTCATAGAACTGCATCAAACAATAAGAGCAACAACACAAGACCCAAAATATTTGACTTCCTGCCGTTTTCATCGGACAAAGAGCAATATTTAAAAATATCCGATTCAGTAACAACCGAGAAAGATTTTAGATTTTGTCAAATGGGTTTGTGGGGAGGTATGGCAGAACGTGTGACAGGAAGTTTATGCAAAAATATTTTAGGTCAGATCCTTAACCTTCCTAAATTACCTTTCGTCGGAGGGAATCAAATAAGTGGCATCATCGACCCTCTAGCAACGAATATTGGATCAAACATTAACCCGAACCTTGGCCCTGTTTTGAATGTCGGCAACAATGCGGGAACGAATAGTCCTTTAGGTTTACCCTTGATTGCAAAGAAACCTGCCAATTCTGTAAAACCTATGTGGACTAGTCCTTTTGGTAATCCTTTCGGAATATGA

Protein sequence:

>DPOGS208286-PA
MWWLLLCSLFAPAHAVVGGRQAAPPEPDAAVVFTQKHGLSARLEGIKDDARGYYVFAGLRYAEPPLGSRRFQRPVRRFLAGEQLAKRHCLPCPQMDPRGSGRVIGHEDCLCLNVYAPKMPGDEEGCPVIFFIHGGNYRTGSAAPYGGVHFTQKDTILVTAQYRLGSLGFLSTGEIDASGNTGLFDLRAAMAWVKDYISFFGGDSSRITVMGHGSGGSAASLMALSPEGRSATGVAALSGAPLSPGAVRENPYKHAEALSEKTNCPKAPPEKLLNCLKSLPAEKIIMADNEIKGDMVDVNSFLDELSGRSGAGSRVEGPHDLRSLPPLVEEAPTDSLKRKHQRVPMFTGVTSAETANAVFGKYRNFLTNQVTAVKDFIKKDLIGGLRGVVQTVQGLNPLKLTGIDQVVPLPDYYQAVFDNSLKAIDALSEIVEATSDALFNFPAYQSVREWSAGAPAFLYSFEHLGNLTRGSHFLPGLALTQRSANATDGKTSKIKGPAHGDELAYLFEPLDEEGNPINEVVSSTDARVRESFIELFAKFAHRTASNNKSNNTRPKIFDFLPFSSDKEQYLKISDSVTTEKDFRFCQMGLWGGMAERVTGSLCKNILGQILNLPKLPFVGGNQISGIIDPLATNIGSNINPNLGPVLNVGNNAGTNSPLGLPLIAKKPANSVKPMWTSPFGNPFGI-