Monarch geneset OGS2.0

DPOGS210608
TranscriptDPOGS210608-TA1191 bp
ProteinDPOGS210608-PA396 aa
Genomic positionDPSCF300168 + 21415-23802
RNAseq coverage1009x (Rank: top 13%)
Annotation
HeliconiusHMEL0059003e-14172.93% 
BombyxBGIBMGA014413-TA1e-9965.22% 
DrosophilaHydr1-PC8e-10651.75% 
EBI UniRef50UniRef50_A1Z7Q71e-10351.75%Alpha/beta hydrolase 1, isoform A n=10 Tax=Diptera RepID=A1Z7Q7_DROME
NCBI RefSeqXP_001647781.11e-10850.26%hypothetical protein AaeL_AAEL015298 [Aedes aegypti]
NCBI nr blastpgi|1571269682e-10750.26%hypothetical protein AaeL_AAEL010695 [Aedes aegypti]
NCBI nr blastxgi|3123813866e-10451.75%hypothetical protein AND_06325 [Anopheles darlingi]
Group
Gene OntologyGO:00040911.4e-113carboxylesterase activity
KEGG pathway 
InterPro domain[1-391] IPR0120201.4e-113AB-hydrolase YheT, putative
Orthology groupMCL12128 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210608-TA
ATGTTAGGAATGTTCTGGTATATATTTGAAGTAAAGAAGGAGTTTCTAGTATGCTTTTCACTGTCTTTGCTGTATATAACCTATTATCTTGTGGAAGTCGTCAAGAAACCGATGCTGATATGTCGTAAGGGGGAATTCCGTCAATTACTAGAGGATAATGTTCCATTGTTGAGTGAGCCTTACTGGCCTACTCCTTGGTGCGTGGAATCTCGATTGCAAACTGTACTGGGCTCTGTTCTTCGGTCACATTTACTTCCACCAGTAGCTTACCGACGCGAGGTGCTGCGGCTGTCGGACGGCGGTCAGGTGGCTCTGGACTGGGCCGAGCTGGCGGATGGCGACAGGGAGAGCCGGCCCGTGCTGCTCGTGCTGCCCGGACTAACGGGCGGGGCGCAGGCGGACTACGTGCGCTGCCTGGTGGCGGCAGCGGCGCGTCTCGGGGCGCACGCGGTCGTCTTCAACAATCGCGGTCTCGGCGGTCTGCCGCTCACCACACCGCGGCTGTACTGCGCCGTGTCGCACGCGGACTTAGCGGAGGTGGTGGAGGCGGTGCGGCCTCGCGGCGCTCCCCTGCTGGCGGTGGGGGTGTCTCTGGGTGGCCTCATCCTCGGACACTATCTCACGGAGCACGCTCAGCGCGCCGCGCATACCTTGCACGCCGCGCTCGTGGTGTCTTCTCCGCTGGACGTCGTGCGGGGTGCGGAGTGTATCGAGCGGCCTCCTCTGAATTCCTTGTTGTCGTGGCACATGGCACGTAACCTCCGCAACACGGTGAACTCTCATTCTCCTCTCCGGAGCGGTCCCGGTGACTGGGCGGCCGTGGAGCGCTGCCGGTCCGTCCGTCAGTTCGACCAGGCCTTTACGACCAAACACTTCGGATTCCCTTCCGTCGACGACTACTACCGCGCGGCGACCCTCTGTGACAAGCTGAGCCGCGTGCGCGTGCCGCTCCTCTGCCTGTGCGCGGCTGACGACCCCTTCCAGCCCCTGGACGTGTTACCGCTGGCGGAGGTGGAAAGCAGTCCTTGCGTGGCGCTGGCGGTGACTGCTCGCGGCGGTCACATCGGCTTCCTGGAAGGTTGGTGGCCGGCACCCCCGTCCCGCTCTCCTCACTCGCAGTACATCGCTCGCCTCGCTCACCAGTACTTCGCGGCGCTGCTGTCGTCCCCGCGTCCCGTCAGCCCCCCCTGA

Protein sequence:

>DPOGS210608-PA
MLGMFWYIFEVKKEFLVCFSLSLLYITYYLVEVVKKPMLICRKGEFRQLLEDNVPLLSEPYWPTPWCVESRLQTVLGSVLRSHLLPPVAYRREVLRLSDGGQVALDWAELADGDRESRPVLLVLPGLTGGAQADYVRCLVAAAARLGAHAVVFNNRGLGGLPLTTPRLYCAVSHADLAEVVEAVRPRGAPLLAVGVSLGGLILGHYLTEHAQRAAHTLHAALVVSSPLDVVRGAECIERPPLNSLLSWHMARNLRNTVNSHSPLRSGPGDWAAVERCRSVRQFDQAFTTKHFGFPSVDDYYRAATLCDKLSRVRVPLLCLCAADDPFQPLDVLPLAEVESSPCVALAVTARGGHIGFLEGWWPAPPSRSPHSQYIARLAHQYFAALLSSPRPVSPP-