Monarch geneset OGS2.0

DPOGS210426
TranscriptDPOGS210426-TA1221 bp
ProteinDPOGS210426-PA406 aa
Genomic positionDPSCF300062 - 414206-419056
RNAseq coverage2006x (Rank: top 6%)
Annotation
HeliconiusHMEL0215732e-15977.54% 
BombyxBGIBMGA002756-TA2e-11464.10% 
DrosophilaCG1882-PA2e-11453.71% 
EBI UniRef50UniRef50_E1ZYV74e-12456.25%Abhydrolase domain-containing protein 4 n=14 Tax=Coelomata RepID=E1ZYV7_CAMFO
NCBI RefSeqXP_001608148.15e-12458.40%PREDICTED: similar to GA15096-PA [Nasonia vitripennis]
NCBI nr blastpgi|3504241305e-12559.53%PREDICTED: abhydrolase domain-containing protein 4-like [Bombus impatiens]
NCBI nr blastxgi|3320170354e-12762.20%Abhydrolase domain-containing protein 4 [Acromyrmex echinatior]
Group
KEGG pathway 
InterPro domain[114-129] IPR0000733.8e-10Alpha/beta hydrolase fold-1
Orthology groupMCL11993 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS210426-TA
ATGGGAAGTGAATTAAAACTTGAAAAAACAAAATCCAGTGTTCTACATAGAACCTGGATTGGAAATGTAATTTTATACTTGTGGAATTGGACCGGCCGTTGGTTCAGCTGGACTCGTCAATCGGACTCCATGTTGCGGAATGTCGAACAACTAATATTATCATGTGTGAAGACGGCGTACAAGCGATTCTATGTGGACATAGGTTCAGTGGTGGGACAATGTGACAAAATTTGGACGATATCTTTAAATGAAGACTCCCCAAAGACACCGCTCGTTATGTTACACGGAATGGCCTCCGGGCTAGCGTTGTGGTGTCCCAACCTTGACGCGCTCGCAGCCACACGACCCGTCTACGCCATGGACTTATTAGGTTTCGGTAGGAGTTCCCGCCCGAAGTTCTCGTCTGATGCTGAGAAGGTCGAGGCTCAGTGGGTGGAGTCGGTTGAGGAGTGGCGGCGGGAGGTGAAACTCGAACAGTTCATACTGCTGGGACACAGTCTTGGAGGGTACATCGCTACGGCGTACGCTCTCAAGTATCCCGAAAGAGTCCGTCACCTAATCCTGGCCGATCCCTGGGGCTTCGCAGAACGCCCGGACAATATCAACGAGAAGTTCCATATTCCTTTCTACATCCGGGTTGTGGCCACTATCTTCCAGCCTCTGAACCCTCTGTGGCCGGTGCGAGCCGCCGGTCCGGCCGGGAAATGGCTCGTCAGCAAAACCAGACCCGACATCGCAAGGAAGTACACCAACTACGTGAAGGACGCCGACACTGTTATACCGGAATATATATACCAGTGTAACTCACAGACACCTAGCGGCGAGAGCGCATTTCACGCGCTAATGAACGGTTTCGGGTGGGCGAAGCACCCTATGTCTCGTCGGGCGGGGCAGTTGTCTCCGTCCCTGGGAGTGACCGTGCTGTACGGGGCGCGCTCCTGGGTTCAGACCGGGGCGGGACAGATAGCTGAAAATAGACCCGGGGCTGAAACACACGTACAGGTAATAAATGGAGCTGGTCATCACATATATCTGGACAAAACGGAGTTGTTCAATAAGTACGTACTGGAGGCGTGCGAAAGAGGCGACAGTCCGCGCCGACTCGTGGACCAGCCTCGAGAACAGTCCGCCAACAGTGCCACAAGCGAGTCGACTGGTGGAGCGACTAGTCGACCAACTAGTGGAGCGGCCGGCGACCAAACTCCGTCTCCACAAACCTAG

Protein sequence:

>DPOGS210426-PA
MGSELKLEKTKSSVLHRTWIGNVILYLWNWTGRWFSWTRQSDSMLRNVEQLILSCVKTAYKRFYVDIGSVVGQCDKIWTISLNEDSPKTPLVMLHGMASGLALWCPNLDALAATRPVYAMDLLGFGRSSRPKFSSDAEKVEAQWVESVEEWRREVKLEQFILLGHSLGGYIATAYALKYPERVRHLILADPWGFAERPDNINEKFHIPFYIRVVATIFQPLNPLWPVRAAGPAGKWLVSKTRPDIARKYTNYVKDADTVIPEYIYQCNSQTPSGESAFHALMNGFGWAKHPMSRRAGQLSPSLGVTVLYGARSWVQTGAGQIAENRPGAETHVQVINGAGHHIYLDKTELFNKYVLEACERGDSPRRLVDQPREQSANSATSESTGGATSRPTSGAAGDQTPSPQT-