Monarch geneset OGS2.0

DPOGS206018
TranscriptDPOGS206018-TA1941 bp
ProteinDPOGS206018-PA646 aa
Genomic positionDPSCF300253 + 207196-219412
RNAseq coverage308x (Rank: top 37%)
Annotation
HeliconiusHMEL0153763e-7729.21% 
BombyxBGIBMGA012637-TA2e-9549.12% 
DrosophilaCG9447-PB4e-4124.92% 
EBI UniRef50UniRef50_Q7PTG82e-5526.55%AGAP007079-PA n=3 Tax=Culicidae RepID=Q7PTG8_ANOGA
NCBI RefSeqXP_308678.43e-5626.55%AGAP007079-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|1582863256e-5526.55%AGAP007079-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|1582863256e-6326.55%AGAP007079-PA [Anopheles gambiae str. PEST]
Group
Gene OntologyGO:00167476.5e-17transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathwaydme:Dmel_CG333372e-10 
 K00680 (E2.3.1.-)maps-> Benzoate degradation via CoA ligation
    Limonene and pinene degradation
    Ethylbenzene degradation
    Tyrosine metabolism
    1- and 2-Methylnaphthalene degradation
InterPro domain[208-597] IPR0026566.5e-17Acyltransferase 3
Orthology groupMCL25993 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206018-TA
ATGGAGTGTCTGAAGAAGTTGGTATACTTAGCGATGCTGTTGGCCGTCTGTGACGGCAGCTCTGAGGAAATAAATGAGACTTCGGTCGGGAGTTTCCCACCGTTGTACGCTTTGGAGGACTGGAAGCTGTGTCAGAACCCTGGAGACGTGTATTGTATTGTGGATGCTCTGCTCGTCTCCACGAACTCTTCGCCGCTGCTGCAAGAGATACAGGAATATTCATCGAGAACGCTCAAGCACTTCAACAGGACAGTTGTCCATCGTGGGGTGTGTGTGACGAAATGCGGGGGCCAGGATCCTGATACGTGGAATGTGGCCGCTGAGGGATGTATCAACACCAGCTTGGAGGAGTATGGACTTGAGGCGGACGTCCAGGACGTCGGCTGGTGTCGCAACAACCAGCCTACACCCATGAGTACGTCGGCCCGAGTGTTCGTCATAGTATGTGCGACGTTGCTGGCTATGACATTAATAGCAACGGGACTACACGCTTTGGAATATAGATTTGGGAAATTCTTTGGTAATAAATACATCTTGGCTTTTTCATTAAAACGAAATTGGAAAATGCTTATATATGATGACAAACAACGAAACAGAAACGAGCGCACGGAGGACTTGAGTTGCATCGATGGTATTAGGTTCATCGGAACACTCAGCGTCGTTCTGACTCACGTCACAATTATTCACGTGTTTGCTTTCATCGACAATCCGGATTTTATTGAAAATTTATATGAACATGTGAGTACTAAGTCAGCCTTCAACACGCCACTTTGGATCCAAGCCTTCATATCGATATCCGGATTCCTCTCCGCGTACTATCTCCTCATTTACACCGAGAAACACTCCTTCACTTGGAAAAAATGTGTAGTATCTGTTTTACATCGATACATCAGATTAACTCCCGTGTCGTTATTCACTCTGTGGTTCACGATATCGTGGTTACCTCGCCTGGGTTCAGGTCCTCAGTGGTCGTGGCTGGTGGAGCAGGAAGCCCAGTACTGCACCGAGCGTGGATGGTACCACGCGCTTTACATTCACAACTACCTCACTCTTGGAAAGTTGTGCATGGGGCATACGTGGTACTTGGCGGTAGACATGCAGCTTCATGTACTCGGCTCATTTCTTCTTCTGATTCTTATGAGATGGAGGAAGGCCGTCATCCCTGTGCTGGCTACCATAGTGATTGCATCAATGGCTGTTACAGGATTGCTCGTCTATTTCTTAAATCTAACTCCTATCATAAGCGCGCAGTCTCCTGAGACTGTTCGCAACATGTTTAAAGGTTCAGCGATAATGCCTACGATATATTTGCCTGTGTGGGTGCATTTTGCCGGCTATGCTTTAGGCATTGCTACTGCCTATATACATTACAATGATCAAAATAATGGATACAAGCTCAGGGATAGTAAGTGGTTTTCAGCTATCTTTCACACGTCCTTGTTGCTGGCCGCAGCTGTCAGCGTCGCTGGTGTGCCATTCCTCTCAGATTCTCCTCCTCCAAGCTGGGTGACTGCTCTATATGCCTCCGTTGATAAAATACTGGTGGCGCTCTTCTTTAATGTGTTTTTATTGGGGTGCTTGAGTCGTTGTAGATCGGTGTTCCGCGACCTGTTGTCGTGGCGAGGCTGGTACTCCCCCGGCAGGTTGTCATACTCCGTCTTCATCATACACTTCGTCATCATGAGGTTCACCATCGCCAACAACCCTCAGATTATTCATATCACCGGTTATTCCTCTTTATCTTTATTAATAGTTGGAACAGTGCTGTCATATCTTATATCTGTTCCGGTATTTTTGGTGATAGAGATGCCCTTTATCCAGCTCTGGAAGGCTGTCATGGGTCTCGATGGTCCCAAAAAAGATGCACAGGCACAAGAAACACAGAATAAGATTGATCTCGTGATGAACGGGAGTAGGAGAAGTGGACAGAATGTTGTTTGA

Protein sequence:

>DPOGS206018-PA
MECLKKLVYLAMLLAVCDGSSEEINETSVGSFPPLYALEDWKLCQNPGDVYCIVDALLVSTNSSPLLQEIQEYSSRTLKHFNRTVVHRGVCVTKCGGQDPDTWNVAAEGCINTSLEEYGLEADVQDVGWCRNNQPTPMSTSARVFVIVCATLLAMTLIATGLHALEYRFGKFFGNKYILAFSLKRNWKMLIYDDKQRNRNERTEDLSCIDGIRFIGTLSVVLTHVTIIHVFAFIDNPDFIENLYEHVSTKSAFNTPLWIQAFISISGFLSAYYLLIYTEKHSFTWKKCVVSVLHRYIRLTPVSLFTLWFTISWLPRLGSGPQWSWLVEQEAQYCTERGWYHALYIHNYLTLGKLCMGHTWYLAVDMQLHVLGSFLLLILMRWRKAVIPVLATIVIASMAVTGLLVYFLNLTPIISAQSPETVRNMFKGSAIMPTIYLPVWVHFAGYALGIATAYIHYNDQNNGYKLRDSKWFSAIFHTSLLLAAAVSVAGVPFLSDSPPPSWVTALYASVDKILVALFFNVFLLGCLSRCRSVFRDLLSWRGWYSPGRLSYSVFIIHFVIMRFTIANNPQIIHITGYSSLSLLIVGTVLSYLISVPVFLVIEMPFIQLWKAVMGLDGPKKDAQAQETQNKIDLVMNGSRRSGQNVV-