Monarch geneset OGS2.0

DPOGS206705
TranscriptDPOGS206705-TA1560 bp
ProteinDPOGS206705-PA519 aa
Genomic positionDPSCF300048 + 1998355-2020801
RNAseq coverage377x (Rank: top 32%)
Annotation
HeliconiusHMEL0048505e-11070.37% 
BombyxBGIBMGA008538-TA1e-8060.96% 
DrosophilaDat-PB3e-2629.09% 
EBI UniRef50UniRef50_A0EM562e-10063.09%Arylalkylamine N-acetyltransferase n=2 Tax=Bombycoidea RepID=A0EM56_BOMMO
NCBI RefSeqNP_001073122.15e-10163.09%arylalkylamine N-acetyltransferase [Bombyx mori]
NCBI nr blastpgi|1603335149e-10063.09%arylalkylamine N-acetyltransferase [Bombyx mori]
NCBI nr blastxgi|1603335141e-10263.09%arylalkylamine N-acetyltransferase [Bombyx mori]
Group
KEGG pathwaydme:Dmel_CG33182e-24 
 K00669 (E2.3.1.87, AANAT)maps-> Tryptophan metabolism
InterPro domain[51-279] IPR0161812.5e-12Acyl-CoA N-acyltransferase
Orthology groupMCL15894 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206705-TA
ATGGCAGTTACAAGTGCAAGGAGCATTGTGAATATGAAGGAGAAAGGACACTTTGAACATGAAGAAACTAGGATGGACAAGTATACGCTCGCCGACGAGTTGCCGGCGCAGCTCTCGGAACTTAATCTCCGTCCTCGTAAAATGGCTATTCCGTACACCCTGAGGAGACTCACCCCCAAGGACAAAGATGGCGTCATCGAGTTTATGAGAAGATTCTTCTTTCGCGACGAGCCTTTGAACTTCACAATCAACTTATTGGAGACTCCAGAATCTCGGTGCTACGAGTTAGAAGATTACACAGCCAGCAGCCTAGCTGATGGAGCCTCAGTGGCCGCTGTTGACGAGAACGGAGAGTTCGTGGGGATGGTCATCAACGGTGTTGTTAGACGAGAGGAGGTAGATTACACGGATAAATCCGAAGATTGTCCACATCCTAAATTCAAGAGGATTCTGAAGCTGTTGGGACATTTGGACCGTGAAGCCAGGATATGGGACAAGCTTCCGCTAAGCTGTCACACAGCAGAGGTAGATTACACGGATAAATCCGAAGATTGTCCACATCCTAAATTCAAGAGGATTCTGAAGCTGTTGGGACATTTGGACCGTGAAGCCAGGATATGGGACAAGCTTCCACTAAGCTGTCACACAGCAGTTGAGATAAGAATCGCTTCAACTCACACGGATTGGAGAGGAAGAGGCTTAATGAGAGTTCTCGTAGAGGAATCAGAGCGGATAGCTAGAGAAATCGGCGCCGGCGCAGTGCGTATGGATACGACCTCTGCGTTTTCCGCGGCCGCGGCCGAGCGACTCAACTACAAAAATGTCTACGGTTTGTTTTACTCCGACCTGCCTTACGCACCACATCCGGACCCTCCCCATCTAGAAGCTAGGGAAATGGAAGAGAGCAATTCGTACGACATAAATCTTATTACAGCAGAAGATGCTGAGGCGGTTATGTCGTTGGTTAAAAGGACATTTTATATCGATGAACCCTTGAATCAAGCTGTGGGGTTGTGTACTTCGGAGTCAGACCCTTGTCACGAACTCGACGATTACTGCTCCAGTTCACTTCTAAGTGGACTATCCTTCAAAGCTATCGATCACGAGGGAAATGTTATCGGTGCTATGATCAATGGAGTATGTCCTTTAAAAATGGATGACGAGATACTTAATCAAGCACAACATTGCAAAAATCCCAAGTTTCAAAGAATATTATACGTATTAGCACAGAGAGAAAATGGCGCGAAACTGTGGGAGAAATTTCCGAACGACAACGAAATTGTTGAAATAAAAGTCGCTGCTACAGATCCGAAATGGAGAAAGAAAGGCATCATGAAAGCATTGATAGACAAAACAGAAAAAGCCGTAAAGCAGAAGAATATTAGATTGCTAAGATTAGATACTTCCAGCGCGTATTCAGCCATGGCTGCCGAGAGGTACGGGTATACTTGTTATTATAAAGCATTATATAAAGATATCAAGATGAACGGACAACCTCTCATAGTGCCCGAACCACCTCACTTAGATGACAGAGTTTATGTTAAAGAAATATATTCTTAA

Protein sequence:

>DPOGS206705-PA
MAVTSARSIVNMKEKGHFEHEETRMDKYTLADELPAQLSELNLRPRKMAIPYTLRRLTPKDKDGVIEFMRRFFFRDEPLNFTINLLETPESRCYELEDYTASSLADGASVAAVDENGEFVGMVINGVVRREEVDYTDKSEDCPHPKFKRILKLLGHLDREARIWDKLPLSCHTAEVDYTDKSEDCPHPKFKRILKLLGHLDREARIWDKLPLSCHTAVEIRIASTHTDWRGRGLMRVLVEESERIAREIGAGAVRMDTTSAFSAAAAERLNYKNVYGLFYSDLPYAPHPDPPHLEAREMEESNSYDINLITAEDAEAVMSLVKRTFYIDEPLNQAVGLCTSESDPCHELDDYCSSSLLSGLSFKAIDHEGNVIGAMINGVCPLKMDDEILNQAQHCKNPKFQRILYVLAQRENGAKLWEKFPNDNEIVEIKVAATDPKWRKKGIMKALIDKTEKAVKQKNIRLLRLDTSSAYSAMAAERYGYTCYYKALYKDIKMNGQPLIVPEPPHLDDRVYVKEIYS-