Monarch geneset OGS2.0

DPOGS208811
TranscriptDPOGS208811-TA1785 bp
ProteinDPOGS208811-PA594 aa
Genomic positionDPSCF300036 - 152118-156786
RNAseq coverage496x (Rank: top 25%)
Annotation
HeliconiusHMEL0150890.081.45% 
BombyxBGIBMGA007665-TA0.096.13% 
DrosophilaElp3-PA0.086.48% 
EBI UniRef50UniRef50_Q9H9T30.082.75%Elongator complex protein 3 n=333 Tax=root RepID=ELP3_HUMAN
NCBI RefSeqXP_001968673.10.086.48%GG25004 [Drosophila erecta]
NCBI nr blastpgi|3123779990.087.38%hypothetical protein AND_10547 [Anopheles darlingi]
NCBI nr blastxgi|3123779990.087.38%hypothetical protein AND_10547 [Anopheles darlingi]
Group
Gene OntologyGO:00038247.1e-36catalytic activity
GO:00515367.1e-36iron-sulfur cluster binding
GO:00081526.3e-10metabolic process
GO:00080806.3e-10N-acetyltransferase activity
KEGG pathway 
InterPro domain[1-540] IPR0059100Histone acetyltransferase ELP3
[75-337] IPR0066387.1e-36Elongator protein 3/MiaB/NifB
[405-537] IPR0161814.4e-19Acyl-CoA N-acyltransferase
[93-279] IPR0071971.5e-17Radical SAM
[132-318] IPR0234042.7e-13Radical SAM, alpha/beta horseshoe
[428-529] IPR0001826.3e-10GCN5-related N-acetyltransferase (GNAT) domain
[556-582] IPR0189024.5e-09Uncharacterised protein family UPF0573/UPF0605
Orthology groupMCL12495 Single-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS208811-TA
ATGGTGATTGTTATATCTGAAATAATACAAGAATTGTTAATCGCCCATCGTCAAGGAAAAGATGTTAACCTCAACAAGATGAAAACACGGATTTCTTCAAAGTATGGGCTGGGCACGTCTCCAAGACTAGTCGATATAATAGCTGCAGTTCCTGCTGACGCTAAGAGTATCCTTCTACCGAAATTGAAGGCAAAACCCATACGCACAGCATCAGGGATTGCAGTTGTAGCTGTTATGTGCAAACCTCATCGTTGTCCGCATATAAACTTCACTGGTAACATTTGTGTTTACTGCCCTGGGGGTCCTGATTCGGATTTTGAATATTCTACCCAGAGTTATACTGGCTATGAGCCTACTTCGATGCGAGCTATTAGAGCCAGATACAACCCATATTTACAAACACGTCACAGAGTTGAACAGCTGAAGCAACTTGGCCATAGTGTTGATAAAGTTGAATTTATTGTTATGGGGGGTACTTTTATGAGTCTTCCTGAAGATTATAGGGACTATTTTATTAGAAATCTACATGATGCACTCTCGGGTCATACGTCAGGTAATGTAGCTGAGGCTGTAAAGTATTCTGAAAGGGCTAAAACAAAATGCATCGGTATAACAATTGAGACCCGGCCGGACTATTGCCTTCAACGACACATGAGTGATATGCTGAATTATGGATGTACCAGATTGGAAATTGGTGTACAGTCGGTATATGAAGATATTGCTAGAGACACCAATAGAGGTCACACTGTGAAGGCTGTGTGTGAAAATTTTAATCTGGCCAAGGATGCTGGATATAAGATTGTAGCACATATGATGCCAGATCTTCCCAATGTTGATTTTGAACGAGATGTTGAGCAATTTAAGGAGTTCTTTGAGAATCCTGCATTCAGAGCCGATGGCCTGAAGATCTACCCAACATTGGTTATAAGGGGAACGGGTCTGTATGAACTGTGGAAGACTGGGAGATACAGGAGCTATCCTCCTTCTACTCTAGTGGATTTGATTGCTAAGATATTGGCTCTGGTTCCACCATGGACAAGGGTGTATAGAGTTCAACGAGACATTCCAATGCCATTGGTATCGTCTGGTGTTGAACATGGCAACCTTCGGGAGCTGGCTTTGGCTCGCATGGCAGATCTTGGCACTGATTGCAGGGATGTTCGTACCCGAGAAGTTGGGATACAGGAGATACATAATAAAGTCAGGCCATATGAGGTGGAATTAATTAGGCGAGATTATGCAGCGAATGGAGGATGGGAGACATTTTTGTCTTATGAGGATCCAGATCAAGATATACTGGTCGGTCTATTGAGACTAAGGAAATGTGCTAAGGATACTTACAGGCCAGAATTGAAGCCTGGACCCAGTTCGAACTTCAAGCAATGTAGTATCGTCAGGGAGTTGCATGTTTACGGATCTGTTGTACCGGTAAATGCGAGAGATCCGACAAAATTCCAACATCAAGGATTTGGTATGCTCCTCATGGAGGAAGCTGAGAGGATAGCCAAGGAGGAACATGGTTCTGACAAAATGGCGGTCATATCCGGTGTTGGAACCCGCAATTATTACGCTAAGATTGGTTATCATTTGGAGGGACCGTACATGGTCAAGATGTTACCGGATCCTCCTCTATCCATAAATCCCACTGAGATCTATCACAAACACGTGGGCATGTTGCCCAATTACGCGGGTCACGTGCCGGGATGCGTGTTCAGATTTGGAAAAACATATGGAAATGATACGAGAGATGCTAAGAGGTGGTTGCGTGGTGACTTCACTTCATGA

Protein sequence:

>DPOGS208811-PA
MVIVISEIIQELLIAHRQGKDVNLNKMKTRISSKYGLGTSPRLVDIIAAVPADAKSILLPKLKAKPIRTASGIAVVAVMCKPHRCPHINFTGNICVYCPGGPDSDFEYSTQSYTGYEPTSMRAIRARYNPYLQTRHRVEQLKQLGHSVDKVEFIVMGGTFMSLPEDYRDYFIRNLHDALSGHTSGNVAEAVKYSERAKTKCIGITIETRPDYCLQRHMSDMLNYGCTRLEIGVQSVYEDIARDTNRGHTVKAVCENFNLAKDAGYKIVAHMMPDLPNVDFERDVEQFKEFFENPAFRADGLKIYPTLVIRGTGLYELWKTGRYRSYPPSTLVDLIAKILALVPPWTRVYRVQRDIPMPLVSSGVEHGNLRELALARMADLGTDCRDVRTREVGIQEIHNKVRPYEVELIRRDYAANGGWETFLSYEDPDQDILVGLLRLRKCAKDTYRPELKPGPSSNFKQCSIVRELHVYGSVVPVNARDPTKFQHQGFGMLLMEEAERIAKEEHGSDKMAVISGVGTRNYYAKIGYHLEGPYMVKMLPDPPLSINPTEIYHKHVGMLPNYAGHVPGCVFRFGKTYGNDTRDAKRWLRGDFTS-