Monarch geneset OGS2.0

DPOGS213754
TranscriptDPOGS213754-TA1725 bp
ProteinDPOGS213754-PA574 aa
Genomic positionDPSCF300212 - 468220-473629
RNAseq coverage727x (Rank: top 18%)
Annotation
HeliconiusHMEL0022667e-11985.77% 
BombyxBGIBMGA009240-TA0.087.48% 
DrosophilaNat1-PA0.064.51% 
EBI UniRef50UniRef50_Q7QKE30.065.02%AGAP002284-PA n=5 Tax=Coelomata RepID=Q7QKE3_ANOGA
NCBI RefSeqXP_975602.10.069.11%PREDICTED: similar to AGAP002284-PA [Tribolium castaneum]
NCBI nr blastpgi|910811910.069.11%PREDICTED: similar to AGAP002284-PA [Tribolium castaneum]
NCBI nr blastxgi|3407150080.069.35%PREDICTED: n-alpha-acetyltransferase 15, NatA auxiliary subunit-like [Bombus terrestris]
Group
Gene OntologyGO:00054883.5e-19binding
KEGG pathway 
InterPro domain[8-254] IPR0119903.5e-19Tetratricopeptide-like helical
Orthology groupMCL11431 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS213754-TA
ATGCCACATAGTAATTCATTGCCGCCAAAAGAAAATGCTCTTTTTAAAAGGATTTTGCGTTGCTATGAACATAAACAATACAAAAATGGGTTGAAATTTGCCAAACAAATACTATCTAATCCAAAATTTGCGGAACATGGAGAGACACTGGCTATGAAAGGTTTGACATTAAACTGTCTTGGTCGTAAAGACGAGGCTTATGAATACGTACGTCGTGGCCTCCGTAACGATCTGAAGTCGCCCAGGTCTGATAAGAAATACGATGAGGCCATCAAATGTTATCGGAATGCACTCAAATGGGAAAAGGAAAACATCCAGATTCTTCGTGATCTTTCTTTGCTTCAAATTCAGATGAGAGATTTAGAAGGCTACAAAGATACTAGGTACCAACTGTTTATTCTGAGGCCGACGCAGAGAGCTTCATGGATAGGGTTTGCGATGAGTTACCACCTCCTTGGAGATTATGAGATAGCAAACAGTATCTTGGATGCTTATCGTACAAATCAGATGAAGGGCCCTTATGATTATGAACATTCTGAATTGTTACTTTACCAAAACATGGTTCTTGCGGAATCCGGTCAATACGACAGGGCTTTGCAGCATCTCCATAAGTTCCAAAGTCAGATTCTAGATAAGCTTTCAATTAAGGAAACCTCTGGGGAATATTATTTGAAATTAAAGAGGTTTAAAGATGCGGAAGCTGTTTATGAGGATTTATTAAAAAGGAATCCCGAGAATGTTATGTATTATCAGAAATTAGTAGAAGCCAAACAGCTTAGTGATCCTGATGAAAAAGTGGCCTTTTATGATGTATATAAGAAGGAGTATCCGAGAGCAATAGCACCCCGGCGGTTGCAGCTTACCGAGGCCTGCCTGCCAGCCTTTGAATCTTTAGTAGATGAGTATCTTCGTCACGGACTACATAAAGGAATCCCGCCGCTATTTATGGATTTAAGATCATTATATGCGGATCAGAGTAAAGCGGACACAATAGAGAAGTTAATAGAACAATATATGGATTGTTTATCAAAGTCCGGTACATTTGGACCCAAAGCTGATGAAGTCAAGCAGCCGGCTAGTGCATTGCTGTGGACTTACTACTTTGCAGCCCAGCATTATGATTATAAACAGGATACGGACAGAGCACTGAAGTACATAGATGCAGCTATAGATCATACACCGACATTGATAGAGCTTTATATTGTTAAGGGAAGGATTTACAAGCATGCCGGTGACCCAGTTCGAGCTTATGGTTGGCTTGAAGAGGCTCAAGCTATGGACACAGCAGACCGATACGTGAACAGCAAATGCGCTAGATACATGCTGAGGGCGGGACACGTACAAAGGGCGGAGGATATGTGTGCCAAGTTCACAAGAGAAGGTGTACCAGCTACTGAGAATCTTAACGAGATGCAGTGCATGTGGTTCCAGACGGAGGCTGCTGCGGCGTACCAAAGACTTAAACAATGGGGCGAGGCGTTAAAGAAGGCCCATGAAGTTGATAGGCATTTCTCAGAAATAATGGAGGATCAATTCGACTTCCATTCATACTGTATGCGCAAGATGACGCTGCGATCGTACGTCGGCCTGCTCAGGCTCGAGGATGTTCTTCGAGCTCATCCCTTCTACTTCCGGTGTGCTCGCGTCGCCATACAAGTATATCTGCGACTCTACGCACATCCGTTGCAGGACGTGCCACAAACACAGGAGCCAGATACAGGTTAG

Protein sequence:

>DPOGS213754-PA
MPHSNSLPPKENALFKRILRCYEHKQYKNGLKFAKQILSNPKFAEHGETLAMKGLTLNCLGRKDEAYEYVRRGLRNDLKSPRSDKKYDEAIKCYRNALKWEKENIQILRDLSLLQIQMRDLEGYKDTRYQLFILRPTQRASWIGFAMSYHLLGDYEIANSILDAYRTNQMKGPYDYEHSELLLYQNMVLAESGQYDRALQHLHKFQSQILDKLSIKETSGEYYLKLKRFKDAEAVYEDLLKRNPENVMYYQKLVEAKQLSDPDEKVAFYDVYKKEYPRAIAPRRLQLTEACLPAFESLVDEYLRHGLHKGIPPLFMDLRSLYADQSKADTIEKLIEQYMDCLSKSGTFGPKADEVKQPASALLWTYYFAAQHYDYKQDTDRALKYIDAAIDHTPTLIELYIVKGRIYKHAGDPVRAYGWLEEAQAMDTADRYVNSKCARYMLRAGHVQRAEDMCAKFTREGVPATENLNEMQCMWFQTEAAAAYQRLKQWGEALKKAHEVDRHFSEIMEDQFDFHSYCMRKMTLRSYVGLLRLEDVLRAHPFYFRCARVAIQVYLRLYAHPLQDVPQTQEPDTG-