Monarch geneset OGS2.0

DPOGS212275
TranscriptDPOGS212275-TA1248 bp
ProteinDPOGS212275-PA415 aa
Genomic positionDPSCF300077 + 20850-29700
RNAseq coverage190x (Rank: top 48%)
Annotation
HeliconiusHMEL0066575e-15994.68% 
BombyxBGIBMGA011436-TA2e-11876.16% 
DrosophilaCG15628-PA4e-9558.25% 
EBI UniRef50UniRef50_Q9VR305e-9358.25%CG15628 n=7 Tax=Sophophora RepID=Q9VR30_DROME
NCBI RefSeqXP_974009.24e-9559.01%PREDICTED: similar to CG15628 CG15628-PA [Tribolium castaneum]
NCBI nr blastpgi|3504142221e-9456.64%PREDICTED: hypothetical protein LOC100742270 [Bombus impatiens]
NCBI nr blastxgi|1892348825e-9659.22%PREDICTED: similar to CG15628 CG15628-PA [Tribolium castaneum]
Group
Gene OntologyGO:00167479.2e-27transferase activity, transferring acyl groups other than amino-acyl groups
KEGG pathway 
InterPro domain[136-403] IPR0161818.7e-39Acyl-CoA N-acyltransferase
[329-409] IPR0136539.2e-27FR47-like
Orthology groupMCL15969 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS212275-TA
ATGCGCGACGTCAGCGCGCGCACCGTCGGCGGGATTTGGCTGATCAGTGATCACTCCGCCGGCAAACCGATCGCGAAGTACACGCGCTCAAGGATCCAGGAAGTAACGCCGGCCCCTCGCGGCCAGGAGCCCTGCCAGGCGCTTCCCGACCTACTCCTACACCGAACCATCACCGGGCAGAGCCGTTCGCAGACAGGGCTCCGCGGACTAAAGAAAATAACGTGGCCGCCCTCGTTTGATTTCCATTTGCACCGCAATGGCACATTTGTTAACGATCTGGCTAGGAACTTCCCCTATATCATATTCGCCATGCAGGGGCTGCCGCATTATGGCAGGGGTAGTGGTTCACGGCGGTGTGTTGTCACTGACAGCATTGCCATTCGACCTTCTTTCTTTGAGCGACAGATGGAAATCGTGCCCGAGGGGAAAAACTTCCGATTGGTAACAGAAGACGAGCTCCCTGCTATAGCTGATATATTAGTGCAGTACATGCCTGAATCTCTAAAGTTCCATCAGACGATCCAAACATATCTTAACAACAAGGTGTGGGATTTCCATTTTTATGTAGCGAAGAATTGGCCGGAGGATCCTATCTGCCTTCATTTCCCTGGATGCACCAGCACTCCAAACCGTCATCCATATGAGAGCGTGACAATATTCTGTCCATCCGAGCGCTCAGAGCTGGTGGACCTACTGAGCTCGGAGGACGTACTTCTGGACCTCACACAACCATTGTACCTGAACTTCACTCACGAGGCCATCGTGGAAAGATTTGAAAAACGATATGAAGCCCACGACAAAATCACGGGAGATGTATACGTTTGTGTCAACCCACCAGATGATAACATTGAAGAATTGCCACCGGACGTGGAGTTAGTCCGCCTTCAGCCGGAACACATGAAGGCTGTTCACGATCTGTATCCGGCTAGCGACATCGAATGTCGGGAGGTGTTCGAGAAGCTGGTGGCCGAACTGCCAGCGTACGGCATCTTCGTGGAAGGTCAACTGGCAGCCTGGATGGTGCAGTCATATTACGGCGCCATGTTCTCCATGCAGACCAGACCAGAGTTCCGTAGGAAAGGCTATGGTATATTCCTAGCGAGGCGTCTCACCAAGGAGGTAGCTGCGCGCGGCTATAAGCCGTTCGTCGTTATTCGTCCGGAAAACGACGCGTCCCGTTCCCTATACTCAAAACTGGGCTTCGAGAAACGTTTCCGAACGGTGCGCGCCGTTTTGCGTCCTCGCTAA

Protein sequence:

>DPOGS212275-PA
MRDVSARTVGGIWLISDHSAGKPIAKYTRSRIQEVTPAPRGQEPCQALPDLLLHRTITGQSRSQTGLRGLKKITWPPSFDFHLHRNGTFVNDLARNFPYIIFAMQGLPHYGRGSGSRRCVVTDSIAIRPSFFERQMEIVPEGKNFRLVTEDELPAIADILVQYMPESLKFHQTIQTYLNNKVWDFHFYVAKNWPEDPICLHFPGCTSTPNRHPYESVTIFCPSERSELVDLLSSEDVLLDLTQPLYLNFTHEAIVERFEKRYEAHDKITGDVYVCVNPPDDNIEELPPDVELVRLQPEHMKAVHDLYPASDIECREVFEKLVAELPAYGIFVEGQLAAWMVQSYYGAMFSMQTRPEFRRKGYGIFLARRLTKEVAARGYKPFVVIRPENDASRSLYSKLGFEKRFRTVRAVLRPR-