Monarch geneset OGS2.0

DPOGS206692
TranscriptDPOGS206692-TA1761 bp
ProteinDPOGS206692-PA586 aa
Genomic positionDPSCF300048 + 1297408-1303149
RNAseq coverage2334x (Rank: top 5%)
Annotation
HeliconiusHMEL0088370.072.20% 
BombyxBGIBMGA008524-TA0.077.99% 
DrosophilaCG9009-PA2e-10439.93% 
EBI UniRef50UniRef50_Q7PGI22e-18053.75%AGAP002503-PA n=9 Tax=Endopterygota RepID=Q7PGI2_ANOGA
NCBI RefSeqXP_966640.10.059.10%PREDICTED: similar to AMP dependent coa ligase [Tribolium castaneum]
NCBI nr blastpgi|910816150.059.10%PREDICTED: similar to AMP dependent coa ligase [Tribolium castaneum]
NCBI nr blastxgi|910816150.059.10%PREDICTED: similar to AMP dependent coa ligase [Tribolium castaneum]
Group
Gene OntologyGO:00081521.8e-101metabolic process
GO:00038241.8e-101catalytic activity
KEGG pathwayspu:5819115e-100 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[99-513] IPR0008731.8e-101AMP-dependent synthetase/ligase
Orthology groupMCL18872 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206692-TA
ATGTCTCATTTGAAAATATTACGCGGGAATTTAGCTCATAGAATTCTAAGTAATTTAACTCCTAAAACAAAATGTAAAAAAATTAGACAATTTCACGTATCTAACCGTAATTCAGCAATTAAAACAGCGACGGTGGTTGAGCATAATGTTTTAAGTTCGCCGTGGGGTGAAATTACTATAGGAAACGAGACTTTAACTCAACACGTATTTCAAGATGTCGAAAAATGGTCGGACGCTCCCTGTGTGACATGCGGCGCATCAGGTCGTTCATACGACTACGGCATGATGAGGATGATGATTGATAGATGTGCGAACGCTCTCGCCGGACATTTGAAACTCGCGCCGGGCGAGAGAGTAGGTCTCATACTACCGAATCTACCTGAATTCGTAGTGCTTATACATGGTGCTATGCAGGCTGGCCTCGTAGTTACATTCGCCAATCCCCTGTATACAGCTGATGAGGTCGGACGCCAATTTTCTGATTGTGGTGTTAAAGCTATTGCTACAATTGAAATGTTCATGCCGGTTGCTGAAAAAGTCAGCAAAATGTTAAAAGACTACAAGGGTACCATCTGGGTGGGTGGTGATGACGATAAAGCAAAAGGTATATACGGTCTGAAGTCCTTACTAATGGCTGATCATAAAGCCGACCTGCCGACATTGAATTGTGATGACGTGTGTTTGGTCCCGTACTCCAGCGGCACAACGGGTCTACCGAAAGGCGTCATGTTAACACACAAGAACTTGGTCTGCAATCTCAAGCAGGTCCAAGTGCCCAAGATGATGAAGTATGAAGGAGAGAAAGGTAAAGGAGACGTAATTCTAACTGTTCCGCCGTTTTTCCATATCTATGGCTTCAACGGGATACTGAACTACAATCTCATCTTAGGGTACCATTTAGTGTCTATCCCAAAATTCACTCCAGAGGATTATATCAACTGTCTGGTAGAATATCAGCCGACTACGTTGTTCGTGGTGCCGTCGTTGCTAGCTTTCTTGGCGACTCATCCCTCTGTGAAGAAGGAACATCTTCAGTCCGTGGAGACCATTATGGTCGGAGCCGCGCCCACTACTGACAGCATGTTAGAGAAGTTCCTCATCAAGTGTGAGAAGAGCAAGGACCAGATCAAGTTGCTTCAAGGTTATGGTATGACGGAGAGTTCTCCCGTGACGCTGATGACTCCATACTCGTACCCGTACAGTAAGGTGGGCTCTGTGGGTCAGCTGGTGCCGTCTACTCAGGCCAGGGTGACGTCACTGACTGACGGCACACCCCTCGGACCACACCACAGCGGGGAGCTGCTTCTGAGGGGACCGCAGGTAATGAAAGGTTACTGGAATAATGAGAAGGCGACGGCAGAAACGGTTGATAGTGAGGGCTGGCTGCATACAGGAGACGTGGCCTATTACGACGAGGACGGGTACTTCTATATAGTTGACAGAACCAAAGAGCTCATTAAAGTTAAAGGCAATCAGGTGTCACCAACAGAAATAGAGAGTATAATTATGGAAATACCTGAAATCGCGGATGTTGCGGTCGTGGGAATCCCCGATGCGTTAGCCGGGGAAGTACCACGAGCCTTCGTCGTTCTGAAACCAGGAAGTAAATTAACAGAAAAAAATATTTACGATGTCGTAGCAGAGAAACTCACCAAATATAAGCATCTCGAAGGAGGTGTTGTATTCGTAGAGGCTATTCCAAGAAATGTAGCTGGTAAAATATTGCGTAATGAACTTAAAGTATTAGGAAGGAAGAAGTGA

Protein sequence:

>DPOGS206692-PA
MSHLKILRGNLAHRILSNLTPKTKCKKIRQFHVSNRNSAIKTATVVEHNVLSSPWGEITIGNETLTQHVFQDVEKWSDAPCVTCGASGRSYDYGMMRMMIDRCANALAGHLKLAPGERVGLILPNLPEFVVLIHGAMQAGLVVTFANPLYTADEVGRQFSDCGVKAIATIEMFMPVAEKVSKMLKDYKGTIWVGGDDDKAKGIYGLKSLLMADHKADLPTLNCDDVCLVPYSSGTTGLPKGVMLTHKNLVCNLKQVQVPKMMKYEGEKGKGDVILTVPPFFHIYGFNGILNYNLILGYHLVSIPKFTPEDYINCLVEYQPTTLFVVPSLLAFLATHPSVKKEHLQSVETIMVGAAPTTDSMLEKFLIKCEKSKDQIKLLQGYGMTESSPVTLMTPYSYPYSKVGSVGQLVPSTQARVTSLTDGTPLGPHHSGELLLRGPQVMKGYWNNEKATAETVDSEGWLHTGDVAYYDEDGYFYIVDRTKELIKVKGNQVSPTEIESIIMEIPEIADVAVVGIPDALAGEVPRAFVVLKPGSKLTEKNIYDVVAEKLTKYKHLEGGVVFVEAIPRNVAGKILRNELKVLGRKK-