Monarch geneset OGS2.0

DPOGS207226
TranscriptDPOGS207226-TA1575 bp
ProteinDPOGS207226-PA524 aa
Genomic positionDPSCF300235 - 375404-378534
RNAseq coverage641x (Rank: top 20%)
Annotation
HeliconiusHMEL0126506e-16755.75% 
BombyxBGIBMGA008556-TA0.061.33% 
DrosophilaCG9009-PA1e-13244.80% 
EBI UniRef50UniRef50_B0W6R44e-13745.90%AMP dependent coa ligase n=2 Tax=Culicinae RepID=B0W6R4_CULQU
NCBI RefSeqXP_001656463.12e-14549.44%AMP dependent coa ligase [Aedes aegypti]
NCBI nr blastpgi|1571348293e-14449.44%AMP dependent coa ligase [Aedes aegypti]
NCBI nr blastxgi|1571348291e-13949.34%AMP dependent coa ligase [Aedes aegypti]
Group
Gene OntologyGO:00081525e-101metabolic process
GO:00038245e-101catalytic activity
KEGG pathwaytad:TRIADDRAFT_562025e-98 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[33-454] IPR0008735e-101AMP-dependent synthetase/ligase
Orthology groupMCL12152 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207226-TA
ATGCCCAATAGTACCCTTTACGACTACGTATGGATGAATCTAGATAAGTGGCCCGAGAGAACACTATCCGTATGTGCAGTCAGCGGCAGAGGTTACACTTACGAACAAGCATTTATCCTTTCTAATAATTTCGCAGCTAATCTTAGAAAAAAAATCAAATTAAGAGATGGGGATGCAGTTATAATAATGTTGCCAAATATTCCGGACTTTCCTCTTGTAGCTTTGGGAATATTGGAGGCTGGAGGGGTTATTAGCACTGTTAATCCCTTATACACAGCTCATGAAGTCCATCGCCAAATACTTATGTCTGATGCAAAAGTTATAGTGACTCTGGCAGAAACGGTGGATGTTGTAAGAAATGCTTTGAGACTCGCAAAAATGGACATTCCTATAATCGTTGTTAAAAACAACGGTGATGCATTGCCAGAGGGAACGGTGGCCTTTAACGAGCTCAGTGAAGATATCCACGTCGACAAATCCTGCTTAAAAGAAGTCAGACGAACACCTAAAGACATATGTTTTTTGCCCTATTCTAGCGGCACCACAGGACTTCCGAAAGGAGTTGAACTTACAAACAGCAATATTATAGCAAATTGTGAACAACTTAATGAACCATCTCTAAAATGTAACGAAGAAACCACAGCAACTCACCAAGATATAATAGTAGGGGTTCTCCCTTTCTTTCACATCTATGGAGCAACTGTCATAATGTTTAATAGTATCGCACAAGGACTGAAGATTGTTACTTTAGAAAAATTCCAGCCTGACGTGTTTATACAAATATTGGAAAAACATAAAATAAACATTCTGTATCTGGCACCGCCTTTAGTACTATTAATGATAAATCATTCTTTGTCATCGCCGGAGAGATTTCAATATTTGAAGCATATCATCAACGGCGCTGCTCCAGTCGCGTCCTCCGATATAGAAAGATTACTTGACAAAATCCAGAGAAAAATTCGTCTTGGTTCTGGTTATGGCTTATCAGAAACTTCACCGGTTATTGCAATGGCGGACAAGGCCTCAGAAAGATACGATATTATCGGTAACTCAATGGCGAATACTGAAATGAAGATCGTTAACGAGGATCTTAAAGCGTTAGGACCGAACCAGCTAGGAGAATTACTCGTCAGAGGTCCTCAGGTGATGCGAGGCTACAGAAATAATCCCGAAAGCAACGCTAGTGCATTTACTGATGATGGTTGGTTCAGAACTGGCGACTTAGCCACTGTTGACGAATCAGGTCGTTTGAAAATAGCAGATAGGCTAAAAGAACTTATTAAGGTCAAAGGTTTCCAAGTACCTCCTGCGGAATTAGAAGCCCTTCTCAGAGACCATCCGGCTGTATTCGACGCAGCTGTCATCGGTGTCCCTCATCCAACGAATGGAGAATCACCAAAAGCCTTTGTTGCTCTACGGCCCGGTGCTAATGTTAACACAAAGGAACTATGTGATTTTGTTTCTGAAAAAGTGGCGTCATATAAAAGAATTGATGATGTAGTCATCCTTGATAGTATCCCAAGAAGTGCTGCGGGAAAAATTTTGAGAAAGGACCTTAAAGCTAAATACTGCTAA

Protein sequence:

>DPOGS207226-PA
MPNSTLYDYVWMNLDKWPERTLSVCAVSGRGYTYEQAFILSNNFAANLRKKIKLRDGDAVIIMLPNIPDFPLVALGILEAGGVISTVNPLYTAHEVHRQILMSDAKVIVTLAETVDVVRNALRLAKMDIPIIVVKNNGDALPEGTVAFNELSEDIHVDKSCLKEVRRTPKDICFLPYSSGTTGLPKGVELTNSNIIANCEQLNEPSLKCNEETTATHQDIIVGVLPFFHIYGATVIMFNSIAQGLKIVTLEKFQPDVFIQILEKHKINILYLAPPLVLLMINHSLSSPERFQYLKHIINGAAPVASSDIERLLDKIQRKIRLGSGYGLSETSPVIAMADKASERYDIIGNSMANTEMKIVNEDLKALGPNQLGELLVRGPQVMRGYRNNPESNASAFTDDGWFRTGDLATVDESGRLKIADRLKELIKVKGFQVPPAELEALLRDHPAVFDAAVIGVPHPTNGESPKAFVALRPGANVNTKELCDFVSEKVASYKRIDDVVILDSIPRSAAGKILRKDLKAKYC-