Monarch geneset OGS2.0

DPOGS207227
TranscriptDPOGS207227-TA2862 bp
ProteinDPOGS207227-PA953 aa
Genomic positionDPSCF300235 - 358089-369868
RNAseq coverage8x (Rank: top 86%)
Annotation
HeliconiusHMEL0126500.046.93% 
BombyxBGIBMGA008557-TA9e-13750.92% 
DrosophilaCG9009-PA1e-8637.03% 
EBI UniRef50UniRef50_D2A3J82e-17438.43%Putative uncharacterized protein GLEAN_07506 n=5 Tax=Tribolium castaneum RepID=D2A3J8_TRICA
NCBI RefSeqXP_966892.13e-9338.63%PREDICTED: similar to AMP dependent coa ligase [Tribolium castaneum]
NCBI nr blastpgi|2700054487e-17438.43%hypothetical protein TcasGA2_TC007506 [Tribolium castaneum]
NCBI nr blastxgi|2700054486e-17538.41%hypothetical protein TcasGA2_TC007506 [Tribolium castaneum]
Group
Gene OntologyGO:00081521.1e-48metabolic process
GO:00038241.1e-48catalytic activity
KEGG pathwaytad:TRIADDRAFT_562026e-77 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[719-885] IPR0008731.1e-48AMP-dependent synthetase/ligase
Orthology groupMCL25504 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS207227-TA
ATGTGTCCTGTTGATTTCAAAAATTATAATTCTGTTGGTTTTCCATTGCCTAACGTCCAATTGAGGATTGTGGACGAGAATGCAAAAAACCTAGGACCCAATAAGGTTTGTGGAGTAACCGAGAGAAAATATACATATTTGCAATTATATCGTAAATCTCAAACATTAGGGGCGAATTTAAGGAGAAACTTTGGCATTAAAAATGGTGATCTCATAGCTGTTATGTTGTCAAATATACCTGAATATCCTATTATAACTTTGGGAGTATTAAGCGCAGGAGGAATTGTAACTACTCTGAATCCAATTTATACTTCATATGAAGTACAAAGGCAATTGATGAGTGCGCATGTTAAGATAATAATAACTTCACCAGAGAATGTATCCACTATAAAACAAGCATTGGATTTAAACAAAATGAGTACACCCATAATTGTAGTCGATTTCAATAGCCCACGTCCCGATGGAACTATATCGTTTAATGAAATTATAAACGATACTATTGACACAAGTATTTTAAAAGAAGTCAAAACTAAACCTCGTGATGTATCATTTCTACTGTATTCGAGTGGAACAACAGGACTACCGAAAGGTGTAGAATTAACGCATAGAAATAAAGTTGCTAATTCTGTACAACAAGATTCCGAAGAGATCAAACATTATAATCTAACTACAGATTCAAATCAAGACACCATTTTGTTATATCTTCCCATGTTCCACTCTTATGGAATGTCTGTAAAAATGTTACACAAACTTTCGGTTGGCTTAAAGTTAGTCACTCTACCCAAATTTAAACCTGACACATTCATTAGTATTTTGGAAAAACATAAGATCAACCTAACGTACCTTGTTCCTCCTACAGACCCTCAAGTGAAAAGGAAACATTTTCAGTATCTAAACTATTTGGGAACCGGTGGGGCTCCATCTCCACGGGCAGACATAGAGAAGTTATTAGATAAAGTTGGCGTGAGTAATGTTATGGATTTCTACATAGATATTTTAGGAGAGTTGCTTATCAAAGGACCAAACGTAATGAAAGGTTATAGGAATAATCCAGAAGCTAACAAACTTGTTCTCACAAATGGCTGGTTTCGAACTGGAGACCGAGCGCAGTTCGAAGAAGATGGATCTCTTGTTATTGCGGATAGATATAAAGAACTAATAAAGGTAAATGCCTATCAAGTAGCACCAGCAGAGTTAGAAAGCGTAATAAAAGATCATCCTGGTGTGTTTGATGTGGCTGTAGTCGGTATACCTGACAGTAAAACGGGAGAGAAACCCAAAGCTTTTGTTGTTCTTAACAAAAATAGTCCAACAAATGAAGCGGATATTATAGAGTTTGCTAATAAGAAGGTCGCACCGTATAAACACATTAAGGAGGTCCAATTCATTGAGAGCATACCAAAAAATCCATCTGGGATTGGTTTTAGTAAAATGAAGCATGTCTGGTCGGCGAATAATATAGTATCATCACCCTTCGAAGATGTCATAATTCCCGATCTAACTATCCCTGAACATGTCTGGAGTAATATGCATAGATGGTCGGATAAGATAGCTATCGTTTGTGGAGTAACCGAGAGAAAATATACATATTTGCAATTATATCGTAAATCTCAAACATTAGGGGCGAATTTAAGGAGAAACTTTGGCATTAAAAATGGTGATCTCATAGCTGTTATGTTGTCAAATATACCTGAATATCCTATTATAACTTTGGGAGTATTAAGCGCAGGAGGAATTGTAACTACTCTGAATCCAATTTATACTTCATATGAAGTACAAAGGCAATTGATGAGTGCGCATGTTAAGATAATAATAACTTCACCAGAGAATGTATCCACTATAAAACAAGCATTGGATTTAAACAAAATGAGTACACCCATAATTGTAGTCGATTTCAATAGCCCACGTCCCGATGGAACTATATCGTTTAATGAAATTATAAACGATACTATTGACACAAGTATTTTAAAAGAAGTCAAAACTAAATCTAGTGATGTGTCATTACTACTGTATTCAAGTGGAACGACAGGACTACCGAAAGGTGTAGAATTAACGCATAGAAATATAGTTGCTAATTCTGTACAACAGGATCCAGCGGAACTCAGACATTACGATCTGACAACAGTTTTATTTTTGGGGTCAAACCAGGAGGTGAAATCGAAACATTTTGAATATTTAAAATACGTAGGCTCTGGTGCAGCGCCATCGCCAAAAGCAGATATCGAGAGGTTGTTAGCGAAATTCGGTCATGGAGTACGTTTCAGTCAACTATATGGCTTAACGGAAGTCTCCCCATTGGCTACAATATCCCCTGTTAACTGCAACAAATTTTTGACTGTGGGTTTTCCTTTGCCTAACATCCAATTCAGGATTGTTGACGACAATGACAACAATTTAGGGCCAGGACAGTTAGGAGAGTTGCTTATCAAAGGACCAAACGTAATGAAAGGTTATAGGAATAATCCAGAAGCTAACAAACTTGTTCTCACAAATGGCTGGTTTCGAACTGGAGACCGAGCGCAGTTCGAAGAAGATGGATCTCTTGTTATTGCGGATAGATATAAAGAACTAATAAAGGTAAATGCCTATCAAGTAGCACCAGCAGAGTTAGAAAGCGTAATAAAAGATCATCCTGGTGTGTTTGATGTGGCTGTAGTCGGTATACCTGATAGTAAAACGGGACAGAAGCCCAAAGCTTTTGTTGTTCCTAACAAAAATAGTCCAGCAAATGAAGCAGATATTATAGAGTTTGTTAATAAGAAGGTCGCACCGTATAAACATATTAAGGAGGTCCAATTCATTGAGAGCATACCAAAAAATCCTTCTGGTAAAATGTTAAGAAGGTTACTGCTTGAAAAATAA

Protein sequence:

>DPOGS207227-PA
MCPVDFKNYNSVGFPLPNVQLRIVDENAKNLGPNKVCGVTERKYTYLQLYRKSQTLGANLRRNFGIKNGDLIAVMLSNIPEYPIITLGVLSAGGIVTTLNPIYTSYEVQRQLMSAHVKIIITSPENVSTIKQALDLNKMSTPIIVVDFNSPRPDGTISFNEIINDTIDTSILKEVKTKPRDVSFLLYSSGTTGLPKGVELTHRNKVANSVQQDSEEIKHYNLTTDSNQDTILLYLPMFHSYGMSVKMLHKLSVGLKLVTLPKFKPDTFISILEKHKINLTYLVPPTDPQVKRKHFQYLNYLGTGGAPSPRADIEKLLDKVGVSNVMDFYIDILGELLIKGPNVMKGYRNNPEANKLVLTNGWFRTGDRAQFEEDGSLVIADRYKELIKVNAYQVAPAELESVIKDHPGVFDVAVVGIPDSKTGEKPKAFVVLNKNSPTNEADIIEFANKKVAPYKHIKEVQFIESIPKNPSGIGFSKMKHVWSANNIVSSPFEDVIIPDLTIPEHVWSNMHRWSDKIAIVCGVTERKYTYLQLYRKSQTLGANLRRNFGIKNGDLIAVMLSNIPEYPIITLGVLSAGGIVTTLNPIYTSYEVQRQLMSAHVKIIITSPENVSTIKQALDLNKMSTPIIVVDFNSPRPDGTISFNEIINDTIDTSILKEVKTKSSDVSLLLYSSGTTGLPKGVELTHRNIVANSVQQDPAELRHYDLTTVLFLGSNQEVKSKHFEYLKYVGSGAAPSPKADIERLLAKFGHGVRFSQLYGLTEVSPLATISPVNCNKFLTVGFPLPNIQFRIVDDNDNNLGPGQLGELLIKGPNVMKGYRNNPEANKLVLTNGWFRTGDRAQFEEDGSLVIADRYKELIKVNAYQVAPAELESVIKDHPGVFDVAVVGIPDSKTGQKPKAFVVPNKNSPANEADIIEFVNKKVAPYKHIKEVQFIESIPKNPSGKMLRRLLLEK-