Monarch geneset OGS2.0

DPOGS206979
TranscriptDPOGS206979-TA1575 bp
ProteinDPOGS206979-PA524 aa
Genomic positionDPSCF300001 + 381752-389235
RNAseq coverage850x (Rank: top 15%)
Annotation
HeliconiusHMEL0021190.060.56% 
BombyxBGIBMGA012941-TA2e-7432.82% 
DrosophilaCG4830-PA4e-5328.73% 
EBI UniRef50UniRef50_D6WF946e-5929.69%Putative uncharacterized protein n=2 Tax=Tribolium castaneum RepID=D6WF94_TRICA
NCBI RefSeqXP_001809806.12e-5929.69%PREDICTED: similar to Luciferase [Tribolium castaneum]
NCBI nr blastpgi|49598859e-6030.97%luciferase [Phrixothrix vivianii]
NCBI nr blastxgi|2545764749e-5830.68%luciferase [Phrixothrix hirtus]
Group
Gene OntologyGO:00081524.1e-71metabolic process
GO:00038244.1e-71catalytic activity
KEGG pathwayath:AT4G051608e-40 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[41-447] IPR0008734.1e-71AMP-dependent synthetase/ligase
Orthology groupMCL26327 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206979-TA
ATGAATAACTCAGAGATGCATTGCTCGCCCGATTACCATCTTGGACATGTCATTTTGGAGCATTCCAAACGACATGCCGACACTGTTTGTCAGATCGATGCAGCTACGGGCGACGAAGAGACATATTCTTCTGTTGTCTCCCGGTCTATTCGCTTGGCAAGAGCATTAAGAAACTATGGACTTAAGCCGGGAGATGTTGTTGCTGTCGGAGGAAGGAACCATTTAGATCTCCACATACCAGTTTATGCTGCTCTCTATGATGGCTTACCCAGTGTTGGTGTTGATCCGTACTTCAAATATGACGAAGTCCGTACTCTATTTAACTTGACCAAGCCTAAAATAGCATTTTGTCAGAATGAACATGTAGAAGTGTATGATAAGGCAGCACGGGATCTGGGCTTGGAACTCAAGATAGTGACCTTTGATCATGGGAACTGCACAATGTCGGAGTTCGTGAACAAATATGACACAGATGAGCCCTTGGATGAGTTTAAGGTTGCCAAAATCGATGTGGACAAAGTGAATGCATTCCTCGTGAGCACTAGCGGTACAACTGGGAAAGTTAAAGTAGCCGCCTTCAATCACCAGCCGTTTATGTTGAAGTGGCTCAAAGTACTTCAAATGTCGCGAATGGTTAAGGGTCACAAAAGGACGTTGTTGATATCGCCAATACACTGGATATCAACTTGCTTCACAATATTCTCAACTCCTCTGACAGGTGACACAAAGATACAAACGTCAAAACCGGATGATTTTGATCACATAGTGTACATCATCAACAAATATAAGCCGAGAAACGTTCTGATGAGTCCGACCCTAATGTCCTACTTGATGACCAGGAAGGACGTAGACTTGGAGTGCTTCAGGTCAGTCACGGTCACCGGCTCGAGGATATATCCGGATGTTTTCGAAAAATTCAAGACGTTATTATCGAGAGAAGCGGTCGCGTCCATAGCGTACGGACAAACAGAGATGCTGGGCCCCATTTTGCTACCGAACCCAGCAGGTCCCAGCGGTAACTGTGGTCAACCTCTGCCCTTCTATGACGTTAAGCTTATAGATCAGGAAACTCGAGCTGAAATAAAAGAACCTCATGTGACAGGAGAAATGTGGGTGAAGGGACCCTGCTTTACGGAATACTATCAGGACCCAGAGGAAACAGCGACGGCTTTCACAGCAGACGGTTATTTTAAAACCGGTGACCTTTTGTACAGAGACGAAAAAAATAATTACTTCTACGTAGAAAGAATAAAAGCCCTCATTAAATATAGGAATTCACATGTTATACCGATAGAGCTGGAGGATATTATACGCAAACATCCGAGTGTGAAGGATGTATGCATCATCGGCGTCAGCGACCCTCTGGACGGTGAACGACCGGTGGCCTGCGTCATTAAACGACAGGGTATGGAGATCACAGCTCAGGAGGTCAAGGATATGGTAGCTAGTAAATTATCTAAAAACAAAGAACTACGAGGCGGTGTCGTGTTCCTGAACGCATTCCCGCAGACATCGTCCGGTAAACTTGCTAGGGCGAAACTGTTGCAAGTTGTCATGAATTCTAAAAGGGAATAA

Protein sequence:

>DPOGS206979-PA
MNNSEMHCSPDYHLGHVILEHSKRHADTVCQIDAATGDEETYSSVVSRSIRLARALRNYGLKPGDVVAVGGRNHLDLHIPVYAALYDGLPSVGVDPYFKYDEVRTLFNLTKPKIAFCQNEHVEVYDKAARDLGLELKIVTFDHGNCTMSEFVNKYDTDEPLDEFKVAKIDVDKVNAFLVSTSGTTGKVKVAAFNHQPFMLKWLKVLQMSRMVKGHKRTLLISPIHWISTCFTIFSTPLTGDTKIQTSKPDDFDHIVYIINKYKPRNVLMSPTLMSYLMTRKDVDLECFRSVTVTGSRIYPDVFEKFKTLLSREAVASIAYGQTEMLGPILLPNPAGPSGNCGQPLPFYDVKLIDQETRAEIKEPHVTGEMWVKGPCFTEYYQDPEETATAFTADGYFKTGDLLYRDEKNNYFYVERIKALIKYRNSHVIPIELEDIIRKHPSVKDVCIIGVSDPLDGERPVACVIKRQGMEITAQEVKDMVASKLSKNKELRGGVVFLNAFPQTSSGKLARAKLLQVVMNSKRE-