Monarch geneset OGS2.0

DPOGS206946
TranscriptDPOGS206946-TA1668 bp
ProteinDPOGS206946-PA555 aa
Genomic positionDPSCF300001 - 390570-394598
RNAseq coverage358x (Rank: top 33%)
Annotation
HeliconiusHMEL0021236e-10741.37% 
BombyxBGIBMGA012941-TA2e-15250.00% 
DrosophilaCG4830-PA3e-5428.79% 
EBI UniRef50UniRef50_Q5TVT31e-6631.97%AGAP003482-PA n=4 Tax=Anopheles RepID=Q5TVT3_ANOGA
NCBI RefSeqXP_560023.32e-6832.08%AGAP003482-PA [Anopheles gambiae str. PEST]
NCBI nr blastpgi|3479699655e-6631.97%AGAP003482-PA [Anopheles gambiae str. PEST]
NCBI nr blastxgi|3800230134e-6630.33%PREDICTED: luciferin 4-monooxygenase-like [Apis florea]
Group
Gene OntologyGO:00081521e-77metabolic process
GO:00038241e-77catalytic activity
KEGG pathwayath:AT4G051608e-45 
 K01904 (E6.2.1.12)maps-> Phenylpropanoid biosynthesis
    Phenylalanine metabolism
    Ubiquinone and other terpenoid-quinone biosynthesis
InterPro domain[73-479] IPR0008731e-77AMP-dependent synthetase/ligase
Orthology groupMCL10359 Insect specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS206946-TA
ATGTCCACAACGGAGACGCGGAGGATGGCTCCAACAAGGATCAATGACGCTGTTCATTGGTACATGTCAAATCTTACTTCTCGGATCATAGCAAGGACTGGTATCCCGTCTGACAGATATCATATGGGAAAAGTTATACTGCAATGTTTAAAAGACTATCCCGAGGCAGTGACTCAAATCGATGGAGCGACTGGCGAATCGGAAACAAACGAAACCATTCTAGAAAGATCCGTGAAATGCGCCACAAGTTTTAGGAAATTTGGTCTGCAAAGTATGGATGTCATTGTACTGATGGCACCGAATCACATACATCTCTGTATACCATTCTACGCCGCACTGTACACAGGAAATGTTATAGCCGCTGTCGATTTTAATTTGGGGAAAATTGAACTACAGCAAACATTAGCTGTTTTGGAACCGAAAATCATATTCTGCCAAAGCTCGAAGGCACCAACGATACAATTAGCATTAAACGAAATTGATTCCAATGCATTTATTGTAGCTTTTGATAAAGGCCATTATCTGTGTGATTTCGACTCATTTATAGATAAATTCTATGACGGAACTACGATTGACCAATTTGAACCCACAGATTTTGATCCAGAGGAAGCAACGGCATTTCTGGTGTCTACGAGTGGCACCACCGGATTACCAAAAGCAGCAGAAGTGACCCACAAAAACTTTTTAATATCACTACCGAATCTTTTTTTACGTTATACAGAGTTTCCTACACCTACGAAAATGGCTCTGGTGGGCTCTCCTTTGCAGTGGCTGACGGCTTTGTTCAATTATGTTGCTTCGGCTATATTCAAATACACCCGTCTGCAATCTTCTTTGCCACTTACCAAGGAACACGCTTATTACTTGTTCCATACTTACAAGCCAACATTCAGTATTTTAAGTCCAACTCTGATAACGTCTCTTCTGAAAAATGAAAATAAATGCGACTTCTCCTCCTTTGAATTCATAATGTTGGGAGGAAGCGCGGTGCCAGCGTCTCTTATTGAAGAAATAAAGAATTTGTCACCAAATACAGAGGTGATTAATGTTTACGGTATGAGTGAGATTAGCAGTATTGCATTTATGGGAGATTACGGTCCACCCGATTCATGTGGTCGTCCGCTAGGGGTTTTCTACTATCGTTTAATTGATACGGAAACCCAAGAAGATATCTTAGAACCCAACAGGCCTGGCGAATTGTGGGTTAAAGGTCCCTCTGTGTTTAAGGGATATTATAAAAATAAGGAAGCCACTGAAGAAGCATTCGCAGAAGACGGCTGGTTTAAAACTGGAGACATGTTTTACAGAGATGAGAATTGGAATTATTATTTTTTGGAACGCATCAAATTGCTTTTAAAATATAAAAGTGATCAAATATCTCCAGTGGAAGTTGAGAACGTGATAAGACAAGTACCTGGTGTGGTTGATGTCGCTGTGGCTGGTCTGCCAGATCCAGAATGTGGAGATATACCTGTAGCCTGCGTTGTCATTCAAAATGGCGCTATTATTACAGCTGACGATATAAAAAACATTGTCCGAGACAAACTATCCGATTCAAAACAACTAAGAGGTGGAGTAATTTTTCTGGATAGTATACCAATGACAGCATCTACAAAAGTACATCGGAGAAAGCTGAAGGAAATAGTTATGAGTTCTAAAAGACTTTGA

Protein sequence:

>DPOGS206946-PA
MSTTETRRMAPTRINDAVHWYMSNLTSRIIARTGIPSDRYHMGKVILQCLKDYPEAVTQIDGATGESETNETILERSVKCATSFRKFGLQSMDVIVLMAPNHIHLCIPFYAALYTGNVIAAVDFNLGKIELQQTLAVLEPKIIFCQSSKAPTIQLALNEIDSNAFIVAFDKGHYLCDFDSFIDKFYDGTTIDQFEPTDFDPEEATAFLVSTSGTTGLPKAAEVTHKNFLISLPNLFLRYTEFPTPTKMALVGSPLQWLTALFNYVASAIFKYTRLQSSLPLTKEHAYYLFHTYKPTFSILSPTLITSLLKNENKCDFSSFEFIMLGGSAVPASLIEEIKNLSPNTEVINVYGMSEISSIAFMGDYGPPDSCGRPLGVFYYRLIDTETQEDILEPNRPGELWVKGPSVFKGYYKNKEATEEAFAEDGWFKTGDMFYRDENWNYYFLERIKLLLKYKSDQISPVEVENVIRQVPGVVDVAVAGLPDPECGDIPVACVVIQNGAIITADDIKNIVRDKLSDSKQLRGGVIFLDSIPMTASTKVHRRKLKEIVMSSKRL-