Monarch geneset OGS2.0

DPOGS211494
TranscriptDPOGS211494-TA1422 bp
ProteinDPOGS211494-PA473 aa
Genomic positionDPSCF300354 - 294167-296738
RNAseq coverage198x (Rank: top 47%)
Annotation
HeliconiusHMEL0132171e-12081.15% 
BombyxBGIBMGA003842-TA4e-11573.95% 
Drosophila% 
EBI UniRef50UniRef50_B6QJD02e-6553.10%Aromatic ring-opening dioxygenase LigB subunit, putative n=4 Tax=Opisthokonta RepID=B6QJD0_PENMQ
NCBI RefSeqXP_001618021.12e-2634.62%hypothetical protein NEMVEDRAFT_v1g155980 [Nematostella vectensis]
NCBI nr blastpgi|2428106432e-6752.04%conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
NCBI nr blastxgi|2428106435e-6852.03%conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Group
Gene OntologyGO:00067256.9e-83cellular aromatic compound metabolic process
GO:00164916.9e-83oxidoreductase activity
GO:00081986.9e-83ferrous iron binding
KEGG pathwaybbt:BBta_61275e-34 
 K05915 (E1.13.-.-)maps-> Bisphenol A degradation
    1- and 2-Methylnaphthalene degradation
InterPro domain[3-257] IPR0041836.9e-83Extradiol ring-cleavage dioxygenase, class III enzyme, subunit B
Orthology groupMCL24945 Lepidoptera specific
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS211494-TA
ATGGTGATGGTAGCTCCAGCGCTTTTCGTTAACCATGGCGGAGGTCCGATGCCTTTGCTTGGGGAAAAAGATCATTTAGGACTCACCAAGTTCCTAAGAGACGAAGTGAAGAAACATGTAAACCTTAAGGAAATTAAAGCGATAGTGCTTGTGACCGCACATTGGGAAGAAAGTGAGGTAACGATATCGTCGGGAGATCGCCACGAACTTTATTTCGACTACTACGGCTTTCCTCCAGAGACTTACAAGTACAAGTATGACGCCCCCGGTGATCCTGAACTCGCGAAACGAATTCAGACCGCACTGAAAAAGGCTGGCATACATTCTAAGTTGGATCCCAAAAGAGGTTGGGACCATGGAGTTTTTGTTCCAATGCTTTTGATAAATCCAGCGGCTGATATACCAATAATACAAATATCCGTGCTCTCGAACCAGGATCCCGAAGAGCATTACAACATTGGCCAAGTATTGAAACAATTCCGCAAAGAAGGTATCGCTATATTTGGTTCGGGAATGTCTTATCACAATATGAGAGAGTTCTTCTATGGACGTAACGCTGGCAGGGTTGTTAACGAGGAGTTTGATGAGTTTTTGAATGATGCTTGTACATCAGGGAACAGCGTGAGAAAGGAGAAGTTGTTGTTGTGGGACCAGCAACCGGGAGCGAGGGAGGCTCATCCGACACGAGCCGCAGAACATTTGATGCCACTGATAGTTATCGCTGGTGCTGGAGGGGATGGACCCGGTGAAAGGATCTTCAACTGGGACATGAGCGGAACTGAGGTAACGATATCGTCGGGAGATCGCCACGAACTTTATTTCGACTACTACGGCTTTCCTCCAGAGACTTACAAGTACAAGTATGACGCCCCCGGTGATCCTGAACTCGCGAAACGAATTCAGACCGCACTGAAAAAGGCTGGCATACATTCTAAGTTGGATCCCAAAAGAGGTTGGGACCATGGAGTTTTTGTTCCAATGCTTTTGATAAATCCAGCGGCTGATATACCAATAATACAAATATCCGTGCTCTCGAACCAGGATCCCGAAGAGCATTACAACATTGGCCAAGTATTGAAACAATTCCGCAAAGAAGGTATCGCTATATTTGGCTCGGGAATGTCTTATCACAATATGAGAGAGTTCTTCTATGGACGTAACGCTGGCAGGGTTGTTAACGAGGAGTTTGATGAGTTTTTGAATGATGCTTGTACATCAGGGAACAGCGTGAGAAAGGAGAAGTTGTTGTTGTGGGACCAGCAACCGGGAGCGAGGGAGGCTCATCCGACACGAGCCGCAGAACATTTGATGCCACTGATAGTTATCGCTGGTGCTGGAGGGGATGGACCCGGTGAAAGGATCTTCAACTGGGACATGAGCGGAACGTTTAGACTAAGTGGATTCATATGGAAAAATGACTGA

Protein sequence:

>DPOGS211494-PA
MVMVAPALFVNHGGGPMPLLGEKDHLGLTKFLRDEVKKHVNLKEIKAIVLVTAHWEESEVTISSGDRHELYFDYYGFPPETYKYKYDAPGDPELAKRIQTALKKAGIHSKLDPKRGWDHGVFVPMLLINPAADIPIIQISVLSNQDPEEHYNIGQVLKQFRKEGIAIFGSGMSYHNMREFFYGRNAGRVVNEEFDEFLNDACTSGNSVRKEKLLLWDQQPGAREAHPTRAAEHLMPLIVIAGAGGDGPGERIFNWDMSGTEVTISSGDRHELYFDYYGFPPETYKYKYDAPGDPELAKRIQTALKKAGIHSKLDPKRGWDHGVFVPMLLINPAADIPIIQISVLSNQDPEEHYNIGQVLKQFRKEGIAIFGSGMSYHNMREFFYGRNAGRVVNEEFDEFLNDACTSGNSVRKEKLLLWDQQPGAREAHPTRAAEHLMPLIVIAGAGGDGPGERIFNWDMSGTFRLSGFIWKND-