Monarch geneset OGS2.0

DPOGS215485
TranscriptDPOGS215485-TA1317 bp
ProteinDPOGS215485-PA438 aa
Genomic positionDPSCF300098 + 439468-443370
RNAseq coverage364x (Rank: top 33%)
Annotation
HeliconiusHMEL0034296e-13271.74% 
BombyxBGIBMGA007324-TA0.082.88% 
Drosophilahgo-PA0.074.42% 
EBI UniRef50UniRef50_Q9VKJ00.074.42%Homogentisate 1,2-dioxygenase n=176 Tax=root RepID=HGD_DROME
NCBI RefSeqXP_001663831.10.078.47%homogentisate 1,2-dioxygenase [Aedes aegypti]
NCBI nr blastpgi|1571367590.078.47%homogentisate 1,2-dioxygenase [Aedes aegypti]
NCBI nr blastxgi|1571367590.078.47%homogentisate 1,2-dioxygenase [Aedes aegypti]
Group
Gene OntologyGO:00044110homogentisate 1,2-dioxygenase activity
GO:00551140oxidation-reduction process
GO:00065700tyrosine metabolic process
GO:00065590L-phenylalanine catabolic process
KEGG pathwayaag:AaeL_AAEL0136370.0 
 K00451 (E1.13.11.5, hmgA)maps-> Styrene degradation
    Tyrosine metabolism
InterPro domain[1-434] IPR0057080Homogentisate 1,2-dioxygenase
[2-438] IPR0110512.3e-177Cupin, RmlC-type
Orthology groupMCL11665 Multiple-copy universal gene
Genotypes for resequenced monarchs and outgroup Danaus species

Nucleotide sequence:

>DPOGS215485-TA
ATGGCAAATTTAAAGTATCTTTCTGGCTTCGGGTCAGAATTCTCAAGCGAGGATCCTCGTCGTCCCGGAGCGTTACCCGAGGGTCAGAACAGTCCTCAACGCTGTGCTTACGGCTTATACGCTGAACAGCTTTCAGGCAGTGCCTTTACGGCTCCACGAACAGAAAACAGGCGCTCTTGGCTCTACAGGATCCGGCCATCTGTGATCCACAAACCATTCGTTAAATCGAATATTTCGGAGCACCTGACACACAAATGGGATGACCAAGAACCAAATCCAAATCAATCGCGTTGGCTCCCCTTCGACATACCTACTCAAGGTTCGGTAGACTTCGCGTCGGGTCTGCACACAGTCTGCGGAGCTGGTGATCCTCGTTCCCGACATGGCATCGCCATACACATCTATCTCTGCAACGCGTCTATGGAGAACAGCGCATTTTATAACAGTGATGGGGACTTCCTCATAGTTCCGCAACAAGGAACTTTAAACATAACAACTGAATTTGGTAAAATGGAGATCCGACCAAATGAAATTGCTGTGATACAACAAGGGATGAGATTCGCTGTCGCTGTAGACGGGCCCACAAGAGGTTATATTTTGGAAGTGTTTGATGGGCATTTCAAACTACCCGACTTAGGGCCGATAGGTGCCAATGGTTTAGCAAACCCTCGCGACTTCCTTACACCAGTCGCATACTACGAAGATAAAGAAGTACCCGATTTCAAGATAATTAATAAATACCAGGGAGCTCTGTTCGAGGCTGTTCAAGGTCATTCTCCTTTCGATGTGGTAGCCTGGCACGGCAACTACGTCCCTTACAAATACGACCTCAGCAAGTTTATGGTCATCAATTCTGTTTCCTTCGATCATTGTGATCCATCTATATTTACTGTACTAACCTGTCCCTCAACAAAGCCCGGTGTTGCCATAGCAGATTTTGTGATATTTCCTCCTCGATGGTCGGTGCAAGAAAATACATTTAGACCTCCTTACTATCATAGAAATTGTATGAGCGAATTTATGGGTCTTATCCTGGGTTCGTATGAAGCGAAAGAAGGTGGTTTTCTACCAGGGGGAGCTTCTCTCCATTCAATGATGACTCCACACGGTCCTGATGCACAATGTTTTGAAGGAGCTTCCAAGGAAAAGCTGGTACCGCAGAAAATAGCCGTGGGGACTCAGGCTTTTATGTTCGAGTCATCTCTCAGTATGGCGATAACGAAGTGGGGCTTCGAGACGTGTAAAAAACTCGACGGCAATTACTATCAGTGCTGGCATAATTTACCTAAACTTTTCTCAGAAAAAATAGATATTTGA

Protein sequence:

>DPOGS215485-PA
MANLKYLSGFGSEFSSEDPRRPGALPEGQNSPQRCAYGLYAEQLSGSAFTAPRTENRRSWLYRIRPSVIHKPFVKSNISEHLTHKWDDQEPNPNQSRWLPFDIPTQGSVDFASGLHTVCGAGDPRSRHGIAIHIYLCNASMENSAFYNSDGDFLIVPQQGTLNITTEFGKMEIRPNEIAVIQQGMRFAVAVDGPTRGYILEVFDGHFKLPDLGPIGANGLANPRDFLTPVAYYEDKEVPDFKIINKYQGALFEAVQGHSPFDVVAWHGNYVPYKYDLSKFMVINSVSFDHCDPSIFTVLTCPSTKPGVAIADFVIFPPRWSVQENTFRPPYYHRNCMSEFMGLILGSYEAKEGGFLPGGASLHSMMTPHGPDAQCFEGASKEKLVPQKIAVGTQAFMFESSLSMAITKWGFETCKKLDGNYYQCWHNLPKLFSEKIDI-