DPGLEAN02390 in OGS1.0

New model in OGS2.0DPOGS215485 
Genomic Positionscaffold2697:+ 4677-8579
See gene structure
CDS Length1317
Paired RNAseq reads  525
Single RNAseq reads  1470
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA007324 (0.0)
Best Drosophila hit  homogentisate 1,2-dioxygenase (0.0)
Best Human hithomogentisate 1,2-dioxygenase (9e-175)
Best NR hit (blastp)  homogentisate 1,2-dioxygenase [Aedes aegypti] (0.0)
Best NR hit (blastx)  homogentisate 1,2-dioxygenase [Aedes aegypti] (0.0)
GeneOntology terms



  
GO:0004411 homogentisate 1,2-dioxygenase activity
GO:0006572 tyrosine catabolic process
GO:0006559 L-phenylalanine catabolic process
GO:0006570 tyrosine metabolic process
GO:0055114 oxidation reduction
InterPro families
  
IPR005708 Homogentisate 1,2-dioxygenase
IPR011051 Cupin, RmlC-type
Orthology groupMCL13529

Nucleotide sequence:

ATGGCAAATTTAAAGTATCTTTCTGGCTTCGGGTCAGAATTCTCAAGCGAGGATCCTCGT
CGTCCCGGAGCGTTACCCGAGGGTCAGAACAGTCCTCAACGCTGTGCTTACGGCTTATAC
GCTGAACAGCTTTCAGGCAGTGCCTTTACGGCTCCACGAACAGAAAACAGGCGCTCTTGG
CTCTACAGGATCCGGCCATCTGTGATCCACAAACCATTCGTTAAATCGAATATTTCGGAG
CACCTGACACACAAATGGGATGACCAAGAACCAAATCCAAATCAATCGCGTTGGCTCCCC
TTCGACATACCTACTCAAGGTTCGGTAGACTTCGCGTCGGGTCTGCACACAGTCTGCGGA
GCTGGTGATCCTCGTTCCCGACATGGCATCGCCATACACATCTATCTCTGCAACGCGTCT
ATGGAGAACAGCGCATTTTATAACAGTGATGGGGACTTCCTCATAGTTCCGCAACAAGGA
ACTTTAAACATAACAACTGAATTTGGTAAAATGGAGATCCGACCAAATGAAATTGCTGTG
ATACAACAAGGGATGAGATTCGCTGTCGCTGTAGACGGGCCCACAAGAGGTTATATTTTG
GAAGTGTTTGATGGGCATTTCAAACTACCCGACTTAGGGCCGATAGGTGCCAATGGTTTA
GCAAACCCTCGCGACTTCCTTACACCAGTCGCATACTACGAAGATAAAGAAGTACCCGAT
TTCAAGATAATTAATAAATACCAGGGAGCTCTGTTCGAGGCTGTTCAAGGTCATTCTCCT
TTCGATGTGGTAGCCTGGCACGGCAACTACGTCCCTTACAAATACGACCTCAGCAAGTTT
ATGGTCATCAATTCTGTTTCCTTCGATCATTGTGATCCATCTATATTTACTGTACTAACC
TGTCCCTCAACAAAGCCCGGTGTTGCCATAGCAGATTTTGTGATATTTCCTCCTCGATGG
TCGGTGCAAGAAAATACATTTAGACCTCCTTACTATCATAGAAATTGTATGAGCGAATTT
ATGGGTCTTATCCTGGGTTCGTATGAAGCGAAAGAAGGTGGTTTTCTACCAGGGGGAGCT
TCTCTCCATTCAATGATGACTCCACACGGTCCTGATGCACAATGTTTTGAAGGAGCTTCC
AAGGAAAAGCTGGTACCGCAGAAAATAGCCGTGGGGACTCAGGCTTTTATGTTCGAGTCA
TCTCTCAGTATGGCGATAACGAAGTGGGGCTTCGAGACGTGTAAAAAACTCGACGGCAAT
TACTATCAGTGCTGGCATAATTTACCTAAACTTTTCTCAGAAAAAATAGATATTTGA

Protein sequence:

MANLKYLSGFGSEFSSEDPRRPGALPEGQNSPQRCAYGLYAEQLSGSAFTAPRTENRRSW
LYRIRPSVIHKPFVKSNISEHLTHKWDDQEPNPNQSRWLPFDIPTQGSVDFASGLHTVCG
AGDPRSRHGIAIHIYLCNASMENSAFYNSDGDFLIVPQQGTLNITTEFGKMEIRPNEIAV
IQQGMRFAVAVDGPTRGYILEVFDGHFKLPDLGPIGANGLANPRDFLTPVAYYEDKEVPD
FKIINKYQGALFEAVQGHSPFDVVAWHGNYVPYKYDLSKFMVINSVSFDHCDPSIFTVLT
CPSTKPGVAIADFVIFPPRWSVQENTFRPPYYHRNCMSEFMGLILGSYEAKEGGFLPGGA
SLHSMMTPHGPDAQCFEGASKEKLVPQKIAVGTQAFMFESSLSMAITKWGFETCKKLDGN
YYQCWHNLPKLFSEKIDI