New model in OGS2.0 | DPOGS215485  |
---|---|
Genomic Position | scaffold2697:+ 4677-8579 |
See gene structure | |
CDS Length | 1317 |
Paired RNAseq reads   | 525 |
Single RNAseq reads   | 1470 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA007324 (0.0) |
Best Drosophila hit   | homogentisate 1,2-dioxygenase (0.0) |
Best Human hit | homogentisate 1,2-dioxygenase (9e-175) |
Best NR hit (blastp)   | homogentisate 1,2-dioxygenase [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | homogentisate 1,2-dioxygenase [Aedes aegypti] (0.0) |
GeneOntology terms    | GO:0004411 homogentisate 1,2-dioxygenase activity GO:0006572 tyrosine catabolic process GO:0006559 L-phenylalanine catabolic process GO:0006570 tyrosine metabolic process GO:0055114 oxidation reduction |
InterPro families    | IPR005708 Homogentisate 1,2-dioxygenase IPR011051 Cupin, RmlC-type |
Orthology group | MCL13529 |
Nucleotide sequence:
ATGGCAAATTTAAAGTATCTTTCTGGCTTCGGGTCAGAATTCTCAAGCGAGGATCCTCGT
CGTCCCGGAGCGTTACCCGAGGGTCAGAACAGTCCTCAACGCTGTGCTTACGGCTTATAC
GCTGAACAGCTTTCAGGCAGTGCCTTTACGGCTCCACGAACAGAAAACAGGCGCTCTTGG
CTCTACAGGATCCGGCCATCTGTGATCCACAAACCATTCGTTAAATCGAATATTTCGGAG
CACCTGACACACAAATGGGATGACCAAGAACCAAATCCAAATCAATCGCGTTGGCTCCCC
TTCGACATACCTACTCAAGGTTCGGTAGACTTCGCGTCGGGTCTGCACACAGTCTGCGGA
GCTGGTGATCCTCGTTCCCGACATGGCATCGCCATACACATCTATCTCTGCAACGCGTCT
ATGGAGAACAGCGCATTTTATAACAGTGATGGGGACTTCCTCATAGTTCCGCAACAAGGA
ACTTTAAACATAACAACTGAATTTGGTAAAATGGAGATCCGACCAAATGAAATTGCTGTG
ATACAACAAGGGATGAGATTCGCTGTCGCTGTAGACGGGCCCACAAGAGGTTATATTTTG
GAAGTGTTTGATGGGCATTTCAAACTACCCGACTTAGGGCCGATAGGTGCCAATGGTTTA
GCAAACCCTCGCGACTTCCTTACACCAGTCGCATACTACGAAGATAAAGAAGTACCCGAT
TTCAAGATAATTAATAAATACCAGGGAGCTCTGTTCGAGGCTGTTCAAGGTCATTCTCCT
TTCGATGTGGTAGCCTGGCACGGCAACTACGTCCCTTACAAATACGACCTCAGCAAGTTT
ATGGTCATCAATTCTGTTTCCTTCGATCATTGTGATCCATCTATATTTACTGTACTAACC
TGTCCCTCAACAAAGCCCGGTGTTGCCATAGCAGATTTTGTGATATTTCCTCCTCGATGG
TCGGTGCAAGAAAATACATTTAGACCTCCTTACTATCATAGAAATTGTATGAGCGAATTT
ATGGGTCTTATCCTGGGTTCGTATGAAGCGAAAGAAGGTGGTTTTCTACCAGGGGGAGCT
TCTCTCCATTCAATGATGACTCCACACGGTCCTGATGCACAATGTTTTGAAGGAGCTTCC
AAGGAAAAGCTGGTACCGCAGAAAATAGCCGTGGGGACTCAGGCTTTTATGTTCGAGTCA
TCTCTCAGTATGGCGATAACGAAGTGGGGCTTCGAGACGTGTAAAAAACTCGACGGCAAT
TACTATCAGTGCTGGCATAATTTACCTAAACTTTTCTCAGAAAAAATAGATATTTGA
Protein sequence:
MANLKYLSGFGSEFSSEDPRRPGALPEGQNSPQRCAYGLYAEQLSGSAFTAPRTENRRSW
LYRIRPSVIHKPFVKSNISEHLTHKWDDQEPNPNQSRWLPFDIPTQGSVDFASGLHTVCG
AGDPRSRHGIAIHIYLCNASMENSAFYNSDGDFLIVPQQGTLNITTEFGKMEIRPNEIAV
IQQGMRFAVAVDGPTRGYILEVFDGHFKLPDLGPIGANGLANPRDFLTPVAYYEDKEVPD
FKIINKYQGALFEAVQGHSPFDVVAWHGNYVPYKYDLSKFMVINSVSFDHCDPSIFTVLT
CPSTKPGVAIADFVIFPPRWSVQENTFRPPYYHRNCMSEFMGLILGSYEAKEGGFLPGGA
SLHSMMTPHGPDAQCFEGASKEKLVPQKIAVGTQAFMFESSLSMAITKWGFETCKKLDGN
YYQCWHNLPKLFSEKIDI