DPGLEAN01474 in OGS1.0

New model in OGS2.0DPOGS205609 
Genomic Positionscaffold513:+ 1830-4244
See gene structure
CDS Length1320
Paired RNAseq reads  52
Single RNAseq reads  435
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA003943 (7e-48)
Best Drosophila hit  Cyp6g2 (2e-55)
Best Human hitcytochrome P450 3A5 isoform 1 (1e-43)
Best NR hit (blastp)  cytochrome P450 332A5 [Manduca sexta] (9e-81)
Best NR hit (blastx)  cytochrome P450 332A4 [Manduca sexta] (3e-76)
GeneOntology terms



  
GO:0005792 microsome
GO:0009055 electron carrier activity
GO:0016020 membrane
GO:0020037 heme binding
GO:0004497 monooxygenase activity
InterPro families

  
IPR001128 Cytochrome P450
IPR017972 Cytochrome P450, conserved site
IPR002402 Cytochrome P450, E-class, group II
Orthology groupMCL10250

Nucleotide sequence:

ATGTATGAACATTTCAAATCTCCGTACATAGGGATCTGGTTAATATGGAAACCAGCTTTA
ATTATAAACGACCCAGAAATCGCTCGGCGAATATTAGTTAAAGATAGTTTGATTTTTAGA
GACAGGTATTTGAGTTCTGGAAGCAGCGACCCTATCGGAGCACTTAATTTGTTTACTGTT
AATGATCCTGTGTGGACCAGCATTCGTCGTAAATTATCTAATGTATTCACTGTAGCTAAG
CTCAAGGCCCTCCACCATTATACTTTGAGTAAAGTTGAAGAGCTGATGAGAAGAATCGAA
AGAGATCGTGAAAAAGGTTTAGAACTTAAGAGACTTTTCGTTGATTACACAACAGATGTT
ACTGGAACATTTTCTTTCGGTATTGAAAGTAATGCAACTCTTACATCTAAGGGCCCTTTG
AGGGAAATCACCGCTGACTTTGGAAAATTCAGTATATATAGAGGAATATGTTGGTTCAGT
ATATTCTTTTGGCCAGACCTAGTTGACATATTTAGATTTACAATGTTCCCAAAGAAATCG
ATGCATAGCTTTAAAAGAATATTTGAAACCACTTTAAATCGGCATAGCAACGACATCGGA
GGCAAAGATTTCAAAGATATAGTCGATGGTCTTATAGAGTTTAAAAAAGAAAAAGAACAG
AAGCATCAAGAAGTGTCCGACGAATTTTTGATTGCACAAGCAGCAATCTTGTTATTTGGT
GGTTTTGATACAACTGCAAGTAACTTAACGTATATGACGTATGAACTAGCTTTTAACAGC
GAGTGCCAGGAAAAGTTATATAATGAACTCAAGGAAGCTGAAGAAAGAAATGGAGGAAAT
TTCGACGCTGACACCGTGTCTGAATTAACTTATCTGAATTGTGTTTTAAAAGAATGCCTC
AGAAAATATCCGCCAATGGGCTGGCTCGATAGAATAGCCGCTACGGACTATAAGATTGAC
GATAAATTGACCATCAAAGCTGGTACAGTAGTTTATGTGAACTCTATTGGTTTTCATTAT
GATCCAAAATACTTCCCCGAGCCTACAAAATTTAATCCTGATAGATTTTTACCAGAAAAT
ATCAACAAAATTAAGCCATATACGTTTTTACCGTTTGGAGACGGACCAAGAGTGTGCATA
GGTCAAAGATTTGCCATAATGACTGCACGAACAGCTGCGTCACAGCTGTTTCTAAAATAC
AAGGTTCGACCGCTCCCCAATACTCCTGCACCTAATGACGCCAAAATCGACTGTAAAGGC
CTTTTGTTGCATCCCGGAGAACCAATGCGTGTTGAGTTTATTCCGAGATCGATAAAGTAA

Protein sequence:

MYEHFKSPYIGIWLIWKPALIINDPEIARRILVKDSLIFRDRYLSSGSSDPIGALNLFTV
NDPVWTSIRRKLSNVFTVAKLKALHHYTLSKVEELMRRIERDREKGLELKRLFVDYTTDV
TGTFSFGIESNATLTSKGPLREITADFGKFSIYRGICWFSIFFWPDLVDIFRFTMFPKKS
MHSFKRIFETTLNRHSNDIGGKDFKDIVDGLIEFKKEKEQKHQEVSDEFLIAQAAILLFG
GFDTTASNLTYMTYELAFNSECQEKLYNELKEAEERNGGNFDADTVSELTYLNCVLKECL
RKYPPMGWLDRIAATDYKIDDKLTIKAGTVVYVNSIGFHYDPKYFPEPTKFNPDRFLPEN
INKIKPYTFLPFGDGPRVCIGQRFAIMTARTAASQLFLKYKVRPLPNTPAPNDAKIDCKG
LLLHPGEPMRVEFIPRSIK