DPGLEAN09010 in OGS1.0

New model in OGS2.0DPOGS215094 
Genomic Positionscaffold5075:+ 191-10345
See gene structure
CDS Length1875
Paired RNAseq reads  7570
Single RNAseq reads  20543
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA014043 (1e-67)
Best Drosophila hit  cytochrome P450-4c3 (1e-56)
Best Human hitcytochrome P450 4V2 (2e-52)
Best NR hit (blastp)  cytochrome p450 CYP4S1 [Helicoverpa armigera] (8e-115)
Best NR hit (blastx)  cytochrome p450 CYP4S1 [Helicoverpa armigera] (3e-111)
GeneOntology terms




  
GO:0005792 microsome
GO:0009055 electron carrier activity
GO:0016020 membrane
GO:0020037 heme binding
GO:0055114 oxidation reduction
GO:0004497 monooxygenase activity
InterPro families

  
IPR001128 Cytochrome P450
IPR017972 Cytochrome P450, conserved site
IPR002401 Cytochrome P450, E-class, group I
Orthology groupND

Nucleotide sequence:

ATGCTGTGGCCGATATTTTTTGGATTACTGGTTTGTGTGGTGGTGTGGAGACTTATCAAA
CAGGAGCCCAATGACCTTGCACACCTGCCTGGACCTCCAAGGACACCAATATTTGGATCA
GTTTTTATGTTCCTGGGAAAATCTCATAGCGAACTCTTCAAAATGTTGGTTGAGCTTCCA
AAGAAATATGGAAATCGTCTTGTTATCAAGGCAATGCACCGGTATATTCTACATGTTTAC
AAAGTCGAGGACATTGAGATTGTTCTAACACATTCGAGAAACATCAAGAAGAATAAACCT
TACACGTTCATAGAGCCGTGGTTGGGAACTGGTCTTCTTATTAGTAATGGCAGTAAATGG
CAGAAACGGCGAAAAATCTTGACACCGACATTCCATTTCGACATTTTAAAGGGATTCGTA
AAAGTATTCGAAGAGCAAAGTAGGAATCTGACAACAATGCTCAGGAAAAAACTGCAGGAG
TCAAATGTTGTCGATACTATGGCCATCATGAGCGATTTTACACTTTATATTATATGTGAG
ACGGCTATGGGTATAAGATTAAATGCGGATAAAAGCGCTGAAAAAATGATGTATAAGAAG
GCCATCATGGAAATAGGACAGATAGTGATGAAGAGGCTGACCACAGTGTGGCTTCACAGT
GACCTGATCTTTTACAATATGCCCATCGGAAAGAAATTCACCAAGTGTCTGGAAAACGTG
CATTCCTTCGCTGATAACGTGATCCTGGAGCGGAAAAAAAAATACGAGAGCGTCGCAAAT
GAGGATGGTGGGAGAAGGAGATTAGCGTTTTTAGACTTACTCCTTGAAGCGGAGAGGAAC
GGAGAAATAGATTTGGAGGGAGTAAGAGAAGAGGTTAATACGTTTATGTTTGAGGGTCAT
GACACAACAGCTACCGCTTTAGCATTTGGCCTGGTGTTGCTCGCCGACAGCGAGGAGGTT
CAGACGGCTATGGGTATAAGATTAAATGCGGATAAAAGCGCTGAAAAAATGATGTATAAG
AAGGCCATCATGGATATAGGACAGATAGTGATGAAGAGGCTGACCACAGTGTGGCTTCAC
AGTGACCTGATCTTTTACAATATGCCCATCGGAAAGAAATTCACCAAGTGTTTGGAAAAC
GTGCATTCCTTCGCTGATAACGTGATCCTGGAGCGGAAAAAAAAATACGAGAGCGTCGCA
AATGAGGATGGTGGGAGAAGGAGGTTAGCGTTTTTAGACTTACTCCTTGAAGCGGAGAGG
AACGGAGAAATAGATTTGGAGGGAGTAAGAGAAGAGGTTAATACGTTTATGTTTGAGGGT
CATGACACAACAGCTACCGCTTTAGCATTTGGCCTGGTGTTGCTCGCCGACAGCGAGGAG
GTTCAGGAACGTCTCTTCGAGGAGTGTCAGCGGGTTGGTCCTGAGCCGAGTGTGTCCGAG
TTGAACGACATGAAGTATTTAGAAGCTGTGGTCAAAGAAATCTTGAGGTTGTATCCAAGC
GTGCCGTTTATAGGACGAGAAATTACCGAGGACTTTATGTTAGATGACATCAAAGTAAAG
AAAGGCTGTGAAGTAGTCGTTCATATATACGACGTACATCGAAGACCGGATCTATATCCG
GATCCTGTAGCTTTCAAACCGGAAAGATTTCTGGACGAAGAGAAACGACATCCCTACTCC
TATGTACCGTTCAGTGCTGGGCCACGAAATTGCATTGGTCAAAAGTTCGCCAAGCTCCAG
ATGAAGGTCGTCATTAGTGAGATAGTCCGTAATTTCAAGTTGTCACCGCTGGTCGCTGGC
GCACGACCCGACCTCAAGGTCGATCTAGTACTGAGACCTGCTGAAACCATCTACGTGAAA
TTTTATCCTCGATAG

Protein sequence:

MLWPIFFGLLVCVVVWRLIKQEPNDLAHLPGPPRTPIFGSVFMFLGKSHSELFKMLVELP
KKYGNRLVIKAMHRYILHVYKVEDIEIVLTHSRNIKKNKPYTFIEPWLGTGLLISNGSKW
QKRRKILTPTFHFDILKGFVKVFEEQSRNLTTMLRKKLQESNVVDTMAIMSDFTLYIICE
TAMGIRLNADKSAEKMMYKKAIMEIGQIVMKRLTTVWLHSDLIFYNMPIGKKFTKCLENV
HSFADNVILERKKKYESVANEDGGRRRLAFLDLLLEAERNGEIDLEGVREEVNTFMFEGH
DTTATALAFGLVLLADSEEVQTAMGIRLNADKSAEKMMYKKAIMDIGQIVMKRLTTVWLH
SDLIFYNMPIGKKFTKCLENVHSFADNVILERKKKYESVANEDGGRRRLAFLDLLLEAER
NGEIDLEGVREEVNTFMFEGHDTTATALAFGLVLLADSEEVQERLFEECQRVGPEPSVSE
LNDMKYLEAVVKEILRLYPSVPFIGREITEDFMLDDIKVKKGCEVVVHIYDVHRRPDLYP
DPVAFKPERFLDEEKRHPYSYVPFSAGPRNCIGQKFAKLQMKVVISEIVRNFKLSPLVAG
ARPDLKVDLVLRPAETIYVKFYPR