DPGLEAN14450 in OGS1.0

New model in OGS2.0DPOGS201819 
Genomic Positionscaffold12:+ 314800-320474
See gene structure
CDS Length2079
Paired RNAseq reads  3639
Single RNAseq reads  9342
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013115 (0.0)
Best Drosophila hit  CG8193 (1e-179)
Best Human hitND
Best NR hit (blastp)  prophenoloxidase 2 [Pieris rapae] (0.0)
Best NR hit (blastx)  prophenoloxidase 2 [Pieris rapae] (0.0)
GeneOntology terms


  
GO:0004503 monophenol monooxygenase activity
GO:0005576 extracellular region
GO:0006583 melanin biosynthetic process from tyrosine
GO:0006952 defense response
InterPro families





  
IPR000896 Hemocyanin, copper-containing
IPR005203 Hemocyanin, C-terminal
IPR005204 Hemocyanin, N-terminal
IPR013788 Arthropod hemocyanin/insect LSP
IPR014756 Immunoglobulin E-set
IPR008922 Di-copper centre-containing
IPR002227 Tyrosinase
Orthology groupMCL10094

Nucleotide sequence:

ATGGCGAACATTGTTACAGCTTTGAAGTTGTTGTTTGACCGTCCTAATGAACCCATGGTT
TCACCCAAGGGTGACAATCAAGTTGTCTTTCAACTCACAGAACAGCATCTCGACGATAAA
TACAAGAGCAATGGCATCGAGATCAATAATCGTTTTGGGAAGGACAAACCAATTATCCCG
TTAAAGGAACTAAAGACACTTCCTCAGTTTCCAAAAGCTAAACGGCTGCCAAGCGATGCC
GATTTCTCTATTCTTCTGCCTGCCCACCAGGAGATGGCTGATGAGGTCATTGATGCCCTT
CTAGCGGTGCCTGAAAACCAACTACCCGAATTTCTATCGACATGCGTTTATGCGCGTGTG
AATCTGAATCCTCAGTTATTTAACTACTGCTACTCTGTGGCTTTGTTGCACAGGAAGGAC
ACTAAAAATGTTCCACTTCAAAACTTCGCTGAGACCTTCCCGTCTAAGTTCGTTGATTCG
AAGTTTTTCAGTCAAGCACGCGAATCCGCCGCCCTTGCCAAACAAGGAGCTCCGCGTGTG
CCAATAATAATCCCGCGCGACTTTACCGCAAACGACTTAGACATTGAACACAGACTCGCT
TACTGGCGCGAAGACATCGGAATCAACCTTCACCACTGGCATTGGCATCTGGTGTACCCA
TTCAGCGCAACTAAAAGAGAAATTGTGGCTAAGGACCGTCGTGGCGAACTCTTCTTCTAC
ATGCACCAGCAAGTCATAGCCCGATACAACACGGAACGCCTGGCTAACCAGCTCGCACGT
GCTAAGAAGTTCAGTGACTTCACGGAACCGACTCCTGAGCCGTACTATCCTAAATTGGAC
AGTCTCACATCGTCCCGCAGCTACCCGCCGCGGCAGGCCAACATGAGGTGGTCGGATCTC
AACAGGCCCGTCGATGGTCTCGTGGTCACCATCGCCGATATGAACCGCTGGAAGAGGAAC
CTCGAAGAGGCCATCGCCACGGGCATGGTCAAACTGCCAAATGGCTCGACCCAGCCCCTG
GACATAGACACTCTGGGGAACATGGTGGAGTCGAGCATACTGTCACCGAACAGAGATTAC
TACGGAACCTTGCACAACAACGGACACAGCTTCGCTGGATACTTGCACGATCCTGACCAC
AGATATCTGGAATCCTTCAACGTAATAGCTGACGAGGCGGTGAATATGCGAGATCCCTTC
TTCTACCGCTGGCACGCGTTCATTGACGACCTTTTCCAGAAGTTCAAAGAGAGCAACAAC
GTGAGACGATACACGAGATCGGAGCTTTCGAACCCGGGGGTGCAAATCACGAATGCCAAG
ATCGTGAACAGCAATGGCGCCGCGGACAACACTCTACACACGTACTGGATGCAAAGCGAC
GTCGATCTGTCGCGCGGACTCGACTTCTCGGACCGCGGGCCGGTGTACGCGAGGTTCACT
CACCTCAACTACAGGCCGTTCAGATATGTCATCGACGTAGACAACACGGGCAGCGCTCGC
CGGACAACGGTCCGCATCTTCATAGCCCCCAAGTTTGATGAACGTGGCTTGCCGTGGATA
CTATCCGACCAACGCAAAATGTTCATCGAGATGGACAGATTCGTTGTGCCCTTGAACGCC
GGCAAGAATGTCATAACACGTGAGTCTACCGAATCCTCGCTGACTATCCCCTTCGAGCAG
ACCTTCCGCGACCTCTCCAGCCAGGGAAGCGACCCTCGACGTGAGGACCTCGCTAGCTTC
AATTTCTGCGGCTGTGGGTGGCCCCAACACATGCTTGTACCACGTGGCACTGAGAGCGGC
ATGCTGTTTGACTTTTTCGTTATGCTGTCAAACTACGACCTTGACGCCATAACTCAACCA
GAAGGTGTTGCACCGCTGTCCTGTACAGAAGCTTCTAGCTTCTGTGGTCTGAAGGATCGT
CTTTACCCCGACAAGCGCAACATGGGCTTCCCATTTGACAGACCTTCCAGCAGCGCTGCA
AACATCCAGGACTTCATTCTGCCAAACATGTTCCTTGCTGATGTTAGCATTCGTCTACAA
AACACCGTGGAAATAAATCCCAGAAATGCTAAAAACTAA

Protein sequence:

MANIVTALKLLFDRPNEPMVSPKGDNQVVFQLTEQHLDDKYKSNGIEINNRFGKDKPIIP
LKELKTLPQFPKAKRLPSDADFSILLPAHQEMADEVIDALLAVPENQLPEFLSTCVYARV
NLNPQLFNYCYSVALLHRKDTKNVPLQNFAETFPSKFVDSKFFSQARESAALAKQGAPRV
PIIIPRDFTANDLDIEHRLAYWREDIGINLHHWHWHLVYPFSATKREIVAKDRRGELFFY
MHQQVIARYNTERLANQLARAKKFSDFTEPTPEPYYPKLDSLTSSRSYPPRQANMRWSDL
NRPVDGLVVTIADMNRWKRNLEEAIATGMVKLPNGSTQPLDIDTLGNMVESSILSPNRDY
YGTLHNNGHSFAGYLHDPDHRYLESFNVIADEAVNMRDPFFYRWHAFIDDLFQKFKESNN
VRRYTRSELSNPGVQITNAKIVNSNGAADNTLHTYWMQSDVDLSRGLDFSDRGPVYARFT
HLNYRPFRYVIDVDNTGSARRTTVRIFIAPKFDERGLPWILSDQRKMFIEMDRFVVPLNA
GKNVITRESTESSLTIPFEQTFRDLSSQGSDPRREDLASFNFCGCGWPQHMLVPRGTESG
MLFDFFVMLSNYDLDAITQPEGVAPLSCTEASSFCGLKDRLYPDKRNMGFPFDRPSSSAA
NIQDFILPNMFLADVSIRLQNTVEINPRNAKN