New model in OGS2.0 | DPOGS205971  |
---|---|
Genomic Position | scaffold176:+ 12820-20579 |
See gene structure | |
CDS Length | 3267 |
Paired RNAseq reads   | 4131 |
Single RNAseq reads   | 10550 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA009401 (4e-22) |
Best Drosophila hit   | ND |
Best Human hit | ND |
Best NR hit (blastp)   | PREDICTED: similar to Y71H2B.5 [Tribolium castaneum] (3e-105) |
Best NR hit (blastx)   | PREDICTED: similar to Y71H2B.5 [Tribolium castaneum] (7e-97) |
GeneOntology terms    | GO:0008152 metabolic process GO:0009058 biosynthetic process GO:0016769 transferase activity, transferring nitrogenous groups GO:0030170 pyridoxal phosphate binding GO:0003824 catalytic activity |
InterPro families   | ND |
Orthology group | MCL17616 |
Nucleotide sequence:
ATGGTAAAACATCCGAGGGGTATCTTCCTGCATCATAACTTCATCTGTGCTGTTCTCAAC
GACGTGTTCGGTATCCAGGCCCGAGGGGGACTTAGCGGAGATTTGAATTACGGATCAGAT
ATCCTAGGGATAGATGATCATCTGTTGAAGGAATACGAAAAGTTATTGGATGTTGAAGCT
CAAAAAGAAATATCTCGAGTTCGTAAACTAAGTGTGGTTAAATCCCCGGAAGTTCCAATC
CACAATGAACCATTACGGCCCGGATTTTGTCGGTTATCACTACCTTTCTTTATGTCTGAG
AATGAACTGGCGTTCGTCTTAGAAGCTTTGAAAATGGTTGCCACGGAAGGATGGAAGATT
TTGCCACAGTATGTAGTGAACTCGGAAACCGGTCAATGGAGACATCATTCGTGCTCAGTA
CTTAGAGATAAGAAGTCATTGTATTCAATAAGGTTCAACGATGGAAAAATTACAGCGAAC
GAGAGACGAGTATCAGGTCCAGGAATATTCCCGCAGACTTTCGCGGAATGTTTGCAAACC
GCTCGTAATCTTTTCAATCGAGCTAGGAAACTGGCCATGAAATGTGCAACGGTTGAGCCC
GAAGTCAGTTTTAACCCGAAAATAGATTACTTGAGATGGTTCATGTTGCCGAAGGAGGCT
CACGACCTGTTGTTGGGCAAGTCCGCGAATGTGAAACACATTGTGCCATTCGATCCCGTG
GGCTACACGGGGACGAGAAAGAGCCTGAACAATTCGAGGTCATCTCACACCTCCTCCCCC
GTCCTGGGGACTACCTCCAGACACTTCAGCTTATCGGCCATCGACGACTGCCATTTACTG
AATCTGAAACAAAGACAGAAGTTCTTTTCAAGAGAATCCAGCTTGAAAGAAACCACGAAA
GAGAAAGCTGAGACGATGTCATCAAACCCGGTGCAATTCGCTGTTGGAGAATCCGTTTCC
CCTCTACGTATAGTACCACAGAATGCGCAGACGATGCTCGGCAGGTCCAGATGCTACTCC
TTAGGATCCGATCTGCCGCCGGTTCAGTTGAGTGCACGAGCGAGACTGAACCTTGGACTC
AAAGAAACTCCTGGAAACGGTGAAAAAACTATCGGCTTCTGTAACTGCGGAAGTCAAACC
GATCTACCCTCGTTGGACGATATGTCGCCCACTAAGAAGTATCCATACAGCACACAGAGC
AGTTCTTCGATATCTGATTGCAGTCAAGTGGGCCGTACCTCGCCAACTACATCAGTGACC
TCGCACACGTCTGAGGATCTCGAAGCTATAGTTAAAGTGACGACCAACGAAATCGCAACA
CAAATAAGATCCCAGTTAAGAGGGGTCATCTCTAAGGTGGACGATATACTAGAGAATTCC
GACTCTCTCGAGCAGTCCAATATGAGTATGACGTCCATATCTAGTCAGAGTGATAAGAAT
TCCGTATCAGTCGTGGACGTCGCCGAACTGTTGATCGGCATGTCGAGGGAAATAGCTTCG
GAGGTGAAACATGAGTTCAGAGAAATGGTTAATACTGTTGACGAGATGATTTCTCCAGAA
CTTTCCGGATCCAGAAGAAGTTCACCGCCGCAGACCGGAAGGAGGAGGATAGGATCCGGT
CCGGAACTCGGGTTGGACTTGAACGTCAGTCAAACCCAGGTGTTGAAAAAATGTCCGACC
AGTCCAGTACTTCCCATCCAGTATGATGATGATCATTGCTGCAAACGCTCACCCGGCGCC
CAGTCGCCCCTGTCAGCCCAGAACACTCCGAGTCATGAGGCATCAGCACCGAACAGCATC
TCGTTCAACTCCAGCGAGACTTCTACACCGGACACGATCGTACAAGTGATGACGTCACAG
AACTCTCCCATACTCTCCAAATCATCGAGCGCCAACAAACTATCCGACGATGAAACCTGT
ACGGACCCGAGATGCAGGCATTACTGTATCAAGAAGAACTGGTGCCAAAACCCGTCCATC
AGTTCCCAGGACAGCGGAATAAACCTGACATTCACAGAAACCGACTCCTACATGGACTTC
GACAAATGGCGAACATCCTCGGACACGTCATCGAACAAACTGAAAAAGCTCCAGGGTAGA
TTACGGATGTGTCAAAAGTACGAGAAGAGCGAGGTGCCGGACATTATAGAGGGTGTTCCA
GTGTGTTCAGGGGATCACGCGCGGACAGCCAAGAACTGTGATCCTGATACGGCGAGGGTT
GTCTTCCAGATACCTGATGATAATGAAAAGCAGAATCGCACAACGGCGCTGGATTCTAAG
CGTTCCAGCCGTTCGTCGAACGCGTCGTCTAGCAGTTCGCGCTCCAGCGGCTACGGCACC
GACCACAGGACTCCAGAGGAACAATTTTATGAGCGAAGTGAATCTGATCGTATCTTCAAG
CCGGATTGGGAGGCAGACAGTACGTGTAGCGAGGCGTCCCTCACAGACTTCACTTTGGAC
GATGAGGGGAAATGGCACTGTCCGCCCCGAGAAGTCTGGAGGGCCACTGTGGAGGCTATA
CACGAATACAACATGGTGAGAGCCGGTGACAAGATCCTCGTATGTCTGTCTGGTAGCCGG
GAGTCGGTCGCATTGCTCCATACCATGCACCAGTACCAATTCTATGCGAGATCCAAGGGA
ATACACTTCAGTATTGCATCTCGGGCTATTGAAGATGACCTCGTACACGGTTCAACAAAT
GCTTTATTTCAAAGAGCTCCGGTTCCTGCAACGGCTCGAGCGAGACTGTTTTTTAGTTTG
TACCCTCGCCTCTATTTATACCAACCATTTTGCTCATCCCAGGATACGCCTTGGCAAAAA
ATGGAGCACAACATCCGTATCCTCCGCCCGTTCATCTACGTCCGCTCTCAGGACCTGGAA
CACTTCGCTCGCTCCCAGGGTCTGCCTGACTTCGGAAGAGATTTGTCTGATAAACCTGTC
CTAGGTCCCAGTAAAAACAAATTGGATCGATCTATATCTCTACCATGCGGGTCTGACAAA
GGAGATTCCTTGGACCGGGAGGAGGATCTATCAGGGTCCCTGCCCGAGCTGGTGGATCCG
ATGTCTTCGGCTCGTGAAATCCTGAAAACTCATGAGAAATTATATCCTTATTTATTCTCC
AGCCTGAAGAACGCTCTGCATCCACTCATCAGCGGCAGGAACATTGATAAGGATAACAGA
CACAGAAAGAAATCTGTTATACAGATGAAGAACGGCTCCCCCGTGTATGACTCGGAGGAG
GGGACGGAGGAGGAGCCGGTGCCATAG
Protein sequence:
MVKHPRGIFLHHNFICAVLNDVFGIQARGGLSGDLNYGSDILGIDDHLLKEYEKLLDVEA
QKEISRVRKLSVVKSPEVPIHNEPLRPGFCRLSLPFFMSENELAFVLEALKMVATEGWKI
LPQYVVNSETGQWRHHSCSVLRDKKSLYSIRFNDGKITANERRVSGPGIFPQTFAECLQT
ARNLFNRARKLAMKCATVEPEVSFNPKIDYLRWFMLPKEAHDLLLGKSANVKHIVPFDPV
GYTGTRKSLNNSRSSHTSSPVLGTTSRHFSLSAIDDCHLLNLKQRQKFFSRESSLKETTK
EKAETMSSNPVQFAVGESVSPLRIVPQNAQTMLGRSRCYSLGSDLPPVQLSARARLNLGL
KETPGNGEKTIGFCNCGSQTDLPSLDDMSPTKKYPYSTQSSSSISDCSQVGRTSPTTSVT
SHTSEDLEAIVKVTTNEIATQIRSQLRGVISKVDDILENSDSLEQSNMSMTSISSQSDKN
SVSVVDVAELLIGMSREIASEVKHEFREMVNTVDEMISPELSGSRRSSPPQTGRRRIGSG
PELGLDLNVSQTQVLKKCPTSPVLPIQYDDDHCCKRSPGAQSPLSAQNTPSHEASAPNSI
SFNSSETSTPDTIVQVMTSQNSPILSKSSSANKLSDDETCTDPRCRHYCIKKNWCQNPSI
SSQDSGINLTFTETDSYMDFDKWRTSSDTSSNKLKKLQGRLRMCQKYEKSEVPDIIEGVP
VCSGDHARTAKNCDPDTARVVFQIPDDNEKQNRTTALDSKRSSRSSNASSSSSRSSGYGT
DHRTPEEQFYERSESDRIFKPDWEADSTCSEASLTDFTLDDEGKWHCPPREVWRATVEAI
HEYNMVRAGDKILVCLSGSRESVALLHTMHQYQFYARSKGIHFSIASRAIEDDLVHGSTN
ALFQRAPVPATARARLFFSLYPRLYLYQPFCSSQDTPWQKMEHNIRILRPFIYVRSQDLE
HFARSQGLPDFGRDLSDKPVLGPSKNKLDRSISLPCGSDKGDSLDREEDLSGSLPELVDP
MSSAREILKTHEKLYPYLFSSLKNALHPLISGRNIDKDNRHRKKSVIQMKNGSPVYDSEE
GTEEEPVP