DPGLEAN10992 in OGS1.0

New model in OGS2.0DPOGS203437 
Genomic Positionscaffold244:- 48730-55015
See gene structure
CDS Length3666
Paired RNAseq reads  1251
Single RNAseq reads  2921
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011156 (3e-24)
Best Drosophila hit  ND
Best Human hitWASH complex subunit FAM21A (1e-09)
Best NR hit (blastp)  PREDICTED: hypothetical protein LOC423772 [Gallus gallus] (9e-10)
Best NR hit (blastx)  PREDICTED: similar to Protein FAM21A [Tribolium castaneum] (9e-13)
GeneOntology terms

  
GO:0003674 molecular_function
GO:0005575 cellular_component
GO:0008150 biological_process
InterPro families  ND
Orthology groupMCL15160

Nucleotide sequence:

ATGGAGGGAGATACTACTCGTCTGAGGTTGTCGGCTCCCGACTGGTCTCTCGCAGGAGAC
TCACAGTTACTGGACATCTTACAGAGCTTGCATCAGACCATCATAACCAAATGCCAGGAG
ACCAATGTTCAGTTGGAGTCCATGATGTCGTCCCTGGATGAGGCCAGCATTCACTTGCAG
AATGTTAACAACAAATTCCTCGGACTCAGTAACAGTCAGTTTGTCGAAAGTCGCGTGTAC
GATGATCACACGGAGATCGCTGAAGATAATAACAACAAGGATCCTCCGCAGCGCGCCCCC
CTCAGTCCCGTGTCGTCTTTGAAGCTCTGTCTCCACACCCTGGAGAGTCTTCACGAAGCT
GTCCCCGTGATGGACTCCGACAGTGACGAAGAGGGCAGCGCCCGCGTGGTGCTCCGGCCT
CTGCTGCGTCGCGGGCGCGTCGCTCATGAAGCTGTATCATCAGACGCGGACAGCGAGGAG
TCCTCCCAGCAGGAGAGGCAGCTGGAGGCGGAGTACTCGGACTCGAGTTCTGAACACGAA
CAACAGACGGACGCACACGAACATACTATTCCACCGCCGCCTTCCGTGTTAGTGACGTCA
CACACGCCGCCCGACACGAGGACCACGGAGCCGGTCACCTCGCCCGAGTCGAATGTATCG
CCAAAAGTGAGGAAGCTGTACACAGTAGACAAACCCGTCACAGCTCAAATCTTCCCCGAG
GAGCCTCCGCCTCTTGACAAGTATGACTCCGACACTGACGATGACATCTTCGCTGACTTA
CACACACATGCACATACACATACACACACCCACACACACACAGCGCCAGACACGGGCGAC
ATCGTAAACGACCTGTTCGGAGGAGGAGGAGGGGGAGGAAGAGCAGGGTTTGACAGAGAT
GACGTCACAGAACACACGCGAGTGAGGCATTCACACTTTGTGAGAGAGGAGTCGCCTGGA
GCGACCAGTGTGGAGCCAGTGGAGCCAGAGTCAGAACAGACTCCGCCACGGGAATATACT
ACAAAGGAAAATGTTAAAAAACCCGCTGGTGGTATATCTCTGTTCGGCGGCGCGGGTCCT
GAAGCTATCGGAGCGGCCGTCCTGCGAAGAGCACGAAGACAGTCATCAAGTGACGGTGAG
GTCGCGGACACTCGCACCGACAGAACCAATGTCATCGACGAATTATTTATAAAACCAACT
AAAAATGTCAAAAAACCACCCGTCGATGTTAAGAAAGAACCGAAAGTTGCTAAAGATATA
GCTGAGAGTAGCGCTAAAGATAAAAAAGATAAAATAGATCTGTTCTCTGATGATATCTTT
GATGACATCGATGATATCTTTACGAGTAACGTTACGAACACGACAAAAGACAGCAAGGAA
ACGTTGTTTAATGATGATCTGTTCAATGATAACAATGATCTGTTTAACGATAACAGTAAG
TCTGTTAAGATTGAGAGCAGCGTTACTAAAGACGACAAAGTAAGAAACATATTTGATAGT
GACAGTGAAGACGATTTGTTCTTTGATGCTAAAGGAAAAGATAAAGATTCAGACACAAAA
GATAGCACTAAAGTTAAAGATTATAATTCAAATGAAAGCTTAACAGTCAAGAACACTAAA
GAAGAAAGTAAAGTTGAACTGAAAAATCAGTTGAGTCCCAATTTATTTGATGATGATGAT
GATGACCTGTTCAATGTGACGCCGTCCAGGAGAGTGGCGAGTGAACACGGTGATAGGAAC
GCTGAAGAAACACGAGATAATCAAAGACAAGACAAGAATGAGGCCGAAAAGATGGAAGGA
ATCAAGACAAGTGACACGCAGGGGGAAGACTGTTTGGAAGAAAAACATGTCGGTGATCCC
GTGACAACTGAAAGAAGTGATGCAAACATGCGCGTTCCAGAAAAAAACGTTGTACGAAAC
GAATTTCACGACGATTTTAATGATTCTGGACCAATAGAGGAAGATTCGGCAAAATCTACT
GATAGAGAAGATAATGATGAGAAATCAAAAGGTGATAATAGTCTGCCGAAAGAAAAAGAC
TTTATAAAAGAAACGAAAGAAAATAAAAGTGAGGAGGAAGCGATAGATGTAAAAGATACT
AACGCCAATGACATATTCGTCGACATCTTCAGTGATCTGCCTCCAGCCTTCGAGAAACCG
ATTGAACCGAAGAAGAGTAAAAACGTCAATGCTCTGTTCGACGATGACTCTGATGATGAG
GCGCTGTTCTTCAAGAAAGATGACGTCATCACCGACGAGAAACCGGAAATGGACTTCGGC
AGTGACAGGTTTAGAATATTCCATGACGAACCACCCGATATTGATGTGGATTTCACAACG
AAGTCTGCGAGCGGACCTCATACGACTGATGTGGCAGATGCTTTGGAAGCTGCGGCGGAC
GTTGAAGCTGTGACCCATGGAAAAGCTGACACGGCATCAGAAAAACAGATTGAAAAATGT
AAAGAGACGGGAAACGAAAATATGCCTCATGAAACAAAAAACAACAACAAAATAAACATA
CTGAAATTACTAGAGAATGAGGAAAACAATACTGATGGAGGAAATGAAAAGAAAGAGGAT
TTATTTACCGGCACAGAAAAAGATGATGCGAACTCTGCGAGTAAAACAAAAACAAGAGAT
GTCAAGACAGAAGAAGAATCAGACTCCTCGGAAAGAGAGAATAGAGTTATTGGAAAGCTG
AAGCCGACGAAGCTCAATATAAATGTTAATACGTTGTTACCGGGAGCTGTTCCGAAGAAA
CCTGTGAACTACGAAGAGACCGACGGACAGGTCACATCCAGAAGTAAAGAAGACTCCGCT
CTGGTTGAAGAGCACAAAGAAAAAGTAGTCAGCTTCAAGGAAGAAACGAACTCGGAAGTC
CTAGATAACAAACTATCCAAGGAGAGAGCTCGGATTCAGGTCAAAAGACGACCGTCGACT
AGACGAGCTAGACTTGAAGCTGTGAGGAAGACTGGTCTAGACTTCGGGTCAGACTCCACA
GACAACTCCAGCTCGTTTGACGAACCGGTCAGAGAGATACCAAGAGACAGCGCTCCTAAC
AAAGAAACAACGACGAAAGTGACCAAACAAGCAGACAACAAAGATGTCATCTCTAAAGTT
GTTTATGTTCTGAACGACGAGGACATCTTCGACATTCCTCCGACAGAAACAACTGCTGGA
AAACCTCGGAAAGAAGATCTCACGGAAACAATGAACTCTACTGGAATCAGACACCAAGAA
ACACAAGGAGACGAGAGTCGGAAAAAGAAGACAGAAGAAAAGAAAACAAAAACATCATTA
TTTGATGATAGCGACGAGGAAACGGATCTGTTTGGGAAACACACTAAGAGATATATATTC
GACTCGGACAGCGACAGCGAACTGTTCGGGAAAGATAAAGGAAAGATAGTGAAAGATACA
AGAACAGAGGAAAAAGATAAAGAAAGGAGAATCGACAAGGTACAAGCGAAAATACCTCTG
TTCAGTGACGACAGCGATGAAGACTTGTTCGGAGGAAAATCAAAAAAAATAGAAGTAAAG
AACACATCACAAGCGAGAGCAGTCCCTGGATCATCACAAGTGAGAGCAGTCCCTGGATCA
TCACAAGCCTTCGATGATCCGCTCTCAGTGCTCGGGGACGAGCGCTCACACAACGTGCAT
ATATAG

Protein sequence:

MEGDTTRLRLSAPDWSLAGDSQLLDILQSLHQTIITKCQETNVQLESMMSSLDEASIHLQ
NVNNKFLGLSNSQFVESRVYDDHTEIAEDNNNKDPPQRAPLSPVSSLKLCLHTLESLHEA
VPVMDSDSDEEGSARVVLRPLLRRGRVAHEAVSSDADSEESSQQERQLEAEYSDSSSEHE
QQTDAHEHTIPPPPSVLVTSHTPPDTRTTEPVTSPESNVSPKVRKLYTVDKPVTAQIFPE
EPPPLDKYDSDTDDDIFADLHTHAHTHTHTHTHTAPDTGDIVNDLFGGGGGGGRAGFDRD
DVTEHTRVRHSHFVREESPGATSVEPVEPESEQTPPREYTTKENVKKPAGGISLFGGAGP
EAIGAAVLRRARRQSSSDGEVADTRTDRTNVIDELFIKPTKNVKKPPVDVKKEPKVAKDI
AESSAKDKKDKIDLFSDDIFDDIDDIFTSNVTNTTKDSKETLFNDDLFNDNNDLFNDNSK
SVKIESSVTKDDKVRNIFDSDSEDDLFFDAKGKDKDSDTKDSTKVKDYNSNESLTVKNTK
EESKVELKNQLSPNLFDDDDDDLFNVTPSRRVASEHGDRNAEETRDNQRQDKNEAEKMEG
IKTSDTQGEDCLEEKHVGDPVTTERSDANMRVPEKNVVRNEFHDDFNDSGPIEEDSAKST
DREDNDEKSKGDNSLPKEKDFIKETKENKSEEEAIDVKDTNANDIFVDIFSDLPPAFEKP
IEPKKSKNVNALFDDDSDDEALFFKKDDVITDEKPEMDFGSDRFRIFHDEPPDIDVDFTT
KSASGPHTTDVADALEAAADVEAVTHGKADTASEKQIEKCKETGNENMPHETKNNNKINI
LKLLENEENNTDGGNEKKEDLFTGTEKDDANSASKTKTRDVKTEEESDSSERENRVIGKL
KPTKLNINVNTLLPGAVPKKPVNYEETDGQVTSRSKEDSALVEEHKEKVVSFKEETNSEV
LDNKLSKERARIQVKRRPSTRRARLEAVRKTGLDFGSDSTDNSSSFDEPVREIPRDSAPN
KETTTKVTKQADNKDVISKVVYVLNDEDIFDIPPTETTAGKPRKEDLTETMNSTGIRHQE
TQGDESRKKKTEEKKTKTSLFDDSDEETDLFGKHTKRYIFDSDSDSELFGKDKGKIVKDT
RTEEKDKERRIDKVQAKIPLFSDDSDEDLFGGKSKKIEVKNTSQARAVPGSSQVRAVPGS
SQAFDDPLSVLGDERSHNVHI