DPGLEAN07673 in OGS1.0

New model in OGS2.0DPOGS212314 
Genomic Positionscaffold605:- 25774-32958
See gene structure
CDS Length2238
Paired RNAseq reads  1798
Single RNAseq reads  4346
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012037 (7e-114)
Best Drosophila hit  ND
Best Human hitproline-, glutamic acid- and leucine-rich protein 1 (2e-11)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL014265 [Aedes aegypti] (3e-22)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL011984 [Aedes aegypti] (6e-30)
GeneOntology terms  GO:0071339 MLL1 complex
InterPro families  ND
Orthology groupMCL16699

Nucleotide sequence:

ATGGAACAAATATTTAAAAGAGTTCGGCATGTTGATCCAAATAACAGTGATGCTGTGAAA
GAAGTACTGTCAAACTTTTTCCAAAATTTGCCGAAGCGCAATGATATTAACAATAAAAAG
TTTTTGGACGAGTTATTGATTATAATTAACCGGTATCCTAAATATTGTATGTCATATCGT
AATATTATTGAGTCATTTATAAGTAATTACTTAAGCTCCAACAATTACTACAATGTTATT
AAAGCAGCTAAGTGTGCACATGCACTACAACAGGTTCGTCCTATTCAAGATAAGACAGCA
ACGCCGAAATCTTGTTGGCGGCAGCAGATGAACTCATTGTGTAATGCTGCACACTCACTC
ATAGAAGTGATATTTGCTGATGCTGTCGACATTTACAGAAGTAATGCTAAGCCTACAGAG
AGTCATTCAAATTCTCCGTTATCAACTATACTGGCAAATATCGTTAAGAAGTCCCAAGAC
AAAGACAGATCTCAGTTGATGATGACAAGACTCAAGAACGTATTCACATTCATACAGGCT
ATGTTGGTAGAAATATATCCTGTACCGAAACCAATCCAGCCTCGTTCAATACTAGATGTG
ATAGTGAGGTCTCTGAGTGTGAGCAGTCAACACGTCTCTCTGGACGTAGCCTCTGTTAAA
GTGCAGGCCTTGAAGACGTTGGATGCTATGATTCTGTGTCTGGGATCAAACCTGATACCA
TATTCACCTCTAGTCTTTAGACTTGCGACGCAGACTTTGAGATGGACCTCGGACAACATG
AGCAGAACTACCGGCAAAGTTCGTTGCACAGCGTACAGTACGCTGAGCAAGTGGCTTCTA
ACACTGCACATTTATAAGATGCCAGAAAAAAATACCTGGGAAGATGATCTGACGGCGCAT
GTCGTGAGGGACGTGACCCCGGCGAAGAGAGTTGTGGCGCTTACGATGGGGCCGCAGCCG
ACTAAAAATTTAAGCAAAAAAGCCAAAAGGAAATTAGCTAATTCTCAACTGTTGCAAAGC
TCAATCGCCGCTCACATGCCCGGGGAGAAAAATAAGATTGATATTCCAGAGGAAGTGAAC
AATGAGGTCACGGTATCAGCCTTGCAGTTCGCAGAAGTCTTCTTCACTGTTTGTGGAAGA
TTCCTCAAACCGGCCACACATAAGTTATTCCAAGAGCGTGTCATCCGCCATCACTCAGCA
GGTGAGACGTTGCTGTACCTGCGAGTGCTGGAGGCGAGTCGCAAGACGACGCCGGCGACC
GTGGCGCCACCGACACAGTACTGTCTTCATATATACAGTACGCTTGTGAACAGCTCCGAC
GCCGAGATATCAAAATTCTGTAGCCAAGCTCTACTGGATATAAGACTGCACCTATATTGC
TCGCCGCCGTCCATCAACCTGGCTATAGAAATACCTCAAGATGAAGAGGAAACGGCGAAT
AAAAGGAAGAAGGTCTCGTCCAAAAACAGGGCCATGTTAGAGTATCTATTAGGGCCGGAT
AAAGTGCCCCGAGATAAAGAAGACGATATTATAACGATTCCAGACGAACCGTCGAATAAG
AAACAACGTGTCGACGAATTGGATAGAATAAGTCTAAGCAGCGATTCCACCAGCACTGTT
AAGATACCGTACGGAACAGACATCAGCTCAGACTCGGACGGAGATAACGTCATGGAGGTC
GACGTAGTCGTCGAAATGAATCACACCACAGGCAGAGAGAGAGTTCTAGAAGCCGCCGAC
GTTCCGAAGATATCCATAAACGACGAAAAGCAATCATCCGATCAAATAAGTCTAAATGAT
ATAACAAACGATGAGAGCAACCAGGCAGACGCGATTACTAGTGAAGACGTCTCTGATGTT
ATACATGAAGCGCCTACACAACTGAACACATCAAGTGGCGCCCCCCAAGTAGTGTACGAC
CATCCGGATACAGGAACCGGCGACGTCACAGTCCTGGAGAGGATTGACGACGAAAATATA
CCAAACACGAACGATACGGACGAAGATGCGATAACTTGCGGGCAAATCGTACGAAGCTCG
CAAGAAATTGTCAACGGAAATAATGAGCCGGAAGTCAATGGTGTTGATAAAATTAATGAG
GACAGCGATGTGTATAATATTACCACGAAAGATATAAATAAGGGGGAAGATAATCTTGCT
GCCAAAATAGATGGAACCAGTGTGGAAGATATGATGGCGGATTTCGTTGATGAAGTAAAT
GAAGCTGTAGCTGTGTAA

Protein sequence:

MEQIFKRVRHVDPNNSDAVKEVLSNFFQNLPKRNDINNKKFLDELLIIINRYPKYCMSYR
NIIESFISNYLSSNNYYNVIKAAKCAHALQQVRPIQDKTATPKSCWRQQMNSLCNAAHSL
IEVIFADAVDIYRSNAKPTESHSNSPLSTILANIVKKSQDKDRSQLMMTRLKNVFTFIQA
MLVEIYPVPKPIQPRSILDVIVRSLSVSSQHVSLDVASVKVQALKTLDAMILCLGSNLIP
YSPLVFRLATQTLRWTSDNMSRTTGKVRCTAYSTLSKWLLTLHIYKMPEKNTWEDDLTAH
VVRDVTPAKRVVALTMGPQPTKNLSKKAKRKLANSQLLQSSIAAHMPGEKNKIDIPEEVN
NEVTVSALQFAEVFFTVCGRFLKPATHKLFQERVIRHHSAGETLLYLRVLEASRKTTPAT
VAPPTQYCLHIYSTLVNSSDAEISKFCSQALLDIRLHLYCSPPSINLAIEIPQDEEETAN
KRKKVSSKNRAMLEYLLGPDKVPRDKEDDIITIPDEPSNKKQRVDELDRISLSSDSTSTV
KIPYGTDISSDSDGDNVMEVDVVVEMNHTTGRERVLEAADVPKISINDEKQSSDQISLND
ITNDESNQADAITSEDVSDVIHEAPTQLNTSSGAPQVVYDHPDTGTGDVTVLERIDDENI
PNTNDTDEDAITCGQIVRSSQEIVNGNNEPEVNGVDKINEDSDVYNITTKDINKGEDNLA
AKIDGTSVEDMMADFVDEVNEAVAV