DPGLEAN15443 in OGS1.0

New model in OGS2.0DPOGS206872 
Genomic Positionscaffold1:- 1908380-1911508
See gene structure
CDS Length1392
Paired RNAseq reads  260
Single RNAseq reads  727
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013149 (2e-156)
Best Drosophila hit  Es2 (3e-61)
Best Human hitprotein DGCR14 (6e-54)
Best NR hit (blastp)  AGAP001347-PA [Anopheles gambiae str. PEST] (2e-87)
Best NR hit (blastx)  AGAP001347-PA [Anopheles gambiae str. PEST] (2e-84)
GeneOntology terms



  
GO:0004757 sepiapterin reductase activity
GO:0005634 nucleus
GO:0007399 nervous system development
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071013 catalytic step 2 spliceosome
InterPro families  IPR019148 Nuclear protein DGCR14
Orthology groupMCL13571

Nucleotide sequence:

ATGAAAATACTTAAAAATGAAAGTTCCCTTTTTAAGGTACCTAAAGTACCACCAATTAAG
AGAAAAGTCACGAAAACACATATCTTGGATGAAGAAGATTATGTACAGGGAATAGCACAG
ATCATACAGAGAGATTTCTTCCCAGATTTAGAAAAACTTAAGGCGCAAAATGATTACTTA
GAAGCTTCTGAAAACAAAGACTATGCCAGGCTACGACAAATTGCCAAAAAATACAGTGGG
CATAGACCACCTACAGAACCATATAATTCTCCTGCCACATTTGACACACCAAATGCCGAT
AGACCTTTTTCTCCATCCTCAGCAGAAGCACAATCAATACCTAAAGAAACTGTTGAGACC
TTTAAGGATATAACTGATAAACATACTTTAGATTCCTTCCTCGCGCTACATACAAGTGAA
GACAATGCAAGCTACAATCGCGTTATAGCACTTGAAAACAAGAAGAAAGCAAGTAAAATT
GCTTCCCAGTTGCAGTCAGAGGTCACCTCAGCTATTCAAGCTGACAATGCATTAGCATTG
CCATCTTTGGAGCAACAGGCTAATCAAGAAGAGAGACCTCATGAGCTGGACACATGGAGG
TATCGGGCCAAGAACTACATAATGTATGTGCCGGACGGGGCTGAATCTCAGCTGCCTAGC
CCCAAACCAGAGCTGCAACACCACAACACTAGACTTACAACACAAGTATTTGATTCAGCT
AAGAACAAAGAAGCGATAACTGCTTTGGCAAGAAGTCAGGATTCAGCTATCCAAGGCAAG
ATTGGTGTTGATGGTGTCAGTATAGGCGAGAAGCCGGAGTACAACTTCGTGTCGACTCCA
TCACCTAGACCAGGAGCTGGTCCGGACCAGTCCCCGCTCATGACCTGGGGAGAGATTGAG
GGCACTCCATTCAGGCTGGACGGAGGGGACACGCCTCTACCTGCTGTGGGCGCAGGCATG
GCTTATCGTATGCTGGAGTCGGGCTCCCGCGAGAGGATCGCACTGCAGCTGGCGGAGAGA
GCTGCGAAGCGGAGACGACCAACAACACCAACACATACCATGAAGACGCCTGGGAGTTTC
AGAACCAACACAGAGAGGTTGGCAAGCATGTCGCCAGCGGCAAGGAAATTGGCGGCAAAG
CATTTACTGTCGCCACGCTTGAAATTAACACCCAACGCCATGGGTATATCGCATAAAACA
CCAAAAATAACCCCGTCCCCAGGAACACCGTTGGTGGCAACTCCAAAAACACCATCTTCA
GCTAAAACATCTGAGAATCCATCTCAAACTCCAGAGAGCAGTTCACAGACGGATAAAAAT
CTCACAGATAATCTATTACAGATAAACCTTGCAAAGAGGACAAGGCTAAAAGCACAAGAT
TTCTTCAAATAA

Protein sequence:

MKILKNESSLFKVPKVPPIKRKVTKTHILDEEDYVQGIAQIIQRDFFPDLEKLKAQNDYL
EASENKDYARLRQIAKKYSGHRPPTEPYNSPATFDTPNADRPFSPSSAEAQSIPKETVET
FKDITDKHTLDSFLALHTSEDNASYNRVIALENKKKASKIASQLQSEVTSAIQADNALAL
PSLEQQANQEERPHELDTWRYRAKNYIMYVPDGAESQLPSPKPELQHHNTRLTTQVFDSA
KNKEAITALARSQDSAIQGKIGVDGVSIGEKPEYNFVSTPSPRPGAGPDQSPLMTWGEIE
GTPFRLDGGDTPLPAVGAGMAYRMLESGSRERIALQLAERAAKRRRPTTPTHTMKTPGSF
RTNTERLASMSPAARKLAAKHLLSPRLKLTPNAMGISHKTPKITPSPGTPLVATPKTPSS
AKTSENPSQTPESSSQTDKNLTDNLLQINLAKRTRLKAQDFFK