New model in OGS2.0 | DPOGS206872  |
---|---|
Genomic Position | scaffold1:- 1908380-1911508 |
See gene structure | |
CDS Length | 1392 |
Paired RNAseq reads   | 260 |
Single RNAseq reads   | 727 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA013149 (2e-156) |
Best Drosophila hit   | Es2 (3e-61) |
Best Human hit | protein DGCR14 (6e-54) |
Best NR hit (blastp)   | AGAP001347-PA [Anopheles gambiae str. PEST] (2e-87) |
Best NR hit (blastx)   | AGAP001347-PA [Anopheles gambiae str. PEST] (2e-84) |
GeneOntology terms    | GO:0004757 sepiapterin reductase activity GO:0005634 nucleus GO:0007399 nervous system development GO:0000398 nuclear mRNA splicing, via spliceosome GO:0071013 catalytic step 2 spliceosome |
InterPro families   | IPR019148 Nuclear protein DGCR14 |
Orthology group | MCL13571 |
Nucleotide sequence:
ATGAAAATACTTAAAAATGAAAGTTCCCTTTTTAAGGTACCTAAAGTACCACCAATTAAG
AGAAAAGTCACGAAAACACATATCTTGGATGAAGAAGATTATGTACAGGGAATAGCACAG
ATCATACAGAGAGATTTCTTCCCAGATTTAGAAAAACTTAAGGCGCAAAATGATTACTTA
GAAGCTTCTGAAAACAAAGACTATGCCAGGCTACGACAAATTGCCAAAAAATACAGTGGG
CATAGACCACCTACAGAACCATATAATTCTCCTGCCACATTTGACACACCAAATGCCGAT
AGACCTTTTTCTCCATCCTCAGCAGAAGCACAATCAATACCTAAAGAAACTGTTGAGACC
TTTAAGGATATAACTGATAAACATACTTTAGATTCCTTCCTCGCGCTACATACAAGTGAA
GACAATGCAAGCTACAATCGCGTTATAGCACTTGAAAACAAGAAGAAAGCAAGTAAAATT
GCTTCCCAGTTGCAGTCAGAGGTCACCTCAGCTATTCAAGCTGACAATGCATTAGCATTG
CCATCTTTGGAGCAACAGGCTAATCAAGAAGAGAGACCTCATGAGCTGGACACATGGAGG
TATCGGGCCAAGAACTACATAATGTATGTGCCGGACGGGGCTGAATCTCAGCTGCCTAGC
CCCAAACCAGAGCTGCAACACCACAACACTAGACTTACAACACAAGTATTTGATTCAGCT
AAGAACAAAGAAGCGATAACTGCTTTGGCAAGAAGTCAGGATTCAGCTATCCAAGGCAAG
ATTGGTGTTGATGGTGTCAGTATAGGCGAGAAGCCGGAGTACAACTTCGTGTCGACTCCA
TCACCTAGACCAGGAGCTGGTCCGGACCAGTCCCCGCTCATGACCTGGGGAGAGATTGAG
GGCACTCCATTCAGGCTGGACGGAGGGGACACGCCTCTACCTGCTGTGGGCGCAGGCATG
GCTTATCGTATGCTGGAGTCGGGCTCCCGCGAGAGGATCGCACTGCAGCTGGCGGAGAGA
GCTGCGAAGCGGAGACGACCAACAACACCAACACATACCATGAAGACGCCTGGGAGTTTC
AGAACCAACACAGAGAGGTTGGCAAGCATGTCGCCAGCGGCAAGGAAATTGGCGGCAAAG
CATTTACTGTCGCCACGCTTGAAATTAACACCCAACGCCATGGGTATATCGCATAAAACA
CCAAAAATAACCCCGTCCCCAGGAACACCGTTGGTGGCAACTCCAAAAACACCATCTTCA
GCTAAAACATCTGAGAATCCATCTCAAACTCCAGAGAGCAGTTCACAGACGGATAAAAAT
CTCACAGATAATCTATTACAGATAAACCTTGCAAAGAGGACAAGGCTAAAAGCACAAGAT
TTCTTCAAATAA
Protein sequence:
MKILKNESSLFKVPKVPPIKRKVTKTHILDEEDYVQGIAQIIQRDFFPDLEKLKAQNDYL
EASENKDYARLRQIAKKYSGHRPPTEPYNSPATFDTPNADRPFSPSSAEAQSIPKETVET
FKDITDKHTLDSFLALHTSEDNASYNRVIALENKKKASKIASQLQSEVTSAIQADNALAL
PSLEQQANQEERPHELDTWRYRAKNYIMYVPDGAESQLPSPKPELQHHNTRLTTQVFDSA
KNKEAITALARSQDSAIQGKIGVDGVSIGEKPEYNFVSTPSPRPGAGPDQSPLMTWGEIE
GTPFRLDGGDTPLPAVGAGMAYRMLESGSRERIALQLAERAAKRRRPTTPTHTMKTPGSF
RTNTERLASMSPAARKLAAKHLLSPRLKLTPNAMGISHKTPKITPSPGTPLVATPKTPSS
AKTSENPSQTPESSSQTDKNLTDNLLQINLAKRTRLKAQDFFK