DPGLEAN21208 in OGS1.0

New model in OGS2.0DPOGS211992 
Genomic Positionscaffold1845:+ 20521-23422
See gene structure
CDS Length1704
Paired RNAseq reads  419
Single RNAseq reads  1166
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000115 (4e-118)
Best Drosophila hit  CG13625 (1e-54)
Best Human hitBUD13 homolog isoform 1 (9e-36)
Best NR hit (blastp)  AGAP011249-PA [Anopheles gambiae str. PEST] (2e-93)
Best NR hit (blastx)  AGAP011249-PA [Anopheles gambiae str. PEST] (3e-80)
GeneOntology terms
  
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071013 catalytic step 2 spliceosome
InterPro families  IPR018609 Bud13
Orthology groupMCL14916

Nucleotide sequence:

ATGGCAACGGCAATAGACCAGAAAGCATATTTAAAAAAATATCTAACTTCTGCACCGGAC
GAAAAGAAGAAGAAAAAGAAAAAAAACATTAAGGGCAAGGGGTTCAAGATAATAGATGAT
GATTTAGACATATCCAAGCTGAGACCTCTCGACGGAGATGAATTGGACATATATAATGAG
GGTGAAGATGCTCCACAAGTTGTGGGCATCGTAGACGAGAGACCGGAGGAATTGAGAAAG
TTAGACGGTGACAGTACAACAAAATGGAAAGTTATCAACCAGGATGACGGTTTCAATAGT
AAGTTGAGAGTGGAAGAAATTAAGAAAGACAGAAAAGAAGTTGAGAAAGAAAATGAAATA
GTGTTCGGTAAAATGTACAGTGATTCTGAGGATGACAACAAGAATGATTCGGATCTAACT
CCACCAAGGAAAAATGGTGATAAGATAAGTCAACGTAGAGACAGTGATTTTAGTCCACCC
CGGAAGAGAAATAAATCTAATTCGGACTCCGATATATCACCACCAAGAAGAAATTCTAGA
AAGAATGAACATGATCATAGTCGACAGAAGAAAACAATGACGGTTAAATATGATTCTGAT
GTTTCCCCTCCTAGAAGAAATAAAGATGATAGATACACAAAGAAACAAAGCAGAACACAT
CATGAACGGACTCAGAAGAATTATGACTCTGATGTTTCACCTCCAAGAAGAGAAAGTCAT
CAGACGAAAAGAAAATCAAGAAAAGATAGTGATAGCGATTTAAGTCCACCGAGAAATAGG
TCACAGAAGAATTACGATTCCGATCTATCACCACCAAGGAAAAAAGATAAACCAAAAGAA
AAAGCAAGGAAACCATCTCGTTGGGGACATTTTGAGGATAAAGATGACAATAACATGTCA
CCAGATAGATCAAGAAAGCATAGCCCCACTAACCAAAGAAGTTCTCACAAAGAAGATAAG
TCTCATAATAGAAGTATAGAGAAGTATAGAAAAGATTCTGCCCGACAGAAGAATAATACA
CCAGATACTGATCTCACCCCAACACGTAAACCCAGATCACCTGAAAGAAAATCTAGACAT
GGCAATAGTGGCAAGATTATGGAAAAAACTTTAGACGGGAAACAAGCTGGCTTACAAGAT
GCTAAAAGACTGAAAGAAGAAAACGACTCATTTAGACGGAGAGAAAATGAGATGTTCAGA
AACATGACTGATGATATATCTGGCAGGAACGCTAAAGCGGTCTCAAGGAAAGGCAAAAGA
GAAACTTCAGAGGATCGCCAGAAACAAAAAGAAAAGGCGGAGAGACAAAAGGAACTCGAT
GAGAAATATAAAAAATGGAGTAAAGGTTTAAAACAAGTTGAAGATCAACAAGCAGCGTTA
CAGGATTACATCCACGAAGCGTCCAAACCACTGGCTCGATACAAAGATGATGTGGACCTT
GAAGATAGATTGAGAGACATAGATAGAGATGGAGATCCAATGTTGAAATATATAAGAGAT
AGGAAGAGAGAGAGGGGGGAGTTGGGACCAGAAAAACCATCATATAAAGGAAATTTCCCC
CCAAATCGCTTCAATATTAGACCGGGTTATCGCTGGGATGGAGTGGACAGGTCTAATGGT
TATGAAAAGAAATACTTTGAACAACAAAGCAAAAGGAGAGCTCAAGCCGAAGAAGCTTAC
AAGTGGAGCACAGAAGATCTTTAG

Protein sequence:

MATAIDQKAYLKKYLTSAPDEKKKKKKKNIKGKGFKIIDDDLDISKLRPLDGDELDIYNE
GEDAPQVVGIVDERPEELRKLDGDSTTKWKVINQDDGFNSKLRVEEIKKDRKEVEKENEI
VFGKMYSDSEDDNKNDSDLTPPRKNGDKISQRRDSDFSPPRKRNKSNSDSDISPPRRNSR
KNEHDHSRQKKTMTVKYDSDVSPPRRNKDDRYTKKQSRTHHERTQKNYDSDVSPPRRESH
QTKRKSRKDSDSDLSPPRNRSQKNYDSDLSPPRKKDKPKEKARKPSRWGHFEDKDDNNMS
PDRSRKHSPTNQRSSHKEDKSHNRSIEKYRKDSARQKNNTPDTDLTPTRKPRSPERKSRH
GNSGKIMEKTLDGKQAGLQDAKRLKEENDSFRRRENEMFRNMTDDISGRNAKAVSRKGKR
ETSEDRQKQKEKAERQKELDEKYKKWSKGLKQVEDQQAALQDYIHEASKPLARYKDDVDL
EDRLRDIDRDGDPMLKYIRDRKRERGELGPEKPSYKGNFPPNRFNIRPGYRWDGVDRSNG
YEKKYFEQQSKRRAQAEEAYKWSTEDL