New model in OGS2.0 | DPOGS211992  |
---|---|
Genomic Position | scaffold1845:+ 20521-23422 |
See gene structure | |
CDS Length | 1704 |
Paired RNAseq reads   | 419 |
Single RNAseq reads   | 1166 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA000115 (4e-118) |
Best Drosophila hit   | CG13625 (1e-54) |
Best Human hit | BUD13 homolog isoform 1 (9e-36) |
Best NR hit (blastp)   | AGAP011249-PA [Anopheles gambiae str. PEST] (2e-93) |
Best NR hit (blastx)   | AGAP011249-PA [Anopheles gambiae str. PEST] (3e-80) |
GeneOntology terms    | GO:0000398 nuclear mRNA splicing, via spliceosome GO:0071013 catalytic step 2 spliceosome |
InterPro families   | IPR018609 Bud13 |
Orthology group | MCL14916 |
Nucleotide sequence:
ATGGCAACGGCAATAGACCAGAAAGCATATTTAAAAAAATATCTAACTTCTGCACCGGAC
GAAAAGAAGAAGAAAAAGAAAAAAAACATTAAGGGCAAGGGGTTCAAGATAATAGATGAT
GATTTAGACATATCCAAGCTGAGACCTCTCGACGGAGATGAATTGGACATATATAATGAG
GGTGAAGATGCTCCACAAGTTGTGGGCATCGTAGACGAGAGACCGGAGGAATTGAGAAAG
TTAGACGGTGACAGTACAACAAAATGGAAAGTTATCAACCAGGATGACGGTTTCAATAGT
AAGTTGAGAGTGGAAGAAATTAAGAAAGACAGAAAAGAAGTTGAGAAAGAAAATGAAATA
GTGTTCGGTAAAATGTACAGTGATTCTGAGGATGACAACAAGAATGATTCGGATCTAACT
CCACCAAGGAAAAATGGTGATAAGATAAGTCAACGTAGAGACAGTGATTTTAGTCCACCC
CGGAAGAGAAATAAATCTAATTCGGACTCCGATATATCACCACCAAGAAGAAATTCTAGA
AAGAATGAACATGATCATAGTCGACAGAAGAAAACAATGACGGTTAAATATGATTCTGAT
GTTTCCCCTCCTAGAAGAAATAAAGATGATAGATACACAAAGAAACAAAGCAGAACACAT
CATGAACGGACTCAGAAGAATTATGACTCTGATGTTTCACCTCCAAGAAGAGAAAGTCAT
CAGACGAAAAGAAAATCAAGAAAAGATAGTGATAGCGATTTAAGTCCACCGAGAAATAGG
TCACAGAAGAATTACGATTCCGATCTATCACCACCAAGGAAAAAAGATAAACCAAAAGAA
AAAGCAAGGAAACCATCTCGTTGGGGACATTTTGAGGATAAAGATGACAATAACATGTCA
CCAGATAGATCAAGAAAGCATAGCCCCACTAACCAAAGAAGTTCTCACAAAGAAGATAAG
TCTCATAATAGAAGTATAGAGAAGTATAGAAAAGATTCTGCCCGACAGAAGAATAATACA
CCAGATACTGATCTCACCCCAACACGTAAACCCAGATCACCTGAAAGAAAATCTAGACAT
GGCAATAGTGGCAAGATTATGGAAAAAACTTTAGACGGGAAACAAGCTGGCTTACAAGAT
GCTAAAAGACTGAAAGAAGAAAACGACTCATTTAGACGGAGAGAAAATGAGATGTTCAGA
AACATGACTGATGATATATCTGGCAGGAACGCTAAAGCGGTCTCAAGGAAAGGCAAAAGA
GAAACTTCAGAGGATCGCCAGAAACAAAAAGAAAAGGCGGAGAGACAAAAGGAACTCGAT
GAGAAATATAAAAAATGGAGTAAAGGTTTAAAACAAGTTGAAGATCAACAAGCAGCGTTA
CAGGATTACATCCACGAAGCGTCCAAACCACTGGCTCGATACAAAGATGATGTGGACCTT
GAAGATAGATTGAGAGACATAGATAGAGATGGAGATCCAATGTTGAAATATATAAGAGAT
AGGAAGAGAGAGAGGGGGGAGTTGGGACCAGAAAAACCATCATATAAAGGAAATTTCCCC
CCAAATCGCTTCAATATTAGACCGGGTTATCGCTGGGATGGAGTGGACAGGTCTAATGGT
TATGAAAAGAAATACTTTGAACAACAAAGCAAAAGGAGAGCTCAAGCCGAAGAAGCTTAC
AAGTGGAGCACAGAAGATCTTTAG
Protein sequence:
MATAIDQKAYLKKYLTSAPDEKKKKKKKNIKGKGFKIIDDDLDISKLRPLDGDELDIYNE
GEDAPQVVGIVDERPEELRKLDGDSTTKWKVINQDDGFNSKLRVEEIKKDRKEVEKENEI
VFGKMYSDSEDDNKNDSDLTPPRKNGDKISQRRDSDFSPPRKRNKSNSDSDISPPRRNSR
KNEHDHSRQKKTMTVKYDSDVSPPRRNKDDRYTKKQSRTHHERTQKNYDSDVSPPRRESH
QTKRKSRKDSDSDLSPPRNRSQKNYDSDLSPPRKKDKPKEKARKPSRWGHFEDKDDNNMS
PDRSRKHSPTNQRSSHKEDKSHNRSIEKYRKDSARQKNNTPDTDLTPTRKPRSPERKSRH
GNSGKIMEKTLDGKQAGLQDAKRLKEENDSFRRRENEMFRNMTDDISGRNAKAVSRKGKR
ETSEDRQKQKEKAERQKELDEKYKKWSKGLKQVEDQQAALQDYIHEASKPLARYKDDVDL
EDRLRDIDRDGDPMLKYIRDRKRERGELGPEKPSYKGNFPPNRFNIRPGYRWDGVDRSNG
YEKKYFEQQSKRRAQAEEAYKWSTEDL