New model in OGS2.0 | DPOGS209769  |
---|---|
Genomic Position | scaffold1680:- 39692-46131 |
See gene structure | |
CDS Length | 2319 |
Paired RNAseq reads   | 652 |
Single RNAseq reads   | 1473 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010283 (0.0) |
Best Drosophila hit   | septin interacting protein 1 (2e-125) |
Best Human hit | tuftelin-interacting protein 11 (9e-101) |
Best NR hit (blastp)   | septin and tuftelin interacting protein [Bombyx mori] (0.0) |
Best NR hit (blastx)   | septin and tuftelin interacting protein [Bombyx mori] (0.0) |
GeneOntology terms    | GO:0005578 proteinaceous extracellular matrix GO:0045449 regulation of transcription GO:0003700 sequence-specific DNA binding transcription factor activity GO:0071013 catalytic step 2 spliceosome GO:0000398 nuclear mRNA splicing, via spliceosome GO:0071011 precatalytic spliceosome |
InterPro families    | IPR000467 D111/G-patch IPR022783 GC-rich sequence DNA-binding factor domain IPR022159 Tuftelin interacting protein N-terminal |
Orthology group | MCL13045 |
Nucleotide sequence:
ATGTCTGATGATGAGGTTATACGTTTTGAAATCACCGACTACGATTTGGATAATGAATTC
AATCCCAACAGAAGTCGGAGGGCTAAGAAGGAACACCAAATATACGGTGTTTGGTCGAAA
GATAGTGATGACGAGGAAAATGAAGACAATATAAGGCGACGTATACGTAAACCGAAAGAT
TTTTCAGCTCCAATAGATTTTGTAACTGGTGGAGTGCAGCAGGCCGGCAAGAAGAAGGAT
GAAAAGCAAGACATACAAAAATCGGAGTCGTCTACATCTCGTCCCAAATTTGCGGATAGT
TCTGATGATGAAGTTTTGGAACCGGAAGCGCGGGAGACTGCGGGGATAAGAAAAGCCGGA
CAGGGTTTGAGATCTGGACAAAATTTAGGTGGAGTTGGTGCTTGGGAGAGACATACTAAA
GGCATTGGAGCTAAATTATTATTACAGATGGGGTATCAACCTGGTAAGGGTTTGGGTAAA
GAGCTGCAAGGTATCTCCGCTCCCGTAGAAGCTACAGTCAGGAAGGGCAGAGGTGCTATT
GGGGCATATGGACCTGAAAAGGCTGCGCAAAAAGCTAAAAAGGAAGAACAGAAGCGGCTG
AAAGAAAAAGAGGGAGATAAAAGTACTACAGAAAAGAGTTATAACTGGAAGAAATCACAT
AAGGGCAGATACTTCTACCGAGATGCAGCCGATGTCATACAAGAGGGTAAACCCACCATG
CATACTATTAGCAGTAACGAGCTGGCCCGCGTGCCGGTTATAGACCTGACCGGCCGGGAG
AAGAAAGTATTGAGCGGCTACCACGCCCTGCGAGCCGCCGCGCCGCGGTATGAACACGAA
CCCCGGAGGGAGTGCACTAACTTCGCAGCACCAGAACTCACTCACAACTTGCAGCTGATG
GTGGATTGTTGTGAACAGGACATTATCCAAAACGCTCGCGAACTCCAACAGTCTGAGGAC
GAGATCGTGGTGATAGAGCGTGATCTCGAAGACTGTAAGATAAAGTTAGGTGAAGAAGAT
GACGTCATCAGGACGCTGGAAGGCATACTGGCGAGGGTGGAGGTACTGAACAGGCCGGAC
GCCTCGCTCGAGATGGGCTATGAGGTGCTGAGGGATCTCAAGGAAACATACCCCTTGGAA
TATGAGATGTTCAGTCTGGGTAATATAGGGGGTAACATAGTGAGTCCCCTATTCAGTGCC
ATGATGGCCTCGTGGAGTCCCCTGACGGACCCCGGGGGCGTAGCACCCGTGTTCCTCAAG
TGGAGGCCCCTTCTGACGGAAGAGGCCTACAATAATCTTCTATGGCAACATTTTGGAACT
AAAATCGAAATGACCGTTGAAGAGTGGAATCCTCGCAATCCGGAGCCTATGGTCCTCGTG
TTCAAGTCGTGGGTGTCCGTGTGTCCGTCCTGGCTGGTGAGCTCGTGTGTCACGAGATAC
GTCGCCCCCCGCCTGGTGACCGCGGTCCGGGACTGGGATCCCACGGGAGATACGCAGCCC
TTACACCAGTGGGTGCTGCCCTGGCACGAGTTCGCTGGGGAAGCTCTCAACGCTTCCGTG
TATCCGTTGATTCGTTCCCGGCTGTCGTCAGCGCTGGCTGCCTGGCACCCCGCGGACTCG
TCGGCCAGGCCGCTGCTGGCGGTTTGGCGAAGTTCGTGGGGCCCCGCTCTCACGACCCTC
TTACATCATCACATCGTACCTAAACTGGAGCACTGTTTGCAGAACGCTCCTTTGGAACTC
GTAGGAAGGGAGAATACCGCGTGGCTCTGGTGTGTGGATTGGGTGGAGTTGATCGGCGCC
CCGACAATAGCCAGCCTGGCGGGTCGAGCTCTAATGCCGCGCTGGTTGGCGGCATTGGCC
GCTTGGCTGAACACTTCCCCACCGCACGCCACTGTACTAAACTCGTACACGGAGTTCAAG
AAAATGTTCCCGGAAGACGTCCTCAAAGAACCGCCCGTGCGTGACGCGTTCCGCAAAGCC
TTGGACATGATGAACAGGAGTACGGACATAGATTCCATAGAGCCGCCCCCACCGCCGCGC
TTCACCATACCAGAACCGAAAGAATCTTCCAGGATAAGCGACGTCCTGGCAACGATAACG
CAACAGAAGAGCTTTTCAGAACTGCTCGAATCCAGGTGCATAGAACGCGGCATCACTTTT
GTGCCTATAGTGGGCAAGAGTAGGGAGGGCAGGCCGTTGTATAAGATCGGTGAACTGCAG
TGTTACGTCATCAGGAACGTCATCATGTACTCCGATGATGGCGGACGGAGCTTCGGGCCC
ATCGGCCTCGATAGGCTCCTGAGTTTGGTTGAGGATTAA
Protein sequence:
MSDDEVIRFEITDYDLDNEFNPNRSRRAKKEHQIYGVWSKDSDDEENEDNIRRRIRKPKD
FSAPIDFVTGGVQQAGKKKDEKQDIQKSESSTSRPKFADSSDDEVLEPEARETAGIRKAG
QGLRSGQNLGGVGAWERHTKGIGAKLLLQMGYQPGKGLGKELQGISAPVEATVRKGRGAI
GAYGPEKAAQKAKKEEQKRLKEKEGDKSTTEKSYNWKKSHKGRYFYRDAADVIQEGKPTM
HTISSNELARVPVIDLTGREKKVLSGYHALRAAAPRYEHEPRRECTNFAAPELTHNLQLM
VDCCEQDIIQNARELQQSEDEIVVIERDLEDCKIKLGEEDDVIRTLEGILARVEVLNRPD
ASLEMGYEVLRDLKETYPLEYEMFSLGNIGGNIVSPLFSAMMASWSPLTDPGGVAPVFLK
WRPLLTEEAYNNLLWQHFGTKIEMTVEEWNPRNPEPMVLVFKSWVSVCPSWLVSSCVTRY
VAPRLVTAVRDWDPTGDTQPLHQWVLPWHEFAGEALNASVYPLIRSRLSSALAAWHPADS
SARPLLAVWRSSWGPALTTLLHHHIVPKLEHCLQNAPLELVGRENTAWLWCVDWVELIGA
PTIASLAGRALMPRWLAALAAWLNTSPPHATVLNSYTEFKKMFPEDVLKEPPVRDAFRKA
LDMMNRSTDIDSIEPPPPPRFTIPEPKESSRISDVLATITQQKSFSELLESRCIERGITF
VPIVGKSREGRPLYKIGELQCYVIRNVIMYSDDGGRSFGPIGLDRLLSLVED