DPGLEAN14300 in OGS1.0

New model in OGS2.0DPOGS209769 
Genomic Positionscaffold1680:- 39692-46131
See gene structure
CDS Length2319
Paired RNAseq reads  652
Single RNAseq reads  1473
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA010283 (0.0)
Best Drosophila hit  septin interacting protein 1 (2e-125)
Best Human hittuftelin-interacting protein 11 (9e-101)
Best NR hit (blastp)  septin and tuftelin interacting protein [Bombyx mori] (0.0)
Best NR hit (blastx)  septin and tuftelin interacting protein [Bombyx mori] (0.0)
GeneOntology terms




  
GO:0005578 proteinaceous extracellular matrix
GO:0045449 regulation of transcription
GO:0003700 sequence-specific DNA binding transcription factor activity
GO:0071013 catalytic step 2 spliceosome
GO:0000398 nuclear mRNA splicing, via spliceosome
GO:0071011 precatalytic spliceosome
InterPro families

  
IPR000467 D111/G-patch
IPR022783 GC-rich sequence DNA-binding factor domain
IPR022159 Tuftelin interacting protein N-terminal
Orthology groupMCL13045

Nucleotide sequence:

ATGTCTGATGATGAGGTTATACGTTTTGAAATCACCGACTACGATTTGGATAATGAATTC
AATCCCAACAGAAGTCGGAGGGCTAAGAAGGAACACCAAATATACGGTGTTTGGTCGAAA
GATAGTGATGACGAGGAAAATGAAGACAATATAAGGCGACGTATACGTAAACCGAAAGAT
TTTTCAGCTCCAATAGATTTTGTAACTGGTGGAGTGCAGCAGGCCGGCAAGAAGAAGGAT
GAAAAGCAAGACATACAAAAATCGGAGTCGTCTACATCTCGTCCCAAATTTGCGGATAGT
TCTGATGATGAAGTTTTGGAACCGGAAGCGCGGGAGACTGCGGGGATAAGAAAAGCCGGA
CAGGGTTTGAGATCTGGACAAAATTTAGGTGGAGTTGGTGCTTGGGAGAGACATACTAAA
GGCATTGGAGCTAAATTATTATTACAGATGGGGTATCAACCTGGTAAGGGTTTGGGTAAA
GAGCTGCAAGGTATCTCCGCTCCCGTAGAAGCTACAGTCAGGAAGGGCAGAGGTGCTATT
GGGGCATATGGACCTGAAAAGGCTGCGCAAAAAGCTAAAAAGGAAGAACAGAAGCGGCTG
AAAGAAAAAGAGGGAGATAAAAGTACTACAGAAAAGAGTTATAACTGGAAGAAATCACAT
AAGGGCAGATACTTCTACCGAGATGCAGCCGATGTCATACAAGAGGGTAAACCCACCATG
CATACTATTAGCAGTAACGAGCTGGCCCGCGTGCCGGTTATAGACCTGACCGGCCGGGAG
AAGAAAGTATTGAGCGGCTACCACGCCCTGCGAGCCGCCGCGCCGCGGTATGAACACGAA
CCCCGGAGGGAGTGCACTAACTTCGCAGCACCAGAACTCACTCACAACTTGCAGCTGATG
GTGGATTGTTGTGAACAGGACATTATCCAAAACGCTCGCGAACTCCAACAGTCTGAGGAC
GAGATCGTGGTGATAGAGCGTGATCTCGAAGACTGTAAGATAAAGTTAGGTGAAGAAGAT
GACGTCATCAGGACGCTGGAAGGCATACTGGCGAGGGTGGAGGTACTGAACAGGCCGGAC
GCCTCGCTCGAGATGGGCTATGAGGTGCTGAGGGATCTCAAGGAAACATACCCCTTGGAA
TATGAGATGTTCAGTCTGGGTAATATAGGGGGTAACATAGTGAGTCCCCTATTCAGTGCC
ATGATGGCCTCGTGGAGTCCCCTGACGGACCCCGGGGGCGTAGCACCCGTGTTCCTCAAG
TGGAGGCCCCTTCTGACGGAAGAGGCCTACAATAATCTTCTATGGCAACATTTTGGAACT
AAAATCGAAATGACCGTTGAAGAGTGGAATCCTCGCAATCCGGAGCCTATGGTCCTCGTG
TTCAAGTCGTGGGTGTCCGTGTGTCCGTCCTGGCTGGTGAGCTCGTGTGTCACGAGATAC
GTCGCCCCCCGCCTGGTGACCGCGGTCCGGGACTGGGATCCCACGGGAGATACGCAGCCC
TTACACCAGTGGGTGCTGCCCTGGCACGAGTTCGCTGGGGAAGCTCTCAACGCTTCCGTG
TATCCGTTGATTCGTTCCCGGCTGTCGTCAGCGCTGGCTGCCTGGCACCCCGCGGACTCG
TCGGCCAGGCCGCTGCTGGCGGTTTGGCGAAGTTCGTGGGGCCCCGCTCTCACGACCCTC
TTACATCATCACATCGTACCTAAACTGGAGCACTGTTTGCAGAACGCTCCTTTGGAACTC
GTAGGAAGGGAGAATACCGCGTGGCTCTGGTGTGTGGATTGGGTGGAGTTGATCGGCGCC
CCGACAATAGCCAGCCTGGCGGGTCGAGCTCTAATGCCGCGCTGGTTGGCGGCATTGGCC
GCTTGGCTGAACACTTCCCCACCGCACGCCACTGTACTAAACTCGTACACGGAGTTCAAG
AAAATGTTCCCGGAAGACGTCCTCAAAGAACCGCCCGTGCGTGACGCGTTCCGCAAAGCC
TTGGACATGATGAACAGGAGTACGGACATAGATTCCATAGAGCCGCCCCCACCGCCGCGC
TTCACCATACCAGAACCGAAAGAATCTTCCAGGATAAGCGACGTCCTGGCAACGATAACG
CAACAGAAGAGCTTTTCAGAACTGCTCGAATCCAGGTGCATAGAACGCGGCATCACTTTT
GTGCCTATAGTGGGCAAGAGTAGGGAGGGCAGGCCGTTGTATAAGATCGGTGAACTGCAG
TGTTACGTCATCAGGAACGTCATCATGTACTCCGATGATGGCGGACGGAGCTTCGGGCCC
ATCGGCCTCGATAGGCTCCTGAGTTTGGTTGAGGATTAA

Protein sequence:

MSDDEVIRFEITDYDLDNEFNPNRSRRAKKEHQIYGVWSKDSDDEENEDNIRRRIRKPKD
FSAPIDFVTGGVQQAGKKKDEKQDIQKSESSTSRPKFADSSDDEVLEPEARETAGIRKAG
QGLRSGQNLGGVGAWERHTKGIGAKLLLQMGYQPGKGLGKELQGISAPVEATVRKGRGAI
GAYGPEKAAQKAKKEEQKRLKEKEGDKSTTEKSYNWKKSHKGRYFYRDAADVIQEGKPTM
HTISSNELARVPVIDLTGREKKVLSGYHALRAAAPRYEHEPRRECTNFAAPELTHNLQLM
VDCCEQDIIQNARELQQSEDEIVVIERDLEDCKIKLGEEDDVIRTLEGILARVEVLNRPD
ASLEMGYEVLRDLKETYPLEYEMFSLGNIGGNIVSPLFSAMMASWSPLTDPGGVAPVFLK
WRPLLTEEAYNNLLWQHFGTKIEMTVEEWNPRNPEPMVLVFKSWVSVCPSWLVSSCVTRY
VAPRLVTAVRDWDPTGDTQPLHQWVLPWHEFAGEALNASVYPLIRSRLSSALAAWHPADS
SARPLLAVWRSSWGPALTTLLHHHIVPKLEHCLQNAPLELVGRENTAWLWCVDWVELIGA
PTIASLAGRALMPRWLAALAAWLNTSPPHATVLNSYTEFKKMFPEDVLKEPPVRDAFRKA
LDMMNRSTDIDSIEPPPPPRFTIPEPKESSRISDVLATITQQKSFSELLESRCIERGITF
VPIVGKSREGRPLYKIGELQCYVIRNVIMYSDDGGRSFGPIGLDRLLSLVED