DPGLEAN08442 in OGS1.0

New model in OGS2.0DPOGS214207 
Genomic Positionscaffold49:- 48529-52950
See gene structure
CDS Length3372
Paired RNAseq reads  429
Single RNAseq reads  1276
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA005934 (0.0)
Best Drosophila hit  CG8211 (0.0)
Best Human hitintegrator complex subunit 2 (0.0)
Best NR hit (blastp)  PREDICTED: similar to integrator complex subunit 2 [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to integrator complex subunit 2 [Apis mellifera] (0.0)
GeneOntology terms






  
GO:0005622 intracellular
GO:0016020 membrane
GO:0016021 integral to membrane
GO:0005515 protein binding
GO:0016180 snRNA processing
GO:0032039 integrator complex
GO:0005634 nucleus
GO:0031965 nuclear membrane
InterPro families  ND
Orthology groupMCL14152

Nucleotide sequence:

ATGGATATCGAATTTATGAAACCCGTTAAGCCTCTAGTTTTCAAGGCTTTAAAAGATGTC
GATATTGAAACTTTAATAAAATGCACACCGGATGAAATAAGACCGATCATACCATGTCTA
GTCCGTATGGCTCTTATAGCACCTCTTGATATAACTAGATATTGTGCCGAGGCTAAAAAA
GACATCCTGACTCTACTATCTGGGATTGATCTAGTAAATTTCATCGTATCTTTACTGTCT
ATTGAATTTCATGCTCTAGAAGTGGATCTAAAGAAAGAACAACAAATGCGCTTAAAAAGT
GGATCCCAGAATACTGAATCCTTTTTAATACAGAATGTAGTAAATGGAATTGCAAATGAC
TTTGAACAGTCAGATTCCGCAAGAAGAGTCCGACTTGTTCTCTCTGAGTTGCTGCAGATG
CAAGCGCAGTTGGCAGAGTATAATCAGAATAAAAATTCAAATTCTGAATCTTCTATAAAA
CCATCGGAACTCTTTGATAATGAGGTGTATCTTGAAGAAATTACAGATGTTATTTGTATA
AGTCTTGCTGAATTACCAAATCTTTTAAATATATGTGAAATTGTTGAAGTATTACTGCAT
GTGAACAAGGGACCAATTATTATTTCTTGGGTTGTAGCAAATATGCCTGACACACTTTTA
GATGTGGCAGAATCTTTGGTTTTAAATGCTGAAAGAGGAGAAGAAGGTGGCATTAGAGCC
AAAACTTTATCCACATTATGTGACGCCTGTCCCTATATTGCAACAGCTGTTAGAGCAAAA
GCTGTATCTGCTTCCAGACTACCGTGTTTAATAATAAACCTCACTTTGACACATCATCAA
GACTTGGTATCCTTTATATCTGGTTTGCTATTGGGTTCAGACCAGAGTACCAGAACATGG
TTTGCAACATTCTTACGTAACTCCCATAAAAGGGGGAAAGGAGATGGCCATGCAATATTG
GTGAAGTTACGCCAAGAACTTCTGATTAGATTAAAAGAAGCTTCAGCTGGGGTTGATGCC
TCTGCATTATTAAGGTTATACTGTGCCTTGAGAGGAATCGCGGGAATAAAGTTCCAAGAT
GATGAGGTGTCAGGACTCTTACGACTTGTGACACAAAAGCCACCGCCAACTCCAGCTGGT
GTGAGATTTGTTTCCTTGAGTTTATGTATGATCCTAGCATGTCCTTCACTTATGGCTGCT
CCTGAATATGAGAAGAAAGCAATAGAATGGGTACAATGGCTTGTAAAGGAAGAAGCTTAT
TTTGAAAGCAATTCAGGCGTCACAGCTTCGTTTGGGGAGATGTTGCTGCTAATAGCAATC
CACTTCCACTCTGGACAGCTGACGGCCGTCGGTGAACTAGTCTGTGCTACACTTGGCATG
AGGGTCCCCGTGCGACCAAACGGACTTGCGAGGATCAAGCAGGCCTTCACACAGGAAATA
TTTACTGAGCAGGTCGTCACTGCACATGCTGTTAAAGTACCTGTCACTGCAAATCTCAAC
AGCAACATATCCGGTTATTTGCCTGTGCATTGTATTCACCAATTACTGAAGTCGCGAGCA
TTTTCGAAACATAAAGTGCCAATAAAAAATTGGATATATAGTCAAATTTGCAACTGTATT
GCTCCCTTACACCCTGTAATGCCAGCCCTCGTCGAAGTTTACGTCAATTCTATTCTGGTT
ATTAATAATAAAGGAACAAATGAATACTTCAACAAGCCAATAACAGAAGAAGAAATACGC
AGGGTATTCCGAAAATCTATTTTTGGTGTTAATTATGACTCAAACAGCAAACCATTTACT
TCTATGGATGTTGATAGTGATTCCACAGTTGACATAAACATTGAGAAACCAACTCTAGCC
TCACAACTATTATTGATCTATTACCTGCTCCTGTATGAAGATGTAAGATTGGCTAATACA
GCTATACTGATTGCCAATGGAAGAAAAGTGAAAAGTTATTCAACAACATTTCTTTCCGAA
TTGCCAATAAAGTATTTGCTACATCAAGCCCAGAAAGATCAAATGAGTTATGGTGGTCTT
TTCAGCCCGCTGCTTCGTTTGCTTGCGACTCATTTTCCGCAGCTATCGCTTGTAGATGAT
TGGATGGATGACCAGGTCTTTGGAGATTCCTGTCGTCACCAAATAGACATTAATCTTTCA
GAAGTATCTATAACTGAAGCATTCCAGTGCATCGAAGAAAATCCATATAAAACGGGTAAA
ATATTAAAAGCCATGCTTAATAAAAATCCTACTGACATATGGCCTTTTGCAGAAATATTT
GTTAAATACGTGAAGAGTGTGTTAGGAGGTAGAGTCCCAAGACATATACAAGAACTCTAC
AGAGAGGTTTGGTTGCGTTTAAACACGGTTCTACCCCGATGTTTGTGGATATTGACAATT
AACGCGTTGCTGGATATAAATAATGGATGCGGTAAATACGTTACCATAACACAGGAAAAC
GTTCTAGTTGATCCTTTACAAGTCTTAAGATGTGATATAAGAGTATTTAGATGTGGTCCT
ATATTAAAAATAATTCTGAGAATTTTAGAAGCGAGCTTAGCTGCATCGAGAAGCCAGTTA
AGTCGCCATTTATTGGACAAGCCACTTCTTGAAAAAAGCGGCCAATTGACATCAGACTCC
GAGAGGGAAGAATTGAAAAATGCCTTAGTTGCCGCTCAAGAAAGTGCAGCACTACAAATT
TTACTAGAAGCTTGTTTGGAGACTGAAGAAGACCAATCTAAACCCGAACTAATGTGGTCT
TTGAAAGAAGTACGAAGTATAATATGTTCGTTTTTACATCAAGTGTTTATAGCTGAGCCA
TCACTTGCAAAATTAGTACACTTCCAAGGATATCCGAGGGAATTATTGACAGTAACCGTC
CAAGGCATACCGTCAATGCACATATGTTTAGATTTTATTCCTGAACTTCTAAGTCAAGCT
TCTCTAGAGAAACAAATTTTTGCTGTGGACTTGGTATCTCATTTATCAATTCAGTATGCT
TTACCCAAAGCTATGTCCATTGCGAGGTTATGCGTGAATACTCTATCCACCCTCCTATCT
GTCCTACCAAGTGACCTGCGTCTGGAACTCTTCCAACCAGTTTTAAAATCGCTCGTACGG
ATTTGTATCGCATTTCCCTCCTTACTTGAAGATATTACATCGTTATTGTTACAGTTAGGT
CGAATTTGTGAATCTCAGGTATCACTTGGCCATTGTTGGAATGACACAAATATATTGGGC
GAAGGAGCTTATGTATCCTCTGAAGTTCACAATGACAGTAAAGTATTACTCGCCGAGGTT
TTATGTAGGGACATTAAATCAACAATGTCAGAAATTATACAGAAAGCACTTTTAAATGAT
AAACTGTATTGA

Protein sequence:

MDIEFMKPVKPLVFKALKDVDIETLIKCTPDEIRPIIPCLVRMALIAPLDITRYCAEAKK
DILTLLSGIDLVNFIVSLLSIEFHALEVDLKKEQQMRLKSGSQNTESFLIQNVVNGIAND
FEQSDSARRVRLVLSELLQMQAQLAEYNQNKNSNSESSIKPSELFDNEVYLEEITDVICI
SLAELPNLLNICEIVEVLLHVNKGPIIISWVVANMPDTLLDVAESLVLNAERGEEGGIRA
KTLSTLCDACPYIATAVRAKAVSASRLPCLIINLTLTHHQDLVSFISGLLLGSDQSTRTW
FATFLRNSHKRGKGDGHAILVKLRQELLIRLKEASAGVDASALLRLYCALRGIAGIKFQD
DEVSGLLRLVTQKPPPTPAGVRFVSLSLCMILACPSLMAAPEYEKKAIEWVQWLVKEEAY
FESNSGVTASFGEMLLLIAIHFHSGQLTAVGELVCATLGMRVPVRPNGLARIKQAFTQEI
FTEQVVTAHAVKVPVTANLNSNISGYLPVHCIHQLLKSRAFSKHKVPIKNWIYSQICNCI
APLHPVMPALVEVYVNSILVINNKGTNEYFNKPITEEEIRRVFRKSIFGVNYDSNSKPFT
SMDVDSDSTVDINIEKPTLASQLLLIYYLLLYEDVRLANTAILIANGRKVKSYSTTFLSE
LPIKYLLHQAQKDQMSYGGLFSPLLRLLATHFPQLSLVDDWMDDQVFGDSCRHQIDINLS
EVSITEAFQCIEENPYKTGKILKAMLNKNPTDIWPFAEIFVKYVKSVLGGRVPRHIQELY
REVWLRLNTVLPRCLWILTINALLDINNGCGKYVTITQENVLVDPLQVLRCDIRVFRCGP
ILKIILRILEASLAASRSQLSRHLLDKPLLEKSGQLTSDSEREELKNALVAAQESAALQI
LLEACLETEEDQSKPELMWSLKEVRSIICSFLHQVFIAEPSLAKLVHFQGYPRELLTVTV
QGIPSMHICLDFIPELLSQASLEKQIFAVDLVSHLSIQYALPKAMSIARLCVNTLSTLLS
VLPSDLRLELFQPVLKSLVRICIAFPSLLEDITSLLLQLGRICESQVSLGHCWNDTNILG
EGAYVSSEVHNDSKVLLAEVLCRDIKSTMSEIIQKALLNDKLY