New model in OGS2.0 | DPOGS214207  |
---|---|
Genomic Position | scaffold49:- 48529-52950 |
See gene structure | |
CDS Length | 3372 |
Paired RNAseq reads   | 429 |
Single RNAseq reads   | 1276 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA005934 (0.0) |
Best Drosophila hit   | CG8211 (0.0) |
Best Human hit | integrator complex subunit 2 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to integrator complex subunit 2 [Apis mellifera] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to integrator complex subunit 2 [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0005622 intracellular GO:0016020 membrane GO:0016021 integral to membrane GO:0005515 protein binding GO:0016180 snRNA processing GO:0032039 integrator complex GO:0005634 nucleus GO:0031965 nuclear membrane |
InterPro families   | ND |
Orthology group | MCL14152 |
Nucleotide sequence:
ATGGATATCGAATTTATGAAACCCGTTAAGCCTCTAGTTTTCAAGGCTTTAAAAGATGTC
GATATTGAAACTTTAATAAAATGCACACCGGATGAAATAAGACCGATCATACCATGTCTA
GTCCGTATGGCTCTTATAGCACCTCTTGATATAACTAGATATTGTGCCGAGGCTAAAAAA
GACATCCTGACTCTACTATCTGGGATTGATCTAGTAAATTTCATCGTATCTTTACTGTCT
ATTGAATTTCATGCTCTAGAAGTGGATCTAAAGAAAGAACAACAAATGCGCTTAAAAAGT
GGATCCCAGAATACTGAATCCTTTTTAATACAGAATGTAGTAAATGGAATTGCAAATGAC
TTTGAACAGTCAGATTCCGCAAGAAGAGTCCGACTTGTTCTCTCTGAGTTGCTGCAGATG
CAAGCGCAGTTGGCAGAGTATAATCAGAATAAAAATTCAAATTCTGAATCTTCTATAAAA
CCATCGGAACTCTTTGATAATGAGGTGTATCTTGAAGAAATTACAGATGTTATTTGTATA
AGTCTTGCTGAATTACCAAATCTTTTAAATATATGTGAAATTGTTGAAGTATTACTGCAT
GTGAACAAGGGACCAATTATTATTTCTTGGGTTGTAGCAAATATGCCTGACACACTTTTA
GATGTGGCAGAATCTTTGGTTTTAAATGCTGAAAGAGGAGAAGAAGGTGGCATTAGAGCC
AAAACTTTATCCACATTATGTGACGCCTGTCCCTATATTGCAACAGCTGTTAGAGCAAAA
GCTGTATCTGCTTCCAGACTACCGTGTTTAATAATAAACCTCACTTTGACACATCATCAA
GACTTGGTATCCTTTATATCTGGTTTGCTATTGGGTTCAGACCAGAGTACCAGAACATGG
TTTGCAACATTCTTACGTAACTCCCATAAAAGGGGGAAAGGAGATGGCCATGCAATATTG
GTGAAGTTACGCCAAGAACTTCTGATTAGATTAAAAGAAGCTTCAGCTGGGGTTGATGCC
TCTGCATTATTAAGGTTATACTGTGCCTTGAGAGGAATCGCGGGAATAAAGTTCCAAGAT
GATGAGGTGTCAGGACTCTTACGACTTGTGACACAAAAGCCACCGCCAACTCCAGCTGGT
GTGAGATTTGTTTCCTTGAGTTTATGTATGATCCTAGCATGTCCTTCACTTATGGCTGCT
CCTGAATATGAGAAGAAAGCAATAGAATGGGTACAATGGCTTGTAAAGGAAGAAGCTTAT
TTTGAAAGCAATTCAGGCGTCACAGCTTCGTTTGGGGAGATGTTGCTGCTAATAGCAATC
CACTTCCACTCTGGACAGCTGACGGCCGTCGGTGAACTAGTCTGTGCTACACTTGGCATG
AGGGTCCCCGTGCGACCAAACGGACTTGCGAGGATCAAGCAGGCCTTCACACAGGAAATA
TTTACTGAGCAGGTCGTCACTGCACATGCTGTTAAAGTACCTGTCACTGCAAATCTCAAC
AGCAACATATCCGGTTATTTGCCTGTGCATTGTATTCACCAATTACTGAAGTCGCGAGCA
TTTTCGAAACATAAAGTGCCAATAAAAAATTGGATATATAGTCAAATTTGCAACTGTATT
GCTCCCTTACACCCTGTAATGCCAGCCCTCGTCGAAGTTTACGTCAATTCTATTCTGGTT
ATTAATAATAAAGGAACAAATGAATACTTCAACAAGCCAATAACAGAAGAAGAAATACGC
AGGGTATTCCGAAAATCTATTTTTGGTGTTAATTATGACTCAAACAGCAAACCATTTACT
TCTATGGATGTTGATAGTGATTCCACAGTTGACATAAACATTGAGAAACCAACTCTAGCC
TCACAACTATTATTGATCTATTACCTGCTCCTGTATGAAGATGTAAGATTGGCTAATACA
GCTATACTGATTGCCAATGGAAGAAAAGTGAAAAGTTATTCAACAACATTTCTTTCCGAA
TTGCCAATAAAGTATTTGCTACATCAAGCCCAGAAAGATCAAATGAGTTATGGTGGTCTT
TTCAGCCCGCTGCTTCGTTTGCTTGCGACTCATTTTCCGCAGCTATCGCTTGTAGATGAT
TGGATGGATGACCAGGTCTTTGGAGATTCCTGTCGTCACCAAATAGACATTAATCTTTCA
GAAGTATCTATAACTGAAGCATTCCAGTGCATCGAAGAAAATCCATATAAAACGGGTAAA
ATATTAAAAGCCATGCTTAATAAAAATCCTACTGACATATGGCCTTTTGCAGAAATATTT
GTTAAATACGTGAAGAGTGTGTTAGGAGGTAGAGTCCCAAGACATATACAAGAACTCTAC
AGAGAGGTTTGGTTGCGTTTAAACACGGTTCTACCCCGATGTTTGTGGATATTGACAATT
AACGCGTTGCTGGATATAAATAATGGATGCGGTAAATACGTTACCATAACACAGGAAAAC
GTTCTAGTTGATCCTTTACAAGTCTTAAGATGTGATATAAGAGTATTTAGATGTGGTCCT
ATATTAAAAATAATTCTGAGAATTTTAGAAGCGAGCTTAGCTGCATCGAGAAGCCAGTTA
AGTCGCCATTTATTGGACAAGCCACTTCTTGAAAAAAGCGGCCAATTGACATCAGACTCC
GAGAGGGAAGAATTGAAAAATGCCTTAGTTGCCGCTCAAGAAAGTGCAGCACTACAAATT
TTACTAGAAGCTTGTTTGGAGACTGAAGAAGACCAATCTAAACCCGAACTAATGTGGTCT
TTGAAAGAAGTACGAAGTATAATATGTTCGTTTTTACATCAAGTGTTTATAGCTGAGCCA
TCACTTGCAAAATTAGTACACTTCCAAGGATATCCGAGGGAATTATTGACAGTAACCGTC
CAAGGCATACCGTCAATGCACATATGTTTAGATTTTATTCCTGAACTTCTAAGTCAAGCT
TCTCTAGAGAAACAAATTTTTGCTGTGGACTTGGTATCTCATTTATCAATTCAGTATGCT
TTACCCAAAGCTATGTCCATTGCGAGGTTATGCGTGAATACTCTATCCACCCTCCTATCT
GTCCTACCAAGTGACCTGCGTCTGGAACTCTTCCAACCAGTTTTAAAATCGCTCGTACGG
ATTTGTATCGCATTTCCCTCCTTACTTGAAGATATTACATCGTTATTGTTACAGTTAGGT
CGAATTTGTGAATCTCAGGTATCACTTGGCCATTGTTGGAATGACACAAATATATTGGGC
GAAGGAGCTTATGTATCCTCTGAAGTTCACAATGACAGTAAAGTATTACTCGCCGAGGTT
TTATGTAGGGACATTAAATCAACAATGTCAGAAATTATACAGAAAGCACTTTTAAATGAT
AAACTGTATTGA
Protein sequence:
MDIEFMKPVKPLVFKALKDVDIETLIKCTPDEIRPIIPCLVRMALIAPLDITRYCAEAKK
DILTLLSGIDLVNFIVSLLSIEFHALEVDLKKEQQMRLKSGSQNTESFLIQNVVNGIAND
FEQSDSARRVRLVLSELLQMQAQLAEYNQNKNSNSESSIKPSELFDNEVYLEEITDVICI
SLAELPNLLNICEIVEVLLHVNKGPIIISWVVANMPDTLLDVAESLVLNAERGEEGGIRA
KTLSTLCDACPYIATAVRAKAVSASRLPCLIINLTLTHHQDLVSFISGLLLGSDQSTRTW
FATFLRNSHKRGKGDGHAILVKLRQELLIRLKEASAGVDASALLRLYCALRGIAGIKFQD
DEVSGLLRLVTQKPPPTPAGVRFVSLSLCMILACPSLMAAPEYEKKAIEWVQWLVKEEAY
FESNSGVTASFGEMLLLIAIHFHSGQLTAVGELVCATLGMRVPVRPNGLARIKQAFTQEI
FTEQVVTAHAVKVPVTANLNSNISGYLPVHCIHQLLKSRAFSKHKVPIKNWIYSQICNCI
APLHPVMPALVEVYVNSILVINNKGTNEYFNKPITEEEIRRVFRKSIFGVNYDSNSKPFT
SMDVDSDSTVDINIEKPTLASQLLLIYYLLLYEDVRLANTAILIANGRKVKSYSTTFLSE
LPIKYLLHQAQKDQMSYGGLFSPLLRLLATHFPQLSLVDDWMDDQVFGDSCRHQIDINLS
EVSITEAFQCIEENPYKTGKILKAMLNKNPTDIWPFAEIFVKYVKSVLGGRVPRHIQELY
REVWLRLNTVLPRCLWILTINALLDINNGCGKYVTITQENVLVDPLQVLRCDIRVFRCGP
ILKIILRILEASLAASRSQLSRHLLDKPLLEKSGQLTSDSEREELKNALVAAQESAALQI
LLEACLETEEDQSKPELMWSLKEVRSIICSFLHQVFIAEPSLAKLVHFQGYPRELLTVTV
QGIPSMHICLDFIPELLSQASLEKQIFAVDLVSHLSIQYALPKAMSIARLCVNTLSTLLS
VLPSDLRLELFQPVLKSLVRICIAFPSLLEDITSLLLQLGRICESQVSLGHCWNDTNILG
EGAYVSSEVHNDSKVLLAEVLCRDIKSTMSEIIQKALLNDKLY