New model in OGS2.0 | DPOGS212366  |
---|---|
Genomic Position | scaffold101:+ 85612-92282 |
See gene structure | |
CDS Length | 3438 |
Paired RNAseq reads   | 2674 |
Single RNAseq reads   | 6983 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA004658 (0.0) |
Best Drosophila hit   | stromalin (0.0) |
Best Human hit | cohesin subunit SA-1 (0.0) |
Best NR hit (blastp)   | PREDICTED: similar to stromal antigen [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to stromal antigen [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005634 nucleus GO:0008278 cohesin complex GO:0005488 binding GO:0000922 spindle pole GO:0000780 condensed nuclear chromosome, centromeric region GO:0005721 centromeric heterochromatin GO:0035327 transcriptionally active chromatin |
InterPro families    | IPR013721 STAG IPR016024 Armadillo-type fold IPR020839 Stromalin conservative domain |
Orthology group | MCL10690 |
Nucleotide sequence:
ATGCACCGAAGAGGCGGGAAACGCATCCGAATGGATGACCCTCCCCCGGAGTATGTCAAT
CCTATGACACCGGCTACACCTATGACCGATTACGGTGGACAATCTGTACACGAGCCGGAA
ACCCCCAGCATTAATTATGCTGGCTTTAACACTGGAACCGTGGCAAACTCAACAGTAGGA
GAGTCCAACAGAGAAGATATCGAAGAACATCAAGAGGAAGAAACGAGGTCACCGTCGCCA
GCACCGACGAAAAGAGTAACTCGTAGCAGAGGACGAGGTGGGGATGGAGGATATGTGGGA
AGACTGGCTGAGTCACCACCTCCGCCACCGCTCCGAAGAAGAGGACGAGGTGGTCGCGGC
CGGGGACGAGGCAGGGGAGCCCCCCCACCACAGGCGTACTCTCCGCCTCCAGTATTGCTG
CCAGGAGATGATGAAAACAGTCTTTACAATATCCTCCGGTTTAATAAAACAGCTATAAAT
CAAGTGGTAGACATGTGGATAGAGGAATACAAGAGTAACCGTGAGAGTGCACTGGTTCAG
CTGATGCAGTTCTTTATTAATTCCTCTGGATGTCGGGGAAAGGTCACACCCAACATGGCT
CAGATGGACCATACCCTCATTATCAAGAAGATGACCCAGGAGTTTGATGAGGAAAGTGGT
GAATATCCATTAATAATGTCAGGACACACATGGAAAAAGTTCCGATCTAACTTCTGTGAG
TTCATCCAAACTCTGGTGAAGATGTGTCAGTACTCCATCATATACGACCAGTTCCTGATG
GACAACATCATATCCTTGCTCACGGGGCTGTCTGACTCTCAAGTGAGAGCATTCAGACAC
ACCGCTACACTCGCTGTGATGAAGCTGATGACGGCGCTGGTGGACGTGGCACTGCTCACG
TCTGTCAACTGCGACAACTGTCTGAGGCAGTACGAGGCCGAGCGGCTCAAGGCGCGCGAC
AAGAGAGCCAGCGAGCGGCTGGAGGTGCTGGTCGCCAAGAGACAGGAGCTGGAGGAGAAC
ATGGAGGAGATCAAGAACATGCTTTCCTACATGTTCAAGTCCGTGTTCGTGCATCGCTAC
AGAGACACCTTAGCCGAGATCAGAGCGATCACCATGTCCGAGATCGGGATCTGGATGGAG
AAATTTCCTGCTCATTTCCTGGACGATTTGTATCTGAAGTACATCGGTTGGACTCTCCAC
GACAAGGTGGGCGAGGTCCGTCTCCGCTGCCTGCAGGCGCTGCAGCCGTTATACGAGTGC
GAGGAGCTGAAGAGCAAATTAGAGCTGTTCACGTCCAAGTTCAAGGACCGCATCGTGTCA
ATGGCCCTCGACAAGGAGACCGAGGTGGCCGTCCACGCCGTGAGACTCGTCATCGCCATA
CTCAAGATGCATCCCGACGTCCTGACCGACAAGGACTGCGAGAACGTTTATGAACTGGTG
TACTCGTCGTGGCGCAGCGTGGCTGCGGCGGCGGGGGAGTTTCTGAACGTCCGCTTGTTC
CGCCCTGATGACCCGGGCGCTCCGCCTGCGCGCTCGCGGCGCGGCAAGCAACGTCTACCC
AACACGCCGCTGGTGCGCGACCTCGTGCAGTTCTTCATCGAGTCGGAGCTGCACGAGCAC
GGCGCCTACCTCGTGGACTCCCTCATAGAGTCCAACCCCATGATGAAGGACTGGGAGTGT
ATGACGGACCTGCTGCTCGAGGAGCCCGGGCCCACCGAGGAGCCGCTCGACAACAGACAG
GAATCGTCCCTGATCGAGCTGATGGTGTGCTGCGTGCGCCAGGCCAGTACGGGCGAGCCG
CCGGTGGGTCGGGGCGCGTCCCGCAAGCAGCACCAGGCGCTGTCCAAGGACCAGGCCAAG
GCGGCCAACGACGACCGCGTCAAGATGACAGCACACTTCATGGTGGCGCTGCCGGCGCTG
CTCGACAAATTCTGCGCCGACCCCGAGAAGCTCAATAACCTCGTCACCATCCCGCAGTAC
TTCGACCTCGAGCTCTACACCACGCAACGGCAGGAAGGAAATCTGACGCTGCTGTTGAAC
AAGATCCGGGAGATCGTCAGCACTCACACCGAGGCCGAAGTGCTGGAGACGTGCGGCCGG
ACGCTGGAGTACCTGTGCAGCGAGGAGCATGCAGTCTACACGCGCTGCAACGTGGCGCGC
GCCACGCTCACCGACATGTGCGTCAACAGATACAAGGAGGCCATCGACGACTACCGGAAC
CTCATCGAGGGGGGGGAGACTCCGGACGCCGACGAGGTGTTCAACGTGATCAACTCGCTG
CGCAAGGTGTCCATCATGTACATGTGCCACAACCTGAACGACACCAACATCTGGGACTCG
CTGTTCGAGGACCTGCCTAAGTGCGTATCGCCGGGGCTGATGCCGACGCAGGCGCTGGTG
TACGTGGTGCGCGCCTGCTTCTACTCCGTGCTGTGGTCGCTGCACGAGCTGGACGAGCGC
GGCGGGGACCCCGCGCCCCTGAGGGAGCGACTGTTGGCCTACGCCGCTCACTGTCGCAAT
ATCGTCGCGGCCGGCGCGACACCCGACCTCAAGGAGGAGGCCTACACGAGTCTTTGTGAC
CTGCTGATCTTCTTCGCGGAGTGTCCGCGCGGCGGCTCGTCGGCCCCGGGCGCCGGTCTG
CGCGCTCTGGAGGCGGACAGCGCCACTATGGACCTGCTCAACGCCTTCGTGCAAGAATTC
GTGTTCGTCCAAAACAACTACGACGGACAAGACGAGAGACGGATAGAGGAGCTCCACAAA
AGAAGGAACTTCTTGGCCGCCTACTGCAAGCTCATCGTGTACAACGTGGCACCGCTGAGG
CGCGCCGCAGAGGTCTTCAAACACTACATACGGTGCTACAACGACTACGGAGACATCATC
AAGGCCACGCTGAGTAAAGCTCGGGAGATCAACAAGCTGGGCTGCGCGCTCACCATGCAG
CTCGCCATGCAGATGCTGTTCACGGACGTGCTGCGGCTCCACCCGCGACCCTCGCGACAA
CTCACCGAGTTCCTGGAGGTCAAGGAGCTCGCCAAGCGGTTCGCGGTCATGTTCGGGCTG
GACGCCGTCAAGAACCGCGAGGCCCTCACGGCGCTACACCGCGCCGGCGTCGCCTTCGCC
GCTCTCGAGGGCCCAGGCCCCGGCCCGCCGCCCAATCTTCTGTTCCTAGAGCCACTGGCC
GAGTTCTCCGCCAAACTGCTCCGTCAGGACAAGCGTGCCGTGCTCAAGTTCGCAGAGACC
AAGTTCTCGAGCATGCAGTGGGGCGAAGAGTGGGCGCCGCTGCTCGCCTACAGGAACTCG
CTGCTCACGGACGCCCCGGACGAGAGGCCGCCGCCCGCCAGGAGACACTACACGAGACGC
ACGCGTGGTGGAGGAGGTGGCGCGGCGCCCGAGTCTGACGATGCCGACGACCCGCACTAC
TCAGACCCCGAGATATGA
Protein sequence:
MHRRGGKRIRMDDPPPEYVNPMTPATPMTDYGGQSVHEPETPSINYAGFNTGTVANSTVG
ESNREDIEEHQEEETRSPSPAPTKRVTRSRGRGGDGGYVGRLAESPPPPPLRRRGRGGRG
RGRGRGAPPPQAYSPPPVLLPGDDENSLYNILRFNKTAINQVVDMWIEEYKSNRESALVQ
LMQFFINSSGCRGKVTPNMAQMDHTLIIKKMTQEFDEESGEYPLIMSGHTWKKFRSNFCE
FIQTLVKMCQYSIIYDQFLMDNIISLLTGLSDSQVRAFRHTATLAVMKLMTALVDVALLT
SVNCDNCLRQYEAERLKARDKRASERLEVLVAKRQELEENMEEIKNMLSYMFKSVFVHRY
RDTLAEIRAITMSEIGIWMEKFPAHFLDDLYLKYIGWTLHDKVGEVRLRCLQALQPLYEC
EELKSKLELFTSKFKDRIVSMALDKETEVAVHAVRLVIAILKMHPDVLTDKDCENVYELV
YSSWRSVAAAAGEFLNVRLFRPDDPGAPPARSRRGKQRLPNTPLVRDLVQFFIESELHEH
GAYLVDSLIESNPMMKDWECMTDLLLEEPGPTEEPLDNRQESSLIELMVCCVRQASTGEP
PVGRGASRKQHQALSKDQAKAANDDRVKMTAHFMVALPALLDKFCADPEKLNNLVTIPQY
FDLELYTTQRQEGNLTLLLNKIREIVSTHTEAEVLETCGRTLEYLCSEEHAVYTRCNVAR
ATLTDMCVNRYKEAIDDYRNLIEGGETPDADEVFNVINSLRKVSIMYMCHNLNDTNIWDS
LFEDLPKCVSPGLMPTQALVYVVRACFYSVLWSLHELDERGGDPAPLRERLLAYAAHCRN
IVAAGATPDLKEEAYTSLCDLLIFFAECPRGGSSAPGAGLRALEADSATMDLLNAFVQEF
VFVQNNYDGQDERRIEELHKRRNFLAAYCKLIVYNVAPLRRAAEVFKHYIRCYNDYGDII
KATLSKAREINKLGCALTMQLAMQMLFTDVLRLHPRPSRQLTEFLEVKELAKRFAVMFGL
DAVKNREALTALHRAGVAFAALEGPGPGPPPNLLFLEPLAEFSAKLLRQDKRAVLKFAET
KFSSMQWGEEWAPLLAYRNSLLTDAPDERPPPARRHYTRRTRGGGGGAAPESDDADDPHY
SDPEI