DPGLEAN10321 in OGS1.0

New model in OGS2.0DPOGS212366 
Genomic Positionscaffold101:+ 85612-92282
See gene structure
CDS Length3438
Paired RNAseq reads  2674
Single RNAseq reads  6983
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA004658 (0.0)
Best Drosophila hit  stromalin (0.0)
Best Human hitcohesin subunit SA-1 (0.0)
Best NR hit (blastp)  PREDICTED: similar to stromal antigen [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to stromal antigen [Tribolium castaneum] (0.0)
GeneOntology terms





  
GO:0005634 nucleus
GO:0008278 cohesin complex
GO:0005488 binding
GO:0000922 spindle pole
GO:0000780 condensed nuclear chromosome, centromeric region
GO:0005721 centromeric heterochromatin
GO:0035327 transcriptionally active chromatin
InterPro families

  
IPR013721 STAG
IPR016024 Armadillo-type fold
IPR020839 Stromalin conservative domain
Orthology groupMCL10690

Nucleotide sequence:

ATGCACCGAAGAGGCGGGAAACGCATCCGAATGGATGACCCTCCCCCGGAGTATGTCAAT
CCTATGACACCGGCTACACCTATGACCGATTACGGTGGACAATCTGTACACGAGCCGGAA
ACCCCCAGCATTAATTATGCTGGCTTTAACACTGGAACCGTGGCAAACTCAACAGTAGGA
GAGTCCAACAGAGAAGATATCGAAGAACATCAAGAGGAAGAAACGAGGTCACCGTCGCCA
GCACCGACGAAAAGAGTAACTCGTAGCAGAGGACGAGGTGGGGATGGAGGATATGTGGGA
AGACTGGCTGAGTCACCACCTCCGCCACCGCTCCGAAGAAGAGGACGAGGTGGTCGCGGC
CGGGGACGAGGCAGGGGAGCCCCCCCACCACAGGCGTACTCTCCGCCTCCAGTATTGCTG
CCAGGAGATGATGAAAACAGTCTTTACAATATCCTCCGGTTTAATAAAACAGCTATAAAT
CAAGTGGTAGACATGTGGATAGAGGAATACAAGAGTAACCGTGAGAGTGCACTGGTTCAG
CTGATGCAGTTCTTTATTAATTCCTCTGGATGTCGGGGAAAGGTCACACCCAACATGGCT
CAGATGGACCATACCCTCATTATCAAGAAGATGACCCAGGAGTTTGATGAGGAAAGTGGT
GAATATCCATTAATAATGTCAGGACACACATGGAAAAAGTTCCGATCTAACTTCTGTGAG
TTCATCCAAACTCTGGTGAAGATGTGTCAGTACTCCATCATATACGACCAGTTCCTGATG
GACAACATCATATCCTTGCTCACGGGGCTGTCTGACTCTCAAGTGAGAGCATTCAGACAC
ACCGCTACACTCGCTGTGATGAAGCTGATGACGGCGCTGGTGGACGTGGCACTGCTCACG
TCTGTCAACTGCGACAACTGTCTGAGGCAGTACGAGGCCGAGCGGCTCAAGGCGCGCGAC
AAGAGAGCCAGCGAGCGGCTGGAGGTGCTGGTCGCCAAGAGACAGGAGCTGGAGGAGAAC
ATGGAGGAGATCAAGAACATGCTTTCCTACATGTTCAAGTCCGTGTTCGTGCATCGCTAC
AGAGACACCTTAGCCGAGATCAGAGCGATCACCATGTCCGAGATCGGGATCTGGATGGAG
AAATTTCCTGCTCATTTCCTGGACGATTTGTATCTGAAGTACATCGGTTGGACTCTCCAC
GACAAGGTGGGCGAGGTCCGTCTCCGCTGCCTGCAGGCGCTGCAGCCGTTATACGAGTGC
GAGGAGCTGAAGAGCAAATTAGAGCTGTTCACGTCCAAGTTCAAGGACCGCATCGTGTCA
ATGGCCCTCGACAAGGAGACCGAGGTGGCCGTCCACGCCGTGAGACTCGTCATCGCCATA
CTCAAGATGCATCCCGACGTCCTGACCGACAAGGACTGCGAGAACGTTTATGAACTGGTG
TACTCGTCGTGGCGCAGCGTGGCTGCGGCGGCGGGGGAGTTTCTGAACGTCCGCTTGTTC
CGCCCTGATGACCCGGGCGCTCCGCCTGCGCGCTCGCGGCGCGGCAAGCAACGTCTACCC
AACACGCCGCTGGTGCGCGACCTCGTGCAGTTCTTCATCGAGTCGGAGCTGCACGAGCAC
GGCGCCTACCTCGTGGACTCCCTCATAGAGTCCAACCCCATGATGAAGGACTGGGAGTGT
ATGACGGACCTGCTGCTCGAGGAGCCCGGGCCCACCGAGGAGCCGCTCGACAACAGACAG
GAATCGTCCCTGATCGAGCTGATGGTGTGCTGCGTGCGCCAGGCCAGTACGGGCGAGCCG
CCGGTGGGTCGGGGCGCGTCCCGCAAGCAGCACCAGGCGCTGTCCAAGGACCAGGCCAAG
GCGGCCAACGACGACCGCGTCAAGATGACAGCACACTTCATGGTGGCGCTGCCGGCGCTG
CTCGACAAATTCTGCGCCGACCCCGAGAAGCTCAATAACCTCGTCACCATCCCGCAGTAC
TTCGACCTCGAGCTCTACACCACGCAACGGCAGGAAGGAAATCTGACGCTGCTGTTGAAC
AAGATCCGGGAGATCGTCAGCACTCACACCGAGGCCGAAGTGCTGGAGACGTGCGGCCGG
ACGCTGGAGTACCTGTGCAGCGAGGAGCATGCAGTCTACACGCGCTGCAACGTGGCGCGC
GCCACGCTCACCGACATGTGCGTCAACAGATACAAGGAGGCCATCGACGACTACCGGAAC
CTCATCGAGGGGGGGGAGACTCCGGACGCCGACGAGGTGTTCAACGTGATCAACTCGCTG
CGCAAGGTGTCCATCATGTACATGTGCCACAACCTGAACGACACCAACATCTGGGACTCG
CTGTTCGAGGACCTGCCTAAGTGCGTATCGCCGGGGCTGATGCCGACGCAGGCGCTGGTG
TACGTGGTGCGCGCCTGCTTCTACTCCGTGCTGTGGTCGCTGCACGAGCTGGACGAGCGC
GGCGGGGACCCCGCGCCCCTGAGGGAGCGACTGTTGGCCTACGCCGCTCACTGTCGCAAT
ATCGTCGCGGCCGGCGCGACACCCGACCTCAAGGAGGAGGCCTACACGAGTCTTTGTGAC
CTGCTGATCTTCTTCGCGGAGTGTCCGCGCGGCGGCTCGTCGGCCCCGGGCGCCGGTCTG
CGCGCTCTGGAGGCGGACAGCGCCACTATGGACCTGCTCAACGCCTTCGTGCAAGAATTC
GTGTTCGTCCAAAACAACTACGACGGACAAGACGAGAGACGGATAGAGGAGCTCCACAAA
AGAAGGAACTTCTTGGCCGCCTACTGCAAGCTCATCGTGTACAACGTGGCACCGCTGAGG
CGCGCCGCAGAGGTCTTCAAACACTACATACGGTGCTACAACGACTACGGAGACATCATC
AAGGCCACGCTGAGTAAAGCTCGGGAGATCAACAAGCTGGGCTGCGCGCTCACCATGCAG
CTCGCCATGCAGATGCTGTTCACGGACGTGCTGCGGCTCCACCCGCGACCCTCGCGACAA
CTCACCGAGTTCCTGGAGGTCAAGGAGCTCGCCAAGCGGTTCGCGGTCATGTTCGGGCTG
GACGCCGTCAAGAACCGCGAGGCCCTCACGGCGCTACACCGCGCCGGCGTCGCCTTCGCC
GCTCTCGAGGGCCCAGGCCCCGGCCCGCCGCCCAATCTTCTGTTCCTAGAGCCACTGGCC
GAGTTCTCCGCCAAACTGCTCCGTCAGGACAAGCGTGCCGTGCTCAAGTTCGCAGAGACC
AAGTTCTCGAGCATGCAGTGGGGCGAAGAGTGGGCGCCGCTGCTCGCCTACAGGAACTCG
CTGCTCACGGACGCCCCGGACGAGAGGCCGCCGCCCGCCAGGAGACACTACACGAGACGC
ACGCGTGGTGGAGGAGGTGGCGCGGCGCCCGAGTCTGACGATGCCGACGACCCGCACTAC
TCAGACCCCGAGATATGA

Protein sequence:

MHRRGGKRIRMDDPPPEYVNPMTPATPMTDYGGQSVHEPETPSINYAGFNTGTVANSTVG
ESNREDIEEHQEEETRSPSPAPTKRVTRSRGRGGDGGYVGRLAESPPPPPLRRRGRGGRG
RGRGRGAPPPQAYSPPPVLLPGDDENSLYNILRFNKTAINQVVDMWIEEYKSNRESALVQ
LMQFFINSSGCRGKVTPNMAQMDHTLIIKKMTQEFDEESGEYPLIMSGHTWKKFRSNFCE
FIQTLVKMCQYSIIYDQFLMDNIISLLTGLSDSQVRAFRHTATLAVMKLMTALVDVALLT
SVNCDNCLRQYEAERLKARDKRASERLEVLVAKRQELEENMEEIKNMLSYMFKSVFVHRY
RDTLAEIRAITMSEIGIWMEKFPAHFLDDLYLKYIGWTLHDKVGEVRLRCLQALQPLYEC
EELKSKLELFTSKFKDRIVSMALDKETEVAVHAVRLVIAILKMHPDVLTDKDCENVYELV
YSSWRSVAAAAGEFLNVRLFRPDDPGAPPARSRRGKQRLPNTPLVRDLVQFFIESELHEH
GAYLVDSLIESNPMMKDWECMTDLLLEEPGPTEEPLDNRQESSLIELMVCCVRQASTGEP
PVGRGASRKQHQALSKDQAKAANDDRVKMTAHFMVALPALLDKFCADPEKLNNLVTIPQY
FDLELYTTQRQEGNLTLLLNKIREIVSTHTEAEVLETCGRTLEYLCSEEHAVYTRCNVAR
ATLTDMCVNRYKEAIDDYRNLIEGGETPDADEVFNVINSLRKVSIMYMCHNLNDTNIWDS
LFEDLPKCVSPGLMPTQALVYVVRACFYSVLWSLHELDERGGDPAPLRERLLAYAAHCRN
IVAAGATPDLKEEAYTSLCDLLIFFAECPRGGSSAPGAGLRALEADSATMDLLNAFVQEF
VFVQNNYDGQDERRIEELHKRRNFLAAYCKLIVYNVAPLRRAAEVFKHYIRCYNDYGDII
KATLSKAREINKLGCALTMQLAMQMLFTDVLRLHPRPSRQLTEFLEVKELAKRFAVMFGL
DAVKNREALTALHRAGVAFAALEGPGPGPPPNLLFLEPLAEFSAKLLRQDKRAVLKFAET
KFSSMQWGEEWAPLLAYRNSLLTDAPDERPPPARRHYTRRTRGGGGGAAPESDDADDPHY
SDPEI