DPGLEAN12956 in OGS1.0

New model in OGS2.0DPOGS210356 
Genomic Positionscaffold199:- 101-9656
See gene structure
CDS Length5769
Paired RNAseq reads  3153
Single RNAseq reads  8818
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011914 (0.0)
Best Drosophila hit  virilizer (3e-61)
Best Human hitprotein virilizer homolog isoform 1 (6e-42)
Best NR hit (blastp)  PREDICTED: similar to virilizer CG3496-PA [Tribolium castaneum] (0.0)
Best NR hit (blastx)  PREDICTED: similar to virilizer CG3496-PA [Tribolium castaneum] (0.0)
GeneOntology terms


  
GO:0000375 RNA splicing, via transesterification reactions
GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome
GO:0005634 nucleus
GO:0007539 primary sex determination, soma
InterPro families  ND
Orthology groupMCL15376

Nucleotide sequence:

ATGCCTCGTGTCGTCTGCGGCGACGGTGGCCGCCAGCAGGAGCCAGCAGGCCTGCCAGGC
GCGCATCTTACCGCCACTTACCACATCAACACAACAACACAGAACACACGCGCACTCAGG
GAAAGTTTCTCGAAAGTTGCTGACGCGTTTCTCTCGACCGACGACCTCCCTCCTCGACCG
CGCGGGAGACACTGGCGACCGGCGGCTGGCAAACCGCAAACTACACACGACACACGACGC
TCAGACAAACCGGATCTATTATTCTTTGATACATTTTCACATGATACTAGTGAAGAGCTT
AATTTAGACCTTGTACAGTTTCCGAAGTCTGTGTATGTGAGAGAAATACGAATTATTCCC
CTCGGTGCTCGTGTGGAAGGAGACTTCCCAGGTGGAGTGAGACTCGGTGCCACGAACCCC
ACAAAGTTTCACATTGACTTCTTTGTGAATGATTTGAGCAAACCTGGAGCTGCTACATTT
GAGGCTCTCGGCAGTTTAGATTATTGTCAGAATGGACAGATTCATATGGAATGCGGAGAC
AATGCTGAGCAGACAAGAATTCCTACAGACGGTCTTGTCCTTAGAGGTTGGTATACAACT
ATAACTTTGGCTGTATATGGTACCTTGACTCAAGTCTTACCTGACAATGTCGCCGCTGTG
AACCCACAGCCCAATAGTCAGAGACCGGTCTCTAGAGAAGTGAGCGCACTACCCGCGAAC
GTGCCACAGTCATCAGATTGGAATCCAGAAACCTCTAATCCCATACCAGCCTACACCAGC
AATGTCACCGCCACCAATCCTGAAGCTTATGGTGCGGGAAACTACCCGAACCCAGAGAAT
TATGATAATCAAATGTATAGAGGAGAATATTACGACAACGAGGCACCCAAAGACCCTCGT
ACATATCACCACATAGACGAGAACGACTGGGAAAAGGAGAGAAGAGATATTAGTTGTGAA
AGAGATGGAGATAGAATGAGACACAGTCCACGATCGATGGAGTTGAACAGAGAGCGCCGG
GAAGGACGGAGCAGGAGACGCTCACTGGACCGTGGATTGAGTCGAGAGTCTTCACGACAC
CGTGACAGAAGCAGAGATGTCGACCGACTTGACAGGCTTCACTCTAGATCGAGAAGTCGC
GATAGAGATTATGTTGGTAAAGGAGAGTACCGTCCTCTGAGCCGGTCAAGGTCACGCAGC
TTAGATCGTGACTGGGACAGGGGCTCGTACAAAAAAGATGACTACCGTCGTCATCGTGAT
GCCTCCTATGATCGGTCGCGCGGTGGTTCGTACGAGCCGAGGTCACCGAGAGCGCCTTCT
TACGAACGCAAACCTCCTTACGACAAAGGTGGTGCGTACGAAAAGAGGCTGTCGCCGTAT
GACAAGAGAAGTTCGTCTTACGAGCGACGAGCTGCATCCTACGACAAACAAACGCCGTAC
GATAGAAGGAGACACTCCCCTTACAGCCGCATGAGAGGATCTAGTTACGGCAGTAGGTCG
CCCAGTCGAGACGATCCCAGGAAGAGACCTAGAACTCCACCCGTGGAAACCAGGCGGCCA
CTGTCACCCAGGGAAGGTGAAACGACGAGCCCGATGAATTCGGTTCGATCTGAGGAAGGC
GCTGAATATGACCGCGGTGACCGCAGCGGGAAGCAAATTCCACGTATCGACTTTTATCAT
CAGAGCTACCGACACAAGAGTTCCATCAGGAGTCCCTCTCAGGAGGTCGATAACAACTAC
GTTGAACTTCAACACTCCAGCCTAGTGACAGTTCCCATCGTCGATACAACGGTAGCTCCG
AAACCCATAGAATCTCCGAATCGCAATCCCGATGAAGAAAAATCCATGGACGCCGAGCCC
TTCGAGCCCATTCTGTCCGATGAAGATATCTGCGATGACTTGGACGGTAATTCCTACATG
GAAGTTGAATATGACGTCAACGAGTACTGCGGAGTGGACGACATCATCAAATATTACAAT
CCATTCAAAGACGAATGGAAGAAATATGAGCGCGTCAACCAAATATGCAGGCTGGCGGAT
GTCAGGCGGGGCAGTGTCAAAGACGTCATGTCGGCCAGCTTCGAAGAACTGTGTGATGCC
AGCAGTGATCTGTTAAAAATATCAGAGATCACGAATGCTGCTAAGAGAAGGGAGCAGAAG
TTTTCGACGGATACGTTTTTGGAAATCGATAATTCTGCGCGGGAAGATTGGGTACATCAA
TGCGAACAATTGTTCGTGTCTCTTATAAATTTATGCAAAAACACCGACATAATACTGCGA
GTCTTCCGCACGGACCGATCAGATTCAGACGAATTTTCTGACATGTATCACCTCCTGATC
ACATTCTGCAGAATCGGTTTAAACTTCGATTTCGTGCTGTCTCAACAGCAGCCAACCTAT
AAGATCCGCCACATGAAATGCGGGATCCGACTCGCCGAGGCTTTGATGTGTCACGAATAC
AGCGGCCGAGTCATGAAGACGTTGCTGAGTTCCGGGATTGACGTTCCGTTACAACTGCTG
GAAATGTATTCCAAGGAATATATGGCGCTGAGTATTCGCCTCATGATATTAAAAGCTTTA
AGCGCCTGCCTTTCCTCTAAAGAGGCCGTCGAGCATTTCATGCAAGATGCCATGTACCCT
AAGTTCGATAAGAACGATGAAAGTCCTAGAAAAGCCATGAACGGTTACCAAGCTCTGATA
GCCATGATAAAGGCCAACCCACTAGGGAGGATCAAATTTTCCGTTAGTTCCCTCCTGAAG
AAACTAAATATTTATGAAGTTCTGATGAAACTGCGTGTTCTCGTGTTTAACTTCAATAAG
GCCGCCGCTGATAACAATCACTCCGACGACTCCGAACTGTCGGAGAGCGATGTAAGCTTC
ATTGTGGATTCCTTTGACGAAATTCTAAACATGTACAGAACTCAATGCTTCCATTTATCT
CAACCGAAAAGGTTCCTGCCTGTGGCTGCGCAGTTTGAAATAAGCAAGGAATGTTCAAAC
GAGATACTCCTGCAGTTTTTTGACATTCATAAAGTTCTACAAGTTTGCTTATACTTGCTG
ACCTGTCCGTCGACCTGCAACAATCTGGTCATCACGAGTCCGATCCACGATCTCATATTC
GAATTAGTCAATTCTCACAACGGGCTCCTATTTCTGTACAAGAATATGGAAATGAGTGAA
CTGTTATTTAAGGTTTTGGAACAACCGTTCGGTCAGCAGACGAGCGAAGAAACGGCGTAC
CCTCATGACAGCACACTTAGTAGTTACAGTGACCTACAAATCGTTGGACTCGAACTCGCC
TATAGGCTGAAGGCGATGTACTACTTGCAATCGATTAGTGATGTGCAAGTCGGAAATAAT
GATGAATACAAATTGATAGACAGACTACAAGCTTTATACTGCCTCAGCTTTGGAAGCATC
GGGAAGACGGCCGTGCCGAGTGTTATAGTTATGGGAAACAACTCGGAGTGTCTCATGGAA
GTGTTCGAGAATGATCTGAAAAGCAAAAACAAAAGCGAGTCGCCATCAAAACAGAAGTCT
CCGGCAAAGGGCTACGCTATCGAGCTTATCGTATCGGCCGTGAGATTTGCTGCAAATGTT
CCATTCTTGAAGAAATACGGACAAAGATTAATAAACGTGTCAAAAGAACATGACAGATTT
GAACCGAGCGTGTCCAGTGTACTTCAAGAAGTCATTCCGTACTTGAAGCCCGTGGAGAAA
CCTAACGTCCTTTCGGGAGAAGACCTCAGCGAGTGTGTCGAGATCTTGAAGGCGAGTTTA
GAACATTCGTCCGATTTACCCGGGGAGCTGATGACTTGCCTGAGAATATTACGACACTTT
TGTATATCCGATTATGAAAAAAACGTTTCCATTGTTTGCGAAAACCCGGTGGAGGAATAT
GTCGAACTGAAATACAAGTATAATATATTACAATTGTTTTCTTTGGACGGCGTCGCCCAT
ATTTCTGGTATCTTAGATCGGCTCGCTACTCATTTCGATCAACCAAGTATGCACACATCA
TTCTTCGCTTCGGTGAGAGGACTTCAGTTGGCCCAGATGTTATTACCTTGTTTGAGTCTC
TTAGACGAGATGTTAGCGAGAGTAGTTCGCTGTCGAGGCCCTAGATTTAGAGATTTAACG
GCGGCACCTATAATATTGAAGACTTACGGGATAGCCAAAACCTTTCCAGTCGGTTCTATC
GGCTACCGAACTGCGGCCAAGGCTGCCGAAGCGGCGGTGAGAGCATTACTGGCTTACGCC
CAGCCGATAGCCGACGACGCCAATGACGGAGATTCAATACGTCGTGGGCCGTGGACGTCG
CTGTGTTCCGAAGTTATTTCCTACACAATGACCGCACCTTACACATTCGTTCCCGGACTT
CTCATATTTTCGGAACTCCTGCCGCTGCCGTTGCCGATGCAAACGAAGCTCCCTCCCACC
GACCGAGAGCTGGCCGACGCATCTAACGAAAGGCGGCTGTGGTCTGCTCATTTACACGCC
TTGTCTAATGACTTGACGGACATGATACAGGTGATATGTATGTCCACGTACCGCCCGGTC
GTGCACATGCTACGGAGAGTGTGCGTTCAAATCGCAGACCTGGCTCCGAACACGGCGGCC
ACAGTGGCTCGGGCGGCTGTTGGTGCTGTCATGAGAGAAATGAAAGAAGAAGACCCGCCC
ACCGCCAGCCTCGCCAGAGTCTTAGGATTCTTGGCATGTCTCGTGTCACACGCGCCAGTC
AAGTGTGCCGTGTTGCACGCTATGAGCGCCGGAGGTCCCAAAGCTAGCGAAGTACAAACG
GCGCTTTGTAATGTTTTCAACTTGTCAAACTCGTCGAACGAACACGCCGCGGCCCAAGAG
TATGCTGCTCATGTGTTAGCGGCGTTCTGTGACATTGAAATAACTTTAACTCCTCTCACA
GCTAACATCGATAGCGTTCTCGCCAATTCATTACCCACTAAAGACGCTCTTACGGCGTTT
TTAGACGCAACGGCAAATTGTCTAGAATCTACGACGAAAACCTGTTCAGTGGCCTCGTCG
GTATTACGCGCCTACTACGTCTTAACGGATCACGAATACGGGTTCCTGCAATTTAGAAAA
TTCGTGACAAAGAGAACGGAAGCCCTCGGTAAATATTTTAAGTGGACGACGGAGAGCGCG
GGAGAAGACAGGACGGACTGCCTGAACCTGTTCCTCGACTTAATAAAGATACTGAGAGTT
GAACACGGCGAGGGCCCGGGCGGGAGGAGAAGCGTCCTGACGCTTAGTGAGATGGCAGAC
ATGGTCGGATACACGGCGTCCGACAAGGAGCATCCGGTCATGACACTAGAAAAAGTTCTC
AAGGAAAGGAACTCCGAAGAAGACGCGATATCAAACGCGACCTTCTTGATCTCGAATCTG
AAAAACCTCGCGGACGAAGAGGTCGGGACTTCCCCGGAGGTCTCGGAAGCTGCGACGCCG
GCCCCCGAACCCTTGGTGGCTCAGTTTGCGGCCCGACAGATTTATTGTATCGGAGATTCA
AGCGACGAACGGCTCACCACCAGCTACTGGCTGAACGTGCCCTCCGCCTCGCTAGAAGAC
GACGTCAATGATAACGAACTGGTGTCGTGCGACATCGTGGAGGTGGCGGGCAGCCTCCTG
TCCGGGGGCAGCGAGTCCCTGGCGGCCGGCGTCAGGAGGCTGGCCGGCTGTCTCGATACC
AGGCCCGAGGCTCTGCCCGCCGACGACAAGAGACCCGCAGGTCACTATGTGCAATACCTA
TCAGTCTGA

Protein sequence:

MPRVVCGDGGRQQEPAGLPGAHLTATYHINTTTQNTRALRESFSKVADAFLSTDDLPPRP
RGRHWRPAAGKPQTTHDTRRSDKPDLLFFDTFSHDTSEELNLDLVQFPKSVYVREIRIIP
LGARVEGDFPGGVRLGATNPTKFHIDFFVNDLSKPGAATFEALGSLDYCQNGQIHMECGD
NAEQTRIPTDGLVLRGWYTTITLAVYGTLTQVLPDNVAAVNPQPNSQRPVSREVSALPAN
VPQSSDWNPETSNPIPAYTSNVTATNPEAYGAGNYPNPENYDNQMYRGEYYDNEAPKDPR
TYHHIDENDWEKERRDISCERDGDRMRHSPRSMELNRERREGRSRRRSLDRGLSRESSRH
RDRSRDVDRLDRLHSRSRSRDRDYVGKGEYRPLSRSRSRSLDRDWDRGSYKKDDYRRHRD
ASYDRSRGGSYEPRSPRAPSYERKPPYDKGGAYEKRLSPYDKRSSSYERRAASYDKQTPY
DRRRHSPYSRMRGSSYGSRSPSRDDPRKRPRTPPVETRRPLSPREGETTSPMNSVRSEEG
AEYDRGDRSGKQIPRIDFYHQSYRHKSSIRSPSQEVDNNYVELQHSSLVTVPIVDTTVAP
KPIESPNRNPDEEKSMDAEPFEPILSDEDICDDLDGNSYMEVEYDVNEYCGVDDIIKYYN
PFKDEWKKYERVNQICRLADVRRGSVKDVMSASFEELCDASSDLLKISEITNAAKRREQK
FSTDTFLEIDNSAREDWVHQCEQLFVSLINLCKNTDIILRVFRTDRSDSDEFSDMYHLLI
TFCRIGLNFDFVLSQQQPTYKIRHMKCGIRLAEALMCHEYSGRVMKTLLSSGIDVPLQLL
EMYSKEYMALSIRLMILKALSACLSSKEAVEHFMQDAMYPKFDKNDESPRKAMNGYQALI
AMIKANPLGRIKFSVSSLLKKLNIYEVLMKLRVLVFNFNKAAADNNHSDDSELSESDVSF
IVDSFDEILNMYRTQCFHLSQPKRFLPVAAQFEISKECSNEILLQFFDIHKVLQVCLYLL
TCPSTCNNLVITSPIHDLIFELVNSHNGLLFLYKNMEMSELLFKVLEQPFGQQTSEETAY
PHDSTLSSYSDLQIVGLELAYRLKAMYYLQSISDVQVGNNDEYKLIDRLQALYCLSFGSI
GKTAVPSVIVMGNNSECLMEVFENDLKSKNKSESPSKQKSPAKGYAIELIVSAVRFAANV
PFLKKYGQRLINVSKEHDRFEPSVSSVLQEVIPYLKPVEKPNVLSGEDLSECVEILKASL
EHSSDLPGELMTCLRILRHFCISDYEKNVSIVCENPVEEYVELKYKYNILQLFSLDGVAH
ISGILDRLATHFDQPSMHTSFFASVRGLQLAQMLLPCLSLLDEMLARVVRCRGPRFRDLT
AAPIILKTYGIAKTFPVGSIGYRTAAKAAEAAVRALLAYAQPIADDANDGDSIRRGPWTS
LCSEVISYTMTAPYTFVPGLLIFSELLPLPLPMQTKLPPTDRELADASNERRLWSAHLHA
LSNDLTDMIQVICMSTYRPVVHMLRRVCVQIADLAPNTAATVARAAVGAVMREMKEEDPP
TASLARVLGFLACLVSHAPVKCAVLHAMSAGGPKASEVQTALCNVFNLSNSSNEHAAAQE
YAAHVLAAFCDIEITLTPLTANIDSVLANSLPTKDALTAFLDATANCLESTTKTCSVASS
VLRAYYVLTDHEYGFLQFRKFVTKRTEALGKYFKWTTESAGEDRTDCLNLFLDLIKILRV
EHGEGPGGRRSVLTLSEMADMVGYTASDKEHPVMTLEKVLKERNSEEDAISNATFLISNL
KNLADEEVGTSPEVSEAATPAPEPLVAQFAARQIYCIGDSSDERLTTSYWLNVPSASLED
DVNDNELVSCDIVEVAGSLLSGGSESLAAGVRRLAGCLDTRPEALPADDKRPAGHYVQYL
SV