New model in OGS2.0 | DPOGS210356  |
---|---|
Genomic Position | scaffold199:- 101-9656 |
See gene structure | |
CDS Length | 5769 |
Paired RNAseq reads   | 3153 |
Single RNAseq reads   | 8818 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011914 (0.0) |
Best Drosophila hit   | virilizer (3e-61) |
Best Human hit | protein virilizer homolog isoform 1 (6e-42) |
Best NR hit (blastp)   | PREDICTED: similar to virilizer CG3496-PA [Tribolium castaneum] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to virilizer CG3496-PA [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0000375 RNA splicing, via transesterification reactions GO:0000381 regulation of alternative nuclear mRNA splicing, via spliceosome GO:0005634 nucleus GO:0007539 primary sex determination, soma |
InterPro families   | ND |
Orthology group | MCL15376 |
Nucleotide sequence:
ATGCCTCGTGTCGTCTGCGGCGACGGTGGCCGCCAGCAGGAGCCAGCAGGCCTGCCAGGC
GCGCATCTTACCGCCACTTACCACATCAACACAACAACACAGAACACACGCGCACTCAGG
GAAAGTTTCTCGAAAGTTGCTGACGCGTTTCTCTCGACCGACGACCTCCCTCCTCGACCG
CGCGGGAGACACTGGCGACCGGCGGCTGGCAAACCGCAAACTACACACGACACACGACGC
TCAGACAAACCGGATCTATTATTCTTTGATACATTTTCACATGATACTAGTGAAGAGCTT
AATTTAGACCTTGTACAGTTTCCGAAGTCTGTGTATGTGAGAGAAATACGAATTATTCCC
CTCGGTGCTCGTGTGGAAGGAGACTTCCCAGGTGGAGTGAGACTCGGTGCCACGAACCCC
ACAAAGTTTCACATTGACTTCTTTGTGAATGATTTGAGCAAACCTGGAGCTGCTACATTT
GAGGCTCTCGGCAGTTTAGATTATTGTCAGAATGGACAGATTCATATGGAATGCGGAGAC
AATGCTGAGCAGACAAGAATTCCTACAGACGGTCTTGTCCTTAGAGGTTGGTATACAACT
ATAACTTTGGCTGTATATGGTACCTTGACTCAAGTCTTACCTGACAATGTCGCCGCTGTG
AACCCACAGCCCAATAGTCAGAGACCGGTCTCTAGAGAAGTGAGCGCACTACCCGCGAAC
GTGCCACAGTCATCAGATTGGAATCCAGAAACCTCTAATCCCATACCAGCCTACACCAGC
AATGTCACCGCCACCAATCCTGAAGCTTATGGTGCGGGAAACTACCCGAACCCAGAGAAT
TATGATAATCAAATGTATAGAGGAGAATATTACGACAACGAGGCACCCAAAGACCCTCGT
ACATATCACCACATAGACGAGAACGACTGGGAAAAGGAGAGAAGAGATATTAGTTGTGAA
AGAGATGGAGATAGAATGAGACACAGTCCACGATCGATGGAGTTGAACAGAGAGCGCCGG
GAAGGACGGAGCAGGAGACGCTCACTGGACCGTGGATTGAGTCGAGAGTCTTCACGACAC
CGTGACAGAAGCAGAGATGTCGACCGACTTGACAGGCTTCACTCTAGATCGAGAAGTCGC
GATAGAGATTATGTTGGTAAAGGAGAGTACCGTCCTCTGAGCCGGTCAAGGTCACGCAGC
TTAGATCGTGACTGGGACAGGGGCTCGTACAAAAAAGATGACTACCGTCGTCATCGTGAT
GCCTCCTATGATCGGTCGCGCGGTGGTTCGTACGAGCCGAGGTCACCGAGAGCGCCTTCT
TACGAACGCAAACCTCCTTACGACAAAGGTGGTGCGTACGAAAAGAGGCTGTCGCCGTAT
GACAAGAGAAGTTCGTCTTACGAGCGACGAGCTGCATCCTACGACAAACAAACGCCGTAC
GATAGAAGGAGACACTCCCCTTACAGCCGCATGAGAGGATCTAGTTACGGCAGTAGGTCG
CCCAGTCGAGACGATCCCAGGAAGAGACCTAGAACTCCACCCGTGGAAACCAGGCGGCCA
CTGTCACCCAGGGAAGGTGAAACGACGAGCCCGATGAATTCGGTTCGATCTGAGGAAGGC
GCTGAATATGACCGCGGTGACCGCAGCGGGAAGCAAATTCCACGTATCGACTTTTATCAT
CAGAGCTACCGACACAAGAGTTCCATCAGGAGTCCCTCTCAGGAGGTCGATAACAACTAC
GTTGAACTTCAACACTCCAGCCTAGTGACAGTTCCCATCGTCGATACAACGGTAGCTCCG
AAACCCATAGAATCTCCGAATCGCAATCCCGATGAAGAAAAATCCATGGACGCCGAGCCC
TTCGAGCCCATTCTGTCCGATGAAGATATCTGCGATGACTTGGACGGTAATTCCTACATG
GAAGTTGAATATGACGTCAACGAGTACTGCGGAGTGGACGACATCATCAAATATTACAAT
CCATTCAAAGACGAATGGAAGAAATATGAGCGCGTCAACCAAATATGCAGGCTGGCGGAT
GTCAGGCGGGGCAGTGTCAAAGACGTCATGTCGGCCAGCTTCGAAGAACTGTGTGATGCC
AGCAGTGATCTGTTAAAAATATCAGAGATCACGAATGCTGCTAAGAGAAGGGAGCAGAAG
TTTTCGACGGATACGTTTTTGGAAATCGATAATTCTGCGCGGGAAGATTGGGTACATCAA
TGCGAACAATTGTTCGTGTCTCTTATAAATTTATGCAAAAACACCGACATAATACTGCGA
GTCTTCCGCACGGACCGATCAGATTCAGACGAATTTTCTGACATGTATCACCTCCTGATC
ACATTCTGCAGAATCGGTTTAAACTTCGATTTCGTGCTGTCTCAACAGCAGCCAACCTAT
AAGATCCGCCACATGAAATGCGGGATCCGACTCGCCGAGGCTTTGATGTGTCACGAATAC
AGCGGCCGAGTCATGAAGACGTTGCTGAGTTCCGGGATTGACGTTCCGTTACAACTGCTG
GAAATGTATTCCAAGGAATATATGGCGCTGAGTATTCGCCTCATGATATTAAAAGCTTTA
AGCGCCTGCCTTTCCTCTAAAGAGGCCGTCGAGCATTTCATGCAAGATGCCATGTACCCT
AAGTTCGATAAGAACGATGAAAGTCCTAGAAAAGCCATGAACGGTTACCAAGCTCTGATA
GCCATGATAAAGGCCAACCCACTAGGGAGGATCAAATTTTCCGTTAGTTCCCTCCTGAAG
AAACTAAATATTTATGAAGTTCTGATGAAACTGCGTGTTCTCGTGTTTAACTTCAATAAG
GCCGCCGCTGATAACAATCACTCCGACGACTCCGAACTGTCGGAGAGCGATGTAAGCTTC
ATTGTGGATTCCTTTGACGAAATTCTAAACATGTACAGAACTCAATGCTTCCATTTATCT
CAACCGAAAAGGTTCCTGCCTGTGGCTGCGCAGTTTGAAATAAGCAAGGAATGTTCAAAC
GAGATACTCCTGCAGTTTTTTGACATTCATAAAGTTCTACAAGTTTGCTTATACTTGCTG
ACCTGTCCGTCGACCTGCAACAATCTGGTCATCACGAGTCCGATCCACGATCTCATATTC
GAATTAGTCAATTCTCACAACGGGCTCCTATTTCTGTACAAGAATATGGAAATGAGTGAA
CTGTTATTTAAGGTTTTGGAACAACCGTTCGGTCAGCAGACGAGCGAAGAAACGGCGTAC
CCTCATGACAGCACACTTAGTAGTTACAGTGACCTACAAATCGTTGGACTCGAACTCGCC
TATAGGCTGAAGGCGATGTACTACTTGCAATCGATTAGTGATGTGCAAGTCGGAAATAAT
GATGAATACAAATTGATAGACAGACTACAAGCTTTATACTGCCTCAGCTTTGGAAGCATC
GGGAAGACGGCCGTGCCGAGTGTTATAGTTATGGGAAACAACTCGGAGTGTCTCATGGAA
GTGTTCGAGAATGATCTGAAAAGCAAAAACAAAAGCGAGTCGCCATCAAAACAGAAGTCT
CCGGCAAAGGGCTACGCTATCGAGCTTATCGTATCGGCCGTGAGATTTGCTGCAAATGTT
CCATTCTTGAAGAAATACGGACAAAGATTAATAAACGTGTCAAAAGAACATGACAGATTT
GAACCGAGCGTGTCCAGTGTACTTCAAGAAGTCATTCCGTACTTGAAGCCCGTGGAGAAA
CCTAACGTCCTTTCGGGAGAAGACCTCAGCGAGTGTGTCGAGATCTTGAAGGCGAGTTTA
GAACATTCGTCCGATTTACCCGGGGAGCTGATGACTTGCCTGAGAATATTACGACACTTT
TGTATATCCGATTATGAAAAAAACGTTTCCATTGTTTGCGAAAACCCGGTGGAGGAATAT
GTCGAACTGAAATACAAGTATAATATATTACAATTGTTTTCTTTGGACGGCGTCGCCCAT
ATTTCTGGTATCTTAGATCGGCTCGCTACTCATTTCGATCAACCAAGTATGCACACATCA
TTCTTCGCTTCGGTGAGAGGACTTCAGTTGGCCCAGATGTTATTACCTTGTTTGAGTCTC
TTAGACGAGATGTTAGCGAGAGTAGTTCGCTGTCGAGGCCCTAGATTTAGAGATTTAACG
GCGGCACCTATAATATTGAAGACTTACGGGATAGCCAAAACCTTTCCAGTCGGTTCTATC
GGCTACCGAACTGCGGCCAAGGCTGCCGAAGCGGCGGTGAGAGCATTACTGGCTTACGCC
CAGCCGATAGCCGACGACGCCAATGACGGAGATTCAATACGTCGTGGGCCGTGGACGTCG
CTGTGTTCCGAAGTTATTTCCTACACAATGACCGCACCTTACACATTCGTTCCCGGACTT
CTCATATTTTCGGAACTCCTGCCGCTGCCGTTGCCGATGCAAACGAAGCTCCCTCCCACC
GACCGAGAGCTGGCCGACGCATCTAACGAAAGGCGGCTGTGGTCTGCTCATTTACACGCC
TTGTCTAATGACTTGACGGACATGATACAGGTGATATGTATGTCCACGTACCGCCCGGTC
GTGCACATGCTACGGAGAGTGTGCGTTCAAATCGCAGACCTGGCTCCGAACACGGCGGCC
ACAGTGGCTCGGGCGGCTGTTGGTGCTGTCATGAGAGAAATGAAAGAAGAAGACCCGCCC
ACCGCCAGCCTCGCCAGAGTCTTAGGATTCTTGGCATGTCTCGTGTCACACGCGCCAGTC
AAGTGTGCCGTGTTGCACGCTATGAGCGCCGGAGGTCCCAAAGCTAGCGAAGTACAAACG
GCGCTTTGTAATGTTTTCAACTTGTCAAACTCGTCGAACGAACACGCCGCGGCCCAAGAG
TATGCTGCTCATGTGTTAGCGGCGTTCTGTGACATTGAAATAACTTTAACTCCTCTCACA
GCTAACATCGATAGCGTTCTCGCCAATTCATTACCCACTAAAGACGCTCTTACGGCGTTT
TTAGACGCAACGGCAAATTGTCTAGAATCTACGACGAAAACCTGTTCAGTGGCCTCGTCG
GTATTACGCGCCTACTACGTCTTAACGGATCACGAATACGGGTTCCTGCAATTTAGAAAA
TTCGTGACAAAGAGAACGGAAGCCCTCGGTAAATATTTTAAGTGGACGACGGAGAGCGCG
GGAGAAGACAGGACGGACTGCCTGAACCTGTTCCTCGACTTAATAAAGATACTGAGAGTT
GAACACGGCGAGGGCCCGGGCGGGAGGAGAAGCGTCCTGACGCTTAGTGAGATGGCAGAC
ATGGTCGGATACACGGCGTCCGACAAGGAGCATCCGGTCATGACACTAGAAAAAGTTCTC
AAGGAAAGGAACTCCGAAGAAGACGCGATATCAAACGCGACCTTCTTGATCTCGAATCTG
AAAAACCTCGCGGACGAAGAGGTCGGGACTTCCCCGGAGGTCTCGGAAGCTGCGACGCCG
GCCCCCGAACCCTTGGTGGCTCAGTTTGCGGCCCGACAGATTTATTGTATCGGAGATTCA
AGCGACGAACGGCTCACCACCAGCTACTGGCTGAACGTGCCCTCCGCCTCGCTAGAAGAC
GACGTCAATGATAACGAACTGGTGTCGTGCGACATCGTGGAGGTGGCGGGCAGCCTCCTG
TCCGGGGGCAGCGAGTCCCTGGCGGCCGGCGTCAGGAGGCTGGCCGGCTGTCTCGATACC
AGGCCCGAGGCTCTGCCCGCCGACGACAAGAGACCCGCAGGTCACTATGTGCAATACCTA
TCAGTCTGA
Protein sequence:
MPRVVCGDGGRQQEPAGLPGAHLTATYHINTTTQNTRALRESFSKVADAFLSTDDLPPRP
RGRHWRPAAGKPQTTHDTRRSDKPDLLFFDTFSHDTSEELNLDLVQFPKSVYVREIRIIP
LGARVEGDFPGGVRLGATNPTKFHIDFFVNDLSKPGAATFEALGSLDYCQNGQIHMECGD
NAEQTRIPTDGLVLRGWYTTITLAVYGTLTQVLPDNVAAVNPQPNSQRPVSREVSALPAN
VPQSSDWNPETSNPIPAYTSNVTATNPEAYGAGNYPNPENYDNQMYRGEYYDNEAPKDPR
TYHHIDENDWEKERRDISCERDGDRMRHSPRSMELNRERREGRSRRRSLDRGLSRESSRH
RDRSRDVDRLDRLHSRSRSRDRDYVGKGEYRPLSRSRSRSLDRDWDRGSYKKDDYRRHRD
ASYDRSRGGSYEPRSPRAPSYERKPPYDKGGAYEKRLSPYDKRSSSYERRAASYDKQTPY
DRRRHSPYSRMRGSSYGSRSPSRDDPRKRPRTPPVETRRPLSPREGETTSPMNSVRSEEG
AEYDRGDRSGKQIPRIDFYHQSYRHKSSIRSPSQEVDNNYVELQHSSLVTVPIVDTTVAP
KPIESPNRNPDEEKSMDAEPFEPILSDEDICDDLDGNSYMEVEYDVNEYCGVDDIIKYYN
PFKDEWKKYERVNQICRLADVRRGSVKDVMSASFEELCDASSDLLKISEITNAAKRREQK
FSTDTFLEIDNSAREDWVHQCEQLFVSLINLCKNTDIILRVFRTDRSDSDEFSDMYHLLI
TFCRIGLNFDFVLSQQQPTYKIRHMKCGIRLAEALMCHEYSGRVMKTLLSSGIDVPLQLL
EMYSKEYMALSIRLMILKALSACLSSKEAVEHFMQDAMYPKFDKNDESPRKAMNGYQALI
AMIKANPLGRIKFSVSSLLKKLNIYEVLMKLRVLVFNFNKAAADNNHSDDSELSESDVSF
IVDSFDEILNMYRTQCFHLSQPKRFLPVAAQFEISKECSNEILLQFFDIHKVLQVCLYLL
TCPSTCNNLVITSPIHDLIFELVNSHNGLLFLYKNMEMSELLFKVLEQPFGQQTSEETAY
PHDSTLSSYSDLQIVGLELAYRLKAMYYLQSISDVQVGNNDEYKLIDRLQALYCLSFGSI
GKTAVPSVIVMGNNSECLMEVFENDLKSKNKSESPSKQKSPAKGYAIELIVSAVRFAANV
PFLKKYGQRLINVSKEHDRFEPSVSSVLQEVIPYLKPVEKPNVLSGEDLSECVEILKASL
EHSSDLPGELMTCLRILRHFCISDYEKNVSIVCENPVEEYVELKYKYNILQLFSLDGVAH
ISGILDRLATHFDQPSMHTSFFASVRGLQLAQMLLPCLSLLDEMLARVVRCRGPRFRDLT
AAPIILKTYGIAKTFPVGSIGYRTAAKAAEAAVRALLAYAQPIADDANDGDSIRRGPWTS
LCSEVISYTMTAPYTFVPGLLIFSELLPLPLPMQTKLPPTDRELADASNERRLWSAHLHA
LSNDLTDMIQVICMSTYRPVVHMLRRVCVQIADLAPNTAATVARAAVGAVMREMKEEDPP
TASLARVLGFLACLVSHAPVKCAVLHAMSAGGPKASEVQTALCNVFNLSNSSNEHAAAQE
YAAHVLAAFCDIEITLTPLTANIDSVLANSLPTKDALTAFLDATANCLESTTKTCSVASS
VLRAYYVLTDHEYGFLQFRKFVTKRTEALGKYFKWTTESAGEDRTDCLNLFLDLIKILRV
EHGEGPGGRRSVLTLSEMADMVGYTASDKEHPVMTLEKVLKERNSEEDAISNATFLISNL
KNLADEEVGTSPEVSEAATPAPEPLVAQFAARQIYCIGDSSDERLTTSYWLNVPSASLED
DVNDNELVSCDIVEVAGSLLSGGSESLAAGVRRLAGCLDTRPEALPADDKRPAGHYVQYL
SV