New model in OGS2.0 | DPOGS206291  |
---|---|
Genomic Position | scaffold1055:+ 12598-21901 |
See gene structure | |
CDS Length | 3345 |
Paired RNAseq reads   | 1541 |
Single RNAseq reads   | 4027 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA010803 (0.0) |
Best Drosophila hit   | CG6511 (2e-157) |
Best Human hit | erythroid differentiation-related factor 1 (6e-99) |
Best NR hit (blastp)   | PREDICTED: similar to conserved hypothetical protein [Nasonia vitripennis] (0.0) |
Best NR hit (blastx)   | PREDICTED: similar to CG6511-PA [Apis mellifera] (0.0) |
GeneOntology terms    | GO:0045449 regulation of transcription GO:0005634 nucleus GO:0006350 transcription GO:0005575 cellular_component GO:0008150 biological_process GO:0003674 molecular_function |
InterPro families   | ND |
Orthology group | MCL14432 |
Nucleotide sequence:
ATGGATGACGTCGAGAATGAAAATGAGTTCAATCGAAACAAGGGGCGAAGTCCATCTCCA
GGTGTTAAATCAACAGCGGTTGTAAAATATACAGCATTCCAGACTCCTGCTAGTTATGCC
AGGTTACAGTGCAACACAGATATCAACCTCCCTCCATCCAACTGGGGTGGGATAGACACT
TACGGCTTGAAGCAGATACTCACAAGAGATTCGGGACTTTCAAGCTTCAGAATGGCTCAC
ATGTTCCCCGACTGTGTCGGAGAGGTGGACGTTATATCTGATGCAGATTGCATAAAGAAT
CTCCTCAAGCTACCCTACCAACCTAATGGAACTGTTAGTATGATGGTGCATAGAGTTGAA
AATACATTGCTATTGGACGACTTTGATGTCTACGAATACCTGATGAAGTCGGAGTGGTCC
TGGTTAAAGGATTTCTTCTACGAGAATGTACTGAAGACTATGTCGGAAAAGGATCGTATC
TCTCTGACGTCATCGGCTAGTAGGAGTGCCCTCCAGCTGACGCACAAGTTCCTATCCCAC
AGTGTGGTGGCGCCACCTCTGCCAGCCAGCCAGCCCTGCCAGCCGATATGTTTACCAGGA
CCATTTCTCCCGGAGCCGGAGACTCGTCCGGAAGAACCAGCTAAAGAACAGAGTTTCAAT
AGGAACGTGCTCTGGACATTCGAAGACATACACATGTTGATAGGTAGCAACCTACCTATT
TTCGGCGACAAGGACAGGCCTTGCGTCAGTTTGCGTTTACGAGACGCGAGGGAACCGATA
AACGTACTCACGGGCATCGACTACTGGCTGGACAACCTCATGTGCAACGTCCCTGAGGTT
CTGATGTGCTACCACTTGGACGGTATCGTGCAGAAATACGAGCCAATGAAGACAGAAGAT
TTGCCGCACATGGAGAACTCCAAGTTCTCGCCAAAGGTTATAAGGAATGTCGCACAGAAT
ATTCTATCGTTTTTAAAATCTAACGCGACTAAGGCCGGTCACACATACTGGTTATTCAAA
GGCCCCCACGACGATGTTGTCAAGTTGTACGACCTGACGACTCTTTGCCCCGATGACATG
GACAATCCCTTCACGACGCCCGTGGCGATGCTATTGTACAGGGTGGCCAGGAACATGAGG
ATGATGAACAGGTCCAAACATGTCCGGCAGCTGCTGGAACACGTCGTGGAATTGCTCGGA
AGCGAGAGATACCCTCAGATTGTAGCGTCCTCGCATTATATGCTGGCTGATCTGTATGTA
CCCGCCACCACAAACCCGGCACATCCAGATTTCAAAGACGAAAGCTCGGACTCCGAAGAG
GAAGCTGAGTTTGGTAACTACGCGGAGTGCCCCTCGGCCGACAGGGGCAGACGGACAGAC
AAGGACGACGAGATTGTACGAGACGTCACTAATGACGATAAGTGCGAGGGAGATGGAAAT
ATAAACCGGGAAGAGTGCGAGCGAGACGGCGATAGTGCTGGCGAACTCACTCTCCGAGTA
CGGGGTCTGGCGTTAAGAGACATCGGCGATAGGCAGACACACGATACTACGAAGAAAACT
AAGAGATCCACTACAGGACTGGGCATCGAACCCGCTACCAGATGTGGGCGGGCGCTGAAA
CACGCGCTCACCGGACTCAAGGCTCTACATCATCTGACCATAGATAAATCTATGGAAGAG
GAAAGAGAACGTCTGAGACAGCAGAAGATCAAAGAGGAACAACATCCGAAAATGGCCAAT
CCTTACGAACCCATCAGAATGGGCTACAAGACGTCCAAGCTAAAAGATAAAGAGCACACC
TCGAGGAGCAGGCGTCGGCGGACGAGGCGGAACTCGTCTAACCACATAGAGACCAACTCG
AACGTAGACAAAAACGCTATTTTAGTGCGGAAAGAGAACACGATAACCCTGCAAGAACCG
AACCGGGACGACAACTTCGCCTGGAAACTACATCTGAAGACGCTGCTGTATGAAAAAATA
TGCCTCGCGTACGCCACGCTAGCCGAATACAGCTACTCACACGAGCAGTACGGCTTCTCC
TTGAAGTATATAGATCTGGCCAGCAAATGCCAGAAGCTGTTGAGCAATATGATCATCAAG
AGTCGCGTGGTGGACGCCAGCTGTCTCATAGGCAGGGTCGGCGACAACTACTTCCAACTG
AGCAAACACTGGCCCAGCTTGGACCAGTACAGCAAACAGTTCGGCATGGACCACGAGATC
GATAGGGAGATAAGAAACGAGATAGAGAGCGATATGGCGGAAGAAATGGAGGGCTTCGGG
GGAGATGAGTTTGAACTGGAGATATACATGTCGTCGTTGGACACTTCGGATACGTTGCCG
GAGGAGTTTCGTCATCTGTCCAAAAAAGCTGCCGAATATTTGGACGAGGCTACTGAAATA
TTCCAACACGTGAACGACGTCCCCAACCTGGCCTTGTTGTACTGCAATAAGGCCAGATAC
ATGAGGTTCAAAGTTCACTGCGACAAAGGAGTTTTCGATGATGAGAAGCGTCGGACTTAT
AATTCAGCCGAAGAGCTGTACTCCCAGGCGCTCAGGCTGGTGGGATCTCGGGAGGCAGCT
ATCAAAGACCTGGTATGCTGGGAATTGTCCTGCCACCTGTACACTAGGGCTGTACTCTTG
CAGGACCATCCGGAAATCTATGCTAGCGAAGTTACAGAAGTAGCGGAGGCTTTCAAACAC
GCTCTGAAGCACTGCCTGTTGAGTCCGGGCCCGAGACAGTACTTGTATCAATTCAGAGCC
GCTATGATATATCACCGGCTGGGATCGCTGTACCATTCACAGTACAGGAAGAGCCAAGAC
CCCTCCATCCGCCGGCGCATGTTATCCGCGACGTGTTCCCACTATGAGAAGGCGGCTCTC
CAATTCGCCTCCCTCGAGGACCCCGCCATGTTCCTCACAGCACGACTCGAGCACATTGCA
GCATTAGAGGCACACGCCGCAGTGTCGCCGAATCTGAAGTTGAAGTCGCTTCAAAACGCA
ATAGACTTACTTCGTCAGTGTCACTCAATAATGAAGCTGTTAAAAGATAGAGATCCGGAC
GAAAAGAAAGAGAAAGATAAACCAGAAGACGGCGACGAGAAAAGTCTGAAAAACGAACAT
AGTTTACTGAGCTTATACGAGAATAGGCTTCACTATATTTTGAAAAGTATCATACAATAC
TGCAGATCGAAGTCCAACAAAGACTATGACAAGATGACAGAGATGTACAAGAAGCTGTAC
AGCGCGTCCCTGAAGATAAGGAGAGACGAGGACGTGCGGCTGTACGCGGCCAGTGTGTGC
GACGTGCTCGCGGCCATGGACAGCATCATAAGCGAGTTCCAGTAG
Protein sequence:
MDDVENENEFNRNKGRSPSPGVKSTAVVKYTAFQTPASYARLQCNTDINLPPSNWGGIDT
YGLKQILTRDSGLSSFRMAHMFPDCVGEVDVISDADCIKNLLKLPYQPNGTVSMMVHRVE
NTLLLDDFDVYEYLMKSEWSWLKDFFYENVLKTMSEKDRISLTSSASRSALQLTHKFLSH
SVVAPPLPASQPCQPICLPGPFLPEPETRPEEPAKEQSFNRNVLWTFEDIHMLIGSNLPI
FGDKDRPCVSLRLRDAREPINVLTGIDYWLDNLMCNVPEVLMCYHLDGIVQKYEPMKTED
LPHMENSKFSPKVIRNVAQNILSFLKSNATKAGHTYWLFKGPHDDVVKLYDLTTLCPDDM
DNPFTTPVAMLLYRVARNMRMMNRSKHVRQLLEHVVELLGSERYPQIVASSHYMLADLYV
PATTNPAHPDFKDESSDSEEEAEFGNYAECPSADRGRRTDKDDEIVRDVTNDDKCEGDGN
INREECERDGDSAGELTLRVRGLALRDIGDRQTHDTTKKTKRSTTGLGIEPATRCGRALK
HALTGLKALHHLTIDKSMEEERERLRQQKIKEEQHPKMANPYEPIRMGYKTSKLKDKEHT
SRSRRRRTRRNSSNHIETNSNVDKNAILVRKENTITLQEPNRDDNFAWKLHLKTLLYEKI
CLAYATLAEYSYSHEQYGFSLKYIDLASKCQKLLSNMIIKSRVVDASCLIGRVGDNYFQL
SKHWPSLDQYSKQFGMDHEIDREIRNEIESDMAEEMEGFGGDEFELEIYMSSLDTSDTLP
EEFRHLSKKAAEYLDEATEIFQHVNDVPNLALLYCNKARYMRFKVHCDKGVFDDEKRRTY
NSAEELYSQALRLVGSREAAIKDLVCWELSCHLYTRAVLLQDHPEIYASEVTEVAEAFKH
ALKHCLLSPGPRQYLYQFRAAMIYHRLGSLYHSQYRKSQDPSIRRRMLSATCSHYEKAAL
QFASLEDPAMFLTARLEHIAALEAHAAVSPNLKLKSLQNAIDLLRQCHSIMKLLKDRDPD
EKKEKDKPEDGDEKSLKNEHSLLSLYENRLHYILKSIIQYCRSKSNKDYDKMTEMYKKLY
SASLKIRRDEDVRLYAASVCDVLAAMDSIISEFQ