New model in OGS2.0 | DPOGS212314  |
---|---|
Genomic Position | scaffold605:- 25774-32958 |
See gene structure | |
CDS Length | 2238 |
Paired RNAseq reads   | 1798 |
Single RNAseq reads   | 4346 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012037 (7e-114) |
Best Drosophila hit   | ND |
Best Human hit | proline-, glutamic acid- and leucine-rich protein 1 (2e-11) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL014265 [Aedes aegypti] (3e-22) |
Best NR hit (blastx)   | hypothetical protein AaeL_AAEL011984 [Aedes aegypti] (6e-30) |
GeneOntology terms   | GO:0071339 MLL1 complex |
InterPro families   | ND |
Orthology group | MCL16699 |
Nucleotide sequence:
ATGGAACAAATATTTAAAAGAGTTCGGCATGTTGATCCAAATAACAGTGATGCTGTGAAA
GAAGTACTGTCAAACTTTTTCCAAAATTTGCCGAAGCGCAATGATATTAACAATAAAAAG
TTTTTGGACGAGTTATTGATTATAATTAACCGGTATCCTAAATATTGTATGTCATATCGT
AATATTATTGAGTCATTTATAAGTAATTACTTAAGCTCCAACAATTACTACAATGTTATT
AAAGCAGCTAAGTGTGCACATGCACTACAACAGGTTCGTCCTATTCAAGATAAGACAGCA
ACGCCGAAATCTTGTTGGCGGCAGCAGATGAACTCATTGTGTAATGCTGCACACTCACTC
ATAGAAGTGATATTTGCTGATGCTGTCGACATTTACAGAAGTAATGCTAAGCCTACAGAG
AGTCATTCAAATTCTCCGTTATCAACTATACTGGCAAATATCGTTAAGAAGTCCCAAGAC
AAAGACAGATCTCAGTTGATGATGACAAGACTCAAGAACGTATTCACATTCATACAGGCT
ATGTTGGTAGAAATATATCCTGTACCGAAACCAATCCAGCCTCGTTCAATACTAGATGTG
ATAGTGAGGTCTCTGAGTGTGAGCAGTCAACACGTCTCTCTGGACGTAGCCTCTGTTAAA
GTGCAGGCCTTGAAGACGTTGGATGCTATGATTCTGTGTCTGGGATCAAACCTGATACCA
TATTCACCTCTAGTCTTTAGACTTGCGACGCAGACTTTGAGATGGACCTCGGACAACATG
AGCAGAACTACCGGCAAAGTTCGTTGCACAGCGTACAGTACGCTGAGCAAGTGGCTTCTA
ACACTGCACATTTATAAGATGCCAGAAAAAAATACCTGGGAAGATGATCTGACGGCGCAT
GTCGTGAGGGACGTGACCCCGGCGAAGAGAGTTGTGGCGCTTACGATGGGGCCGCAGCCG
ACTAAAAATTTAAGCAAAAAAGCCAAAAGGAAATTAGCTAATTCTCAACTGTTGCAAAGC
TCAATCGCCGCTCACATGCCCGGGGAGAAAAATAAGATTGATATTCCAGAGGAAGTGAAC
AATGAGGTCACGGTATCAGCCTTGCAGTTCGCAGAAGTCTTCTTCACTGTTTGTGGAAGA
TTCCTCAAACCGGCCACACATAAGTTATTCCAAGAGCGTGTCATCCGCCATCACTCAGCA
GGTGAGACGTTGCTGTACCTGCGAGTGCTGGAGGCGAGTCGCAAGACGACGCCGGCGACC
GTGGCGCCACCGACACAGTACTGTCTTCATATATACAGTACGCTTGTGAACAGCTCCGAC
GCCGAGATATCAAAATTCTGTAGCCAAGCTCTACTGGATATAAGACTGCACCTATATTGC
TCGCCGCCGTCCATCAACCTGGCTATAGAAATACCTCAAGATGAAGAGGAAACGGCGAAT
AAAAGGAAGAAGGTCTCGTCCAAAAACAGGGCCATGTTAGAGTATCTATTAGGGCCGGAT
AAAGTGCCCCGAGATAAAGAAGACGATATTATAACGATTCCAGACGAACCGTCGAATAAG
AAACAACGTGTCGACGAATTGGATAGAATAAGTCTAAGCAGCGATTCCACCAGCACTGTT
AAGATACCGTACGGAACAGACATCAGCTCAGACTCGGACGGAGATAACGTCATGGAGGTC
GACGTAGTCGTCGAAATGAATCACACCACAGGCAGAGAGAGAGTTCTAGAAGCCGCCGAC
GTTCCGAAGATATCCATAAACGACGAAAAGCAATCATCCGATCAAATAAGTCTAAATGAT
ATAACAAACGATGAGAGCAACCAGGCAGACGCGATTACTAGTGAAGACGTCTCTGATGTT
ATACATGAAGCGCCTACACAACTGAACACATCAAGTGGCGCCCCCCAAGTAGTGTACGAC
CATCCGGATACAGGAACCGGCGACGTCACAGTCCTGGAGAGGATTGACGACGAAAATATA
CCAAACACGAACGATACGGACGAAGATGCGATAACTTGCGGGCAAATCGTACGAAGCTCG
CAAGAAATTGTCAACGGAAATAATGAGCCGGAAGTCAATGGTGTTGATAAAATTAATGAG
GACAGCGATGTGTATAATATTACCACGAAAGATATAAATAAGGGGGAAGATAATCTTGCT
GCCAAAATAGATGGAACCAGTGTGGAAGATATGATGGCGGATTTCGTTGATGAAGTAAAT
GAAGCTGTAGCTGTGTAA
Protein sequence:
MEQIFKRVRHVDPNNSDAVKEVLSNFFQNLPKRNDINNKKFLDELLIIINRYPKYCMSYR
NIIESFISNYLSSNNYYNVIKAAKCAHALQQVRPIQDKTATPKSCWRQQMNSLCNAAHSL
IEVIFADAVDIYRSNAKPTESHSNSPLSTILANIVKKSQDKDRSQLMMTRLKNVFTFIQA
MLVEIYPVPKPIQPRSILDVIVRSLSVSSQHVSLDVASVKVQALKTLDAMILCLGSNLIP
YSPLVFRLATQTLRWTSDNMSRTTGKVRCTAYSTLSKWLLTLHIYKMPEKNTWEDDLTAH
VVRDVTPAKRVVALTMGPQPTKNLSKKAKRKLANSQLLQSSIAAHMPGEKNKIDIPEEVN
NEVTVSALQFAEVFFTVCGRFLKPATHKLFQERVIRHHSAGETLLYLRVLEASRKTTPAT
VAPPTQYCLHIYSTLVNSSDAEISKFCSQALLDIRLHLYCSPPSINLAIEIPQDEEETAN
KRKKVSSKNRAMLEYLLGPDKVPRDKEDDIITIPDEPSNKKQRVDELDRISLSSDSTSTV
KIPYGTDISSDSDGDNVMEVDVVVEMNHTTGRERVLEAADVPKISINDEKQSSDQISLND
ITNDESNQADAITSEDVSDVIHEAPTQLNTSSGAPQVVYDHPDTGTGDVTVLERIDDENI
PNTNDTDEDAITCGQIVRSSQEIVNGNNEPEVNGVDKINEDSDVYNITTKDINKGEDNLA
AKIDGTSVEDMMADFVDEVNEAVAV