New model in OGS2.0 | DPOGS203686  |
---|---|
Genomic Position | scaffold120:- 146204-153351 |
See gene structure | |
CDS Length | 2604 |
Paired RNAseq reads   | 2242 |
Single RNAseq reads   | 5509 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA003480 (0.0) |
Best Drosophila hit   | CG7843, isoform C (3e-111) |
Best Human hit | serrate RNA effector molecule homolog isoform e (1e-93) |
Best NR hit (blastp)   | arsenite-resistance protein, putative [Pediculus humanus corporis] (0.0) |
Best NR hit (blastx)   | arsenite-resistance protein [Aedes aegypti] (0.0) |
GeneOntology terms    | GO:0005654 nucleoplasm GO:0031053 primary microRNA processing |
InterPro families    | IPR012677 Nucleotide-binding, alpha-beta plait IPR007042 Arsenite-resistance protein 2 IPR021933 Protein of unknown function DUF3546 |
Orthology group | MCL14206 |
Nucleotide sequence:
ATGGCTGATAGCGACGACGAGTATGACCGCAAGAGACGCGATAAATTCCGTGGTGAAAGA
GGAGCAGCAGAAGGCAGCAGTTATCGAAACAGCGATAGACGAGAGGAACGAGGACGCGGT
AGAGAGGAATGGTCAGAAAGGTCTCGTGGAGGTAGATCAGGACCCGACTACAGAGATTAT
AGAGGTGGAGCAAGTGCAAGTCGTGGGTATTCTCCAGTTAGAGGTGAAGGGCCTCCTAGC
AAACGAATCAGACCAGACTGGCCAGTTGATGATAGAAGATATGGCGGTATGCCTCATGAC
TCATATGGCTCATATGGTTGGGCCCACGACCACTTTGGACCGCATCCTGCTCATCAAGGA
TATGGTCAACCAATGCCTCCTGTACCAGCCAGAGATGCTGTTCTGCCGATGGGACCAACT
GATGGGCCACCTTCAATGATGTCATTCAAGGCTTTCCTGGCTGCTCAAGACGATGCTATC
ACCGTTGATGACGCTATACAGAAGTATAATGAATACAAACTTGAATTTAGGAGGCAGCAA
TTAAATGAATTTTTTGTAGCACACAAAGATGAGGAGTGGTTCAAGATAAAATACCATCCA
GAGGAATCGGTGAAACGTAAAGAGGAGCAACTAGCTGCGCTTAAGAACCGACTTAACGTA
TTTCTTGAGCTCCTGGAACAACGCGAATTGGACAAAGTATCAGTGGATGTTGACAAGTCG
GACAAGTTGATACGTTTACTTGACACTGTTGTTATTAAATTAGAAGGCGGAACAGAGGAA
GATCTTAAAGCTCTCGACGAACCCAACCCAGCGGAAAATGCAAATGACAAACAAGATAAG
AATGACACGAATAAGGCAATTGTAATTGAAGACGATGCGGTCAAAGAAATTAAGGATGAG
AAAGACACCGAACAGAACAAGGATAAGATTAATTCAACTGAGGAAAAAAAGGAAGATTCT
CCAAAAAAAACCGCACCGCTTACGATGGAGATCGACCCTCACCTCCGTCAGCTACAAGAA
CAGGCCAAACTTTTTTCTCGTTACAACAGTGTGCCTGGAACGGAATCGGAACAAGTTGTA
CCCGAGAAAGAACCTCCTCCGCCAGGTTCTTCATCGAGCTCATCATCATCGAGTTCATCG
TCATCAAGTTCGGAAGACGAAGGGGAAACCGGAACGCGCAGGAAATCAAAATCCAAGTCT
AAATCCAAAACTCCCGACAAGTCGCCGAAACAAAAGGAACGGACCGCGTCACCGAGCGCA
GAGAAAGTCGTCGAAATAAAAGACAAGGAGGCGAGCAATGATAATAATGAAACTTCCATT
GATGTGACGGAGAAGAAAGAATCCAGGGCTCTCCATAAAACAACTTCCATTTTCTTAAGA
AATCTAGCTCCAACGATCACGAAGGCTGAAGTAGAAGCTATGTGTAAACGTTACGGTGGG
TTCCTGCGCGTGGCGCTCGCAGACCCACTGCCCGAAAGACGATGGTTCCGCAGAGGTTGG
GTCACGTTCCGACGAGAGGTCAACATCAAGGACATCTGCTGGAATCTTAATAATATAAGG
CTTCGCGAGTGTGAGTTGGGTGCGATAGTAAACCGCGATCTTCAGCGTCGTATCCGAGCT
GTCTCCGGCGTCACTTTGGAGCGAGCGGTGTTGCGGGCTGACGCTAGACTCGCGGCTAGA
CTCGCACACCATCTAGACACTAGGTCTAGACTGTGGGACGGACCTGGTGAAGATGGACCG
CAGACTGAGAACTTCAGTTTGAGCTCCAAAAACCCGGTGCTACATAAGATAACGGAACAT
CTCATAGAAGAGGCTTCAACGGAGGAAGAAGAGTTGCTCGGTCTGGAGGCGTCGTCGGAG
GCCGCGGCGCATGAACAACCGGATCCGGAACTCATCAAAGTGTTGGACCGCCTAGTGTTG
TACCTTCGTATTGTACACTCTGTGGACTATTACAATCATTGTGAATATCCATACGAGGAC
GAGATGCCAAACCGTTGTGGTATTATGCATGCGCGCTCCGGACCTCCTCCTAACAAGCCC
ACTCAGCAGGAGATCCAAGATTATATTAAAACTTTCGAAGGTAAAATGTCAGCTTTTCTG
CAAGATGTCAAACCGCTGACAGACGAAGAGCTGCAGAAACTAGGAATTAAGGACTCCGAG
GCAGAAGTAGAAAAGTTCATTCAAGCTAACACTCAAGAGCTGTCTCAAGACAAATGGTTG
TGCCCACTCAGCGGTAAAAAGTTCAAGGGACCAGATTTCATAAGAAAGCACATCTTCAAT
AAACACGCTGAAAAGGTGGATGAGGTCCGCCGCGAGGTGTCGTACTTCAACGCGTACGTT
AAAGACGTGCGACGTCCCCAACAACCCGAGCAACCGGCTCGCGCCGCGCCACAACCCGTG
CACGCGCCGCCGGCACATCCATATAGTGGAGCTGGTGGAGCAGGCGGTGGTCGCGGCTGG
GGATGGGGCGGCTGGGCACCACCCGCGCCTTACATGCCCAGACACCCGCGGTTCTCGAGA
CCCAGGGCCGGTGCCGCGGAGTTCCGTCCGGTGATACACTATCGCGACTTGGACGCGCCG
CGGGAACCCGACGAGTTCATTTAA
Protein sequence:
MADSDDEYDRKRRDKFRGERGAAEGSSYRNSDRREERGRGREEWSERSRGGRSGPDYRDY
RGGASASRGYSPVRGEGPPSKRIRPDWPVDDRRYGGMPHDSYGSYGWAHDHFGPHPAHQG
YGQPMPPVPARDAVLPMGPTDGPPSMMSFKAFLAAQDDAITVDDAIQKYNEYKLEFRRQQ
LNEFFVAHKDEEWFKIKYHPEESVKRKEEQLAALKNRLNVFLELLEQRELDKVSVDVDKS
DKLIRLLDTVVIKLEGGTEEDLKALDEPNPAENANDKQDKNDTNKAIVIEDDAVKEIKDE
KDTEQNKDKINSTEEKKEDSPKKTAPLTMEIDPHLRQLQEQAKLFSRYNSVPGTESEQVV
PEKEPPPPGSSSSSSSSSSSSSSSEDEGETGTRRKSKSKSKSKTPDKSPKQKERTASPSA
EKVVEIKDKEASNDNNETSIDVTEKKESRALHKTTSIFLRNLAPTITKAEVEAMCKRYGG
FLRVALADPLPERRWFRRGWVTFRREVNIKDICWNLNNIRLRECELGAIVNRDLQRRIRA
VSGVTLERAVLRADARLAARLAHHLDTRSRLWDGPGEDGPQTENFSLSSKNPVLHKITEH
LIEEASTEEEELLGLEASSEAAAHEQPDPELIKVLDRLVLYLRIVHSVDYYNHCEYPYED
EMPNRCGIMHARSGPPPNKPTQQEIQDYIKTFEGKMSAFLQDVKPLTDEELQKLGIKDSE
AEVEKFIQANTQELSQDKWLCPLSGKKFKGPDFIRKHIFNKHAEKVDEVRREVSYFNAYV
KDVRRPQQPEQPARAAPQPVHAPPAHPYSGAGGAGGGRGWGWGGWAPPAPYMPRHPRFSR
PRAGAAEFRPVIHYRDLDAPREPDEFI