New model in OGS2.0 | DPOGS201250  |
---|---|
Genomic Position | scaffold687:+ 2992-4710 |
See gene structure | |
CDS Length | 1719 |
Paired RNAseq reads   | 1032 |
Single RNAseq reads   | 2742 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA014340 (0.0) |
Best Drosophila hit   | CG11526, isoform B (1e-134) |
Best Human hit | hypothetical protein LOC85369 (4e-156) |
Best NR hit (blastp)   | hypothetical protein AaeL_AAEL007865 [Aedes aegypti] (0.0) |
Best NR hit (blastx)   | hypothetical protein TcasGA2_TC010759 [Tribolium castaneum] (0.0) |
GeneOntology terms    | GO:0005515 protein binding GO:0005634 nucleus GO:0003674 molecular_function GO:0008150 biological_process |
InterPro families    | IPR021819 Protein of unknown function DUF3402 IPR012486 N1221-like |
Orthology group | MCL11637 |
Nucleotide sequence:
ATGAAAAAAGTGCTTTTACTGCTCTGGAAGATATTACTAGTGTTTCTTGGTGGTTCTAAA
GAATTGAAAGAACGTAAAGCTAGATTGCGTGAAAAGTATGGTTTACCACCTTGTACTGAA
GATACCCTTGAAATAATAAAGGGCATGCGTTCAAGTTCTCCACCACCGAGTGCAGCTGAC
ATGTTGGAAAACCAGAATCCTGGGCAGAGGAAGTTAAAAGAGAATGAAAGAGCTCTTAGG
CGACAGACATTTATGAAACAAACTTCTCTTGATGAATCTGATGAACAAATCTTCGTAGAC
AAAGAGGAAACCAGTAACGGATCTGAAGATTACTCTATGGAATTTCAGTCAATGTCAGCT
GAGGGAGTTCAGAATGCCAACACAAACCAGAACTGCCCATACTACATGTATATAAAAAGA
TTAGACAGTCCCCCACCCCCTCCACCACTCCCTAGAAGCCTGCCCTGGAGATCAAAAGTC
CGACAAAAAGACATAGACACATTTCTTGATAATGTAAGAGTGAAATTTGTAGGTTACTCG
CTGCCTGGTGATAGACAAACCATTGCTGGCCTTCCACAACCTATACATGAAGGAATCGAG
ATATTAAAAAAGCATACATACACAAGCCTCGCTGAGGTTCAGGCCGAGAGGGAGCTGGAG
ATTATCCGCAGCCCTCTCACGAAAGGCGAAAAAGAAGTTGAGGAAACTGAAGCGGAGATA
TTATATAGAGCTGTACAGCCAAACCTTCCACAGTATATAATCGCTCTACTTAAAATTCTC
TTAGCCGCCACACCCACGTCAACGGCGAAAACATACTCCATGAATATAATGGCTGACGTG
TTGCCAGAAGAGATGCCAATGACCGTGCTACAATCACTCAAACTGGGTATAGATGTTAAT
AGACATAAAGAGATCATAGTTAAAGCCGTGACCGCCATCCTGCTGCTTCTGTTGAAACAT
TTCAAATTAAATCATATATATCAATTTGAATTCATGTCACAGAATCTAGTATTCTCTAAC
TGCATGCCTCTCGTTTTAAAATTCTTCAACCAGAACATCTTATCGTATATAGGAGCTAAG
AATTCAATACCCATATTTGATTTCCCCGCTTGTGTTATCGGCGAGCAACCGGAATTGACG
AGAGATTGTTTGGACATCGGAGACTCGTCAGTACCATACTCGTGGAGGAACGTGTTCTCA
TGCATAAACTTGTTGCGAATTCTCAACAAACTCACTAAGTGGAAGAACGCTAGAATCATG
ATGCTTGTCGTATTTAAGAGCGCTCCGATACTAAAACGTACACTCAAAGTCCGTCACGCT
CTCATGCAGTTCTATGTATTGAAGCTGCTCAAAATGCAAACGAGATATTTGGGGCGACAG
TGGAGGAAGACGAATATGAAGACTATTAGTGCCATTTACTCCAAAGTCCGGCATCGGTTG
AACGATGACTGGGCGTTCGGTAACGAGGTCGATGCTCGGCCATGGGATTTCCAGGACGAA
GAATGTGCATTGAGAGTAAGCGTAGAGAGATTCAATCAGAGACGGTATGGAAACGCCAGT
GAACTGGAAACTGAACTCACCCCGGTGGACACAGATATAAATAGTGTTCTCGATAGTAAT
ATAGAGCTGGACGAGGAATTCAAGTCTAACTATGAGTTGTGGCTGGAGCAAGAGGTGTAT
AACAATGAAATCAACTGGGACGTTCTGCTCTCCACATAA
Protein sequence:
MKKVLLLLWKILLVFLGGSKELKERKARLREKYGLPPCTEDTLEIIKGMRSSSPPPSAAD
MLENQNPGQRKLKENERALRRQTFMKQTSLDESDEQIFVDKEETSNGSEDYSMEFQSMSA
EGVQNANTNQNCPYYMYIKRLDSPPPPPPLPRSLPWRSKVRQKDIDTFLDNVRVKFVGYS
LPGDRQTIAGLPQPIHEGIEILKKHTYTSLAEVQAERELEIIRSPLTKGEKEVEETEAEI
LYRAVQPNLPQYIIALLKILLAATPTSTAKTYSMNIMADVLPEEMPMTVLQSLKLGIDVN
RHKEIIVKAVTAILLLLLKHFKLNHIYQFEFMSQNLVFSNCMPLVLKFFNQNILSYIGAK
NSIPIFDFPACVIGEQPELTRDCLDIGDSSVPYSWRNVFSCINLLRILNKLTKWKNARIM
MLVVFKSAPILKRTLKVRHALMQFYVLKLLKMQTRYLGRQWRKTNMKTISAIYSKVRHRL
NDDWAFGNEVDARPWDFQDEECALRVSVERFNQRRYGNASELETELTPVDTDINSVLDSN
IELDEEFKSNYELWLEQEVYNNEINWDVLLST