New model in OGS2.0 | DPOGS209660  |
---|---|
Genomic Position | scaffold5575:+ 27-6508 |
See gene structure | |
CDS Length | 1845 |
Paired RNAseq reads   | 306 |
Single RNAseq reads   | 1132 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA008144 (8e-99) |
Best Drosophila hit   | CG6236, isoform A (3e-105) |
Best Human hit | ND |
Best NR hit (blastp)   | AGAP003596-PA [Anopheles gambiae str. PEST] (1e-119) |
Best NR hit (blastx)   | conserved hypothetical protein [Culex quinquefasciatus] (8e-118) |
GeneOntology terms    | GO:0003824 catalytic activity GO:0008152 metabolic process |
InterPro families    | IPR004245 Protein of unknown function DUF229 IPR017850 Alkaline-phosphatase-like, core domain |
Orthology group | MCL13108 |
Nucleotide sequence:
ATGAGTCTCTCTTATGGTTTCCAAAGGGAGGAGAAGCTGGACATTACGAGACGGTATATG
GATAGACAATTAGAACAATCCCTGAAACTGGACAAGGGTTCCGGATGTGAGATACCCAGG
TTAGATCCATTCCCGAAAGAAGTCACGCAGTTTGATAAGGATATACCCAAGCAAACCAAA
TGTAGGGTAGTTCCTAAAATATTAAATACCACCGATAACATAGTGTGTTCATATCAAGAC
ATAATATATGAGAGTGACCAAAAATATACAATAGGACCTCCCGTAGAGGTCAGAGGGGAT
AACGAATATGTTCTCACTAAGAGCGATCACGTGAAGATTAAGTGTTCGGGAAAACATAGA
GACAGTATACTCCCCTCCAAATGGATCGGCCACTCTCTTGGCTTGCGTTCTACTGGCTAT
GCAAAACTCTCCCCGCAGGGAAGAGGTGACTCCTTGAACGTTCTCATCCTGGGATTCGAC
TCCACCGCCAAAAACGGTTTCATACGAAGAATGCCGAAAAGCTATAAAGTTTTAAAGGAA
ATATTGGGAGCTACGATTTTGAATGGGTACAATATAGTAGGCGATGGCACGCCGGCTGCC
TTATTCCCGATCCTGACGGGGAAGACAGAGCTGGAGCTGCCGGATGTGAGGAAGAAGATG
AAGAACAACAGAACCTTGGACTCCATGCCCTTCATATTCTATAAGTTGAAAGATGAAGGT
TACCGAACAGCATTCTTTGAAGACATGCCCTGGATAGGTACATTTCAGTACAGATTCAAT
GGTTTCAAAAAGCAACCTGCGGATCATTACTTGCTGGCGTTTTACATGGAGGAGTCGAAC
GGTGGCAAGAAGTGGTGGACGAGCAGCCAGAACAAATACTGCGTGGGAGACACGCCGCAG
TATAGACTGATGTTGGATATTACGGATCAGTTTCTCCGTCTGGATGGAAAACGTTTTGCT
TTCACGTTTATAGTTGACATATCCCATGATGATTTCAACATGATATCCATCGCTGACGAT
GATACCGCAGATTATCTTAGAAGGTTCCACGACCGCTATAGAGAGGACACCTTGTTGATT
GTCATGGGGGACCATGGACCAAGGTACGCAAACGTTCGAGATACTCTTCAAGGGAAACTC
GAAGAGAGATTACCGCTCATGGCGATCAGACTACCAGACAAACTGACGAAGACCAGAACA
GAGGCGGAGAAGAATCTGAGGAACAACGCGGAAGTGTTGACGACACCTCACGACATATAC
GCCACGGTCTTAGATGTTCTGGACCTGACTCAGTTCACTAATCCCTACAAAGTTAAAGGA
GCCGACCTAACCAGAGGACTTAGTCTTTTGGAACCGATACCAAAGAACAGGTCGTGTAGC
GAGGCCGGTGTGGAGGCTCATTGGTGTTCCTGTCTGTCCTGGCAGAACGTCTCTGACGAT
GACGTCATGTTCAGTAGGACGGCCGCCGCGCTGGTCGACTTCATCAATCATCTCACTGAG
GAGAGGCGGTCGGTTTGCGCGGTGCGCACGCTCAAGTCGGTGTCGTGGGTGATGCGAGCG
CGGCCCAACAGCGGTGTACTGACCTTCGTCGAGGCTCGCGATCAAGACGGATATGTCGGC
AAGTTTGGTAACAGAGTGAAACAGACCAGGGAAAACTACCAGCTCAAGATCGCAGTGGGA
CCCGGCCATGGTATATATGAGGCGTTAGTGACTTACGTCATTACTGAGGATAGATTTGAA
ATCAATACGAGAGAAATATCACGGACTAACGCTTACAACAACGAGCCGAGCTGCATCAGC
GACACTCACCCGCACCTCAACATGTACTGCTACTGTCGTCACTAG
Protein sequence:
MSLSYGFQREEKLDITRRYMDRQLEQSLKLDKGSGCEIPRLDPFPKEVTQFDKDIPKQTK
CRVVPKILNTTDNIVCSYQDIIYESDQKYTIGPPVEVRGDNEYVLTKSDHVKIKCSGKHR
DSILPSKWIGHSLGLRSTGYAKLSPQGRGDSLNVLILGFDSTAKNGFIRRMPKSYKVLKE
ILGATILNGYNIVGDGTPAALFPILTGKTELELPDVRKKMKNNRTLDSMPFIFYKLKDEG
YRTAFFEDMPWIGTFQYRFNGFKKQPADHYLLAFYMEESNGGKKWWTSSQNKYCVGDTPQ
YRLMLDITDQFLRLDGKRFAFTFIVDISHDDFNMISIADDDTADYLRRFHDRYREDTLLI
VMGDHGPRYANVRDTLQGKLEERLPLMAIRLPDKLTKTRTEAEKNLRNNAEVLTTPHDIY
ATVLDVLDLTQFTNPYKVKGADLTRGLSLLEPIPKNRSCSEAGVEAHWCSCLSWQNVSDD
DVMFSRTAAALVDFINHLTEERRSVCAVRTLKSVSWVMRARPNSGVLTFVEARDQDGYVG
KFGNRVKQTRENYQLKIAVGPGHGIYEALVTYVITEDRFEINTREISRTNAYNNEPSCIS
DTHPHLNMYCYCRH