New model in OGS2.0 | DPOGS210375  |
---|---|
Genomic Position | scaffold1126:+ 57-4202 |
See gene structure | |
CDS Length | 1839 |
Paired RNAseq reads   | 362 |
Single RNAseq reads   | 928 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA011941 (7e-176) |
Best Drosophila hit   | CG2938 (8e-166) |
Best Human hit | CAS1 domain-containing protein 1 precursor (3e-105) |
Best NR hit (blastp)   | AGAP001402-PA [Anopheles gambiae str. PEST] (0.0) |
Best NR hit (blastx)   | AGAP001402-PA [Anopheles gambiae str. PEST] (0.0) |
GeneOntology terms    | GO:0016021 integral to membrane GO:0016020 membrane |
InterPro families   | IPR012419 Cas1p-like |
Orthology group | MCL15530 |
Nucleotide sequence:
CAATATTCCGACAGGCCTCCATCTGTGATAGTGGCTAGCATCGGTCTCAATTTGGTGAAG
ATCCACAACGCTACGGAACCTATACTAGAGGAATATAAACGGAACCTCACACAGCTGGTC
CAGCCGATAGATTCCTTATCGGGGAGAGGAACCCAGGTGTTGTGGAAGTTGTTGGAGGAC
GTTGATCAGAAGACGGTCAAGATCAGCAACAGTGATATAGACGCTTACAACAGAGCTGCC
ATGGAGATCCTCCAACACAGCGCCACCAAGATATGGAACTCCGCCCGCCTGGCCGGAGCC
CCGGGCGCCGGGCCGGGGCTGCAGCACACGGCTCAGATCCTCCTCAACATGTTCTGCAAC
GACCACATGAACTTCAACGACGGCACTTGCTGCGCCCAACCCGAGCCCTGCACACAACTA
CAGTTACTCACATTTGCGTTGTTCCTGCTCTGCGCAGTACTGGCCTGCGGACGATGGTTG
TGGAAGTGGTCGCAGGGCATCAAGCAGCGCATGGAAGGTTACGCTCTCGTCAACGCAGTA
CACAACGAAACGCCATCAGCCATGGTGGCTATGGCGAAGTTGGGCATGATTATGGCCTAC
TTCTATCTATGTGATAGAACTAATTTCTTCATGAAGGAAAACAAATATTATTCTGAATGG
AGTTTTTGGCTACCCGTCGGCTATGTGTTCGCGTTGGGGCTGTTCTTTACCGATGAATCT
AGATCCAGCAGGGTGCTCCATCGCGAACAGACGAACGAATGGAAGGGCTGGATGCAGCTG
GTGATTCTGGTGTACCAGGTCACAGGTGCCAGCAAGGTCCTGCCGATATACATGATGGTG
AGGGCGCTCGTGTCCTCGTACCTGTTCCTGACCGGATATGGTCACTTCTACTACACGTGG
AAGACCGGGGACACGGGCCTCGTGAGATACTTCAGGGTTATCTTCAGACTGAACTTCCTG
ACCGTGGTCCTCTGTCTGACCATGAACAGACCGTATCAGTTCTACAGCTTCATACCGCTG
GTGTCGTTTTGGTACACCTTGATGTTCGCGATTTTCTCGCTGCCTCCTCAACTCTCTCCG
CCTCATACCCTGGAGCCTTACCAGCCTGTGTACACAGTTATAAAGACCCTGGGCCTGCTG
GCGATGGTGACCGTGCTGTACATGAGCGAAGTGTTCTTCCAGAAGATCTTCCTCATGAGA
CCCTGGAAGGCGCTGTTTGTGAACTCCGACGACGACATCCGACAGTGGTGGCTGGACTGG
AAACAGGACCGCTACTCGATGGCGTACGGCATAATATTCGCGGCGGCTTACCTTTTAGCG
CAGAAGTATAGCTTACTGGACGACAACAACCACAGCAACCTGTTCACGCCGGGCATCGCG
TTGACCGCTACCCTGCTGGCGTTCATCGCGCTCGGAAGTTACATAACGTTCACATTTTTC
TGCACCAACACATTCGACTGCAACGAGATACACTCCTACGTGACCTTTCTGCCCATCATC
GGGTACATCATATTGAGGAACGTGTCCGGCGTGCTCCGCACGAGACATTCGAGTCTTTTC
GCGTGGTTTGGGACCATAACGCTCGAACTGTTCGCCAGTCAGTCCCATATCTGGTTGGCC
GCCGATACTCACGGCGTGTTGGTCCTAGTTCCCGGCGTGCCCGTCTTCAATCTGATCCTG
ACCTCGTATATTTTCATATTCACCGCCCACGAAATACATAAATTAACAGGAATCATTCTC
CCCTACGCCGTTCCGGACGACTGGCGGCTAGTTTTAAGGAATTTTGCTATTTTCCTAGCG
ATTTTGGTACCAATTGGCATCCACGATGGTATGTTTTAA
Protein sequence:
QYSDRPPSVIVASIGLNLVKIHNATEPILEEYKRNLTQLVQPIDSLSGRGTQVLWKLLED
VDQKTVKISNSDIDAYNRAAMEILQHSATKIWNSARLAGAPGAGPGLQHTAQILLNMFCN
DHMNFNDGTCCAQPEPCTQLQLLTFALFLLCAVLACGRWLWKWSQGIKQRMEGYALVNAV
HNETPSAMVAMAKLGMIMAYFYLCDRTNFFMKENKYYSEWSFWLPVGYVFALGLFFTDES
RSSRVLHREQTNEWKGWMQLVILVYQVTGASKVLPIYMMVRALVSSYLFLTGYGHFYYTW
KTGDTGLVRYFRVIFRLNFLTVVLCLTMNRPYQFYSFIPLVSFWYTLMFAIFSLPPQLSP
PHTLEPYQPVYTVIKTLGLLAMVTVLYMSEVFFQKIFLMRPWKALFVNSDDDIRQWWLDW
KQDRYSMAYGIIFAAAYLLAQKYSLLDDNNHSNLFTPGIALTATLLAFIALGSYITFTFF
CTNTFDCNEIHSYVTFLPIIGYIILRNVSGVLRTRHSSLFAWFGTITLELFASQSHIWLA
ADTHGVLVLVPGVPVFNLILTSYIFIFTAHEIHKLTGIILPYAVPDDWRLVLRNFAIFLA
ILVPIGIHDGMF