DPGLEAN16173 in OGS1.0

New model in OGS2.0DPOGS210375 
Genomic Positionscaffold1126:+ 57-4202
See gene structure
CDS Length1839
Paired RNAseq reads  362
Single RNAseq reads  928
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011941 (7e-176)
Best Drosophila hit  CG2938 (8e-166)
Best Human hitCAS1 domain-containing protein 1 precursor (3e-105)
Best NR hit (blastp)  AGAP001402-PA [Anopheles gambiae str. PEST] (0.0)
Best NR hit (blastx)  AGAP001402-PA [Anopheles gambiae str. PEST] (0.0)
GeneOntology terms
  
GO:0016021 integral to membrane
GO:0016020 membrane
InterPro families  IPR012419 Cas1p-like
Orthology groupMCL15530

Nucleotide sequence:

CAATATTCCGACAGGCCTCCATCTGTGATAGTGGCTAGCATCGGTCTCAATTTGGTGAAG
ATCCACAACGCTACGGAACCTATACTAGAGGAATATAAACGGAACCTCACACAGCTGGTC
CAGCCGATAGATTCCTTATCGGGGAGAGGAACCCAGGTGTTGTGGAAGTTGTTGGAGGAC
GTTGATCAGAAGACGGTCAAGATCAGCAACAGTGATATAGACGCTTACAACAGAGCTGCC
ATGGAGATCCTCCAACACAGCGCCACCAAGATATGGAACTCCGCCCGCCTGGCCGGAGCC
CCGGGCGCCGGGCCGGGGCTGCAGCACACGGCTCAGATCCTCCTCAACATGTTCTGCAAC
GACCACATGAACTTCAACGACGGCACTTGCTGCGCCCAACCCGAGCCCTGCACACAACTA
CAGTTACTCACATTTGCGTTGTTCCTGCTCTGCGCAGTACTGGCCTGCGGACGATGGTTG
TGGAAGTGGTCGCAGGGCATCAAGCAGCGCATGGAAGGTTACGCTCTCGTCAACGCAGTA
CACAACGAAACGCCATCAGCCATGGTGGCTATGGCGAAGTTGGGCATGATTATGGCCTAC
TTCTATCTATGTGATAGAACTAATTTCTTCATGAAGGAAAACAAATATTATTCTGAATGG
AGTTTTTGGCTACCCGTCGGCTATGTGTTCGCGTTGGGGCTGTTCTTTACCGATGAATCT
AGATCCAGCAGGGTGCTCCATCGCGAACAGACGAACGAATGGAAGGGCTGGATGCAGCTG
GTGATTCTGGTGTACCAGGTCACAGGTGCCAGCAAGGTCCTGCCGATATACATGATGGTG
AGGGCGCTCGTGTCCTCGTACCTGTTCCTGACCGGATATGGTCACTTCTACTACACGTGG
AAGACCGGGGACACGGGCCTCGTGAGATACTTCAGGGTTATCTTCAGACTGAACTTCCTG
ACCGTGGTCCTCTGTCTGACCATGAACAGACCGTATCAGTTCTACAGCTTCATACCGCTG
GTGTCGTTTTGGTACACCTTGATGTTCGCGATTTTCTCGCTGCCTCCTCAACTCTCTCCG
CCTCATACCCTGGAGCCTTACCAGCCTGTGTACACAGTTATAAAGACCCTGGGCCTGCTG
GCGATGGTGACCGTGCTGTACATGAGCGAAGTGTTCTTCCAGAAGATCTTCCTCATGAGA
CCCTGGAAGGCGCTGTTTGTGAACTCCGACGACGACATCCGACAGTGGTGGCTGGACTGG
AAACAGGACCGCTACTCGATGGCGTACGGCATAATATTCGCGGCGGCTTACCTTTTAGCG
CAGAAGTATAGCTTACTGGACGACAACAACCACAGCAACCTGTTCACGCCGGGCATCGCG
TTGACCGCTACCCTGCTGGCGTTCATCGCGCTCGGAAGTTACATAACGTTCACATTTTTC
TGCACCAACACATTCGACTGCAACGAGATACACTCCTACGTGACCTTTCTGCCCATCATC
GGGTACATCATATTGAGGAACGTGTCCGGCGTGCTCCGCACGAGACATTCGAGTCTTTTC
GCGTGGTTTGGGACCATAACGCTCGAACTGTTCGCCAGTCAGTCCCATATCTGGTTGGCC
GCCGATACTCACGGCGTGTTGGTCCTAGTTCCCGGCGTGCCCGTCTTCAATCTGATCCTG
ACCTCGTATATTTTCATATTCACCGCCCACGAAATACATAAATTAACAGGAATCATTCTC
CCCTACGCCGTTCCGGACGACTGGCGGCTAGTTTTAAGGAATTTTGCTATTTTCCTAGCG
ATTTTGGTACCAATTGGCATCCACGATGGTATGTTTTAA

Protein sequence:

QYSDRPPSVIVASIGLNLVKIHNATEPILEEYKRNLTQLVQPIDSLSGRGTQVLWKLLED
VDQKTVKISNSDIDAYNRAAMEILQHSATKIWNSARLAGAPGAGPGLQHTAQILLNMFCN
DHMNFNDGTCCAQPEPCTQLQLLTFALFLLCAVLACGRWLWKWSQGIKQRMEGYALVNAV
HNETPSAMVAMAKLGMIMAYFYLCDRTNFFMKENKYYSEWSFWLPVGYVFALGLFFTDES
RSSRVLHREQTNEWKGWMQLVILVYQVTGASKVLPIYMMVRALVSSYLFLTGYGHFYYTW
KTGDTGLVRYFRVIFRLNFLTVVLCLTMNRPYQFYSFIPLVSFWYTLMFAIFSLPPQLSP
PHTLEPYQPVYTVIKTLGLLAMVTVLYMSEVFFQKIFLMRPWKALFVNSDDDIRQWWLDW
KQDRYSMAYGIIFAAAYLLAQKYSLLDDNNHSNLFTPGIALTATLLAFIALGSYITFTFF
CTNTFDCNEIHSYVTFLPIIGYIILRNVSGVLRTRHSSLFAWFGTITLELFASQSHIWLA
ADTHGVLVLVPGVPVFNLILTSYIFIFTAHEIHKLTGIILPYAVPDDWRLVLRNFAIFLA
ILVPIGIHDGMF