DPGLEAN19734 in OGS1.0

New model in OGS2.0DPOGS212980 
Genomic Positionscaffold3038:- 440-6357
See gene structure
CDS Length1473
Paired RNAseq reads  564
Single RNAseq reads  1487
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA011623 (5e-165)
Best Drosophila hit  ND
Best Human hitarmadillo repeat-containing protein 8 isoform 2 (4e-100)
Best NR hit (blastp)  hypothetical protein AaeL_AAEL012608 [Aedes aegypti] (7e-141)
Best NR hit (blastx)  hypothetical protein AaeL_AAEL012608 [Aedes aegypti] (1e-118)
GeneOntology terms  GO:0005488 binding
InterPro families


  
IPR016024 Armadillo-type fold
IPR000225 Armadillo
IPR011989 Armadillo-like helical
IPR000357 HEAT
Orthology groupMCL16676

Nucleotide sequence:

ATGCAACAGCTCACAATATTCATGGATATCGAGAGTTCACGTTCATATATAGACGAGATT
TATTCGTCGAATCCAGAGAAACTATTGGAGGCTCTGGTCACTTTGAAGAATTCAGTGATT
GGCAGTAACAGACAAAAGAGTTCGGTGATACAACAAGGAATAGTTCCTAGACTGTTGCAG
CTTATGACGGACGATGGCCTCGATCCGAATATAAGGCTTGAAGCAACGATCACAATTGGA
TCCCTGGCCAAAGGAACACCGGAAAATGTGGCGTCACTAGTTGAGCAAGGGACAACTATA
GTTCTTGTGGAGTTGTTGAAAGTAGTGCCGACTGGTACTAAGCTAGCAGAAGCATGTCTG
TGTGCACTCAGGAGTATATTTCAACATCCACCAGCACCAATTGGTGCCCTCCCTGCTGAT
ATGAGACTGCTGGGAAGATTAACAGTAATAGCCAGGGAGGGATCTCTGACGGCGCGTGCG
TGCGTGGTTCGCATCCTGTCAATATGGTGCGGAGGTCCCCTGGAACAGGAGGCTTTATGT
GCGGCGGGGGCATGTGCGGCGGTGGCAGCCCTGCTGGCGGCCAGGCCAGATGCCGCACCA
CCCATGGCTAGAGCGCTGCCGGCCTTGGATCTTTTAGCTGCCATGTGCTTTGAAAATGCC
AGTGTTTCGCAGGTCGCACTCACAACCAGACACGGCGACAAAACAATTCCTGAGTTATTA
ATGGCATTGGTATCAAGAGATAAACCATTGCCGGTGGCTATGGGTGCAGCTAGATGTCTC
ACATTCATACATCGAGCGGGTGCATTAGGAGCTGATGATAATAGGGTGGTATTTGGAGCG
TTGCCTTGTTTGGCGAGATTGTGTACAAAGGATATGCCCGAAGACATCAGAGCGACGGCT
GCTGAGACTTTGGCTTATTTGGCAGAAGTGGACACATCCCTTCAAAGGCTGGCGGCTATA
TCGAATCACCTGATGAGTTCGTTAGCTGATATAGTCACGTGTTCGTCGTCAGCGGCCAAG
CAGGGTGCCTTCAAGTGTTTCGCATCACTGGGAGCCAACGATGAAGACATACGGAAGAAG
ATCATAGAAACCCACAGCATCATGGTCCATGTGGTGAATGGGATGAACAATCAAGAGGCG
TCGGTGAGGCTGGCGGCCGTGAGATGTCTCCATTCACTGTCAAGGTCAGTTCAGCAGTTA
CGGACGACATTTCAGGATCACGAAGTTTGGCGTCCATTAATGTTCCTCTTGAACGACTCC
CCGGGAACCGAACTCCTGACCGTTGGATCTTCGACGCTATGCAATCTGCTCCTTGAGTTC
TCTCCCGCCAAAGAACCCATGTTAGACCAAGGTGCCGTTGAGATGTTGTGTGGTCTCACC
AGGCGACCGGAGGCAGCGCTGAGGCTCAACGGGATATGGGCGCTCATGAATATGGCCTTC
CAGGTAAAAAATCATTTATTTTATAGATTATAA

Protein sequence:

MQQLTIFMDIESSRSYIDEIYSSNPEKLLEALVTLKNSVIGSNRQKSSVIQQGIVPRLLQ
LMTDDGLDPNIRLEATITIGSLAKGTPENVASLVEQGTTIVLVELLKVVPTGTKLAEACL
CALRSIFQHPPAPIGALPADMRLLGRLTVIAREGSLTARACVVRILSIWCGGPLEQEALC
AAGACAAVAALLAARPDAAPPMARALPALDLLAAMCFENASVSQVALTTRHGDKTIPELL
MALVSRDKPLPVAMGAARCLTFIHRAGALGADDNRVVFGALPCLARLCTKDMPEDIRATA
AETLAYLAEVDTSLQRLAAISNHLMSSLADIVTCSSSAAKQGAFKCFASLGANDEDIRKK
IIETHSIMVHVVNGMNNQEASVRLAAVRCLHSLSRSVQQLRTTFQDHEVWRPLMFLLNDS
PGTELLTVGSSTLCNLLLEFSPAKEPMLDQGAVEMLCGLTRRPEAALRLNGIWALMNMAF
QVKNHLFYRL