DPGLEAN01123 in OGS1.0

New model in OGS2.0DPOGS210234 
Genomic Positionscaffold42:- 9053-12576
See gene structure
CDS Length1449
Paired RNAseq reads  271
Single RNAseq reads  724
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA002373 (1e-17)
Best Drosophila hit  CG32343, isoform C (5e-25)
Best Human hitGA-binding protein subunit beta-1 isoform beta 1 (1e-29)
Best NR hit (blastp)  AGAP006384-PA [Anopheles gambiae str. PEST] (3e-55)
Best NR hit (blastx)  AGAP006384-PA [Anopheles gambiae str. PEST] (1e-36)
GeneOntology terms

  
GO:0005575 cellular_component
GO:0003674 molecular_function
GO:0008150 biological_process
InterPro families
  
IPR020683 Ankyrin repeat-containing domain
IPR002110 Ankyrin repeat
Orthology groupMCL16379

Nucleotide sequence:

ATGACTTCAGATCTCTGTGAAGAAGATGTAGTCGTTCGATTATCCTCACCTCATATAATA
TCATCAGAGACGATAGTACCAGCTGCCAGTAATGTTGGCGGTGGACGAATACAGACCGGA
GGAGTGGAGCTGGGTCGCAGACTGCTCTTAGCAGCCAGAGCAGGAGATACCGCTACTGTA
CTTGATCTCATGGCCAAAGGTGCACCATTTACCACTGACTGGCTGGGTACATCACCGCTG
CACCTGGCTGCTGCCAACAACCATGTGGAGACATGCGGTGTATTACTGAGGGCGGGTGTG
TCTCGGGATGCTCGGACTAAAGTTGAACGAACACCGCTGCACCTGGCCGCACATGCTGGG
CATGCCGCTGTAGTTGCACTGCTGCTCGACCATGGAGCTATGGTGGACTGTCGCGACATG
CTCCACATGACGCCGCTGCACTGGGCGAGTGCTCGAGGTCACGTGGCCGTGGTCCGCGAG
CTAGTGTGTCGCGGCGCGGATTTGCTCGCTCGCTGCAAGTTCAGGAAGACGCCGCGCTGC
CTCGCCGTCCGCGCCGGGGCCAGTGACGTCATGGCTGTCCTCGACCAAGCTGCCAAGGAA
CACGACCGACCCACAGTGACTGAGGAAACGCCAAAGATTCAACATTTTGAAACAATCCAA
AGACTACAGGAGGTCAGACAGCAGACCAAAACCAAGCCTCCGGAGAAGACTATCGTAATA
GAATCTAAGACTGAGCCGGCGTCGGGTCTGTCCGGGGCGGCGTTACTCCGCGCACACGGC
ATCACTCTCCTACCCCGGGACCGCGGCTCCACTGTACTCAGCGCACTGAGGAGCGGACGG
ACCGTCGTACTGTCCGATGCCGGGAAGCTGATGTTGAAGGAGAGCACCAACGCCCCGGTG
ATGGTCAGCGCCACCAGCGCCTCTGTGGACGCGAGCAACAACACAGCCAGCAACAGTCAG
TCAAGCTTGCCCACAACTAACATAGTGACCAGTTCAAACATCACCGACGCTAAAGGGGTC
ATGGTCCGAGCGAGGACTCTCAACACCATCAAGGGCGTCAAAGGCTTGCAAATGCTCTCC
GTCAACAGATCCGACCACACTGTTAAGAAGGTCATCAGTTCACATGACTTGCAGAAAGTT
AAATTACTCGGCGTGAAAGAGAACAAGTCACCCCGCCGTCCAGCTCTCAAGATCCTTCTC
AACAAAGCCAACCTCACACGACTACTAGCCAACACCACTAACGCTTCTACCACCAACAAC
ACACAGATATCGATCGAGCCTTCCGGCGAGCTGAGCGAGTCGCCGGTTCAAAGTGACGCG
GTGATGGAGGACGCGTCGGAATCGTCTCTGAGGGTTCAACTGCAACAAGCGCACGCCGCC
CTGGCCAGCCTGGCCGCAGAGTTACGACACTGTAAGGCTAAACTGGCCAAATACGAACAC
ACGCACTGA

Protein sequence:

MTSDLCEEDVVVRLSSPHIISSETIVPAASNVGGGRIQTGGVELGRRLLLAARAGDTATV
LDLMAKGAPFTTDWLGTSPLHLAAANNHVETCGVLLRAGVSRDARTKVERTPLHLAAHAG
HAAVVALLLDHGAMVDCRDMLHMTPLHWASARGHVAVVRELVCRGADLLARCKFRKTPRC
LAVRAGASDVMAVLDQAAKEHDRPTVTEETPKIQHFETIQRLQEVRQQTKTKPPEKTIVI
ESKTEPASGLSGAALLRAHGITLLPRDRGSTVLSALRSGRTVVLSDAGKLMLKESTNAPV
MVSATSASVDASNNTASNSQSSLPTTNIVTSSNITDAKGVMVRARTLNTIKGVKGLQMLS
VNRSDHTVKKVISSHDLQKVKLLGVKENKSPRRPALKILLNKANLTRLLANTTNASTTNN
TQISIEPSGELSESPVQSDAVMEDASESSLRVQLQQAHAALASLAAELRHCKAKLAKYEH
TH