DPGLEAN18116 in OGS1.0

New model in OGS2.0DPOGS210274 
Genomic Positionscaffold274:+ 73587-84224
See gene structure
CDS Length1563
Paired RNAseq reads  11
Single RNAseq reads  55
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA000027 (7e-06)
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  allergen sol i III (2e-09)
Best NR hit (blastx)  Chain A, Crystal Structure Of The Major Allergen From Fire Ant Venom, Sol I 3 (2e-09)
GeneOntology terms  ND
InterPro families
  
IPR014044 CAP domain
IPR001283 Allergen V5/Tpx-1-related
Orthology groupND

Nucleotide sequence:

ATGTGCACGGATTATAATCCGAACTATACAATGGGCCCAACATGCGCGGGTGTAGAAAAT
GCGACTATGACGGAGGACAATGCGGCTCTGATACTAGATTTGATCAACAGTATAAGAAGC
AGAGCAGCTCGTGGTCTGGCCATGGGCTATGAGAAAGAATTACTCCCAAGAGCCTATGGA
ATGTACAGAGTCGAATGGGATCCAGAATTGGCCACATTGGCCCAAGTGTGGGCTAATCAA
TGTGTTTTAGAACGAGACAATTGTCGAGCAACTAAAAATTTTCCCGATCCCGGACAACAG
GCTTCGATAGCTCGCTTTGTAACGGACAAATGGATACCTATAAGTAAAACGAAAGACAAA
ACTTATAATGAATCATCTGGTTTTAATTCACACAAGGTAATCCGTCTGGTTTGCAATTTT
TCTTCAAGAGTTTACGACGATCGTGGGATATATAACGTCACTGCTCCAACCACATCAGAG
TTCACTCCCCAATGTGGCTGTCCGCCAGGATACGACGAGGATTCCTGGTGTTTGTGCTAC
AAAAGCGAAAAGAATAAAAAAAATCAGTACAGAAGCTCTTTGCCAATAAAAAATATTAAT
AAAAAGAATAATATTGTTGAAAACAAATTCCATAAGAAAGATACAGCTGAAGCTAGCATC
ATGATGCAACAAAAACGCCCAAGCCCTGCTAGCAAATCAATACAAGATGACGGAAATCTA
TTAAGATTAATACATAAATTAGAAAGTGAAGCTAAACATATAAATTTAAATGAACATGAT
AGAAAGATACTTAATAACGAAATTCGTGAATTATACAACGTCGTCAAAAACACGTATTAT
ATATTGAGAGACAATAAAAAAGTTTACATTGATATCGGGACTTCCAAAAGGAATGAAATT
TTCACTAATGATTTGAAAAATAGATACTTCAATAACCAAACTGGAGATAATAAGGTTATT
ACGATAAGAGAATTTCCTACGTCTACAAACAACTTCCAATATAGCACTAATTATAAAAAC
TCATTGCCCGGAAATGCATTACGAACACTCACATATTACTCTGGTAAACACGTCCACAAA
ACTAACGAAAAAATCAAAGACATGGACGAAAATTTAGGAAAACATTCAAAATCTGATGGA
CGCAAATTATCGCTGACGAAAAAAATGTATTACCAGAAAAGAATTAATGACATAAAAAGG
AAACTTACTCTGAAATATAATTATCATAATAAAAGCCAGACCCAATTAAATTATAATAAT
AATCTATACGAAAAAGAAAGCACCGAGAAAGACAAAGCCGAATTTCCAATTTTCTTAAGA
AGAAGAAATAAGAAGAAGCATCATGGCAACAACAATAACGACCTACAACCAAAAAGTTCT
AGAAAGAACGAGAAAAGAAAAAAATTAAATAAAAAACGTAAACCAAAAGCCAAATACAAC
AAGAAATTCATAAAAACAACTTTAAAAGACATCGACAGGGATTCCAATATGTCCGGTTCG
AAAGACAGTGATAGCAAAAACAAACCCGTCGACATTATTGTTCATATTAAAATGAACGAA
TAA

Protein sequence:

MCTDYNPNYTMGPTCAGVENATMTEDNAALILDLINSIRSRAARGLAMGYEKELLPRAYG
MYRVEWDPELATLAQVWANQCVLERDNCRATKNFPDPGQQASIARFVTDKWIPISKTKDK
TYNESSGFNSHKVIRLVCNFSSRVYDDRGIYNVTAPTTSEFTPQCGCPPGYDEDSWCLCY
KSEKNKKNQYRSSLPIKNINKKNNIVENKFHKKDTAEASIMMQQKRPSPASKSIQDDGNL
LRLIHKLESEAKHINLNEHDRKILNNEIRELYNVVKNTYYILRDNKKVYIDIGTSKRNEI
FTNDLKNRYFNNQTGDNKVITIREFPTSTNNFQYSTNYKNSLPGNALRTLTYYSGKHVHK
TNEKIKDMDENLGKHSKSDGRKLSLTKKMYYQKRINDIKRKLTLKYNYHNKSQTQLNYNN
NLYEKESTEKDKAEFPIFLRRRNKKKHHGNNNNDLQPKSSRKNEKRKKLNKKRKPKAKYN
KKFIKTTLKDIDRDSNMSGSKDSDSKNKPVDIIVHIKMNE