DPGLEAN14433 in OGS1.0

New model in OGS2.0DPOGS201800 
Genomic Positionscaffold12:+ 17198-47544
See gene structure
CDS Length1911
Paired RNAseq reads  936
Single RNAseq reads  3347
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA013228 (2e-146)
Best Drosophila hit  CG32137, isoform B (3e-58)
Best Human hitbicaudal D-related protein 1 (4e-17)
Best NR hit (blastp)  hypothetical protein TcasGA2_TC015685 [Tribolium castaneum] (9e-83)
Best NR hit (blastx)  AGAP001308-PA [Anopheles gambiae str. PEST] (6e-81)
GeneOntology terms

  
GO:0008150 biological_process
GO:0005575 cellular_component
GO:0003674 molecular_function
InterPro families  ND
Orthology groupMCL13317

Nucleotide sequence:

ATGGTGTGTGAAATTAAAACTCCATTAAAACATGCTGGCCCAGAGCCCCGTAAAGCACGA
CGCAGATGCTTCACAGTCCGCTGCCCCAAACCGATTCCCACAACTGTTTCCGTCACATGT
CTGAACAACTACTATTCGGTTAATAATTTATTGTTTATTTATCTGTCGCTGGAAATTATA
TATCTCGAGTGGTTTCCACACGCCGAGGACGCGGTTCGTGATATTTCGTGGCCTGATAGA
GACGTGATAACGTTCGAGTACTGGATTCACAAGGCGCAGTTTACTGTAAGGTTACTGGAA
ACAGAATGGGGTCGTTGTCATGAAAAAAGTTTAGTTCACACTGACGGATACAAAAGAATT
GCGAGAAGATGTACGCTATTGTTTGTTTACAAGAATCAAGAAAATAAGCCACTTTACATC
AACTCCATGGTTCACATATCCACGAGCATCCAGGTCACGTCGATGGTCCTTATCTCGGAG
TTGGAGCAAGAAAAGCATCTCCTGCGACGACGTTTGGACACCGAGCAAGGGGAATACGAA
GCCAGGCTTGTGGAACTACAGAATGACATCAAAGAACTCACCGCCAAGATCGACTCCAAG
GACAATTCAGTTAAACAAAGAGAAGAAGAAAAGACGGGTCTTATAGCAGAGTTGACAGCA
CAGAATTCTCGTCTAACTAATCAACTGAAGGAGTCATCTGCTGTCGAGGCGCAGCTCTTA
GCGCAACTGGAGTTGCTCAAAGATCAGTGCTCCATAAGAAAGACAAGCTTGCAAGACCAT
GTCCAAAGCTTGGAATCCCTCAAAGCTGAGCTGGCTCTCATGAGTGATAAAAGGGCGGAT
TTGGAAAGACGGCTGACCACATCGCTAAAAGACAAAGACAGCTTAACACAGCAATTAGAT
GAAGCTAACGACAGGATCTCAGCCCTGGAGAGGCAGTTGAAGGAACAGGAACATCTATAC
CAGAACACGCTCAAGGAGTTGGAGCGTCTACAGAGATCTCACGACACGCTGGCAGAAAGA
GTTGGATCTGATCCGGTGGAAATTACGAACACTCCGAGGTCCTTGCACGCGGAACTGGAA
TCGGAACCGGAAGAAGATGAGAACTGGCTAAGAACAGAGGCTGTTCAGGTCTTCAAGCAG
TTGAGGGCATTAGCCCTCCAACTGAACACGGGCCACGACGATGATTCCGTCACGTTCCTC
AAGGGTCCGCGCTCACTGGACTCTGCCGTATCTGTACCAGGAGTCAAACAGTTCTCTTCT
CCTTTCAAGCAGCCTTTGATTTCGTTTTACTTGAACAATGATCGTCTACATTCAGATCTA
TCTTTGTCGTCTCTCGATGGTGATGAAGGGGAGACTCTCCGTCGTGGAGCACTGTCCGCC
GCTTGTGCTGATGCCGTTGCAGCGTATGCAGCATTAGAGGGATCCAGAGTGAGGGACTCC
ATCGCCTCCCACGCGCGTCGTGCTATGGAGAGAGAGAGACAGATTGATGAAAAGAATGAG
ATCATAGCGGAACTGTCGTCCAAGCTGTCAGTGGCGGAAGTTGAACTGCGAGCGTCAGCT
GACGAGAGAGATAAGCTGCTGAACGACGCGACATACAGTAGCTTACAGCATGATGAAGCT
GTCACCAAAGCCAGGCAGGAGAGAGATGAAGCTATAGAGAGGAAAAAGGCCAGCGAGGTC
GCTCTGGCTAAGACACGCGTAGAATTGATGCAGGCTAACAGCCAGCTGTACGAGGCGGTG
AGACAGAAGATAGACCTGGGCCAACAGCTGGAGCAGTGGCAGATGGACATGCAGGAACTC
ATAGATGAACAGATGAAGCACAAACTGACGTCCCAGGAGAAACGCCGCAAACTCCCCCCG
CCGCGCGCACCGACTCGCACCGAGAGACTATTCGGGCTTTTTCACCGGTAA

Protein sequence:

MVCEIKTPLKHAGPEPRKARRRCFTVRCPKPIPTTVSVTCLNNYYSVNNLLFIYLSLEII
YLEWFPHAEDAVRDISWPDRDVITFEYWIHKAQFTVRLLETEWGRCHEKSLVHTDGYKRI
ARRCTLLFVYKNQENKPLYINSMVHISTSIQVTSMVLISELEQEKHLLRRRLDTEQGEYE
ARLVELQNDIKELTAKIDSKDNSVKQREEEKTGLIAELTAQNSRLTNQLKESSAVEAQLL
AQLELLKDQCSIRKTSLQDHVQSLESLKAELALMSDKRADLERRLTTSLKDKDSLTQQLD
EANDRISALERQLKEQEHLYQNTLKELERLQRSHDTLAERVGSDPVEITNTPRSLHAELE
SEPEEDENWLRTEAVQVFKQLRALALQLNTGHDDDSVTFLKGPRSLDSAVSVPGVKQFSS
PFKQPLISFYLNNDRLHSDLSLSSLDGDEGETLRRGALSAACADAVAAYAALEGSRVRDS
IASHARRAMERERQIDEKNEIIAELSSKLSVAEVELRASADERDKLLNDATYSSLQHDEA
VTKARQERDEAIERKKASEVALAKTRVELMQANSQLYEAVRQKIDLGQQLEQWQMDMQEL
IDEQMKHKLTSQEKRRKLPPPRAPTRTERLFGLFHR