DPGLEAN04164 in OGS1.0

New model in OGS2.0DPOGS211368 
Genomic Positionscaffold5140:- 4838-10194
See gene structure
CDS Length1977
Paired RNAseq reads  34
Single RNAseq reads  93
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008366 (2e-153)
Best Drosophila hit  ND
Best Human hithypothetical protein LOC26074 isoform 1 (7e-56)
Best NR hit (blastp)  PREDICTED: hypothetical protein [Strongylocentrotus purpuratus] (6e-71)
Best NR hit (blastx)  PREDICTED: similar to Uncharacterized protein C20orf26 [Apis mellifera] (8e-63)
GeneOntology terms  ND
InterPro families  ND
Orthology groupMCL15015

Nucleotide sequence:

ATGTCGAAGGTTTACGTGAATAACAGTATTGTTATAGTTGGTGCAAGTCGTACTGGTCTG
GCGTTCCTGGAAACACTGCTTCTTGGCCCAACCTCCCCATATCTAACATTCACCCACATA
ACTTTGGTTTCTCACCACGGCCTGCCGACTGTAGCCGACAGTGCTCATGCTGCGGAAATA
TGTGTCCCAAGAGAAGGAAGATACACCGACAGATATCTTAAAAGTGTGCCGTTTTATTAC
TATGTTGATGTCATGTCAGCTGTCATGACACAGATCGATAGGAAGAAAAAATGCATACAT
TTTAAAGGCGGTGGCGTTAAATTTTACGATGAACTGGTATTGACTTGCGGTCAACAGTTC
CAACATCCAGAATATTTGAAAGACTCGATTACATTGGCTCAGGAAGTTGAACAAGGCAAG
CCATGCAATAGGATTCTTATGGACAATCCAAAATACGAGCCAGATCACGTACCACCTTCA
CCTGATATTCCTAACAATGTAATGCTTATCAATTCATTGTTCGAGGCCAACACTTGTTTG
AGAAACTTAAAAAGATTGATCTCTGAAACTAAAAATACATGCCAGTGTTTGAGTACGGAG
AATCAAGTGGTTGTTTACGGGGACTGTATAGAAGCATACAGTTGTATTTCAGCGTTAATT
GAATTCGGCATTTCGCCAAAAATGATAACATTCGTTGAGCCGTTTCCCCCGCAGGACAGG
ACTTCCTTAAGGGTCAACTGTTTCAATGATGAAACTGTGGATGAGCGAGTACAGGCCAGT
ATAGAGAAGCTGGGTATCCGTACATACCGCCAATGTCACTATGACGGCTGGTCCCAGCGA
GGCCCTCGGGTTGAGTTTCTACGTCTTATGTCTCCACTACACGCTATACATTTACCATGC
TTCGCACTCTTTTATTACGGAATTAAAGCCATCGATTTAAACGCCTTCAAAGCTATAAAT
GAGAGCGGTTTGGTATATGACGGGGGTCTGGTGGTTGGTCCGAAGTTCGACACAAACGAC
CCGTGCGTTTGGGCCGCCGGGCCGTGTGTGAGGTACTCAAGGAGACTGTACGCGCCCAAG
GAACTACATAAATATTACTACTCTGAAGATGTAGGAGAAGAGCTAGCAAGACAATTTTTG
AAGAAACTAAACCCATTCGAAGTTGCCAAATCACATATGGAAAGTATGTCCTCTGACACA
TTACACGCGCCCAGCTCTAGCCTGTTCAGATTTCAGAGTTCTCATGCAAGCATCTCTAGT
AGAACATCGTTCTGCAGCGCTAAGACAAAGTGGCAGCCAGTAATGAAATTCGACTCCCCG
ATAGTGGTGACAGCTACGTTACCAGGTGAGCTGCACTACATGAAGCTGAGAAAACCAGGC
GAAGATGTACCAATGGCCGTGCAACTCACACTGCCTTCTCAGGGGCACACGCTAATAACG
GACAAGCGACAGAATTATTTCAGGCTGCAAATGAACTCATTACACTGCATCGAGTCCATC
ACTTGTTTGTCGAAGAGGCAGTTCAGTTGCGAGACGCTCGCGCAGCTTTATGGAAGACAC
GAGGCCTTCTTCAACAATCTGCTTACCAGATTTAAGATGAATCTAATCGATGATCTGTAC
TCGTATTTCTCTAAGACATGGACCGCGGCTTTATATCAGGAACCGTTCGTTAACTTGTTG
CAAGATATTTACGATCACGGCGGTAACACAGTCTACGACGTGGTACAGACGAAGTACAAT
GAAATAAACGATAAAGATGTGACTGAACCAATAAAGGATATTACTTACGAAGACGTGATG
ATGGCGATGGACTGTAAAGAATGCGGTCACAATAGAACCTTACGTCAGGAAACCAGAATC
TTTTGGAACTCGATCGGCGGTGAAGACATTGTAAACACTCACCTCGCGCGATATCTTCAC
AAGAACATAATCACCAACCCACATTACGCTATGCCTGACCCTGAGTATCTTAAATAA

Protein sequence:

MSKVYVNNSIVIVGASRTGLAFLETLLLGPTSPYLTFTHITLVSHHGLPTVADSAHAAEI
CVPREGRYTDRYLKSVPFYYYVDVMSAVMTQIDRKKKCIHFKGGGVKFYDELVLTCGQQF
QHPEYLKDSITLAQEVEQGKPCNRILMDNPKYEPDHVPPSPDIPNNVMLINSLFEANTCL
RNLKRLISETKNTCQCLSTENQVVVYGDCIEAYSCISALIEFGISPKMITFVEPFPPQDR
TSLRVNCFNDETVDERVQASIEKLGIRTYRQCHYDGWSQRGPRVEFLRLMSPLHAIHLPC
FALFYYGIKAIDLNAFKAINESGLVYDGGLVVGPKFDTNDPCVWAAGPCVRYSRRLYAPK
ELHKYYYSEDVGEELARQFLKKLNPFEVAKSHMESMSSDTLHAPSSSLFRFQSSHASISS
RTSFCSAKTKWQPVMKFDSPIVVTATLPGELHYMKLRKPGEDVPMAVQLTLPSQGHTLIT
DKRQNYFRLQMNSLHCIESITCLSKRQFSCETLAQLYGRHEAFFNNLLTRFKMNLIDDLY
SYFSKTWTAALYQEPFVNLLQDIYDHGGNTVYDVVQTKYNEINDKDVTEPIKDITYEDVM
MAMDCKECGHNRTLRQETRIFWNSIGGEDIVNTHLARYLHKNIITNPHYAMPDPEYLK