DPGLEAN11713 in OGS1.0

New model in OGS2.0DPOGS200485 
Genomic Positionscaffold5812:+ 322-4456
See gene structure
CDS Length3702
Paired RNAseq reads  168
Single RNAseq reads  423
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitND
Best Drosophila hit  ND
Best Human hitND
Best NR hit (blastp)  ND
Best NR hit (blastx)  viral A-type inclusion protein [Trichomonas vaginalis G3] (5e-22)
GeneOntology terms  ND
InterPro families  ND
Orthology groupMCL20855

Nucleotide sequence:

ATGTCTTTCGATCTCATCATGGAGCCGAATAAAGATGAACGAGTCAGAAAATCGCACAGA
GGACTATTACAAATGTTTAGACCTGGAGGATGTCTCAGCATTCACAACGATGAAGAAGAA
TACTCCTATATGCCGGGCACTTCTAACGAATTAAATAGAAGCTCATCTGGACAGGAAGAC
TTCCATGTTAAAACTACGAATGAGGCATTTTCTTTAAAAAAGAATGTTTTTTCTCCCTCT
TACACGACTACATTTAAAACACCTAAGCCATTTTTGGGAAAATGCAAACCTGGTGGCTGC
CTTGATCCACCGTTTGGTGAAGATAAGTACATTTACAAACCTACGTTCGACGACAACCAA
ATTGCAGAGAAATCTCCACACACAAATGATATAAATAAAATGCTGCTAGATTCAGATAAA
AATAAACCGGAATTAGGAATAAAAAAACAAACTTTGGATCAAGAAGTAGAAGATTTGCAT
GAATCTTACACCGATTTGACCGAGAATATTGAAGATTCAGAATCTCTTGTTACTTACCCA
TTACCAGTATCATCTAGTTATATTAAAAATAGGGACAGTCAAAGTAAAAGGGACAGATCT
AGAATTAACTCTGAACAAACATCTCAAAAATTAACCAATGAATTTCCTGTTTCATTTTTT
ATGCCTCCTAGAAAAGAGAAAACAAAACAATTTAAGACCTATGATGATTACAATGACAAG
GAAACTAAAGTAAAAGAGGCATATATTAACAATGAGTATGCAATGCAAAGTCAAACACCT
AGCGAAATGTTATCTGATTCATTGACAACGACACATCCATCGCGTGAAGAACCTTCACAA
GACGTAACACGGGAACAAGATACATCTGTGAAATCGATACTTTCATCGAGAAATGATGAT
ATTTCTAAGGAACCAGTAAAATCTGAAGGTTTAGATTATATTTCTGATATCAAAGCCTTA
AGAAAAGGTGAGGAAATATTTTCACAGGAACAAATTACTGCAAACGATAAATCCGGGGCA
GGAAATCAGAACATAGGATCTAAGTTGCCAGAAAGTCATATTGATACATCGTATCAATCA
GATAAAATTAATTCAATACAAAAGATAGAAATAGAAAACATAGTGAGTGAGGATAAAAAT
ATAGAACCTGAAAAAAAACACCCTTCAGCAAAGGTTAAGTTAAGCCGTGAAAAGGTAAAA
ATAAACTCACTTTCTGAATTAAACAATCATTCAGATAATTCAGAAATAAATAGTGTTGGT
GACAAACCTGTGGAAGGAGAAACTGGATTACGCAACGGAATTAATAATCTTTTAGACCCA
ATTGAAAAAGAGAAATTTAAGAAGAATTCTTTGAAATATGAGACTTCAAAAGATTTTGGA
GTAAACGTTGAGAAGCATGATTCAACAAACCTTATCAAGCCTAATTTAAAATTGAGCCAT
GAAGAATTAACTAAAACATCAATTGATGATCCAAGCAAAATTTTAACATACTCAAGTAGT
GAAAGCATTGCTGATAAAGACATTAAAATAGAAAATCAATTACGTGATGGAGTTGAAAGT
AATTTAGATCCAACGCAAAAGGAAATGTTTGAAAAAGATTCTCTTAAATACGAAGCTTTA
AGACATTCTAAGGAAATAATTACAAATGATGACACTATAAAAGTAAGAAAATCTAATGTA
AAATTAAGCAAGGAAGATTTAGCTGAAATATCTATGGATCAATTAAACAGGTTTTCAAAA
GATCTAAAGAATGAAAGTATAACTGACATCAACGTTAAAAGAAAATCTCAATTGCCTAAT
AAAACGAGTAGCATTTCAGATCCAATGCAAAAAGATGCTTTAAAATACGAAGCTGTACTA
GATTCTGGCCCAACAATCACAAATGAAGACCCAATACAAATCGAGAAAACTGATTTAAAG
TTAGGCCAGGAGGAAATAACAAAAATTTCAATTGATCAATTAAGCAAGCTCATTGACCAG
GCAGAAAATGAAATTATTTCTGACACAAGTGCTAAAAGTGAAATCCGATTAACTGATGAA
TTGAGTAATTTGTTACACCCAGCCACTAAAGAAGTATTTGAAAAAGACTCTTTAACATAC
AAAGATTTAAAAGATTCAAGTAAAATAACTTTACAAAGCAGCGACGGGGATGGCATTAGT
AATATTAAAAATTTTGTTCCCTTACAAAAGAAATCATTGGATTCTAATGACAATTCTGAA
GTAAATTTAAAAGAGCCAACTAAGAACCTAGACGAAATTATAACGTTGCCAACAGAAAAA
TTAGTATTTCAGAATAAAACAAGTGTAAATGAAGACTTTGAGAGACATTCTGAAAGTTCC
AATAAAATAAATAATAAAGTCTCAGATATCGCAATAAAATCATTGTTATCTAAAATATCA
CTTAATGAGGTAAATTCTCAGAAAACTGCTGACTTAAATTTGGAAAATCTTAGTAAGAAA
AGCTCAAAATCTATCGTATTTGAAAAACCACTAAGTGCTGGTTCTGATGATTTGAAAGAA
ACAGCTCTTGAAAATAAATCATCGACATTTAAAGACTTACCAACAAACAATATCGATTCT
AAAAGTATGTTAATATCTTACGAAGAAAATCACGACATAGATAATGATAAAAATGGAAGT
CTTCGTAACATATCATTAAAACAAGAAATAAATCAATCAAGTTCTCATCATTCAAGAGAC
AGTCAATTAAGTCTTTTAAAAAAGCTCAGTTCTCTACCACAAATTCAAAATTCAGAAGGC
ATACTGTACGATAATGTTCAAGATAAAAATACATTCGACAACTCCTCTCATGAATTTACT
GATTCTAATATTTTACGAACTGGCCAATACTTGAATAATACTCCAACAAAAATAAAAGAC
GGCGATAGAGAAGGGATTGATTTAAAACAAAATATCTTACTTCATAAAGGTCTCGACAAT
AATTTGGATGAATGGATAAGGAAGGATGAAAAGTCCTTTCAAAATTATGTTACAAATAAT
GTTAACGAAGATATTTTAAGTACATCTCAAACCAATGATTTCATTGAAGACAGCACTCAA
TTAGATAAAGGTCCGAGAGAAACAGTCGATCATGAATTAATGTCCCTCAAATCTGGTAAT
AGGGACCAAGGCATTAGTAAAGATTATGCAAAATCACAAGTTTATGACTCGTCAAGTGAC
ATAAAACACACAAGCTCTTCTATATTCCTCAATTTTGCCTTCCACTCGTTAGCAAACGAA
TTTGGCTACACCATTACAACATTAGATCCTACAACAGATTTAACTGTATACCCACAAAAT
AAAATAACTACCATTAAAACTGCTCTTAAAGAAAATGATAAAGAAATTAGAATAAGGATG
GATTCCGAAACCGATATTATAATTCAAATAAAAAGGAACCATAAGCGTAACGAAGGGTCT
AAAAGCATCGTATCTAATGAGGGAAGGAATTTGATGCGTGGATACTGTTCAGAAACAATA
ATCAATAAAGATCAATTTTTTAAAAATACTCTTAAAACAGTTTATGACACGTTGGTGCCG
ATAGAAAAAATAATATCCAATCTTAAGGAAGAAGCAGATGTGCTATATCGGGAACAATTA
TTACTGAGAAAAATTTTGTCATCGAGGGAAATGAAGTCTAAAAGAATTATCCGCACTAAT
AAAAATTGTAGCTGCCTAGAAAAGGAAATGGGAATCAAGTGA

Protein sequence:

MSFDLIMEPNKDERVRKSHRGLLQMFRPGGCLSIHNDEEEYSYMPGTSNELNRSSSGQED
FHVKTTNEAFSLKKNVFSPSYTTTFKTPKPFLGKCKPGGCLDPPFGEDKYIYKPTFDDNQ
IAEKSPHTNDINKMLLDSDKNKPELGIKKQTLDQEVEDLHESYTDLTENIEDSESLVTYP
LPVSSSYIKNRDSQSKRDRSRINSEQTSQKLTNEFPVSFFMPPRKEKTKQFKTYDDYNDK
ETKVKEAYINNEYAMQSQTPSEMLSDSLTTTHPSREEPSQDVTREQDTSVKSILSSRNDD
ISKEPVKSEGLDYISDIKALRKGEEIFSQEQITANDKSGAGNQNIGSKLPESHIDTSYQS
DKINSIQKIEIENIVSEDKNIEPEKKHPSAKVKLSREKVKINSLSELNNHSDNSEINSVG
DKPVEGETGLRNGINNLLDPIEKEKFKKNSLKYETSKDFGVNVEKHDSTNLIKPNLKLSH
EELTKTSIDDPSKILTYSSSESIADKDIKIENQLRDGVESNLDPTQKEMFEKDSLKYEAL
RHSKEIITNDDTIKVRKSNVKLSKEDLAEISMDQLNRFSKDLKNESITDINVKRKSQLPN
KTSSISDPMQKDALKYEAVLDSGPTITNEDPIQIEKTDLKLGQEEITKISIDQLSKLIDQ
AENEIISDTSAKSEIRLTDELSNLLHPATKEVFEKDSLTYKDLKDSSKITLQSSDGDGIS
NIKNFVPLQKKSLDSNDNSEVNLKEPTKNLDEIITLPTEKLVFQNKTSVNEDFERHSESS
NKINNKVSDIAIKSLLSKISLNEVNSQKTADLNLENLSKKSSKSIVFEKPLSAGSDDLKE
TALENKSSTFKDLPTNNIDSKSMLISYEENHDIDNDKNGSLRNISLKQEINQSSSHHSRD
SQLSLLKKLSSLPQIQNSEGILYDNVQDKNTFDNSSHEFTDSNILRTGQYLNNTPTKIKD
GDREGIDLKQNILLHKGLDNNLDEWIRKDEKSFQNYVTNNVNEDILSTSQTNDFIEDSTQ
LDKGPRETVDHELMSLKSGNRDQGISKDYAKSQVYDSSSDIKHTSSSIFLNFAFHSLANE
FGYTITTLDPTTDLTVYPQNKITTIKTALKENDKEIRIRMDSETDIIIQIKRNHKRNEGS
KSIVSNEGRNLMRGYCSETIINKDQFFKNTLKTVYDTLVPIEKIISNLKEEADVLYREQL
LLRKILSSREMKSKRIIRTNKNCSCLEKEMGIK