DPGLEAN10283 in OGS1.0

New model in OGS2.0DPOGS202527 
Genomic Positionscaffold563:+ 106210-111475
See gene structure
CDS Length1539
Paired RNAseq reads  3133
Single RNAseq reads  8435
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA001551 (0.0)
Best Drosophila hit  ubiquilin (5e-122)
Best Human hitubiquilin-1 isoform 1 (1e-111)
Best NR hit (blastp)  ubiquilin 1,2 [Aedes aegypti] (0.0)
Best NR hit (blastx)  ubiquilin 1,2 [Aedes aegypti] (2e-162)
GeneOntology terms  GO:0042982 amyloid precursor protein metabolic process
InterPro families





  
IPR000626 Ubiquitin
IPR000449 Ubiquitin-associated/translation elongation factor EF1B, N-terminal
IPR009060 UBA-like
IPR006636 Heat shock chaperonin-binding
IPR015940 Ubiquitin-associated/translation elongation factor EF1B, N-terminal, eukaryote
IPR015496 Ubiquilin
IPR019955 Ubiquitin supergroup
Orthology groupMCL12413

Nucleotide sequence:

ATGGCAGAAGGCCAGGAGGAACCTAAAAAGATTACAATTACTGTAAAAACACCAAAAGAA
AAGCAGCAAGTTGAAATCGAAGAAGATGCAGATATCAAAAAACTCAAAGAAGTGTTGTCC
CCTAAATTTAACGCGGAGCCCGAACAGCTATGTTTAATTTTTGCCGGAAAAATTATGAAC
GATTCAGATACTATGAAGCAACATAACATCAAAGATGGGTTGACAGTTCATCTTGTTATC
AAGACTCCTCCAAGACCTGAACCGGAAGGTGGAACACGGCGCCCTCCAGCTGATATTGGT
GCTACACCTTTTGGACTGAACTCTCTTGGGGGTCTAGCAGGCTTAGAAAGCCTTGGTCTA
GGCCAAAGCACTTTTATGGACCTGCAAGCTCGTATGCAACAAGAGCTTTTGTCGAACCCT
GATATGTTACGACAAGTGCTGGATAACCCACTCGTTCAGCAAATGATGAACGATCCTGAG
AATATGCGAACCCTTATTACATCCAACCCGCAGATGCAAGATTTGATGGCTAGGAACCCT
GAAATTAGTCATATGTTGAACAACCCTGAACTGTTACGACAAACAATGGAATTGGCACGC
AATCCTGCCATGCTTCAAGAGTTGATGAGGTCCCATGACCGAGCTTTGTCCAACTTGGAG
AGTATACCTGGTGGTTACAATGCTTTGCAGCGAATGTATCGAGACATCCAAGAACCAATG
TTGAATGTAGCCAGTAGCATGGCTGGAAATCCATTCTCTGGACTAGTAGACAATTCAGAT
GGCACCAATCCCCAACAGGGGGCAGAGAACCGTCAGCCCCTTCCAAACCCTTGGCAGCGT
GGAGGTTCTAATGCATCTAGCACACCAAACACAGGCCCAGGCCTTATCAATACACCTGGC
ATGCAGTCATTGCTACAACAGATGTCTGAAAATCCTCGTCTTGTACAATCAATGCTATCA
GCACCATACACTAATAGTATGCTACAAGCTCTCGCTGCCGACCCGGAGATGGCATCTCAA
CTTATTAACCAGAATCCCATGTTTGCCAATAATCCACAACTGCAAGAACAGATTCGTACT
ATGATGCCACAAATGCTAGCCCAGCTGCAGAATCCAGAAATGCAACAGATGATGTCTAAT
CCACAGGCGCTGAATGCCCTACTTCAGATCCAGCAGGGTATGGAACAATTGCGAGCGGCG
GCACCAAGTCTGGTCAATAATATGGGCTTCGGAGCAGCCGCTGCCACTGCCGCCCCACCC
CCACCTCCCACTACTAACACACCGCCAGCACAAGCGAGACAACAACAGAACTCTGAGCTG
TTCACACAGTTCATGCAAAGAATGGTATCGGCGATGGCCAACAACCAGACCAACACTCAG
CAACCCCCGGAACAACGCTACTCACAACAGCTAGAGCAACTTGCAGCCATGGGTTTCCTC
AACAGGGAGGCTAATTTACAAGCACTGATCGCAACATTTGGTGACGTGAACGCGGCAGTT
GAAAGGCTACTGGCTCTAGGTCAACTGTCCATGAGCTAA

Protein sequence:

MAEGQEEPKKITITVKTPKEKQQVEIEEDADIKKLKEVLSPKFNAEPEQLCLIFAGKIMN
DSDTMKQHNIKDGLTVHLVIKTPPRPEPEGGTRRPPADIGATPFGLNSLGGLAGLESLGL
GQSTFMDLQARMQQELLSNPDMLRQVLDNPLVQQMMNDPENMRTLITSNPQMQDLMARNP
EISHMLNNPELLRQTMELARNPAMLQELMRSHDRALSNLESIPGGYNALQRMYRDIQEPM
LNVASSMAGNPFSGLVDNSDGTNPQQGAENRQPLPNPWQRGGSNASSTPNTGPGLINTPG
MQSLLQQMSENPRLVQSMLSAPYTNSMLQALAADPEMASQLINQNPMFANNPQLQEQIRT
MMPQMLAQLQNPEMQQMMSNPQALNALLQIQQGMEQLRAAAPSLVNNMGFGAAAATAAPP
PPPTTNTPPAQARQQQNSELFTQFMQRMVSAMANNQTNTQQPPEQRYSQQLEQLAAMGFL
NREANLQALIATFGDVNAAVERLLALGQLSMS