DPGLEAN21052 in OGS1.0

New model in OGS2.0DPOGS210715 
Genomic Positionscaffold2067:+ 571-5557
See gene structure
CDS Length2028
Paired RNAseq reads  5082
Single RNAseq reads  12636
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA006327 (5e-23)
Best Drosophila hit  multiple ankyrin repeats single KH domain, isoform B (0.0)
Best Human hitankyrin repeat domain-containing protein 17 isoform a (0.0)
Best NR hit (blastp)  PREDICTED: similar to ankyrin repeat domain protein 17 isoform a [Apis mellifera] (0.0)
Best NR hit (blastx)  PREDICTED: similar to ankyrin repeat domain protein 17 isoform a [Apis mellifera] (0.0)
GeneOntology terms


  
GO:0005634 nucleus
GO:0003723 RNA binding
GO:0044419 interspecies interaction between organisms
GO:0005737 cytoplasm
InterPro families
  
IPR002110 Ankyrin repeat
IPR020683 Ankyrin repeat-containing domain
Orthology groupMCL10438

Nucleotide sequence:

ATGGAGGCCGCGAGTGCCGGTCACGTGGACATCGTGAGGCTGCTGGTCGCTCACGGCGCC
GACGTCAACGCTGTCTCGGGCTCCGGGAACACGCCCCTCATGTACGCCTGCGCCGGCGGA
CACGAGGACTGCGTGCGGGCGCTGCTCGATAACGGGGCCAATGTAGAAGATCACAACGAA
AACGGTCACACGCCGCTCATGGAGGCCGCATCAGCCGGTCACGTGGGCGTCGCGAAGATC
TTGCTGGAGCACGGCGCCGGCATCAACACGCACTCCAACGAGTTCAAGGAGTCCGCCCTC
ACGCTCGCATGCTACAAGGGTCACCTGGACATGGTCAGGTTCCTGTTGGCGGCCGGCGCC
GACCGCGAGCACAAGACTGACGAGATGCACACCGCCCTCATGGAGGCCAGCATGGACGGA
CACGTCGAGGTCGCCCGGCTGCTGTTGGACTCTGGAGCACAGGTTAACATGCCGACGGAC
AGTTTCGAGTCTCCGCTGACCCTGGCGGCGTGCGGGGGACACGTGGAGCTGGCTATGTTG
CTGTTGGAGAGAGGCGCCAACATAGAAGAAGTCAACGACGAGGGATACACGCCGCTCATG
GAGGCAGCTAGGGAAGGTCACGAGGAGATGGTGGCGCTGCTGCTCGGTCAGGGCGCGTCC
ATCAACGCTCAGACCGACGAGACGCAGGAGACGGCCCTCACCCTGGCCTGCTGCGGCGGC
TTCCTCGAGGTGGCGGACTTCCTCATCAAGGCGGGGGCGGATGCGGAACTGGGAGCTTCC
ACGCCGCTCATGGAGGCCTCGCAGGAAGGACACCTGGAGCTCGTACGATACCTGCTGCAA
GCCGGCGCGGAGGTCCACGCTCAGACGCAGACGGGCGACACGGCGTTGACGTACGCGTGC
GAGAACGGACACACGGACGTGGCGGACGTGCTGCTGCGGGCCGGGGCGCTGCTGGAGCAC
GAGAGCGAGGGAGGCAGGACGCCGCTCATGAAGGCCTGTCGCGCCGGACATCTCTGTACC
GTGCAGTTCCTCGTGGGCAAGGGTGCTGACGTGAACCGCATGACGGCCAACGGGGATCAC
ACGCCGCTGTCGCTGGCGTGCGCCGGCGGACATGCGGACGTGGTGAAGTTCCTGCTGGCG
TGCGACGCCGACCCCTTCCGCAAGCTCAAGGACAACTCTAGCACACTCATCGAGGCGGCC
AAGGGCGGACACACCACCGTCGTGCAGCTGCTGCTAGACTACCCCCACTCCCTCATGTTG
CCCAGAGGTAACACGGGTACGGAGGAGAGCGGGGGTCTGAGTTCCGCACAGGCGGCGGCG
CTCGGCCTGAGTCACGCCCCGGCGCCGGGCGCGCCCAGCCAGCGAGCGCTGCTCCCCGCG
CACGCACCCCCCTCGCACCCTCACGCACATGCCCACCCTCACGCGCACCCCCCGCCGCAC
GCGCATCCCTCGCACGCCGCTCACCCCGCGCATCCTGCTCACCCCGCCCTCCCCGCGGCC
GCGCCGCAGCAGGACGTGCCTCCCAACTTCGCCAAAGTCTATTTGGACGGAAGAAAGAAA
CAGGCGAGCGGCAACGGCACGGTCCAGCCGGGCGTCCCCGCGCACCCCCCGGCCGGGGCG
GCGGGCGCGGGAGGGGCCGGCAAGCACAAGTGCGGCCGCAAGCAGCGTCCCGCCGCGCCG
CACTCCGACCACCACCTGCCGCCGCCGCCCGACATACTGGAGGACCATATGTGCAACCAC
GACGTGGTGCACAAGCATAAGCTATCCCTCCCGCCCGGCTTTACTTGGAAGGATGTTAAC
AAGAAATTTAAAAATAAAAACAAAGTCGAAAAAACATGCAATAGGGCGGAGGCGCTCGCC
GGTCAGCCGAGGAATGAATCACTGCCACCGACTGAGAGGACGATGCTGGAACTAGCCGAC
GCCTCCGGTCCACCGACTCTCGCCTCGTCAGCTCTGCACGCCCTCGACCTGCAGTACTGC
GCCCAAGGCAAGTCGCCGTTCACAATCATATACGTGTCTAACAAGTGA

Protein sequence:

MEAASAGHVDIVRLLVAHGADVNAVSGSGNTPLMYACAGGHEDCVRALLDNGANVEDHNE
NGHTPLMEAASAGHVGVAKILLEHGAGINTHSNEFKESALTLACYKGHLDMVRFLLAAGA
DREHKTDEMHTALMEASMDGHVEVARLLLDSGAQVNMPTDSFESPLTLAACGGHVELAML
LLERGANIEEVNDEGYTPLMEAAREGHEEMVALLLGQGASINAQTDETQETALTLACCGG
FLEVADFLIKAGADAELGASTPLMEASQEGHLELVRYLLQAGAEVHAQTQTGDTALTYAC
ENGHTDVADVLLRAGALLEHESEGGRTPLMKACRAGHLCTVQFLVGKGADVNRMTANGDH
TPLSLACAGGHADVVKFLLACDADPFRKLKDNSSTLIEAAKGGHTTVVQLLLDYPHSLML
PRGNTGTEESGGLSSAQAAALGLSHAPAPGAPSQRALLPAHAPPSHPHAHAHPHAHPPPH
AHPSHAAHPAHPAHPALPAAAPQQDVPPNFAKVYLDGRKKQASGNGTVQPGVPAHPPAGA
AGAGGAGKHKCGRKQRPAAPHSDHHLPPPPDILEDHMCNHDVVHKHKLSLPPGFTWKDVN
KKFKNKNKVEKTCNRAEALAGQPRNESLPPTERTMLELADASGPPTLASSALHALDLQYC
AQGKSPFTIIYVSNK