DPGLEAN01153 in OGS1.0

Genomic Positionscaffold614:- 124052-130496
See gene structure
CDS Length1932
Paired RNAseq reads  833
Single RNAseq reads  3282
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012065 (2e-92)
Best Drosophila hit  multiplexin, isoform K (1e-33)
Best Human hitcollagen alpha-1(XV) chain precursor (6e-23)
Best NR hit (blastp)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (2e-67)
Best NR hit (blastx)  PREDICTED: similar to ankyrin 2,3/unc44 [Strongylocentrotus purpuratus] (4e-68)
GeneOntology terms





  
GO:0005198 structural molecule activity
GO:0007155 cell adhesion
GO:0031012 extracellular matrix
GO:0005488 binding
GO:0040035 hermaphrodite genitalia development
GO:0030054 cell junction
GO:0005604 basement membrane
InterPro families

  
IPR000477 Reverse transcriptase
IPR008985 Concanavalin A-like lectin/glucanase
IPR003129 Laminin G, thrombospondin-type, N-terminal
Orthology groupMCL10014

Nucleotide sequence:

ATGGTCAGTTTCGATGTACAGTCATTATTCACTAGTATACCTGTTCTTGACTGCATTGAG
ATTGTAAGAGGTAAGTTAAAGGATAACAATATGCCTATAGAATATGCAGAGCTATTAAAG
CATTGCCTAACATCTGGCTACCTCATGTGGAAGGATGAATTCTACATACAAGTAGATGGA
GTTGCAATGGGTTCACCGGTTTCCCCCGTTGTCGCTGACATATTCATGGAGGACTTCGAG
GTGCGAGCCCTTTGCTCTCCTCCTATAAGACCTTTAATTTATAAACGGTATGTAGATGAC
ACCTTCACAATATTAAATAAAAATAAAACATCTGCTTTTCTGAACCATCTCAATTCTATC
AATAGTAAGATTCAGTGTACTATAGAATTGGAGGCAAATAATTCTTTAGCTTTCCTTGAT
ATACTTGTTGTTAGGAATCCTGACAATACTTTGGGACATACTGTTTATAGGAAACCCACA
CATACGGACAGGTACCTCAATGGTTACTCACACCACCACCCTATCCAGTTAGCTACCGTT
GGCAAATCTTTGTTACAGAGAGCCCAACATCTTTGTGATGCTGACCACCTAGAGGCCGAG
CTGCAGCATGTAAAACATGCTCTCACTATCAACAACCTGCCCGTGCCTCGCCAGCATCGC
AAGAAGCACCTGAAGCCACCCACAGTTGAACGACAACCTGCGATACTACCATATGTGAAG
GGAGTTACTGACAGAATAGGCAACATCTTGAAGAAGGTTTCCATTAAAACTATTTACAAA
CCACATAAGAAAGTGAGCCAATTCTTGAGACCAATCAAGAGTAACATTCCTTTACAACAA
GCGGGTGTATACAAACTCGACTGTGACTGTGTCTTGTCATACATTGGACAGACGAAGAGG
AGCATCGGTACAAGGGTTAAGGAACACATCTCAGATATCAAAAACAGGCGCGCGTCGAAG
TCAGCAGTGTGTGAACACACAATGGACAAACCAGGCCACTACATTCGTTTTGATAAACCT
CAAATCCTCGCTCGGGAAGACAAGTATATACCGAGATTAATTCGCGAGGCTATTGAAATT
AAAAAACATCCCAATTTCAATAGAGAAGATGGCTGGAATCTATCAAACACCTGGGACCCC
GTTCTTAAAAATATAAAATCCCATGTCCGAAACCACACCGCAGGACCTCAAGACACCGTG
AGCGCATTCTGCCGGCATCCAGAGCGGTACGCCAGAAAATTAAGAAATCGATGGCGGTAC
ATTTGGCTCACAATTGTATTATATTCAACAATATGTGCAGCCAACGACGGCATTTTTGGA
TCCAAATACCCTAACGAAATCCCGGAATACGACTTGCTACACGCGATTGGAGTTCCATTC
AGTAACCCAAAAACCCAGTATTTCGATGAAGGTCTCGACGGCTTTCCTGCATACGGCCTT
AAACCAGGCTCGGATATTAAATCACCATATAGGCTCTTCATGCCAGAGAAACTATATTCG
GAATTTTCAATAACCGCTACCGTACGCCCGGCCAACAAAGATGGCGGATTCCTTTTCTCC
GTTGTGAACCCTTTGGAGACTGTGGTCCAATTGGGAGTTCAATTAATACCTTCTGGGCCT
GGTTTGACAAACATTTCACTTCTCTATACAGATCCAAATATATATGCATTAAGTCAGACC
ATAGCATCGTTTGTCGTGCCATCATTTGCCAAAAAATGGAGTCGGTTTGCCCTCAAAGTA
ACCTCTGATAATGTTACTTTATTTTTGAACTGTCACGAGTTTGATACTTTAGTAGTTAAA
AGAAACCCTTTGGAGCTGGTGTTCGACTCGGCCTCTACTTTGTACGTCGGTCAGGCGGGA
CCCCTTATTACAGGTGCATTTCATGTAAGTATATTTTACATAAATAGCTACTTAAGAATA
TATTACATTTAG

Protein sequence:

MVSFDVQSLFTSIPVLDCIEIVRGKLKDNNMPIEYAELLKHCLTSGYLMWKDEFYIQVDG
VAMGSPVSPVVADIFMEDFEVRALCSPPIRPLIYKRYVDDTFTILNKNKTSAFLNHLNSI
NSKIQCTIELEANNSLAFLDILVVRNPDNTLGHTVYRKPTHTDRYLNGYSHHHPIQLATV
GKSLLQRAQHLCDADHLEAELQHVKHALTINNLPVPRQHRKKHLKPPTVERQPAILPYVK
GVTDRIGNILKKVSIKTIYKPHKKVSQFLRPIKSNIPLQQAGVYKLDCDCVLSYIGQTKR
SIGTRVKEHISDIKNRRASKSAVCEHTMDKPGHYIRFDKPQILAREDKYIPRLIREAIEI
KKHPNFNREDGWNLSNTWDPVLKNIKSHVRNHTAGPQDTVSAFCRHPERYARKLRNRWRY
IWLTIVLYSTICAANDGIFGSKYPNEIPEYDLLHAIGVPFSNPKTQYFDEGLDGFPAYGL
KPGSDIKSPYRLFMPEKLYSEFSITATVRPANKDGGFLFSVVNPLETVVQLGVQLIPSGP
GLTNISLLYTDPNIYALSQTIASFVVPSFAKKWSRFALKVTSDNVTLFLNCHEFDTLVVK
RNPLELVFDSASTLYVGQAGPLITGAFHVSIFYINSYLRIYYI