DPGLEAN14409 in OGS1.0

New model in OGS2.0DPOGS207308 
Genomic Positionscaffold1127:- 1689-18765
See gene structure
CDS Length2784
Paired RNAseq reads  1227
Single RNAseq reads  3055
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA012091 (4e-41)
Best Drosophila hit  multiplexin, isoform E (3e-47)
Best Human hitcollagen alpha-1(XVIII) chain isoform 1 precursor (4e-31)
Best NR hit (blastp)  multiplexin, isoform M [Drosophila melanogaster] (3e-71)
Best NR hit (blastx)  PREDICTED: similar to collagen alpha 1(xviii) chain [Tribolium castaneum] (1e-87)
GeneOntology terms




  
GO:0005581 collagen
GO:0007155 cell adhesion
GO:0005198 structural molecule activity
GO:0031012 extracellular matrix
GO:0005488 binding
GO:0008045 motor axon guidance
InterPro families


  
IPR010515 Collagenase NC10/endostatin
IPR008160 Collagen triple helix repeat
IPR016187 C-type lectin fold
IPR016186 C-type lectin-like
Orthology groupMCL40713

Nucleotide sequence:

ATGGTGATAGCTGAACCAGACAACTACGATATCCTGAGTCTAGTCCGTGCGAATGTTTTG
ACCGATTTTATAGATATAGTCAAAGGCACAGATGTGTATGGCGCCATTAAACTTGTAAAA
AACGAATTAATTACTATTAAACTGGATCAATTCCCAGATCCAATAAACCATCTAGCAACT
CCTTTTGAAATATATGCCTTAGTGAAGTTGAACGTGGATGTGACGTCATGTTTGTTTCAA
ATCATATCGAATAAAGAAAACAAACTCAGTCTATGTTTTACACCTGAAGGAGAAGATTTA
ATTAGAATTACATTAAATGGTAGCGATCTACCAGAAAATGGAATATCTTTCCATTACTTG
ATAGAAGATTATAATGCATTTGTAAATATAATCTTAGCTGTGAATGATAAAAATGTGGAA
TTTTACTCTAACTGTGAAAAAATTGAAACCCAATATTTCGATTCCGACTACACCATCGAA
AACATAAATCTCGAAAAAGATTCTATACTACATTTTGGCAAATTGACCGAAGAAAGTAAT
TTATTTGAGGATTTCACAAAGGCAGAAAACCTCGAAGTGAATACGTTTATTGATTTTGAT
TCAAGCGAAAAAATATCAACCAACTCCCTGTTTGATAGCACTGAAGAAACTGTTGTTAAA
GGAGAAAAAGGTGATAAGGGGGACAAGGGCGAGAAAGGCGATCGAGGAGACAAGGGTGAA
CGAGGTGAATCTGTCATGGGTGAACGTGGTCCAATTGGTCCTGATGGAGCTCCTGGAACA
CCGGGTGTGATGGGAAAAGAAGGCTCCTGCAAATGTTCAGAAGCTATTGTGTCAGACTTA
CTACTAAAAATGCCAGAAATGAGAGGACCTCCAGGTGACTACGGGCTGAAAGGCGATAGA
GGTGAAAAGGGGGTGAAAGGAGATAGCGGATTACCAGGAAAAGATGGTAGAGATGGTAAT
GAGGGCGATCCCGGTATACAAGGTCCTCCTGGAACACCAGGTCTTGTTCGTAAGGAAATA
GTAGAGACAAAAGTGCCAGTTGTTGGAGAAAAAGGAGAAAGAGGGCCCGTTGGACCACCT
GGTACTCCTGGTAGAGACGGCTTAAGAGGAGAAAAAGGAGACAAAGGTGAACCGGGTCTC
ATGGGACTACCTGCAAAACTATCATCGATATTAGACGAGGACATCGATCCTAATGAAGAA
AAGGCTATCGTCGAAAAATTCAGAGGATATAAAGGGGCAAGTGGTCCTGAAGGACCGAAG
GGTGAAAAAGGGGATACAGGAGCAATTGGTCCTCAAGGTGAAACTGGCAGAGATGGTATT
CAGGGTCCCCCAGGAAAACATGGACATAAAGGAGAAACTGGCAAAGATGGATCAAAGGGT
GACAAAGGAGAACCAGGAATACCCGGTCCTCCTGGTACTGTGCCATCATCTCAAATAAGT
CTCATGAAAGGACCGAAAGGTGACCGTGGTCCACCAGGTCAGACAGGTCCTCGAGGACCA
ACGGGACATCATGGAAAAGTGGGCCCCATAGGACCACCGGGTAAAAGCCACAAGGGAGAG
CCTGGGAAACCAGGTCCTATGGGACCCAAAGGAGAAAAGGGTGCTACTGGACCTAGAGGA
GAAAAAGGTGAAGGGTTGTCGCCCAGTGATATCGAGAGGTTAAAAGGACATAAAGGTGAC
AGAGGTGAAATTGGTTTACCTGGTGAAGCTGGAAAGCCTGGGTTGCCGGGGACTTGTGGC
GAATGTGTTCGCGTATCAATCCCGGGCCCATCTGGACCACCGGGACCTCCGGGTCCATCA
GGTCCTCCTGGAGTCTCTATCATCGGTCCTAAAGGAGAACCTGGTGGATTAGTAACTAAG
AAATCATTTTTTGCATTCAATGACATTCATCATGAGAGCACAGATGAAGACGATGATTTT
TATACAGCAGCGACTGTCATTTTCAAAACAACTACCGGTCTTCTTAAGAGAACTACTGAC
ACCCCTCTGGGGACGCTGGCATATATATTACAAGAGAAAATATTATTAATGCGGGTTGAA
AATGGATGGCAATACGTTGTGATGGGTTCTTTTTTGCAAACAAGGGAATCACATACCAGC
ACAACATTCAGACCAACGTACTATTCATCAACTCCATCAAGTCCACCCTCTTCAGATGAA
ACGACAGAGAATAATGAAGATAATTACATACGTTTGGTCGCCTTAAACCAAGCATATGCA
GGAAATATACTTATGGCAAACAATAGAACTGGGCGTAATGCTGCTGACCAGGAATGTTAC
CGACAAGCTTATATACATAATTTTAAAAGCACTTTTGCAGCCTTCCTAGCTACTAGGGTT
GAAGATCTAAGATTTATTGTAAAAAGAAAACGAGACAGATATGTTCCGGTAGTCAACTTG
TACGGACAAGTTCTTTTCGATTCCTGGGCGAGCATGTTTAATGGTTCAGGAGCACTGTTT
GCAAAATCAAGTATTTACAGCTTTAATGGAAAAAATGTTCAGATTGATACTACTTGGCCT
TTAAAAGCTGTATGGCATGGCAGCAACTCTTTTGGCACAGTTTTATCAAGAGCAAATTGC
AATGAATGGACGAGTGACAGTCCGCTGAACGTTGGCGCGGCCTCCCTACTATATACCCAT
AGACTATTAGAGGAAGAACAACATACCTGTGATAAAAAACTAATTGTACTCTGTGTCGAA
GTTACATCAAATTCATACAAGCTTAGAAGACATTCACACAGTACAAGGTATGCAAAATTT
TACGCAAATGGATTTATTTTGTAG

Protein sequence:

MVIAEPDNYDILSLVRANVLTDFIDIVKGTDVYGAIKLVKNELITIKLDQFPDPINHLAT
PFEIYALVKLNVDVTSCLFQIISNKENKLSLCFTPEGEDLIRITLNGSDLPENGISFHYL
IEDYNAFVNIILAVNDKNVEFYSNCEKIETQYFDSDYTIENINLEKDSILHFGKLTEESN
LFEDFTKAENLEVNTFIDFDSSEKISTNSLFDSTEETVVKGEKGDKGDKGEKGDRGDKGE
RGESVMGERGPIGPDGAPGTPGVMGKEGSCKCSEAIVSDLLLKMPEMRGPPGDYGLKGDR
GEKGVKGDSGLPGKDGRDGNEGDPGIQGPPGTPGLVRKEIVETKVPVVGEKGERGPVGPP
GTPGRDGLRGEKGDKGEPGLMGLPAKLSSILDEDIDPNEEKAIVEKFRGYKGASGPEGPK
GEKGDTGAIGPQGETGRDGIQGPPGKHGHKGETGKDGSKGDKGEPGIPGPPGTVPSSQIS
LMKGPKGDRGPPGQTGPRGPTGHHGKVGPIGPPGKSHKGEPGKPGPMGPKGEKGATGPRG
EKGEGLSPSDIERLKGHKGDRGEIGLPGEAGKPGLPGTCGECVRVSIPGPSGPPGPPGPS
GPPGVSIIGPKGEPGGLVTKKSFFAFNDIHHESTDEDDDFYTAATVIFKTTTGLLKRTTD
TPLGTLAYILQEKILLMRVENGWQYVVMGSFLQTRESHTSTTFRPTYYSSTPSSPPSSDE
TTENNEDNYIRLVALNQAYAGNILMANNRTGRNAADQECYRQAYIHNFKSTFAAFLATRV
EDLRFIVKRKRDRYVPVVNLYGQVLFDSWASMFNGSGALFAKSSIYSFNGKNVQIDTTWP
LKAVWHGSNSFGTVLSRANCNEWTSDSPLNVGAASLLYTHRLLEEEQHTCDKKLIVLCVE
VTSNSYKLRRHSHSTRYAKFYANGFIL