New model in OGS2.0 | DPOGS207308  |
---|---|
Genomic Position | scaffold1127:- 1689-18765 |
See gene structure | |
CDS Length | 2784 |
Paired RNAseq reads   | 1227 |
Single RNAseq reads   | 3055 |
Migratory profiles | Query via corresponding ESTs |
Best Bmobyx hit | BGIBMGA012091 (4e-41) |
Best Drosophila hit   | multiplexin, isoform E (3e-47) |
Best Human hit | collagen alpha-1(XVIII) chain isoform 1 precursor (4e-31) |
Best NR hit (blastp)   | multiplexin, isoform M [Drosophila melanogaster] (3e-71) |
Best NR hit (blastx)   | PREDICTED: similar to collagen alpha 1(xviii) chain [Tribolium castaneum] (1e-87) |
GeneOntology terms    | GO:0005581 collagen GO:0007155 cell adhesion GO:0005198 structural molecule activity GO:0031012 extracellular matrix GO:0005488 binding GO:0008045 motor axon guidance |
InterPro families    | IPR010515 Collagenase NC10/endostatin IPR008160 Collagen triple helix repeat IPR016187 C-type lectin fold IPR016186 C-type lectin-like |
Orthology group | MCL40713 |
Nucleotide sequence:
ATGGTGATAGCTGAACCAGACAACTACGATATCCTGAGTCTAGTCCGTGCGAATGTTTTG
ACCGATTTTATAGATATAGTCAAAGGCACAGATGTGTATGGCGCCATTAAACTTGTAAAA
AACGAATTAATTACTATTAAACTGGATCAATTCCCAGATCCAATAAACCATCTAGCAACT
CCTTTTGAAATATATGCCTTAGTGAAGTTGAACGTGGATGTGACGTCATGTTTGTTTCAA
ATCATATCGAATAAAGAAAACAAACTCAGTCTATGTTTTACACCTGAAGGAGAAGATTTA
ATTAGAATTACATTAAATGGTAGCGATCTACCAGAAAATGGAATATCTTTCCATTACTTG
ATAGAAGATTATAATGCATTTGTAAATATAATCTTAGCTGTGAATGATAAAAATGTGGAA
TTTTACTCTAACTGTGAAAAAATTGAAACCCAATATTTCGATTCCGACTACACCATCGAA
AACATAAATCTCGAAAAAGATTCTATACTACATTTTGGCAAATTGACCGAAGAAAGTAAT
TTATTTGAGGATTTCACAAAGGCAGAAAACCTCGAAGTGAATACGTTTATTGATTTTGAT
TCAAGCGAAAAAATATCAACCAACTCCCTGTTTGATAGCACTGAAGAAACTGTTGTTAAA
GGAGAAAAAGGTGATAAGGGGGACAAGGGCGAGAAAGGCGATCGAGGAGACAAGGGTGAA
CGAGGTGAATCTGTCATGGGTGAACGTGGTCCAATTGGTCCTGATGGAGCTCCTGGAACA
CCGGGTGTGATGGGAAAAGAAGGCTCCTGCAAATGTTCAGAAGCTATTGTGTCAGACTTA
CTACTAAAAATGCCAGAAATGAGAGGACCTCCAGGTGACTACGGGCTGAAAGGCGATAGA
GGTGAAAAGGGGGTGAAAGGAGATAGCGGATTACCAGGAAAAGATGGTAGAGATGGTAAT
GAGGGCGATCCCGGTATACAAGGTCCTCCTGGAACACCAGGTCTTGTTCGTAAGGAAATA
GTAGAGACAAAAGTGCCAGTTGTTGGAGAAAAAGGAGAAAGAGGGCCCGTTGGACCACCT
GGTACTCCTGGTAGAGACGGCTTAAGAGGAGAAAAAGGAGACAAAGGTGAACCGGGTCTC
ATGGGACTACCTGCAAAACTATCATCGATATTAGACGAGGACATCGATCCTAATGAAGAA
AAGGCTATCGTCGAAAAATTCAGAGGATATAAAGGGGCAAGTGGTCCTGAAGGACCGAAG
GGTGAAAAAGGGGATACAGGAGCAATTGGTCCTCAAGGTGAAACTGGCAGAGATGGTATT
CAGGGTCCCCCAGGAAAACATGGACATAAAGGAGAAACTGGCAAAGATGGATCAAAGGGT
GACAAAGGAGAACCAGGAATACCCGGTCCTCCTGGTACTGTGCCATCATCTCAAATAAGT
CTCATGAAAGGACCGAAAGGTGACCGTGGTCCACCAGGTCAGACAGGTCCTCGAGGACCA
ACGGGACATCATGGAAAAGTGGGCCCCATAGGACCACCGGGTAAAAGCCACAAGGGAGAG
CCTGGGAAACCAGGTCCTATGGGACCCAAAGGAGAAAAGGGTGCTACTGGACCTAGAGGA
GAAAAAGGTGAAGGGTTGTCGCCCAGTGATATCGAGAGGTTAAAAGGACATAAAGGTGAC
AGAGGTGAAATTGGTTTACCTGGTGAAGCTGGAAAGCCTGGGTTGCCGGGGACTTGTGGC
GAATGTGTTCGCGTATCAATCCCGGGCCCATCTGGACCACCGGGACCTCCGGGTCCATCA
GGTCCTCCTGGAGTCTCTATCATCGGTCCTAAAGGAGAACCTGGTGGATTAGTAACTAAG
AAATCATTTTTTGCATTCAATGACATTCATCATGAGAGCACAGATGAAGACGATGATTTT
TATACAGCAGCGACTGTCATTTTCAAAACAACTACCGGTCTTCTTAAGAGAACTACTGAC
ACCCCTCTGGGGACGCTGGCATATATATTACAAGAGAAAATATTATTAATGCGGGTTGAA
AATGGATGGCAATACGTTGTGATGGGTTCTTTTTTGCAAACAAGGGAATCACATACCAGC
ACAACATTCAGACCAACGTACTATTCATCAACTCCATCAAGTCCACCCTCTTCAGATGAA
ACGACAGAGAATAATGAAGATAATTACATACGTTTGGTCGCCTTAAACCAAGCATATGCA
GGAAATATACTTATGGCAAACAATAGAACTGGGCGTAATGCTGCTGACCAGGAATGTTAC
CGACAAGCTTATATACATAATTTTAAAAGCACTTTTGCAGCCTTCCTAGCTACTAGGGTT
GAAGATCTAAGATTTATTGTAAAAAGAAAACGAGACAGATATGTTCCGGTAGTCAACTTG
TACGGACAAGTTCTTTTCGATTCCTGGGCGAGCATGTTTAATGGTTCAGGAGCACTGTTT
GCAAAATCAAGTATTTACAGCTTTAATGGAAAAAATGTTCAGATTGATACTACTTGGCCT
TTAAAAGCTGTATGGCATGGCAGCAACTCTTTTGGCACAGTTTTATCAAGAGCAAATTGC
AATGAATGGACGAGTGACAGTCCGCTGAACGTTGGCGCGGCCTCCCTACTATATACCCAT
AGACTATTAGAGGAAGAACAACATACCTGTGATAAAAAACTAATTGTACTCTGTGTCGAA
GTTACATCAAATTCATACAAGCTTAGAAGACATTCACACAGTACAAGGTATGCAAAATTT
TACGCAAATGGATTTATTTTGTAG
Protein sequence:
MVIAEPDNYDILSLVRANVLTDFIDIVKGTDVYGAIKLVKNELITIKLDQFPDPINHLAT
PFEIYALVKLNVDVTSCLFQIISNKENKLSLCFTPEGEDLIRITLNGSDLPENGISFHYL
IEDYNAFVNIILAVNDKNVEFYSNCEKIETQYFDSDYTIENINLEKDSILHFGKLTEESN
LFEDFTKAENLEVNTFIDFDSSEKISTNSLFDSTEETVVKGEKGDKGDKGEKGDRGDKGE
RGESVMGERGPIGPDGAPGTPGVMGKEGSCKCSEAIVSDLLLKMPEMRGPPGDYGLKGDR
GEKGVKGDSGLPGKDGRDGNEGDPGIQGPPGTPGLVRKEIVETKVPVVGEKGERGPVGPP
GTPGRDGLRGEKGDKGEPGLMGLPAKLSSILDEDIDPNEEKAIVEKFRGYKGASGPEGPK
GEKGDTGAIGPQGETGRDGIQGPPGKHGHKGETGKDGSKGDKGEPGIPGPPGTVPSSQIS
LMKGPKGDRGPPGQTGPRGPTGHHGKVGPIGPPGKSHKGEPGKPGPMGPKGEKGATGPRG
EKGEGLSPSDIERLKGHKGDRGEIGLPGEAGKPGLPGTCGECVRVSIPGPSGPPGPPGPS
GPPGVSIIGPKGEPGGLVTKKSFFAFNDIHHESTDEDDDFYTAATVIFKTTTGLLKRTTD
TPLGTLAYILQEKILLMRVENGWQYVVMGSFLQTRESHTSTTFRPTYYSSTPSSPPSSDE
TTENNEDNYIRLVALNQAYAGNILMANNRTGRNAADQECYRQAYIHNFKSTFAAFLATRV
EDLRFIVKRKRDRYVPVVNLYGQVLFDSWASMFNGSGALFAKSSIYSFNGKNVQIDTTWP
LKAVWHGSNSFGTVLSRANCNEWTSDSPLNVGAASLLYTHRLLEEEQHTCDKKLIVLCVE
VTSNSYKLRRHSHSTRYAKFYANGFIL