DPGLEAN05223 in OGS1.0

New model in OGS2.0DPOGS213048 
Genomic Positionscaffold941:- 23912-47957
See gene structure
CDS Length1416
Paired RNAseq reads  42
Single RNAseq reads  160
Migratory profilesQuery via corresponding ESTs
Best Bmobyx hitBGIBMGA008935 (4e-06)
Best Drosophila hit  ND
Best Human hitfibronectin type-III domain-containing protein C4orf31 precursor (2e-10)
Best NR hit (blastp)  PREDICTED: hypothetical protein [Taeniopygia guttata] (4e-15)
Best NR hit (blastx)  PREDICTED: hypothetical protein [Taeniopygia guttata] (1e-12)
GeneOntology terms



  
GO:0010811 positive regulation of cell-substrate adhesion
GO:0030198 extracellular matrix organization
GO:0008201 heparin binding
GO:0019800 peptide cross-linking via chondroitin 4-sulfate glycosaminoglycan
GO:0031012 extracellular matrix
InterPro families  IPR019326 Protein of unknown function DUF2369
Orthology groupMCL16168

Nucleotide sequence:

ATGACCACGGACGACATGATATATGGGTGTCACCTACATCGCGTCTTCCAAAGTCCGGAC
ATCGTCACGACAAGGCTAGCAGCGCTGATCATACTTCACGCAGCTAGAGCTGGATTCGCA
GCTCCAAGTGTTCTCGTCACCAGACCAGTCAGCAGGCTTTCCGAAGAATGGCTGCCAATG
GATAACCAGACCTTGTACACATTGGAAGAAGGAGAAAGTCGAAATCCAATAGACCCTCAA
TCTACAACCTATTGCGTGGTCGCATCTCGGAGAAAGAACTACACTTCACTTTGCGCAGCT
CAATACGACCTCCGGAATACAAAAACAGAAAAAGATACATCCTCTGATCAGTCGCTGGAA
AACCATAAACCTAACCGTTCACAAAACAGCGGTTCGGAAGAGAATAAAATTGATATAGAT
AAAGACAGCATAAGTATTTTTGATGGAAATTATAGGACTTTGTATAGAAGAAAAAAGTTT
GGTCGTAGCACAAGAGTATCAAACGAAGACCCACTGGTTGTTTGTATAGGAGACAGAACA
CATCACTTTATTGAAAATTTAGATCCTGGCACAACCTATTTCGTCAGTATTTTCGGCATT
GCAAGAGATAGACAAATTGGGTCCTTATTAGCATCTGGAAGCGTTCGACCCAGGACGTCC
ACTGCTAAAAGACTAAGGGAAAATGTTCCTTACAAGGCTGACATAAAGGGAAAAAATGTT
TATTACTTGAAGACAACGACCAGTAGTACTTCAACAAACGCTGGTCTTTGGATTGCGACG
TCCACTTGCGGAGGTTCAGTTGACATCGAAGTTTATGTAAAAGGAAAGCGATTGTACGTG
GCCAAAAATATAGAAAATCATTCAAAGTTTTTCGTGCCATCACCAATACTTTCATCGACA
CAAGAAACAAGTGACGAGGGTTCTGTGCAGTTCGATTCGAGCTCAGAGGAATCAAAAATA
AGATACATTGTACGAGTTGTGCCTAACAAATGGGCTATTGATGGTGCCGTAACAATAGAG
TTACTGGCCTCAACATCAAGATGGGGCATGGATACGCCAGAACTTACCGACGGCGGAGTG
ATAAGGGAACTCCGGCCGAGGAGGTCTTGTAAATCCGTAGACATTGCCTTTTTACCAGCC
ACCCATAACGCGACTGATGTGATTAGATATTGCATTTCAACAAAAGAAATAGTCGACAAG
GACATAACAATCTGTGCGTTAACAAAAAAATCATCAACAAAAACACAATGCATAAGTCAC
ATGCAACGTCCTCAGTCAAGAGTAATAGTCCAGAAAGTGAGTGGCTTAAAACCGGGAAGA
AAATACGGAATTCAAGTAACTGCCGCCTCCAAAGGGATCTCAGTGCCATACAATGTCCTG
TATGTGGAAACTAATGCAACTTGCAAAGAAGAATAG

Protein sequence:

MTTDDMIYGCHLHRVFQSPDIVTTRLAALIILHAARAGFAAPSVLVTRPVSRLSEEWLPM
DNQTLYTLEEGESRNPIDPQSTTYCVVASRRKNYTSLCAAQYDLRNTKTEKDTSSDQSLE
NHKPNRSQNSGSEENKIDIDKDSISIFDGNYRTLYRRKKFGRSTRVSNEDPLVVCIGDRT
HHFIENLDPGTTYFVSIFGIARDRQIGSLLASGSVRPRTSTAKRLRENVPYKADIKGKNV
YYLKTTTSSTSTNAGLWIATSTCGGSVDIEVYVKGKRLYVAKNIENHSKFFVPSPILSST
QETSDEGSVQFDSSSEESKIRYIVRVVPNKWAIDGAVTIELLASTSRWGMDTPELTDGGV
IRELRPRRSCKSVDIAFLPATHNATDVIRYCISTKEIVDKDITICALTKKSSTKTQCISH
MQRPQSRVIVQKVSGLKPGRKYGIQVTAASKGISVPYNVLYVETNATCKEE